[00:45:39] 10Analytics-Radar, 10Technical-blog-posts: Story idea for Blog: The Best Dataset on Wikimedia Content and Contributors - https://phabricator.wikimedia.org/T259559 (10Milimetric) Wow, thanks so much. I think I cleaned up most of my messes, can you take another look? And then maybe we can do a quick live sync... [06:11:55] good morning! [06:12:21] razzius: hello! [06:19:59] I also believe that I have another new colleague with nickname joal, I haven't met him in a while :D <3 [06:24:48] 10Analytics, 10CAS-SSO: Allow login to JupyterHub via CAS - https://phabricator.wikimedia.org/T260386 (10MoritzMuehlenhoff) >>! In T260386#6393823, @jbond wrote: >>>! In T260386#6393108, @Ottomata wrote: >> Thanks @jbond, I'll leave this as a low/medium priority one for now and discuss with Luca when he gets b... [06:31:56] RECOVERY - Check the last execution of mediawiki-history-drop-snapshot on an-launcher1002 is OK: OK: Status of the systemd unit mediawiki-history-drop-snapshot https://wikitech.wikimedia.org/wiki/Analytics/Systems/Managing_systemd_timers [06:35:46] !log restart mediawiki-history-drop-snapshot on an-launcher1002 to check that it works [06:35:48] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [06:50:13] elukey!!!! hello :) Welcome back :) [06:54:13] joal: bonjoooooouurrrrrrrrr o/ [06:54:49] How are you elukey? how was the trip on the roof of europe? [06:58:42] joal: it was amazing, next time I'll try to do more trekking, this time we visited cities too, but I'd love to spend time climbing mountains :) [06:58:49] there are also a ton of trail runs to do [06:59:09] * joal will never keep up with elukey next we run together [07:01:54] joal: I mentioned that there are trail runs to do, not that I am able to do them :D [07:02:06] :) [07:04:27] how about you? All good? [07:06:03] all good elukey - holidays were refreshing, and I managed not to break anything last week :) [07:06:51] :) [07:07:04] I see that we are again at ~2PB [07:07:05] sigh [07:07:32] yeah elukey - I deleted 2 snapshots yesterday, but it feels our data is growing :( [07:07:59] elukey: should we devise a daily scan helping in understanding which data grows? [07:08:40] could be an option yes [07:08:59] inside home dirs there are a lot of TBs stored IIRC [07:09:23] hm [07:09:57] but I might remember wrong [07:10:33] joal: the other thing that I am planning to do is upgrade ram on an-master* during the next couple of weeks [07:10:49] to 128G, we have ~44M files, time to bump the heap of the namenodes [07:10:55] elukey: there a some Tb in users folders, but not that much in comparison of where is stored for prod [07:11:07] ah yes sure [07:14:43] elukey: users data folders of more than 1Tb sums up to about 200Tb - While this is not small this is not the bulk of our usage [07:15:21] elukey: bumping RAM will be great - IIRC the hardware is ready, sync about shutdown restart is needed [07:16:24] joal: sure not the bulk of usage but if there is garbage in there I'd love to remove it :D it is surely time consuming since we'd need to reach out to people though [07:16:47] makes sense elukey - and it is indeed time consuming [07:19:30] (03CR) 10Joal: [V: 03+2 C: 03+2] "Code looks good - I'm not a big fan of having druid+cassandra serving layers in the same file, but this is a detail and I can't find a bet" [analytics/aqs] - 10https://gerrit.wikimedia.org/r/593660 (https://phabricator.wikimedia.org/T238365) (owner: 10Lex Nasser) [07:20:19] (03Merged) 10jenkins-bot: Add editors by country data to AQS [analytics/aqs] - 10https://gerrit.wikimedia.org/r/593660 (https://phabricator.wikimedia.org/T238365) (owner: 10Lex Nasser) [10:32:47] * elukey lunch! [10:32:53] (be back in a couple of hours) [11:23:45] (03CR) 10Mforns: "> Patch Set 2: Verified+2 Code-Review+2" [analytics/aqs] - 10https://gerrit.wikimedia.org/r/593660 (https://phabricator.wikimedia.org/T238365) (owner: 10Lex Nasser) [12:06:18] yay, we have a lukey :) [12:40:44] 10Analytics: Location of prior analyst folder - https://phabricator.wikimedia.org/T261203 (10EYener) [12:47:30] 10Analytics: Location of prior analyst folder - https://phabricator.wikimedia.org/T261203 (10elukey) 05Open→03Resolved a:03elukey @EYener everything was deleted as part of https://phabricator.wikimedia.org/T252364, it is the usual process that we follow for these things :( We don't have backup so data can... [12:51:19] 10Analytics: Location of prior analyst folder - https://phabricator.wikimedia.org/T261203 (10EYener) Thank you @elukey ! Unfortunately, I can't view that task. Would you be able to grant me access to https://phabricator.wikimedia.org/T252364 so that I can potentially plan on future use cases? [12:54:19] (03PS8) 10Milimetric: Allow more than one dimension to be filtered in Wikistats [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/612574 (https://phabricator.wikimedia.org/T255757) (owner: 10Fdans) [13:05:56] woohoo hello luca! [13:06:10] o/ [13:08:21] 10Analytics, 10CAS-SSO: Allow login to JupyterHub via CAS - https://phabricator.wikimedia.org/T260386 (10Ottomata) > Regarding google auth/TOTP specificly, we did test it during the prototyping phase and it should be easy to enable again however there is an admin overhead we need to consider untill we have a p... [13:10:43] 10Analytics: Location of prior analyst folder - https://phabricator.wikimedia.org/T261203 (10EYener) 05Resolved→03Open [13:10:58] 10Analytics, 10CAS-SSO: Allow login to JupyterHub via CAS - https://phabricator.wikimedia.org/T260386 (10MoritzMuehlenhoff) >>! In T260386#6409120, @Ottomata wrote: >> Regarding google auth/TOTP specificly, we did test it during the prototyping phase and it should be easy to enable again however there is an ad... [13:26:06] 10Analytics, 10CAS-SSO: Allow login to JupyterHub via CAS - https://phabricator.wikimedia.org/T260386 (10Ottomata) Hm, I guess we could enable CAS + HW 2FA, but keep the ssh tunnel support for users without the HW? [13:30:54] welcome back elukey =] [13:31:38] 10Analytics, 10CAS-SSO: Allow login to JupyterHub via CAS - https://phabricator.wikimedia.org/T260386 (10MoritzMuehlenhoff) >>! In T260386#6409164, @Ottomata wrote: > Hm, I guess we could enable CAS + HW 2FA, but keep the ssh tunnel support for users without the HW? That'll unlikely work on the same installat... [13:32:46] 10Analytics, 10CAS-SSO: Allow login to JupyterHub via CAS - https://phabricator.wikimedia.org/T260386 (10elukey) >>! In T260386#6409164, @Ottomata wrote: > Hm, I guess we could enable CAS + HW 2FA, but keep the ssh tunnel support for users without the HW? I like this option, seems feasible. The only thing tha... [13:33:07] mforns: hola Marcel :) [13:33:45] 10Analytics, 10CAS-SSO: Allow login to JupyterHub via CAS - https://phabricator.wikimedia.org/T260386 (10Ottomata) > httpd + mod_cas for the authentication part if possible, not relying on jupyterhub's one Hm, makes sense. Might be easier to set up too. [13:51:24] (03PS6) 10Milimetric: Add filter/split component to Wikistats [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/613114 (https://phabricator.wikimedia.org/T249758) (owner: 10Fdans) [13:53:19] elukey: who are you [13:54:15] jk elukey I miz u [14:00:35] <3 [14:01:22] 10Analytics: Check home/HDFS leftovers of lulu - https://phabricator.wikimedia.org/T261089 (10mforns) All /home/lulu folders on stat100[4-8] are empty. Also, /user/lulu in HDFS is empty. Moving to the done column! [14:01:38] 10Analytics, 10Analytics-Kanban: Check home/HDFS leftovers of lulu - https://phabricator.wikimedia.org/T261089 (10mforns) [14:03:55] 10Analytics: Location of prior analyst folder - https://phabricator.wikimedia.org/T261203 (10elukey) @EYener you should be able to see the task now :) [14:04:11] 10Analytics: Check home/HDFS leftovers of drossi/fsalutari - https://phabricator.wikimedia.org/T258788 (10mforns) ping @Gilles :] [14:05:09] 10Analytics: Location of prior analyst folder - https://phabricator.wikimedia.org/T261203 (10EYener) 05Open→03Resolved Thank you @elukey, I can! [14:06:51] 10Analytics: Check home/HDFS leftovers of drossi/fsalutari - https://phabricator.wikimedia.org/T258788 (10elukey) Forgot to mention on this task - I pinged Gilles a while ago and he pinged Flavia, but I think we've never got any response. [14:27:21] mforns: yt? got a few mins to brain bounce some event utils stuff with me? [14:28:18] 10Analytics: Check home/HDFS leftovers of drossi/fsalutari - https://phabricator.wikimedia.org/T258788 (10mforns) @elukey should I delete the data then? [14:34:16] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Event-Platform, and 2 others: Automate ingestion and refinement into Hive of event data from Kafka using stream configs and canary/heartbeat events - https://phabricator.wikimedia.org/T251609 (10Ottomata) Update. wikimedia-event-utilities is rea... [14:36:06] ottomata: yes [14:36:12] bc? [14:36:45] ya! [14:37:06] am there mforns [14:37:17] ok, give me 1 min! [14:40:13] 10Analytics-Radar, 10Technical-blog-posts: Story idea for Blog: The Best Dataset on Wikimedia Content and Contributors - https://phabricator.wikimedia.org/T259559 (10srodlund) @Millimetric Great! I will take a look at this today or tomorrow. We could set up a synch meeting for Thursday Aug 27) or early next w... [14:51:22] hiya gehel yt? [14:51:34] i'm not sure if I've done something wrong with my package/module naming [14:51:35] https://archiva.wikimedia.org/#artifact~releases/org.wikimedia/eventutilities/1.0.0 [14:51:47] i can't seem to add it as a dependency in the refinery-core pom [14:51:59] Dependency 'org.wikimedia:eventutilities:1.0.0' not found [14:52:42] you have the full error? is it using the correct maven repo? [14:52:54] java: package org.wikimedia.eventutilities.core.event does not exist [14:52:55] ? [14:53:09] link to the code? [14:53:43] (03PS1) 10Ottomata: [WIP] Add wikimedia eventutilities as a dep [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/622369 [14:54:05] gehel: [14:54:05] hm [14:54:23] hm i get a different error on the CLI mvn [14:54:27] so this must be an intellij problem [14:54:39] (03CR) 10jerkins-bot: [V: 04-1] [WIP] Add wikimedia eventutilities as a dep [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/622369 (owner: 10Ottomata) [14:55:06] mvn package -pl refinery-core on the cli downloaded eventutilities 1.0.0 and then gave me a difffernet compile error [14:55:06] hm [14:55:49] 10Analytics, 10Analytics-Kanban, 10Patch-For-Review: [Entropy alarms] Restrict the RSVD analysis to the last N data-points - https://phabricator.wikimedia.org/T257691 (10mforns) [14:59:01] there are 2 JsonLoadingException, one in org.wikimedia.eventutilities.core.json and one in org.wikimedia.analytics.refinery.core.jsonschema [14:59:46] EventLoggingSchemaLoader.load() (L111) as an incoherent signature with the overriden method [15:00:04] yeah, i get that too on compile on CLI [15:00:22] thanks for looking gehel i think its a problem with intellij [15:00:26] there are some visibility issues with schemaLoader.cache (which point to design issues) [15:00:29] yes yes [15:00:47] do you have intellij configured to reload the project when the pom.xml changes? [15:00:48] i see those too am working on that now, but i had thought there was another issue with archiva or class module [15:00:53] my intellij won't compile [15:00:59] but mvn cli will and I see the same things you do [15:01:03] I think it's not enabled by default [15:01:07] enabled? [15:01:43] archiva you mean? [15:01:47] https://github.com/wikimedia/analytics-refinery-source/blob/master/pom.xml#L90-L94 [15:01:51] that should do it, right? [15:02:27] File -> Settings -> Build, Execution, Deployment -> Build Tools -> Reload project after changes in the build script [15:03:08] or CTRL-SHIFT-A -> Reload All Maven Projects [15:03:58] 10Analytics: Check home/HDFS leftovers of drossi/fsalutari - https://phabricator.wikimedia.org/T258788 (10Gilles) Sure, go ahead and delete the data. [15:03:58] thanky ou that seemed to work [15:04:05] i had to enable to reload for Any Changes [15:04:09] thank you!!! [15:04:21] no idea why reloading isn't on by default, I can see any reason not to [15:07:02] ok another q for you [15:07:12] i need to make changes to eventutilities now, clearly [15:07:28] i'd like to use those changes in refinery-core locally for testing before I release a new eventutilities [15:07:34] shoudl I be able to [15:07:36] mvn install [15:07:39] eventutilities [15:07:49] and then use 1.0.0-SNAPSHOT in refinery-source pom while I develop? [15:07:52] yep, that shoudl work [15:08:08] meeting, I'll keep an eye here, but focus is elsewhere [15:09:07] hm do I need to configure intellij to use the local .m2 repo with snapshots? [15:09:12] k thanks [15:09:31] I don't think you need any special configuration in intellij [15:16:33] 10Analytics-Radar, 10Performance-Team, 10MW-1.36-notes (1.36.0-wmf.4; 2020-08-11): Invalid navigation timing events - https://phabricator.wikimedia.org/T254606 (10Gilles) @ottomata is there a way for us to track a metric of how often these schemas are now hitting their min/max values in Grafana? [15:35:54] 10Analytics, 10Analytics-EventLogging, 10Event-Platform, 10Performance-Team: Parse user agents in navtiming instead of relying on eventlogging to do it - https://phabricator.wikimedia.org/T260580 (10Gilles) Maybe we should switch to something like https://github.com/ua-parser/uap-python instead? [15:37:49] (03PS2) 10Ottomata: Use EventSchemaLoader from org.wikimedia.eventutilities [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/622369 (https://phabricator.wikimedia.org/T251609) [15:39:57] 10Analytics, 10Analytics-EventLogging, 10Event-Platform, 10Performance-Team: Parse user agents in navtiming instead of relying on eventlogging to do it - https://phabricator.wikimedia.org/T260580 (10Gilles) Ah, I see that it does use that library 🙂 [15:44:03] (03CR) 10jerkins-bot: [V: 04-1] Use EventSchemaLoader from org.wikimedia.eventutilities [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/622369 (https://phabricator.wikimedia.org/T251609) (owner: 10Ottomata) [15:47:43] !log restart mariadb@analytics_meta on db1108 to apply a replication filter (exclude superset_staging database from replication) [15:47:44] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [15:50:06] 10Analytics, 10Analytics-EventLogging, 10Event-Platform, 10Performance-Team: Parse user agents in navtiming instead of relying on eventlogging to do it - https://phabricator.wikimedia.org/T260580 (10Gilles) @Ottomata were you previously taking care of updates for python-ua-parser's Wikimedia deb package? I... [15:55:28] 10Analytics, 10Analytics-EventLogging, 10Event-Platform, 10Performance-Team: Parse user agents in navtiming instead of relying on eventlogging to do it - https://phabricator.wikimedia.org/T260580 (10Gilles) Also I'm not sure where `http.request_headers` is supposed to come from in the context of navtiming.... [15:55:30] 10Analytics, 10Analytics-EventLogging, 10Event-Platform, 10Performance-Team: Parse user agents in navtiming instead of relying on eventlogging to do it - https://phabricator.wikimedia.org/T260580 (10Nuria) @Gilles : anyone in our team (or others) can do updates, see: https://wikitech.wikimedia.org/wiki/Ana... [15:55:41] 10Analytics, 10Analytics-EventLogging, 10Event-Platform, 10Performance-Team: Parse user agents in navtiming instead of relying on eventlogging to do it - https://phabricator.wikimedia.org/T260580 (10Gilles) p:05Triage→03Medium [15:55:47] 10Analytics: Add urlshortener button to Turnilo - https://phabricator.wikimedia.org/T233336 (10elukey) I tried the following: ` export https_proxy=http://webproxy.eqiad.wmnet:8080 elukey@an-tool1005:~$ curl 'https://api-rw.discovery.wmnet/w/api.php?action=shortenurl&url=https%3A%2F%2Fen.wikipedia.org%2Fwiki%2FA... [15:56:57] 10Analytics, 10Analytics-EventLogging, 10Event-Platform, 10Performance-Team: Parse user agents in navtiming instead of relying on eventlogging to do it - https://phabricator.wikimedia.org/T260580 (10Gilles) @nuria I just want to know if you'll keep taking care of those updates for the Python library, or if... [16:01:31] ping ottomata [16:02:28] ottomata: ping ping :) [16:04:20] 10Analytics, 10Analytics-EventLogging, 10Event-Platform, 10Performance-Team: Parse user agents in navtiming instead of relying on eventlogging to do it - https://phabricator.wikimedia.org/T260580 (10Ottomata) > were you previously taking care of updates for python-ua-parser's Wikimedia deb package? If so,... [16:16:53] 10Analytics, 10Analytics-Kanban, 10Product-Analytics (Kanban): Whitelist new VisualEditorFeatureUse fields - https://phabricator.wikimedia.org/T256048 (10MNeisler) Thanks @Ottomata and @mforns! I'll make sure to add the #Analytics tag to these tasks in the future. [16:17:07] 10Analytics, 10Analytics-Kanban: Whitelist new VisualEditorFeatureUse fields - https://phabricator.wikimedia.org/T256048 (10MNeisler) [16:26:10] 10Analytics, 10Analytics-Kanban, 10Operations, 10netops: Add more dimensions in the netflow/pmacct/Druid pipeline - https://phabricator.wikimedia.org/T254332 (10Nuria) a:03fdans [16:57:44] 10Analytics, 10Analytics-EventLogging, 10Event-Platform, 10Performance-Team: Parse user agents in navtiming instead of relying on eventlogging to do it - https://phabricator.wikimedia.org/T260580 (10Gilles) @ottomata So I should be looking for it in `meta['event']['http']` where `meta` is the json object w... [17:09:59] 10Analytics-Clusters, 10Operations, 10ops-eqiad: (Need By: TBD) upgrade ram in an-master100[12] - https://phabricator.wikimedia.org/T259162 (10elukey) @Jclark-ctr I'd say 5/10 minutes for each host to do proper failover, and the host can stay down even for half an hour but better if less of course :) [17:18:10] 10Analytics, 10Analytics-EventLogging, 10Event-Platform, 10Performance-Team: Parse user agents in navtiming instead of relying on eventlogging to do it - https://phabricator.wikimedia.org/T260580 (10Ottomata) Hm, no, if `meta` is the variable that contains the full event object (including the 'capsule', wh... [17:23:43] I am going offline in a bit :) [17:26:04] elukey: what was the database name you said? I can not find it :[ [17:27:08] 10Analytics-Radar, 10Technical-blog-posts: Story idea for Blog: The Best Dataset on Wikimedia Content and Contributors - https://phabricator.wikimedia.org/T259559 (10Milimetric) @srodlund: both work, but maybe since you have Andrew's 3 to get through, let's do next week? I'll pencil something in. [17:29:01] 10Analytics-Radar, 10Technical-blog-posts: Story idea for Blog: The Best Dataset on Wikimedia Content and Contributors - https://phabricator.wikimedia.org/T259559 (10srodlund) Feel free to look at my calendar and invite me to a meeting. It is up to date :-) [17:30:04] 10Analytics-Radar, 10Technical-blog-posts: Story idea for Blog: The Best Dataset on Wikimedia Content and Contributors - https://phabricator.wikimedia.org/T259559 (10Ottomata) Sarah has already reviewed mine, i'm mostly just waiting for reviews from you all (and also for response from Confluent about a potenti... [17:34:42] mforns: should be flavia_eventtiming_total in theory [17:35:07] elukey: there's no such database. or I'm missing sth.. [17:35:32] maybe she already dropped that [17:35:41] mforns: then I think the data is abandoned, we can drop it from HDFS [17:35:44] yep yep [17:35:48] ok, cool thanks! [17:36:25] super [17:36:30] going offline! ttl! [17:36:43] byeeeee [17:39:51] 10Analytics: Check home/HDFS leftovers of drossi/fsalutari - https://phabricator.wikimedia.org/T258788 (10mforns) OK, deleted fsalutari's data in Hive warehouse directory as well. The only thing remaining now is the stat100* home folders. [17:40:30] 10Analytics, 10Analytics-Kanban: Check home/HDFS leftovers of drossi/fsalutari - https://phabricator.wikimedia.org/T258788 (10mforns) a:03elukey [17:41:36] 10Analytics, 10Analytics-Kanban, 10Analytics-Wikistats: Stats for newer projects not available - https://phabricator.wikimedia.org/T258033 (10mforns) a:03mforns [17:48:32] Test T258788 [17:48:33] T258788: Check home/HDFS leftovers of drossi/fsalutari - https://phabricator.wikimedia.org/T258788 [17:54:12] (03PS3) 10Ottomata: Use EventSchemaLoader from org.wikimedia.eventutilities [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/622369 (https://phabricator.wikimedia.org/T251609) [17:54:29] James_F: hi! meet razzi, our newest team member. Could you pretty please add him to #wikimedia-staff? (btw you're still in the docs as the one to ask for this, do correct me if there are better ways) [17:54:55] Welcome razzi! [17:55:24] Hi James! Thank you [17:55:33] milimetric, razzi: Done. Just /join. [17:55:39] thanks so much! [17:55:51] milimetric: And yeah, I'm one of the good people to ask still. :-) [17:56:01] <3 [17:56:05] Thanks! [18:56:14] razzi: WELCOME! [19:47:24] 10Analytics, 10Analytics-Kanban, 10Patch-For-Review, 10Platform Team Workboards (Initiatives): reportupdater Pingback reports are broken and need to be refactored - https://phabricator.wikimedia.org/T246154 (10CCicalese_WMF) Interesting. Seven times smaller does seem large, but we haven't had a good sense... [20:49:59] java is the wooooorrrrst [20:50:01] https://stackoverflow.com/questions/1998544/method-has-the-same-erasure-as-another-method-in-type/8467804