[00:00:52] 10Analytics, 10Product-Analytics: Re-process webrequests from 2020-05-18 so that page views from latest Wikipedia app releases are counted - https://phabricator.wikimedia.org/T256516 (10Nuria) Given that after the changes done to the pageview definition for the apps (https://gerrit.wikimedia.org/r/c/analytics/... [00:59:02] 10Analytics-Radar, 10Core Platform Team, 10Dumps-Generation: HTML Dumps - June/2020 - https://phabricator.wikimedia.org/T254275 (10RBrounley_WMF) [06:52:46] good morning :) [06:53:02] I am wondering if archiva1001 (the old one) can be finally dropped [06:59:11] elukey: it seems so! [06:59:17] elukey: hi :) [07:00:07] ack! will wait for gehe*l's confirmation before proceeding just in case, but I should be able to drop it today :) [08:23:30] tried the cookbook for the rollback and one little thing didn't work, but it was a minor problem, will send a patch [08:23:47] I need to take care of another weird corner case and we should be ok after it [09:57:54] 10Analytics-Clusters, 10Analytics-Kanban, 10Patch-For-Review: Move Archiva to Debian Buster - https://phabricator.wikimedia.org/T252767 (10elukey) Verified on backup1001 that archiva1002 is correctly using bacula: ` Terminated Jobs: JobId Level Files Bytes Status Finished Name =========... [10:06:03] 10Analytics-Clusters, 10Analytics-Kanban, 10Patch-For-Review: Move Archiva to Debian Buster - https://phabricator.wikimedia.org/T252767 (10ops-monitoring-bot) cookbooks.sre.hosts.decommission executed by elukey@cumin1001 for hosts: `archiva1001.wikimedia.org` - archiva1001.wikimedia.org (**PASS**) - Downti... [10:08:05] 10Analytics-Clusters, 10Analytics-Kanban, 10Patch-For-Review: Move Archiva to Debian Buster - https://phabricator.wikimedia.org/T252767 (10elukey) Tried to run the decom cookbook and got: ` elukey@cumin1001:~$ sudo cookbook sre.hosts.decommission -t T252767 archiva1001.wikimedia.org START - Cookbook sre.hos... [10:12:27] !log decom archiva1001 [10:12:30] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [10:25:49] 10Analytics-Clusters, 10Analytics-Kanban, 10Patch-For-Review: Move Archiva to Debian Buster - https://phabricator.wikimedia.org/T252767 (10elukey) [10:26:05] 10Analytics, 10Analytics-Kanban: Unique devices, retrofit with bot detection code - https://phabricator.wikimedia.org/T250744 (10JAllemandou) See https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake/Traffic/Unique_Devices/Automated_traffic_correction [10:26:17] elukey: if you have a minute to proofread: https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake/Traffic/Unique_Devices/Automated_traffic_correction [10:26:22] 10Analytics-Clusters, 10Analytics-Kanban, 10Patch-For-Review: Move Archiva to Debian Buster - https://phabricator.wikimedia.org/T252767 (10elukey) archiva1001 decommed, task completed! [10:28:26] sure [10:31:09] 10Analytics, 10Product-Analytics, 10Epic: API pageview counts for 'Mobile app' are incorrect since switch to mobile-html - https://phabricator.wikimedia.org/T256508 (10JoeWalsh) @Nuria sounds good, I'd happy to talk about that. I set up a meeting to discuss. [10:34:05] joal: looks good to me! [10:34:14] cool elukey :) [10:34:22] thanks! [10:34:34] interesting reading :) [10:34:45] now I get more what you mean with "Actors" :D [10:35:21] going afk for lunch!! ttl :) [12:58:24] 10Analytics, 10Analytics-Kanban: Rename pageview_actor_hourly to pageview_actor - https://phabricator.wikimedia.org/T256415 (10Ottomata) I defer to Joseph as well but I think renaming it is the right thing to do for consistency sake. [13:23:06] elukey: dunno maybe you already do something like this [13:23:07] https://gist.github.com/peelman/1773863 [13:23:10] it works great [13:24:17] ottomata: Riccardo has been keeping up a similar script, lemme check where it is [13:26:12] heh, i bet i'd find some interesting things taking a look in his pupppet admin module files eh...? :) [13:26:39] IIRC there was a phab paste somewhere [13:28:15] so it is https://phabricator.wikimedia.org/rGUTI8abfa629ad8fb31f38bde053ac8899ec02189e2b [13:28:20] I didn't know about wmf-utils [13:28:22] TIL [13:31:51] ottomata: it also populates known_hosts etc... [13:40:50] huh! [13:41:08] yeah 100% Riccardo's quality :D [13:44:30] elukey: there's also wmf-sre-laptop [13:44:47] cdanis: I am so n00b, wasn't aware [13:44:56] it hasn't been well-publicized [13:45:01] and it really should be a few different packages [13:45:44] ah there is also pws! <3 [13:50:33] indeed :) [14:12:42] cdanis: am looking for a quick +1 before I merge thise https://gerrit.wikimedia.org/r/c/operations/puppet/+/610065 [14:12:52] and i see you have added a standard_package before so why not you? :p [14:12:58] looking [14:13:24] hah I've never heard of this program but I love the demo screenshots [14:13:53] lgtm [14:13:59] :) [14:14:15] does it have an analogue of curl's --resolve ? [14:15:12] you can set Host header ofc but it's nicer when you don't have to also disable SSL verification [14:15:55] https://github.com/jakubroztocil/httpie/issues/99 :( [14:20:44] OH REALLY daw [14:20:49] yeah i use that all the time esp in k8s land [15:01:22] milimetric: what was the name of the team that evan was on that you did wikimetrics with? [15:08:18] I honestly lost track, I can't remember ottomata, like global programs grants learning evaluation independent GLAM 6th floor people who want to make the world a better place? [15:09:00] hahaha [15:09:02] yeah something like that [15:29:13] 10Analytics, 10Analytics-EventLogging, 10Discovery: SearchSatisfaction has validation errors for event.query - https://phabricator.wikimedia.org/T257331 (10Ottomata) [15:30:47] hey a-team check it out! finally a useful logstash validation errors dashboard [15:31:01] https://logstash.wikimedia.org/app/kibana#/discover/AXMlVWkuMQ_08tQas2Xi?_g=(refreshInterval%3A(display%3A'1%20minute'%2Cpause%3A!f%2Csection%3A2%2Cvalue%3A60000)%2Ctime%3A(from%3Anow-1h%2Cmode%3Aquick%2Cto%3Anow)) [15:31:06] dentist time, back laters [15:40:20] 10Analytics, 10Product-Analytics (Kanban): Collect metrics/tables which might be touched by IP masking feature - https://phabricator.wikimedia.org/T255816 (10jwang) [17:06:16] ping fdans [17:18:09] 10Analytics, 10Product-Analytics: Re-process webrequests from 2020-05-18 so that page views from latest Wikipedia app releases are counted - https://phabricator.wikimedia.org/T256516 (10LGoto) a:03kzimmerman [17:20:50] 10Analytics-Radar, 10Better Use Of Data, 10Product-Analytics: session_tick stream configs - https://phabricator.wikimedia.org/T256311 (10LGoto) p:05Triage→03Medium [17:20:57] 10Analytics, 10Better Use Of Data, 10Event-Platform, 10Product-Analytics: EventLogging MEP Upgrade Phase 3 (Stream cc-ing) - https://phabricator.wikimedia.org/T256165 (10LGoto) p:05Triage→03Medium [18:07:07] 10Analytics-Clusters, 10Product-Analytics: Request admin access to Superset - https://phabricator.wikimedia.org/T255207 (10cchen) Thanks for granting us the access @elukey. I just checked with Maya, both of us can see the user list and update dashboards now. [18:31:00] 10Analytics, 10Operations, 10Traffic: Add backend field to webrequest Hive table - https://phabricator.wikimedia.org/T257354 (10Ottomata) [18:31:10] 10Analytics, 10Operations, 10Traffic: Add backend field to webrequest Hive table - https://phabricator.wikimedia.org/T257354 (10Ottomata) [18:42:53] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Event-Platform, and 2 others: EventGate validation errors should be visible in logstash - https://phabricator.wikimedia.org/T116719 (10Ottomata) @Krinkle, finally got something good here. > Where do they end up in Logstash exactly? https://logs... [18:59:19] 10Analytics, 10Analytics-Kanban: unique devices for project families, bug in signature computation for automatic traffic detection - https://phabricator.wikimedia.org/T257358 (10Nuria) [18:59:37] 10Analytics, 10Analytics-Kanban: unique devices for project families, bug in signature computation for automatic traffic detection - https://phabricator.wikimedia.org/T257358 (10Nuria) a:03JAllemandou [18:59:56] 10Analytics, 10Analytics-Kanban: Unique devices, retrofit with bot detection code - https://phabricator.wikimedia.org/T250744 (10Nuria) Docs are good, still https://phabricator.wikimedia.org/T257358 pending [19:03:32] 10Analytics, 10Analytics-EventLogging: NameError: name 'urlopen' is not defined - https://phabricator.wikimedia.org/T257360 (10dbarratt) [19:13:18] 10Analytics, 10Analytics-EventLogging: NameError: name 'urlopen' is not defined - https://phabricator.wikimedia.org/T257360 (10dbarratt) I suppose a solution to this problem would be to have the devserver in a custom image. That would also make it easier to use with Docker. :) Though that might be unrelated to... [19:47:17] 10Analytics-Clusters, 10Analytics-Radar, 10Operations, 10ops-eqiad: Renamed notebook1003 and notebook1004 - https://phabricator.wikimedia.org/T256397 (10Jclark-ctr) @elukey notebook1004 to an-scheduler1001 i do not see this in netbox. either names. [19:51:02] 10Analytics-Clusters, 10Analytics-Radar, 10Operations, 10ops-eqiad: Renamed notebook1003 and notebook1004 - https://phabricator.wikimedia.org/T256397 (10Dzahn) @Jclark-ctr They are [[ https://netbox.wikimedia.org/dcim/devices/210/ | device 210 ]] and [[ https://netbox.wikimedia.org/dcim/devices/702/ | devi... [19:52:24] 10Analytics-Clusters, 10Analytics-Radar, 10Operations, 10ops-eqiad: Renamed notebook1003 and notebook1004 - https://phabricator.wikimedia.org/T256397 (10Jclark-ctr) @Dzahn sorry that was my mistake. Thanks! [20:00:13] 10Analytics-Clusters, 10Analytics-Radar, 10Operations, 10ops-eqiad: Renamed notebook1003 and notebook1004 - https://phabricator.wikimedia.org/T256397 (10Jclark-ctr) 05Open→03Resolved Relabled host and resolved ticket [20:00:16] 10Analytics-Clusters, 10Analytics-Kanban, 10Patch-For-Review: Repurpose notebook100[3,4] - https://phabricator.wikimedia.org/T256363 (10Jclark-ctr) [20:06:38] 10Analytics-Radar, 10Operations, 10Traffic, 10Privacy: Connect Hadoop records of the same request coming via different channels - https://phabricator.wikimedia.org/T113817 (10Ottomata) 05Open→03Resolved a:03Ottomata Going to resolve this instead of declining. For EventBus generated events, including... [20:07:12] (03PS1) 10Joal: Correct unique-devices per-project-family bug [analytics/refinery] - 10https://gerrit.wikimedia.org/r/610151 (https://phabricator.wikimedia.org/T257358) [20:20:20] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Event-Platform, and 2 others: Add examples to all event schemas - https://phabricator.wikimedia.org/T242454 (10Ottomata) a:03Ottomata [20:21:19] (03PS1) 10Joal: Update clickstream rename pageview_actor_hourly [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/610153 (https://phabricator.wikimedia.org/T256415) [20:27:33] 10Analytics, 10Analytics-EventLogging: Error: Additional properties are not allowed ('clientValidated' was unexpected) - https://phabricator.wikimedia.org/T257364 (10dbarratt) [20:28:53] 10Analytics, 10Analytics-EventLogging, 10Patch-For-Review: NameError: name 'urlopen' is not defined - https://phabricator.wikimedia.org/T257360 (10dbarratt) a:05dbarratt→03Krinkle [20:28:55] 10Analytics, 10Analytics-EventLogging, 10Patch-For-Review: NameError: name 'urlopen' is not defined - https://phabricator.wikimedia.org/T257360 (10dbarratt) 05Open→03Resolved a:03dbarratt [20:29:39] 10Analytics-Radar, 10Operations, 10Traffic, 10Privacy: Connect Hadoop records of the same request coming via different channels - https://phabricator.wikimedia.org/T113817 (10Tgr) The task was about connecting webrequest data and MediaWiki API logs (or more generally, MediaWiki logs), though, and webreques... [20:31:55] 10Analytics-Radar, 10Operations, 10Traffic, 10Privacy: Connect Hadoop records of the same request coming via different channels - https://phabricator.wikimedia.org/T113817 (10Ottomata) 05Resolved→03Open Hm, you are right, but that is not clear from the task description. I'll edit it and leave open. I... [20:32:22] 10Analytics-Radar, 10Operations, 10Traffic, 10Privacy: Add request_id to webrequest logs as well as other event records ingested into Hadoop - https://phabricator.wikimedia.org/T113817 (10Ottomata) [20:32:45] 10Analytics-Radar, 10Operations, 10Traffic, 10Privacy: Add request_id to webrequest logs as well as other event records ingested into Hadoop - https://phabricator.wikimedia.org/T113817 (10Ottomata) [20:33:35] (03PS1) 10Joal: Rename pageview_actor_hourly to pageview_actor [analytics/refinery] - 10https://gerrit.wikimedia.org/r/610159 (https://phabricator.wikimedia.org/T256415) [20:35:38] ok, done for tonight - see ou tomorrow team (kids day, will start around 3:30pm CEST) [20:35:44] byeeooo [20:39:23] (03CR) 10Ottomata: [C: 03+1] Rename pageview_actor_hourly to pageview_actor [analytics/refinery] - 10https://gerrit.wikimedia.org/r/610159 (https://phabricator.wikimedia.org/T256415) (owner: 10Joal) [20:55:51] 10Analytics, 10Product-Analytics, 10Epic: Calculate impact of missing mobile app pageviews to high-level metrics - https://phabricator.wikimedia.org/T257373 (10kzimmerman) [20:56:11] 10Analytics, 10Product-Analytics: Calculate impact of missing mobile app pageviews to high-level metrics - https://phabricator.wikimedia.org/T257373 (10kzimmerman) [20:56:22] 10Analytics, 10Product-Analytics: Calculate impact of missing mobile app pageviews to high-level metrics - https://phabricator.wikimedia.org/T257373 (10kzimmerman) p:05Triage→03High [21:35:06] kinda random hive-ish question, how transactional is replacing a table content (such as how canonical_data.wikis is unpartitioned, but is sometimes updated). I'm thinking of a similar approach for a new thing i'm doing that needs to source live configuration from the wikis, there is little or no value to historical data so it could be simplified to wholy unpartitioned. [21:35:35] Basically, should i expect problems if we simply `INSERT OVERWRITE TABLE ...` without partitions? [21:45:47] 10Analytics, 10Analytics-EventLogging, 10dev-images: EventLogging dev image should have verbose output enabled - https://phabricator.wikimedia.org/T257378 (10dbarratt) [21:46:02] 10Analytics, 10Analytics-EventLogging, 10dev-images: EventLogging dev image should have verbose output enabled - https://phabricator.wikimedia.org/T257378 (10dbarratt) [22:08:23] ebernhardson: is the table large or small? [22:11:19] 10Analytics, 10Product-Analytics: Calculate impact of missing mobile app pageviews to high-level metrics - https://phabricator.wikimedia.org/T257373 (10Nuria) >based on the data we lost? ( Data is not lost, app interactions have changed significantly though so w/o understanding the new app behaviour it will be... [22:12:20] nuria: relatively tiny, its (wikiid, namespace_id, elasticsearch_index_name), so ~800 * 50 or so [22:12:27] in hive terms, extra tiny :) [22:17:23] ebernhardson: i do not think there are any issues of not having partitions if tables are small [22:17:45] ebernhardson: cause if your table is small enough there is no issue with reading the whole table everytime [22:17:58] nuria: alright, excellent. That makes sense. Thanks! [22:22:15] (03Abandoned) 10Nuria: Correcting docs [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/609558 (owner: 10Nuria) [22:24:57] 10Analytics-Radar, 10Operations, 10Traffic, 10Privacy: Add request_id to webrequest logs as well as other event records ingested into Hadoop - https://phabricator.wikimedia.org/T113817 (10Tgr) >>! In T113817#6287140, @Ottomata wrote: > Hm, you are right, but that is not clear from the task description. I... [22:30:57] (03CR) 10Nuria: [C: 03+1] Update clickstream rename pageview_actor_hourly [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/610153 (https://phabricator.wikimedia.org/T256415) (owner: 10Joal) [22:31:42] (03CR) 10Nuria: [C: 03+1] "Super thanks for doing these changes" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/610159 (https://phabricator.wikimedia.org/T256415) (owner: 10Joal) [22:36:47] (03CR) 10Nuria: "Let me know if you strongly disagree with my suggestion" (031 comment) [analytics/refinery] - 10https://gerrit.wikimedia.org/r/610151 (https://phabricator.wikimedia.org/T257358) (owner: 10Joal) [23:04:35] (03CR) 10Nuria: "I have a hard time CR this cause I do not know how to best test it, maybe you can show me how can it be tested (besides running the denorm" (031 comment) [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/609465 (https://phabricator.wikimedia.org/T255548) (owner: 10Joal) [23:57:16] 10Analytics, 10Product-Analytics: Re-process webrequests from 2020-05-18 so that page views from latest Wikipedia app releases are counted - https://phabricator.wikimedia.org/T256516 (10JoeWalsh) There's at least one bug with the updated definition and it's my fault. The iOS app is sending the wrong `User-Age...