[00:13:40] 10Analytics, 10Analytics-Kanban, 10Fundraising-Backlog, 10MediaWiki-extensions-CentralNotice: Refining is failing to refine centranoticeimpression events - https://phabricator.wikimedia.org/T244771 (10Nuria) Closing as all data is re-refined and accessible. [00:13:50] 10Analytics, 10Analytics-Kanban, 10Fundraising-Backlog, 10MediaWiki-extensions-CentralNotice: Refining is failing to refine centranoticeimpression events - https://phabricator.wikimedia.org/T244771 (10Nuria) 05Open→03Resolved [00:16:33] 10Analytics, 10Fundraising-Backlog, 10WMDE-Analytics-Engineering, 10WMDE-FUN-Team, 10WMDE-Fundraising-Tech: Find a better way for WMDE to get impression counts for their banners - https://phabricator.wikimedia.org/T243092 (10Nuria) ping @kai.nissen that data is accessible in event table in hive, to query... [00:31:35] 10Analytics, 10Inuka-Team: Update EventLogging to accept events from KaiOS app - https://phabricator.wikimedia.org/T246295 (10Nuria) >It seems that EventLogging will need to be updated to whitelist the user agent of the KaiOS app so that the app's events are properly processed. mmm, no, that is not needed. Ca... [00:34:39] 10Analytics, 10Inuka-Team, 10Product-Analytics: Set up pageview counting for KaiOS app - https://phabricator.wikimedia.org/T244547 (10Nuria) When the app fetches this url: https://en.wikipedia.org/api/rest_v1/page/summary/Domesticated how does the UA look like? [00:41:51] 10Analytics, 10Inuka-Team: Update EventLogging to accept events from KaiOS app - https://phabricator.wikimedia.org/T246295 (10mpopov) @Nuria: how will the system know to accept events from the KaiOS app? Since the iOS and Android apps don’t send events from a Wikimedia domain I assumed the system had to be con... [00:45:15] 10Analytics, 10Inuka-Team: Update EventLogging to accept events from KaiOS app - https://phabricator.wikimedia.org/T246295 (10Nuria) >how will the system know to accept events from the KaiOS app? It works the same for iOs and android right? the events come from the users device with just a UA that follows a c... [00:53:31] 10Analytics, 10Inuka-Team: Update EventLogging to accept events from KaiOS app - https://phabricator.wikimedia.org/T246295 (10mpopov) > It works the same for iOs and android right? the events come from the users device with just a UA that follows a convention to parse the app, makes sense? So, in theory, we... [05:01:12] 10Analytics, 10Inuka-Team: Update EventLogging to accept events from KaiOS app - https://phabricator.wikimedia.org/T246295 (10Nuria) The UA formatting is only needed to parse the app version, nothing else. See data example below, UA does not appear as a UA for an app which is expected to send an app versio... [05:04:17] 10Analytics: Virtual pageviews should set access_type to mobile if webhost is a mobile one - https://phabricator.wikimedia.org/T246309 (10Nuria) [07:57:24] leila: sorry, awk for a while, but Martin and I are in touch and I will take a look at the etherpad too. [08:22:55] Hi team [08:27:12] hey! [08:27:17] Hi dcausse :) [08:27:37] joal: looked at T246237, thanks! :) [08:27:38] T246237: Extract some statistics on the use of the isBlank() function in wdqs query logs - https://phabricator.wikimedia.org/T246237 [08:27:39] I guess now is a better timing dcausse :) [08:27:44] \o/ [08:28:04] dcausse: I hope this helps, and as I say, I can get deeper into more precise info [08:28:24] quick question tho, function names are case-insensitive did you do case-insensitive matching? [08:28:28] dcausse: I gaining expertise in jena AST :) [08:28:34] oh... [08:28:46] you parsed the queries, awesome! [08:28:47] dcausse: I use the AST, so it's actually case unrelated :) [08:30:56] joal: next thing where I might need your help is T246238 [08:30:57] T246238: Investigate common qualifiers for “unknown value” statement main snaks - https://phabricator.wikimedia.org/T246238 [08:32:06] not sure if the dump you imported in hdfs might be a good fit [08:32:08] hm - a quick parse shows I could do with some explanations :) [08:32:13] dcausse: --^ [08:33:00] joal: do you have a couple mins now for quick chat? [08:33:09] for sure - batcave? [08:33:18] where is that? [08:33:34] https://meet.google.com/_meet/rxb-bjxn-nip [08:33:40] joining [08:33:53] I didn't dare say 'use notes A and B on the piano' [09:18:11] 10Analytics, 10Inuka-Team: Update EventLogging to accept events from KaiOS app - https://phabricator.wikimedia.org/T246295 (10hueitan) >>! In T246295#5921936, @nshahquinn-wmf wrote: > @hueitan what will the app's user agent look like? Here's one UA example from the device `Mozilla/5.0 (Mobile; Nokia_2720_Fli... [11:18:42] dcausse: I think I lost you :) [12:13:52] back! [12:30:42] Hi elukey - I hope news are good [12:34:00] yeah not bad :) [12:34:05] I'll update the email list in a bit [12:34:52] <3 [12:51:29] 10Analytics, 10Inuka-Team: Update EventLogging to accept events from KaiOS app - https://phabricator.wikimedia.org/T246295 (10mpopov) >>! In T246295#5922409, @Nuria wrote: > The UA formatting is only needed to parse the app version, nothing else. > See data example below, UA does not appear as a UA for an ap... [12:53:38] 10Analytics, 10Inuka-Team: Update EventLogging to accept events from KaiOS app - https://phabricator.wikimedia.org/T246295 (10mpopov) >>! In T246295#5922749, @hueitan wrote: >>>! In T246295#5921936, @nshahquinn-wmf wrote: >> @hueitan what will the app's user agent look like? > > Here's one UA example from the... [12:53:42] (03PS7) 10Fdans: Add language selection functionality to Wikistats [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/564047 (https://phabricator.wikimedia.org/T238752) [12:53:46] (03CR) 10Fdans: Add language selection functionality to Wikistats (031 comment) [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/564047 (https://phabricator.wikimedia.org/T238752) (owner: 10Fdans) [12:54:41] dcausse: I had a flash :) [12:56:59] dcausse: you can see my comments in the etherpad line 151 when ou're back :) [13:33:30] joal: looking :) [13:43:56] not sure I fully understand :/ [14:05:17] I just found out that on bigtop we have oozie 4.3, vs 4.1 on cdh [14:05:31] and for some reason the sharedlib stuff is a bit different [14:06:04] but I am getting closer [14:27:53] 10Analytics, 10Discovery, 10Wikidata, 10Wikidata-Query-Service: Add pageviews total counts to WDQS - https://phabricator.wikimedia.org/T174981 (10Lydia_Pintscher) Ordering by relevancy is a good use case. But I believe relevancy is about more than page views. We have T143424 to come up with a good measure... [14:35:41] dcausse: I had it wrong about the joins (joining to df645 or joining to df12) [14:36:15] dcausse: joining to df12 does the same thing as the filtering step we used, but in a clean way :) [14:50:39] awight: Hello :) [14:50:52] awight: I figured I ping you there instead of on wiki :) [14:52:09] awight: I'm willing to revert your change on https://wikitech.wikimedia.org/w/index.php?title=Analytics/Data_Lake, as the ORES data is currently only made available as events - Since your change was based on mine, I prefer to talk first and not start an edit-war :-P [14:57:33] joal: Ah! Okay, good to hear the reason behind the indent, please do revert :-) [14:58:36] Thanks awight - I'll also add a precision on the ores line [15:00:47] elukey: hello :) is now a correct moment to ask a question? [15:01:28] sure! [15:01:34] elukey: batcave? [15:01:41] coming [15:30:49] ottomata: want to talk about stream config loading? [15:43:50] joal: all good! [15:43:53] you can try to ssh now [15:43:54] \o/ [15:45:01] elukey: confirmed :) [15:46:05] super [15:58:20] I'm the worst train conductor! I'll do it now [15:58:24] deploying [15:58:53] mforns: holaaa [15:59:51] hip: hang on one bit i have another patch coming that willl make it much cleaner and clearer [16:06:56] (03PS1) 10Milimetric: Bump changelog.md to v0.0.115 [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/575275 [16:07:15] (03CR) 10Milimetric: [C: 03+2] Bump changelog.md to v0.0.115 [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/575275 (owner: 10Milimetric) [16:12:46] (03Merged) 10jenkins-bot: Bump changelog.md to v0.0.115 [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/575275 (owner: 10Milimetric) [16:13:17] ok hip just pushed new patch [16:13:39] much cleaner and avoids putting too much code in the generated JS snipped, and also uses mw.track('eventstream.' ...) very explicitly [16:13:42] let's talk! : [16:13:44] :) [16:13:51] snippet* [16:17:13] (03CR) 10Milimetric: [C: 03+2] Add language selection functionality to Wikistats [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/564047 (https://phabricator.wikimedia.org/T238752) (owner: 10Fdans) [17:01:05] ping milimetric [17:01:30] yep was in the bathroom [17:35:38] (03CR) 10DannyS712: "Recheck" [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/575289 (owner: 10L10n-bot) [17:36:58] (03CR) 10DannyS712: [C: 03+1] Localisation updates from https://translatewiki.net. [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/575289 (owner: 10L10n-bot) [17:56:30] 10Analytics, 10Fundraising-Backlog, 10WMDE-Analytics-Engineering, 10WMDE-FUN-Team, 10WMDE-Fundraising-Tech: Find a better way for WMDE to get impression counts for their banners - https://phabricator.wikimedia.org/T243092 (10kai.nissen) @Nuria, @AndyRussG I compared the numbers we received from `pgehres.... [18:06:13] 10Analytics, 10Analytics-Cluster: Hadoop Hardware Orders FY2019-2020 - https://phabricator.wikimedia.org/T243521 (10Ottomata) @wiki_willy We'd like to place these orders soon. I know there are issues with rackspace, etc. Can we plan out exactly what we need to do and get quotes and place orders? We'll need... [18:31:13] 10Analytics, 10Analytics-Kanban, 10Product-Analytics: Presto: missing partitions causes queries to fail - https://phabricator.wikimedia.org/T246034 (10nettrom_WMF) 05Open→03Resolved Queries in Hive and Presto now work as expected on this data from what I can tell, closing this as resolved. [18:33:58] 10Quarry, 10User-DannyS712: Unable to access Quarry: 504 timeout - https://phabricator.wikimedia.org/T246364 (10zhuyifei1999) Can't reproduce. Probably related to NFS maintenance a few hours ago. [18:38:53] 10Quarry, 10User-DannyS712: Unable to access Quarry: 504 timeout - https://phabricator.wikimedia.org/T246364 (10zhuyifei1999) 05Open→03Invalid [18:47:12] nuria: do you want a quick walkthrough of this patch? [18:47:51] ottomata: want to come to https://meet.google.com/kzt-kodc-hha? [18:48:13] I marcel online? [18:53:23] 10Analytics: Should reportupdater Pingback reports be refactored? - https://phabricator.wikimedia.org/T246154 (10CCicalese_WMF) @mforns Actually, I don't think that will work. Since the reports are cumulative, we need the old data to correctly accumulate usage values for past time periods. [18:59:40] sorry elukey, I got disconnected for some reason :[ [18:59:45] here if you need me [19:00:05] mforns: np! So there is a little issue [19:00:14] aha [19:00:48] root@an-launcher1001:/srv/reportupdater# ls output/metrics/published_cx2_translations/ [19:00:51] pages_with_unreviewed_translations.tsv [19:00:56] this is the current one, the hdfs [19:00:59] hdfs/hive [19:01:06] root@an-launcher1001:/srv/reportupdater# ls /home/elukey/reportupdater/output/metrics/published_cx2_translations/ [19:01:09] pages_with_unreviewed_translations.tsv published_cx2_translations.tsv published_cx2_translators.tsv [19:01:23] the tsv names are clashing [19:01:40] joal: dumps mountpoints on an-launcher1001 working! \o/ [19:01:44] elukey, oh! I think one of those reports got moved from mysql to hive [19:02:01] so the hive file should be bigger, should contain more recent dates, can you check that? [19:02:22] the hive one should be the most current, and the one that should be kept. but better to confirm [19:02:46] mforns: btw i refactored a little bit of code since yesterday but the concept is the same. would love comments. [19:02:53] ottomata, ok [19:02:57] will look today [19:05:43] mforns: let's check together if you have a sec [19:05:50] 10Analytics, 10Fundraising-Backlog, 10WMDE-Analytics-Engineering, 10WMDE-FUN-Team, 10WMDE-Fundraising-Tech: Find a better way for WMDE to get impression counts for their banners - https://phabricator.wikimedia.org/T243092 (10Nuria) I do not think those two datasets have the same sample size sample sizes,... [19:05:50] you should have access to an-launcher [19:05:55] elukey, sure to the batcave! [19:06:00] yes I do [19:12:41] 10Analytics, 10Analytics-Kanban, 10Patch-For-Review: Make stats.wikimedia.org point to wikistats2 by default - https://phabricator.wikimedia.org/T237752 (10Nuria) 05Open→03Resolved [19:12:54] 10Analytics, 10Analytics-Kanban, 10Patch-For-Review: Fix non MapReduce execution of GeoCode UDF - https://phabricator.wikimedia.org/T238432 (10Nuria) 05Open→03Resolved [19:13:54] 10Analytics, 10Analytics-Kanban, 10Operations, 10ops-eqiad, 10Patch-For-Review: Degraded RAID on analytics1044 - https://phabricator.wikimedia.org/T245910 (10Nuria) 05Open→03Resolved [19:15:21] 10Analytics, 10Analytics-Kanban, 10Fundraising-Backlog: Turnilo no longer showing sample-rate adjusted data for banner activity - https://phabricator.wikimedia.org/T241162 (10Nuria) mmm... i do not see it on turnilo: https://turnilo.wikimedia.org/#banner_activity_minutely/ cc @Milimetric [19:17:12] 10Analytics, 10Analytics-Kanban: Analytics datasets should be under a free license - https://phabricator.wikimedia.org/T244685 (10Nuria) 05Open→03Resolved [19:17:22] nuria: I probably have to restart turnilo, doing so now [19:17:26] 10Analytics, 10Analytics-Kanban, 10Patch-For-Review: The guava error still persists in data quality bundles - https://phabricator.wikimedia.org/T241375 (10Nuria) 05Open→03Resolved [19:17:43] milimetric: i still do not see where that value comes from, is it in that datasource? [19:18:35] it's in the same HQL that loads the request_count, it just divides it to normalize and inserts into the table. It's been available in Druid this whole time, but I guess nobody needed it for a couple of years [19:19:25] mforns: all done, an-launcher1001 is now configured with all RU jobs! [19:19:28] * elukey dances [19:19:40] yyyyaaaay! [19:19:50] should we force the execution of one? [19:19:56] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10MW-1.35-notes (1.35.0-wmf.19; 2020-02-11), 10Performance-Team (Radar): EventLogging needs to enque events to avoid draining users' battery on mobile - https://phabricator.wikimedia.org/T225578 (10Nuria) 05Open→03Resolved [19:20:01] like reportupdater-published_cx2_translations_mysql [19:20:03] ok, nuria, turnilo restarted and measure is there [19:20:08] ok, I doubt thought that it will run any reports until tomorrow [19:20:18] and nuria: I added that you have to restart to the docs, so future us know [19:20:18] milimetric: k [19:20:25] mforns: okok, then next timer run should be in ~40 mins [19:20:31] elukey, do you want me to delete some data points, so that RU runs? [19:20:39] milimetric: closing ticket [19:20:47] you can schedule for rerunning too mforns / elukey [19:21:02] 10Analytics, 10Analytics-Kanban, 10Fundraising-Backlog: Turnilo no longer showing sample-rate adjusted data for banner activity - https://phabricator.wikimedia.org/T241162 (10Nuria) 05Open→03Resolved [19:21:27] milimetric, yes, although I added that feature I never used it, so I'm a bit afraid of that, but theoretically should work! [19:21:48] mforns: I have restarted reportupdater-ee-beta-features.service [19:21:54] basically running manually the timer [19:22:02] and I get [19:22:03] : RuntimeError: pymysql can not execute query ((1146, "Table 'testwiki.user_properties' doesn't exist")). [19:22:14] hmmmmmmm [19:22:30] 10Analytics, 10Analytics-Kanban, 10Fundraising-Backlog: Whitelist CentralNotice banner history events for sanitization and long-term storage - https://phabricator.wikimedia.org/T245285 (10Nuria) 05Open→03Resolved [19:22:49] ah nothing it was already broken on stat1006 [19:22:54] mforns: --^ [19:23:06] oh [19:23:11] ok, fiu [19:23:59] mforns: if you are ok I am inclined to go to dinner and check laterz [19:24:04] seems all ok atm [19:24:21] elukey, of course! we check tomorrow! it's not critical job/data [19:24:23] I am kinda surprised that this change went all done like this [19:24:32] no errors from puppet, etc.. [19:24:52] good puppeteer [19:25:42] 10Analytics, 10Analytics-Kanban: Fix wikitext-generation jobs (use 0.0.114 jar) - https://phabricator.wikimedia.org/T245496 (10Nuria) 05Open→03Resolved [19:25:55] nono I wasn't going to say that, usually if this happens I am suspicious that I've done something horribly wrong :D [19:26:01] anyway, let's see! [19:26:06] milimetric: i have my 1 on 1 with joseph in 5 mins, we can talk after? [19:26:14] ok [19:26:27] thanks mforns/milimetric! [19:26:30] ttl :) [19:27:01] 10Analytics, 10Analytics-Kanban: Fix hdfs-rsync`prune-empty-dirs` feature - https://phabricator.wikimedia.org/T243832 (10Nuria) 05Open→03Resolved [19:27:06] 10Analytics, 10Analytics-Kanban, 10Research-Backlog, 10Wikidata, 10Patch-For-Review: Copy Wikidata dumps to HDFS + parquet - https://phabricator.wikimedia.org/T209655 (10Nuria) [19:27:20] 10Analytics, 10Analytics-Kanban, 10Patch-For-Review, 10User-Elukey: Allow all Analytics tools to work with Kerberos auth - https://phabricator.wikimedia.org/T226698 (10Nuria) [19:27:22] 10Analytics, 10Analytics-Kanban, 10Patch-For-Review, 10User-Elukey: Shorten the time it takes to move files from hadoop to dump hosts by Kerberizing/hadooping the dump hosts - https://phabricator.wikimedia.org/T234229 (10Nuria) 05Open→03Resolved [19:27:43] 10Analytics, 10Analytics-Kanban: Fix sqoop after changes - https://phabricator.wikimedia.org/T242015 (10Nuria) 05Open→03Resolved [19:27:45] nuria: I have mine with Lex, but I'll see if he has time to reschedule [19:28:03] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Event-Platform, and 2 others: Modern Event Platform: Schema Registry: Implementation - https://phabricator.wikimedia.org/T206789 (10Nuria) [19:28:05] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Event-Platform, and 5 others: Public schema.wikimedia.org endpoint for schema.svc - https://phabricator.wikimedia.org/T233630 (10Nuria) 05Open→03Resolved [19:28:19] 10Analytics, 10Analytics-EventLogging, 10Event-Platform, 10Core Platform Team Legacy (Watching / External), 10Services (watching): Modern Event Platform: Stream Intake Service - https://phabricator.wikimedia.org/T201068 (10Nuria) [19:28:21] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Event-Platform, and 2 others: Set up TLS for eventgate-main and eventgate-analytics - https://phabricator.wikimedia.org/T241073 (10Nuria) 05Open→03Resolved [19:28:30] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Event-Platform, and 2 others: Modern Event Platform: Schema Registry: Implementation - https://phabricator.wikimedia.org/T206789 (10Nuria) 05Open→03Resolved [19:28:32] 10Analytics, 10Analytics-EventLogging, 10Event-Platform, 10Core Platform Team Legacy (Watching / External), 10Services (watching): Modern Event Platform: Schema Registry - https://phabricator.wikimedia.org/T201063 (10Nuria) [19:28:42] 10Analytics, 10Analytics-Kanban: Hive data quality alarms pipeline - https://phabricator.wikimedia.org/T235486 (10Nuria) 05Open→03Resolved [19:28:44] 10Analytics, 10Analytics-Kanban: Data Quality Alarms - https://phabricator.wikimedia.org/T198986 (10Nuria) [19:29:24] 10Analytics, 10Analytics-Kanban, 10Tool-Pageviews: Add mediarequests metrics to wikistats UI - https://phabricator.wikimedia.org/T234589 (10Nuria) 05Open→03Resolved [19:29:29] 10Analytics, 10Multimedia, 10Tool-Pageviews: Statistics for views of individual Wikimedia images - https://phabricator.wikimedia.org/T210313 (10Nuria) [19:29:30] elukey: let's move the resync jobs tomorrow? [19:29:37] 10Analytics, 10Analytics-Kanban, 10Release-Engineering-Team (Deployment services): issues with artifact cache in an-coord1001 - https://phabricator.wikimedia.org/T227132 (10Nuria) 05Open→03Resolved [19:29:56] 10Analytics, 10Analytics-Kanban: Make hdfs-rsync process sub-folders recursively - https://phabricator.wikimedia.org/T238326 (10Nuria) 05Open→03Resolved [19:30:14] 10Analytics, 10Analytics-Kanban: Remove tranquility and banner-impressions streaming from refinery-job - https://phabricator.wikimedia.org/T245151 (10Nuria) 05Open→03Resolved [19:30:22] ok, if joal/nuria/ottomata want we can talk about design doc in cave now [19:30:36] milimetric: in 1-1 now - 1/2 [19:30:37] h [19:30:43] 10Analytics, 10Analytics-Kanban, 10Research-Backlog, 10Wikidata, 10Patch-For-Review: Copy Wikidata dumps to HDFS + parquet - https://phabricator.wikimedia.org/T209655 (10Nuria) 05Open→03Resolved [19:30:45] 10Analytics: Provide data dumps in the Analytics Data Lake - https://phabricator.wikimedia.org/T186559 (10Nuria) [19:30:46] oh! [19:30:47] 10Analytics, 10Article-Recommendation, 10Patch-For-Review: Generate article recommendations in Hadoop for use in production - https://phabricator.wikimedia.org/T210844 (10Nuria) [19:30:54] sorry joal - after I meant [19:39:49] 10Analytics, 10Analytics-Cluster: Hadoop Hardware Orders FY2019-2020 - https://phabricator.wikimedia.org/T243521 (10wiki_willy) Hi @Ottomata - I could be mistaken, but I thought things were currently pending on getting the GPUs tested out first via T242149 and T238587...before proceeding with the remaining ser... [19:48:01] 10Analytics, 10Analytics-Kanban, 10Release Pipeline, 10Patch-For-Review, and 2 others: Migrate EventStreams to k8s deployment pipeline - https://phabricator.wikimedia.org/T238658 (10Ottomata) Hm, @akosiaris I have been trying to vary CPU and/or memory limits to see what makes a difference in the throughput... [20:03:08] 10Analytics, 10Analytics-Cluster: Hadoop Hardware Orders FY2019-2020 - https://phabricator.wikimedia.org/T243521 (10Ottomata) The 6 x Hadoop GPU workers HW is blocked on the GPU testing but the other HW is not! We'd like to order these soon and begin a Presto + Hadoop Worker colocation project on the new nod... [20:03:33] 10Analytics, 10Analytics-Cluster: Hadoop Hardware Orders FY2019-2020 - https://phabricator.wikimedia.org/T243521 (10Ottomata) I can make an official procurement request for the 16 x Hadoop workers if so. [20:05:56] milimetric: I need to leave now - Will talk tomorrow :) [20:06:04] joal: np [20:08:44] ottomata, reviewed your code, changes loog good! [20:08:58] ottomata, I don't understand though the mw.track part [20:10:56] k [20:11:16] mforns_brb: do you understand the way eventlogging does mw.track('event.SchemaName', event) currently? [20:15:34] 10Analytics, 10Gerrit, 10Gerrit-Privilege-Requests, 10User-MarcoAurelio: Give access to Wikistats 2 to l10n-bot - https://phabricator.wikimedia.org/T245805 (10MarcoAurelio) 05Stalled→03Resolved [20:16:05] 10Analytics, 10Gerrit, 10Gerrit-Privilege-Requests, 10User-MarcoAurelio: Give access to Wikistats 2 to l10n-bot - https://phabricator.wikimedia.org/T245805 (10MarcoAurelio) [20:16:10] 10Analytics, 10Analytics-Wikistats, 10translatewiki.net, 10Patch-For-Review: Add stats.wikimedia.org to translatewiki.net - https://phabricator.wikimedia.org/T240621 (10MarcoAurelio) [20:24:32] 10Analytics, 10Better Use Of Data, 10Desktop Improvements, 10Product-Infrastructure-Team-Backlog, and 6 others: Client side error logging production launch - https://phabricator.wikimedia.org/T226986 (10Ottomata) [20:24:55] 10Analytics, 10Better Use Of Data, 10Desktop Improvements, 10Product-Infrastructure-Team-Backlog, and 6 others: Client side error logging production launch - https://phabricator.wikimedia.org/T226986 (10Ottomata) [20:25:23] ottomata, not, but I haven't dug into that, I can look at the code [20:26:51] oh ottomata saw your comment on the CR [20:33:37] 10Analytics: Should reportupdater Pingback reports be refactored? - https://phabricator.wikimedia.org/T246154 (10mforns) @CCicalese_WMF Hmm, all queries have the same first step, which is to isolate the last ping from each wiki. Only the last ping is considered for the calculations of new data points. It is true... [20:42:04] 10Analytics, 10Analytics-Kanban, 10Product-Analytics: Give clear recommendations for Spark settings - https://phabricator.wikimedia.org/T245897 (10nshahquinn-wmf) FYI, from my perspective, this is done. Thanks again! [21:09:59] (03PS1) 10Milimetric: Update aqs to f3ea76f [analytics/aqs/deploy] - 10https://gerrit.wikimedia.org/r/575355 [21:10:20] (03CR) 10Milimetric: [V: 03+2 C: 03+2] Update aqs to f3ea76f [analytics/aqs/deploy] - 10https://gerrit.wikimedia.org/r/575355 (owner: 10Milimetric) [21:16:29] !log tried to deploy AQS but it failed with the same integration test on mediarequests, sending email [21:16:31] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [21:18:50] 10Analytics, 10Analytics-EventLogging, 10VisualEditor: New EventLogging queue doesn't log events in window.unload - https://phabricator.wikimedia.org/T246382 (10DLynch) [21:22:06] ottomata: wanna help me track down a scap failure? [21:22:14] deploying aqs [21:22:31] 10Analytics: Should reportupdater Pingback reports be refactored? - https://phabricator.wikimedia.org/T246154 (10Nuria) >We even can create a table in top of the backup data to allow for retroactive queries there. Since this is not needed to get the same reports we have been getting to date let''s just please no... [21:22:33] it says Executing check 'endpoints' \ Check 'endpoints' failed: [21:23:59] and I'm trying to figure out where these checks are [21:27:31] milimetric: hey sorry sure!@ [21:27:45] cave might be easier [21:27:55] the checks are defined in the scap deploy repo [21:27:57] looking at aqs [21:29:08] oh its an nrpe check [21:29:10] probalby defind in puppet [21:29:20] yeah it says command: check_endpoints_aqs [21:29:45] looked in puppet could find: https://github.com/wikimedia/puppet/search?q=check_endpoints_aqs&unscoped_q=check_endpoints_aqs [21:29:56] *couldn't [21:30:54] https://github.com/wikimedia/puppet/blob/production/modules/service/manifests/node.pp#L397-L404 [21:31:23] heh, well, that's impossible to find unless you know it's there [21:31:31] should put a comment above it or something [21:31:34] https://github.com/wikimedia/puppet/blob/production/modules/service/templates/check-service.erb [21:31:46] milimetric: i went to modules/aqs/init.pp [21:32:02] saw that it used service::node [21:32:18] modules/service/manifests/node.pp [21:32:21] searched for nrpe [21:32:32] ok, still, I don't know what nrpe is [21:32:49] 10Analytics, 10Analytics-EventLogging, 10Editing-team, 10VisualEditor, 10Patch-For-Review: New EventLogging queue doesn't log events in window.unload - https://phabricator.wikimedia.org/T246382 (10DLynch) [21:32:54] yeah its confusing so don't worry about it [21:33:00] but its like a remote nagios check [21:33:21] 10Analytics, 10Analytics-EventLogging, 10VisualEditor, 10Editing-team (Q3 2019-2020 Kanban Board), 10Patch-For-Review: New EventLogging queue doesn't log events in window.unload - https://phabricator.wikimedia.org/T246382 (10DLynch) [21:33:21] ok, anyway, but I still don't see what it does -> why it's failing -> how to fix it and [21:33:24] anyway it runs service-checker-swagger [21:33:26] haha yeah am looking too [21:33:43] dunno what service-checker-swagger is [21:34:12] ah but it is ondeploy1001 [21:34:45] and it's an old version of service-checker-swagger 'cause we're not on latest [21:35:25] or... I guess maybe it's a new version... we just use an old version of template-node-service [21:35:30] ah but it runs a sa local script so [21:35:39] what node? e.g. aqs1001 ? [21:35:44] 1004 [21:35:48] k [21:35:58] (1001-1003 have been decomissioned for a while I thought) [21:36:46] Check 'endpoints' failed: /analytics.wikimedia.org/v1/mediarequests/per-file/{referer}/{agent}/{file_path}/{granularity}/{start}/{end} (Get per file requests) is CRITICAL: Test Get per file requests returned the unexpected status 404 (expecting: 200) [21:36:58] 21:36:39 [@aqs1004:/home/otto] $ /usr/bin/service-checker-swagger -t 5 $(hostname -i) http://$(hostname -i):7232/analytics.wikimedia.org/v1 [21:36:58] All endpoints are healthy [21:37:08] right, I rolled back when that failed [21:37:11] aye k [21:37:21] so, that command apparently is failling when tyou deploy [21:37:22] that's the full error I got above, I think it's what Fran was trying to fix but it didn't work so I was going to give it a shot [21:37:28] k [21:37:56] heh, so it is doing the right thing ! :) [21:39:31] well, I think if we knew what the hell that was, we could fix it [21:39:53] ah the actual url? [21:40:00] it is using the service swagger spec to get that [21:40:02] ok [21:40:49] e.g., [21:40:49] curl aqs1004.eqiad.wmnet:7232/analytics.wikimedia.org/v1/?spec | jq . [21:41:14] i think it uses the x-amples to curl it [21:44:04] here's the spec for that path [21:44:14] (coudlln't be bothhered to figure out better .jq, so used grep -A) [21:44:15] curl aqs1004.eqiad.wmnet:7232/analytics.wikimedia.org/v1/?spec | jq '.paths' | grep "/mediarequests/per-file/{referer}/{agent}/{file_path}/{granularity}/{start}/{end}" -A 113 [21:44:23] https://www.irccloud.com/pastebin/lECrQ1if/ [21:44:30] so hyou can reconstruct the url using those params [21:44:38] ah ok [21:45:43] e.g. [21:45:44] curl aqs1004.eqiad.wmnet:7232/analytics.wikimedia.org/v1/mediarequests/per-file/en.wikipedia/all-agents/-/daily/1970010100/1970010100 | jq . [21:45:46] which works right now [21:45:53] so after deploy that isn't working [21:45:59] did the swagger spec change? [21:46:05] gotcha [21:46:07] maybe the URL changed but the speec wasn't updated? [21:46:32] this is what Fran did to try and fix this, I think: https://gerrit.wikimedia.org/r/#/c/analytics/aqs/+/573998/ [21:47:39] and then maybe this messed it up? https://github.com/wikimedia/analytics-aqs/commit/469f143000fcaef28aefebeab33946249c0d2e89 [21:48:31] is it ok if I deploy again to see what's going on? [21:52:23] ok, I'm just going to revert the xample and test for "-" as a file_path. It might be primitive but at least we can deploy and we can fix later [21:54:11] (03PS1) 10Milimetric: Revert "Fix per file mediarequests integration tests" [analytics/aqs] - 10https://gerrit.wikimedia.org/r/575359 [21:54:37] (03CR) 10Milimetric: [V: 03+2 C: 03+2] "This needs to be revisited, right now it just breaks the integration tests." [analytics/aqs] - 10https://gerrit.wikimedia.org/r/575359 (owner: 10Milimetric) [21:55:45] ok! [21:55:58] (03PS1) 10Milimetric: Update aqs to dec493f [analytics/aqs/deploy] - 10https://gerrit.wikimedia.org/r/575360 [21:58:02] (03CR) 10Milimetric: [V: 03+2 C: 03+2] Update aqs to dec493f [analytics/aqs/deploy] - 10https://gerrit.wikimedia.org/r/575360 (owner: 10Milimetric) [22:05:38] ok, looks like the deploy is fine now, and the fix that Fran tried to do is in. I am not 100% sure the fix is correct, but it looks like it works for single-encoded file paths [22:10:32] 10Analytics, 10Analytics-Kanban, 10Multimedia, 10Tool-Pageviews: Fix double encoding of urls on mediarequests api - https://phabricator.wikimedia.org/T244373 (10Milimetric) Ok, I deployed this, seems to fix the original problem as reported in T234590#5850051, as not it allows the single-encoded form: http... [22:19:27] 10Analytics, 10Analytics-EventLogging, 10DiscussionTools, 10VisualEditor, and 3 others: New EventLogging queue doesn't log events in window.unload - https://phabricator.wikimedia.org/T246382 (10JTannerWMF) [22:30:07] 10Analytics, 10Analytics-Cluster: Hadoop Hardware Orders FY2019-2020 - https://phabricator.wikimedia.org/T243521 (10wiki_willy) @Ottomata - gotcha, we should be able to proceed forward then with the 16x nodes for the refresh. Go ahead and shoot open a procurement request for them. Thanks, Willy [22:39:50] 10Analytics, 10Analytics-Kanban, 10Event-Platform, 10Patch-For-Review, 10Services (watching): jsonschema-tools should add a 'latest' symlink - https://phabricator.wikimedia.org/T245859 (10Ottomata) [23:20:55] 10Analytics, 10Analytics-EventLogging, 10DiscussionTools, 10VisualEditor, and 3 others: New EventLogging queue doesn't log events in window.unload - https://phabricator.wikimedia.org/T246382 (10matmarex) [23:28:10] 10Quarry, 10User-DannyS712: Unable to access Quarry: 504 timeout - https://phabricator.wikimedia.org/T246364 (10Framawiki) Site was down for 8 minutes from 2020-02-27 18:08:10 UTC+1.