[04:26:50] 10Analytics-Kanban, 10Contributors-Analysis, 10Patch-For-Review: Provide cumulative edit count in Data Lake edit data - https://phabricator.wikimedia.org/T161147#3526959 (10Neil_P._Quinn_WMF) @Nuria, I really apologize; this got swallowed by other work and then Wikimania. I did some spot-checks today, and... [06:32:54] 10Analytics-Kanban, 10Operations, 10Patch-For-Review, 10User-Elukey: Analytics Kafka cluster causing timeouts to Varnishkafka since July 28th - https://phabricator.wikimedia.org/T172681#3526998 (10elukey) {F9085708} Seems definitely solved, but some follow ups would need to be done: 1) Make sure to insta... [06:33:02] 10Analytics-Kanban, 10Operations, 10Patch-For-Review, 10User-Elukey: Analytics Kafka cluster causing timeouts to Varnishkafka since July 28th - https://phabricator.wikimedia.org/T172681#3526999 (10elukey) p:05High>03Normal [10:30:13] 10Analytics-Kanban, 10Operations, 10Patch-For-Review, 10User-Elukey: Analytics Kafka cluster causing timeouts to Varnishkafka since July 28th - https://phabricator.wikimedia.org/T172681#3527270 (10elukey) Created report https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=872327 [10:40:12] 10Analytics-Kanban, 10Patch-For-Review, 10User-Elukey: Refactor puppet code for the Hadoop Analytics cluster to roles/profiles - https://phabricator.wikimedia.org/T167790#3527293 (10elukey) [10:58:18] * elukey lunch! [12:29:06] What's up with Wikistats? It still shows May 2017, https://stats.wikimedia.org/wiktionary/EN/TablesWikipediaRU.htm [13:05:16] (03CR) 10Mforns: [V: 031 C: 031] "LGTM! Andrew any concerns with the addition of group permits?" (031 comment) [analytics/reportupdater] - 10https://gerrit.wikimedia.org/r/371955 (https://phabricator.wikimedia.org/T173333) (owner: 10Bearloga) [13:21:09] (03CR) 10Mforns: [C: 032] README.md: update link [analytics/reportupdater] - 10https://gerrit.wikimedia.org/r/371964 (owner: 10Bearloga) [13:21:26] (03CR) 10Mforns: "Thanks for fixing this!" [analytics/reportupdater] - 10https://gerrit.wikimedia.org/r/371964 (owner: 10Bearloga) [14:23:41] 10Analytics-Kanban, 10Analytics-Wikistats: Wikistats2 bugs (1/4) - Dashboard and general UI - https://phabricator.wikimedia.org/T170933#3527671 (10fdans) [14:25:02] 10Analytics-Kanban, 10Analytics-Wikistats: Wikistats2 bugs (1/4) - Dashboard and general UI - https://phabricator.wikimedia.org/T170933#3447985 (10fdans) [15:00:05] ping mforns elukey [15:00:48] 10Analytics-Kanban, 10Operations, 10Patch-For-Review, 10User-Elukey: Analytics Kafka cluster causing timeouts to Varnishkafka since July 28th - https://phabricator.wikimedia.org/T172681#3527754 (10elukey) a:03elukey [15:01:33] elukey, joal: fyi: https://www.eventbrite.com/e/next-generation-cassandra-conference-ngcc-tickets-36160855091 [15:01:52] also nuria_ :) ^^^ [15:02:06] urandom: on meeting will read in abit [15:02:19] urandom: nice! Will you be presenting? [15:02:46] elukey: no, i'm organizing [15:12:05] 10Analytics-Kanban, 10Analytics-Wikistats: Wikistats2 bugs (2/4) - Wiki selector - https://phabricator.wikimedia.org/T170936#3527780 (10fdans) [15:12:07] 10Analytics-Kanban, 10Analytics-Wikistats: Wikistats2 bugs (1/4) - Dashboard and general UI - https://phabricator.wikimedia.org/T170933#3527781 (10fdans) [15:34:37] 10Analytics-Kanban, 10User-Elukey: Archive PageContentSaveComplete in hdfs while we continue collecting data - https://phabricator.wikimedia.org/T170720#3527872 (10elukey) Sanity check from my side: ``` HDFS: select count(*) from PageContentSaveComplete_5588433_15423246 ; 1291270247 dbstore1002: mysql:resear... [15:49:06] 10Analytics-Kanban, 10User-Elukey: Archive PageContentSaveComplete in hdfs while we continue collecting data - https://phabricator.wikimedia.org/T170720#3440380 (10Nuria) Dropped from slaves but not master. [15:58:15] urandom: nice [15:58:26] urandom: are you encouraging us to attend/present? [15:59:32] fdans: THAT WAS SOME KARAOKE [16:00:00] nuria_: haha yeah it was the "VIP room" of a chinese karaoke [16:00:10] the song selection was weeeeird [16:00:31] nuria_: yes [16:00:39] LA2: wikistats is normally two months behing and this time its processing has moved machines, which has delayed it further [16:01:25] nuria_: i mean, i will probably be otherwise disposed during, and have something of a conflict of interest maybe wrt to point out issues [16:01:45] which probably means i'm harder on people, really, but i digress [16:02:14] urandom: so gist, we submit a presentation in which we say bad things about cassandra, juas! [16:02:17] nuria_: anyway, it's probably not easy to send someone, but i thought i'd mention it [16:02:22] heh [16:02:48] urandom: it will be not hard to send someone from us [16:03:12] urandom: we try to have budget to attend 1 tech conference each (either tech conf or wikimania) [16:03:29] ok [16:04:39] i can't promise any submitted talk would be selected (i'm delegating that to the PMC), but there would be plenty of opportunity to hear about what is coming up, and to provide input on what doesn't work well (i.e. what should be prioritized in upcoming work) [16:06:02] nuria_: and yikes, i just noticed that submissions need to be in by saturday [16:06:12] nuria_: this whole thing has been a bit rushed (mea culpa) [16:08:17] urandom: then it is going to be hard to have something ready , half of team is out now and probably elukey and joal are the best ones to put a presentation together, if you want a technical presentation, which you probably do [16:09:00] nuria_: an abstract and title is all that is needed by saturday [16:26:46] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, and 4 others: Investigate use-cases for delayed job executions - https://phabricator.wikimedia.org/T172832#3528070 (10GWicke) Some use cases from [today's eventbus sync discussion](https://etherpad.wikimedia.org/p/JobQueue): - Batching & rate li... [16:52:17] elukey: that tiny little blip is so sad: https://grafana.wikimedia.org/dashboard/file/server-board.json?refresh=1m&orgId=1&var-server=dbstore1002&var-network=bond0&panelId=17&fullscreen&from=now-7d&to=now [16:52:33] yeah :( [16:52:43] urandom: I don't think we can make it before Saturday :( [16:53:49] * elukey going afk people! [16:54:43] byeeee elukey ! [16:58:49] 10Analytics, 10Scoring-platform-team-Backlog, 10revscoring, 10artificial-intelligence: [Investigate] Use PMML for prediction model serialization - https://phabricator.wikimedia.org/T173244#3522058 (10Halfak) p:05Triage>03Normal [17:00:29] 10Analytics, 10Analytics-EventLogging, 10Performance-Team, 10Scap (Scap3-Adoption-Phase1): Use scap3 to deploy eventlogging/eventlogging - https://phabricator.wikimedia.org/T118772#3528180 (10greg) [17:47:37] 10Analytics-Kanban: Run mediawiki edit reconstruction 2017-07 snapshot with new set of wikis - https://phabricator.wikimedia.org/T172463#3528400 (10Nuria) 05Open>03Resolved [17:48:07] 10Analytics-Kanban, 10RESTBase-API, 10Services (later), 10User-mobrovac: Expose pageview data in each project's REST API - https://phabricator.wikimedia.org/T119094#3528401 (10Nuria) 05Open>03declined [17:48:43] 10Analytics-Kanban, 10Analytics-Wikistats: Fix wikistats 2.0 footer links - https://phabricator.wikimedia.org/T173043#3528427 (10Nuria) a:03fdans [17:49:46] 10Analytics-Kanban: Vet Analysis on June 2017 Data - https://phabricator.wikimedia.org/T171914#3528432 (10Nuria) 05Open>03Resolved [18:30:52] 10Analytics, 10ChangeProp, 10EventBus, 10Services (later): Generalise deduplication in ChangeProp - https://phabricator.wikimedia.org/T157090#3528641 (10GWicke) [19:49:16] PROBLEM - eventlogging Varnishkafka log producer on cp1008 is CRITICAL: Return code of 255 is out of bounds [19:49:25] PROBLEM - Webrequests Varnishkafka log producer on cp1008 is CRITICAL: Return code of 255 is out of bounds [19:50:06] PROBLEM - statsv Varnishkafka log producer on cp1008 is CRITICAL: Return code of 255 is out of bounds [20:10:45] RECOVERY - Webrequests Varnishkafka log producer on cp1008 is OK: PROCS OK: 1 process with args /usr/bin/varnishkafka -S /etc/varnishkafka/webrequest.conf [20:11:25] RECOVERY - statsv Varnishkafka log producer on cp1008 is OK: PROCS OK: 1 process with args /usr/bin/varnishkafka -S /etc/varnishkafka/statsv.conf [20:11:45] RECOVERY - eventlogging Varnishkafka log producer on cp1008 is OK: PROCS OK: 1 process with args /usr/bin/varnishkafka -S /etc/varnishkafka/eventlogging.conf [20:45:45] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, 10Services (doing): Generalized rate limiting, deduplication, and job scheduling module - https://phabricator.wikimedia.org/T173447#3529091 (10mobrovac) [20:57:28] 10Analytics-Kanban, 10Patch-For-Review: Add QuickSurvey schemas to EventLogging white-list - https://phabricator.wikimedia.org/T172112#3529149 (10leila) @mforns: re QuickSurveyInitiation: dropping clientIp (which is hashed) and userAgent for the survey we ran towards the end of June 2017 is too risky from the... [21:04:26] 10Analytics-Kanban, 10Patch-For-Review: Add QuickSurvey schemas to EventLogging white-list - https://phabricator.wikimedia.org/T172112#3529173 (10leila) @mforns: re QuickSurveyResponses: is `event_surveySessionToken` the exact same value as in QuickSurveyInitiation? If yes, it's fine to drop userAgent from Qu... [21:04:51] 10Analytics-Kanban, 10Research, 10Patch-For-Review: Add QuickSurvey schemas to EventLogging white-list - https://phabricator.wikimedia.org/T172112#3529174 (10leila) [21:10:29] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, 10Services (doing): Generalized rate limiting, deduplication, and job scheduling module - https://phabricator.wikimedia.org/T173447#3529193 (10GWicke) [21:11:38] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, 10Services (doing): Generalized rate limiting, deduplication, and job scheduling module - https://phabricator.wikimedia.org/T173447#3528627 (10GWicke) [21:17:38] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, 10Services (doing): Generalized rate limiting, deduplication, and job scheduling module - https://phabricator.wikimedia.org/T173447#3529199 (10GWicke) [21:18:45] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, 10Services (doing): Generalized rate limiting, deduplication, and job scheduling module - https://phabricator.wikimedia.org/T173447#3528627 (10GWicke) [21:19:55] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, 10Services (doing): Generalized rate limiting, deduplication, and job scheduling module - https://phabricator.wikimedia.org/T173447#3529203 (10GWicke) [21:22:54] 10Analytics, 10Analytics-EventLogging, 10Contributors-Analysis, 10EventBus, 10Community-Tech-Sprint: Add index to mediawiki_page_create_1 table - https://phabricator.wikimedia.org/T170990#3529208 (10kaldari) @Ottomata: @Nettrom (who is working as a contract analyst for the CommTech team) is looking into... [22:33:34] 10Analytics, 10Analytics-EventLogging, 10Performance-Team, 10Scap (Scap3-Adoption-Phase1): Use scap3 to deploy eventlogging/eventlogging - https://phabricator.wikimedia.org/T118772#3529414 (10Krinkle) Discussed this today. Given we're only using the eventlogging library as simple abstraction around python-...