[00:39:33] 10Analytics, 10Analytics-Kanban, 10Analytics-Wikistats: Wikistats 2.0: Page heading style varies - https://phabricator.wikimedia.org/T187412#4089922 (10Krinkle) @fdans I can still reproduce the issue: 1. View in a new tab. – Heading not bold. 2. In that tab, click "Reading", t... [01:24:49] 10Analytics, 10Analytics-Wikistats, 10Patch-For-Review: Display of radio buttons in Wikistats 2 is somewhat confusing - https://phabricator.wikimedia.org/T183185#4089978 (10Amitjoki) @Nuria if that was offensive, sorry about that. Didn't mean to. Will condense and remove the inappropriate details from the co... [01:34:59] (03PS2) 10Amitjoki: Bug T183185 Display of radio buttons in Wikistats 2 is somewhat confusing [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/422407 [01:37:56] (03PS3) 10Amitjoki: Bug T183185 Display of radio buttons in Wikistats 2 is somewhat confusing [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/422407 [01:39:23] (03CR) 10Amitjoki: ">" (031 comment) [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/422407 (owner: 10Amitjoki) [05:17:33] 10Analytics, 10Analytics-Wikistats, 10Patch-For-Review: Display of radio buttons in Wikistats 2 is somewhat confusing - https://phabricator.wikimedia.org/T183185#4090105 (10Nuria) >if that was offensive, sorry about that. Didn't mean to. No, not at all. Really. Just wanted to emphasize that the approach for... [06:37:41] 10Analytics, 10Analytics-Kanban, 10Analytics-Wikistats, 10Google-Summer-of-Code (2018): GSoC Proposal 2018 : [Analytics] Improvements to Wikistats2 front-end - https://phabricator.wikimedia.org/T189964#4090136 (10sahil505) [07:30:00] !log Manually reparing hive mediawiki_private_cu_changes table after manual sqooping of 2018-01 data, and add _PARTITIONNED file to the folder [07:30:01] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [07:34:28] * elukey imagines joseph doing mechanical work in his Hive car [07:34:43] :D [07:34:46] Hi elukey [07:35:10] elukey: The hive actually self-repares itself - It's pretty cool ;) [07:36:18] 10Analytics, 10Patch-For-Review, 10User-Elukey: latest varnishkafka fails to build on Debian - https://phabricator.wikimedia.org/T186250#4090214 (10elukey) 05Open>03Resolved a:03elukey Just merged the change, thanks a lot! [07:36:51] contributions for varnishkafka --^ [07:36:52] super nice [07:37:35] great :) [07:39:06] Yay - As expected, geowiki data for 2018-01 is being computed now out of manually sqooped cu_changed [07:39:28] It;s always a relief to see that stuff works, particularly when dealing with stuff that *doesn't*g [07:42:06] :) [07:43:47] commuting to the co-working, bbiab [07:48:07] (03PS6) 10Fdans: Fix responsive glitches [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/422422 (https://phabricator.wikimedia.org/T187440) (owner: 10Milimetric) [07:49:54] (03CR) 10Fdans: "Added a patch to fix a weird margin issue between the header and "Monthly overview" on the dashboard. Also changed a bit the css of the ta" [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/422422 (https://phabricator.wikimedia.org/T187440) (owner: 10Milimetric) [08:01:45] (03CR) 10Fdans: "I only found one thing that bother me, which you didn't add. I'll remove it myself, test on the canary with my phone, and merge." (031 comment) [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/422422 (https://phabricator.wikimedia.org/T187440) (owner: 10Milimetric) [08:04:34] joal: going to do the cassandra rolling restart for aqs, any objection? [08:04:46] (openjdk-8 upgrades) [08:04:46] (03PS7) 10Fdans: Fix responsive glitches [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/422422 (https://phabricator.wikimedia.org/T187440) (owner: 10Milimetric) [08:04:51] then druid is next :) [08:05:12] I am really curious to know if the trick about doing zk first and then druid overlord works again [08:05:31] I can imagine elukey [08:05:38] no objection for restart nope :) [08:06:33] 10Analytics, 10Analytics-Kanban, 10User-Elukey: Add trash folder to hadoop - https://phabricator.wikimedia.org/T189051#4090253 (10elukey) [08:07:05] goood! proceeding then [08:09:57] (03CR) 10Fdans: [C: 032] Fix responsive glitches [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/422422 (https://phabricator.wikimedia.org/T187440) (owner: 10Milimetric) [08:27:29] 10Analytics, 10Operations, 10Traffic: Investigate and fix odd uri_host values - https://phabricator.wikimedia.org/T188804#4090285 (10ema) p:05Triage>03Normal [08:28:30] 10Analytics, 10Operations, 10Traffic: Spammy events coming our way for sites such us https://ru.wikipedia.kim - https://phabricator.wikimedia.org/T190843#4090289 (10ema) p:05Triage>03Normal [08:40:47] moritzm: o/ - should I also roll restart aqs on aqs*? (seeing some pending updates with lsof) [08:43:25] yeah, 1008 and 1009 seem to need a restart of cassandra [08:43:34] yep still in progress :) [08:45:40] the AQS stuff itself is written in nodejs, it currently needs a restart for the ICU update, but I held it back since there's also the openssl update and the nodejs update [08:45:43] _but_ [08:46:14] the nodejs update happened yesterday night and the changes are negligable, so I don't think we'll update it [08:46:30] ack [08:46:39] nodejs as distributed by upstream bundles OpenSSL, so it needs an update whenever OpenSSL is updated [08:46:47] but we link against OpenSSL from Debian proper [08:47:01] and the other two node-specific issues are harmless [08:47:18] so, let me upgrade openssl on AQS and then we can restart the nodejs service [08:47:26] +1 [08:50:05] elukey: there is a long awating doc patch for AQS [08:50:12] elukey: should we deploy if we restat? [08:51:36] joal: ah yes this is a good way to couple two things! [08:54:58] elukey: hm - ok, let's not do it :-P [08:56:49] joal: why? [08:57:11] those are new ICU/openssl updates, I don't expect anything to explode [08:57:18] elukey: I was wondering if "coupling" could be wrong :) [08:57:20] Ah ok [08:57:29] nono it is fine this time :) [08:57:47] I can restart one aqs daemon first, check that it is fine and then let you deploy [08:58:25] elukey: As you wish - The patch to deploy is actually only doc, I'm even wondering if it's worth [08:58:39] elukey: Mwah - Let's not do it :) [09:04:05] elukey: give me 5 mins, then I'll upgrade openssl, currently completing something else [09:04:12] ack! [09:06:17] 10Analytics, 10Contributors-Analysis, 10Performance-Team, 10VisualEditor, and 2 others: Statsv down, affects metrics from beacon/statsv (e.g. VisualEditor, mw-js-deprecate) - https://phabricator.wikimedia.org/T141054#4090458 (10Neil_P._Quinn_WMF) p:05High>03Triage [09:06:52] 10Analytics-Kanban, 10Contributors-Analysis, 10DBA, 10MW-1.27-release (WMF-deploy-2016-02-02_(1.27.0-wmf.12)), and 2 others: Edit schema needs purging, table is too big for queries to run (500G before conversion) {oryx} - https://phabricator.wikimedia.org/T124676#4090482 (10Neil_P._Quinn_WMF) p:05Normal>... [09:07:46] 10Analytics-Dashiki, 10Analytics-Kanban, 10Contributors-Analysis, 10Contributors-Team, 10Patch-For-Review: Time selector on https://edit-analysis.wmflabs.org/compare/ is only followed (?) by the first and fifth elements {crow} [3 pts] - https://phabricator.wikimedia.org/T112183#4090532 (10Neil_P._Quinn_WM... [09:08:23] 10Analytics, 10Analytics-Backlog, 10Analytics-Dashiki, 10Contributors-Analysis, and 4 others: Limit wikis on https://edit-analysis.wmflabs.org/compare/ to top 50 Wikipedias {lion} - https://phabricator.wikimedia.org/T112222#4090541 (10Neil_P._Quinn_WMF) p:05Normal>03Triage [09:10:36] elukey, joal: openssl upgraded on AQS, you can go ahead [09:11:12] moritzm: thanks! [09:11:19] elukey: note that there's also a few other services pending for restart, but thanks to the beauty of wmf-auto-upgrade these are all handled automatically [09:11:35] (nrpe, lldpd, diamond) [09:11:51] Nice moritzm ! [09:12:40] moritzm: \o/ [09:14:12] joal: restarted aqs on 1004, all good, shall we deploy? [09:15:29] elukey: I was actually thinking it's not necessary [09:15:55] This doc is not even visible to the oustside world, so let's have it deployed next time a real change is needed [09:16:05] elukey: sorry for the noise :) [09:16:07] as you prefer :) [09:19:13] (03PS1) 10Fdans: Release 2.2.0 [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/422899 [09:19:52] (03CR) 10Fdans: [C: 032] Release 2.2.0 [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/422899 (owner: 10Fdans) [09:22:28] (03Merged) 10jenkins-bot: Release 2.2.0 [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/422899 (owner: 10Fdans) [09:27:26] (03PS1) 10Fdans: Release 2.2.0 [analytics/wikistats2] (release) - 10https://gerrit.wikimedia.org/r/422900 [09:27:30] (03CR) 10Fdans: [C: 032] Release 2.2.0 [analytics/wikistats2] (release) - 10https://gerrit.wikimedia.org/r/422900 (owner: 10Fdans) [09:29:40] (03Merged) 10jenkins-bot: Release 2.2.0 [analytics/wikistats2] (release) - 10https://gerrit.wikimedia.org/r/422900 (owner: 10Fdans) [09:43:42] aqs restarted! [09:44:25] now since we are around the clock, I'd stop the middlemanager on the druid node acting as overlord leader on the druid private cluster [09:44:31] (drain+stop_ [09:44:46] to see if the trick with zookeeper works [09:44:54] elukey: works for me elukey [09:45:23] that is druid1002 [09:46:00] goood! so disabled the druid1002's middle manager [09:50:10] ah snap joal I just realized one thing [09:50:39] we have openjdk-8 installed on druid because of the last failed attempt to upgrade [09:50:53] but currently (correct me if I am wrong) druid runs on jdk7 [09:52:02] I also remember that [09:52:48] moritzm: so we have a plan to upgrade druid next quarter, so probably better to just leave openjdk in there and then purge 7 once done [09:52:51] would it be ok? [09:53:35] ack [09:56:22] ok to also upgrade java on thorium?all the services running there are node or python based, so seems only installed for clients anyway [09:59:50] yep [10:04:20] elukey: pivot, hue and superset services need a restart after the openssl update on thorium, are those suitable candidates to wmf-auto-update? [10:05:32] moritzm: not sure, usually I do it without too many checks but during the EU morning (less chance to find analysts working on those) [10:07:39] ok, if a restart of any of those as a user-visible affect, then it's not a candidate :-) [10:07:46] effect [10:15:30] elukey: ok to restart all three of them now or do you want to do it when time is right? [10:16:20] moritzm: now it is fine [10:16:36] do you want me to do it? [10:19:12] sure, go ahead please [10:20:48] BTW, we can also extend wmf-auto-update to run the check at a specific time, right now it's spread out randomly across the day, but for cases like pivot, hue and superset we can also configure a fixed weekly maintenance window where the check and automatic restart would run [10:21:30] moritzm: it could be good, I think that monday early EU time would be fine (after the weekend, early EU morning, nobody using those) [10:21:48] (restared all in the meantime) [10:22:06] k, I'll make a note to extend wmf-auto-update with a parameter to use a fixed time and then we can revisit this [10:22:55] super [10:43:10] 10Analytics-Kanban, 10User-Elukey: Refresh zookeeper nodes in eqiad - https://phabricator.wikimedia.org/T182924#4090850 (10elukey) Another idea to reduce the amount of restarts would have been to add DNS CNAMEs for each zookeeper host, in order for example to avoid touching the zookeeper.connect settings in ea... [11:19:52] 10Analytics-Kanban, 10Operations, 10Patch-For-Review, 10User-Elukey, 10User-Joe: rack/setup/install conf1004-conf1006 - https://phabricator.wikimedia.org/T166081#4090905 (10elukey) Before proceeding I'd wait for @Joe's confirmation. I'd like to: 1) add static IPv6 addresses to conf100[456] with https://... [11:21:53] * elukey lunch! [12:20:55] PROBLEM - Kafka main-eqiad consumer group lag for kafka-mirror-main-eqiad_to_jumbo-eqiad on kafkamon1001 is CRITICAL: CRITICAL: Group is in an error state. Worst Lag: eqiad.mediawiki.revision-create/p0 - lag:5 offset:713084097 [12:21:54] RECOVERY - Kafka main-eqiad consumer group lag for kafka-mirror-main-eqiad_to_jumbo-eqiad on kafkamon1001 is OK: OK: Group is in an error state. [12:46:40] 10Analytics: Add UI in mobile view to switch to table view - https://phabricator.wikimedia.org/T191019#4091091 (10fdans) [12:51:28] elukey: kafka alerts happens every once in a while - SHould I do something about them? [12:57:16] joal: those are new ones that we are still testing, you can skip them :) [12:57:33] elukey: Thanks for the confirmation ;) [12:58:04] they indicate some temporary lag but nothing major.. if you don't see any recovery after a while ping me or andrew :) [13:09:40] (03PS1) 10Fdans: Prevent Wikimedia Statistics header from bolding in certain cases [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/422925 (https://phabricator.wikimedia.org/T187412) [13:09:41] 10Analytics, 10Analytics-Kanban, 10Analytics-Wikistats: Wikistats 2.0: Page heading style varies - https://phabricator.wikimedia.org/T187412#4091150 (10fdans) @Krinkle thank you so much for the report! I've uploaded a change to get rid of this. [13:10:05] 10Analytics, 10Analytics-Wikistats, 10Patch-For-Review: Display of radio buttons in Wikistats 2 is somewhat confusing - https://phabricator.wikimedia.org/T183185#3846152 (10ezachte) Thanks @Amitjoki , new version looks clean and quite intuitive to me. [13:13:18] 10Analytics-Kanban, 10Analytics-Wikistats: Make the Wikistats 2 UI responsive - https://phabricator.wikimedia.org/T186812#4091170 (10fdans) [13:13:23] 10Analytics-Kanban, 10Patch-For-Review: Adapt SimpleLegend component to match responsive design - https://phabricator.wikimedia.org/T188727#4091173 (10fdans) [13:32:05] PROBLEM - Kafka main-eqiad consumer group lag for kafka-mirror-main-eqiad_to_jumbo-eqiad on kafkamon1001 is CRITICAL: CRITICAL: Group is in an error state. Worst Lag: eqiad.mediawiki.job.cirrusSearchCheckerJob/p0 - lag:237 offset:164348134 [13:34:04] RECOVERY - Kafka main-eqiad consumer group lag for kafka-mirror-main-eqiad_to_jumbo-eqiad on kafkamon1001 is OK: OK: Group is in an error state. [13:34:58] still a bit sensitive [13:35:36] yeah [13:35:39] lag 237??? [13:35:46] also strange part is the 'error state' [13:35:58] it seems burrow is always reproting 'error' state even for things that don't look like errors [13:36:18] i had to add code yesterday to not do alerts for 'error' when all that was happening is there were no new messages in a topic [13:36:19] hey fdans, I’ll work on the geowiki stuff for a bit, you got everything under control on the responsive stuff? [13:36:30] if not, lemme know [13:36:38] elukey: what's the status of burrow 1.0? i forget. service folks asked me yesterday about it [13:37:31] also, elukey why are we getting paged about this error? i have it set to critical = false i thought [13:37:36] nagios_critical = false [13:37:42] but i'm getting text messages [13:38:18] ottomata: I am not sure, we get all the alarms fired to 'analytics', probably it is a nagios setting? Never checked it, I find it very handy when I am not at the keyboard [13:39:30] yeah for some we shoudl get paged [13:39:33] but not for these [13:39:34] :) [13:39:40] even if these weren't false alarms [13:40:09] elukey: burrow 1.0? i forget, go dependencies? stretch something something? [13:40:32] is it a matter of doing it, or are we blocked waiting on something ? [13:41:05] milimetric: yeah your patched is merged [13:42:49] 10Analytics, 10Analytics-Wikistats, 10Patch-For-Review: Display of radio buttons in Wikistats 2 is somewhat confusing - https://phabricator.wikimedia.org/T183185#4091227 (10Amitjoki) @ezachte glad it worked out :) [13:42:54] ottomata: now it is a matter of doing it, all written in the task.. I think that the new version is a bit different, it controls multiple kafka clusters from the same daemon (rather than 1:1) [13:43:00] ah task duh i should read that... :p [13:43:03] 10Analytics, 10Wikidata, 10Wikidata-Ministry-Of-Magic: Add Wikidata website extract oozie job - https://phabricator.wikimedia.org/T191022#4091228 (10Jonas) [13:43:05] it is scheduled for next quarter, they know it :) [13:45:14] ok cool [13:45:17] was just wondering about this alert [13:45:26] since the burrow lag checker is a litlte werid... [13:45:26] well [13:45:29] burrow is weird [13:45:34] and seems to report incorrect things [13:45:37] even just via the http api [13:45:47] so i was considering making the alert prometheus based instead [13:45:48] since we [13:45:56] we'd have more control over the query function that generates the alert [13:46:02] than just relying on burrow /status endpoint [13:46:26] hmm, looks like they merged https://github.com/jirwin/burrow_exporter/issues/8 [13:47:12] since that's true, maybe basing on prometheus is better aftter alla [13:47:16] elukey: what do you think? [13:47:40] (03PS1) 10Jonas Kress (WMDE): Add Wikidata website extract oozie job [analytics/refinery] - 10https://gerrit.wikimedia.org/r/422930 [13:48:10] ottomata: I think so yes! If you want I can try to prioritize burrow 1.0 [13:48:32] well, i was thinking maybe burrow 1.0 would be better about these false positives, maybe [13:48:33] but [13:48:40] the check script uses the v2 api anyway [13:48:50] so maybe it is cleaner to bypass it and rely on prometheus [13:48:53] ? [13:49:19] 10Analytics, 10CirrusSearch, 10Discovery, 10Discovery-Search, 10EventBus: Exception thrown while running DataSender::sendData in cluster codfw: Data should be a Document, a Script or an array containing Documents and/or Scripts - https://phabricator.wikimedia.org/T191024#4091269 (10dcausse) [13:49:41] probably would be easier to migrate if the metrics change, we'd just change the prometheus query, rather than having to modify the script that makes api requests and parses the responses [13:52:23] yep this is a good point [13:52:43] ok i will try that now then [13:52:59] one thing that I don't get is why those flapping alarms still trigger, since we should have a wider retry interval? [13:53:05] am I missing something? [13:53:32] yeah that i don't get either [14:01:14] PROBLEM - Kafka main-eqiad consumer group lag for kafka-mirror-main-eqiad_to_jumbo-eqiad on kafkamon1001 is CRITICAL: CRITICAL: Group is in an error state. Worst Lag: eqiad.mediawiki.job.categoryMembershipChange/p0 - lag:107 offset:171029910 [14:05:14] RECOVERY - Kafka main-eqiad consumer group lag for kafka-mirror-main-eqiad_to_jumbo-eqiad on kafkamon1001 is OK: OK: Group is in an error state. [14:37:35] (03CR) 10Ottomata: Add Wikidata website extract oozie job (031 comment) [analytics/refinery] - 10https://gerrit.wikimedia.org/r/422930 (owner: 10Jonas Kress (WMDE)) [14:42:13] 10Analytics, 10Patch-For-Review, 10User-Elukey: latest varnishkafka fails to build on Debian - https://phabricator.wikimedia.org/T186250#4091437 (10Jrdnch) Great, thanks for your help all along the way! [14:42:17] PROBLEM - Kafka main-eqiad consumer group lag for kafka-mirror-main-eqiad_to_jumbo-eqiad on kafkamon1001 is CRITICAL: CRITICAL: Group is in an error state. Worst Lag: eqiad.mediawiki.revision-create/p0 - lag:52 offset:713223957 [14:49:28] PROBLEM - Kafka MirrorMaker main-eqiad_to_jumbo-eqiad max lag in last 10 minutes on einsteinium is CRITICAL: CRITICAL - scalar(max(max_over_time(kafka_burrow_partition_lag{group=kafka-mirror-main-eqiad_to_jumbo-eqiad} [10m])): bad_data: parse error at char 108: unclosed left parenthesis https://grafana.wikimedia.org/dashboard/db/kafka-mirrormaker?var-datasource=eqiad%2520prometheus%252Fops&var-mirror_name=main-eqiad_to_jumbo-eqi [14:49:33] (03CR) 10Jonas Kress (WMDE): Add Wikidata website extract oozie job (031 comment) [analytics/refinery] - 10https://gerrit.wikimedia.org/r/422930 (owner: 10Jonas Kress (WMDE)) [14:50:00] ottomata: ---^ [14:50:19] didn't see it sorry, the query needs a ")" [14:52:33] fixing it [14:54:12] 10Analytics, 10CirrusSearch, 10Discovery, 10Discovery-Search, and 3 others: Exception thrown while running DataSender::sendData in cluster codfw: Data should be a Document, a Script or an array containing Documents and/or Scripts - https://phabricator.wikimedia.org/T191024#4091516 (10dcausse) Also seeing:... [14:54:47] oh me too elukey [14:54:50] did you fix? [14:56:40] 10Analytics, 10CirrusSearch, 10Discovery, 10Discovery-Search, and 3 others: Exception thrown while running DataSender::sendData in cluster codfw: Data should be a Document, a Script or an array containing Documents and/or Scripts - https://phabricator.wikimedia.org/T191024#4091543 (10dcausse) [14:56:53] (03CR) 10Joal: Add Wikidata website extract oozie job (031 comment) [analytics/refinery] - 10https://gerrit.wikimedia.org/r/422930 (owner: 10Jonas Kress (WMDE)) [14:57:02] elukey: wtheckay? mm down? [14:57:06] ERROR org.apache.kafka.common.metrics.JmxReporter - Error getting JMX attribute: [14:57:06] javax.management.AttributeNotFoundException: Could not find attribute response-rate [14:58:17] wait a sec, where was this error logged? [14:58:36] in kafka-mirror log on analytics broker nodes [14:58:37] 10Analytics, 10CirrusSearch, 10Discovery, 10Discovery-Search, and 3 others: Exception thrown while running DataSender::sendData in cluster codfw: Data should be a Document, a Script or an array containing Documents and/or Scripts - https://phabricator.wikimedia.org/T191024#4091544 (10Pchelolo) Is this some... [14:58:49] mm stopped replicating [14:58:50] https://grafana-admin.wikimedia.org/dashboard/db/kafka-mirrormaker-new-consumer?refresh=5m&orgId=1&from=now-30m&to=now [14:58:52] wth? [14:58:55] i haven't touched them [14:58:58] trying to bounce them now [15:02:35] on 1023 I can see a lot of ERROR Error when sending message to topic eqiad.mediawiki.job.cirrusSearchIncomingLinkCount with key: null, [15:02:59] yea [15:03:06] i think that's beacuse there's too much to produce [15:03:13] elukey: i wonder if we should run more mirror maker instances [15:03:15] not just more consumer streams [15:03:27] Batch Expired [15:03:37] ottomata: could we do 6 (one on every node?) [15:03:48] 6 for main to jumbo [15:03:51] i'd have to refactor the profile to a define [15:03:52] but probably [15:04:00] ah yeah [15:04:53] elukey: also v strange [15:04:54] https://grafana-admin.wikimedia.org/dashboard/db/kafka-mirrormaker-new-consumer?refresh=5m&panelId=5&fullscreen&orgId=1&from=now-30m&to=now [15:05:03] no consume lag at all from burrow during that time?! [15:05:12] or, there was lag, but it didn't increase?! [15:05:31] mirror maker should have nothing to do with burrow's lag cacl [15:05:32] calc [15:05:40] 10Analytics, 10CirrusSearch, 10Discovery, 10Discovery-Search, and 3 others: Exception thrown while running DataSender::sendData in cluster codfw: Data should be a Document, a Script or an array containing Documents and/or Scripts - https://phabricator.wikimedia.org/T191024#4091556 (10dcausse) I think it's... [15:11:12] of course zookeeper doesn't work out of the box with the jmx agent [15:14:33] RECOVERY - Kafka MirrorMaker main-eqiad_to_jumbo-eqiad max lag in last 10 minutes on einsteinium is OK: OK - scalar(max(max_over_time(kafka_burrow_partition_lag{group=kafka-mirror-main-eqiad_to_jumbo-eqiad} [10m]))) within thresholds https://grafana.wikimedia.org/dashboard/db/kafka-mirrormaker?var-datasource=eqiad%2520prometheus%252Fops&var-mirror_name=main-eqiad_to_jumbo-eqiad [15:15:15] 10Analytics, 10CirrusSearch, 10Discovery, 10Discovery-Search, and 3 others: Exception thrown while running DataSender::sendData in cluster codfw: Data should be a Document, a Script or an array containing Documents and/or Scripts - https://phabricator.wikimedia.org/T191024#4091594 (10Pchelolo) Ok, I think... [15:19:07] 10Analytics, 10Patch-For-Review, 10User-Elukey: Add the prometheus jmx exporter to all the Zookeeper daemons - https://phabricator.wikimedia.org/T177460#4091601 (10elukey) Testing in deployment-prep, it seems not working: zookeeper doesn't come up and java agent doesn't bind its port. I added set -x to the i... [15:21:05] 10Analytics: Open source in its own github repo EL refine - https://phabricator.wikimedia.org/T191034#4091603 (10Nuria) [15:36:54] 10Analytics, 10Patch-For-Review, 10User-Elukey: Add the prometheus jmx exporter to all the Zookeeper daemons - https://phabricator.wikimedia.org/T177460#4091648 (10elukey) I've tested this also with zookeeper on Stretch, and systemd pointed out that a PEBKAC was happening, namely trying to bind the jmx agent... [15:47:42] 10Analytics, 10CirrusSearch, 10Discovery, 10Discovery-Search, and 3 others: Exception thrown while running DataSender::sendData in cluster codfw: Data should be a Document, a Script or an array containing Documents and/or Scripts - https://phabricator.wikimedia.org/T191024#4091681 (10dcausse) I can return... [15:56:13] (03CR) 10Nuria: "Please be aware that this data needs to be purged every 90 days. You need a companion change to this one to execute teh data drop:" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/422930 (owner: 10Jonas Kress (WMDE)) [16:00:28] (03CR) 10Nuria: [C: 04-1] "Please take a look at the info I have included about tags and let's talk to understand why this is needed." (032 comments) [analytics/refinery] - 10https://gerrit.wikimedia.org/r/422930 (owner: 10Jonas Kress (WMDE)) [16:01:11] ping fdans mfournier [16:01:21] sorry [16:08:33] 10Analytics, 10CirrusSearch, 10Discovery, 10Discovery-Search, and 3 others: Exception thrown while running DataSender::sendData in cluster codfw: Data should be a Document, a Script or an array containing Documents and/or Scripts - https://phabricator.wikimedia.org/T191024#4091710 (10Pchelolo) > I think th... [16:15:35] 10Analytics, 10Analytics-Wikistats, 10Easy: [Wikistats2] The detail page for tops and maps metrics does not indicate time range - https://phabricator.wikimedia.org/T182990#4091725 (10sahil505) a:03sahil505 [16:16:12] 10Analytics: wqsq_extrac jobs should probably be stopped and deleted - https://phabricator.wikimedia.org/T191037#4091727 (10Nuria) [16:20:52] 10Analytics: wqsq_extrac jobs should probably be stopped and deleted - https://phabricator.wikimedia.org/T191037#4091739 (10leila) @mkroetzsch can you let us know if you still need fresh data for wdqs research. If not, we will stop the script that extracts the data. [16:21:04] 10Analytics, 10Research: wqsq_extrac jobs should probably be stopped and deleted - https://phabricator.wikimedia.org/T191037#4091741 (10leila) [16:34:06] 10Quarry, 10cloud-services-team, 10Patch-For-Review: Quarry server errors caused by Cloud VPS shared proxy failures - https://phabricator.wikimedia.org/T190218#4091752 (10Bstorm) Ok, I've confirmed that the logs are rotated either daily or on the hour if they are above 2GB in size. That kept the main proxy... [16:34:22] 10Analytics, 10CirrusSearch, 10Discovery, 10Discovery-Search, and 3 others: Exception thrown while running DataSender::sendData in cluster codfw: Data should be a Document, a Script or an array containing Documents and/or Scripts - https://phabricator.wikimedia.org/T191024#4091753 (10dcausse) I have to che... [16:34:38] 10Quarry, 10cloud-services-team: Quarry server errors caused by Cloud VPS shared proxy failures - https://phabricator.wikimedia.org/T190218#4091754 (10Bstorm) 05Open>03Resolved [16:42:32] (03CR) 10Nuria: "You can ammend commit message before comitting just use on this change" [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/422407 (owner: 10Amitjoki) [16:45:19] ping joal [16:45:39] hm - I was here, not anymore [16:46:28] 10Analytics, 10Research: wqsq_extrac jobs should probably be stopped and deleted - https://phabricator.wikimedia.org/T191037#4091814 (10fdans) p:05Triage>03Low [16:46:44] 10Analytics, 10Research: wdqs_extract jobs should probably be stopped and deleted - https://phabricator.wikimedia.org/T191037#4091816 (10Nuria) [16:47:40] 10Analytics: Open source the Eventlogging refine code, move to its own depot - https://phabricator.wikimedia.org/T191034#4091827 (10Nuria) [16:48:02] 10Analytics: Open source Spark DataFrame to hive refine job - https://phabricator.wikimedia.org/T191034#4091830 (10Ottomata) [16:49:31] 10Analytics: Open source Spark DataFrame to hive refine job - https://phabricator.wikimedia.org/T191034#4091850 (10fdans) p:05Triage>03Normal [16:51:06] 10Analytics, 10Analytics-Kanban, 10Wikidata, 10Patch-For-Review, 10Wikidata-Ministry-Of-Magic: Add Wikidata website extract oozie job - https://phabricator.wikimedia.org/T191022#4091853 (10fdans) p:05Triage>03Normal [16:52:27] 10Analytics, 10Analytics-Wikistats: Add UI in mobile view to switch to table view - https://phabricator.wikimedia.org/T191019#4091857 (10fdans) [16:52:40] 10Analytics, 10Analytics-Wikistats: Add UI in mobile view to switch to table view - https://phabricator.wikimedia.org/T191019#4091091 (10fdans) p:05Triage>03Low [16:54:32] 10Analytics, 10Analytics-Kanban, 10Analytics-Wikistats: Scroll should not bounce horizontally when swiping left/right - https://phabricator.wikimedia.org/T190959#4091867 (10fdans) p:05Triage>03Normal [16:57:55] 10Analytics, 10Analytics-Wikistats: Improve scoping of CSS - https://phabricator.wikimedia.org/T190915#4091878 (10fdans) p:05Triage>03Low [16:58:29] 10Analytics: Build prototype for MediaWiki content processing - https://phabricator.wikimedia.org/T190858#4091885 (10fdans) p:05Triage>03High [16:58:54] 10Analytics: Archive old geowiki data (editors per country) and make it easily available at WMF - https://phabricator.wikimedia.org/T190856#4091888 (10fdans) p:05Triage>03Normal [16:59:46] 10Analytics: Enable automatic ingestion from eventlogging into druid for some schemas - https://phabricator.wikimedia.org/T190855#4091891 (10fdans) p:05Triage>03High [17:00:14] 10Analytics: Upgrade main Kafka clusters to 1.0 - https://phabricator.wikimedia.org/T190853#4091893 (10fdans) p:05Triage>03High [17:01:12] 10Analytics: Spike: Quantify how many EventLogging requests we get from non-wiki* hostnames or apps - https://phabricator.wikimedia.org/T190840#4085113 (10fdans) p:05Triage>03Normal [17:01:27] 10Analytics, 10Analytics-Data-Quality: Spike: Quantify how many EventLogging requests we get from non-wiki* hostnames or apps - https://phabricator.wikimedia.org/T190840#4085113 (10fdans) [17:02:29] 10Analytics, 10Analytics-Kanban: Define battery of smoke tests to run by hand before realease - https://phabricator.wikimedia.org/T190837#4091905 (10fdans) p:05Triage>03Normal [17:03:24] 10Analytics: Notebook machine to double as RStudio Server? - https://phabricator.wikimedia.org/T190769#4091910 (10fdans) p:05Triage>03Lowest [17:03:45] 10Analytics, 10Patch-For-Review, 10User-Elukey: Add the prometheus jmx exporter to all the Zookeeper daemons - https://phabricator.wikimedia.org/T177460#4091912 (10elukey) So using the following config it seems clearer: ``` --- lowercaseOutputLabelNames: true lowercaseOutputName: false whitelistObjectNames:... [17:04:00] 10Analytics, 10Analytics-Kanban, 10SEO: Make Google API Python Client Library available on stat* machines - https://phabricator.wikimedia.org/T190767#4091913 (10fdans) [17:06:21] 10Analytics, 10CirrusSearch, 10Discovery, 10Discovery-Search, and 3 others: Exception thrown while running DataSender::sendData in cluster codfw: Data should be a Document, a Script or an array containing Documents and/or Scripts - https://phabricator.wikimedia.org/T191024#4091923 (10EBjune) p:05Triage>... [17:06:35] 10Analytics, 10CirrusSearch, 10Discovery, 10Discovery-Search, and 3 others: Exception thrown while running DataSender::sendData in cluster codfw: Data should be a Document, a Script or an array containing Documents and/or Scripts - https://phabricator.wikimedia.org/T191024#4091924 (10Pchelolo) > I guess we... [17:09:18] 10Analytics, 10EventBus, 10ORES, 10Reading-Infrastructure-Team-Backlog, and 3 others: Emit revision-score event to EventBus and expose in EventStreams - https://phabricator.wikimedia.org/T167180#3319947 (10fdans) p:05Triage>03Normal [17:09:21] 10Analytics, 10EventBus, 10ORES, 10Reading-Infrastructure-Team-Backlog, and 3 others: Emit revision-score event to EventBus and expose in EventStreams - https://phabricator.wikimedia.org/T167180#3319947 (10fdans) p:05Normal>03Triage [17:12:20] 10Analytics: Changes to map projection in wikistats - https://phabricator.wikimedia.org/T188927#4091939 (10fdans) p:05Triage>03High [17:13:48] 10Analytics, 10Operations, 10User-Elukey: Tune Varnishkafka delivery errors to be more sensitive - https://phabricator.wikimedia.org/T173492#4091941 (10fdans) p:05Normal>03Low [17:15:26] 10Analytics, 10ORES, 10Scoring-platform-team: Enable ores::base on stat1006 - https://phabricator.wikimedia.org/T181646#3796974 (10fdans) @Halfak hey! Are you aware that this is already enabled in stat1005? [17:16:08] 10Analytics, 10Analytics-Kanban, 10ORES, 10Scoring-platform-team: Enable ores::base on stat1006 - https://phabricator.wikimedia.org/T181646#4091946 (10fdans) a:03Ottomata [17:16:18] 10Analytics, 10Analytics-General-or-Unknown, 10Language-Team, 10Mobile-Apps, and 3 others: there should be a comparison of clicks count on interlanguage links on different platforms - https://phabricator.wikimedia.org/T78351#843122 (10Nuria) @Nuria can work on this on BCN hackathon maybe? There is a new... [17:17:02] 10Analytics, 10Analytics-Kanban, 10ORES, 10Scoring-platform-team: Enable ores::base on stat1006 - https://phabricator.wikimedia.org/T181646#3796974 (10fdans) p:05Triage>03Normal [17:19:18] 10Analytics, 10Analytics-Kanban, 10ORES, 10Scoring-platform-team: Enable ores::base on stat1006 - https://phabricator.wikimedia.org/T181646#4091968 (10Halfak) Yes. Stat1005 is over-used because it has access to everything. [17:19:44] 10Analytics: Jupyter Notebooks TLC 2018-2019 - https://phabricator.wikimedia.org/T188275#4091970 (10Ottomata) [17:19:47] 10Analytics, 10Discovery-Analysis: Get 'sparklyr' working on stats1005 - https://phabricator.wikimedia.org/T139487#4091969 (10Ottomata) [17:20:31] 10Analytics, 10Analytics-Kanban, 10Analytics-Wikistats: Wikistats Beta – Put view settings in URL so it can be shared. bookmarks - https://phabricator.wikimedia.org/T179444#4091974 (10fdans) p:05Triage>03High [17:21:22] 10Analytics, 10Trash: ---------------- Discussed above -------------------- - https://phabricator.wikimedia.org/T169900#4091976 (10Milimetric) 05stalled>03Invalid [17:23:04] 10Analytics-EventLogging, 10Analytics-Kanban, 10Patch-For-Review: Implement EventLogging Hive refinement - https://phabricator.wikimedia.org/T162610#4091980 (10fdans) [17:23:07] 10Analytics, 10Patch-For-Review, 10Performance-Team (Radar): Explore NavigationTiming by faceted properties - EventLogging refine - https://phabricator.wikimedia.org/T166414#4091979 (10fdans) 05Open>03Resolved [17:23:23] 10Analytics-EventLogging, 10Analytics-Kanban, 10Patch-For-Review: Implement EventLogging Hive refinement - https://phabricator.wikimedia.org/T162610#3168526 (10fdans) [17:23:26] 10Analytics, 10Patch-For-Review, 10Performance-Team (Radar): Explore NavigationTiming by faceted properties - EventLogging refine - https://phabricator.wikimedia.org/T166414#3295705 (10fdans) 05Resolved>03Open [17:24:28] 10Analytics, 10Analytics-Wikistats: Deploy Wikistats and analytics.wikimedia.org via SCAP - https://phabricator.wikimedia.org/T170429#4091989 (10fdans) p:05Normal>03Low [17:25:30] 10Analytics: User limits for stat machines. Limit space on /home dir and possibly /tmp - https://phabricator.wikimedia.org/T151904#4092007 (10fdans) p:05Normal>03Low [17:25:47] 10Analytics: User limits for stat machines. Limit space on /home dir and possibly /tmp - https://phabricator.wikimedia.org/T151904#2831653 (10fdans) 05Open>03declined [17:25:49] 10Analytics-Cluster, 10Analytics-Kanban, 10Operations, 10Patch-For-Review: rack/setup/install replacement to stat1005 (stat1002 replacement) - https://phabricator.wikimedia.org/T165368#4092012 (10fdans) [17:28:26] 10Analytics, 10Analytics-Cluster: Import Kafka messages into HDFS authenticating with TLS/SSL - https://phabricator.wikimedia.org/T166832#4092027 (10fdans) p:05High>03Lowest [17:29:52] 10Analytics, 10Research: wdqs_extract jobs should probably be stopped and deleted - https://phabricator.wikimedia.org/T191037#4092034 (10leila) [17:57:17] fdans, milimetric : do we use a polyfill for promises? [18:01:11] (03CR) 10Amitjoki: "> You can ammend commit message before comitting just use on this" [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/422407 (owner: 10Amitjoki) [18:06:27] ottomata: ping? [18:06:43] hiya [18:07:04] ottomata: I can't connect to the JDBC thrift server [18:07:15] on my client: Could not open client transport with JDBC Uri:jdbc:hive2://localhost:10000:null [18:07:36] on terminal where I create the ssh tunnel: channel 2: open failed: connect failed: Connection timed out [18:09:25] hm [18:09:35] ::null hm [18:09:42] chelsyx: [18:11:15] timed out [18:11:32] hmm, i was able to just at the very least connect to it via tcp from my local laptop througha tunnel [18:13:30] chelsyx: i can connect via a local beeline client too [18:13:35] so your server is up and running [18:13:39] but, i don't see the hive databases [18:13:39] ottomata: hm... Do you have a minute to hangout so that I can share screen with you? [18:13:41] fdans, milimetric : did you tested the responsive changeset with opera mobile/opera mini? [18:13:42] sure! [18:13:57] chelsyx: come to the batcave! https://hangouts.google.com/hangouts/_/wikimedia.org/a-batcave?authuser=1 [18:14:05] https://hangouts.google.com/hangouts/_/wikimedia.org/a-batcave?authuser=1https://hangouts.google.com/hangouts/_/wikimedia.org/a-batcave [18:14:07] gah [18:14:08] this one [18:14:08] https://hangouts.google.com/hangouts/_/wikimedia.org/a-batcave [18:14:38] chelsyx: did you remember to source the spark-env.sh file? [18:14:39] :) [18:15:03] ottomata: I did. but let me do it again [18:15:28] ottomata: I think that I forgot to ensuer /etc/prometheus as directory in the refactoring.. it does not auto-create it [18:15:36] :) [18:15:52] makes sense elukey, maybe just put that in some promethues class somewhere [18:16:34] yeah... I confused it with the fact that if the parent dir is defined in puppet, it is auto-required [18:16:37] * elukey cries [18:29:59] chelsyx: !connect jdbc:hive2://localhost:10000 [18:30:35] * elukey off! [18:39:56] nuria_: no, haven't tested with that yet, I'll try to get my hands on an old school android tablet tonight [18:40:14] milimetric: i just tested with opera mobile and found couple unrelated bugs [18:40:26] milimetric: so it was mostly fine [18:41:06] oh, great [19:05:28] 10Analytics-Kanban, 10Patch-For-Review: Checklist for geowiki pipeline - https://phabricator.wikimedia.org/T190409#4092625 (10Milimetric) [19:05:48] hahaha, weeeeeeird phab glitch [19:06:02] if you have a task open for editing, notifications from gerrit don't come in until you save [19:06:33] (03PS1) 10Milimetric: Partition successfuly sqooped cu_changes [analytics/refinery] - 10https://gerrit.wikimedia.org/r/423019 (https://phabricator.wikimedia.org/T190409) [19:26:59] 10Analytics, 10Analytics-Kanban, 10Patch-For-Review, 10SEO: Make Google API Python Client Library available on stat* machines - https://phabricator.wikimedia.org/T190767#4092654 (10Ottomata) Done! Notice: /Stage[main]/Packages::Python_google_api/Package[python-google-api]/ensure: created Notice: /Stage[m... [19:27:18] 10Analytics, 10Analytics-Kanban, 10Patch-For-Review, 10SEO: Make Google API Python Client Library available on stat* machines - https://phabricator.wikimedia.org/T190767#4083192 (10Ottomata) a:03Ottomata [19:35:06] 10Analytics, 10Analytics-Kanban, 10ORES, 10Scoring-platform-team, 10Patch-For-Review: Enable ores::base on stat1006 - https://phabricator.wikimedia.org/T181646#4092684 (10Ottomata) Done! [19:37:22] elukey: are you still there? [19:37:24] nope [19:37:26] i see your off [19:37:28] up ^^^ :) [19:37:29] nM! [19:45:36] (03CR) 10Abbe98: [C: 031] Enhance top menu by adding links for quick access to database tables [analytics/quarry/web] - 10https://gerrit.wikimedia.org/r/361299 (owner: 10XXN) [20:04:12] PROBLEM - Kafka MirrorMaker main-eqiad_to_jumbo-eqiad average message produce rate in last 30m on einsteinium is CRITICAL: CRITICAL - scalar(sum(avg_over_time(kafka_producer_producer_metrics_record_send_rate{client_id=kafka-mirror-main-eqiad_to_jumbo-eqiad-.+} [30m]))): 0.0 = 0.0 https://grafana.wikimedia.org/dashboard/db/kafka-mirrormaker?var-datasource=eqiad%2520prometheus%252Fops&var-mirror_name=main-eqiad_to_jumbo-eqiad [20:05:33] PROBLEM - Kafka MirrorMaker main-eqiad_to_jumbo-eqiad average message consume rate in last 30m on einsteinium is CRITICAL: CRITICAL - scalar(sum(avg_over_time(kafka_consumer_consumer_fetch_manager_metrics_all_topics_records_consumed_rate{mirror_name=main-eqiad_to_jumbo-eqiad} [30m]))): 0.0 = 0.0 https://grafana.wikimedia.org/dashboard/db/kafka-mirrormaker?var-datasource=eqiad%2520prometheus%252Fops&var-mirror_name=main-eqiad_to_ [20:06:31] 10Analytics-Kanban, 10WikimediaUI Style Guide, 10Patch-For-Review: Setup & integrate analytics on design.wikimedia.org - https://phabricator.wikimedia.org/T188786#4092724 (10Volker_E) [20:12:26] !log blacklisted mediawiki.job topics from main -> jumbo MirrorMaker again, don't want to page over the weekend while this still is not stable. T189464 [20:12:28] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [20:12:29] T189464: Fix Mirror Maker erratic behavior when replicating from main-eqiad to jumbo - https://phabricator.wikimedia.org/T189464 [20:12:42] RECOVERY - Kafka MirrorMaker main-eqiad_to_jumbo-eqiad average message consume rate in last 30m on einsteinium is OK: OK - scalar(sum(avg_over_time(kafka_consumer_consumer_fetch_manager_metrics_all_topics_records_consumed_rate{mirror_name=main-eqiad_to_jumbo-eqiad} [30m]))) within thresholds https://grafana.wikimedia.org/dashboard/db/kafka-mirrormaker?var-datasource=eqiad%2520prometheus%252Fops&var-mirror_name=main-eqiad_to_jumb [20:13:13] RECOVERY - Kafka MirrorMaker main-eqiad_to_jumbo-eqiad average message produce rate in last 30m on einsteinium is OK: OK - scalar(sum(avg_over_time(kafka_producer_producer_metrics_record_send_rate{client_id=kafka-mirror-main-eqiad_to_jumbo-eqiad-.+} [30m]))) within thresholds https://grafana.wikimedia.org/dashboard/db/kafka-mirrormaker?var-datasource=eqiad%2520prometheus%252Fops&var-mirror_name=main-eqiad_to_jumbo-eqiad [20:17:13] PROBLEM - Kafka MirrorMaker main-eqiad_to_jumbo-eqiad max lag in last 10 minutes on einsteinium is CRITICAL: CRITICAL - scalar(max(max_over_time(kafka_burrow_partition_lag{group=kafka-mirror-main-eqiad_to_jumbo-eqiad} [10m]))): 652106.0 100000.0 https://grafana.wikimedia.org/dashboard/db/kafka-mirrormaker?var-datasource=eqiad%2520prometheus%252Fops&var-mirror_name=main-eqiad_to_jumbo-eqiad [20:18:16] (03CR) 10Milimetric: "joal / nuria: this is tested and ready to review" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/423019 (https://phabricator.wikimedia.org/T190409) (owner: 10Milimetric) [20:18:48] (03CR) 10Milimetric: [C: 031] Partition successfuly sqooped cu_changes [analytics/refinery] - 10https://gerrit.wikimedia.org/r/423019 (https://phabricator.wikimedia.org/T190409) (owner: 10Milimetric) [20:24:22] RECOVERY - Kafka MirrorMaker main-eqiad_to_jumbo-eqiad max lag in last 10 minutes on einsteinium is OK: OK - scalar(max(max_over_time(kafka_burrow_partition_lag{group=kafka-mirror-main-eqiad_to_jumbo-eqiad} [10m]))) within thresholds https://grafana.wikimedia.org/dashboard/db/kafka-mirrormaker?var-datasource=eqiad%2520prometheus%252Fops&var-mirror_name=main-eqiad_to_jumbo-eqiad [20:55:32] !log accidentally killed mediawiki-geowiki-monthly-coord, and then restarted it [20:55:33] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [21:22:36] 10Analytics-Kanban, 10Patch-For-Review: Checklist for geowiki pipeline - https://phabricator.wikimedia.org/T190409#4093002 (10Milimetric) [21:23:26] 10Analytics-Kanban, 10Patch-For-Review: Checklist for geowiki pipeline - https://phabricator.wikimedia.org/T190409#4072764 (10Milimetric) @JAllemandou I was trying to delete the daily data from druid, and I used the refinery/bin script, and it said everything was ok, but the segment is still there. I then tri... [21:31:35] there's all kinds of disagreements on - vs. _ on here: https://superset.wikimedia.org/druiddatasourcemodelview/list/ [21:31:39] which is it yall? [21:53:00] oh man, I worked with superset for a couple hours and my eyes are bleeding [21:53:32] (03PS1) 10Nuria: Fixing formatting of dates on time selector [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/423051 (https://phabricator.wikimedia.org/T188208) [22:06:11] 10Analytics-Kanban, 10Analytics-Wikistats: Broken "1-month" interval on timerange selector - https://phabricator.wikimedia.org/T191097#4093239 (10Nuria) [22:06:18] 10Analytics-Kanban, 10Analytics-Wikistats: Broken "1-month" interval on timerange selector - https://phabricator.wikimedia.org/T191097#4093249 (10Nuria) [22:07:08] 10Analytics-Kanban, 10Analytics-Wikistats: Broken "1-month" interval on timerange selector - https://phabricator.wikimedia.org/T191097#4093239 (10Nuria) Patch: https://gerrit.wikimedia.org/r/#/c/423051/ [22:08:50] (03PS2) 10Nuria: Fixing formatting of dates on time selector [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/423051 (https://phabricator.wikimedia.org/T189266) [22:09:03] 10Analytics, 10Analytics-Kanban, 10Analytics-Wikistats, 10Google-Summer-of-Code (2018): GSoC Proposal 2018 : [Analytics] Improvements to Wikistats2 front-end - https://phabricator.wikimedia.org/T189964#4093256 (10sahil505) [22:13:48] (03PS3) 10Nuria: Fixing formatting of dates on time selector [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/423051 (https://phabricator.wikimedia.org/T191097) [22:23:44] 10Analytics-Kanban, 10WikimediaUI Style Guide, 10Patch-For-Review: Setup & integrate analytics on design.wikimedia.org - https://phabricator.wikimedia.org/T188786#4093302 (10Volker_E) [22:27:06] 10Analytics-Kanban, 10WikimediaUI Style Guide: Setup & integrate analytics on design.wikimedia.org - https://phabricator.wikimedia.org/T188786#4019366 (10Volker_E) 05Open>03Resolved a:05Prtksxna>03Volker_E [22:28:57] (03CR) 10Nuria: "Please see: https://www.mediawiki.org/wiki/Gerrit/Commit_message_guidelines and see if you can edit your commit message a bit to abide to " [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/422407 (owner: 10Amitjoki) [22:35:50] (03CR) 10Nuria: "Where is this code appears that we are waiting for _SUCCESS from scoop job?" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/423019 (https://phabricator.wikimedia.org/T190409) (owner: 10Milimetric) [22:40:20] milimetric: i can help with dashboards [22:45:59] 10Analytics, 10Analytics-Kanban, 10Wikidata, 10Patch-For-Review, 10Wikidata-Ministry-Of-Magic: Add Wikidata website extract oozie job - https://phabricator.wikimedia.org/T191022#4091228 (10Nuria) @Jonas: do you want all requests to www.wikidata.org to be included, correct? Do you care about request to w... [22:54:04] 10Analytics-Kanban, 10Analytics-Wikistats: Broken "1-month" interval on timerange selector - https://phabricator.wikimedia.org/T191097#4093361 (10Nuria) a:03Nuria [22:55:02] 10Analytics-Kanban, 10Patch-For-Review: Zooming in or out in wikistats shouldn't alter the number of metrics shown - https://phabricator.wikimedia.org/T190782#4093363 (10Nuria) 05Open>03Resolved [22:56:25] 10Analytics-Kanban: Adapt TableChart to compact size - https://phabricator.wikimedia.org/T188952#4093370 (10Nuria) [22:56:28] 10Analytics-Kanban, 10Patch-For-Review: Limit length of table and add "Load more rows" button - https://phabricator.wikimedia.org/T188953#4093369 (10Nuria) 05Open>03Resolved [22:56:43] 10Analytics-Kanban: The size of metric areas in the dashboard should scale to available window space - https://phabricator.wikimedia.org/T187345#4093372 (10Nuria) [22:56:45] 10Analytics-Kanban: Allow switching metrics in a dashboard widget - Carrousel - https://phabricator.wikimedia.org/T187440#4093371 (10Nuria) 05Open>03Resolved [22:57:34] 10Analytics-Kanban, 10Patch-For-Review: Oozie job to compute geowiki on top of sqooped data - https://phabricator.wikimedia.org/T188113#4093380 (10Nuria) How do we ensure this job runs after the partition job? [22:57:47] 10Analytics, 10Analytics-Kanban, 10Patch-For-Review: Phasing away one of the mobile apps session metrics jobs. - https://phabricator.wikimedia.org/T190459#4093381 (10Nuria) 05Open>03Resolved [22:57:58] 10Analytics-Kanban, 10Patch-For-Review: unique devices data for january not in cassandra - https://phabricator.wikimedia.org/T189740#4093382 (10Nuria) 05Open>03Resolved [22:58:09] 10Analytics-Kanban, 10User-Elukey: Reboot all Analytics hosts for Kernel upgrade - https://phabricator.wikimedia.org/T188594#4093383 (10Nuria) 05Open>03Resolved [22:58:14] nuria: I got a basic one with a table but for some reason the world map doesn’t work with this datasource [22:58:30] it’s called Geowiki Monthly if you wanna mess with it [22:58:40] 10Analytics-Kanban, 10Patch-For-Review: Correct mediawiki-reduced loading job - https://phabricator.wikimedia.org/T189448#4093384 (10Nuria) 05Open>03Resolved [22:59:29] 10Analytics, 10Analytics-Kanban, 10Patch-For-Review: Correct documentation for the referer_class field in pageview_hourly - https://phabricator.wikimedia.org/T190579#4093385 (10Nuria) 05Open>03Resolved [22:59:53] 10Analytics-Cluster, 10Analytics-Kanban, 10Operations, 10Patch-For-Review, 10User-Elukey: rack/setup/install analytics107[0-7] - https://phabricator.wikimedia.org/T188294#4093386 (10Nuria) 05Open>03Resolved [23:00:23] 10Analytics-Kanban, 10Patch-For-Review, 10User-Elukey: Move away from jmxtrans in favor of prometheus jmx_exporter - https://phabricator.wikimedia.org/T175344#4093390 (10Nuria) [23:00:26] 10Analytics-Kanban, 10Patch-For-Review, 10User-Elukey: Add the prometheus jmx agent to AQS Cassandra - https://phabricator.wikimedia.org/T184795#4093388 (10Nuria) 05Open>03Resolved [23:01:17] 10Analytics-Kanban, 10RESTBase-API, 10Patch-For-Review, 10Services (done): Update AQS pageview-top definition - https://phabricator.wikimedia.org/T184541#4093395 (10Nuria) 05Open>03Resolved [23:01:36] 10Analytics-Kanban, 10Analytics-Wikistats: Make the Wikistats 2 UI responsive - https://phabricator.wikimedia.org/T186812#4093396 (10Nuria) 05Open>03Resolved [23:02:10] (03PS1) 10Nuria: [WIP] Add wikidata tag [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/423064 (https://phabricator.wikimedia.org/T191022) [23:02:20] 10Analytics, 10Services (watching): Add Accept header to webrequest logs - https://phabricator.wikimedia.org/T170606#4093398 (10Nuria) p:05Triage>03Normal [23:04:16] 10Analytics: Varnishkafka does not play well with varnish 5.2 - https://phabricator.wikimedia.org/T177647#4093403 (10Nuria) ping @elukey is this still true? I though we migrated to varnish 5 awhile back [23:05:12] (03CR) 10jerkins-bot: [V: 04-1] [WIP] Add wikidata tag [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/423064 (https://phabricator.wikimedia.org/T191022) (owner: 10Nuria) [23:06:29] 10Analytics: Track overall traffic, without any filtering, broken down into major categories, for internal use. - https://phabricator.wikimedia.org/T117236#4093409 (10Nuria) See data in pivot for pageviews and webrequest: https://pivot.wikimedia.org/#pageviews-hourly https://pivot.wikimedia.org/#webrequest [23:06:38] 10Analytics: Track overall traffic, without any filtering, broken down into major categories, for internal use. - https://phabricator.wikimedia.org/T117236#4093410 (10Nuria) 05Open>03Resolved [23:08:47] 10Analytics: Weird performance of sqoop job on Edit Reconstruction - https://phabricator.wikimedia.org/T172579#4093415 (10Nuria) I ma going to close this ticket as since then we have revamped quite a bit our scoop jobs. [23:08:51] 10Analytics: Weird performance of sqoop job on Edit Reconstruction - https://phabricator.wikimedia.org/T172579#4093416 (10Nuria) 05Open>03Resolved [23:09:15] 10Analytics: Measure portal pageviews - https://phabricator.wikimedia.org/T162618#4093417 (10Nuria) Tagging might be coming soon and we can FINALLY do this.