[04:55:02] (03Abandoned) 10Mrcornacchio: Fix stylelint errors and TopicExplorer placeholder contrast [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/423176 (https://phabricator.wikimedia.org/T185533) (owner: 10Mrcornacchio) [06:08:25] 10Analytics: Varnishkafka does not play well with varnish 5.2 - https://phabricator.wikimedia.org/T177647#3665564 (10R4q3NWnUx2CEhVyr) From what I can see, the change seem pretty straightforward (call to VUT_Init/VUT_Setup to replace all non kafka setup and the VUT_Main as main loop) except for the polling of k... [06:50:01] Morning team [06:51:09] (03PS1) 10Mrcornacchio: Fix stylelint errors and TopicExplorer placeholder [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/426848 (https://phabricator.wikimedia.org/T185533) [06:51:51] o/ [07:03:21] elukey: any objection to me deploying this morning? [07:04:37] 10Analytics-Kanban, 10Patch-For-Review: Use spark to split webrequest on tags - https://phabricator.wikimedia.org/T164020#4132759 (10JAllemandou) [07:13:27] joal: not sure, I think I am going to say no [07:13:35] :D [07:13:44] elukey: no problem - I was sure you'd be on the paranoid side :-P [07:13:54] hahahaha nono I am joking! [07:14:02] please go ahead, I have no opposition [07:14:05] I am, too :) [07:14:09] Cool ,thanks [07:14:21] spark finally gave away, after a phoenix revival [07:14:28] whenever you have time I'd have to brain bounce on IRC about some ideas for zookepeer [07:14:57] When you wish [07:16:01] all right lemme dump my ideas in here :) [07:16:13] premise: [07:16:52] - we are running zookeeper 3.4.5 in all our clusters since everything is still Debian Jessie based [07:17:39] - Debian stretch offers 3.4.9, that it is still compatible but resolves a lot of bugs etc.. [07:18:02] - the new conf100[456] nodes are on stretch, meanwhile conf100[123] are on Jessie [07:20:35] - the upgrade process is a bit tricky, since all the zk nodes have the list of hosts in their ensemble in their configs and swapping one means that during a rolling restart at some point there will be a little "inconsistency" between nodes having the new list of hosts in the ensemble and old ones still running the current on [07:21:03] the idea from the zk doc is to limit the upgrade to one host at the time, and roll restart all the clients [07:21:17] (since they need to have the list of zk nodes too) [07:21:25] elukey: first dumb question: `the upgrade` == moving conf100[234] to stretch ? [07:21:39] s/4/1 [07:21:51] nono the upgrade is to swap 123 to 456 [07:21:57] right [07:22:13] so swap 1 with 4, then roll restart, 2 with 5, etc.. [07:22:22] ideally doing it in different days [07:22:46] now my "fear", especially after https://phabricator.wikimedia.org/T182924#4129301, is that the swap will mix two different things [07:22:55] and when we say roll-restart, we talk about 4,2,3 at first (since 1 and 4 have been swapped) [07:22:56] 1) upgrade of zk from 3.4.5 to 3.4.9 [07:23:05] 2) change list of nodes in the ensemble [07:23:39] so the swap procedure (expanding the "roll restart") would be something like: [07:23:43] (03CR) 10VolkerE: Darken subdued text and alter footer area for WCAG AA compliance (031 comment) [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/419958 (https://phabricator.wikimedia.org/T185533) (owner: 10Sturmkrahe) [07:24:34] 1) starting with conf100[123] in the ensemble, swap 1 with 4 in puppet and run on all the hosts. This will change the zk config on the config files but not restart zk [07:24:43] 2) conf1004 will join the ensemble as "follower" [07:25:19] 3) the swap procedure requires to swap the other two "followers" (say conf100[12], assuming conf1003 the leader) [07:25:29] err requires to restart the other two [07:25:35] and finally the leader [07:25:51] so eventually the cluster will be conf100[234] [07:26:01] after this, all "clients" needs to be restarted [07:26:13] since they have conf100[123] in their config [07:26:22] not a big deal but better to be consistent [07:26:46] (and of course conf1001 needs to stay down otherwise the clients contacting it will get confused) [07:27:22] elukey: So I'm assuming you shut-off conf1001 after the restarts of 2 and 3 [07:27:46] I can mask zookeeper with systemctl in there, not shutoff since etcd runs in there [07:27:50] elukey: another question: What are the clients of conff100X currently? [07:28:07] makes sense elukey about shut-off [07:28:07] Hadoop master nodes, Kafka Jumbo/Analytics/Main [07:28:25] elukey: ok [07:28:35] Well, the procedure que explain makes sense [07:28:50] Now about upgrading first or not, I really can't say [07:29:08] super :) if you have patience I'd need to annoy you for 5mins more :D [07:29:34] elukey: so far, nothing has eannoyed me :D [07:29:43] Moritz helped a lot on Friday and built 3.4.9 for jessie, that I've tested in labs, worked fine :) [07:29:43] elukey: You're far less a pain than spark is ;) [07:30:01] so now I am wondering what to do [07:31:00] 1) upgrade all our clusters to 3.4.9, starting from Druid, in order to test it carefully in less busy environments before main and unlocking some good things like being able to reimage one druid host at the time to Stretch transparently [07:31:17] (since zk wouldn't change) [07:32:13] 2) limit the upgrade to 3.4.9 to conf100[123], as a way to upgrade first and then swap the nodes [07:34:17] elukey: with 1 you get more out, but it's also more complicated :( [07:34:53] yeah, but at the same time I am wondering the risks to upgrade main directly [07:35:06] since in labs it looks fine, but in prod is always a different thing [07:35:23] yeah - I know ! [07:37:05] so an alternative solution could be to think about upgrading only the "main" zk clusters, starting from codfw [07:37:15] so both of them will be 3.4.9 [07:37:26] and we wouldn't touch druid [07:37:43] but we'd start from a less impactful zk cluster [07:38:05] if it goes well in codfw, we'd be less concerned in proceeding with eqiad [07:38:21] elukey: less busy for sure, but still impactfull -- kafka is using it, and I think kafka is used for prod jobs and all [07:39:05] yes true, but kafka main codfw is definitely less busy/important than eqiad [07:39:11] Makes sense [07:40:00] lemme have a chat with the services team :) [07:40:04] thanks for the brain bounce! [07:40:16] np elukey :) [07:40:37] if you need one for spark lemme know, but probably I wouldn't be able to help a lot :( [07:42:05] (03PS4) 10Joal: Update MediawikiHistory-denormalize [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/425798 [07:42:25] elukey: Thanks for offering :) [07:42:50] (03CR) 10Joal: [C: 032] "Merging for deploy" [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/425798 (owner: 10Joal) [07:53:38] (03Merged) 10jenkins-bot: Update MediawikiHistory-denormalize [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/425798 (owner: 10Joal) [07:59:30] (03PS1) 10Joal: Update changelog.md to v0.0.62 [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/426855 [07:59:38] elukey: --^ if you have a minute :P) [08:00:18] (03CR) 10Elukey: [C: 031] Update changelog.md to v0.0.62 [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/426855 (owner: 10Joal) [08:02:22] Thanks mate :) [08:02:44] (03CR) 10Joal: [V: 032 C: 032] "Merging for deploy" [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/426855 (owner: 10Joal) [08:03:12] !log Deploying refinery source v0.0.62 from tin [08:03:13] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [08:03:47] !log Correction - Deploying refinery-source v0.0.62 using Jenkins ! [08:03:48] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [08:12:21] (03PS3) 10Joal: Correct Mediawiki-history denormalize job [analytics/refinery] - 10https://gerrit.wikimedia.org/r/425569 [08:13:16] (03CR) 10Joal: [V: 032 C: 032] "Merging for deploy" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/425569 (owner: 10Joal) [08:21:27] (03PS1) 10Joal: Update wikidata-articleplaceholder jar version [analytics/refinery] - 10https://gerrit.wikimedia.org/r/426859 [08:21:45] (03CR) 10Joal: [V: 032 C: 032] "Merging for deploy" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/426859 (owner: 10Joal) [08:22:10] !log Deploying refinery from tin [08:22:11] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [08:25:46] 10Analytics-Cluster, 10Analytics-Kanban, 10Datasets-Archiving, 10Datasets-Webstatscollector, 10Patch-For-Review: Mediacounts missing top1000 files after 2018-01-01 - https://phabricator.wikimedia.org/T185419#3915720 (10JAllemandou) a:03JAllemandou [08:26:18] 10Analytics-Kanban, 10Patch-For-Review: Improve mediwiki-history performance - https://phabricator.wikimedia.org/T189449#4132840 (10JAllemandou) [08:29:59] !log Deploying refnery onto HDFS [08:30:00] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [08:35:49] !log Restarting wikidata-articleplaceholder oozie job after last week's failures [08:35:50] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [08:41:00] !log Restart Mediawiki-history job after new patches [08:41:01] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [08:51:45] (03PS4) 10Joal: Update sqoop script to allow for parquet import [analytics/refinery] - 10https://gerrit.wikimedia.org/r/423235 [08:52:50] (03PS5) 10Joal: Update sqoop script to allow for parquet import [analytics/refinery] - 10https://gerrit.wikimedia.org/r/423235 [09:21:14] 10Analytics-Tech-community-metrics: Affiliations/enrollments not always correctly synced between user data in database and frontend indices - https://phabricator.wikimedia.org/T191779#4132965 (10Aklapper) [09:22:24] 10Analytics-Tech-community-metrics: Affiliations/enrollments not always correctly synced between user data in database and frontend indices - https://phabricator.wikimedia.org/T191779#4116189 (10Aklapper) Yes, keeping this ticket open for further investigation. [09:48:37] 10Analytics, 10Analytics-Wikistats: Routing code allows invalid routes - https://phabricator.wikimedia.org/T188792#4132994 (10Liuxinyu970226) >>! 在T188792#4114911中,@Aklapper写道: > This task has "Unbreak Now!" priority for five weeks now. That means [[ https://www.mediawiki.org/wiki/Phabricator/Project_managemen... [09:54:12] !log Update wmf.mediawiki_metrics table for T190058 [09:54:14] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [09:54:14] T190058: Make 'metric' field not a partition in mediawiki_metrics - https://phabricator.wikimedia.org/T190058 [10:04:27] !log Restart metrics job after table update [10:04:28] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [10:06:02] Jonas_WMDE: Hello :) [10:06:23] Jonas_WMDE: When running long queries, can you please follow this procedure? https://wikitech.wikimedia.org/wiki/Analytics/Systems/Cluster/Hive/Queries#Run_long_queries_in_a_screen_session_and_in_the_nice_queue [10:06:31] Jonas_WMDE: Thanks ! [10:07:15] sorry joal ! [10:07:30] no worries Jonas_WMDE - Just a quick note :) [10:07:53] Jonas_WMDE: It'll be easier when data will be partitioned into smaller tables :) [10:28:03] (03CR) 10Lucas Werkmeister (WMDE): "> Don’t you need to add the script to the cron file as well?" (032 comments) [analytics/wmde/scripts] - 10https://gerrit.wikimedia.org/r/425872 (https://phabricator.wikimedia.org/T191891) (owner: 10Ladsgroup) [10:32:50] joal: with my immese joy, I discovered this https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=838385 [10:32:53] :D [10:41:55] * elukey lunch + errand! bb in ~2h [12:41:04] joal: https://github.com/wikimedia/restbase/pull/984/commits/f2309e244e7b8235ef3503a84405addc1b4bcf2d [12:57:47] I am checking if Druid's brokers need to be tuned for this [12:57:57] in theory no [13:09:51] 10Analytics-Kanban, 10Patch-For-Review, 10User-Elukey: Refresh zookeeper nodes in eqiad - https://phabricator.wikimedia.org/T182924#4133441 (10elukey) Interesting discovery today while testing zookeeper on stretch. I tried to clean up /etc/zookeeper/conf and ran puppet to check if everything was going to be... [13:19:37] (03PS5) 10Amire80: WIP Analyzing failed ULS searches [analytics/limn-language-data] - 10https://gerrit.wikimedia.org/r/413270 (https://phabricator.wikimedia.org/T190630) [13:20:17] (03PS6) 10Amire80: WIP Analyzing failed ULS searches [analytics/limn-language-data] - 10https://gerrit.wikimedia.org/r/413270 (https://phabricator.wikimedia.org/T190630) [13:46:00] o/ [13:47:41] ottomata: o/ [13:52:16] Hey ottomata - Refinery version 0.0.62 is available, you could possibly re-productionize refine :) [13:54:22] joal: anything against druid + connection reuse from restbase? [13:55:38] elukey: I have seen the Restbase one, but not druid [13:55:46] ok for restbase, but druid? [13:56:19] joal: not sure I follow [13:56:36] hm - I have seen the patch for restbase, works for me [13:56:38] elukey: -^ [13:56:50] However I don't get what the change for duid is [13:57:33] ahh! There shouldn't be any, namely rb will keep the tcp connection open and will do more than one http request over it. Druid should be able to support it transparently [13:57:57] but I might miss something so this is why I am asking [13:58:00] sounds:) [13:58:08] super :) [13:58:32] elukey: I am completely unable to say if Druid will do it, but you know way better than me in TCP oriented stuff [13:59:14] joal: cool will do, its deployed etc/ i just need to fix cron job jar? [14:08:15] 10Analytics, 10Analytics-Kanban, 10Analytics-Wikistats: Upgrading Wikistats 2.0 footer UI/design - https://phabricator.wikimedia.org/T191672#4133626 (10sahil505) @mforns Thanks for the feedback. I'll submit a patch for this after we merge the 'NaN' patch. > In my prior comment about text brightness and con... [14:25:16] 10Analytics-Kanban: Compile a request data set for caching research and tuning - https://phabricator.wikimedia.org/T128132#2064831 (10dayyoung0324) Hello Danielsberger and everyone, Since I am figure out an algorithm to analyze workloads (e.g., page read or page create) over a certain period of time (e.g., o... [14:32:24] 10Analytics-Kanban, 10Patch-For-Review, 10User-Elukey: Refresh zookeeper nodes in eqiad - https://phabricator.wikimedia.org/T182924#4133785 (10elukey) Tested in labs a migration with two stretch hosts running zk 3.4.9 and one jessie host running zk 3.4.9 (Moritz's backport) and the host swap happened without... [14:38:29] ok I managed to swap a zookeeper host in labs without any issue [14:38:47] ottomata: do you have a min for a question (even on IRC) ? [14:38:53] elukey: sure [14:39:00] irc ok for now? [14:39:04] thanks! yep yep [14:39:30] (03PS1) 10Fdans: Release 2.2.2 [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/426932 [14:39:33] so the TL;DR is that Moritz built zookeeper 3.4.9 for jessie (picked stretch version, rebuilt, tested in labs) [14:39:54] and this version fixes a weird bug that sometimes happens to 3.4.5 [14:40:04] (03CR) 10Fdans: [C: 032] Release 2.2.2 [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/426932 (owner: 10Fdans) [14:40:04] (and I'd like to avoid it happening while I am swapping hosts) [14:40:21] so what I'd like to do is to upgrade zk main to 3.4.9 and then swap the hosts [14:40:29] k [14:40:43] sounds easy enough? [14:40:53] my plan would be to start in main-codfw (for consistency and less things might explode if I am not lucky) [14:41:02] and then main eqiad [14:41:06] yep yep seems fine [14:41:11] does it sound good to you? [14:41:21] I'd avoid upgrading druid for the moment [14:41:22] ya sounds good! [14:41:42] even if, if we upgrade those clusters as well, we could reimage one host at the time to stretch ! [14:41:50] maybe later on :) [14:42:09] +1 to that too! [14:42:14] \o/ [14:42:16] are the new conf nodes stretch? [14:42:19] yep [14:42:21] cool [14:46:19] 10Analytics, 10Performance-Team (Radar): Possible statsv corruption? - https://phabricator.wikimedia.org/T189530#4133814 (10Ottomata) Not sure I totally understand the problem. What is an example of a good metric name and a corrupted one here? You are saying that metric names are coming in with dots in the n... [14:46:36] 10Analytics, 10Services (watching): Update to latest kafkacat - https://phabricator.wikimedia.org/T182163#4133815 (10Ottomata) That timeline sounds fine to me! [14:47:13] elukey: btw, thanks so much for being such a great varnishkafka maintainer :) [14:47:35] elukey: when you have time, i'd like to brain bounce kafka main migration [14:47:38] it seems alittle tricky [14:47:41] with mirror maker in the mix [14:55:16] ottomata: wow thanks! Sure, whenever you want (maybe after meetings?) [14:56:06] ya [14:57:16] (03Merged) 10jenkins-bot: Release 2.2.2 [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/426932 (owner: 10Fdans) [14:58:51] (03PS1) 10Fdans: Release 2.2.2 [analytics/wikistats2] (release) - 10https://gerrit.wikimedia.org/r/426939 [15:03:13] (03CR) 10Fdans: [C: 032] Release 2.2.2 [analytics/wikistats2] (release) - 10https://gerrit.wikimedia.org/r/426939 (owner: 10Fdans) [15:03:38] 10Analytics-Kanban: Compile a request data set for caching research and tuning - https://phabricator.wikimedia.org/T128132#4133855 (10Nuria) @dayyoung0324 Please do not post on tickets that are closed, as I mentioned the data available that most resembles your request is at: https://analytics.wikimedia.org/data... [15:05:19] (03Merged) 10jenkins-bot: Release 2.2.2 [analytics/wikistats2] (release) - 10https://gerrit.wikimedia.org/r/426939 (owner: 10Fdans) [15:05:51] 10Analytics, 10Analytics-Data-Quality, 10Analytics-Kanban, 10New-Readers, and 5 others: Proxies information gone from Zero portal - https://phabricator.wikimedia.org/T187014#4133871 (10ema) >>! In T187014#4129884, @Nuria wrote: > +1 let me know when it is in place and i can help check things square again o... [15:21:47] 10Analytics-Kanban: Compile a request data set for caching research and tuning - https://phabricator.wikimedia.org/T128132#4133924 (10dayyoung0324) @Nuria Since I have some questions related to this task T128132, should I create a new task to post my question? For example, the dataset sizes at https://analytics.... [15:38:25] 10Analytics: update piwik to latest stable - https://phabricator.wikimedia.org/T192298#4133946 (10Nuria) [15:42:11] 10Analytics, 10User-Elukey: Update piwik to latest stable - https://phabricator.wikimedia.org/T192298#4133981 (10elukey) p:05Triage>03Normal a:03elukey [15:51:33] 10Analytics, 10Analytics-Wikistats: Unexpected increase in traffic for 4 languages in same region, on smaller projects - https://phabricator.wikimedia.org/T136084#4134016 (10mforns) p:05Triage>03Normal [15:52:21] 10Analytics, 10Analytics-Wikistats: Unexpected increase in traffic for 4 languages in same region, on smaller projects - https://phabricator.wikimedia.org/T136084#4134017 (10mforns) p:05Normal>03Triage [16:03:26] 10Analytics, 10User-Elukey: Update piwik to latest stable - https://phabricator.wikimedia.org/T192298#4134063 (10fdans) p:05Normal>03Low [16:07:17] 10Analytics: Wikistats2 and SEO - https://phabricator.wikimedia.org/T192172#4134075 (10fdans) p:05Triage>03Normal [16:10:45] 10Analytics, 10Services (watching): Update to latest kafkacat - https://phabricator.wikimedia.org/T182163#4134088 (10mforns) p:05Triage>03Normal [16:11:10] 10Analytics: Decomission old analytics kafka cluster - https://phabricator.wikimedia.org/T183303#4134089 (10mforns) p:05Triage>03Normal [16:13:27] 10Analytics, 10Discovery, 10Wikidata, 10Wikidata-Query-Service, 10Wikimedia-Stream: Increase kafka event retention to 14 or 21 days - https://phabricator.wikimedia.org/T187296#3970994 (10mforns) We'll have this on our radar, until things are stable. [16:13:46] 10Analytics, 10Discovery, 10Wikidata, 10Wikidata-Query-Service, 10Wikimedia-Stream: Increase kafka event retention to 14 or 21 days - https://phabricator.wikimedia.org/T187296#4134096 (10mforns) p:05Triage>03Low [16:14:47] 10Analytics-Cluster, 10Analytics-Kanban, 10Patch-For-Review: Provision new Kafka cluster(s) with security features - https://phabricator.wikimedia.org/T152015#4134113 (10mforns) [16:14:53] 10Analytics, 10Analytics-Cluster, 10Operations, 10Traffic, and 2 others: Encrypt Kafka traffic, and restrict access via ACLs - https://phabricator.wikimedia.org/T121561#4134111 (10mforns) 05Open>03Resolved a:03mforns [16:15:48] 10Analytics, 10Discovery, 10Patch-For-Review: Send Mediawiki Kafka logs to Kafka jumbo cluster with TLS encryption - https://phabricator.wikimedia.org/T126494#4134114 (10mforns) p:05Normal>03Triage [16:25:15] 10Analytics, 10Patch-For-Review: Update druid to latest release (0.11) - https://phabricator.wikimedia.org/T164008#4134127 (10mforns) p:05High>03Triage [16:25:19] 10Analytics, 10Patch-For-Review: Update druid to latest release (0.11) - https://phabricator.wikimedia.org/T164008#3218113 (10mforns) p:05Triage>03High [16:25:50] 10Analytics, 10Pageviews-API, 10User-Elukey: Improve user management for AQS - https://phabricator.wikimedia.org/T142073#4134129 (10mforns) p:05Normal>03Low [16:26:08] 10Analytics, 10Patch-For-Review, 10User-Elukey: Update druid to latest release (0.11) - https://phabricator.wikimedia.org/T164008#4134130 (10elukey) [16:26:10] 10Analytics, 10Operations, 10Traffic, 10User-Elukey: Add VSL error counters to Varnishkafka stats - https://phabricator.wikimedia.org/T164259#4134131 (10mforns) p:05Normal>03Low [16:30:23] 10Analytics: Investigate adding user-friendly testing functionality to Reportupdater - https://phabricator.wikimedia.org/T156523#4134132 (10mforns) p:05Triage>03Low [16:30:37] 10Analytics, 10Performance-Team (Radar): Eventlogging client needs to support offline events - https://phabricator.wikimedia.org/T162308#4134134 (10mforns) p:05Triage>03Low [16:32:33] 10Analytics: Alarm on data quality issues - https://phabricator.wikimedia.org/T159840#4134136 (10mforns) [16:32:35] 10Analytics: Productionize analysis of editcount vs per_user_revision_count - https://phabricator.wikimedia.org/T168648#4134135 (10mforns) [16:32:41] 10Analytics: Productionize analysis of editcount vs per_user_revision_count - https://phabricator.wikimedia.org/T168648#4134137 (10mforns) p:05Triage>03Low [16:33:31] 10Analytics: Add pagecounts by article and top pagecounts to AQS - https://phabricator.wikimedia.org/T173720#4134139 (10mforns) p:05Triage>03Low [16:33:44] 10Analytics, 10Analytics-Cluster: Combine Hive Year / Month / Day / Hour partitions into ISO date string - https://phabricator.wikimedia.org/T177097#4134140 (10mforns) p:05Triage>03Low [16:35:40] 10Analytics: Easter Egg: wikistats classic style on wikistats 2.0 - https://phabricator.wikimedia.org/T177408#4134141 (10mforns) p:05Triage>03Normal [16:37:31] 10Analytics, 10Analytics-Wikistats: Make Wikistats data easily embedable on-wiki - https://phabricator.wikimedia.org/T178016#4134152 (10mforns) p:05Triage>03Low [16:37:58] 10Analytics, 10Analytics-Wikistats: Add overall ORES scores to Wikistats - https://phabricator.wikimedia.org/T178019#4134159 (10mforns) p:05Triage>03Low [16:38:08] 10Analytics, 10Services (watching): Update to latest kafkacat - https://phabricator.wikimedia.org/T182163#4134160 (10faidon) kafkacat 1.3.1-1~bpo9+1 should be available from Debian's stretch-backports on all stretch hosts: ``` $ rmadison -a amd64 kafkacat kafkacat | 1.3.0-1+b1 | stable | amd64... [16:38:09] 10Analytics: Survey dashboard layout for dashiki - https://phabricator.wikimedia.org/T178399#4134161 (10mforns) p:05Triage>03Low [16:38:32] 10Analytics: Survey dashboard layout for dashiki - https://phabricator.wikimedia.org/T178399#4134163 (10mforns) p:05Low>03Triage [16:39:06] 10Analytics: Remove "bot" from metrics/pageviews/per-article - https://phabricator.wikimedia.org/T178448#4134166 (10mforns) p:05Triage>03Low [16:39:41] 10Analytics-Kanban: Compile a request data set for caching research and tuning - https://phabricator.wikimedia.org/T128132#4134168 (10Nuria) >Since I have some questions related to this task T128132, should I create a new task to post my question? Using analytics@ e-mail list would be fine. [16:44:29] 10Analytics, 10Analytics-Kanban: Index and store page preview agreggates on Druid so they are visible in pivot/superset - https://phabricator.wikimedia.org/T192305#4134215 (10Nuria) p:05Triage>03High [16:44:42] 10Analytics, 10Analytics-Kanban: Index and store page preview agreggates on Druid so they are visible in pivot/superset - https://phabricator.wikimedia.org/T192305#4134227 (10Nuria) a:03mforns [16:51:30] elukey: want to talk kafka for a few mins? [16:52:57] ottomata: sure [16:53:30] I am in bc [17:10:15] 10Analytics, 10Analytics-Kanban, 10Analytics-Wikistats, 10Google-Summer-of-Code (2018): GSoC Proposal 2018 : [Analytics] Improvements to Wikistats2 front-end - https://phabricator.wikimedia.org/T189964#4058903 (10Nuria) [17:10:35] 10Analytics, 10Analytics-Kanban, 10Analytics-Wikistats, 10Google-Summer-of-Code (2018): Proposal: [Analytics] Improvements to Wikistats2 front-end - https://phabricator.wikimedia.org/T190949#4088715 (10Nuria) [17:13:25] 10Analytics, 10New-Readers, 10Easy: Split opera mini in proxy or turbo mode - https://phabricator.wikimedia.org/T138505#2402369 (10Nuria) p:05Normal>03Low [17:16:32] 10Analytics, 10Analytics-Wikistats: Please update Tulu Language(tcy)in Wikipedia Statistics. - https://phabricator.wikimedia.org/T160630#4134279 (10Nuria) 05Open>03Resolved Please see avaiilable stats: https://stats.wikimedia.org/v2/#/tcy.wikipedia.org 35K pageviews per month, 30 editors and more Resol... [17:20:43] 10Analytics, 10Analytics-Wikistats, 10Wikimedia-Site-requests: Add li: Wikibooks to stats.wikimedia.org - https://phabricator.wikimedia.org/T165634#4134297 (10Nuria) 05Open>03Resolved a:03Nuria Wikistats2 has data now for li.wikibooks.org: https://stats.wikimedia.org/v2/#/li.wikibooks.org 4K pageview... [17:22:46] * elukey off! [17:57:24] 10Analytics, 10Analytics-Kanban: Gather all constants related to mobile/responsiveness in config - https://phabricator.wikimedia.org/T190339#4134378 (10mforns) a:05fdans>03Amitjoki [18:49:35] (03CR) 10Joal: "Comment inline." (033 comments) [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/423336 (https://phabricator.wikimedia.org/T191714) (owner: 10Jonas Kress (WMDE)) [19:04:12] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, and 3 others: Enable CP4JQ support for private wikis - https://phabricator.wikimedia.org/T191464#4134457 (10mobrovac) [19:04:18] 10Analytics, 10Analytics-Kanban, 10ChangeProp, 10EventBus, and 2 others: Disable MirrorMaker for job queue events - https://phabricator.wikimedia.org/T192005#4134454 (10mobrovac) 05Open>03Resolved Thank you, @Ottomata ! [19:14:01] 10Analytics, 10EventBus, 10MediaWiki-JobQueue, 10MediaWiki-Recent-changes, and 6 others: Wikidata doesn't update recentchanges - https://phabricator.wikimedia.org/T192198#4134484 (10mobrovac) [19:27:10] 10Analytics, 10Analytics-Kanban, 10EventBus, 10Patch-For-Review, 10Services (watching): Use --new.consumer for main codfw <-> eqiad Kafka MirrorMaker - https://phabricator.wikimedia.org/T190940#4088273 (10Ottomata) [19:32:03] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, and 4 others: Select candidate jobs for transferring to the new infrastucture - https://phabricator.wikimedia.org/T175210#4134513 (10mobrovac) [19:59:44] 10Analytics, 10Operations, 10Traffic, 10User-Elukey: Refactor kafka_config.rb and and kafka_cluster_name.rb in puppet to avoid explicit hiera calls - https://phabricator.wikimedia.org/T177927#4134556 (10Ottomata) Hm, I just thought about this a little bit, and I'm not so sure we should do it. The hiera in... [20:00:49] 10Analytics, 10Analytics-Kanban: Upgrade Kafka on main cluster with security features - https://phabricator.wikimedia.org/T167039#4134560 (10Ottomata) [20:02:19] 10Analytics, 10Analytics-Kanban: Upgrade Kafka on main cluster with security features - https://phabricator.wikimedia.org/T167039#3315367 (10Ottomata) [20:02:21] 10Analytics, 10EventBus, 10Services (blocked), 10User-Elukey: Investigate group.initial.rebalance.delay.ms Kafka setting - https://phabricator.wikimedia.org/T189618#4134564 (10Ottomata) [20:10:06] 10Analytics, 10Analytics-Cluster, 10Analytics-Kanban, 10Patch-For-Review: Migrate Mediawiki Monolog Kafka producer to Kafka Jumbo - https://phabricator.wikimedia.org/T188136#4134590 (10Ottomata) Ah @elukey I looked more into this and remembered that this might actually cause TCP issues after all. I think... [20:10:39] 10Analytics, 10Analytics-Cluster, 10Patch-For-Review: Port Kafka clients to new jumbo cluster - https://phabricator.wikimedia.org/T175461#4134591 (10Ottomata) [20:43:05] 10Analytics, 10Analytics-Kanban: Upgrade Kafka on main cluster with security features - https://phabricator.wikimedia.org/T167039#4134659 (10Ottomata) For reference, just tested in labs. MirrorMaker 0.9 works (but is flaky and buggy) with both 0.9 and 1.x brokers, but 1.x MirrorMaker will only work with 1.x b... [21:37:08] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, and 4 others: Select candidate jobs for transferring to the new infrastucture - https://phabricator.wikimedia.org/T175210#4134725 (10Pchelolo) [21:38:12] (03PS1) 10Framawiki: Add export to HTML [analytics/quarry/web] - 10https://gerrit.wikimedia.org/r/427020 (https://phabricator.wikimedia.org/T117644) [21:38:27] (03CR) 10jerkins-bot: [V: 04-1] Add export to HTML [analytics/quarry/web] - 10https://gerrit.wikimedia.org/r/427020 (https://phabricator.wikimedia.org/T117644) (owner: 10Framawiki) [21:38:48] 10Analytics, 10EventBus, 10MediaWiki-JobQueue, 10MediaWiki-Recent-changes, and 6 others: Wikidata doesn't update recentchanges - https://phabricator.wikimedia.org/T192198#4134732 (10mobrovac) 05Open>03Resolved a:03Pchelolo It took us a while to find the root cause of this. Essentially, the problem wa... [21:40:02] (03PS2) 10Framawiki: Add export to HTML [analytics/quarry/web] - 10https://gerrit.wikimedia.org/r/427020 (https://phabricator.wikimedia.org/T117644) [21:40:06] 10Analytics, 10EventBus, 10MediaWiki-JobQueue, 10MediaWiki-Recent-changes, and 6 others: Wikidata doesn't update recentchanges - https://phabricator.wikimedia.org/T192198#4134737 (10mobrovac) [21:40:12] 10Analytics, 10EventBus, 10MediaWiki-JobQueue, 10Goal, and 2 others: FY17/18 Q4 Program 8 Services Goal: Complete the JobQueue transition to EventBus - https://phabricator.wikimedia.org/T190327#4134736 (10mobrovac) [21:40:30] 10Quarry, 10Patch-For-Review: Add 'download in HTML format' option (Quarry) - https://phabricator.wikimedia.org/T117644#4134738 (10Framawiki) p:05Triage>03Low a:03Framawiki [21:40:44] (03CR) 10Zhuyifei1999: [C: 04-1] "compiled.js is pre-generated from a nunjucks file. I'll do that change later." [analytics/quarry/web] - 10https://gerrit.wikimedia.org/r/427020 (https://phabricator.wikimedia.org/T117644) (owner: 10Framawiki) [21:45:24] (03CR) 10Zhuyifei1999: [C: 04-1] "It's the two html files in templates/, if you want to do that." [analytics/quarry/web] - 10https://gerrit.wikimedia.org/r/427020 (https://phabricator.wikimedia.org/T117644) (owner: 10Framawiki) [21:49:17] 10Quarry: Show the execution time in the table of queries - https://phabricator.wikimedia.org/T71264#4134752 (10Framawiki) [21:49:21] 10Quarry: Include query execution time - https://phabricator.wikimedia.org/T126888#4134754 (10Framawiki) [21:53:00] 10Quarry: Show replication lag - https://phabricator.wikimedia.org/T60841#4134769 (10Framawiki) [21:55:22] 10Quarry: Provide a way to add hyperlink in Quarry results/output - https://phabricator.wikimedia.org/T74874#4134774 (10Framawiki) [21:56:44] 10Quarry: REPORTS-52 Add option to output report results to a wiki page - https://phabricator.wikimedia.org/T60833#4134782 (10Framawiki) [21:56:47] 10Quarry: Setup an easy way to have Quarry dump information / results on a wiki page - https://phabricator.wikimedia.org/T137179#4134785 (10Framawiki) [21:58:40] 10Quarry: REPORTS-49 IP block SQL Query on en-wiki - https://phabricator.wikimedia.org/T60835#4134787 (10Framawiki) [21:58:43] 10Quarry: Recurring queries - https://phabricator.wikimedia.org/T101835#4134790 (10Framawiki) [22:08:37] (03PS1) 10Framawiki: Verify user before updating meta query informations [analytics/quarry/web] - 10https://gerrit.wikimedia.org/r/427024 [22:10:39] (03PS1) 10Framawiki: Don't use "None" if the query has no title [analytics/quarry/web] - 10https://gerrit.wikimedia.org/r/427025 [22:18:04] (03CR) 10Zhuyifei1999: "Needs nunjucks to generate compiled.js" [analytics/quarry/web] - 10https://gerrit.wikimedia.org/r/427025 (owner: 10Framawiki) [22:22:40] (03CR) 10Zhuyifei1999: "Oh wait, never mind." [analytics/quarry/web] - 10https://gerrit.wikimedia.org/r/427025 (owner: 10Framawiki) [23:30:20] 10Analytics, 10Performance-Team (Radar): Possible statsv corruption? - https://phabricator.wikimedia.org/T189530#4134878 (10Krinkle) >>! In T189530#4133814, @Ottomata wrote: > Not sure I totally understand the problem. What is an example of a good metric name and a corrupted one here? You are saying that met...