[00:05:15] 10Analytics, 10Analytics-SWAP: Cannot instantiate multiprocessing.RLock - https://phabricator.wikimedia.org/T211163 (10EBernhardson) I can also note that PAWS is running the same version of jupyterhub as SWAP, that is 5.5.0. On PAWS `multiprocessing.RLock` works as expected. So this isn't specifically a jupyt... [00:19:31] 10Quarry, 10Patch-For-Review: Quarry should refuse to save results that are way too large - https://phabricator.wikimedia.org/T188564 (10zhuyifei1999) >>! In T188564#4798799, @Framawiki wrote: > What I had in mind: > 1. send the work request from **web server** to a **worker**. > 2. when the **worker** has res... [00:32:48] 10Analytics, 10ChangeProp, 10Operations, 10Services (designing), 10Wikimedia-Incident: Separate dev Change-Prop from production Kafka cluster - https://phabricator.wikimedia.org/T199427 (10Nuria) ping on this issue, has this work been planned? [00:43:27] 10Analytics, 10Analytics-SWAP: Cannot instantiate multiprocessing.RLock - https://phabricator.wikimedia.org/T211163 (10EBernhardson) I figured out how to get jupyterhub to inject strace in front of the kernel it runs, the appropriate part is: ` stat("/usr/lib/python3.5/multiprocessing", {st_mode=S_IFDIR|0755,... [00:44:06] 10Analytics, 10ChangeProp, 10Operations, 10Services (designing), 10Wikimedia-Incident: Separate dev Change-Prop from production Kafka cluster - https://phabricator.wikimedia.org/T199427 (10Pchelolo) p:05Normal>03Low We're currently not using CP in the dev cluster since we have no new major RESTBase f... [06:26:37] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Marostegui) From my chat with @Banyek the approach now is to create tables populated with the following queries... [06:47:49] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Marostegui) >>! In T210693#4798691, @Milimetric wrote: > And to follow up on my first bullet from before: > > 1... [07:43:26] logs on druid are nicely rotated now [07:43:28] yesss [07:43:33] going to roll restart also public [07:43:52] !log restart middlemanager/broker/historical on druid-public to pick up new log4j settings [07:43:53] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [08:12:51] all restarted and logs cleaned up [08:15:52] joal: if ok with you, I'll do a test backfilling of one day for per family uniques and then proceed to backfill all if successful [08:29:20] 10Analytics: "Edit" equivalent of pageviews daily available to use in Turnilo and Superset - https://phabricator.wikimedia.org/T211173 (10Reedy) [08:57:09] 10Analytics, 10Analytics-Kanban: Upgrade Matomo to 3.6.1 or 3.7.0 - https://phabricator.wikimedia.org/T209808 (10elukey) ` *** Update *** Database Upgrade Required Your Matomo database is out-of-date, and must be upgraded before you can continue. Matomo database will be upgraded from version... [09:13:05] !log matomo read only + upgrade to matomo 3.7.0 on matomo1001 [09:13:06] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [09:20:21] piwik upgraded to 3.7.0 [09:20:25] seems also very fast [09:22:42] 10Analytics, 10Analytics-Kanban: Upgrade Matomo to 3.6.1 or 3.7.0 - https://phabricator.wikimedia.org/T209808 (10elukey) Piwik/Matomo upgraded, but while testing the users I noticed that the `piwik` user outlined in https://wikitech.wikimedia.org/wiki/Analytics/Systems/Piwik#Access seems having a different pas... [09:22:50] 10Analytics, 10Analytics-Kanban: Upgrade Matomo to 3.6.1 or 3.7.0 - https://phabricator.wikimedia.org/T209808 (10elukey) [09:23:49] o/ joal :D [09:24:19] sorry, got totally bogged down by stuff yesterday, and only really have 35 mins free this morning before it all starts again# [09:24:48] 10Analytics: ReadingDepth schema is whitelisting both session ids and page ids - https://phabricator.wikimedia.org/T209051 (10Tbayer) For the record: decided with @ovasileva to remove the session IDs and keep the page IDs. I'll see to submit the patch soon. [09:32:28] 10Analytics, 10Product-Analytics: Metrics request on portal namespace usage - https://phabricator.wikimedia.org/T205681 (10Tbayer) >>! In T205681#4773575, @AfroThundr3007730 wrote: > @Tbayer Just curious if the analytics team had time to pull any data for this yet. Thanks for the ping! I spent some time workin... [09:34:42] (03PS1) 10Elukey: Upgrade to nodejs-10 [analytics/turnilo/deploy] - 10https://gerrit.wikimedia.org/r/477735 [09:35:16] fdans: --^ [09:35:58] from a labs instance I used https://deb.nodesource.com/setup_10.x, then installed nodejs/npm, and did as the README suggests (rm node_modules, npm install) [09:37:35] right [09:38:03] if everything looks ok I'd just merge, deploy to the labs instance, test and then go to prod [09:38:22] (we have nodejs 10 packages build by Moritz ready to go) [09:44:19] (03CR) 10Fdans: [C: 031] "Don't see anything weird, lgtm!" [analytics/turnilo/deploy] - 10https://gerrit.wikimedia.org/r/477735 (owner: 10Elukey) [09:44:51] thanks! [09:45:05] (03CR) 10Elukey: [V: 032 C: 032] Upgrade to nodejs-10 [analytics/turnilo/deploy] - 10https://gerrit.wikimedia.org/r/477735 (owner: 10Elukey) [09:45:24] * elukey tests deployment-server.analytics.eqiad.wmflabs [10:00:18] deployment done [10:00:26] (in labs) [10:02:59] 10Analytics, 10Analytics-Kanban, 10Patch-For-Review, 10User-Elukey: Move turnilo to nodejs 10 - https://phabricator.wikimedia.org/T210705 (10elukey) >>! In T210705#4797552, @elukey wrote: > Before doing this, we need to probably run npm install for turnilo with the nodejs10... Just realized it Followed t... [10:03:11] all good from my side, I'll ask to the team before proceeding [10:03:38] !log backfilling test for unique project families - start_time=2016-01-01T00:00Z stop_time=2016-02-01T00:00Z [10:03:39] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [10:04:53] fdans: are you interested in https://phabricator.wikimedia.org/T210706 ? [10:05:05] (we can work together if you have time during the next days) [10:05:38] elukey: that sounds great! [10:05:42] \o/ [10:25:23] * fdans the new Hue UI is such an awful mess [10:27:36] fdans: there is a trick to revert to hue 3 [11:34:15] !log backfill test successful. Starting job to backfill family uniques since mar 2017 [11:34:16] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [11:39:54] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Banyek) For testing I created the view as `comment_view_temp` with the query @Bstorm wrote in T210693#4798638 an... [11:43:50] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Marostegui) On which host and on which database? [11:45:06] * elukey lunch! [11:46:10] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Marostegui) Ah, I see, labsdb1010 but why is the view `comment_view_temp` on `enwiki` and not on `enwiki_p` wher... [11:52:25] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Banyek) No, there is no any real reson, but as this is a test only I wanted to keep this 'clean' and enwiki feel... [11:58:23] !log backfilling in progress, killing uniques coordinators within bundle, will restart bundle on Jan 1st [11:58:24] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [12:01:07] a-team we got some per family uniques on AQS! \o/ https://wikimedia.org/api/rest_v1/metrics/unique-devices/all-wikipedia-projects/all-sites/monthly/2017040100/2017050100 [12:19:45] fdans: I guess it's too late for me to suggest backfilling only per-familly and not the whole datawet :) [12:34:38] hmmm joal the monthly part is complete, should i kill the ongoing daily and restart with a modified hql to exclude per project? [12:35:30] fdans: as you wish - It's not a big data-volume, so it;s not very important, but it would have been good to update the HQL to remove the unneded part of the union :) [12:51:37] Hi addshore - I'm sorry wednesday is kids day, so I'm there only during siesta timeor later in the day [12:52:51] addshore: If you have a minute you can help me precise the question: You want to count all items having a given text equals/contained in description or label or alias, that's it? [12:53:20] addshore: And actually, you probably want their count to be unique (if the term appears in multiple places, only count the item once)? [13:21:08] 10Analytics, 10Analytics-Kanban, 10DBA, 10User-Banyek: Migrate dbstore1002 to a multi instance setup on dbstore100[3-5] - https://phabricator.wikimedia.org/T210478 (10Banyek) What I am not sure is what to do with the 'extra' databases (I see there are no previous clusters having them, but if we move them t... [13:24:01] 10Analytics, 10Analytics-Kanban, 10DBA, 10User-Banyek: Migrate dbstore1002 to a multi instance setup on dbstore100[3-5] - https://phabricator.wikimedia.org/T210478 (10Banyek) About the `staging` db: Analytics team uses it, but I am not sure if they need all the data in it (is there any data or just tables... [13:42:15] 10Analytics, 10Analytics-Kanban, 10DBA, 10User-Banyek: Migrate dbstore1002 to a multi instance setup on dbstore100[3-5] - https://phabricator.wikimedia.org/T210478 (10Banyek) maybe I can summon here @Milimetric about the `staging` db? [13:43:03] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Marostegui) Thanks for the clarification - I just wanted to know if there was some specific reason for it that I... [13:44:19] hi joal ! [13:44:49] so, not counting the items, but counting the unique strings used in labels descriptions and aliases [13:45:29] so, if I have an item with an en label of "Foo" and another item with a description "Foo", then the result would be ( 'Foo' => 2 ) [13:49:10] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Banyek) >>! In T210693#4800137, @Marostegui wrote: > Thanks for the clarification - I just wanted to know if the... [13:58:21] 10Analytics, 10Analytics-Kanban, 10DBA, 10User-Banyek: Migrate dbstore1002 to a multi instance setup on dbstore100[3-5] - https://phabricator.wikimedia.org/T210478 (10elukey) This is a very good point, I'll bring it up to my team's standup today and I'll let you know. It has been used, as far as I know, fo... [13:59:24] 10Analytics, 10Analytics-Kanban, 10DBA, 10User-Banyek: Migrate dbstore1002 to a multi instance setup on dbstore100[3-5] - https://phabricator.wikimedia.org/T210478 (10Banyek) >>! In T210478#4800170, @elukey wrote: > This is a very good point, I'll bring it up to my team's standup today and I'll let you kno... [14:23:32] ok new hadoop worker nodes ready to be deployed in https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/477779/1/manifests/site.pp [14:23:41] waiting for Andrew's +1 before proceeding [14:26:01] 10Analytics, 10Analytics-Kanban, 10Patch-For-Review, 10User-Elukey: Decommission old Hadoop worker nodes and add newer ones - https://phabricator.wikimedia.org/T209929 (10elukey) The plan is: 1) Add new nodes to rack awareness config and site.pp, and leave the cluster re-balance for some days. 2) In the m... [14:44:48] o/ [14:45:20] ottomata: o/ [14:48:04] ottomata: when you are ready I'd add moar hadoop nodes [14:48:28] anytime! [14:48:32] emails: checked [14:48:41] new computer: go [14:48:43] loud keyboard: go [14:48:46] new hadoop nodes: go [14:49:55] \o/ [14:49:58] so just to recap [14:50:09] I merged the change for rack awareness [14:50:44] but I don't recall if we need or not to restart the name nodes [14:50:45] I guess yes [14:50:55] hmmmm [14:50:58] i also don't recall [14:51:52] according to http://community.cloudera.com/t5/Cloudera-Manager-Installation/Changing-rack-awareness-in-a-running-Hadoop-cluster-in/td-p/56451 [14:51:58] yes need restart [14:52:49] all right doing it [14:52:53] possibly RM needs restart too? [14:52:59] not sure [14:53:13] let's do it just in case [14:53:22] it is not a huge deal [14:53:28] atye [14:53:28] aye [14:53:45] !log restart hdfs namenodes and yarn rm to update rack awareness config (prep for new nodes) [14:53:46] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [15:05:04] ok done! [15:05:12] great! restarting compy real quick brb [15:12:22] ottomata: I've disabled puppet on all an-worker, shall I proceed with the merge + puppet run on one? [15:13:02] proceed ya! [15:14:36] ack! [15:24:23] one thing that I'd need to investigate is the refresh to the apt-get update refresh [15:24:33] because in my opinion it doesn't work [15:24:51] each time I need to do a manual apt-get update to make everything work [15:26:04] an-worker1078 up [15:26:21] hm [15:27:10] in the puppet log I can see refresh exec apt-get update tc.. [15:28:21] in class apt we have [15:28:23] exec { 'apt-get update': [15:28:23] path => '/usr/bin', [15:28:23] timeout => 240, [15:28:23] returns => [ 0, 100 ], [15:28:25] refreshonly => true, [15:28:27] } [15:28:56] and I think that this one is refreshed each time, but I am wondering what a refresh does to a exec [15:30:29] and in a lot of places in puppet we do notify => Exec['apt-get update'] [15:30:40] (I mean generally, not only cdh) [15:30:53] notify is like a subscribe, but backwards [15:31:00] so the dep might go out of order there [15:31:02] its like [15:31:16] apt-get update subscribe => our stuff [15:31:34] don't we have some require for apt class in some of our classes? [15:31:37] that should make it happen first? [15:32:11] so afaik we have [15:32:11] apt::repository { 'thirdparty-cloudera': [15:32:12] uri => 'http://apt.wikimedia.org/wikimedia', [15:32:12] dist => "${::lsbdistcodename}-wikimedia", [15:32:12] components => 'thirdparty/cloudera', [15:32:14] } [15:32:18] in profile::cdh::apt [15:33:13] and e [15:33:22] apt::repository does [15:33:27] file { "/etc/apt/sources.list.d/${name}.list": [15:33:27] ensure => $ensure, [15:33:27] owner => 'root', [15:33:27] group => 'root', [15:33:27] mode => '0444', [15:33:30] content => "${binline}${srcline}", [15:33:32] notify => Exec['apt-get update'], [15:33:35] } [15:33:52] and do we have something like [15:34:07] class cdh require => Apt::Repository[thirdparty-cloudera] ? [15:35:41] so profile::hadoop::common requires profile::cdh::apt [15:36:08] and we have in there [15:36:08] Class['::profile::cdh::apt'] -> Exec['apt-get update'] -> Class['::cdh::hadoop'] [15:39:03] hm that seems ok [15:39:12] i think [15:45:56] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Bstorm) > Basically, we are importing actor and comment from production replicas, and these include comment text... [15:46:40] 10Analytics, 10Analytics-Kanban, 10Operations, 10ops-eqiad, 10User-Elukey: rack/setup/install cloudvirtan100[1-5].eqiad.wmnet - https://phabricator.wikimedia.org/T207194 (10Cmjohnson) a:05Cmjohnson>03RobH @RobH all have been cabled and switch port updated minus the vlan. Can you please update vlan... [15:46:58] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Bstorm) That's relying on a very particular special case, but I think it is actually true. 😁 [15:47:41] (03PS1) 10GoranSMilovanovic: pre-processing in production [analytics/wmde/TW/AdvancedSearchExtension-Dashboard] - 10https://gerrit.wikimedia.org/r/477797 [15:48:07] (03CR) 10GoranSMilovanovic: [V: 032 C: 032] pre-processing in production [analytics/wmde/TW/AdvancedSearchExtension-Dashboard] - 10https://gerrit.wikimedia.org/r/477797 (owner: 10GoranSMilovanovic) [15:50:07] ottomata: print topology [15:50:08] Rack: /eqiad/A/2 10.64.5.27:50010 (an-worker1078.eqiad.wmnet) [15:50:11] goooood [15:51:06] nice [15:51:51] going to enable other ones [15:53:01] mmm controlling metrics before [15:54:01] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Marostegui) We don't sanitize anything on `actor` or `comment` on a triggers level on sanitarium if that is what... [15:56:01] I need to fix grafana but metrics are good [15:59:17] fixed :) [15:59:56] coool [15:59:57] nice [16:20:03] teammm hi! does anyone want me to say sth specific in scrum of scrums? [16:20:07] PROBLEM - Disk space on Hadoop worker on an-worker1080 is CRITICAL: NRPE: Command check_disk_space_hadoop_worker not defined [16:20:11] a-team ^ [16:20:37] mforns: not I! [16:21:07] RECOVERY - Disk space on Hadoop worker on an-worker1080 is OK: DISK OK [16:22:08] mforns: new nodes :) [16:22:37] elukey, sure for Hadoop cluster or for dbstore replacement? [16:22:42] ah snap sorry I thought you were pointing me to the alarm [16:22:43] :P [16:22:48] oh! ok [16:22:51] nono nothing :D [16:22:52] elukey: we doing ops sync in 8 mins ya? [16:22:53] ok ok [16:23:00] a-team i will miss standup today, there is a better use of data meeting [16:23:26] a better meeting than stand-up? hmmm :P [16:23:30] ottomata: still in the meeting with cloud and persistence team for our usage of cloud db replicas [16:23:37] if you want we can skip [16:24:20] elukey: whenever you like! i'm all in not much ops world these days, so I don't have many updates [16:24:28] but if you want to talk about anything we can do whenever [16:25:03] ottomata: all right let's skip it, nothing to say as well for this round.. if you are ok I'd put in prod now only 3/4 new workers (leaving puppet disabled on the rest) [16:25:13] let them boil for a night (eu night) and then finish tomorrow [16:28:13] proceed! [16:30:50] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Milimetric) Following up on a good problem that @Bstorm raised with my approach. I would love @Anomie to take a... [16:31:26] Oh a-team also! I forgot to mention that i'm using some volunteer time at school in queens tomorrow [16:31:34] gonna talk to kids about programming or something [16:31:38] nice! [16:31:40] uooooo, cool [16:31:43] i'll miss standup but should be on in the afternoon [16:31:50] ACK and then i'm off on friday [16:31:53] SOoSOOOOOoO i guess TTY monday :/ [16:32:48] o/ [16:47:53] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Milimetric) Following up on a good problem that @Bstorm raised with my approach. I would love @Anomie to take a... [16:55:59] 10Analytics-EventLogging, 10Analytics-Kanban, 10EventBus, 10Core Platform Team (Modern Event Platform (TEC2)), and 3 others: Modern Event Platform (TEC2) - https://phabricator.wikimedia.org/T185233 (10Ottomata) [16:57:15] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Anomie) >>! In T210693#4800694, @Milimetric wrote: > According to the `actor` view definition: https://gerrit.wi... [17:01:03] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Milimetric) >>! In T210693#4800781, @Anomie wrote: >>>! In T210693#4800694, @Milimetric wrote: >> According to t... [17:01:40] ping ottomata [17:01:47] nuria got BUOD meeting [17:07:56] Enjoy school time ottomata :) I like these moments: ) [17:14:12] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Anomie) You said "any reference is sanitized → not present". The contrapositive of that would be "present → not... [17:14:37] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Bstorm) @Anomie I think you actually answered the question in that. As it currently stands, the logic there alr... [17:16:52] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Bstorm) To be clear, I'm deliberately trying to find problems with this because it sounds correct to me. I want... [17:24:06] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Anomie) >>! In T210693#4800871, @Bstorm wrote: > @Milimetric would like to try using the sanitized views of the... [17:31:26] nuria: MEP sync? [17:31:42] ottomata: 2 mins [17:34:15] nuria: the minute that you stopped talking about kafka icinga alerted about kafka1013 down [17:34:18] ahahah [17:43:18] 10Analytics, 10Tool-Pageviews: Statistics for views of individual Wikimedia images - https://phabricator.wikimedia.org/T210313 (10jmatazzoni) [17:44:52] ah a-team tomorrow's a bank holiday in Spain so I won't be here! see yall fri [17:45:03] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Milimetric) Thanks very much @Anomie, I understand my misunderstanding, and your third answer is what I was asking. [17:46:40] 10Analytics, 10Tool-Pageviews: Statistics for views of individual Wikimedia images - https://phabricator.wikimedia.org/T210313 (10jmatazzoni) ! In T210313#4787451, @Milimetric wrote: > To answer both of these questions, all files served on all wikis are first uploaded to upload.wikimedia.org. That's what's c... [17:48:06] * elukey off! [18:09:37] taking off for lunch, will be back later but ping me if you need me [18:22:20] 10Analytics-EventLogging, 10Analytics-Kanban, 10EventBus, 10Core Platform Team (Modern Event Platform (TEC2)), and 3 others: Modern Event Platform (TEC2) - https://phabricator.wikimedia.org/T185233 (10Ottomata) [18:22:26] 10Analytics-EventLogging, 10Analytics-Kanban, 10EventBus, 10Core Platform Team (Modern Event Platform (TEC2)), and 3 others: Modern Event Platform (TEC2) - https://phabricator.wikimedia.org/T185233 (10Ottomata) [18:22:42] 10Analytics-EventLogging, 10Analytics-Kanban, 10EventBus, 10Core Platform Team (Modern Event Platform (TEC2)), and 3 others: Modern Event Platform (TEC2) - https://phabricator.wikimedia.org/T185233 (10Ottomata) [18:23:11] 10Analytics, 10Analytics-Kanban, 10Operations, 10User-Elukey: rack/setup/install cloudvirtan100[1-5].eqiad.wmnet - https://phabricator.wikimedia.org/T207194 (10RobH) a:05RobH>03Ottomata >>! In T207194#4800497, @Cmjohnson wrote: > @RobH > > all have been cabled and switch port updated minus the vlan.... [18:27:04] 10Analytics: "Edit" equivalent of pageviews daily available to use in Turnilo and Superset - https://phabricator.wikimedia.org/T211173 (10Nuria) Importing the mediawiki_history into turnilo I think should be possible, leaving up to @Neil_P._Quinn_WMF to decide whether this is the best format to answer this ques... [18:36:44] 10Analytics, 10Analytics-Kanban, 10Operations, 10User-Elukey: rack/setup/install cloudvirtan100[1-5].eqiad.wmnet - https://phabricator.wikimedia.org/T207194 (10RobH) [18:39:12] Hello A-team! There is no wikidata description edit record from either of the app (ios or android) after Nov 15 on MariaDB, do you know what may cause the issue? [18:39:32] Query on wikidatawiki https://www.irccloud.com/pastebin/UmNOpamw/ [18:42:20] chelsyx: you are querying the Db replicas right? [18:42:34] yep [18:43:17] chelsyx: if so the question might be a better one for platform team. That data is sourced directly from mediawiki prod databases and at this time there are three refactors going on: tags/comments and actor tables [18:44:52] chelsyx: revision comments just moved out [18:46:38] nuria: yes, but would that affect the tag too? [18:46:56] chelsyx: i do not know much about tag refactor but both are ongoing [18:47:21] nuria: do you know who from the platform team should I ping ? [18:48:19] chelsyx: I think all chnages by now will be documented by them somewhere as they have happen some time back, see: https://www.mediawiki.org/wiki/Manual:Revision_comment_temp_table [18:49:07] chelsyx: https://www.mediawiki.org/wiki/Manual:Tag_summary_table [18:51:26] chelsyx: also, this is hard to read but *I think* this is how platform team documents changes: https://www.mediawiki.org/wiki/Manual:Database_layout [18:52:58] nuria: They deprecate the tag_summary table last month, and intend to replacing it with the change_tag table, which is the one I'm using https://lists.wikimedia.org/pipermail/wikitech-l/2018-November/091170.html [18:54:12] chelsyx: and the field you expect to see full that is empty is on the change_tag table or the revision table? [18:58:00] nuria: the change_tag table [18:58:28] chelsyx: i see, sorry but i know little of that refactor Amir1 (in DE) I think is driving it [19:01:21] nuria: thanks [19:50:29] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Banyek) The 'materialized view' for comments is completed. I moved it into `enwiki_p` with name `comment_mat`. I... [19:55:18] 10Analytics, 10Anti-Harassment (AHT Sprint 35): 👩‍👧 Track how often blocked user attempt to edit - https://phabricator.wikimedia.org/T189724 (10TBolliger) [19:58:04] 10Analytics, 10Anti-Harassment (AHT Sprint 35): Set wgEnableBlockNoticeStats to true on 19 more wikis - https://phabricator.wikimedia.org/T211234 (10TBolliger) p:05Triage>03Low [20:00:57] 10Analytics, 10Anti-Harassment (AHT Sprint 35): 👩‍👧 Track how often blocked user attempt to edit - https://phabricator.wikimedia.org/T189724 (10dmaza) [20:01:21] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Banyek) The actor view was empty, and the empty `actor_mat` too. [20:02:59] 10Analytics, 10Anti-Harassment (AHT Sprint 35): 👩‍👧 Track how often blocked user attempt to edit - https://phabricator.wikimedia.org/T189724 (10TBolliger) [20:03:09] 10Analytics, 10Anti-Harassment (AHT Sprint 35): 👩‍👧 Track how often blocked user attempt to edit - https://phabricator.wikimedia.org/T189724 (10TBolliger) [20:07:23] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Anomie) If you want to fake up an `actor_mat` for a size estimate, I think something like this would do it. Obvi... [20:09:30] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Banyek) >>! In T210693#4801411, @Anomie wrote: > If you want to fake up an `actor_mat` for a size estimate, I th... [20:09:46] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Banyek) also we need to create indices for comment_mat [20:15:19] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Marostegui) I am not sure it makes sense to continue that approach. We have more than 900 wikis, even if it take... [20:15:32] 10Analytics, 10Analytics-SWAP, 10Patch-For-Review: Cannot instantiate multiprocessing.RLock - https://phabricator.wikimedia.org/T211163 (10EBernhardson) 05Open>03Resolved a:03EBernhardson Solved! I had to fully shutdown my jupyterhub-singleuser and restart it from the hub. After that multiprocessing lo... [21:33:42] 10Analytics, 10Analytics-Kanban, 10Operations, 10netops: Figure out networking details for new cloud-analytics-eqiad Hadoop/Presto cluster - https://phabricator.wikimedia.org/T207321 (10Andrew) I've created a proof-of-concept VM, hadoop-worker-01.cloud-analytics.eqiad.wmflabs. Please check that out and co... [21:36:50] 10Analytics, 10Analytics-Kanban, 10Operations, 10Patch-For-Review, 10User-Elukey: rack/setup/install cloudvirtan100[1-5].eqiad.wmnet - https://phabricator.wikimedia.org/T207194 (10Andrew) 05Open>03Resolved [21:53:30] 10Analytics, 10Analytics-EventLogging, 10EventBus, 10Core Platform Team Backlog (Watching / External), 10Services (watching): Modern Event Platform: Stream Intake Service: Implementation: Deployment Pipeline - https://phabricator.wikimedia.org/T211247 (10Ottomata) p:05Triage>03Normal [21:56:26] 10Analytics, 10Analytics-Cluster, 10Discovery, 10Patch-For-Review: Modern Event Platform: Stream Intake Service: Migrate Mediawiki monolog Kafka uses to EventGate - https://phabricator.wikimedia.org/T188136 (10Ottomata) [21:56:56] 10Analytics-EventLogging, 10Analytics-Kanban, 10EventBus, 10Services (watching): Modern Event Platform: Stream Intake Service - https://phabricator.wikimedia.org/T201068 (10Ottomata) [21:57:00] 10Analytics, 10Analytics-Cluster, 10Discovery, 10Patch-For-Review: Modern Event Platform: Stream Intake Service: Migrate Mediawiki monolog Kafka uses to EventGate - https://phabricator.wikimedia.org/T188136 (10Ottomata) [21:59:01] 10Analytics-EventLogging, 10Analytics-Kanban, 10EventBus, 10Services (watching): Modern Event Platform: Stream Intake Service: Migrate Mediawiki Eventbus events to EventGate - https://phabricator.wikimedia.org/T211248 (10Ottomata) p:05Triage>03Normal [22:07:40] 10Analytics, 10Wikimedia-Stream, 10Patch-For-Review: Create /v2/schema/:schema_uri endpoint for eventstreams that proxies schemas from eventbus - https://phabricator.wikimedia.org/T160748 (10Ottomata) [22:07:43] 10Analytics, 10EventBus, 10Wikimedia-Stream, 10Services (designing), 10User-mobrovac: Puppetize event schema topic configuration - https://phabricator.wikimedia.org/T161027 (10Ottomata) 05Open>03declined We'll be doing this differently in Modern Event Platform. [22:09:45] 10Analytics, 10Analytics-EventLogging, 10EventBus, 10Core Platform Team (Modern Event Platform (TEC2)), and 2 others: CI Support for Schema Registry - https://phabricator.wikimedia.org/T206814 (10Ottomata) a:05Ottomata>03None [22:10:02] 10Analytics, 10Analytics-EventLogging, 10EventBus, 10Core Platform Team (Modern Event Platform (TEC2)), and 2 others: Git Commit hook that adds a whole new file when a new version of schema is committed - https://phabricator.wikimedia.org/T206812 (10Ottomata) a:05Ottomata>03None [22:10:09] 10Analytics-EventLogging, 10Analytics-Kanban, 10EventBus, 10Services (watching): Modern Event Platform: Stream Intake Service: Migrate Mediawiki Eventbus events to EventGate - https://phabricator.wikimedia.org/T211248 (10Ottomata) a:05Ottomata>03None [22:10:20] 10Analytics, 10Analytics-EventLogging, 10EventBus, 10Security-Reviews, and 3 others: T206785: Modern Event Platform: Stream Intake Service: AJV usage security review - https://phabricator.wikimedia.org/T208251 (10Ottomata) a:05Ottomata>03None [22:10:29] 10Analytics, 10Analytics-EventLogging, 10EventBus, 10Core Platform Team (Modern Event Platform (TEC2)), and 2 others: Develop a library for JSON schema backwards incompatibility detection - https://phabricator.wikimedia.org/T206889 (10Ottomata) a:05Ottomata>03None [22:11:19] 10Analytics, 10Analytics-Cluster, 10Discovery, 10Patch-For-Review: Modern Event Platform: Stream Intake Service: Migrate Mediawiki monolog Kafka uses to EventGate - https://phabricator.wikimedia.org/T188136 (10Ottomata) a:05EBernhardson>03None [22:16:41] 10Analytics, 10Analytics-EventLogging, 10EventBus, 10Core Platform Team (Modern Event Platform (TEC2)), and 2 others: Git Commit hook that adds a whole new file when a new version of schema is committed - https://phabricator.wikimedia.org/T206812 (10Nuria) [22:17:06] 10Analytics, 10Analytics-EventLogging, 10EventBus, 10Core Platform Team (Modern Event Platform (TEC2)), and 2 others: CI Support for Schema Registry - https://phabricator.wikimedia.org/T206814 (10Nuria) [22:17:28] 10Analytics, 10Analytics-EventLogging, 10EventBus, 10Core Platform Team Backlog (Watching / External), 10Services (watching): Modern Event Platform: Stream Intake Service: Implementation: Deployment Pipeline - https://phabricator.wikimedia.org/T211247 (10Nuria) [22:17:53] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Anomie) Defining all the triggers might be a pain, but https://dev.mysql.com/doc/refman/5.5/en/replication-featu... [22:18:02] 10Analytics-EventLogging, 10Analytics-Kanban, 10EventBus, 10Services (watching): Modern Event Platform: Stream Intake Service: Migrate Mediawiki Eventbus events to EventGate - https://phabricator.wikimedia.org/T211248 (10Nuria) a:03Pchelolo [22:19:37] 10Analytics-EventLogging, 10Analytics-Kanban, 10EventBus, 10Services (watching): Modern Event Platform: Stream Intake Service: Migrate Mediawiki Eventbus events to EventGate - https://phabricator.wikimedia.org/T211248 (10Nuria) [22:20:06] 10Analytics, 10Analytics-EventLogging, 10EventBus, 10Security-Reviews, and 3 others: T206785: Modern Event Platform: Stream Intake Service: AJV usage security review - https://phabricator.wikimedia.org/T208251 (10Nuria) [22:20:54] 10Analytics, 10Analytics-EventLogging, 10EventBus, 10Security-Reviews, and 3 others: T206785: Modern Event Platform: Stream Intake Service: AJV usage security review - https://phabricator.wikimedia.org/T208251 (10Nuria) Putting this on security's radar, @chasemp Please let us know the best way to drive thi... [22:21:12] 10Analytics, 10Analytics-EventLogging, 10EventBus, 10Core Platform Team (Modern Event Platform (TEC2)), and 2 others: Develop a library for JSON schema backwards incompatibility detection - https://phabricator.wikimedia.org/T206889 (10Nuria) [22:23:29] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10EventBus, and 2 others: Prototype in node intake service - https://phabricator.wikimedia.org/T206815 (10Nuria) Pending items to close this ticket is to have a codewalkthrogh. Note that repo is now under wikimedia github: https://github.com/wikime... [22:25:23] 10Analytics, 10Analytics-EventLogging, 10EventBus, 10Core Platform Team (Modern Event Platform (TEC2)), and 2 others: Develop a library for JSON schema backwards incompatibility detection - https://phabricator.wikimedia.org/T206889 (10Nuria) Ping @ottomata @Pchelolo is this work we are iaming to do for ne... [22:25:39] 10Analytics, 10Analytics-EventLogging, 10EventBus, 10Core Platform Team (Modern Event Platform (TEC2)), and 2 others: Git Commit hook that adds a whole new file when a new version of schema is committed - https://phabricator.wikimedia.org/T206812 (10Nuria) Ping @ottomata @Pchelolo is this work we are iami... [22:27:40] 10Analytics, 10Analytics-EventLogging, 10EventBus, 10Core Platform Team (Modern Event Platform (TEC2)), and 2 others: Git Commit hook that adds a whole new file when a new version of schema is committed - https://phabricator.wikimedia.org/T206812 (10Ottomata) This work can happen at anytime, I think the so... [22:28:06] 10Analytics, 10Analytics-EventLogging, 10EventBus, 10Core Platform Team (Modern Event Platform (TEC2)), and 2 others: Git Commit hook that adds a whole new file when a new version of schema is committed - https://phabricator.wikimedia.org/T206812 (10Ottomata) We need this stuff settled before we can ask an... [22:29:09] 10Analytics, 10Analytics-EventLogging, 10EventBus, 10Core Platform Team (Modern Event Platform (TEC2)), and 2 others: Develop a library for JSON schema backwards incompatibility detection - https://phabricator.wikimedia.org/T206889 (10Pchelolo) >>! In T206889#4801877, @Nuria wrote: > Ping @ottomata @Pchelo... [22:33:37] 10Analytics, 10Analytics-EventLogging, 10EventBus, 10Core Platform Team (Modern Event Platform (TEC2)), and 2 others: Develop a library for JSON schema backwards incompatibility detection - https://phabricator.wikimedia.org/T206889 (10Nuria) Sounds good, this item seems somewhat connected to this one : htt... [22:36:32] 10Analytics, 10Analytics-Kanban, 10DBA, 10Data-Services, and 3 others: Create materialized views on Wiki Replica hosts for better query performance - https://phabricator.wikimedia.org/T210693 (10Milimetric) On the issue of storage, the average per wiki would definitely not come out to 5GB. If you plot all...