[01:13:54] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, and 4 others: Investigate use-cases for delayed job executions - https://phabricator.wikimedia.org/T172832#3540031 (10Mattflaschen-WMF) There are three considerations relevant to Echo: 1. Delayed notifications ({T156808}) 1a. Article reminder not... [07:05:26] moooorning! [07:23:29] (03CR) 10Nikerabbit: "Is this still WIP or ready for review?" [analytics/limn-language-data] - 10https://gerrit.wikimedia.org/r/362877 (https://phabricator.wikimedia.org/T158835) (owner: 10Amire80) [08:01:04] 10Analytics-Kanban, 10Operations, 10ops-eqiad: Degraded RAID on analytics1055 - https://phabricator.wikimedia.org/T172809#3540324 (10Volans) p:05Triage>03Normal [08:18:29] (03PS4) 10Joal: Improve resiliency of Banner streaming job [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/359461 (https://phabricator.wikimedia.org/T169101) [08:18:31] (03PS1) 10Joal: Add tranquility to the banner streaming job [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/373030 [08:22:31] (03CR) 10jerkins-bot: [V: 04-1] Add tranquility to the banner streaming job [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/373030 (owner: 10Joal) [08:27:16] !log Rerun druid loading jobs after night failures [08:27:17] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [08:37:46] (03PS2) 10Joal: Add tranquility to the banner streaming job [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/373030 [09:39:17] 10Analytics-Kanban: Troubleshoot Wikimetrics "magic button" - https://phabricator.wikimedia.org/T173585#3540509 (10Marostegui) >>! In T173585#3539637, @mforns wrote: > @Marostegui > > Hi! I'm looking into this problem and it looks very much like the one Dan was fixing a couple days ago in: > https://phabricato... [10:06:40] 10Analytics-Kanban: Troubleshoot Wikimetrics "magic button" - https://phabricator.wikimedia.org/T173585#3540595 (10mforns) Thanks @Marostegui ! [10:22:31] moin moin [10:23:51] Hi addshore [10:26:11] 10Analytics, 10Contributors-Analysis, 10DBA, 10Chinese-Sites, 10Patch-For-Review: Data Lake edit data missing for many wikis - https://phabricator.wikimedia.org/T165233#3540619 (10Marostegui) Hi, Is there anything pending here? [10:28:31] 10Analytics, 10Contributors-Analysis, 10DBA, 10Chinese-Sites, 10Patch-For-Review: Data Lake edit data missing for many wikis - https://phabricator.wikimedia.org/T165233#3540631 (10JAllemandou) @Marostegui : We're waiting for august run to happen (at the first days of september) before closing, but we exp... [10:30:22] 10Analytics, 10Contributors-Analysis, 10DBA, 10Chinese-Sites, 10Patch-For-Review: Data Lake edit data missing for many wikis - https://phabricator.wikimedia.org/T165233#3540632 (10Marostegui) Excellent! Thanks for the update! [10:56:30] 10Analytics, 10DBA, 10Research, 10cloud-services-team (Kanban): Implement technical details and process for "datasets_p" on wikireplica hosts - https://phabricator.wikimedia.org/T173511#3540700 (10jcrespo) One thought- after setting up the new labsdbs, we said we were going to consider the #DBA part of the... [11:24:19] nuria_: I have a question related to privacy for when you are here! Vaguely linked to analytics stuff [11:25:02] addshore: nuria_ is in holidays for 10 days - ottomata is the one for now :) [11:25:29] ack! well, I'll just ask my question in here and see who comes up with answers :D [11:28:34] So this relates to the Campaigns mediawiki extension. Currently when a user lands on the registration page with campaign=foo in the url and registered then this gets recorded in an event logging table. [11:28:34] When a user hits a regular wikipage with campaign=foo and then navigates to the registration page there is no tracking done. [11:28:34] Is there any reason that I shouldnt go and write the code that allows landing on pages other than the registration page and then tracking once the user gets there? [11:28:34] basically this would be, when a user lands on a wikipage with a campaign in the URL param a session cookie would be set, and the registration page would then look for this as well as the url param?! :) [11:31:31] addshore: if I understand correctly, this would mean full tracking of our users - any user get a session cookie when coming to us - which I think we don't want [11:32:02] "any user get a session cookie when coming to us", well, only when they have a campaign=XXX set [11:32:27] Arh, forgetting that campaign point sorry [11:32:36] hm ... I don't know ! [11:32:48] also, I guess this will only apply for anon users, as there is no need to do it for logged in users [12:13:52] taking a b [12:13:54] reak [12:42:15] addshore, do you know which EventLogging schema is used to store the campaigns data? [12:44:16] I assume the related schema is correctly white-listed to purge/sanitize it after 90 days to avoid storing privacy sensitive information [12:45:02] maybe the reason it is not logging events in the normal wiki pages, is that it would be too much traffic? [12:45:48] EventLogging currently supports an average flow of 10 events per second per schema (with some exceptions) [13:14:40] (03CR) 10Ottomata: [C: 031] Improve resiliency of Banner streaming job [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/359461 (https://phabricator.wikimedia.org/T169101) (owner: 10Joal) [13:16:43] (03CR) 10Ottomata: "Cool! Did you get this to work?!" (031 comment) [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/373030 (owner: 10Joal) [13:39:43] mforns: yes! [13:40:31] mforns: the schema is https://meta.wikimedia.org/wiki/Schema:AccountCreation [13:41:23] And it wont be logging events on normal wiki pages, still only on account creation. But if the user hits http://en.wikipedia.org/wiki/LandingPage?campaign=foo and then navigates to the create account page and then create an account that would be recorded in event logging, where right now it is not [13:42:35] I woudn't expect the number of events to increase really, I mean, these campaigns are only used in select locations / on select occasions. For example WMDEs usecase is for banners which go to a landing page, and then to record how many people register after hitting the landing page. [13:42:54] addshore, understand [13:43:12] the AccountCreation schema is not white-listed, so it's set up for full auto-purging after 90 days [13:44:08] fy i'm looking into these failed oozie jobs [13:44:14] druid jobs* [13:44:23] this looks similar to what happened when we first upgraded to 0.9.2 [13:44:30] ottomata, I think joal re-run them this morning? [13:44:33] so i must have missed something, (or something isn't puppetized 100%) [13:44:33] yes [13:44:39] and they will succeed sometimes [13:44:42] if it is the same problem [13:44:44] ok [13:44:49] but fail others [13:45:18] addshore, then I think it's totally safe to add those logs [13:45:37] mforns: awesome! I'll get on to writing the code at some point then! :) [13:48:46] addshore, just a heads-up, we've notified this lots of times in different channels, but just in case: this schema is not white-listed, so its data will be full-purged after 90 days. If you wish to keep data for longer, it should be analyzed for privacy sensitive structures and white-listed accordingly. Having a first look at the schema, I think at least the internalReferrer field would have to be purged in any case. [13:57:44] !log sudo -u hdfs hdfs dfs -rm /tmp/druid-indexing/classpath/guava.jar (guava 11.0.2 is conflicting with guava 16.0.1. from druid-hdfs-storage-cdh extension). Not sure how guava 11.0.2 got there, but let's see if it doesn't come back [13:57:46] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [14:17:34] (03PS1) 10Mforns: [wIP] Close all sessions to avoid database connection errors [analytics/wikimetrics] - 10https://gerrit.wikimedia.org/r/373078 (https://phabricator.wikimedia.org/T173585) [14:18:26] (03CR) 10jerkins-bot: [V: 04-1] [wIP] Close all sessions to avoid database connection errors [analytics/wikimetrics] - 10https://gerrit.wikimedia.org/r/373078 (https://phabricator.wikimedia.org/T173585) (owner: 10Mforns) [14:46:46] ottomata, yt? [14:52:22] 10Analytics, 10Analytics-Cluster, 10Operations, 10ops-eqiad, and 2 others: rack/setup/install new kafka nodes kafka-jumbo100[1-6] - https://phabricator.wikimedia.org/T167992#3541450 (10Cmjohnson) [14:54:10] mforns: sorry [14:54:10] ya [14:54:11] hey [14:54:22] ottomata, hello [14:54:59] I have one question about wikimetrics-web service, do you recall how to restart it? [14:55:06] I can not see it... [14:55:23] I can see wikimetrics-queue, wikimetrics-scheduler [14:55:29] but not wikimetrics-web [14:56:26] I know mostly dan and madhuvishy knew more about this, but just in case [14:56:53] 10Analytics, 10Analytics-Cluster, 10Operations, 10ops-eqiad, 10User-Elukey: kafka-jumbo1004 h/w problem most likely raid card - https://phabricator.wikimedia.org/T173837#3541461 (10Cmjohnson) [14:57:02] hmmmm [14:57:11] can't say I do mforns... [14:57:31] ottomata, I can't even find apache... [14:57:38] mforns: maybe [14:57:45] uwsgi-wikimetrics-web [14:57:45] ? [14:57:57] mmmmmm yea maybe [14:58:02] just looking in puppet [14:58:04] not sure [14:58:53] ottomata, yes! [14:59:08] thank youuuuuuu! [15:01:54] AH [15:01:59] SATANDUP [15:02:01] whoa! [15:02:04] satan dup [15:13:39] (03CR) 10Ottomata: [wIP] Close all sessions to avoid database connection errors (031 comment) [analytics/wikimetrics] - 10https://gerrit.wikimedia.org/r/373078 (https://phabricator.wikimedia.org/T173585) (owner: 10Mforns) [15:55:08] (03PS2) 10Mforns: [wIP] Close all sessions to avoid database connection errors [analytics/wikimetrics] - 10https://gerrit.wikimedia.org/r/373078 (https://phabricator.wikimedia.org/T173585) [15:56:16] (03CR) 10jerkins-bot: [V: 04-1] [wIP] Close all sessions to avoid database connection errors [analytics/wikimetrics] - 10https://gerrit.wikimedia.org/r/373078 (https://phabricator.wikimedia.org/T173585) (owner: 10Mforns) [16:07:19] joal mforns whenever you folks want we can catch up for a couple mins [16:07:27] fdans, sure [16:07:30] anytime [16:29:52] when you want guys :) [16:37:44] 10Analytics, 10Analytics-Cluster, 10Operations, 10ops-eqiad, 10User-Elukey: kafka-jumbo1004 h/w problem most likely raid card - https://phabricator.wikimedia.org/T173837#3541879 (10Cmjohnson) open a self dispatch with Dell for a new raid card [16:39:00] actually, will be back after family diner [16:58:14] joal: no luck i guess huh? [16:58:35] damn, I just saw your message joal [16:58:43] do you want to do it now mforns ? [16:58:58] fdans, ok! [16:59:52] I'm at the batcave having batcaviar [17:02:24] hehe ok [18:23:48] 10Analytics, 10Analytics-EventLogging, 10AbuseFilter, 10CirrusSearch, and 29 others: Possible WMF deployed extension PHP 7 issues - https://phabricator.wikimedia.org/T173850#3542316 (10Reedy) [18:34:15] 10Analytics, 10Analytics-EventLogging, 10Contributors-Analysis, 10EventBus, and 2 others: Visualize page create events for all wikis - https://phabricator.wikimedia.org/T170850#3542387 (10Nettrom) I'm working on this and got ReportUpdater working locally. A couple of questions: # Most of the examples I'v... [18:46:14] fdans, mforns: still here? [18:46:48] joal I am! had a chat with mforns already [18:46:59] ok fdans [18:46:59] we can talk tomorrow after standup if you want [18:47:07] sounds good :) [18:49:56] ottomata: nice post on event-oriented architecture ! [18:55:18] ottomata: no luck no ... Interesting post on class-path on spark: https://stackoverflow.com/questions/37132559/add-jars-to-a-spark-job-spark-submit [18:55:26] ottomata: I'm gonna stop for now ... [18:56:54] 10Analytics, 10Analytics-EventLogging, 10AbuseFilter, 10CirrusSearch, and 29 others: Possible WMF deployed extension PHP 7 issues - https://phabricator.wikimedia.org/T173850#3541977 (10Anomie) I'm not impressed with the false positive rate for this tool. The warnings in OAuth, CentralAuth, and Scribunto al... [18:57:37] ok joal [18:57:48] ottomata: I'm a bit sad :( [18:59:12] joal [18:59:12] • --jars vs SparkContext.addJar: These are identical, only one is set through spark submit and one via code. Choose the one which suites you better. One important thing to note is that using either of these options does not add the JAR to your driver/executor classpath, you'll need to explicitly add them using the extraClassPath config on both. [18:59:30] Indeed [18:59:34] did you try that? [18:59:39] hha i guess tyou did [18:59:42] you pasted this link to me :) [19:00:18] What is needed (if I understand correctly), is the --jars AND the --extraClassPath [19:03:10] right, did you try that? [19:03:31] yes I did [19:03:37] it didn't work :( [19:04:34] rats [19:05:10] logs say the 2 jars are present, but still no luck [19:06:15] :/ [19:15:50] 10Analytics, 10Analytics-EventLogging, 10Contributors-Analysis, 10EventBus, and 2 others: Visualize page create events for all wikis - https://phabricator.wikimedia.org/T170850#3542638 (10kaldari) >What measurements are we interested in? Whatever you need for ACTRIAL, plus some basic ones: - Page creatio... [19:49:01] mforns: you still there? [19:49:06] ottomata, yes! [19:49:13] sup? [19:49:14] got a few mins for a python brain bounce? [19:49:19] sure! [19:49:25] batcave? [19:49:27] ya [19:49:29] omw [19:52:25] !log Drop / recreate wmf.mediawiki_history table for naming correction [19:52:26] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [20:42:49] 10Analytics, 10Analytics-EventLogging, 10Contributors-Analysis, 10EventBus, and 2 others: Visualize page create events for all wikis - https://phabricator.wikimedia.org/T170850#3542935 (10Ottomata) @mforns has more reportupdater know how than me, I'll let him respond :) [20:51:49] 10Analytics, 10Analytics-EventLogging, 10Contributors-Analysis, 10EventBus, and 2 others: Visualize page create events for all wikis - https://phabricator.wikimedia.org/T170850#3542975 (10mforns) @Nettrom @kaldari > Most of the examples I've seen create datasets with a single measurement, meaning there a... [20:52:07] 10Analytics, 10Operations, 10Ops-Access-Requests, 10Research, and 2 others: NDA, MOU and LDAP (analytics cluster) for Shilad Sen - https://phabricator.wikimedia.org/T171988#3542976 (10herron) Shell account `shiladsen` has been added to puppet and deployed to stat systems: stat1003:~$ id shiladsen uid=... [20:52:39] 10Analytics, 10Operations, 10Ops-Access-Requests, 10Research, and 2 others: NDA, MOU and LDAP (analytics cluster) for Shilad Sen - https://phabricator.wikimedia.org/T171988#3542977 (10herron) [21:47:33] 10Analytics, 10Operations, 10Ops-Access-Requests, 10Research, and 2 others: NDA, MOU and LDAP (analytics cluster) for Shilad Sen - https://phabricator.wikimedia.org/T171988#3543170 (10DarTar) Thanks @herron. @Shilad: Aaron is currently traveling, but let me know if you need any assistance. If we have an... [23:17:23] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, and 4 others: Investigate use-cases for delayed job executions - https://phabricator.wikimedia.org/T172832#3543355 (10ggellerman) @dr0ptp4kt @Phuedx @Fjalapeno @kaldari @TrevorParscal adding you to make you aware of this request from Services t...