[00:33:41] Analytics-EventLogging: Investigate duplicate EventLogging rows - https://phabricator.wikimedia.org/T142667#2542598 (Tbayer) [07:57:05] (PS1) Joal: Update maven jar building in refinery [analytics/refinery/source] - https://gerrit.wikimedia.org/r/304184 [07:58:50] (PS2) Joal: Update maven jar building in refinery [analytics/refinery/source] - https://gerrit.wikimedia.org/r/304184 [08:17:17] a-team: I need to restart the kafka analytics cluster for JVM upgrades, if you see anything weird please let me know :) [08:17:27] elukey: Sure ! [08:17:27] * elukey looks at eventlogging [08:28:19] EL seems fine [08:28:21] woa [08:33:36] elukey: woa? [08:34:11] I meant "WOOOOOOOO!" [08:34:24] Ahhhh :) [08:34:55] I can see errors in the logs but then everything recovers with a metadata refresh [08:40:20] great work from Andrew [08:53:18] (PS18) Joal: [WIP] Refactor Mediawiki History scala code [analytics/refinery/source] - https://gerrit.wikimedia.org/r/301837 (https://phabricator.wikimedia.org/T141548) [10:00:53] (PS1) Joal: Modify bin/camus script to follow refinery update [analytics/refinery] - https://gerrit.wikimedia.org/r/304194 [10:04:47] hi a-team! [10:04:53] Hi mforns ! [10:05:08] mforns: I pushed some changes on scala code if you want to rebase :) [10:05:44] joal, ok, cool, I pushed yesterday after logging off, so I think I won't need to [10:05:55] mforns: Arf, I didn't update [10:05:58] mforns: will do ! [10:06:23] joal, oh, np I can do that, don't worry [10:06:35] mforns: nono, really, I should have double checked! [10:13:11] mobrovac: I'd need to restart kafka on kafka200[12] for jvm upgrades, anything against it? [10:13:31] nope elukey, go ahead [10:13:39] elukey: will you need to do it in eqiad too? [10:19:31] mobrovac: it will be done by Andrew next week as part of the kafka 0.9 upgrade [10:20:03] kk awesome [10:27:32] elukey: loading of 2nd month done [10:27:39] elukey: waiting for compaction [10:28:14] goooood [10:29:04] let me know after lunch when you'll have 20 mins to chat about manual loading and next steps for the password thing [10:31:40] elukey: sure, in like 15mins? [10:31:57] elukey: I'd like to finish something with mforns :) [10:32:11] mforns: I have an issue with some tests you wrote: they fail :( [10:32:23] joal, oh... [10:32:36] they were passing for me... in mvn at least [10:32:48] mforns: I don't understand because mvn passed for your patch, yes exactly [10:33:08] joal, batcave? [10:33:10] sure [10:33:13] ok [10:38:28] joal: let's do it in a couple of hours ok? [10:47:33] (PS19) Joal: [WIP] Refactor Mediawiki History scala code [analytics/refinery/source] - https://gerrit.wikimedia.org/r/301837 (https://phabricator.wikimedia.org/T141548) [10:47:40] mforns: --^ [10:47:49] thanks a lot joal [10:47:50] mforns: Sorry for the mess of rebases :( [10:48:04] mforns: I added comments with todo on tests that failed for me [10:48:09] np, thanks for changing the subgraph thing [10:48:14] ok [10:48:15] a-team: taking a break :) [10:59:14] * elukey lunch! [11:39:40] Analytics: Test - https://phabricator.wikimedia.org/T142702#2543554 (SaiLahariMedikonda) [11:40:11] Analytics: Test - https://phabricator.wikimedia.org/T142702#2543575 (SaiLahariMedikonda) a:SaiLahariMedikonda>None [12:16:22] Analytics-Cluster: Queries in Hue always return an empty result set - https://phabricator.wikimedia.org/T128039#2543720 (akosiaris) [12:56:11] team I am going to restart cassandra on aqs100[123] for security upgrades [12:59:03] Analytics-Cluster, Operations: Migrate titanium to jessie (archiva.wikimedia.org upgrade) - https://phabricator.wikimedia.org/T123725#2543774 (MoritzMuehlenhoff) a:MoritzMuehlenhoff>None [13:06:52] aqs1001 done, will proceed in a bit with 1002 [13:09:05] (CR) Ottomata: [C: 2] Update maven jar building in refinery [analytics/refinery/source] - https://gerrit.wikimedia.org/r/304184 (owner: Joal) [13:09:10] (CR) Ottomata: [C: 1] Update maven jar building in refinery [analytics/refinery/source] - https://gerrit.wikimedia.org/r/304184 (owner: Joal) [13:09:27] (CR) Ottomata: [C: 1] Modify bin/camus script to follow refinery update [analytics/refinery] - https://gerrit.wikimedia.org/r/304194 (owner: Joal) [13:42:19] joal: I have a short-medium term idea for the cassandra admin username [13:44:22] but I'd need to know first what credentials a 'aqs-loader' user will need [13:44:37] atm 'aqs' has CREATE, ALTER, DROP, SELECT, MODIFY [13:45:05] not sure if INSERT is allowed but I guess so [13:45:44] anyhow, the idea would be to create the new user with minimal permissions and assign the pass that we are using for 'cassandra' [13:46:26] then we'll be able to 1) avoid using the admin user 2) change the admin password 3) force oozie to use a user with less priviledges [13:46:55] in this way we wouldn't solve completely the problem but at least we'd limit the security exposure [13:47:11] then the last step would be to secure the aqs-loader password [13:59:39] elukey: sounds a very good idea :) [14:00:32] elukey: we'll need to test, but I think SELECT and MODIFY are enough for the loader (i'd even say modify only, but select is cheap ;) [14:02:49] ah MODIFY == INSERT, DELETE, UPDATE, TRUNCATE [14:04:33] joal: my idea is to create a similar adduser.cql file (like addaqsloaderuser.cql) storing the initial password in the private repo [14:04:54] push the file to our aqs hosts [14:05:08] and create the user with it [14:05:22] so it will be easy to redo it [14:05:25] and more documented [14:05:30] then we just try it [14:06:43] let me see if I can create a puppet patch [14:07:13] sounds good elukey [14:07:33] elukey: only thing I don't yet forsee is how to pass the password to oozie job [14:08:17] I tried to think something today and failed miserably [14:08:31] huhu :) [14:08:41] I mean, failed == ended up with a big hack [14:10:15] Analytics-Cluster, Graphoid, Operations, Services, and 2 others: Graphoid access logs are missing from Hadoop - https://phabricator.wikimedia.org/T99372#2544017 (akosiaris) parsoidcache has been deprecated and graphoid is now exposed via the text cluster. That solves the problem and graphoid logs... [14:10:37] Analytics-Cluster, Graphoid, Operations, Services, and 2 others: Graphoid access logs are missing from Hadoop - https://phabricator.wikimedia.org/T99372#2544018 (akosiaris) Open>Resolved a:akosiaris [14:16:03] Analytics, Operations, Traffic: Correct cache_status field on webrequest dataset - https://phabricator.wikimedia.org/T142410#2544038 (akosiaris) p:Triage>Normal [14:16:44] Analytics, Operations, Traffic: Correct cache_status field on webrequest dataset - https://phabricator.wikimedia.org/T142410#2544040 (BBlack) Open>Resolved a:BBlack I think we're done here, assuming the data looks sane on the analytics end. [14:22:33] ouch not that easy to put into the aqs module [14:22:50] it references a lot of cassandra instance puppet class things [14:22:51] grr [14:23:55] Analytics-Cluster, Operations: Migrate titanium to jessie (archiva.wikimedia.org upgrade) - https://phabricator.wikimedia.org/T123725#2544059 (akosiaris) p:Triage>Normal [14:37:16] joal, milimetric, hi! do you have 10 mins for opinion on tests? [14:37:24] Hi mforns [14:37:28] mforns: sure [14:37:34] hi mforns, I'm struggling with oozie but I have no opinion on tests [14:37:39] I'll do it however you guys like [14:37:44] milimetric, xD [14:38:04] milimetric, continue with oozie, it's OK I'll ask joseph [14:38:08] joal, batcave? [14:38:13] mforns: OMW ! [14:38:21] milimetric, join if you think you can't miss this hehe [14:38:48] :) thx [14:53:17] dudes my internet at this care seems really really bad [14:53:21] will try standup but i dunno [14:53:29] also, my phone is still busted so i can't use phone internet :/ [14:55:40] we could see who's on and just IRC standup [14:55:57] no prob for me [15:05:38] Analytics-Kanban: Update refinery to solve dependency conflicts - https://phabricator.wikimedia.org/T142717#2544163 (JAllemandou) [16:00:30] elukey: re aqsloader user. why not just 'aqs' for name. second: i think it would be totally fine to put the oozie .properties file in puppet as a template. [16:00:33] don't put any .xml files in there [16:00:52] just the one .properties file that will be used to launch the job. [16:00:52] OR [16:00:53] OR [16:01:01] elukey: don't put the password anywhere [16:01:01] actually [16:01:02] yeah [16:01:12] the password could be passed in on the CLI. [16:01:13] HMMMM [16:01:18] but then again. [16:01:31] all the properties will be exposed in the job browser / in hue [16:01:31] yeah today I had a lot of "hmmmmm"s too [16:01:49] so yeah , hm, it will be readable no matter what we do. [16:01:50] i thikn [16:02:33] ottomata, elukey : We could modify cassandra loading job to be given a file with the password for cassandra [16:02:58] was about to suggest something like that [16:03:00] This file should be hdfs readable-only on hdfs [16:03:04] +1 [16:03:29] a-team i can barely hear anything in the meeting, sorry [16:03:33] if there is something you need me to task let me know [16:04:14] let me if you want to apply my short to medium solution (so maybe adding the user manually as temp step is fine) or if we want to go straight to the longer term one [16:04:22] I am ok either way :) [16:06:08] Analytics-Kanban: Page History: Add unit tests to PageHistoryDataExtractors and PageHistoryBuilder - https://phabricator.wikimedia.org/T142724#2544366 (JAllemandou) [16:06:21] Analytics-Kanban: Productionize scala code for edit reconstruction - https://phabricator.wikimedia.org/T142552#2544380 (JAllemandou) a:JAllemandou>None [16:06:33] Analytics-Kanban: Page History: Add unit tests to PageHistoryDataExtractors and PageHistoryBuilder - https://phabricator.wikimedia.org/T142724#2544366 (JAllemandou) p:Triage>Normal [16:06:41] Analytics-Kanban: User history: Adapt the user history reconstruction to use scaling by clustering - https://phabricator.wikimedia.org/T141774#2544383 (Nuria) This task was about parallelization strategy, which worked, but code is not final. Following up will be done on a different task. [16:10:52] Analytics-Kanban: Productionize scala code for edit reconstruction - https://phabricator.wikimedia.org/T142552#2544395 (Nuria) [16:13:17] Analytics-Kanban: Page History: Add unit tests to PageHistoryDataExtractors and PageHistoryBuilder - https://phabricator.wikimedia.org/T142724#2544401 (Nuria) [16:13:19] Analytics-Kanban: Productionize scala code for edit reconstruction - https://phabricator.wikimedia.org/T142552#2544400 (Nuria) [16:13:35] elukey: to puppet? naw i think its fine not to puppetize the cassandra user [16:13:39] i'd just add it and document. [16:14:23] parameterize the username and pw in puppet somehow, but not the creattion/grant in cassandra [16:15:50] Analytics-Kanban: Page History: Add unit tests to PageHistoryDataExtractors and PageHistoryBuilder - https://phabricator.wikimedia.org/T142724#2544366 (Nuria) Some tests will be units, others -due to nature of code- will only be integration tests. We need to create some fixtures with fake data to test input... [16:16:32] yeah the adduser.cql needs to be manually run [16:16:35] ottomata: --^ [16:16:37] so something like that [16:19:11] ottomata: just configured the new disk on an1045 and rebooted the host [16:19:14] all good [16:19:45] nice! thanks elukey [16:19:55] elukey: yeah, just manually add the user and document, i think thats totally enough for creating the user [16:21:17] hello, all! any idea why only some pageviews get recorded for Special:Random? http://tools.wmflabs.org/pageviews/?project=en.wikipedia.org&platform=all-access&agent=user&range=latest-20&pages=Special:Random [16:21:50] Analytics-Kanban: Update Refinery to solve dpendency onflicts - https://phabricator.wikimedia.org/T142717#2544409 (Nuria) [16:23:46] those numbers are far too low, but obviously Special:Random isn't a page you can actually browse to (and returns a 302 redirect and not a 301, if that matters). So it's interesting we're still seeing some pageviews [16:24:48] ottomata: all right, will work on that tomorrow, thanks :) [16:36:15] Analytics-Kanban: Productionize edit history extraction for all wikis using Sqoop - https://phabricator.wikimedia.org/T141476#2544435 (Nuria) [16:41:46] ottomata: ready for a big deploy time now ? [16:42:00] (PS3) Joal: Update maven jar building in refinery [analytics/refinery/source] - https://gerrit.wikimedia.org/r/304184 (https://phabricator.wikimedia.org/T142717) [16:45:05] (PS1) Joal: Update changelog for v0.0.34 [analytics/refinery/source] - https://gerrit.wikimedia.org/r/304259 [16:45:28] Arf ... looks like ottomata is not here anymore :( [16:46:36] (PS2) Joal: Modify bin/camus script to follow refinery update [analytics/refinery] - https://gerrit.wikimedia.org/r/304194 (https://phabricator.wikimedia.org/T142717) [16:48:21] here! [16:48:21] joal: [16:48:26] Yay :) [16:48:26] soryr, was in 1:1 [16:48:30] np ottomata :) [16:48:31] ja let's do it [16:48:33] cool [16:48:42] ottomata: 2 patches on refinery-source to start :) [16:48:44] please :) [16:49:41] holy moly slow internet [16:49:46] waiting for gerrit pages to load [16:49:54] :d [16:52:12] joal: +2, merge whatever you can [16:52:14] i'll do the puppet one [16:52:17] when its time [16:52:21] ottomata: Ok, doing [16:52:26] i think you'llh ave to deploy [16:52:31] ottomata: no prob [16:52:41] (CR) Joal: [C: 2] Update maven jar building in refinery [analytics/refinery/source] - https://gerrit.wikimedia.org/r/304184 (https://phabricator.wikimedia.org/T142717) (owner: Joal) [16:53:12] (CR) Joal: [C: 2] Update changelog for v0.0.34 [analytics/refinery/source] - https://gerrit.wikimedia.org/r/304259 (owner: Joal) [16:53:56] (CR) Joal: [C: 2] Update maven jar building in refinery [analytics/refinery/source] - https://gerrit.wikimedia.org/r/304184 (https://phabricator.wikimedia.org/T142717) (owner: Joal) [16:54:09] Man was on the wrong patch set [16:55:54] YES phone decided to work [16:55:57] phone internet activated! [16:56:41] joal: lemme know when you have deployed and I can merge and run puppet [16:56:59] ottomata1: Yup. Jenkins taking his time for the moment :) [16:57:12] (CR) Joal: [V: 2] Update maven jar building in refinery [analytics/refinery/source] - https://gerrit.wikimedia.org/r/304184 (https://phabricator.wikimedia.org/T142717) (owner: Joal) [16:57:16] (Merged) jenkins-bot: Update maven jar building in refinery [analytics/refinery/source] - https://gerrit.wikimedia.org/r/304184 (https://phabricator.wikimedia.org/T142717) (owner: Joal) [16:57:18] (Merged) jenkins-bot: Update changelog for v0.0.34 [analytics/refinery/source] - https://gerrit.wikimedia.org/r/304259 (owner: Joal) [17:01:24] (CR) Joal: [C: 2 V: 2] "Merging for deploy." [analytics/refinery] - https://gerrit.wikimedia.org/r/304194 (https://phabricator.wikimedia.org/T142717) (owner: Joal) [17:03:58] (CR) Joal: [C: 2 V: 2] Modify bin/camus script to follow refinery update [analytics/refinery] - https://gerrit.wikimedia.org/r/304194 (https://phabricator.wikimedia.org/T142717) (owner: Joal) [17:07:40] elukey: have you seen aqs latencies since you restarted the machines? [17:07:52] elukey: I want to know more !!! [17:12:28] Analytics-Kanban: Update Refinery to solve dependencies conflicts - https://phabricator.wikimedia.org/T142717#2544630 (JAllemandou) [17:19:10] ottomata: the old build and dpeloy system was less good, but it was faster ;) [17:27:14] aye [17:27:16] haha [17:27:19] well, i mean, sorta [17:27:26] except for maybe the upload artifacts to archiva [17:27:28] that was still really slow [17:39:48] ottomata: I was doing it from stat1002, reasonnably fast [17:40:20] ottomata: refinery-source done [17:40:42] ottomata: We need to sync the refinery and puppet deploy [17:45:00] ottomata: Ready whenever you want :) [18:02:13] was eating lunch [22:31:28] (PS5) Milimetric: [WIP] Oozify sqoop import of mediawiki tables [analytics/refinery] - https://gerrit.wikimedia.org/r/303339 (https://phabricator.wikimedia.org/T141476) [22:48:54] hey milimetric. do you have few min (if you're still around.)? [23:12:21] hey lzia [23:12:23] what's up [23:12:45] ow hey milimetric. :) [23:13:12] so, I wanted to see if I'm listing the right steps in https://phabricator.wikimedia.org/T142780 if I'm requesting access to webrequest logs for two researchers? [23:13:58] lzia: task looks good, just tag Ops-Access-Requests on it [23:14:05] it'll get discussed at the meeting and done [23:14:09] perfect. thank you milimetric. [23:14:17] lzia: but they're under the NDA right? [23:14:20] yup [23:14:27] NDA and MOU are already signed by all parties. :) [23:14:47] k, and in the future if you forget, just remember to type "access-requests" in phab and you can see the other examples that tag it [23:14:54] cool, welcome to more researchers :) [23:14:54] yay [23:15:12] oh! thank you! :) and thank you! :) [23:15:34] Analytics-Kanban, Trash: ---- DISCUSSED BELOW ---- - https://phabricator.wikimedia.org/T114124#2546198 (Danny_B) Hello. Why don't you create another column then actually? You should be aware of that it is highly unreliable way of sorting and absolutely without any guarantee that that sorting will remain... [23:17:58] Analytics-Kanban, Trash: ---- RUBICON ---- - https://phabricator.wikimedia.org/T104390#2546207 (Danny_B) Hello. Why don't you create another column then actually? You should be aware of that it is highly unreliable way of sorting and absolutely without any guarantee that that sorting will remain the sam...