[11:34:39] 06Analytics-Kanban, 15User-Elukey: Improve purging for analytics-slave data on Eventlogging - https://phabricator.wikimedia.org/T156933#3254707 (10elukey) Some notes taken while researching: * running the purge script and `eventlogging_syncy.sh` on the same host might generate contention while INSERTING/DELE... [11:41:19] taking a break lads [11:41:23] joal: o/ [11:42:01] fdans: hola amigo [12:22:15] elukey: holaaaa sorry I was at lunch [12:35:25] 10Analytics, 10EventBus, 13Patch-For-Review, 06Services (watching): Check eventbus Kafka cluster settings for reliability - https://phabricator.wikimedia.org/T144637#3254899 (10elukey) I think I already asked this question to @Ottomata but I forgot the answer :D, so I am going to ask again: any reason to k... [12:36:18] fdans: o/ [12:41:24] fdans: I think that in beta everything is good, I applied your patch as cherry pick and I didn't end up in detached head [12:43:44] and before on tin the branch was in detached head [12:43:57] (el analytics repo I mean) [12:52:57] elukey: wonderful!! let's deploy then? [12:53:17] oh I see so the key is that HEAD isn't in a detached state [12:53:20] that makes sense [12:55:17] fdans: do you know about the cherry pick automagic from the gerrit UI? [12:55:54] elukey: nope! is that the piece of the puzzle that I'm missing that will make me looooove gerrit? :D [12:56:06] https://gerrit.wikimedia.org/r/#/c/352579/ --> top right corner 'Download' --> select anonymous http in the bottom --> Cherry pick [12:56:37] then you apply that one on deployment-tin [12:58:18] about the deployment: since https://gerrit.wikimedia.org/r/#/c/352579 has no +1 in there and Andrew is on vacation and we are going to Prague soon, let's postpone the deployment to when we'll be back? [12:58:40] unless it is super urgent [12:59:01] I like Joseph's idea of "deployment freeze" for analytics a couple of days before the offsie [12:59:04] *offsite [13:03:16] yeah that makes sense [13:08:44] hey team :] [13:08:44] elukey: +1 [13:09:07] to deployment freeze from today until we are all in Prague [13:18:37] mforns: are the tests for refinery-core broken for you in master? [13:18:58] nuria_, /me checks [13:24:58] nuria_, refinery-core passes for me... [13:25:09] mforns: master? [13:25:17] nuria_, but refinery-jobs fails! [13:25:19] yes, master [13:25:33] mforns: ok, for me it fails, with some dependencies not present [13:25:54] for refinery-job I get: java.io.FileNotFoundException: File /tmp/unittest/refinery-source/refinery-job/TestSubgraphPartitioner/38de5826-c854-4dbc-97f4-a022b28db32e does not exist [13:29:35] (03PS1) 10Nuria: [WIP] UDF to tag requests [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/353287 (https://phabricator.wikimedia.org/T164021) [13:30:29] mforns: for me master is all broken, ok, we are going to have to look at it in more detail, fdans, do tests for master in refinery work for you? [13:31:04] refinery_source no? [13:31:24] nuria_: checking [13:33:09] (03PS2) 10Nuria: [WIP] UDF to tag requests [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/353287 (https://phabricator.wikimedia.org/T164021) [13:34:08] mforns: right , refinery-core actually [13:35:01] refinery core seems fine to me after pulling nuria_ [13:35:18] fdans: so tests pass [13:35:23] yes [13:35:59] fdans: ok, let me blow up all deps then, i wonder if i need a new vs of scala [13:36:43] nuria_: I'm running them with mvn test, that's the right way? [13:37:03] fdans: yaya totally, i must have something not right on my env [13:39:48] fdans: ok, refinery-job ? [13:39:53] fdans: i think that one fails [13:40:02] checking [13:41:11] nuria_: which error are you getting? [13:41:30] fdans, mforns all ok now but refinery-job [13:41:39] there are several tests that fail [13:42:34] refinery-job fails for me too [13:43:22] +1 [13:43:28] refinery-job failing [13:49:08] hey guys - mvn clean package works for me on master [13:51:05] :/ [13:59:21] joal: it is an issue of deps: no snappyjava in java.library.path [14:01:17] elukey: hola! standup [14:05:54] just retested on stat1004 after having removed ~/.m2 folder (force cleaning mvn deps) --> works for me :( [14:06:09] refinery-source --^ [14:44:05] https://phabricator.wikimedia.org/T164377#3235864 [14:56:29] nuria_: redirect codes we should look at( sorry lost the chat when closing hangout): 301, 302, 304, 307 0 anything else?n [15:04:37] 10Analytics, 06Scoring-platform-team, 10rsaas-articlequality , 07Spike: [Spike] Store article quality data inside hadoop and make AQS outputs a public API - https://phabricator.wikimedia.org/T164377#3255338 (10Nuria) >Is there a public hadoop task that we could make this task block on? Our plans for next y... [15:04:51] 304 is pageview [15:04:56] ~^joal [15:05:01] so you alredy have that one [15:05:05] *already [15:05:14] the others are 307, 301 and 302 [15:05:22] halfak: answered on ticket [15:11:35] thanks nuria_ [15:12:23] 06Analytics-Kanban: Cleaning scheme for banner data _SUCCESS files - https://phabricator.wikimedia.org/T164497#3255364 (10mforns) a:03mforns [15:14:25] (03PS1) 10Mforns: Add script to delete banner activity _SUCCESS files [analytics/refinery] - 10https://gerrit.wikimedia.org/r/353309 (https://phabricator.wikimedia.org/T164497) [15:15:45] (03PS1) 10Joal: Provide RedirectToPageview function and UDF [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/353310 (https://phabricator.wikimedia.org/T143928) [15:53:22] nuria_: There must be something else :( [15:54:31] there is a HUGE artefact on m.wikidata.org for last-acess-global-set and last-access-not-set [15:54:38] and redirects don't cover [15:54:45] There must be something else [16:14:05] joal: did you look at http codes [16:15:44] I'm not sure what you mean [16:16:46] nuria_: was expecting that, adding new redirects to set, we would find a size-equivalent uniques set for global as the ones on m.wikidata.org [16:16:55] nuria_: But there is not :( [16:17:13] joal: what about if you do not exclude any http code? [16:17:24] hm [16:17:27] will try [16:17:34] joal: sorry, mi initial reply was not clear [16:25:37] nuria_: mismo :( [16:26:10] nuria_: numbers are bigger on some other specific cases, but the artifact is still here [16:26:15] joal: the number increases when looking at redirects right? (even if there is a disparity) [16:27:02] nuria_: total number of uniques found increases, yes - but the artifact between global and not on L-A not set while L-A-G set is still here [16:27:33] joal: ok, we are going to have to do more thinking [16:27:45] Looks like so :( [16:27:55] I was so happy we had found a good candidate [16:28:51] people I am going afk! o/ [16:29:04] bye elukey [17:21:32] 06Analytics-Kanban: Create purging script for mediawiki-history data - https://phabricator.wikimedia.org/T162034#3256023 (10mforns) @JAllemandou > Can we drop partitions that contain subpartitions, or do we need to iterate over every subpartition? Thanks for the heads-up! In my ignorance, though, I don't unde... [17:22:32] joal, I commented on the mediawiki snapshot task, about your last comment, when you have time can you have a look, please? :] [17:56:30] Hi mforns - commenting :) [17:56:37] :] thx joal [17:56:38] sorry for the delay mforns [17:56:41] np [18:07:17] 06Analytics-Kanban: Create purging script for mediawiki-history data - https://phabricator.wikimedia.org/T162034#3256238 (10JAllemandou) > In my ignorance, though, I don't understand what could be the potential problem of deleting the whole snapshot directory (with sub-partitions in it)? You think Hadoop won't... [18:10:08] joal, thanks for the comments [18:19:02] mforns: np ! Does it make sense ? [18:19:44] joal, yes! I was thinking of a low tech alternative which was to execute the msck repair table after all deletions, but your solution is better [18:20:02] even... repair table might not work at all... [18:20:18] mforns: I think repair only adds, doesn't delete unfortunately [18:20:23] I see [18:20:30] mforns: not sure though [18:21:02] anyway, the drop partition command both deletes data AND meta-data right? [18:21:32] nope mforns, external tables, so only meta-data when droping partitions [18:21:32] joal, ^ [18:21:37] oh, ok [18:22:02] K, I think I can finish the script now, thanks a lot! [18:22:15] no prob mforns, thanks a lot for doing that ! [18:22:23] o/ [18:54:00] 06Analytics-Kanban, 06DC-Ops, 06Operations, 10ops-eqiad: analytics1030 stuck in console while booting - https://phabricator.wikimedia.org/T162046#3256414 (10Cmjohnson) The new system board has been ordered and I will be contacted by a Dell tech to visit the cage and replace. In regards to Service Tag –... [20:19:16] 10Analytics, 10Fundraising-Backlog, 10MediaWiki-extensions-CentralNotice: Make banner impression counts available somewhere public - https://phabricator.wikimedia.org/T115042#3256732 (10AndyRussG) >>! In T115042#3252540, @mforns wrote: > 1) We spoke about the target segment of a campaign, and gave as an exam... [20:55:10] 10Analytics, 03Community-Tech-Sprint: Investigation: How can we improve the speed of the popular pages bot - https://phabricator.wikimedia.org/T164178#3256808 (10Niharika) The bot is currently using promises for fetching redirects and it's...lightning fast somehow...Examples below - Before (~20 hours): ``` 20... [20:57:21] 10Analytics, 03Community-Tech-Sprint: Investigation: How can we improve the speed of the popular pages bot - https://phabricator.wikimedia.org/T164178#3256813 (10kaldari) @Niharika: Since using promises provides such a dramatic speed improvement, I would hate for us to throw that away for the majority of WikiP... [21:30:02] 10Analytics, 03Community-Tech-Sprint: Investigation: How can we improve the speed of the popular pages bot - https://phabricator.wikimedia.org/T164178#3256908 (10MusikAnimal) >>! In T164178#3256808, @Niharika wrote: > The bot is currently using promises for fetching redirects and it's...lightning fast somehow.... [21:38:13] 10Analytics, 03Community-Tech-Sprint: Investigation: How can we improve the speed of the popular pages bot - https://phabricator.wikimedia.org/T164178#3256959 (10Niharika) >>! In T164178#3256813, @kaldari wrote: > @Niharika: Since using promises provides such a dramatic speed improvement, I would hate for us t...