[04:40:13] (PS2) Terrrydactyl: Added projects to csv output [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/138474 [05:36:37] (PS6) AndyRussG: WIP Create a cohort from campaign membership [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/126927 (owner: Awight) [16:26:28] ottomata, the reducer restarts are still happening, at an earlier stage, even with a restriciton to 10 percent of data [16:27:11] (CR) Nuria: "This is much better. Couple minor points:" [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/133091 (owner: Terrrydactyl) [16:30:30] Ironholds: just curious, what data are you running on? [16:30:37] seeing as uhhh, things are pretty much broken right now :/ [16:31:34] ottomata, define 'broken'? [16:31:40] and webrequest_source='text' [16:31:43] data from June [16:31:47] well, new data since saturday [16:31:50] not coming in 100% [16:31:56] we ahve a broker down, which means lots of dropped messages [16:32:07] i am in the process of taking one datanode out of the cluster, then we will take a couple more probably [16:32:08] yee [16:32:10] kk [16:32:24] so, with this query we have a reducer die-off at 4% and then the same pattern as before [16:32:28] aye [16:32:29] 33%, stalling, stalling, 40%, die. [16:32:41] Ironholds: i'm really sorry I haven't had time to look into your queries since last week really :/ [16:32:44] There shouldn't be any consistency in which machines are assigned in which order, should they? [16:32:46] it's okay. [16:32:52] i have to do these stupid reviews today, and we have more meetings [16:32:52] sigh [16:34:51] ottomata: you're doing awesome work, the pieces will align, don't worry [16:35:26] thanks, ori :) [16:35:27] i'm at some remove and from my vantage point things look like they're coming together [16:35:41] yeah, i think so too, just this past weekend wasn't a good sign :/ [16:35:50] i mean, we knew something like that could happen, hence the planned capacity increases [16:35:50] but ja [16:36:01] also, Ironholds has crazy query problems that are hard to troubleshoot [16:36:19] that will either be solved by me tweaking some yet unknown hadoop knobs [16:36:24] or understanding how to make his queries better [16:37:36] yeah, I have no idea what's going on [16:37:47] but I'm happy I could break everything *before* we had production systems on it :D [17:03:42] hello everyone! are we having a showcase today? [18:01:25] (PS9) Terrrydactyl: Add ability to tag a cohort [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/133091 [18:29:54] milimetric: Google does not allow me to reconnect. [18:30:05] milimetric: Might take a few minutes :-( [20:42:01] (PS1) QChris: Make daily dammit compact script rsync files if monthly step fails [analytics/wikistats] - https://gerrit.wikimedia.org/r/138685