[10:56:59] average are you around? [13:25:14] hi drdee [13:25:36] drdee: saw your e-mail, I'm working on fixing that, but I first have to wrap up documenting the package building workflow [13:25:42] which I'm doing now [13:26:05] k [13:30:08] whoaaa, drdee alex k, has a kafka .deb! [13:30:15] i have some things to do on that now [13:30:23] should I work on that or on the hadoop node reinstall? [13:31:22] yEs !!!! [13:33:09] which one? i was in the middle of hadoop node resintall puppet stuff on friday :) [13:36:37] drdee? [13:36:50] 1 sec [13:37:09] (I think hadoop thing, it is more doable in the short term :) [13:46:02] hey everyone [13:46:32] compy problems - waking up the laptop from sleep is like a 20 minute ordeal, shutdown/startup is 15 seconds. I'm beginning to think sleep isn't worth it :) [13:46:49] morning milimetric [13:47:05] mornin [13:47:07] ottomata: i would say help with kafka [13:47:14] and let's finish that first [13:53:27] welll, it sounds like alex is actually still working on it [13:53:31] i'll let him do his thing for now [13:58:39] ottomata: is it ok to assume that upstream tag versions have no correlation with debian tag versions ? [14:01:21] bwer? (good that you are asking faidon :) ) [14:01:46] ottomata: ok :) [14:12:25] drdee! [14:12:26] https://blog.cloudera.com/blog/2013/05/cdh-4-3-is-released/ [14:12:49] i know [14:12:55] i forgot to forward it to you [14:47:35] drdee, so, alex put hive,pig,sqoop, etc. in our apt, but since he just did it he did it at 4.3, which put the other hadoop stuff in at 4.3 [14:47:44] which means when I reinstall a hadoop node, it will be on 4.3 and others will be on 4.2.1 [14:47:49] which is probably fine, but to avoid potential issues [14:47:53] i'd rather just upgrade everything now [14:47:54] thoughts? [14:48:37] let's not mix versions; that caused issues the last time as well [14:48:58] can't we just stick at 4.2.1 ? [14:50:56] i'll ask [14:51:44] ergh, hm, it looks like our apt only has 4.3.0 .debs right now [15:08:58] drdee, ^ i asked alex but he's away now. [15:21:51] New patchset: Erik Zachte; "more specific test for harvesting redirect localization from php message file" [analytics/wikistats] (master) - https://gerrit.wikimedia.org/r/66549 [15:24:55] Change merged: Erik Zachte; [analytics/wikistats] (master) - https://gerrit.wikimedia.org/r/66549 [15:42:39] milimetric; can you help me? i am blind [15:46:08] milimetric ^^ [15:46:14] sure [15:46:27] I'm no ophthalmologist [15:46:32] but what's up [15:46:57] ugh, why the heck did this thing not beep on your first message but beeped on your second [15:47:03] exactly i need a ophthalmologist [15:47:15] a source-code ophthalmologist [15:47:18] :) [15:47:26] i have some unit-tests failing (java) [15:47:30] k, kraken? [15:47:31] can't figure out why [15:47:33] yup [15:47:38] pushed? [15:47:41] 1 sec [15:48:00] (lunchtime, back in a bit! [15:48:01] ) [15:50:12] ok, pushed [15:50:33] running [15:51:12] ok so these 10 tests fail starting with testSampleMobileLogLine [15:51:23] yes [15:51:36] i think there are two issues at stake [15:54:06] i think one culprit is refersToSameArticle() but not sure why (in PageviewFilter.java) [16:12:33] drdee: isn't this backwards: !pageviewFilter.refersToSameArticle(pageviewType, url, pageviewTypeReferer, referer) [16:12:48] shouldn't that just be pageviewFilter.refersToSameArticle(pageviewType, url, pageviewTypeReferer, referer) without the "not" [16:12:58] 219 of Pageview.java [16:14:14] oh, misread [16:14:20] i don't think so. if the referer and the url refer to the same article then that request should not be counted as a pageview [16:14:24] "same" not "some" [16:14:54] i need a real ophthalmologist [16:15:20] :D [16:16:27] now only 9 failing :) [16:17:17] well, so it's just that method that's wrong (isWikistatsMobileReportPageview) [16:17:39] it should return false if it's a mobile search for the test to work [16:18:06] 8 failing [16:19:47] ok [16:23:09] testNullRequestMethod fails predictably because isPageView requires a request method ;) [16:24:03] drdee, line 231 PageviewFilter.java ^^ [16:24:03] k [16:24:09] fixing :) [16:24:16] i will push some fixes soon [16:24:19] k [16:24:24] i'll look at the rest after [16:24:34] drdee: have you found anything promising that could account for the low numbers? [16:24:41] not yet [16:25:10] i turned off the check in the pig script and got roughly twice the counts, which is a bit worrisome [16:29:57] is this where we're trying to find counts that are 10x bigger erosen? [16:30:41] are you counting in general where the x-cs is set or looking at a specific partner? [16:32:11] milimetric: by summing the individual partners [17:00:43] erosen: scrum [17:08:29] average: which branch? [17:08:41] on the dclass repo? [17:23:27] btw, drdee, erosen, milimetric: pics! [17:23:27] http://www.flickr.com/photos/ottomatona/sets/72157633905309558/ [17:23:37] ohoho [17:23:46] cool :) [17:23:51] awesome [17:24:10] ncie! [17:26:34] ottomata: newpackage [17:26:46] ottomata: steps to build so far are in DEBIAN.md [17:26:56] ottomata: please have a look [17:27:20] ottomata: is there a way I could get a link to the repo ? [17:27:34] ottomata: wanted to show it in the scrum but didn't manage to get a link [17:28:11] ja [17:28:21] go here [17:28:21] https://gerrit.wikimedia.org/r/#/admin/projects/ [17:28:26] click on (gitweb) [17:28:27] aaah, thanks :) [17:28:28] on the side [17:29:52] ottomata: you will notice the following switches in git-dch -D --force-distribution [17:30:03] ottomata: these are only present for newer versions of git-buildpackage [17:30:28] ottomata: you may get a version which has those features here http://garage-coding.com/releases/git-buildpackage/git-buildpackage_0.6.0~git20130414_all.deb [17:31:22] ottomata: I've built it from the official repo (located here git://anonscm.debian.org/users/agx/git-buildpackage.git ) [17:33:15] drdee: I got the pageview class working in a scala interpreter which is pretty nice for debuggin, in case you are interested [17:50:44] drdee: Andrew and Faidon reviewed it. There are multiple things that need to happen. A gerrit patchset will follow fixing the problems. [17:50:51] In the meantime I'm on #738 [17:51:34] (I'll work on a patchset for #716 after I close #738) [18:35:38] average: awesome! [18:35:44] thanks for all your hardwork [18:35:49] much appreciated!! [18:42:17] ottomata milimetric drdee : I figured out how to use the scala interpreter for testing java UDF logic. I haven't gotten it to work for any kraken java code (CLASSPATH issues), but it works for kraken-generic at least: https://gist.github.com/embr/11dc7af47d6356e48afe [18:42:36] thanks erosen! very cool!! [18:42:43] np [18:42:46] i love interpreters [18:42:49] cool! [18:43:04] ah very nice [18:43:08] tab complete yay [18:43:23] yeah, was pretty excited to discover that [19:35:17] drdee: thanks [19:35:26] bb 1h [19:35:38] k [19:40:34] drdee, i'm glancing at our hadoop confs, checking htings out [19:40:44] by default we do not have mapreduce output compression turned on [19:40:45] k [19:40:56] shoudl we turn that on? [19:41:36] maybe :) [19:42:03] intermediate compression is on [19:42:13] that's good [20:31:14] drdee: milimetric rounce123: have time to meet with dan? [20:31:21] i'm in the hangout [20:31:25] yes [20:31:27] coming now [22:04:21] erosen: how can I help? [22:29:41] milimetric: hey, was talking to liAnna [22:32:56] hi [22:33:03] yeah, i'm gonna be helping her out i think [22:33:10] she's having trouble with the current umapi i heard [22:33:36] yeah [22:33:39] i can tell you more about it [22:33:44] as I looked into it with her briefly [22:34:05] this is the query she is using as a test: https://metrics.wikimedia.org/cohorts/wep_uscan_spring2013/bytes_added?start=20130101&end=20130601&group=input&aggregator=sum&project=enwiki [22:34:31] and the results she is getting are much lower than she would expect [22:34:47] i have a saved json output of the thing she thinks is wrong [22:34:53] and I was attempting to replicate on stat1 [22:35:00] but I couldn't remember what port it is running ont [22:35:25] milimetric: I just sent you the result from metrics.wikimedia.org [22:36:34] (the results she thinks are wrong) [22:40:15] cool, thanks erosen [22:40:27] i gotta run some errands now but I'll take a look soon [22:40:31] cool [22:40:45] if she asks, I'll get to it first thing tomorrow at the latest [22:40:52] cool [22:41:31] we also have userstats as a fall back [23:16:02] hey guys [23:16:18] how many page views do we get across all projects in a day, on average? (rough order of magnitude ok) [23:16:55] nevermind, found it on reportcard.wmflabs.org