[09:09:49] (PS51) Nuria: [WIP] Run recurring reports using the scheduler [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/112165 (owner: Milimetric) [09:26:13] (PS7) Nuria: Adding coding guidelines to README.md file [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/120542 [09:29:15] (CR) Nuria: Adding coding guidelines to README.md file (5 comments) [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/120542 (owner: Nuria) [10:13:49] (PS52) Nuria: Add ability to create recurrent reports via UI. [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/112165 (owner: Milimetric) [10:14:30] (PS2) Nuria: Added coalesced recurrent reports [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/122638 (owner: Csalvia) [13:46:37] mornin qchris! [13:46:42] Morning, sir! [13:46:49] did you mean to have two jobs called camus-webrequest-import-* running at once? [13:46:50] :) [13:46:55] Yes. [13:46:58] ok! [13:47:01] Is that too much? [13:47:06] naw its fine [13:47:17] just thought maybe it was an accident, since the second one is called -try4 [13:47:25] i was about to restart some stuff on an12 [13:47:28] The first one is for a directory, where there is already some data in. [13:47:33] ok cool [13:47:35] The second one is for a fresh directory. [13:49:54] (CR) QChris: "Code looks good. Just nit-picking in commit message." (2 comments) [analytics/wikistats] - https://gerrit.wikimedia.org/r/123603 (owner: Stefan.petrea) [14:38:03] (CR) QChris: [C: -1] "Considering this a WIP commit, as per your email," [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/120542 (owner: Nuria) [14:47:07] oo hey qchris [14:47:11] so, i've been testing camus stuff [14:47:13] using the test topic [14:47:17] instead of webrequest mobile [14:47:26] which means you can test using much less data :) [14:47:28] want to know how? [14:48:10] also, i just restarted some things on an12, so the task attempts that were running there just got killed [14:48:20] Sure tell me :-D [14:48:22] Whaaaaat? [14:48:33] That means I have to restart my tests. Hahahaha. [14:49:15] Ok. I'll restart them afterwards. [14:49:17] ha, nope [14:49:17] you don't [14:49:24] those attempts shoudl get reassigned, right? [14:49:32] Yes. They should. [14:49:38] i hadn't restarted the hadoop services since I fixed that IPs [14:49:43] that's why jmx ganglia stuff wasn't working [14:49:48] But the whole point of testing was that they go through on analytics1012 right away :-) [14:49:56] aye ok, so [14:50:12] No worries. I'll have to leave for a few hours anyways. [14:50:20] so, i'm using Snaps' kafkacat to pipe from webrequest_mobile | test [14:50:23] I'll ping you afterwards to see if it's safe to restart testing. [14:50:25] on an26 [14:50:44] https://github.com/edenhill/kafkacat [14:50:47] kafkacat ... Cool. [14:51:22] if you wann [14:51:24] Gonna have to look at that. [14:51:46] /home/otto/bin/kafkacat [14:51:50] i've got it compiled and just put there [14:51:56] i'll make a package for it one day... [14:51:56] :) [14:52:04] its there on an26 [14:52:51] kafkacat -b analytics1021.eqiad.wmnet -C -t webrequest_mobile -p 1 | head kafkacat -b analytics1021.eqiad.wmnet -P -t test -z snappy [14:52:54] :) [14:52:54] \ [14:53:00] Missing broker .. [14:53:01] Ah! [14:53:02] Right. [14:53:02] kafkacat -b analytics1021.eqiad.wmnet -C -t webrequest_mobile -p 1 | head | kafkacat -b analytics1021.eqiad.wmnet -P -t test -z snappy [14:53:42] Cool! [15:01:28] (PS8) Nuria: Adding coding guidelines to README.md file [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/120542 [15:01:46] whoop, early adopter achievement unlocked! [15:05:04] ooh, chsalvia?! [15:05:09] I wonder what the h stands for [15:05:11] herbert? [15:05:14] hugo? [15:05:30] hector! [15:05:42] hermione?! [15:06:50] hellcat [15:07:15] hey Snaps, what version would you say kafkacat is? [15:07:16] :p [15:07:18] 0.0.1? [15:07:25] 0.1.0? [15:07:41] * Snaps waits for another shift [15:08:35] ottomata: 1.0.0? :) [15:08:48] sure! [15:08:54] Be bald, and all that [15:08:57] ;) [15:08:59] bald, right [15:09:05] was thinking of buildling a package [15:09:08] wanna tag it? [15:09:11] cool! [15:09:11] yeah [15:09:18] danke [15:13:17] ottomata: done [15:13:30] (PS1) QChris: Remove outdated LOCs in README.md [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/124333 [15:14:24] (CR) QChris: "Not sure which part one should measure, but 1.5k seems" [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/124333 (owner: QChris) [15:14:34] ottomata: it requires rdk 0.8.4 though, which has not been released yet (current master) [15:15:24] (CR) Milimetric: [C: 2] "That makes sense. Though it's nice to remember that the codebase was once so small. We should aim to increase it only if absolutely nece" [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/124333 (owner: QChris) [15:16:18] Hey ... milimetric you're too fast. I wanted to use that change in an email to you ... now it's merged and gone :-P [15:16:44] :) sorry qchris [15:17:01] if I don't see [WIP] or actual Work In Progress, I merge ruthlessly [15:17:10] Hahaha :-D [15:17:13] did you want to talk about it? [15:17:30] Nono. It was about the dependency injection. [15:17:43] You're email sounded like wikimetrics is still a small code base. [15:17:57] It no longer is. And I wanted to point you to that commit ... [15:18:00] But you were faster. [15:18:15] hm, I agree it's not still 1.5k [15:18:21] So no worries. Just merge away if I do not WIP a change. [15:18:31] but I consider 0-10k small, 10k-100k medium and over that large [15:18:42] Ja. It's ok. [15:19:17] Have to leave for some hours. I'll rejoin afterwards. [15:19:25] have a nice night [15:19:28] ah ok Snaps [15:19:31] when do you think 0.8.4 will be out? [15:20:47] ottomata: I'm pretty much done with it, but since there are some big changes (C++ interface and configure script) I'd like some second eyes on stuff, the debian packaging in particular (faidon) [15:20:53] ah ok [15:20:55] hm, ok cool [15:21:05] ottomata: did ./configure in kafkacat work without problems for you? [15:22:13] yeah, worked like a charm [15:22:33] I think kafaktee built statically is a good idea [15:22:50] could we make it do that by default? or is that a bad idae? [15:22:58] (or does it already do that by default?) [15:24:59] ottomata: it doesnt, and I dont think it should because it gets all its features automagically from rdkafka through the config properties interface. so I think it should be an explicit static when desired. Okay? [15:25:54] yeah that's fine, you are probably right [15:26:21] sometimes its nice to be able to just copy utils like that around without needing build tools or root privs, ya know? [15:26:31] but, yeah, I can always just build my own static version of it and use it [15:26:42] probably won't really matter much [15:27:03] ./bootstrap.sh builds a static version [15:27:12] (or ./configure --enable-static) [15:28:24] ayeyyy,e, yeah I used bootstrap, not ./configure,:) [15:29:48] and I do agree, a static version is most handy. (albeit wrong from a debian perspective) [15:42:13] yup [17:49:00] (PS53) Csalvia: [WIP] Run recurring reports using the scheduler [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/112165 (owner: Milimetric) [17:49:01] (PS3) Csalvia: Added tests for coalesced recurrent reports [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/122638 [17:49:03] (PS1) Csalvia: Added tests for coalesced public reports [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/124361 [18:15:58] so, the sampled requestlogs [18:16:03] are those produced internally or provided to us by ops? [18:40:33] (PS54) Csalvia: Add ability to create recurrent reports via UI. [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/112165 (owner: Milimetric) [18:41:10] (PS55) Milimetric: Add ability to create recurrent reports via UI. [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/112165 [18:45:54] Ironholds: they come from udp2log [18:45:59] via the varnishes themselves [18:46:01] so umm, by ops [18:51:32] (PS4) Csalvia: Added tests for coalesced recurrent reports [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/122638 [18:54:07] (PS2) Csalvia: Added tests for coalesced public reports [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/124361 [18:57:01] (PS5) Csalvia: Added tests for coalesced recurrent reports [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/122638 [18:58:13] (Abandoned) Csalvia: Added tests for coalesced public reports [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/124361 (owner: Csalvia) [19:34:31] ottomata, aha, ta [19:35:19] Ironholds: : https://wikitech.wikimedia.org/wiki/Analytics/Kraken/Overview#Current_Architecture [19:35:27] uhhh [19:35:31] that deals with kraken [19:35:46] in that diagram, replace analytics1006/udp2log kafka producer [19:35:51] with udp2log log box instances and filters :) [19:35:58] + an rsync job over to stat1002 [19:36:17] haha [19:36:19] i love this: [19:36:20] "Analytics' spring 2013 deliverables requires running analysis on unsampled mobile webrequest log" [19:36:24] har har har [20:07:45] explain it to me like i am five years old: why do we need unsampled web logs ?:D [20:10:30] hashar, I am not analyst, so many things I do not konw [20:10:32] but here's one: [20:10:35] funnel analysis [20:11:05] and behavior analysis [20:11:06] hm [20:11:07] right? [20:12:28] ottomata: yeah that sounds buzz word approved [20:12:46] i.e. I have no clue but there must be a reason for us to do such analysis hehe [20:16:38] well, those allow people to figure out how, for example, changes in UI make people behave [20:16:43] really good for editor experiments stuff [20:17:13] with sampled you can't do that, because then you can't track individual sessions at all [20:19:32] No tracking of individual sessions. It's evil! [20:23:27] haha [20:23:30] just sayin! [20:23:36] in theory that is why? [20:28:33] I'm in a meeting but I have some comments in a little bit [20:31:31] (CR) Nuria: "Besides the comments the UI side is still missing." (4 comments) [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/122638 (owner: Csalvia) [20:41:38] anybody here aware of this? [20:41:38] http://wikistats.wmflabs.org/ [20:46:58] Seeing the page, I recall that I have seen it before some time ago. But I know nothing about it. [20:48:36] name collision! [21:16:10] re: unsampled data -- here's something that I wrote a few months ago for the privacy policy. [21:16:11] https://meta.wikimedia.org/wiki/Talk:Privacy_policy#WMF_Response_to_concerns_about_unsampled_data [21:17:13] hashar ^ ahh, he's gone [21:17:22] regarding qchris's points, I do not want to track users, but I think there are mechanisms we can use to get data that we do need without tracking [21:17:37] ok -- I tried -- I can send him an email [21:18:58] tnegrin: Antoine Musso [21:19:15] thanks -- it was not obvious :) [21:19:43] but everyone calls him hasher right? [21:28:40] ottomata: I do need permissions to create a kafka topic. Right? [21:28:55] (kafkacat allows me to produce to the test topic, but not test_qchris) [21:29:07] % ERROR: Failed to produce message (734 bytes): Local: Unknown topic [21:29:37] his irc nick is hashar [21:29:50] yeah, you'd need perms to do that qchris [21:30:09] qchris: can you just produce to test? [21:30:11] Ok. Thanks. [21:30:17] i'd prefer to keep the number of topics low [21:30:19] Yes, I am doing that right now. [21:30:32] But I did not want to get in the way of others, so I wanted to append my nick. [21:30:52] "test" is fine for now. [21:35:01] naw, it won't hurt anything, test topic is meant to be for whatever you want [21:35:10] there's currently no way to delete topics in kafka [21:35:14] so I do'nt want to just create them [21:35:24] well, you can delete them [21:35:31] but you have to shutdown kafka and do a buncha manual stuff [21:36:00] (PS56) Milimetric: Add ability to create recurrent reports via UI [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/112165 [21:56:11] (CR) QChris: [C: -1] "Patch Set 8 has conflicts with master." (1 comment) [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/120542 (owner: Nuria) [22:52:33] 0'9;i4~['po-{);o'p [22:53:04] wtf, was tryin to clean my keyboard, dropped some coffee on it