[00:26:41] (CR) Aklapper: [C: 1] "Looks good to me" [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/106636 (owner: Gerrit Patch Uploader) [00:37:01] (CR) Milimetric: [C: 2 V: 2] Link to correct Bugzilla bug entry form [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/106636 (owner: Gerrit Patch Uploader) [12:15:19] heya analytics. may be ooold news for you, but making sure, you saw ezache wired.com article, right [14:19:18] mutante: Thanks :-) Actually not too old news. The link flew by on the analytics lists around end of december as [14:19:24] "Meet the Stats Master Making Sense of Wikipedia’s Massive Data Trove" [14:19:30] What a nice subject :-) [14:20:02] * qchris hails ezachte :-) [14:21:24] In case someone is missing the link: [14:21:28] http://www.wired.com/wiredenterprise/2013/12/erik-zachte-wikistats/ [15:05:00] yes, that one:) [16:40:11] heyyy milimetric [16:40:17] select avg(time_firstbyte) from webrequest_mobile where http_status between '200' and '299' and year=2014 and month=01 and day=09; [16:40:21] 0.04311783260745881 [16:40:21] yay! [16:40:49] woo! [16:40:52] awesome ottomata [16:41:01] so how do we start hive now? [16:41:27] i just emailed [16:41:28] internal [16:41:51] i'm going to start importing via compression today i think, so we'll lose all the previous data [16:41:54] well, we won't lose it [16:41:57] but it will be in an old table [16:42:00] and not updated anymore [16:42:05] but, for now, it is still there [16:46:31] cool, that works for me [16:49:31] when you say compression, do you mean kafka compression? [16:49:36] if so, there are important fixes in master [16:50:16] (CR) Nuria: [V: 1] "+1" [analytics/limn-mobile-data] - https://gerrit.wikimedia.org/r/105851 (owner: Jdlrobson) [16:56:35] Snaps: naw [16:56:38] compression in hadoop [16:56:54] right now it is not compressed, i recently added support to camus to write snappy compressed seq files [16:57:02] sequence files [16:57:15] is a hadoop format for saving compressed files in a manner that can still be split by mappers [16:57:29] https://github.com/linkedin/camus/blob/master/camus-etl-kafka/src/main/java/com/linkedin/camus/etl/kafka/common/SequenceFileRecordWriterProvider.java?source=cc [17:00:27] ok, running to grocery store, making lunch [18:00:52] https://gerrit.wikimedia.org/r/106738 [18:25:15] ottomata, lemme know if you need me to ping the JVM thread on ops@ [18:26:15] hmm, feel free to if you want, especially if you have anything to add, we could wait a couple of days too [18:27:01] I'll wait til next week and will ping if it's getting lost :) [18:28:04] also, totally unrelatedly, would be good to have your further input on the hub/green spaces/csi thing :) [18:28:17] also feel free to process nuria's change in any way, as usual unsure about "waiting periods" [18:28:32] (private data access but existing account) [18:28:56] would like to leave it to analytics decision [18:42:41] (CR) Ori.livneh: [C: 2] Story 1481: Collect graph data only for current month [analytics/limn-mobile-data] - https://gerrit.wikimedia.org/r/105851 (owner: Jdlrobson) [18:48:11] [travis-ci] develop/0dd5a9e (#150 by nuria): The build has errored. http://travis-ci.org/wikimedia/limn/builds/16739665 [18:48:30] (CR) Milimetric: [C: 2 V: 2] "Leeeerooy!" [analytics/limn-mobile-data] - https://gerrit.wikimedia.org/r/105851 (owner: Jdlrobson) [18:50:36] Leeroy ? :) [18:51:14] oh I see [19:00:43] hahaha [19:00:50] yeah, I was just like, man this just gotta get merged [19:01:05] I assume you're familiar with http://www.youtube.com/watch?v=LkCNJRfSZBU [19:10:02] Eloquence: is there another thread? [19:10:06] i'm on one about NYC hub [19:10:24] oh sorry [19:11:00] I responded to that thread yesterday morning, anything else I should add? [19:24:15] ottomata, sorry missed your response :) [19:26:43] no probs [19:26:46] yeah, i think we should try it! [19:48:35] Hey ori. I heard you were talking to nuria about user agent and what it adds to identification. I was wondering if I could get your perspective. [19:49:43] Heya Ironholds, where was that wiki page you made documenting your experiences using hadoop + hive? [19:50:02] ottomata, https://www.mediawiki.org/wiki/Analytics/Kraken/Researcher_analysis I believe [19:50:11] it's kinda junky but will hopefully be useful :) [19:50:11] thanks! [19:50:24] np! It was fun to write [19:50:35] yea its cool, i'm going to write some more hive documentation [19:50:37] will use this for some reference [19:50:40] thanks bunches [20:23:41] hey halfak, I'm around now if you want to chat (re-emerging from PV-land) [20:24:29] ottomata, Ironholds: please link whatever you write for hive from here: https://office.wikimedia.org/wiki/Data_access [20:24:50] ottomata: Andrew ? how do I create a repo ? [20:24:51] ooo ok thanks DarTar [20:24:56] average, what kinda repo? [20:25:14] ottomata: for dclass, they deleted it, I need to create it again [20:25:37] who is they? [20:25:48] did I delete it? [20:26:04] ottomata: no you did not [20:26:10] but we will fix this if I can get a repo [20:26:16] you want analytics/dclass? [20:26:17] I also had to commit something more to the repo [20:26:23] ottomata: yes please [20:27:00] https://gerrit.wikimedia.org/r/#/admin/projects/analytics/dclass [20:29:42] ok, thank you [21:29:46] (PS1) Milimetric: For Bug 59843 [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/106775 [21:31:16] (CR) Milimetric: [C: 2 V: 2] add contact information [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/106634 (owner: Gerrit Patch Uploader) [22:11:55] have good weekend everybody!!! [22:13:11] aw have a good one otto! [22:15:13] nite everyone, good weekend