[08:11:06] Analytics-Tech-community-metrics, Engineering-Community, ECT-July-2015: Automated generation of repositories for Korma - https://phabricator.wikimedia.org/T104845#1451340 (Dicortazar) Hi, I've started to work on this. In first place I've modified the code of Automator to accept external config files w... [14:57:08] Analytics-Cluster, operations, ops-eqiad: rack new hadoop worker nodes - https://phabricator.wikimedia.org/T104463#1451836 (Ottomata) Heya Chris, any updates? [15:27:15] Analytics-Cluster, operations, ops-eqiad: rack new hadoop worker nodes - https://phabricator.wikimedia.org/T104463#1451927 (Cmjohnson) 3 of 4 are racked in row D and connected to mgmt. They only need re-install. The 4th one was broken and I didn't get to replace it before I left on vacation. Dell has... [16:05:35] ottomata, the pageview counts by page name without k-anonymization for 1 day is 7.7GB. It's in stat1002:/home/mforns/mexico/page_view_counts_by_page_name.tsv [16:08:43] that is much larger than we though! [16:08:46] thought! [16:10:05] yea, I've added the same thing with k=100, and will add also k=10 [16:14:58] oh where? [16:15:02] oh i see 10 [16:15:03] 100 [16:15:08] mforns: ^ [16:15:36] ottomata, yes, I'm executing the query for k=10 now, will ping you when done [16:15:40] oh k [16:20:01] mforns: i'm going to compress these files, sok? [16:20:10] ottomata, sure! [16:20:21] the k=10 file is already there [16:20:25] ottomata, ^ [17:16:53] joal|mexico, is there any chance that you have a distribution of revision sizes from enwiki handy? [17:17:04] Or maybe some metadata that would make that easy to generate? [17:19:45] halfak: best way to grab info : the worklog I wrote a month ago [17:19:59] * halfak digs for that [17:21:56] Analytics-Kanban, Research-and-Data: Pipeline from Research to productization - https://phabricator.wikimedia.org/T105815#1452179 (ggellerman) NEW a:DarTar [21:16:02] exit [21:16:04] oops [21:42:37] joal|mexico, [21:42:37] mforns: https://cwiki.apache.org/confluence/display/Hive/LanguageManual+WindowingAndAnalytics [21:42:40] joal|mexico, [21:42:42] joal|mexico, [21:42:47] halfak: ? [21:42:48] halfak: ? [21:42:50] halfak: ? [21:45:58] org.wikimedia.wikihadoop.job.MediaWikiDumpXMLToJSON [21:46:03] halfak: --^