[11:38:38] hi a-team! [11:43:13] hey marcel [11:43:33] I'm not really awake yet, but didn't wanna leave you hangin :) [11:43:41] good morning milimetric! :] [11:44:03] i thought elukey was here :] [11:44:36] mforns: he's back tomorrow [11:49:15] moritzm, oh! thanks [14:13:08] (PS24) Mforns: [WIP] Refactor Mediawiki History scala code [analytics/refinery/source] - https://gerrit.wikimedia.org/r/301837 (https://phabricator.wikimedia.org/T141548) (owner: Joal) [14:29:55] (CR) Ottomata: "- Let's default to using underscores in filenames that are not executables, unless there is a reason to do so. E.g. create_project_namesp" (2 comments) [analytics/refinery] - https://gerrit.wikimedia.org/r/306292 (https://phabricator.wikimedia.org/T141476) (owner: Milimetric) [14:30:43] (CR) Ottomata: Script sqooping mediawiki tables into hdfs (1 comment) [analytics/refinery] - https://gerrit.wikimedia.org/r/306292 (https://phabricator.wikimedia.org/T141476) (owner: Milimetric) [14:43:45] mforns: re:vital signs, pageviews are working fine cc Deskana (it is the editing metrics that we are going to remove) [14:44:37] milimetric: let's talk about EL after standup. there are couple things it needs like throttling [14:47:49] k, cool [14:49:32] Analytics: Stop vital signs metric creation on wikimetrics - https://phabricator.wikimedia.org/T143715#2579091 (Nuria) [15:00:32] ottomata: standddupppp [15:01:48] Analytics-Kanban, Analytics-Wikimetrics: Can't run reports on Wikimetrics - https://phabricator.wikimedia.org/T143399#2579109 (mforns) a:mforns>Nuria [15:18:17] Analytics: Revamp Eventlogging so anyone can use it - https://phabricator.wikimedia.org/T143794#2579160 (Nuria) [15:27:15] Analytics: Revamp Eventlogging so anyone can use it - https://phabricator.wikimedia.org/T143794#2579160 (Ottomata) Anyone can use our installation of it? Or anyone can install and use their own installation of it? [15:49:05] mforns: you wanna chat in the batcave for a bit, about next work? [15:49:14] milimetric, sure [15:49:16] omw [16:35:18] hello people :) [16:44:42] I checked the AQS alarms and I feel really sad [16:44:48] latencies went up again with traffic [16:44:52] :/ [16:48:14] plus https://grafana.wikimedia.org/dashboard/db/aqs-cassandra-compaction looks a bit weird for the 5th month [16:48:45] all right, will start looking into my dear AQS tomorrow morning [16:49:03] probably I worry too much :D [16:49:20] thanks a lot for having loaded the new cluster! [16:55:34] hi elukey! don't worry about the cassandra compaction, we've been naughty and loaded new months before the old ones finished compacting [16:55:49] to test, and it looks good for now [16:57:37] hi Luca :) We're excited to have you back but please don't worry until tomorrow [16:59:14] o/ [16:59:21] sure sure I worry too much :) [17:03:42] mforns: will ping you tomorrow for the details! :P [17:05:00] ok [17:10:30] going afk, talk with you tomorrow!! [17:42:39] elukey: do not worry, i can explain the compaction tomorrow [17:42:52] elukey: i am .. *ahem* doing some experiments .. muahahahaha [20:12:35] Analytics: ResearchSpike: Pivot UI: react, playwood - https://phabricator.wikimedia.org/T143828#2580389 (Nuria) [20:14:22] Analytics-Kanban, Analytics-Wikimetrics: Can't run reports on Wikimetrics - https://phabricator.wikimedia.org/T143399#2567093 (Nuria) Open>Resolved [20:14:41] Analytics-Kanban, Patch-For-Review: Extract edit oriented data from MySQL for simplewiki - https://phabricator.wikimedia.org/T134790#2580410 (Nuria) [20:14:42] Analytics-Kanban, EventBus, MW-1.28-release-notes, Patch-For-Review, WMF-deploy-2016-08-16_(1.28.0-wmf.15): Propose evolution of Mediawiki EventBus schemas to match needed data for Analytics need - https://phabricator.wikimedia.org/T134502#2580408 (Nuria) Open>Resolved [20:24:18] nuria_: ClickHouse looks pretty amazing [20:24:18] "It also offers an opportunity to use external dictionaries, dimension tables loaded from an external source, for seamless joins." [20:24:34] milimetric: ya, joal was really wanting to try it [20:24:52] using that ^ we can load once and never re-load unless we change the history reconstruction algorithm [20:25:05] the spike with Druid is about finding out if that's even possible [20:25:07] milimetric: that is why on the goals might need to say "test whether data can be efficiently loaded and query in a columnary datastore" [20:25:28] big data OLAP would be even more generic [20:28:03] hahaha, the code comments are in russian. sold [20:29:28] milimetric: man ... [20:58:05] nuria_: "Pivot can connect to multiple clusters, also Postgres and MySQL" is a recent improvement [20:58:15] a bit of a downside is it uses TypeScript [21:09:38] milimetric, hey! do you know where the namespaces CSV file is in hadoop? [21:09:54] yes, one sec [21:09:57] but you should be sleeping! [21:10:09] xD [21:10:34] mforns: /user/milimetric/mediawiki/namespace_db_mapping/namespace.dictionary.csv [21:10:40] thanks milimetric! [21:10:41] that's the latest with the db names in it [21:10:44] ok [21:11:16] the schema is here: https://gerrit.wikimedia.org/r/#/c/306292/1/hive/mediawiki/edit-history/create-project-namespace-map-table.hql [21:28:52] nuria_: yeah, so pivot already supports loading files and querying them [21:29:03] it's not super elegant, and it loads the whole file in memory [21:30:20] so we could send a pull request with a proper file query module, but it would be very simple work from what I can tell. It's just a matter of transforming the file and maybe querying it efficiently without loading it in memory [21:46:35] wow... pivot looks like about 80k lines of code [22:38:59] milimetric: ok, let's talk tomorrow about whether that looks like something we want to tackle