[00:15:59] Phabricator, Analytics-Tech-community-metrics: SQL user/grant for phabricator statistics script - https://phabricator.wikimedia.org/T78311#941010 (Springle) phstats now has access to both phabricator_maniphest and phabricator_user. [01:49:22] (PS4) Nuria: [WIP] Mobile apps oozie jobs [analytics/refinery] - https://gerrit.wikimedia.org/r/181017 [03:12:44] (PS5) Nuria: [WIP] Mobile apps oozie jobs [analytics/refinery] - https://gerrit.wikimedia.org/r/181017 [13:02:32] Engineering-Community, Analytics-Tech-community-metrics, Phabricator: Monthly report of total / active Phabricator users - https://phabricator.wikimedia.org/T1003#941542 (Dzahn) [13:02:35] Analytics-Tech-community-metrics, Phabricator: SQL user/grant for phabricator statistics script - https://phabricator.wikimedia.org/T78311#941540 (Dzahn) Open>Resolved thanks :) [13:39:58] Engineering-Community, Analytics-Tech-community-metrics, Phabricator: Monthly report of total / active Phabricator users - https://phabricator.wikimedia.org/T1003#941556 (Dzahn) Quim said: ( for some reason I can't hit "quote" on his comment) >Is it possible to add these to the report? >Number of accounts cr... [13:42:43] Analytics-Tech-community-metrics, Phabricator: Metrics for key Wikimedia projects software in Maniphest - https://phabricator.wikimedia.org/T28#941559 (Dzahn) [13:42:46] Engineering-Community, Analytics-Tech-community-metrics, Phabricator: Monthly report of total / active Phabricator users - https://phabricator.wikimedia.org/T1003#941557 (Dzahn) Open>Resolved resolving per: "This task can be closed as soon as we can publish monthly metrics for total amount of users and... [13:43:25] Engineering-Community, Analytics-Tech-community-metrics, Phabricator: Monthly report of total / active Phabricator users - https://phabricator.wikimedia.org/T1003#941560 (Dzahn) a:Dzahn>None [14:00:02] Engineering-Community, Analytics-Tech-community-metrics, Phabricator: Monthly report of total / active Phabricator users - https://phabricator.wikimedia.org/T1003#941592 (Aklapper) ♥! Thanks Dzahn! [14:15:44] we salt the EventLogging IPs, right? [14:17:32] I mean, I'm hoping not [14:17:35] but it seems improbable ;p [14:33:49] Ironholds: yes, they are salted. [14:33:50] https://git.wikimedia.org/blob/mediawiki%2Fextensions%2FEventLogging/9522a747e665ef8d5229ccfb71363f7a1aff9597/server%2Feventlogging%2Fparse.py#L111 [14:34:59] neat! [14:35:05] ...any chance I could get my hands on the key? :P [14:58:38] (CR) OliverKeyes: [WIP] UDF for classifying pageviews according to https://meta.wikimedia.org/wiki/Research:Page_view/Generalised_filters (1 comment) [analytics/refinery/source] - https://gerrit.wikimedia.org/r/180023 (owner: Ottomata) [15:00:14] (CR) OliverKeyes: [WIP] UDF for classifying pageviews according to https://meta.wikimedia.org/wiki/Research:Page_view/Generalised_filters (1 comment) [analytics/refinery/source] - https://gerrit.wikimedia.org/r/180023 (owner: Ottomata) [15:06:49] (PS19) Ottomata: [WIP] UDF for classifying pageviews according to https://meta.wikimedia.org/wiki/Research:Page_view/Generalised_filters [analytics/refinery/source] - https://gerrit.wikimedia.org/r/180023 [15:07:32] ottomata, is the #19 change just the tolower() moving into Java? [15:08:14] no, just a comment change [15:08:18] ahh [15:09:11] ah you think we should lower in java? [15:09:30] ok [15:10:53] Ironholds: is there any worry with what toLowerCase mentions here? http://docs.oracle.com/javase/7/docs/api/java/lang/String.html#toLowerCase() [15:11:20] uri hosts are not ever in weird locales, i assume? [15:12:56] I mean, they shouldn't be? [15:13:06] k [15:13:10] just double checking [15:13:15] hm, what about other fields? [15:13:18] uriPath, for example? [15:13:21] Other fields, yes :/ [15:13:24] particularly path [15:13:35] content type? [15:13:45] * Ironholds thinks [15:13:49] no, I've always seen that be sane [15:13:56] generally speaking path, query and userAgent are the bugbears [15:14:12] you want to lowercase userAgent? [15:21:13] kevinator, milimetric, when do you want to try to install vagrant? [15:21:34] milimetric, I suppose you'd like to deploy annotations first [15:21:59] I have it installed, should I delete it and download it again? [15:22:41] mforns: before the standup I did a vagrant git-pull and many “missing files” errors came up [15:23:45] kevinator, I don't know exactly, I think we should have a look in the batcave [15:23:54] maybe after staff? [15:24:01] ok, after stagg [15:24:04] er staff [15:24:15] although, milimetric wants to deploy annotations today... [15:24:25] so maybe we should prioritize that? [15:24:34] what do you thin [15:24:37] think? [15:25:26] yes, deploying annotations first is more better [15:25:44] ottomata, oh, no, we should leave that alone [15:25:52] (sorry, got distracted by looking in horror at my emails) [15:26:46] Ironholds: [15:26:54] which fields shoudl be lower cased? [15:28:08] just host, I think? As your patch had [15:29:15] ah [15:29:41] ah, Ironholds, no path? [15:29:46] Argh. I am trying to join the batcave since 5 mins. [15:29:46] you are giving me mixed messages! [15:29:52] no path! [15:30:05] It's hanging at "Trying to join the call. Please wait ...." can others join? [15:30:07] well you kept asking what fields contained weird encoding sso I assumed you wanted them for something! :D [15:30:09] Ironholds: Other fields, yes [15:30:09] Ironholds: particularly path [15:30:26] I don't think the others need lower-casing, naw [15:30:28] kevinator: Can you join the hangout? [15:30:30] oh, no i was asking what other fields should be to-lowered [15:30:30] ok [15:30:31] in fact, some are probably case-sensitive [15:31:40] qchris_meeting: we're all in, I invited you again [15:31:41] does that work? [15:31:41] qchris_meeting: yes, I just joined [15:37:15] (PS20) Ottomata: [WIP] UDF for classifying pageviews according to https://meta.wikimedia.org/wiki/Research:Page_view/Generalised_filters [analytics/refinery/source] - https://gerrit.wikimedia.org/r/180023 [16:20:55] Analytics-Cluster, Ops-Access-Requests: Access to Hadoop Cluster for Ananth Ramakrishnan (new contractor - https://phabricator.wikimedia.org/T85229#941898 (Ottomata) NEW a:Ottomata [16:21:01] Analytics-Cluster, Ops-Access-Requests: Access to Hadoop Cluster for Ananth Ramakrishnan (new contractor) - https://phabricator.wikimedia.org/T85229#941898 (Ottomata) [16:33:11] (PS1) Mforns: Add annotations to graphs [analytics/dashiki] - https://gerrit.wikimedia.org/r/181591 [16:39:02] (CR) Mforns: [C: -1] "Still WIP" (1 comment) [analytics/dashiki] - https://gerrit.wikimedia.org/r/181591 (owner: Mforns) [16:49:08] hey nuria 1:1? [16:56:41] tnegrin: SORRY [16:56:46] np [16:57:00] tnegrin: getting there, with kids out of school I am loosing track of time [16:57:12] heh -- I feel your pain [16:57:24] tnegrin: on hangout [17:08:15] mforns_lunch: sorry I couldn't get to the dashiki review [17:08:18] let's do it when I get back [17:08:22] i gotta run to a flight [17:08:32] happy holidays! bybybybye [17:12:37] milimetric: happy holidays! [17:52:47] Analytics-Dashiki: Failure to retrieve a metric json file should not break the UI - https://phabricator.wikimedia.org/T85233#942020 (Nuria) NEW [18:04:04] (PS1) Nuria: Fix reference to responsive.css file in index.html [analytics/dashiki] - https://gerrit.wikimedia.org/r/181599 [18:26:57] (CR) Nuria: "Most of my comments deal with code loading. I think a small refactor is in order to make sure that annotation component is loaded lazily " (4 comments) [analytics/dashiki] - https://gerrit.wikimedia.org/r/181591 (owner: Mforns) [18:28:10] (CR) Nuria: "Please merge this change and rebase to fix 404s with responsive.css: https://gerrit.wikimedia.org/r/181599" (2 comments) [analytics/dashiki] - https://gerrit.wikimedia.org/r/181424 (owner: Mforns) [18:50:00] (CR) Mforns: Add Annotations API (1 comment) [analytics/dashiki] - https://gerrit.wikimedia.org/r/181424 (owner: Mforns) [18:50:07] (PS2) Mforns: Add Annotations API [analytics/dashiki] - https://gerrit.wikimedia.org/r/181424 [19:13:39] warning [19:13:45] I'm using hive and I have a question [19:14:46] my computer fan is on a lot -- could this be connected? [19:23:51] tnegrin: Typically, those two things would not be connected. [19:24:13] I suppose you're running hive through ssh to stat1002? [19:24:21] hah -- got you! I actually have a question about joins :) [19:24:46] yes -- I'm on stat1002 [19:25:02] Then your fan and hive usage should not be connected. [19:25:04] I'm trying to join two tables and I can't get the syntax right [19:25:11] I know -- I was just being silly [19:25:27] I never tried joins in hive, but let's figure it out together. [19:25:50] hello from da buuuus [19:25:51] thanks qchris -- I think it's pretty straightforward, it's just been a while [19:25:55] ohai [19:25:57] What parts are you trying to join? [19:26:06] ottomata: Hello ottomata on the bus :-D [19:26:27] ellery has made 2 tables -- ellery.oozie_mc and ellery.en_revisions [19:26:46] one with page views, one with editors and pages they have edited [19:27:15] I'd like to create a dataset with the users joined with the stats about the pages they have edited [19:27:34] here's the instructions: prev (referer title) , curr (current page title, n (count) [19:27:34] The table ellery.en_revisions contains a sample 2000 users and the pages they have edited. The schema is: [19:27:35] user_id, page_id, page_title [19:27:36] The two table can be joined on page_title = curr. [19:28:55] um -- do we have a local gist? [19:29:01] yup [19:29:07] https://phabricator.wikimedia.org/paste/create/ [19:29:40] thanks [19:29:48] so here's the query I'm trying https://phabricator.wikimedia.org/P177 [19:30:15] I'm running it via hive -f