[04:20:32] (PS3) Milimetric: Add pretty symlink for WikimetricsBot [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/143040 (https://bugzilla.wikimedia.org/66087) [04:22:09] (CR) Milimetric: Add pretty symlink for WikimetricsBot (2 comments) [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/143040 (https://bugzilla.wikimedia.org/66087) (owner: Milimetric) [04:22:12] (PS4) Milimetric: Add pretty symlink for WikimetricsBot [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/143040 (https://bugzilla.wikimedia.org/66087) [04:32:49] (CR) Milimetric: [C: -1] "Cool, I like in general. A couple of pointers and some nitpicks." (7 comments) [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/142514 (owner: Milimetric) [08:21:05] (CR) QChris: [C: 2 V: 2] Ignore temporary vim files [analytics/refinery] - https://gerrit.wikimedia.org/r/143485 (owner: Milimetric) [09:29:52] qchris: Gerrit Code Review 2.8.1-4-ga1048ce ready !! Thank you very much [09:30:06] Wooohooo! \o/ [09:31:02] (And that war also sneaks in the fix for the stream-events that we did, but never got deployed) [09:32:22] that works on integration-dev.eqiad.wmflabs at least [09:32:23] \O/ [09:32:34] OpenStack has been hit by that bug as well -:( [09:32:47] So glad to finally see a chance of closing that bug. [09:33:59] I have hunt for it a bit [09:34:09] I am very happy Krinkle found the bug report [09:34:17] Yup. Me too. [09:34:57] I tried replicating the bug for several days when it first got reported, but failed consistently (although trying several different jdks) [09:35:06] Now I know why I failed to reproduce :-) [09:38:24] my contact at openstack-infra (pelix) asked: any idea as to why the bug in mina-sshd wouldn't be ticked if using a host key generated with openssl? [09:38:28] who knows :-] [09:38:34] mina sshd patch fix it hehe [09:39:40] A OpenSSL generated key would not have fixed it. [09:39:41] But! [09:40:07] Extracting the (gerrit generated, or generated by whatever means) key and using BouncyCastle [09:40:22] from within gerrit to serve it to the user would have fixed it. [09:40:36] Because the error is in mina-sshd's DH implementation. [09:40:54] And using BouncyCastle from within gerrit, would not use that code. [09:41:31] And the error is only triggered for a certain kind of JDKs, because they changed how the represent keys. [09:41:41] (stripping vs. no stripping of leading zeros) [09:41:51] ohhh [09:42:03] sounds like a snowball of tiny bugs [09:42:07] Yup. [09:42:11] would you mind copy pasting the above explanation to https://bugzilla.wikimedia.org/show_bug.cgi?id=53895 ? [09:42:20] Openstack folks are lurking the bug, I feel to lazy to copy paste it :)] [09:42:31] Ok. Will do. [09:42:33] (afraid of adding non sense in between and thus forcing you to correct me later on hehe) [09:42:36] But first: lunch :-D [09:42:50] it is not even noon! :-D [09:42:53] have a good lunch [09:43:06] and thank you for your help with Gerrit related issues. It is much appreciated [09:43:23] socially since I have exactly no clue how it works and i am java illiterate [09:43:27] hashar: Noon or not ... my wife is starving since 11:00 ... and my wife is the boss :-D [09:43:38] definitely [09:43:40] same deal here [09:43:51] ;-) [13:11:59] (CR) Nuria: Fix wiki cohort display for report cohorts (4 comments) [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/142514 (owner: Milimetric) [13:13:23] (CR) Milimetric: Fix wiki cohort display for report cohorts (1 comment) [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/142514 (owner: Milimetric) [13:13:34] * Ironholds awakes [13:41:17] (PS2) Milimetric: [WIP] Oozify sequence_stats hive script [analytics/refinery] - https://gerrit.wikimedia.org/r/143486 (https://bugzilla.wikimedia.org/67128) [14:48:07] oh qchris_away: you know the hive trick you did to make it unnecessary to add the serde jar manually each time people ran hive? [14:48:25] is that deep enough in the bowels of the machine that it means we don't have to add the jar from anywhere? [14:48:40] as in: from coordinators, workflows, etc. [14:56:19] milimetric: i think the only way to find out is to test it! :/ :) [15:27:23] milimetric: I totally do not know. I haven't tried. [15:29:28] k qchris thanks [15:34:05] I just looked over Oozie's HiveMain (which is used by HiveAction to do the work), and it looks like it should work, if you provide the system's Hive configuration file. [15:40:30] qchris: thanks for your super long detailed description of gerrit hash mismatch issue ( https://bugzilla.wikimedia.org/show_bug.cgi?id=53895#c28 ) [15:40:34] that is very nice [15:40:55] Ha. Not sure if anyone will gonna read it ... :-) [15:41:09] But it should give the OpenStack people some idea of how/why. [15:41:34] I hope they can solve it on their end in a similar way. [15:42:02] It certainly cost us much time to find a solution. Would be a pity if they'd have to spend as much time as we did. [15:54:58] ottomata: sequence_stats needs webrequest_source right, as a parameter? [15:58:22] yes, i believe so [16:01:24] qchris: openstack infra folks are tracking the bug :-] [16:01:42] qchris: they have the issue hitting them for some integration tests they are doing. That is what resurrected the bug on our side [16:02:01] qchris: guess they will "just" upgrade [16:02:09] :-D [16:04:24] oh, ottomata, while you're here, 2 things [16:04:43] yup? [16:04:51] (1) it looks like RHive is now being actively supported by its parent company. woo! I'll be testing it when I get back and flinging bugs/patches upstream, and hopefully we'll have a connector solution there. [16:05:25] (2) when's the CDH5 conversion expected? I want to be sure I can adequately warn consumers that we will/won't have data in [period] [16:09:15] 1 cool! [16:09:31] 2: ahhhh, still not sure, i was hoping this week but the cdh5 labs testing is taking longer than expected [16:11:32] tis okay :) [16:11:45] hey qchris, you fancy writing a parser for puppet? ;p [16:12:20] trying to work out how to avoid manual tasks with updating the IP ranges for internal traffic in..whatever we call the pageviews solution. Unfortunately the only machine-readable, consistently updated list is in a puppet manifest. [16:12:39] This may be an area where I go "this sort of looks like a tremendous pain" [16:12:46] but, implementation details [16:12:51] Ironholds: Sad to see that your email to Ops has not seen official responses. [16:13:00] But the issue is even more involved :-/ [16:13:18] Like X-Forwarded-For handling ... [16:13:36] And different parties in wmf trusting different IPs to set X-Forwarded-For correctly. [16:13:45] actually I got a great reply from manybubbles but it was reply, not reply-all [16:13:46] yep [16:13:54] I mean, we have the XFF extension, which has one set of rules [16:14:11] that's used by checkuser really effectively [16:14:21] but I'm skeptical that we don't have 3 or 4 competing standards there ;p [16:14:49] Like ... wp-zero trusting some special proxies :-) [16:15:06] yep :D [16:15:18] Aaaaand different legacy implementations using treating different parts as "internal" [16:15:21] the opera ranges are actually in the XFF setup for MW, which was interesting to discover, along with a lot of other ones [16:15:26] yep :( [16:15:29] Fun fun fun. [16:15:49] from my point of view, I think we may not need internal tracking/exclusion for the initial setup. [16:15:58] I'll respond to your Ops thread if nothing happens there. Just to get discussion started. [16:16:00] I mean, if we start talking about multimedia image hits and such, absolutely. [16:16:27] but I can't imagine there are text/html requests there, and the filters on API requests are positive, not negative (it looks for specified user agents and excludes anything without those user agents) [16:16:34] I will test this and see what I can see [16:17:05] Be sure to have a look at requests the ssl terminators make on behalf of users. [16:17:22] ohh, yes. [16:17:27] that's a good point. Oh dear. [16:17:45] * Ironholds headscratches. Ugh. [16:17:47] I am sure Ops know more edge cases. [16:17:52] yep. [16:18:02] That's why I had hoped they chime in more to your question. [16:18:18] I am really pointing towards "we should not exclude this". I'll make the argument and see what people in ops/community/you and erikZ say when I propose it. [16:20:12] k [16:24:23] doh, the script does it [16:24:38] I wish I knew why my brain turned against me when I do this stuff... [16:33:48] (PS3) Milimetric: [WIP] Oozify sequence_stats hive script [analytics/refinery] - https://gerrit.wikimedia.org/r/143486 (https://bugzilla.wikimedia.org/67128) [16:52:57] (PS1) QChris: Use index when querying for editors in a date range [analytics/geowiki] - https://gerrit.wikimedia.org/r/143636 [16:54:21] (CR) QChris: "Thanks go to springle for finding the issue, alerting" [analytics/geowiki] - https://gerrit.wikimedia.org/r/143636 (owner: QChris) [17:26:31] qchris: if you're around, what bug number is the gerrit problem? [17:26:41] never mind, found it [17:26:42] Let me dig it up. [17:26:42] sorry [17:26:45] ok. [17:51:18] (PS4) Nuria: Fix wiki cohort display for report cohorts [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/142514 (owner: Milimetric) [17:53:01] qchris, you're absolutely right [17:53:06] just conducted a test using the IP ranges we use [17:53:11] (as 'internal ranges') [17:53:58] definitely including a loot of requests through terminators; or, at least, the requests have the structure of normal requests and a lot of variation in UA [17:54:34] I'd prefer something like "qchris, I found it. It's simple, you overlooked $X which solves the issue completely" [17:54:44] Cannot we restart using that? [17:55:27] hahaha [17:55:30] if it would make you feel better ;p [17:55:47] but..hmn. This would suggest we may have been undercounting for quite a while. [17:55:48] Yes ... if you can provide a value for $X. [17:56:12] Under/Overcounting depends on where/what. [17:56:14] :D [17:56:15] yep [17:56:33] udp2log for example also has the requests arriving at the ssl terminators. [17:56:44] yep [17:56:50] That saves our neck for large parts of our infrastructure. [17:56:54] I will further filter against other filters and hope there are 0 left at the end [17:57:06] k. [17:58:52] hmn, okay. there are 6 left at the end. And they are all odd things. [17:59:23] 5 embedded commons requests, and..an API request? Of type text/html?! [17:59:58] http://commons.m.wikimedia.org/w/api.php?useformat=mobile&r=[redacted]&origin=https://en.m.wikipedia.org [18:02:41] I should write this up. [18:47:37] (CR) Nuria: "Permits wise code works." (1 comment) [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/143040 (https://bugzilla.wikimedia.org/66087) (owner: Milimetric) [19:09:01] (PS5) Milimetric: Fix wiki cohort display for report cohorts [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/142514 [20:28:20] (PS6) Milimetric: Fix wiki cohort display for report cohorts [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/142514 [20:33:02] (CR) Milimetric: "Nuria, I factored out the mess of logic from the html template, sorry you had to deal with that in the first place. That flow is actually" [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/142514 (owner: Milimetric) [20:48:14] (CR) Milimetric: "I'm just starting out testing, but I think I see the problem. So, the links are ONLY created for WikimetricsBot. There was no Wikimetric" [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/143040 (https://bugzilla.wikimedia.org/66087) (owner: Milimetric) [20:58:46] (CR) Milimetric: Add pretty symlink for WikimetricsBot (1 comment) [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/143040 (https://bugzilla.wikimedia.org/66087) (owner: Milimetric)