[01:07:05] done for the night, see y'all tomorrow [12:53:54] yoyoyo [12:54:01] drdee: hello [13:09:38] hey [14:14:46] hey milimetric [14:14:53] can you have a quick glance at https://gist.github.com/5293b54d9df63aa3fa5e [14:14:54] morning drdee [14:14:56] sure [14:14:59] like max 10minutes [14:15:04] and tell me what you think it does [14:15:38] heh, you're not gonna tell me what it's trying to do? [14:16:47] k, i'll read it now [14:18:20] i have a suspicion [14:19:57] phew! [14:20:02] another annoying morning [14:20:04] goodmorning! [14:20:09] 1. internet wasn't working at apartment [14:20:11] morning ottoamta [14:20:11] it is raining [14:20:12] biked to cafe [14:20:16] forgot computer [14:20:17] flat tire? [14:20:19] :( [14:20:24] biked back to apt., got computer, biked to cafe [14:20:30] and IRC does not like the internet in cafe [14:20:37] set up socks proxy... [14:20:40] now I am looking good! [14:20:43] well it's almost weekend [14:20:51] yup [14:22:32] drdee, did you say you were adding filtering based on X-Forwarded for? [14:22:38] or did I make that up? [14:23:45] yep, i will add that [14:23:51] and respond at that emial [14:24:06] k i'm responding too [14:26:03] oh [14:26:45] drdee, huh, weird [14:27:04] i saw some localization code trying to figure out what general country you're in [14:27:20] i saw some CDN loading code and loading code to make sure scripts are loaded [14:27:48] and i saw some flash embedding code to inject an swf into the DOM once some of those other things are done [14:28:06] not really sure what all those codes are though [14:28:26] could it be browser fingerprinting? [14:28:29] all the objects with "6472d" in them labeled by country [14:29:19] well to be honest I don't know the specifics of browser fingerprinting but there doesn't seem to be much more than some basic IE detection in terms of browser type [14:29:42] ok [14:29:59] somehow this gets loaded on some very rare wikipages [14:30:34] milimetric: browser fingerprinting is basically getting a lot of data from your browser, hashing it, so it can identify you as a unique visitor.. something like that [14:30:43] right, no I know that [14:30:51] but I'm not sure the specifics [14:31:11] drdee what do you mean by rare? [14:31:17] not often visited? [14:31:39] sorry i mean some obscure wiki page with very little traffic, maybe a user page [14:31:59] hm... by us? [14:34:06] either volunteer / or malicious user, not wmf [14:43:40] ok, so after looking at it a little more, this looks like some type of cross site scripting attack maybe [14:43:58] it has all kinds of insertJS, insert SWF type functions [14:44:30] I'd let that guy (Steve? Mark?) who presented on security take a look [14:45:20] that was my suspicion as well [14:45:29] i think Chris Steipp [14:45:31] I just can't see a legitimate reason why someone would need to augment a page with stuff [14:45:34] right, Chris [14:45:38] no [15:57:03] Saper is having a git+gerrit workshop on 3rd October at 07:00 UTC+3:00 (more info on #git-gerrit) [15:57:07] I'm gonna be there then [15:57:14] thought you guys might be interested [15:57:22] I found out this morning [16:01:29] k [16:37:05] drdee: train woes, so I'll be a bit late for the scrum [16:37:13] is this a show-and-tell day too? [16:37:22] i didn't see an invite, i don [16:37:24] t think [16:52:04] average_drifter: any progress? [16:54:58] https://plus.google.com/hangouts/_/2e8127ccf7baae1df74153f25553c443bd351e90 [16:55:11] morning dschoon [16:55:28] ohai [16:55:32] one sec [16:55:40] i imagine being dressed is a good idea [17:03:56] drdee: yes, but we still need to make a test suite [17:04:10] drdee: I mean even if I extract the code from filter and put it in udp-filters [17:04:54] drdee: so first question would be: Do the logs of squid in /a/squid/archive/sampled have a 1-to-1 correspondence with processed counts here http://dumps.wikimedia.org/other/pagecounts-raw/2012/2012-09/ ? [17:06:20] test suite is nice idea, i like it, but let's firs focus on functionality [17:06:53] drdee: are there outstanding unfixed bugs in filter right now ? [17:07:03] drdee: I remember yesterday we talked about a leading slash in teh title [17:07:07] yeah [17:07:22] so for wiki's if the title ends with a '/' then drop the '/' [17:07:34] drdee: but I need to talk with you to make a list of those, and identify which squid line caused those bugs to surface [17:07:46] ok call me on skype [17:07:53] in 3 minutes [17:07:55] yes [17:10:49] you can Skype me now [18:15:11] drdee: heh. Not that MediaWiki drops the /: https://www.mediawiki.org/wiki/Analytics/ [18:15:26] which means that technically https://www.mediawiki.org/wiki/Analytics/ and https://www.mediawiki.org/wiki/Analytics are different pages [18:15:28] (norly) [18:15:45] oh crap [18:15:57] okay average_drifter: ignore my remarks regarding '/' [18:16:00] just leave it as-is [18:16:13] average_drifter ^^ [18:18:34] other valid but ridiculous pages [18:18:36] https://www.mediawiki.org/wiki// [18:19:33] that's a great page to hide something nasty, because it's so easy to overlook [18:23:49] brb coffee time [18:25:29] drdee: so I won't remove the trailing slashes ? [18:29:36] average_drifter: no, they're valid pages. [18:29:49] everything after /wiki/ is part of the page title [18:29:57] which is why https://www.mediawiki.org/wiki/Analytics/ and https://www.mediawiki.org/wiki/Analytics are different pages [18:31:13] hm ok [18:31:18] so I'll just remove what I wrote [18:32:14] drdee: so just the widechar bug [18:35:02] db1047 aka s1-analytics is going down for maintenance (in case you missed Asher's announcement) [18:35:26] thanks! [18:35:40] (i saw -- i think erosen is the only one who might be using) [18:36:58] not using it either actually' [18:40:57] an oldie, but such a good read: http://www.kalzumeus.com/2011/10/28/dont-call-yourself-a-programmer/ [19:23:14] average_drifter shoot [19:58:38] hm, I'm wondering if the google hangout I'm on is wrong [19:58:46] clicking on the one in the invite never works... [19:59:09] https://plus.google.com/hangouts/_/4d17c1bee0c30e050921f0c1d83773c393267237 [20:01:54] dschoon? [20:01:58] ottomata? [20:02:00] erosen [20:02:00] ? [20:02:02] ^^ [20:02:08] haii [20:02:15] https://plus.google.com/hangouts/_/4d17c1bee0c30e050921f0c1d83773c393267237 [20:12:25] that is so cool [20:12:28] hangout by phone! [20:14:59] https://gist.github.com/3801810 [21:03:01] yawn [21:08:55] pawn [21:12:08] dawn [21:20:01] fawn [21:20:14] prawn [21:20:43] "I am not a shrimp! I am a KING PRAWN!" [21:21:22] http://www.youtube.com/watch?v=ZmCp8rvXbdU [21:28:37] seems pretty brawny [21:36:25] average_drifter: All good? [21:46:42] dschoon: heya.... CT and I are in R32 talking about the Dell boxes [21:46:57] yep. [21:46:58] omw [22:02:54] i, for one, am extremely excited about our machines getting replaced [22:03:14] you see that, drdee drdee_ ottomata [22:04:23] Yep but not super surprised but it is good news [22:04:33] Same model or different model? [22:07:01] R series, I believe [22:07:11] we are not accepting any further C2100s. [22:10:22] Cool [23:10:24] btw, http://nexus.wmflabs.org/nexus/index.html#welcome