[13:28:34] morning all [15:09:26] halfak :o [15:10:58] hi ToAruShiroiNeko [15:11:09] hey :) [15:16:09] #wikimedia-research-ORES [18:28:03] morning halfak :) [18:50:00] o/ Ironholds [18:50:38] see you halfak [18:51:51] halfak, how goes? [18:52:14] Not bad. Just got done with a hacking session with the revscores team :) [18:52:24] Lots of language bits and some dependency injection. [18:52:27] How you doin? [18:52:50] p. good! Writing a MediaWiki island parser :) [18:53:03] heh. What's on your island? [18:53:05] so far I've got template and link parsing/extraction/removal/manipulation down pat [18:53:14] next, mako asked for a MW-table-to-df parser [18:53:18] ...this is going to be all sorts of pain [18:53:22] but it sounds fun, SO. [18:54:33] heh. I can see why that'd be useful. [18:54:45] yeah, but so damn difficult to do efficiently [18:54:50] it's fine until colspan and rowspan get involved [18:55:08] oh yeah. bah! [18:56:06] I wrote a super-small-subset-of-SQL parser earlier [18:56:09] works for my usecase [18:56:11] but is rather slow [18:56:21] should go back and understand the theory better at some point [20:31:47] YuviPanda|zzz, cool! [20:31:51] can you go patch Hive's parser? :D [20:32:07] it doesn't recognise to exclude strings from parsing