[05:45:36] I'm working on the content of wikipedia articles,(mainly concerned with the text part of a subset of articles) and calculate tfidf scores to look for specific word occurences in that subset [05:46:24] Would using the Wikitext of articles work or should I first parse them and strip everything wiki-specific to bring them down to bare text? [05:47:05] If I can work with Wikitext, any suggestions on how can I ignore wiki-syntax, which only adds noise to my text processing? [12:33:21] 10Revision-Scoring-As-A-Service-Backlog, 10Research Ideas, 10Wikimedia-Developer-Summit (2017): Building an AI wishlist & working groups for Wikimedia Projects - https://phabricator.wikimedia.org/T147710#2701365 (10Qgil) Do you expect this top be a discussion session with many people or more of a hands-on se... [15:04:51] o/ [15:42:45] 10Revision-Scoring-As-A-Service-Backlog, 10Research Ideas, 10Wikimedia-Developer-Summit (2017): Building an AI wishlist & working groups for Wikimedia Projects - https://phabricator.wikimedia.org/T147710#2703864 (10Halfak) Most likely, this will not be hands-on anything. Attendance is intended to be broad.... [15:57:25] halfak: Do you have/want to review a rather big patch in ORES extension? It's just CI tests and they all passed by jenkins [15:57:26] https://gerrit.wikimedia.org/r/#/c/314845/ [15:57:37] *have time/want [15:58:24] We should try to get someone with more relevant experience in this testing environment than me. [15:58:27] Maybe RoanKattouw [15:59:02] I added him as reviewer, He hasn't came back here yet [15:59:08] maybe he is busy [15:59:48] Could be. [16:00:15] I think he might have gone to WikiCon USA [16:00:36] Oh that [16:00:37] I'll join the meeting in 2min [16:00:56] Holy crap! Today is a holiday! [16:01:08] I'm not even supposed to be here today [16:01:08] lol [16:01:14] It's a regional holiday [16:44:35] 06Revision-Scoring-As-A-Service, 10ORES, 13Patch-For-Review, 15User-Ladsgroup: Quiet result.get Warning in tasks - https://phabricator.wikimedia.org/T146680#2704004 (10Halfak) Here's an example of the full warning ``` [2016-10-10 11:43:07,262: WARNING/Worker-8] /home/halfak/venv/3.5/lib/python3.5/site-pack... [17:52:16] (03PS1) 10Ladsgroup: Fixup maintenance/CleanDuplicateScores.php [extensions/ORES] (wmf/1.28.0-wmf.21) - 10https://gerrit.wikimedia.org/r/315141 [18:01:07] halfak: Amir1: Yes I'm at WCNA, and yes today is a holiday in the US and Canada :) [18:01:31] RoanKattouw: have fun! [18:01:37] Amir1: Please send me an email with the link and I'll review your patch tomorrow or at the airport this evening [18:01:54] (I often miss Gerrit CCs because I get a lot of Gerrit email) [18:02:04] kk [18:02:06] thanks! [18:02:45] RoanKattouw: https://gerrit.wikimedia.org/r/#/c/314449/ and https://gerrit.wikimedia.org/r/#/c/314845/ [18:05:55] (03CR) 10Dereckson: [C: 032] "SWAT" [extensions/ORES] (wmf/1.28.0-wmf.21) - 10https://gerrit.wikimedia.org/r/315141 (owner: 10Ladsgroup) [18:13:05] (03Merged) 10jenkins-bot: Fixup maintenance/CleanDuplicateScores.php [extensions/ORES] (wmf/1.28.0-wmf.21) - 10https://gerrit.wikimedia.org/r/315141 (owner: 10Ladsgroup) [18:29:14] halfak: BTW I've been talking to User:mxn and he's interested in running a labels campaign on viwiki [18:29:35] From the WikiLabels API it looks like one exists already, but I've forgotten where the human-readable information on campaigns is located [18:31:48] RoanKattouw: https://en.wikipedia.org/wiki/Wikipedia:Labels [20:04:45] 06Revision-Scoring-As-A-Service, 10ORES, 13Patch-For-Review, 15User-Ladsgroup, 07Wikimedia-log-errors: Quiet result.get Warning in tasks - https://phabricator.wikimedia.org/T146680#2704336 (10hashar)