[04:11:18] <Amir1>	 okay, wikilabels in staging seems happy 
[04:18:08] <Amir1>	 deploying to the main instance
[05:14:12] <wikibugs>	 06Revision-Scoring-As-A-Service, 10wikilabels: Deploy updates for Wikilabels - https://phabricator.wikimedia.org/T134032#2253106 (10Ladsgroup)
[05:14:21] <wikibugs>	 06Revision-Scoring-As-A-Service, 10wikilabels: Review staging protocol for Wikilabels - https://phabricator.wikimedia.org/T133557#2253120 (10Ladsgroup)
[05:14:29] <wikibugs>	 06Revision-Scoring-As-A-Service, 10wikilabels: WikiLabels doesn't handle well revdeleted edits - https://phabricator.wikimedia.org/T130234#2253121 (10Ladsgroup)
[07:24:41] <wikibugs>	 06Revision-Scoring-As-A-Service, 10rsaas-editquality, 10wikilabels: Complete wikidatawiki edit quality campaign - https://phabricator.wikimedia.org/T130274#2253180 (10Ladsgroup) a:03Ladsgroup
[07:25:40] <wikibugs>	 06Revision-Scoring-As-A-Service, 10rsaas-editquality, 10wikilabels: Complete wikidatawiki edit quality campaign - https://phabricator.wikimedia.org/T130274#2131460 (10Ladsgroup) Working on it. I just labeled 450 edits, I'm going to do 263 more and then we are done!
[12:17:05] <wikibugs>	 06Revision-Scoring-As-A-Service, 10rsaas-editquality, 10wikilabels: Complete wikidatawiki edit quality campaign - https://phabricator.wikimedia.org/T130274#2253432 (10Ladsgroup)
[12:18:22] <wikibugs>	 06Revision-Scoring-As-A-Service, 10Wikidata, 10rsaas-editquality: Train / Test wikidata damaging model - https://phabricator.wikimedia.org/T134047#2253433 (10Ladsgroup)
[12:18:38] <wikibugs>	 06Revision-Scoring-As-A-Service, 10Wikidata, 10rsaas-editquality: Train / Test wikidata damaging model - https://phabricator.wikimedia.org/T134047#2253446 (10Ladsgroup)
[13:35:50] <wikibugs>	 06Revision-Scoring-As-A-Service, 10Wikidata, 10rsaas-editquality: Train / Test wikidata damaging model - https://phabricator.wikimedia.org/T134047#2253593 (10Ladsgroup) Damaging:  ``` (p3)ladsgroup@ores-compute-01:~/editquality$ make models/wikidatawiki.damaging.gradient_boosting.model cat datasets/wikidataw...
[14:11:41] <wikibugs>	 06Revision-Scoring-As-A-Service, 10Wikidata, 10rsaas-editquality: Train / Test wikidata damaging model - https://phabricator.wikimedia.org/T134047#2253642 (10Ladsgroup) Good faith:  ``` (p3)ladsgroup@ores-compute-01:~/editquality$ make models/wikidatawiki.goodfaith.gradient_boosting.model cat datasets/wikida...
[14:15:33] <wikibugs>	 06Revision-Scoring-As-A-Service, 10Wikidata, 10rsaas-editquality: Train / Test wikidata damaging model - https://phabricator.wikimedia.org/T134047#2253644 (10Ladsgroup) https://github.com/wiki-ai/editquality/pull/30
[14:26:35] <halfak>	 o/ 
[14:29:54] <halfak>	 Hey Amir1 
[14:30:05] <halfak>	 Just sat down and go through my messages from yesterday. 
[14:30:19] <Amir1>	 halfak: hey
[14:30:29] <halfak>	 Digging into the paper review I need to do for the next couple of hours and then I'll be working on the ORES paper.  Hopefully, I'll have an agenda for our first meeting. 
[14:30:42] <Amir1>	 https://github.com/wiki-ai/editquality/pull/30
[14:30:54] <Amir1>	 awesome
[14:31:04] <Amir1>	 I want to answer the email very soon
[14:31:15] <Amir1>	 I do it once I'm done with the precaching
[14:31:24] <halfak>	 ROC-AUC of 99 O.O
[14:31:28] <halfak>	 Woah
[14:31:38] <Amir1>	 this is so cool
[14:31:40] <Amir1>	 :D
[14:31:49] <halfak>	 WTF.  Better than awesome.  Amazing!
[14:31:58] <halfak>	 Will be very interesting to see this in practice. 
[14:32:06] <halfak>	 BTW, any word on the KDD reviews yet?
[14:32:19] <Amir1>	 not yet
[14:32:38] <Amir1>	 I guess it will start from May 10th or so
[14:33:25] <halfak>	 kk
[14:33:30] <halfak>	 Not unreasonable. 
[14:33:46] * halfak hopes we can cite that in future work :) 
[14:35:44] <halfak>	 Filter rates in the 90s!  \o/  I bet the effective filter rate will match our 99% figure. 
[14:35:52] <halfak>	 Some damage just isn't all that damaging. 
[14:36:09] <halfak>	 We should be able to flip the class if interest for goodfaith. 
[14:36:19] <halfak>	 So that we can also report our filter rate of vandalism. 
[14:36:42] <halfak>	 We can revert 70% of damage and expect < 0.1 false-positive rate!
[14:36:48] <halfak>	 Holy crap is this model effective. 
[14:36:56] <halfak>	 Oh wait... these stats are weird. 
[14:37:03] <halfak>	 Because the sample isn't balanced. 
[14:37:04] <halfak>	 Hmm. 
[14:37:25] <Amir1>	 it's 2.4 K against 20K
[14:37:25] <halfak>	 Or rather because it *is* balanced.  We need a natural test set in order to know for sure. 
[14:37:40] <halfak>	 Regardless, these all suggest really strong signal. 
[14:37:48] <Amir1>	 I checked numbers of very carefully
[14:38:02] <Amir1>	 the prelabel we loaded to wikilabels was like this
[14:38:18] <Amir1>	 20K prelabeled and 4K needs review
[14:38:50] <halfak>	 But we prelabeled the balanced reverted/not-reverted set
[14:39:01] <halfak>	 So it's biased towards damage at that point. 
[14:39:08] <halfak>	 Good for signal.  Bad for interpretation
[14:45:16] <Amir1>	 (p3)ladsgroup@ores-compute-01:~/editquality/datasets$ grep "True" wikidatawiki.rev_damaging.5k_2016.tsv | wc -l
[14:45:17] <Amir1>	 2697
[14:45:46] <Amir1>	 halfak: this is number of damaging cases in 4283 that was loaded into wikilabels
[14:46:05] <Amir1>	 24471 wikidatawiki.prelabeled_revisions.20k_balanced_2015.tsv
[14:46:12] <halfak>	 We'd need ~10 million edits to have 2700 instances of damage show up!
[14:46:38] <Amir1>	 I think we did it 500K of human edits
[14:46:39] <halfak>	 Anyway,  this is all good.  Just hard to know exactly what the tradeoffs will be in practice with regards to false positive *rates*
[14:46:40] <Amir1>	 AFAIK
[14:46:42] <Amir1>	 IIRC
[14:46:51] <halfak>	 We did that for the paper. 
[14:47:02] <Amir1>	 yeah you're right
[14:47:05] <halfak>	 But for the modeling work, I think we worked with your balanced set from an XML processing job. 
[14:47:22] <Amir1>	 yeah
[14:47:25] <Amir1>	 I did that
[14:47:33] <Amir1>	 It was I think about 10M
[15:23:44] * halfak works a little bit on text vectorization for sabya. 
[15:23:48] <halfak>	 Check this about Amir1 https://gist.github.com/halfak/f1c334690e846309fd4d8c272aca12a8
[15:24:09] <halfak>	 Not related to what we're working on, but it was nice to have an idea turn into a piece of code in minutes. 
[15:24:14] <halfak>	 It works, it's fast and it's simple 
[15:24:34] <halfak>	 This demonstrates a simple and flexible way to create #grams. 
[15:24:52] <halfak>	 The last line generates unigrams, bigrams, trigrams and single-skipgrams for a sequence of number. 
[15:25:02] <halfak>	 Will work great for sequences of tokens in a revision :) 
[15:30:04] <Amir1>	 \o/
[15:30:48] <Amir1>	 I'm trying to understand what's going on
[15:32:34] <Amir1>	 halfak: https://grafana.wikimedia.org/dashboard/db/ores
[15:32:43] <Amir1>	 so our precaching is up now
[15:33:11] <Amir1>	 I think the reason for overloaded was that I was running precache from two instances 
[15:33:23] <Amir1>	 one the daemon and the other one was --verbose
[15:33:27] <Amir1>	 running directly 
[15:41:16] <halfak>	 Looks good now. 
[15:41:35] <halfak>	 So, one more thought: we should be able to run two precachers in parallel. 
[15:41:41] <halfak>	 We should be able to run 10!
[15:41:56] <halfak>	 I think that the work you did to discover the worker queues is key here. 
[15:42:35] <halfak>	 Right now, the workers have a queue that gets populated by jobs.  The jobs aren't "known" to the system until processing starts. 
[15:43:25] <halfak>	 I think that delay is messing with our ability to associate requests to score an "in-process* revision with the *in-process* celery job. 
[15:43:40] <halfak>	 I should do a writeup about this.  
[15:45:47] <wikibugs>	 10Revision-Scoring-As-A-Service-Backlog: [Spike] Explore ORES hanging of *in-process* scorings - https://phabricator.wikimedia.org/T134064#2253759 (10Halfak)
[15:45:55] <wikibugs>	 10Revision-Scoring-As-A-Service-Backlog, 10ores: [Spike] Explore ORES hanging of *in-process* scorings - https://phabricator.wikimedia.org/T134064#2253772 (10Halfak)
[15:45:57] <halfak>	 https://phabricator.wikimedia.org/T134064
[15:46:17] <wikibugs>	 10Revision-Scoring-As-A-Service-Backlog, 10ores: [Spike] Explore ORES handling of *in-process* scorings - https://phabricator.wikimedia.org/T134064#2253759 (10Halfak)
[15:46:52] <Amir1>	 halfak: we can have a simple workaround for that by using more selective precaching and saying this daemon precaches enwiki, this one precaches wikidata, and so on
[15:47:01] <Amir1>	 it should be easy to implement that
[15:47:22] <Amir1>	 (we give the wiki and models via argument)
[15:47:33] <halfak>	 Not sure if that will help the problem. 
[15:47:51] <halfak>	 From ORES point of view, the request pattern will look identical. 
[15:48:21] <Amir1>	 hmm
[15:48:23] <Amir1>	 yeah
[15:48:27] <Amir1>	 you're right
[15:49:35] <halfak>	 Hmm... Looks like I have wasted my entire paper review time writing emails >:( 
[15:51:01] <Amir1>	 :(((
[15:51:12] <Amir1>	 What can I do to help?
[15:52:43] <halfak>	 Heh.  Invent cloning and/or time travel. 
[15:52:51] <halfak>	 Actually, if I'm going to have time travel, I need cloning too. 
[15:52:58] <halfak>	 Otherwise, I'll just end up getting old really fast. 
[15:53:13] <halfak>	 Since I'll stop time/go back in time and work on things. 
[15:53:40] <Amir1>	 halfak: Also is it okay to deploy damaging and goodfaith for wikidata?
[15:53:44] * halfak suddenly releases the most complete software system ever seen and promptly dies at the ripe age of 95. 
[15:54:04] <halfak>	 Yeah!  Do you want to do the staging shuffle?
[15:54:11] <Amir1>	 I wish at least some parts of Harry Potter was real, specially the magic watch
[15:54:26] <Amir1>	 halfak: yeah
[15:54:39] <Amir1>	 I just want to get permission first 
[15:54:49] <halfak>	 What's your pypi username?
[15:55:04] <halfak>	 (No pypi stuff should be necessary, but I'll get it set up anyway.)
[15:55:27] <Amir1>	 Amir_Sarabadani
[15:55:33] <Amir1>	 everything else was taken
[15:55:42] <halfak>	 Couldn't get Ladsgroup!?
[15:55:44] <Amir1>	 https://pypi.python.org/pypi/pywikibase
[15:55:50] <Amir1>	 yeah, I tried 
[15:55:58] <halfak>	 WTF
[15:56:31] <Amir1>	 in instagram, I tried Ladsgroup, was taken, then I tried Amir Sarabadani and it was taken so I tried "amirsarabadanitafreshi" and it worked!
[15:56:44] <Amir1>	 Package Index Owner: Amir_Sarabadani
[15:56:49] <halfak>	 Yeah.  I had a similar experience. 
[15:57:05] <halfak>	 Tried halfak, EpochFail, and a bunch of other handles I have used in the past. 
[15:57:21] <Amir1>	 halfak: I add you to this repo too
[15:57:25] <Amir1>	 I hope that's okay
[15:57:53] <halfak>	 I added you as a maintainer.  Let's see how that works.  It seems like that is what the system expects of us. 
[15:58:15] <Amir1>	 cool
[16:02:51] <halfak>	 OK.  I think I have you added to all the things.  revscoring, wikiclass, editquality and ores.
[16:04:54] <halfak>	 I really wish that we could get the other WMF staffers working on AI to join this channel
[16:05:04] <halfak>	 Lzia is just picking up work on image classification for commons. 
[16:05:11] <halfak>	 *sigh*
[16:05:59] <Amir1>	 thanks
[16:06:01] <Amir1>	 :)
[16:06:09] <Amir1>	 I hope we can get more people soon
[16:06:20] <Amir1>	 once we have some publicity 
[16:06:29] <Amir1>	 specially the extension deployed in some big wikis
[16:11:03] <Amir1>	 https://pypi.python.org/pypi/pywikibase
[16:11:10] <Amir1>	 halfak: you are an owner now
[16:11:43] <halfak>	 Amir1, do you want to push a new version of revscoring and deploy that?  It would be nice to have the dict lookup speedup deployed :) 
[16:11:52] <halfak>	 It would be a good test of pypi and the deploy process. 
[16:16:49] <Amir1>	 sure halfak 
[16:17:03] <Amir1>	 I'm online off and on because of dinner
[16:38:07] <Amir1>	 halfak: I just pushed new version of revscoring to 1- github 2- pypi
[16:39:17] <Amir1>	 now it's time to deply
[16:39:22] <Amir1>	 *deploy
[17:07:32] <halfak>	 All looks good on pypi and versioning. 
[17:18:18] <GhassanMas>	 is the python-mwapi the upgraded version of mwapi ?
[17:18:44] <halfak>	 python-mwapi == mwapi
[17:18:51] <halfak>	 It's just called python-mwapi in the repo
[17:18:57] <halfak>	 pypi knows it as simply "mwapi"
[17:19:58] <GhassanMas>	 alright
[17:30:36] <Amir1>	 working with gerrit is getting harder everyday https://gerrit.wikimedia.org/r/#/c/286283/
[17:30:37] <Amir1>	 :D
[17:57:56] <wikibugs>	 06Revision-Scoring-As-A-Service, 10Wikidata, 10rsaas-editquality: Train / Test wikidata damaging model - https://phabricator.wikimedia.org/T134047#2253886 (10Ladsgroup)
[18:48:32] <wikibugs>	 06Revision-Scoring-As-A-Service, 10rsaas-editquality, 10wikilabels: Complete wikidatawiki edit quality campaign - https://phabricator.wikimedia.org/T130274#2253942 (10Ladsgroup) I encountered {T130872} again today but it was bearable.
[19:07:32] <Amir1>	 going to sleep
[19:07:33] <Amir1>	 o/
[22:42:07] <wikibugs>	 06Revision-Scoring-As-A-Service, 10wikilabels: [Investigate] Intermittent performance issues with wikilabels - https://phabricator.wikimedia.org/T130872#2254132 (10Halfak) 05Resolved>03Open
[22:43:19] <wikibugs>	 06Revision-Scoring-As-A-Service, 10wikilabels: [Investigate] Intermittent performance issues with wikilabels - https://phabricator.wikimedia.org/T130872#2149118 (10Halfak) In T130274, @Ladsgroup said: > I encountered T130872 [performance issues] again today but it was bearable.  So I'm re-opening this.  I thin...
[22:43:35] <wikibugs>	 06Revision-Scoring-As-A-Service, 10wikilabels: [Investigate] Intermittent performance issues with wikilabels - https://phabricator.wikimedia.org/T130872#2254136 (10Halfak)