[09:11:05] <Phantom42>	 Amir1: Hello! You are mentoring my task on GCI (adding feature to wikiclass). May I ask you some questions about my task?
[09:34:05] <Amir1>	 Phantom42: hey, sure. Can you give link to the exact phab card?
[09:34:11] <wikibugs>	 10Scoring-platform-team (Current), 10ORES, 10Operations, 10Graphite, and 2 others: Regularly purge old ores graphite metrics - https://phabricator.wikimedia.org/T169969#3892437 (10fgiunchedi) 05Open>03Resolved All done! Agreed the parameter isn't the best, and naming is hard :(  This task is done from...
[09:37:21] <Phantom42>	 Amir1: Here is the task on Phab: https://phabricator.wikimedia.org/T174384
[09:38:05] <Amir1>	 Amir1: Cool
[09:38:07] <Phantom42>	 So I check wikiclass and revscoring code and made myself familiar with it. I also understand what needs to be done in that task
[09:38:16] <Amir1>	 That would be very interesting 
[09:38:22] <Amir1>	 pinged myself. facepalm
[09:38:57] <Amir1>	 Phantom42: so what you need to do is to add some features that can signal big clumps of unreferenced text 
[09:39:02] <Phantom42>	 I didn't have problems with building models on my machine (with `make models/enwiki.nettrom_wp10.gradient_boosting.model`). But I have problems with generating tunning reports
[09:39:24] <Phantom42>	 Error log for tunning reports: https://dpaste.de/LqOV
[09:39:56] <Amir1>	 it seems the make command is wrong
[09:40:13] <Phantom42>	 Yes. There are some problems with Makefile
[09:41:25] <Amir1>	 Phantom42: can you get make the version of revscoring library (you can get it by pip freeze)
[09:42:07] <Phantom42>	 pip shows 2.0.8
[09:42:15] <Phantom42>	 I had some problems with 2.1.0 so I downgraded it
[09:42:37] <Phantom42>	 Try generating tunning report with 2.1.0?
[09:42:42] <Amir1>	 yup
[09:42:46] <Amir1>	 otherwise it doesn't work
[09:42:53] <Amir1>	 we should fix the upgrade
[09:42:57] <Phantom42>	 Okay, thanks. Will try it now
[09:47:37] <Phantom42>	 Amir1: Still failing after upgrading: https://dpaste.de/L3aZ :(
[09:50:06] <Amir1>	 let me check the changes in the code
[09:51:38] <Phantom42>	 I played a bit with parameters and found out that it stops failing after removing `--label-type=str`, but generates empty output file :(
[09:52:31] <Amir1>	 can you send me the log and what it outputs you?
[09:54:31] <Phantom42>	 One moment...
[09:59:15] <Phantom42>	 Amir1: So here is what happens if I remove `--label-type=str`: https://dpaste.de/uSCy
[10:00:24] <Amir1>	 it seems sklearn library is not installed and called correctly
[10:00:34] <Amir1>	 Model sklearn.svm.SVC does not have a train() method.
[10:04:20] <Phantom42>	 Hm, I think I forgot to install it. Will install and try again now.
[10:11:26] <Phantom42>	 Amir1: Okay, I installed it with (pip3 install -U scikit-learn). The output has changed a bit, but looks like the error is the same: https://dpaste.de/VDbu
[10:13:25] <Amir1>	 Phantom42: I think the version that should be installed is very important 
[10:13:32] <Amir1>	 otherwise, it makes a mess
[10:14:44] <Amir1>	 Phantom42: install 0.17
[10:15:09] <Phantom42>	 Okay, will try now...
[10:37:31] <Phantom42>	 Amir1: Installing 0.17 did not help: https://dpaste.de/3nSq Maybe there are some other dependencies needed?
[10:48:26] <Amir1>	 Phantom42: it can be
[10:48:38] <Amir1>	 go through dependencies in requirements.txt in revscoring
[10:51:37] <Phantom42>	 Amir1: Just tried running "pip3 install -r requirements.txt" for revscoring requirements.txt. Got "Requirement already satisfied" for all of them
[10:54:42] <Amir1>	 Phantom42: give it a try with pip3 install -U -r requirements.txt
[10:54:55] <Amir1>	 and if that doesn't work out, I'm running out of ideas :(
[11:00:59] <Phantom42>	 I just tried it. Some dependencies were updated. But still the same problem. I am doing everything in parallel on 2 machines - same problem on both. Let me try rebuilding the model. Hopefully I will get that error I was previously getting there and it gives us some clues. If it doesn't, I will try again in virtualenv
[11:06:57] <Phantom42>	 Hm, I didn't get error building model with 2.1.0 revscoring this time. Okay, let's wait for model to rebuild and while I am waiting, I will try in virtualenv other machine
[11:15:07] <Phantom42>	 Unluckily no difference with virtualenv
[11:16:34] <Phantom42>	 I will also try doing things on machine with different OS. Both previous machines were running Ubuntu, but I also have laptop running windows. Will try there too...
[14:48:23] <halfak>	 o/
[14:49:16] <Phantom42>	 Amir1: I rebuilt the model, but it didn't help. And trying things on Windows doesn't seem to be good, as there are even more problems because some dependencies need to be compiled. I better try to get it to work on my Ubuntu machine... 
[14:51:51] <wikibugs>	 (03PS1) 10Awight: Namespace maintenance scripts so they're discoverable from tests [extensions/ORES] - 10https://gerrit.wikimedia.org/r/403663 (https://phabricator.wikimedia.org/T184140)
[14:51:56] <awight>	 halfak: heyo
[14:52:04] <halfak>	 yo!  Good morning
[14:52:32] <halfak>	 Today is super weird -- sandwiched between trips.  
[14:52:47] <halfak>	 Just about to start my first meeting :|
[14:54:34] <awight>	 Fine with me, I’ve been digging into E:ORES tests like a wombat
[14:54:50] <awight>	 Phantom42: Sorry I’m late to the party.  Looking at your error output, I think there’s something wrong with our Makefile.
[14:55:14] <halfak>	 awight, do you have a rough sense for what the progress is on that work?  E.g. do you think we'll be able to call the refactor "done" soon?
[14:55:40] * halfak needs to find time to finish converting ores.wmflabs.org to stretch. 
[14:55:42] <awight>	 Phantom42: The --pop-rate option doesn’t appear in the “tune” tool’s error output.
[14:56:22] <awight>	 halfak: The refactor is happing at blinding speed, IIRC Amir1 is only planning on slicing up one more small class, if any.
[14:56:36] <halfak>	 gotcha.  Cool.  thanks :) 
[14:56:42] <awight>	 The tests are at 56% coverage, out of a self-imposed goal of 60%.
[14:56:50] <awight>	 I’ll hit that in a couple of hours.
[14:59:11] <wikibugs>	 (03CR) 10jerkins-bot: [V: 04-1] Namespace maintenance scripts so they're discoverable from tests [extensions/ORES] - 10https://gerrit.wikimedia.org/r/403663 (https://phabricator.wikimedia.org/T184140) (owner: 10Awight)
[14:59:54] <awight>	 halfak: ah, also I remembered about this great code quality tool to help us find the pain points: https://scrutinizer-ci.com/g/wikimedia/mediawiki-extensions-ORES
[15:03:36] <awight>	 Oh we must have already had this conversation…. https://scrutinizer-ci.com/g/wiki-ai/revscoring/
[15:05:04] <wikibugs>	 10Scoring-platform-team, 10Collaboration-Community-Engagement, 10MediaWiki-extensions-ORES, 10Patch-For-Review, 10User-notice-collaboration: Deploy ORES filters to Simple Wikipedia - https://phabricator.wikimedia.org/T182012#3893520 (10Trizek-WMF) What is the status of that task? I haven't seen the filte...
[15:10:36] <Phantom42>	 awight: Hm, initially the problem was `--label-type` parameter and we removed it from command, so it works. But what's with `--pop-rate`?
[15:13:39] <awight>	 Phantom42: Sorry, this is definitely a problem with our Makefile.
[15:13:44] <codezee>	 o/
[15:14:05] <awight>	 Phantom42: Can I ask why you’re trying to build that particular model?  Have you and Amir1 already determined it’s the closest fit for your project?
[15:14:34] <awight>	 oh hehe I see, you’re just running the tuning reports.
[15:16:59] * awight files a bug
[15:18:22] <Phantom42>	 awight: I am working on this task: https://phabricator.wikimedia.org/T174384  I am working on adding new feature to enwiki and I need running reports to see if model prediction rates improved after new feature was added.
[15:18:41] <Phantom42>	 tuning reports *
[15:22:12] <awight>	 Phantom42: Can you show me the result of “pip freeze | grep revscoring"?
[15:22:34] <awight>	 oh nvm, I see it’s in your paste already!
[15:25:28] <awight>	 Phantom42: I’m curious how you got “datasets/enwiki.labeling_revisions.w_cache.nettrom_30k.json”, which isn’t in the Makefile.
[15:26:17] <awight>	 ah my fault again—I was in editquality, but you’re working in wikiclass.
[15:27:13] * awight pretends to drink some coffee
[15:29:46] <awight>	 Amir1: btw, <3 tqdm, I use it for everything now.
[15:31:36] <Amir1>	 awight: sorry, was afk meeting with Angel
[15:31:38] <Amir1>	 so
[15:31:58] <awight>	 Amir1: that’s right!  Cool, I hope it went well.
[15:32:51] <Amir1>	 Yeah
[15:33:08] <Amir1>	 Okay, Cool. Today is wikidata day, so can't work much
[15:33:17] <Amir1>	 but if there is anything, let me know :)
[15:33:50] <wikibugs>	 10Scoring-platform-team, 10ORES: Makiefile tuning reports broken by deprecated command parameters - https://phabricator.wikimedia.org/T184727#3893621 (10awight)
[15:34:45] <Amir1>	 awight: there is one small thing with your patch: https://integration.wikimedia.org/ci/job/mwext-testextension-hhvm-jessie/28693/console
[15:35:11] <awight>	 Amir1: I’m planning to write tests for maintenance/, do you think that’s worthwhile?
[15:35:12] <Amir1>	 otherwise it should've been done loong time ago 
[15:36:09] <awight>	 Amir1: That’s weird, it totally does use ORESService.
[15:36:21] <awight>	 Lint fail?
[15:40:09] <awight>	 Phantom42: Looks like it was something simple.  You can pull this branch, or make the change locally as you wish: https://github.com/wiki-ai/wikiclass/pull/58/files
[15:42:28] <Phantom42>	 awight: Good! But unluckily it still does not help with the problem that I get empty tuning report.
[15:42:41] <awight>	 hahaha that’s something else, then.
[15:43:01] <awight>	 Can you paste the output?
[15:43:43] <Phantom42>	 awight: https://dpaste.de/3nSq
[15:44:40] <awight>	 Phantom42: I haven’t actually run this tool myself, yet :-/ but this line looks most suspicious:
[15:44:42] <awight>	 > Running gridsearch for 0 model/params pairs
[15:44:48] <awight>	 Not much of a grid!
[15:45:30] <Phantom42>	 But I have a built model... Why doesn't it use it? 
[15:46:10] <awight>	 Hmm, I get a totally different error...
[15:46:30] <Phantom42>	 What do you get? 
[15:47:01] <travis-ci>	 wiki-ai/wikiclass#28 (fix_T184727 - 8066d2b : Adam Roses Wight): The build passed. https://travis-ci.org/wiki-ai/wikiclass/builds/327731953
[15:50:09] <awight>	 Phantom42: I get, https://dpaste.de/ooK6
[15:50:25] <Amir1>	 that is extremely weird 
[15:50:41] <awight>	 I’m copying down a built model to see if that’s the issue and it’s just a misleading error.
[15:51:19] <awight>	 Amir1: phpcs passes locally!
[15:51:31] <Phantom42>	 awight: Hm. I didn't have such error 
[15:52:55] <wikibugs>	 (03CR) 10Awight: "recheck" [extensions/ORES] - 10https://gerrit.wikimedia.org/r/403663 (https://phabricator.wikimedia.org/T184140) (owner: 10Awight)
[15:54:11] <wikibugs>	 (03CR) 10jerkins-bot: [V: 04-1] Namespace maintenance scripts so they're discoverable from tests [extensions/ORES] - 10https://gerrit.wikimedia.org/r/403663 (https://phabricator.wikimedia.org/T184140) (owner: 10Awight)
[15:57:34] <wikibugs>	 (03PS2) 10Awight: Namespace maintenance scripts so they're discoverable from tests [extensions/ORES] - 10https://gerrit.wikimedia.org/r/403663 (https://phabricator.wikimedia.org/T184140)
[15:58:50] <wikibugs>	 (03CR) 10Awight: "php-cs is killing me.  With the change in PS2, php-cs seems to pass, but of course the script crashes at runtime.  With or without the cha" [extensions/ORES] - 10https://gerrit.wikimedia.org/r/403663 (https://phabricator.wikimedia.org/T184140) (owner: 10Awight)
[15:59:22] <wikibugs>	 (03CR) 10Awight: [C: 04-1] Namespace maintenance scripts so they're discoverable from tests [extensions/ORES] - 10https://gerrit.wikimedia.org/r/403663 (https://phabricator.wikimedia.org/T184140) (owner: 10Awight)
[16:02:10] <wikibugs>	 (03PS3) 10Awight: Namespace maintenance scripts so they're discoverable from tests [extensions/ORES] - 10https://gerrit.wikimedia.org/r/403663 (https://phabricator.wikimedia.org/T184140)
[16:02:12] <wikibugs>	 (03CR) 10jerkins-bot: [V: 04-1] Namespace maintenance scripts so they're discoverable from tests [extensions/ORES] - 10https://gerrit.wikimedia.org/r/403663 (https://phabricator.wikimedia.org/T184140) (owner: 10Awight)
[16:02:28] <awight>	 what is happening
[16:04:51] <awight>	 Amir1: :D It was silently rebase, which changed that line to your MediaWikiServices singleton call.  So my use statement was extraneous after rebase.
[16:05:55] <wikibugs>	 (03PS4) 10Awight: Namespace maintenance scripts so they're discoverable from tests [extensions/ORES] - 10https://gerrit.wikimedia.org/r/403663 (https://phabricator.wikimedia.org/T184140)
[16:08:53] <wikibugs>	 (03CR) 10Awight: Namespace maintenance scripts so they're discoverable from tests [extensions/ORES] - 10https://gerrit.wikimedia.org/r/403663 (https://phabricator.wikimedia.org/T184140) (owner: 10Awight)
[16:10:23] <awight>	 halfak: Feel like sharing a clue… https://dpaste.de/ooK6/raw
[16:11:02] <halfak>	 awight, features aren't extracted.
[16:12:57] <awight>	 halfak: I double-checked and datasets/enwiki.labeling_revisions.w_cache.nettrom_30k.json does included the cached features.
[16:13:28] <awight>	 Or at least, a bunch of binary data in the “cache” field...
[16:13:31] <halfak>	 awight, looks like one of the features isn't cached. :/  That's my only explanation
[16:13:46] <halfak>	 Might be different versions of wikiclass/revscoring
[16:13:52] <awight>	 OK cool, that’s helpful!  Yes I think that’s it.
[16:14:13] <awight>	 I grabbed this file out of a random homedir to “save time” :)
[16:14:18] <awight>	 Will rebuild from scratch.
[16:14:24] <halfak>	 :)  godspeed
[16:17:18] <Phantom42>	 And meanwhile I am experimenting with dependencies, versions, etc...
[16:17:48] <awight>	 Phantom42: Probably a good idea.  Thanks for being patient :)
[16:19:52] <awight>	 halfak: Feel free to try to diagnose the latest issue that Phantom42 is running against, if you find time: https://dpaste.de/3nSq
[16:19:58] <awight>	 Note line 34.
[16:20:07] <awight>	 The output is empty.
[16:20:31] <halfak>	 Old version of wikiclass with a new version of revscoring :) 
[16:20:54] <awight>	 Phantom42: :D ^
[16:20:58] <halfak>	 Ack!  No! 
[16:21:03] <awight>	 lol
[16:21:11] <halfak>	 We haven't updated the wikiclass param set since we updated revscoring to 2.0!
[16:21:30] <halfak>	 OK so here's the solution.  Check out the editquality param set here:
[16:21:38] <halfak>	 https://github.com/wiki-ai/editquality/blob/master/config/classifiers.params.yaml
[16:21:40] <awight>	 I did find a —label-type param, which I crudely butchered out.
[16:21:53] <halfak>	 Note that you see lines like "revscoring.scoring.models.GradientBoosting"
[16:22:04] <halfak>	 Rather than "sklearn.ensemble.GradientBoostingClassifier"
[16:22:25] <awight>	 halfak: gotcha.  I’ll see if I can adapt it, always nice to explore new alleyways.
[16:22:29] <halfak>	 This is because we now use our own "scoring.Model" classes rather than sklearn directly. 
[16:22:35] <halfak>	 \o/  Thanks awight 
[16:22:43] <halfak>	 Just got done with mega meeting
[16:22:53] <halfak>	 Will be working on ores.wmflabs.org --> Stretch soon. 
[16:23:01] <awight>	 yuck.  OK great!
[16:24:04] <codezee>	  halfak does revscoring cv_train save the statistics when we save the model?
[16:24:17] <halfak>	 yes
[16:24:24] <halfak>	 The statistics are stored inside of the model. 
[16:24:29] <halfak>	 awight, +1 for yuck!
[16:24:37] <halfak>	 :D  But it's a really good test ^_^
[16:24:54] <halfak>	 It's very exciting for me to be back to normal work
[16:25:00] <halfak>	 Conferences get old after a while :) 
[16:25:06] <halfak>	 I missed y'all
[16:25:09] <awight>	 halfak: Should I comment out SVC?
[16:25:26] <halfak>	 awight, +1  SVC is super slow and bad 
[16:25:33] <halfak>	 It never wins.  just wastes time. 
[16:25:41] <awight>	 {{done}}
[16:25:41] <AsimovBot>	 How efficient, awight!
[16:26:23] <awight>	 n.b. I noticed that the “RandomForestClassifier” key is still inconsistent with the others, which have had the “Classifier” suffix stripped.
[16:28:42] <codezee>	 awight: are you refering to classifier names in revscoring/scoring/models?
[16:29:18] <awight>	 codezee: These are the keys in config/classifiers.params.yaml —I’m not sure what they correspond to, yet.
[16:29:32] <codezee>	 awight: those keys correspond to the classifier to use
[16:30:05] <codezee>	 awight: oh sorry, you're on wikiclass right?
[16:31:11] <awight>	 codezee: I’m updating wikiclass/config, yeah
[16:31:24] <codezee>	 awight: that key is just a key to the config in yaml, the real thing is the "class:" attribute from which it'll pick up classs
[16:31:33] <awight>	 tgr|away: Argh, donno how I missed the meeting notification!  Is there anything I can help with, halfway through?
[16:31:46] <awight>	 codezee: oho, thanks.  So the keys are arbitrary?
[16:32:53] <codezee>	 awight: yes, see - https://github.com/wiki-ai/revscoring/blob/master/revscoring/utilities/tune.py#L253
[16:33:02] <codezee>	 its the "name" part
[16:33:36] <codezee>	 awight: btw i use the latest config for drafttopic , sth like this - https://dpaste.de/19Do
[16:34:01] <awight>	 codezee: Nice, ty
[16:34:06] <codezee>	 np :)
[16:34:52] <awight>	 note to self: draftquality needs an updated config as well.
[16:36:46] <wikibugs>	 10Scoring-platform-team, 10ORES: Wikiclass tuning broken, needs revscoring 2 update - https://phabricator.wikimedia.org/T184727#3893930 (10awight)
[16:37:25] <Phantom42>	 Thank you awight! Will try that now 
[16:37:33] <awight>	 Phantom42: It’s working for me!
[16:38:09] <wikibugs>	 10Scoring-platform-team, 10ORES: Wikiclass tuning broken, needs revscoring 2 update - https://phabricator.wikimedia.org/T184727#3893621 (10awight) https://github.com/wiki-ai/wikiclass/pull/58
[16:43:04] <wikibugs>	 10Scoring-platform-team, 10ORES: Wikiclass tuning broken, needs revscoring 2 update - https://phabricator.wikimedia.org/T184727#3893940 (10awight) https://github.com/wiki-ai/draftquality/pull/18
[16:43:12] <codezee>	 awight: btw, after the fast scoring change merge, the model building should take 5 times less time...I'm hoping
[16:43:18] <wikibugs>	 10Scoring-platform-team, 10ORES: Tuning broken in some repos, needs revscoring 2 update - https://phabricator.wikimedia.org/T184727#3893941 (10awight)
[16:43:39] <awight>	 codezee: wat!  I missed this, is this about reducing the number of estimators for drafttopic?
[16:44:34] <codezee>	 awight: this was a general revscoring change that would improve tuning times for every model by 5 times approx
[16:44:52] <codezee>	 by scoring items in a bunch rather than one by one
[16:44:59] <awight>	 Fantastic, thanks for scaling us!
[16:45:41] <codezee>	 awight: https://github.com/wiki-ai/revscoring/pull/388 there hasn't been any substantial model building after that so i'm hoping you could report on that
[16:45:55] <awight>	 https://img00.deviantart.net/bfed/i/2011/099/6/1/shrinking_or_growing__by_cannibalcupcake-d3dl5hn.jpg
[16:46:39] <awight>	 codezee: Cool.  I’m running the wikiclass enwiki tuning on ores-misc-01, if that’s going to be comparable with any older data points?
[16:47:16] <awight>	 darn!  Our tuning reports don’t include total run time!
[16:49:22] <codezee>	 awight: i don't suppose a quantitative comparison is possible , was just hoping if you could related to by memory.... :D
[16:49:48] <awight>	 codezee: This is the first time I’ve run a tuning report /o\
[16:50:47] <wikibugs>	 10Scoring-platform-team, 10ORES: Tuning broken in some repos, needs revscoring 2 update - https://phabricator.wikimedia.org/T184727#3893974 (10Phantom42) Looks like there is one more minor problem with Makefile. `enwiki_tuning_reports` rule runs `tuning_reports/enwiki.wp10.md` and `tuning_reports/enwiki.nettro...
[16:50:53] <codezee>	 nevermind :P
[16:51:00] <wikibugs>	 10Scoring-platform-team, 10ORES, 10Performance: Tuning reports should give us a rough indication of algorithm performance - https://phabricator.wikimedia.org/T184743#3893975 (10awight)
[16:51:08] <awight>	 codezee: For next time ^
[16:51:15] <paladox>	 awight hi
[16:51:22] <paladox>	 about adding javascript to add a class
[16:51:29] <paladox>	 how would i use that in the css please?
[16:51:29] <awight>	 paladox: -releng?
[16:51:44] <paladox>	 awight for that gerrit change you reviewed :)
[16:51:55] <paladox>	 https://gerrit.wikimedia.org/r/#/c/402665/
[16:51:58] <awight>	 codezee: I’ll record the total time and we can see if anyone else remembers ballpark
[16:52:15] <awight>	 paladox: Totally, I was just suggesting #wikimedia-releng cos other people there might be interested.
[16:52:21] <paladox>	 ah i see
[16:52:36] <awight>	 paladox: My thought was that you could jam the .js line into the init() function
[16:52:48] <paladox>	 yep, it works in as far as it adds the class
[16:52:56] <paladox>	 but im wondering how do i use it?
[16:53:27] <awight>	 Oh great!  So the next change is to rewrite any CSS with the “rootNode” keyword, to instead be a top-level rule addressing .loginParent {
[16:53:59] <paladox>	 ah so
[16:54:08] <paladox>	 .loginParent body {
[16:54:10] <paladox>	 for example?
[16:54:34] <paladox>	 hmm doing this
[16:54:35] <paladox>	 html body .loginParent
[16:54:38] <paladox>	 does
[16:54:39] <paladox>	 https://gerrit.git.wmflabs.org/r/login/
[16:55:01] <awight>	 I’m confused about $root body
[16:55:11] <paladox>	 that's html
[16:55:18] <paladox>	 $root = html
[16:55:28] <paladox>	 so html body ($root body)
[16:55:48] <awight>	 paladox: Are you already using the newer gerrit?  Might want to use the same version as WMF
[16:56:22] <paladox>	 awight i am using gerrit 2.14. But the version dosen't really matter as we use GerritSite.css (gerrit 2.13 uses that too).
[16:56:28] <awight>	 paladox: I thought that $root was a trick to get $(‘.loginForm').parentNode
[16:56:42] <paladox>	 awight that was a eqcss  var
[16:56:52] <awight>	 paladox: ah cos I noticed that the element already has an ID that you can address, so we don’t need the extra JS code
[16:57:00] <paladox>	 yep
[16:57:11] <tgr>	 awight: thanks, it was mostly MediaWiki-related topics
[16:57:49] <awight>	 paladox: confirmed that it exists in WMF gerrit 2.13
[16:58:12] <awight>	 tgr: OK, thanks for handling!
[16:58:16] <Phantom42>	 awight: Looks like it works now! 77 model/param pairs! Thank you so much! 
[16:58:34] <paladox>	 yep
[16:58:34] <awight>	 Phantom42: \o/ any time I can help muddle through things, at your service
[16:59:25] <awight>	 paladox: You’re right, I see “$root which always refers to the HTML document.” on http://elementqueries.com
[16:59:33] <paladox>	 yep
[16:59:33] <awight>	 In that case, I have no idea what we’re doing here.
[16:59:53] <paladox>	 well what we are trying to do is apply this new css only if we are on the login page
[17:00:01] <paladox>	 otherwise it will apply it to the whole of gerrit
[17:00:19] <awight>	 paladox: ty.  OK so the mere existence of a matching element is enough...
[17:00:42] <paladox>	 yep
[17:01:28] <awight>	 So I’m a terrible hack, but perhaps you could use the login init() function to add a class to the body, then qualify all the login-only rules with “body.isLoginPage ..."
[17:02:17] <awight>	 The JS to add the class would look like, “document.body.classList.add"...
[17:03:27] <paladox>	 ah
[17:03:27] <paladox>	 ok
[17:03:33] <paladox>	 thanks will try with that
[17:04:08] <paladox>	 awight yay
[17:04:12] <paladox>	 that works i think
[17:04:12] <paladox>	 https://gerrit.git.wmflabs.org/r/login/
[17:04:23] <awight>	 Looks good!
[17:04:41] <awight>	 Seems like it’s not wrecking non-login pages either?
[17:05:04] <awight>	 Sorry to be a jerk about EQCSS, it just seems like overkill.
[17:05:38] <paladox>	 Heh yeh
[17:52:23] <wikibugs>	 10Scoring-platform-team, 10Analytics, 10Analytics-Wikistats, 10ORES: Discuss Wikistats integration for ORES - https://phabricator.wikimedia.org/T184479#3884392 (10Milimetric) totally, put a meeting on our calendar or let's chat here.
[17:54:54] <wikibugs>	 10Scoring-platform-team (Current), 10Beta-Cluster-Infrastructure, 10Recommendation-API, 10Release-Engineering-Team: What to do with deployment-sca03? - https://phabricator.wikimedia.org/T184501#3894240 (10Nuria)
[17:59:37] <awight>	 biab
[18:09:37] <codezee>	 Nettrom: quick q. - what tool do you use to plot graphs?
[18:11:06] <Nettrom>	 codezee: I use R and ggplot2
[18:31:58] <Nettrom>	 halfak: we chatted about using just the “OK” label from the draft quality model, let me know when you have a few minutes to dig more into that
[18:32:13] <halfak>	 Oh yeah.  So. 
[18:32:23] <halfak>	 You probably want to choose a threshold based on some constraints. 
[18:32:33] <halfak>	 E.g. 90% recall of non-OK drafts 
[18:33:53] * halfak gets a link/api call
[18:34:39] <halfak>	 https://ores.wikimedia.org/v3/scores/enwiki/?models=draftquality&model_info=statistics.thresholds.OK.%22maximum%20!precision%20@%20!recall%20%3E=%200.9%22
[18:34:55] <halfak>	 That is wrong 50% of the time when it flags something as !OK
[18:35:04] <halfak>	 But it catches 90% of !OK stuff. 
[18:35:17] <halfak>	 0.664 probability threshold. 
[18:37:56] <Nettrom>	 ah, so I can use that to figure out the cost/benefit of using different thresholds?
[18:39:13] * Nettrom thinks about this for a bit
[18:42:51] <halfak>	 Right :D 
[18:43:07] <halfak>	 It will also help you figure out the "meaning" of certain prediction "probabilities"
[18:43:28] <halfak>	 You could fit a spline to recall or precision so you can convert "probability" to your desired metric. 
[18:49:20] <codezee>	 halfak: while I was collecting statistics with different hyperparams, I saw that averaged best precision lies around 35% for drafttopic while recall is as high as 82%, does this suggest that its not missing out on topics that are assigned(high recall), and predicting some more(low precision) ?
[18:58:49] <wikibugs>	 10Scoring-platform-team, 10Collaboration-Community-Engagement, 10MediaWiki-extensions-ORES, 10Patch-For-Review, 10User-notice-collaboration: Deploy ORES filters to Simple Wikipedia - https://phabricator.wikimedia.org/T182012#3894528 (10Halfak) I don't think this is blocked from #ORES end.  Is there any i...
[19:03:06] <Nettrom>	 halfak: thanks for the link and the info, I might end up making a plot, but first I need to wrap my head around precision & recall metrics again to make sure I know what I want :)
[19:04:09] <halfak>	 codezee, average precision (aka PR-AUC) is a measure of precision and recall across the set of all thresholds. 
[19:04:28] <halfak>	 While the recall presented is a measure of recall at one specific threshold. 
[19:04:59] <halfak>	 Nettrom, +1 specificity and sensitivity (precision, recall) are weird to think about but really useful concepts. 
[19:05:38] <halfak>	 I usually use 90% recall for the damaging model as a "needs review" threshold because patrolers like it. 
[19:05:51] <halfak>	 "Let's make sure to catch at least 90% of the damage on the first pass" 
[19:06:02] <halfak>	 In practice, any obvious damage is going to get caught by that. 
[19:06:30] <Nettrom>	 yeah, that makes total sense
[19:06:34] <halfak>	 The stuff that doesn't get caught is often a False False-Positive ;) 
[19:07:12] <Nettrom>	 and I’m fairly sure I can make some good progress by figuring out a better threshold for the draft quality model, just need to make sure I understand it
[19:07:46] <halfak>	 Nettrom, see my brief essay http://socio-technologist.blogspot.com/2016/01/notes-on-writing-wikipedia-vandalism.html for some more thoughts 
[19:07:59] <Nettrom>	 excellent, thanks!
[19:08:02] <halfak>	 Should be relevant to the new article reviewing problem (at least WRT draftquality)
[19:08:18] <halfak>	 codezee, did you see my response earlier?
[19:09:50] <wikibugs>	 10Scoring-platform-team, 10Wikilabels, 10editquality-modeling, 10User-Tgr, 10artificial-intelligence: Complete edit quality campaign for Hungarian Wikipedia - https://phabricator.wikimedia.org/T167968#3894584 (10Halfak) Bot edits should not be included in the dataset.  Is it possible that some bots that...
[19:11:19] <codezee>	 halfak: so by averaged precision I meant the metric shown when we pass "precision.macro" to tune as fitness...and I'm assuming thats the precision of all classes averaged?
[19:11:36] <halfak>	 Oh!  Right. 
[19:12:00] <halfak>	 That "average precision" metric that sklearn has is such a confusing name.  I'm happy we're avoiding it :D 
[19:13:16] <codezee>	 halfak: also according to the implementation I suppose we're penalizing a prediction if its NOT in the true set, not the other way round right?
[19:14:14] <halfak>	 Right.  It's a "false positive".  One thing we could do is encourage editors to help us by cleaning up wikiproject tags on our train/test set and re-extracting that data. 
[19:14:41] <halfak>	 E.g. we could give editors work lists of "missing" mid-level categories and given them a list of potentially relevant WikiProject tags. 
[19:15:29] <codezee>	 some papers that I read upon take the set difference of actual and predicted sets, thereby accounting both ways, do you think we should do that here?
[19:15:41] <wikibugs>	 (03CR) 10Catrope: [C: 032] Tentatively re-enable ORES filters on RecentChangesLinked [extensions/ORES] - 10https://gerrit.wikimedia.org/r/403479 (https://phabricator.wikimedia.org/T179718) (owner: 10Sbisson)
[19:15:57] <codezee>	 so a penalization even if we missed on predicting a tag
[19:16:47] <halfak>	 codezee, yeah. that will penalize recall
[19:17:12] <halfak>	 codezee, I don't feel strongly about that.  but I suppose we could add a metric for it if you felt strongly. 
[19:17:27] <halfak>	 I think I like the nuanced metrics for each target class :) 
[19:18:05] <halfak>	 I just remembered that a lot of problems could be solved by fixing the directory hierarchy too. :) 
[19:18:27] <halfak>	 Again, that'd be on Wikipedians. 
[19:18:38] <halfak>	 I'm guessing we'll see a lot of that when Wikipedians first start using the tool :D 
[19:22:44] <wikibugs>	 (03Merged) 10jenkins-bot: Tentatively re-enable ORES filters on RecentChangesLinked [extensions/ORES] - 10https://gerrit.wikimedia.org/r/403479 (https://phabricator.wikimedia.org/T179718) (owner: 10Sbisson)
[19:25:30] <codezee>	 Oh...I got confused, the current implementation already does a kind of set difference and is penalizing both ways so nothing to worry...
[19:27:03] <codezee>	 yes, +1 for directory hierarchy...
[19:30:25] <wikibugs>	 (03CR) 10jenkins-bot: Tentatively re-enable ORES filters on RecentChangesLinked [extensions/ORES] - 10https://gerrit.wikimedia.org/r/403479 (https://phabricator.wikimedia.org/T179718) (owner: 10Sbisson)
[19:34:41] <halfak>	 :) 
[19:34:51] <halfak>	 OK emails done.  Time to start working on ores.wmflabs.org
[19:35:15] <halfak>	 I have 1.5 hours before I must wrap up and leave. :/
[19:37:48] <wikibugs>	 10Scoring-platform-team (Current), 10ORES: Back up ores-misc-01 to ores-staging-01 - https://phabricator.wikimedia.org/T184765#3894658 (10Halfak)
[19:38:07] <wikibugs>	 10Scoring-platform-team (Current), 10ORES: Convert ores-misc-01 to stretch - https://phabricator.wikimedia.org/T184766#3894669 (10Halfak)
[19:38:38] <wikibugs>	 10Scoring-platform-team (Current), 10ORES: Convert ores-misc-01 to stretch - https://phabricator.wikimedia.org/T184766#3894682 (10Halfak)
[19:38:40] <wikibugs>	 10Scoring-platform-team (Current), 10ORES: Back up ores-misc-01 to ores-staging-01 - https://phabricator.wikimedia.org/T184765#3894681 (10Halfak)
[19:38:56] <wikibugs>	 10Scoring-platform-team (Current), 10ORES: Convert CloudVPS instances to stretch. - https://phabricator.wikimedia.org/T184296#3894684 (10Halfak)
[19:38:58] <wikibugs>	 10Scoring-platform-team (Current), 10ORES: Convert ores-misc-01 to stretch - https://phabricator.wikimedia.org/T184766#3894669 (10Halfak)
[19:39:15] <halfak>	 Anyone see if awight said he'd be back soon?
[19:39:43] <halfak>	 Last I saw was "biab" 1.5 hours ago. 
[19:43:03] <codezee>	 halfak: not able to scp to ores-staging from ores-misc, that not possible?
[19:47:46] <Zppix>	 @seen awight
[19:47:46] <AsimovBot>	 04Error: Command “seen” not recognized. Please review and correct what you’ve written.
[19:48:06] <halfak>	 It's a pain to do that.  You'll need to rsync from remote to remote.  It passes the data through your computer so it's not exactly performant. 
[19:48:07] <Zppix>	 wm-bot4: your supposed to answer :/
[19:48:32] <Zppix>	 Is @seenon not enable here?
[19:50:37] <codezee>	 :/ means i'll have to rebuild models.... then
[19:50:50] <codezee>	 i'll copy the important reports in that case
[20:02:17] <halfak>	 codezee, important reports should be committed to the repo 
[20:02:21] <halfak>	 Same with important models ^_^
[20:02:39] <halfak>	 Usually the only think I need to copy is datasets that are painful (slow) to re-produce. 
[20:03:53] <Zppix>	 Like enwiki's?
[20:04:39] <halfak>	 Depends on the dataset
[20:05:06] <halfak>	 E.g. working from a random sample for enwiki is pretty quick :) 
[20:08:43] <codezee>	 halfak: yes I'll copy them eventually, I have these reports of varying one hyperparameter keeping others constant for reporting them
[20:08:54] <codezee>	 I'll commit the main statistics report and model in a commit
[20:09:19] <halfak>	 kk.  Others could be copied then. 
[20:09:24] * halfak looks for a better way
[20:27:30] <halfak>	 OK.  I got ores-web-01 set up with new code.  My plan is to switch that node in.  It should be able to talk to the old celery nodes.  We'll find out.
[21:00:07] <awight>	 codezee: Looks like the tuning run took 2.5hr
[21:04:54] <codezee>	 awight: how many models x params did it run?
[21:05:34] <awight>	 codezee: You can see results in ores-misc-01:/srv/awight/wikiclass/tuning_reports/enwiki.nettrom_wp10.md, if u want more detail.  Lemme see...
[21:05:54] <codezee>	 awight: nevermind if its from config file i can see
[21:06:20] <awight>	 Really?  I’m having a hard time interpreting.
[21:07:05] <awight>	 argh scrollback limit
[21:08:09] <awight>	 I guess it’s just the product of options in each line?  So 4^3 + 2*3 + 1 + 1 + 7*5*2
[21:08:26] <codezee>	 awight: its simple, yes
[21:08:27] <awight>	 142
[21:08:28] <codezee>	 that one
[21:09:06] <codezee>	 awight: just can you also tell the number of entries in enwiki.nettrom.wp10?
[21:09:33] <awight>	 I could wc -l, but there’s lots of header junk
[21:09:56] <awight>	 186 lines, so I think my 142 guess is on point.
[21:10:12] <codezee>	 awight: sorry, i meants the dataset...
[21:10:15] <codezee>	 :/
[21:10:19] <awight>	 32k or so
[21:10:21] <codezee>	 *meant
[21:10:32] <awight>	 32,450
[21:10:37] <codezee>	 oh, then it seems to be pretty fast scoring 142 classifier runs
[21:10:51] <awight>	 Does that account for CV?
[21:10:55] <awight>	 5-fold.
[21:11:06] <awight>	 so 32k * 142 * 5?  or 6?
[21:11:08] <codezee>	 awight: tune does a 5 fold CV
[21:11:14] <codezee>	 so yes
[21:11:30] <awight>	 I think training does the CV folds, then a final run over all the data
[21:11:45] <awight>	 oh it’s tricker, cos each fold is actually 4/5 of the data.
[21:12:18] <awight>	 4/5 * 5 = 4 :)
[21:16:09] <wikibugs>	 10Scoring-platform-team (Current), 10ORES: Convert CloudVPS instances to stretch. - https://phabricator.wikimedia.org/T184296#3895041 (10Halfak) ores-web-01 is created and configured, but not pooled.  It seems to work fine to request scores from this node and it is able to talk to celery as planned.
[21:16:32] <awight>	 halfak: How is the wheels thing not a problem?
[21:17:40] <halfak>	 Not sure I'd wee how it could be a problem
[21:17:58] <halfak>	 The Jessie instances are running Jessie wheels and the one Stretch instance is running Stretch wheels. :) 
[21:18:01] <awight>	 halfak: We were talking about the wheel versioning stuff...
[21:18:02] <awight>	 AH
[21:18:04] <awight>	 great.
[21:18:07] <halfak>	 Same exact version of celery :) 
[21:18:08] <awight>	 thanks for doing all that, then :)
[21:18:14] <awight>	 yep that should be fine.
[21:18:19] <halfak>	 Which means we can switch out the web node first and then the celery nodes second. A
[21:18:28] <halfak>	 Also we can probably switch them out one half at a time :) 
[21:18:37] <halfak>	 Regretfully I need to go get on an airplane. 
[21:18:43] <awight>	 o/
[21:18:46] <halfak>	 So it'll need to wait until i have more time. 
[21:18:53] <halfak>	 I'll be AFK tomorrow and Saturday. 
[21:18:54] <Nettrom>	 halfak: safe travels!
[21:18:56] <halfak>	 Should be around on Sunday. 
[21:19:05] <awight>	 hahahaahahahahaha
[21:19:10] <halfak>	 Looks like Monday is a holiday.  I plan to show up for our sync meetings and take the rest of the day off. 
[21:19:11] <awight>	 you and whose army.
[21:19:25] <halfak>	 Thanks Nettrom 
[21:19:26] <halfak>	 :) 
[21:19:41] <awight>	 OK good warning.  I’ll… try to make the Monday meeting.
[22:36:35] <wikibugs>	 10Scoring-platform-team, 10MediaWiki-extensions-ORES, 10Release-Engineering-Team: How do I test my extension's maintenance scripts? - https://phabricator.wikimedia.org/T184775#3895786 (10awight)
[22:56:23] <wikibugs>	 (03CR) 10jenkins-bot: Localisation updates from https://translatewiki.net. [extensions/ORES] - 10https://gerrit.wikimedia.org/r/403815 (owner: 10L10n-bot)
[23:18:51] <wikibugs>	 (03PS1) 10Awight: Steal model fixtures for TestHelper; add dirty tricks [extensions/ORES] - 10https://gerrit.wikimedia.org/r/403838 (https://phabricator.wikimedia.org/T184140)
[23:18:53] <wikibugs>	 (03PS1) 10Awight: Add a maintenance script test [extensions/ORES] - 10https://gerrit.wikimedia.org/r/403839 (https://phabricator.wikimedia.org/T184140)