[00:19:32] <awight>	 halfak: fyi I’ve gathered the things we did and threw them into rough categories, https://phabricator.wikimedia.org/phame/post/view/77/status_update_october_6_2017/
[00:19:44] <awight>	 I’ll add human language by early next week...
[00:19:54] <awight>	 *have thrown
[00:20:46] <wikibugs_>	 (03PS1) 10Sbisson: RCFilters: default highlight according to preference [extensions/ORES] - 10https://gerrit.wikimedia.org/r/382632 (https://phabricator.wikimedia.org/T172757)
[01:25:54] <wikibugs_>	 10Scoring-platform-team, 10MediaWiki-Vagrant, 10MediaWiki-extensions-ORES: Can't enable ores role in vagrant - https://phabricator.wikimedia.org/T177555#3663220 (10Mooeypoo) I manually updated composer afterwards in vagrant-ssh, and then ran `foreachwiki update.php --quick` and added the wg variables for ORE...
[01:44:00] <wikibugs_>	 10Scoring-platform-team (Current), 10Collaboration-Team-Triage, 10ORES, 10Patch-For-Review: Make RCFilters compatible with both the old and new thresholds APIs - https://phabricator.wikimedia.org/T175053#3663229 (10awight) The code is in good shape and ready for another review.
[12:49:11] <wikibugs_>	 (03CR) 10Ladsgroup: "I think this helps in the performance of the queries, is there any chance of looking into this?" [extensions/ORES] - 10https://gerrit.wikimedia.org/r/349235 (https://phabricator.wikimedia.org/T163337) (owner: 10Ladsgroup)
[12:50:57] <wikibugs_>	 (03Abandoned) 10Ladsgroup: Use DISTINCT option on ChangesList select [extensions/ORES] - 10https://gerrit.wikimedia.org/r/349235 (https://phabricator.wikimedia.org/T163337) (owner: 10Ladsgroup)
[14:09:29] <halfak>	 o/
[14:56:47] <wikibugs_>	 10Scoring-platform-team (Current), 10ORES, 10revscoring, 10artificial-intelligence: Revscoring 2.0 takes up too much memory - https://phabricator.wikimedia.org/T177544#3664639 (10Halfak) Just thought I should check on the draft quality model.  Loading just that into memory and got the biggest boost to RES...
[15:03:02] <awight>	 halfak: o/
[15:03:21] <awight>	 The extension code would benefit from eyeballs if you have them to spare.
[15:03:39] <halfak>	 will do.  I'm very worried about memory usage today.  Will do a bit of both :) 
[15:03:50] <halfak>	 FYI: https://phabricator.wikimedia.org/T177544
[15:04:05] <awight>	 I’ll stand up for my new thresholds logic, but the fallback stuff is kludgey as hell
[15:04:17] <awight>	 halfak: ah ok.  That’s equally important IMO
[15:06:02] <awight>	 34k per model would be outstanding.
[15:06:09] <awight>	 aren’t those MB, though?
[15:07:07] <awight>	 yeah.
[15:08:55] <awight>	 halfak: Could we store thresholds in a database rather than explicitly in-memory?
[15:09:02] <awight>	 It’s accessed infrequently.
[15:09:28] <awight>	 indexes would be ideal, come to think of it.
[15:09:49] <awight>	 O(1) get
[15:18:22] <halfak>	 hey!  just got in a meeting done in 40 mins
[15:21:55] <awight>	 It would be really slick if the threshold lookup code tried a database first, and if unconfigured falls back to more granular stats merged into the model file.
[15:37:44] <awight>	 So this hypothetical thresholds state db would be in MySQL.
[15:38:51] <awight>	 How about, it has a basic schema with the indexed columns, then the remainder of statistics for each threshold are a json blob?
[15:54:18] <awight>	 > Differences between the current environment and the environment in which the model was constructed ...
[15:54:22] <awight>	 that’s awesome.
[15:57:31] <awight>	 I love that it’s just a warning, too 8D
[16:00:08] <travis-ci>	 wiki-ai/revscoring#1257 (oneliners - a8092bc : Adam Roses Wight): The build passed. https://travis-ci.org/wiki-ai/revscoring/builds/284273726
[16:08:22] <Amir1>	 halfak: did you add pronunciation statement to ORES item?
[16:08:33] <Amir1>	 we have a bet here
[16:13:26] <wikibugs_>	 10Scoring-platform-team (Current), 10ORES, 10revscoring, 10artificial-intelligence: Revscoring 2.0 takes up too much memory - https://phabricator.wikimedia.org/T177544#3662554 (10awight) Just playing around, I dumped the thresholds table to json: ``` m = Model.load(open("models/enwiki.damaging.gradient_boo...
[16:14:30] <awight>	 halfak: ^ Am I forgetting some orthogonal dimension, or does m.info['statistics'].label_thresholds have the right order of magnitude size for what we’re looking to store?
[16:19:04] <awight>	 Yeah it seems to have the thresholds for each label.  LOL so the hydrated form is 20,000x bigger?
[16:46:58] <halfak>	 OMG dog problems
[16:47:15] <halfak>	 I have moles in my yard and Isla is trying to get them -- ruining all my hard work in the process
[16:47:34] <halfak>	 Amir1, I didn't add the pronunciation statement
[16:47:36] * halfak searches
[16:48:31] <Amir1>	 Lydia_WMDE: I won
[16:48:35] <halfak>	 Wait.  is there a wikidata item for ORES?
[16:49:26] <halfak>	 awight, not sure what you mean 20,000x bigger
[16:49:56] <Amir1>	 There are some but not for ORES as far as I checked
[16:50:07] <Amir1>	 we have for the pages
[16:50:31] <halfak>	 Oh gotcha.  Does ORES need an item?  That would be cool but I'm not sure what it would be used for. 
[16:50:46] <Amir1>	 Definitely it will be super cool
[16:54:08] * halfak has a COI ;0 
[16:54:25] <awight>	 halfak: It seems that we can dump the entire info[‘statistics’].label_thresholds to json in just 3.5MB
[16:55:09] <halfak>	 awight, for which model?
[16:55:36] <awight>	 enwiki.damaging
[16:56:26] <halfak>	 That's a lot of space still. 
[16:56:28] * awight checks draftquality
[16:56:31] <halfak>	 And one of the smaller ones
[16:56:37] <halfak>	 yeah.  draftquality will be a lot bigger, I think
[16:56:40] <halfak>	 like 30MB
[16:56:45] <halfak>	 or maybe 100MB
[16:57:55] <halfak>	 "In [6] used 824.5898 MiB RAM"  Just loaded the draftquality model into memory 
[16:57:55] <AsimovBot>	 El búfer 6 está vacío.
[16:57:59] <halfak>	 O_O
[16:58:17] <awight>	 hahaha
[17:00:28] <awight>	 True that 3.5MB is a lot, but only 20% of a 16MB serialized model.  Actually, is the model storing a json or a serialized ModelInfo of the threshold stats?
[17:00:41] <halfak>	 pickle serialized. 
[17:01:15] <halfak>	 According to my estimate, the draft quality model's model_info formatted as JSON is 129 MB
[17:01:34] <halfak>	 Which is about the size of the serialized model. 
[17:01:52] <awight>	 I was off by 20k about that 20,000x thing, sorry.  For some reason I was thinking your numbers applied to each *row*
[17:02:21] <halfak>	 We need to stop storing threshold information for *literally ever threshold* it seems. 
[17:02:38] <awight>	 As dirty as the database suggestion is, I’m attracted to the fact that we use indexes for exactly what they’re meant to do.
[17:02:57] <halfak>	 ^?
[17:03:20] <awight>	 I was floating a crazy idea an hour ago
[17:03:31] <halfak>	 Oh I must have missed that while in meeting. 
[17:03:36] <awight>	 that the threshold stats could optionally be stored in mysql
[17:03:50] <halfak>	 " thresholds state db"
[17:03:51] <halfak>	 ?
[17:04:04] <awight>	 yes.  max(recall) above precision X is really elegant in a db
[17:04:17] <halfak>	 Oh!  Yeah.  I see what you are saying there. 
[17:04:25] <awight>	 and we only need to call it once per day, per stat per model.
[17:04:42] <halfak>	 The JSON is 80.5MB when pickled rather than JSON dump'd
[17:05:07] <awight>	 It messes up the whole thing you had going with the self-contained models, but I was saying we could provide a few granular stats in the model.
[17:05:19] <halfak>	 And 90.9MB when I just pickle the whole object without JSON formatting. 
[17:05:20] <awight>	 as a fallback.
[17:05:54] <halfak>	 awight, yeah, that's a bummer.  I do really like keeping them together.  But we could have parallel outputs I think
[17:06:01] <halfak>	 foo.model, foo.model_info
[17:06:18] <halfak>	 And we could provide a checksum in foo.model to make sure foo.model_info is right. 
[17:07:47] <awight>	 .model can still be loaded as if it’s a single thing, and it is responsible for loading its own model_info either from db or sister file.
[17:08:19] <awight>	 akin to loading its dictionary
[17:08:42] <halfak>	 awight, we wouldn't want to load it by default every time we load the model though. 
[17:08:47] <awight>	 +1
[17:08:50] <awight>	 that’s the win
[17:08:54] <halfak>	 Because celery doesn't need the info and uwsgi doesn't need the model. 
[17:09:26] <awight>	 & if the model_info is available via db, we never have to load the detailed info into memory, that’s all done on an external db
[17:10:05] <awight>	 How do you think this wonky scheme compares to just cutting down on granularity...
[17:11:39] <halfak>	 awight, short term, cut down on granularity.  Long term, consider alternative schemes. 
[17:11:51] <awight>	 yup.
[17:12:18] <halfak>	 OK I have confirmed: A tuple of 20 ints * 200k rows = ~160MB in python memory
[17:12:51] <halfak>	 So that says we're doing an OK job of efficiently storing our data. 
[17:13:00] <halfak>	 In classes that is.  It's about the same as a tuple
[17:14:22] <awight>	 Are you prepared to cut granularity by 100-500x?
[17:16:56] <halfak>	 Yes.  I think so.  
[17:17:06] <halfak>	 The hard part is identifying the *right* place to cut granularity. 
[17:17:18] <halfak>	 I need some information theoretic measure of "important" thresholds. 
[17:17:29] <halfak>	 OR we can just round to 4 decimals. 
[17:18:00] <halfak>	 That's the difference between 20MB and 129MB
[17:18:01] <wikibugs_>	 10Scoring-platform-team (Current), 10ORES, 10revscoring, 10artificial-intelligence: Reduce label_thresholds granularity - https://phabricator.wikimedia.org/T177636#3665251 (10awight)
[17:18:35] <halfak>	 If we go down to 3 decimals, we'll be down to 700MB
[17:19:02] <awight>	 halfak: My initial thought (recorded in that task) is that we could have the segments between each data point stay within a certain distance from the “real” function, and then interpolate when we do the calculations.
[17:19:27] <awight>	 I don’t think it would be too hard to do over multiple functions, assuming that algorithm doesn’t already exist.
[17:20:18] <halfak>	 awight, the real problem is having the thresholds reported represent the variance in statistics at threshold optimally. 
[17:20:23] <awight>	 Just iterate and keep track of the deviation between the quantized and actual line, then write a point before it exceeds your epsilon
[17:20:37] <halfak>	 E.g. we should have a lot of datapoints around high confidence and few around low confidence for uncommon classes. 
[17:20:37] <awight>	 halfak: I think my proposal solves that.
[17:21:06] <halfak>	 awight, OK yeah.  I like that idea.  Let's file it though and do something dumb :) 
[17:21:08] <awight>	 we carefully map any curves, and straight lines are just kept within a tolerance
[17:21:12] <awight>	 lolol
[17:21:13] <awight>	 agreed
[17:22:03] <halfak>	 I think we can set the default threshold digits for sklearn probability classifiers to 4 digits, retrain the models, and be done pretty soon. 
[17:22:37] <awight>	  \o/
[17:23:09] <halfak>	 I'll get a PR together.  We'll need to retrain the models overnight.  I think it'll go smooth because I just cleaned up the Makefile. 
[17:23:30] <awight>	 So this was only revscoring 2.0, true?
[17:23:44] <halfak>	 Somehow we 32 million thresholds in draftquality "OK" 
[17:23:46] <halfak>	 WTF
[17:23:48] <halfak>	 right
[17:23:54] <awight>	 LOL
[17:24:04] <halfak>	 Only information about a small set of thresholds in revscoring 1.3
[17:24:09] <halfak>	 like 5-10 thresholds. 
[17:24:18] <awight>	 gotcha
[17:24:38] <awight>	 We’re about to be blocked on file handles again.
[17:24:40] <halfak>	 Oh woops. 
[17:24:49] <halfak>	 32million chars in draftquality "OK
[17:24:56] <awight>	 ah
[17:25:05] * halfak re-checks rows
[17:25:26] <halfak>	 152k for no rounding
[17:25:40] <halfak>	 9131 for rounding at 4 digits
[17:25:51] <halfak>	 994 for rounding at 3 digits
[17:26:10] <halfak>	 I wonder if we should round at 3.  
[17:26:13] <halfak>	 Hmm.  
[17:26:42] <halfak>	 The difference between 3.7 MB and 470K
[17:26:55] <halfak>	 I like 3 digits. 
[17:27:04] * halfak works. 
[17:28:10] <awight>	 +1 the --
[17:28:32] * halfak races to get this together so he can go eat lunch
[17:40:00] <awight>	 I’ll be around to CR any time.
[17:47:38] <halfak>	 awight, https://github.com/wiki-ai/revscoring/pull/365
[17:49:13] <awight>	 looking
[17:50:13] <travis-ci>	 wiki-ai/revscoring#1258 (memory_usage - 09766f4 : halfak): The build passed. https://travis-ci.org/wiki-ai/revscoring/builds/284319896
[17:50:56] <wikibugs_>	 10Scoring-platform-team (Current), 10ORES, 10revscoring, 10artificial-intelligence: Revscoring 2.0 takes up too much memory - https://phabricator.wikimedia.org/T177544#3665375 (10Halfak) When formatting json, the thresholds are arounded and limited.  In this case, the default is 4 decimal places.  You can...
[17:51:06] <wikibugs_>	 10Scoring-platform-team (Current), 10ORES, 10revscoring, 10artificial-intelligence: Revscoring 2.0 takes up too much memory - https://phabricator.wikimedia.org/T177544#3665376 (10Halfak) https://github.com/wiki-ai/revscoring/pull/365
[17:53:19] <halfak>	 Running to lunch/next meeting 
[17:53:22] <halfak>	 Back in 30-45
[17:53:37] <awight>	 halfak: util.round doesn’t eliminate any values from the list.  does the grouping do that?
[17:53:41] <awight>	 k see you
[17:55:23] <awight>	 ok confirmed that itertools.groupby does exactly that.
[17:59:54] * awight scratches head trying to figure out how to regenerate just the stats on a model
[18:05:41] <awight>	 Something about > self.info['statistics'].fit(score_labels)
[18:07:49] <awight>	 revscoring test_model
[18:17:47] <awight>	 revscoring test_model models/enwiki.damaging.gradient_boosting.model damaging --model-file=models/enwiki.da
[18:17:48] <awight>	 maging.gradient_boosting-round3.model --observations=datasets/enwiki.labeled_revisions.w_cache.20k_2015.json
[18:22:24] <awight>	 halfak: oops, something I didn’t catch before merging.
[18:22:25] <awight>	   File "/media/sf_work/revscoring/revscoring/scoring/statistics/classification/classification.py", line 112, in fit
[18:22:26] <awight>	     threshold_ndigits=self.threshold_ndigits,
[18:22:27] <awight>	 AttributeError: 'Classification' object has no attribute 'threshold_ndigits'
[18:22:35] <awight>	 Probably simple...
[18:24:12] <awight>	 Weird.  Maybe it’s initializing .info with the serialized statistics
[18:30:47] <awight>	 meh I’ll just retrain with 100 observations
[18:34:21] <awight>	 1000.
[18:34:23] <awight>	 love the makefile
[18:35:22] <awight>	 fwiw, I got about 370 thresholds, and they’re all unique to 3 decimals.
[18:35:39] <awight>	 I’ll dial that down to 2 decimals just to feel like I’ve done my smoke test due diligence.
[18:37:30] <awight>	 Success.  thresholds are rounded to 2 places now, and there are c. 130 now.
[18:39:54] <wikibugs_>	 10Scoring-platform-team: revscoring model_info display should include target prediction value - https://phabricator.wikimedia.org/T177649#3665605 (10awight)
[18:40:10] <wikibugs_>	 10Scoring-platform-team: revscoring model_info display should include target prediction value - https://phabricator.wikimedia.org/T177649#3665617 (10awight) p:05Triage>03Lowest
[18:41:54] <awight>	 halfak: Helpful if I deploy that to labs?
[18:42:15] <awight>	 Or should that wait until you retrain a few models to test those?
[19:10:14] <halfak>	 awight, need to retrain
[19:20:52] <halfak>	 awight, https://gerrit.wikimedia.org/r/382765
[19:33:39] * halfak begins the process of rebuilding the files on our new big beefy stats machine. 
[19:35:21] <wikibugs_>	 10Scoring-platform-team (Current), 10ORES, 10revscoring, 10artificial-intelligence: Revscoring 2.0 takes up too much memory - https://phabricator.wikimedia.org/T177544#3665826 (10Halfak) OK Just released revscoring 2.0.8.  Now I'm going to rebuild all of the models -- starting with the big set of editquali...
[19:37:46] <awight>	 halfak: for that you just “rm models/*” and make?
[19:37:57] <awight>	 thinking about how -j should work...
[19:38:31] <halfak>	 Yup. 
[19:38:51] <awight>	 I should have pushed to packagist…
[19:39:02] <awight>	 thanks for deploying!
[19:39:18] <awight>	 so, the file handles…
[19:39:22] <halfak>	 Looks like stats machine is broken.  Moving to ores-misc-01
[19:39:26] <awight>	 oof
[19:40:28] <awight>	 I’ve read that the null handles come from files that are opened and later deleted.
[19:41:01] <halfak>	 No idea what that would be. 
[19:41:09] <halfak>	 I can look at that now though.  Lotsa waiting ahead. 
[19:41:17] <halfak>	 Oh!  I should review your extension work. 
[19:41:18] <awight>	 ty
[19:41:22] <halfak>	 Maybe Amir1 can get that in beta tomorrow. 
[19:42:40] <awight>	 Just realized that the celery worker recovers thanks to a kick from puppet.
[19:45:14] <awight>	 What do you think about a “pending deployment” column?
[19:46:33] <wikibugs_>	 10Scoring-platform-team (Current), 10ORES, 10Operations, 10Patch-For-Review, and 2 others: Stress/capacity test new ores* cluster - https://phabricator.wikimedia.org/T169246#3665899 (10awight)
[19:46:36] <wikibugs_>	 10Scoring-platform-team (Current), 10ORES, 10Operations, 10Patch-For-Review: Give ores admins read access to /srv/log/ores/main.log* - https://phabricator.wikimedia.org/T175736#3601826 (10awight) 05Open>03Resolved
[19:49:52] <halfak>	 awight, seems like a good idea to me
[19:50:10] <awight>	 We can try for a while, at least.
[19:50:17] <halfak>	 awight, not sure what I can do in this review.  I can't comment on any good practices with PHP/MW
[19:50:20] <halfak>	 It all looks bad to me :) 
[19:51:07] <halfak>	 I like that you have a function for the explicit formula conversion
[19:51:26] <awight>	 lol
[19:51:43] <halfak>	 It's hacky and contained
[19:51:49] <awight>	 Well, I can walk you through the fallback if you want.  It’s not wholesome.
[19:52:22] <awight>	 The scariest maneuver is when I put garbage into the new-thresholds cache as a reminder to not try fetching for another minute.
[19:54:04] <awight>	 Pretty sure the ripcords are easy to find when we dump old-thresholds support
[19:54:39] <halfak>	 Is that the empty array you put in memcached?
[19:54:42] <awight>	 yup
[19:55:37] <awight>	 I might actually try a git headstand for fun, to split this and make the fallback revertable.
[19:56:37] <halfak>	 I'm a little unnerved to see "damaging" in a plane string and not "goodfaith"
[19:57:57] <halfak>	 plain
[19:58:57] <halfak>	 Is that just to check if we're v1 or v2?
[19:59:07] <halfak>	 *1.3 or 2.0?
[20:01:34] <halfak>	 Serialized model files are smaller by about 66%
[20:01:39] <halfak>	 with revscoring 2.0.8
[20:05:41] <halfak>	 Ok other than that one Q I think I'm ready to +1 the extension work
[20:05:53] <halfak>	 I'm going to look at filedecriptors and redis. 
[20:07:11] <halfak>	 I'm doing so much engineering work today. This is awesome!
[20:13:37] <wikibugs_>	 (03PS18) 10Awight: Support new thresholds API [extensions/ORES] - 10https://gerrit.wikimedia.org/r/380893 (https://phabricator.wikimedia.org/T175053)
[20:13:40] <wikibugs_>	 (03PS1) 10Awight: Fallback to old thresholds API as necessary [extensions/ORES] - 10https://gerrit.wikimedia.org/r/382778 (https://phabricator.wikimedia.org/T175053)
[20:14:59] <wikibugs_>	 (03CR) 10jerkins-bot: [V: 04-1] Support new thresholds API [extensions/ORES] - 10https://gerrit.wikimedia.org/r/380893 (https://phabricator.wikimedia.org/T175053) (owner: 10Awight)
[20:15:01] <wikibugs_>	 (03CR) 10jerkins-bot: [V: 04-1] Fallback to old thresholds API as necessary [extensions/ORES] - 10https://gerrit.wikimedia.org/r/382778 (https://phabricator.wikimedia.org/T175053) (owner: 10Awight)
[20:15:20] <halfak>	 awight, do you know if celery workers are using a lot of file handles? 
[20:15:38] <halfak>	 And I'm wondering how I could check file handles used on my own machine. 
[20:32:38] <Keegan>	 halfak: awight: I went back and spammed the contact list with the link to sign up for the JADE feedback group themselves. Strategy worked, picked up a bunch more. I'll follow up on Monday or so.
[20:37:25] <Zppix>	 o/
[20:37:44] <Zppix>	 Keegan:  link? and what would the group do exactly?
[20:38:56] <Keegan>	 Zppix: Not like a formal thing. It's just a mass message list to send a notice when there's something needing feedback. As you follow in this channel, you'll likely already know about whatever is being sent out. But you're welcome to sign up. 
[20:38:58] <Keegan>	 https://meta.wikimedia.org/wiki/Global_message_delivery/Targets/JADE
[20:39:21] <Keegan>	 Every so often a "Hi, this thing about JADE needs some attention <link>"
[20:39:35] <Keegan>	 This way the team knows they're not just communicating into the dark
[20:40:01] <Zppix>	 Ah i dont need any more mass message messages (say that 10x fast) im in here everytime im on irc and i get notifs from the scoring platform team project on phab
[20:40:58] <Keegan>	 right
[20:42:25] <Zppix>	 Im kinda on pause for dev with ores and such until I find time to setup my ubuntu vm with vagrant
[20:57:39] <halfak>	 o/  was in meeting. 
[20:57:42] <halfak>	 reading scrollback
[20:57:49] <halfak>	 Nice, Keegan :) 
[20:58:00] <Zppix>	 halfak:  do you ever not have a meeting lol
[20:58:00] <halfak>	 I responded to Baba Tabita on the talk page. 
[20:58:07] <halfak>	 GOOD QUESTION 
[20:58:10] <halfak>	 Seriously though
[20:58:43] * Zppix hands halfak  a nice cold pint
[21:00:10] <halfak>	 :D 
[21:00:22] <halfak>	 I could use it.  Almost to EOD on Friday!  Wooo
[21:00:42] <Zppix>	 end of day?
[21:00:45] <Zppix>	 (eod)?
[21:00:55] <halfak>	 OK so I've confirmed that the redis connection in celery workers is *not* accounting for the bunch of file handles. 
[21:00:56] <halfak>	 yeah
[21:01:00] <halfak>	 eod = end of day
[21:09:20] <Zppix>	 halfak:  legoktm just mentioned a huge possible legal issue on the post from Keegan  on wikitech-l
[21:09:46] <halfak>	 Oh?
[21:10:11] <halfak>	 What legal issue?
[21:13:54] <Zppix>	 halfak:  https://lists.wikimedia.org/pipermail/wikitech-l/2017-October/088975.html
[21:14:21] <halfak>	 Oh they can go to hell
[21:14:23] <halfak>	 :) 
[21:14:44] <Zppix>	 halfak:  you should be a lawyer :P
[21:15:02] <Zppix>	 we're going to sue you, "well you can go to hell" :P
[21:15:14] <Zppix>	 instant case drop right there
[21:15:16] <halfak>	 Case closed
[21:17:18] <legoktm>	 I don't think it's a huge problem
[21:17:22] <legoktm>	 it was more of a "fyi"
[21:17:36] <halfak>	 thanks legoktm 
[21:17:58] <halfak>	 Will keep this in mind.  Might talk to legal about it. 
[21:18:14] <Zppix>	 halfak:  it sounds like you had the legal thing figured out though xD
[21:18:48] <halfak>	 I'll just confirm the complete effectiveness of the "got to hell" defense. 
[21:20:30] <legoktm>	 I think a trademark on term "jade" is pretty silly and it seemed like the nodejs person didn't want to fight it, but we have a pretty solid legal team
[21:21:55] <Zppix>	 halfak:  the visit to legal is just a formality, they just need to enter into the paperwork halfak  says "go to hell" 
[21:25:27] <halfak>	 That's where all paperwork goes eventually. 
[21:25:37] <halfak>	 OK.  I give up on the filehandle stuff.  But let me first record my notes. 
[21:27:23] <wikibugs_>	 10Scoring-platform-team, 10ORES: Clean up file handle and Redis connection management in ORES worker and celery processes - https://phabricator.wikimedia.org/T177036#3645111 (10Halfak) OK so I checked on this and it looks there's no effect at all on the file-handle count by dropping the connection to redis in...
[21:35:57] <awight>	 halfak: 66% savings, act today to pre-order your model!
[21:36:19] <halfak>	 :D 
[21:36:22] <awight>	 Your point wrt. “damaging” is important, yeah
[21:36:33] <awight>	 We need to make an API call that doesn’t assume either damaging or reverted.
[21:36:39] <awight>	 Or we have to hardcode enwiki.
[21:37:18] <halfak>	 Make it to the v3 version of the API
[21:37:37] <halfak>	  /v3/scores/enwiki/?model_info
[21:38:21] <Zppix>	 awight:  hardcode enwiki... ill make sure to file that for halfak  into the your crazy :P
[21:39:00] <halfak>	 oh... well that'd be the wikiId
[21:39:00] <awight>	 lol @ owning the english word for a type of rock
[21:39:06] <halfak>	 awight, right 
[21:39:13] <halfak>	 they should go to hell
[21:39:17] <halfak>	 :) 
[21:39:31] <awight>	 I’m already using the v3 route
[21:40:12] <Zppix>	 halfak:  I wonder if hell is trademarked?
[21:40:20] <awight>	 is that enough to tell me that revscoring 2.0 is available?
[21:40:35] <awight>	 Zppix: lol is most certainly is
[21:40:40] <halfak>	 awight, yes.  Output format will be 2.0-ish
[21:41:03] <awight>	 aha.  So {wikiId} then
[21:41:04] <awight>	 great
[21:41:34] <halfak>	 https://ores.wikimedia.org/versions should be machine readable
[21:41:54] <halfak>	 Also, it's bad that it's kind of hard-coded for which libraries matter. 
[21:42:26] <wikibugs_>	 (03CR) 10Awight: [C: 04-1] "TODO: hit /v3/scores/{wikiId}/?model_info to test API compatibility, rather than assuming the "damaging" model is present." [extensions/ORES] - 10https://gerrit.wikimedia.org/r/382778 (https://phabricator.wikimedia.org/T175053) (owner: 10Awight)
[21:42:54] <awight>	 halfak: whoa, neat!
[21:44:20] <awight>	 Don’t forget to have an EOD
[21:45:38] <halfak>	 Yeah.  Just about to do that :) 
[22:21:27] <wikibugs_>	 (03CR) 10Catrope: [C: 032] WLFilters: Temporarily stop respecting hideNonDamaging on WL with beta feature [extensions/ORES] - 10https://gerrit.wikimedia.org/r/382627 (owner: 10Sbisson)
[22:29:15] <wikibugs_>	 (03Merged) 10jenkins-bot: WLFilters: Temporarily stop respecting hideNonDamaging on WL with beta feature [extensions/ORES] - 10https://gerrit.wikimedia.org/r/382627 (owner: 10Sbisson)