[08:18:54] <wikibugs>	 10Quarry, 06Discovery, 10Labs-project-other, 10Wikidata, and 2 others: Setup sparqly service at https://sparqly.wmflabs.org/  (like Quarry) - https://phabricator.wikimedia.org/T104762#2635939 (10Multichill) With the current SPARQL setup it's easy to share queries either by full url or by short url. I think...
[10:49:24] <wikibugs>	 10Quarry, 06Discovery, 10Labs-project-other, 10Wikidata, and 2 others: Setup sparqly service at https://sparqly.wmflabs.org/  (like Quarry) - https://phabricator.wikimedia.org/T104762#2636433 (10Base) Do I get it right that now a query cannot be longer than URL length limit? How much exactly is that number...
[10:55:56] <wikibugs>	 10Quarry, 06Discovery, 10Labs-project-other, 10Wikidata, and 2 others: Setup sparqly service at https://sparqly.wmflabs.org/  (like Quarry) - https://phabricator.wikimedia.org/T104762#1426314 (10jcrespo) @Base, your questions are very interesting, and you seem to have really nice suggestions, but I would s...
[15:03:56] <halfak>	 o/
[18:50:57] <milimetric>	 halfak: not sure if you saw this, but I missed it during the wishlist process last year: https://meta.wikimedia.org/wiki/2015_Community_Wishlist_Survey/Miscellaneous#Editor_Stats_API.2FGUI
[18:51:06] <milimetric>	 they're basically asking for wikicredit
[18:51:18] <milimetric>	 which I can't wait to help with :)
[18:51:32] <milimetric>	 do yall have thoughts about a timeline for that?
[18:52:11] <halfak>	 milimetric, ORES success --> WIkiCredit backburner :( 
[18:52:28] <milimetric>	 it's all good, we'll get there :)
[18:52:31] <halfak>	 But really, if we could design a system that could generate the productivity stats in semi-realtime. 
[18:52:36] <sabya>	 o/ halfak 
[18:52:42] <halfak>	 That's the major barrier to progress now. 
[18:52:44] <halfak>	 o/ sabya 
[18:53:33] <sabya>	 regarding the model performance, I think it is performing better if we don't pass sample_weight
[18:54:25] <sabya>	 generating another graph for proving this.
[18:55:11] <sabya>	 so, roc is around 92 without sample weight, and I think it will be around 90 with weight.
[18:55:38] <halfak>	 I'm surprised that roc auc changes
[18:55:50] <halfak>	 I know that accuracy/precision/etc. should change. 
[18:56:06] <halfak>	 Since essentially we're saying, "Pretend that this one case isn't all that uncommon"
[18:56:09] <halfak>	 When really it is. 
[18:57:19] <sabya>	 ok, will confirm once I see the plot.
[18:57:33] <sabya>	 it is currently generating the grid.
[19:00:58] <sabya>	 also, since in the last plot, the lines were still going upwards till the last value of n_estimators, I'm generting roc for estimators till 2100. Want to see where it becomes flat. 
[19:02:04] <sabya>	 makes sense?
[19:09:52] <halfak>	 Sounds good.  Thanks sabya 
[19:10:36] <halfak>	 milimetric, do you think that anything analytics has in the pipeline would help us generate the massively CPU-intensive content persistence measures in easier ways?
[19:10:46] <halfak>	 I suppose the stream processing will be key.  
[19:11:03] <halfak>	 We need to not miss an event (kafka) and we need to distribute processing with flexible memory
[19:11:23] <milimetric>	 yeah, stream processing
[19:11:40] <milimetric>	 also though, we have to find a good way to store/retrieve/operate on text chunks
[19:12:36] <milimetric>	 nothing turned up on my past searches but when we do this "for real", I intend on searching and thinking for at least a week, because if there's nothing out there, there should be
[20:50:23] <halfak>	 milimetric, +1 to that.  Seems like a clear technical proposal is a good next step
[21:37:40] <halfak>	 I'm just going to leave this here: https://en.wikipedia.org/wiki/Ore
[21:37:56] <halfak>	 Better link: https://en.wikipedia.org/wiki/ores