[13:58:22] o/ [14:55:48] halfak, there [14:56:16] Hi GhassanMas [14:57:01] Hi, Regarding the able of pages ids and class [14:57:10] table* [14:57:41] have you extract the features yet? [14:58:38] Ahh yes. So we have a dataset that has predicted scores for the most recent version of articles in English Wikipedia. [14:58:48] I'm currently generating a table of monthly article quality assessments. [14:59:54] ...which looks like it errored out over the weekend :( [15:00:56] Looks like a file system issue prevented me from writing to the output file. [15:01:01] Well... that's frustrating. [15:01:06] I don't think I can resume here. [15:01:10] Hmm... maybe I can./ [15:01:14] * halfak looks into that. [15:03:22] the (, <month>, <total views>) dataset? [15:04:16] <halfak> Oh! Woops. I forgot that we were talking about views. You said "features" :) [15:04:32] <GhassanMas> yeah I mean the features first [15:04:34] <halfak> So, yeah. No real progress there yet unless sabya has looked into the mwviews library [15:04:46] <halfak> Which I haven't heard about [15:04:52] <GhassanMas> but I thought you were working on the data set you mention that you need on Saturday [15:05:14] <GhassanMas> I have looked into mwvies [15:05:42] <halfak> Na. Not working on the views one. [15:05:48] <halfak> Just on the predicted quality one [15:06:48] <GhassanMas> this one enwiki.rev_wp10.nettrom_30k.tsv? [15:07:46] <halfak> GhassanMas, ahh. Yeah. So I was talking about extended in that prediction problem with hash vector features. [15:10:27] <GhassanMas> close to this http://scikit-learn.org/stable/modules/feature_extraction.html#hashing-vectorizer? [15:11:12] <GhassanMas> http://scikit-learn.org/stable/modules/feature_extraction.html#hashing-vectorizer [15:13:32] <halfak> yeah. We're literally using that in testing and I've recently developed a strategy for doing it inside of ORES' dependency injection framework. [15:14:17] <halfak> See https://github.com/wiki-ai/revscoring/blob/master/revscoring/datasources/meta/hashing.py [15:14:21] <halfak> https://github.com/wiki-ai/revscoring/blob/master/revscoring/datasources/meta/gramming.py [15:14:25] <halfak> And https://github.com/wiki-ai/revscoring/blob/master/revscoring/datasources/meta/frequencies.py [16:23:56] <wikibugs> 06Revision-Scoring-As-A-Service, 10MediaWiki-extensions-ORES, 15User-Ladsgroup: Embed machine readable ores scores as data on pages where ORES scores things - https://phabricator.wikimedia.org/T143611#2648929 (10Halfak) @Legoktm, it looks like this is going to be very painful. We're going to have to make a... [16:28:04] <wikibugs> 06Revision-Scoring-As-A-Service, 10MediaWiki-extensions-ORES, 15User-Ladsgroup: Embed machine readable ores scores as data on pages where ORES scores things - https://phabricator.wikimedia.org/T143611#2648953 (10Halfak) Note, the wbEntity strategy will not require us to think hard about serialization. [16:36:03] <wikibugs> 10Revision-Scoring-As-A-Service-Backlog, 10ORES, 06Operations, 07Puppet: Clean up puppet & configs for ORES - https://phabricator.wikimedia.org/T142002#2648990 (10Halfak) [16:36:57] <wikibugs> 06Revision-Scoring-As-A-Service, 10rsaas-articlequality , 15User-Ladsgroup: Setup a db on labsdb for article quality that is publicly accessible - https://phabricator.wikimedia.org/T106278#2648993 (10Halfak) As discussed in the meeting. Review https://wikitech.wikimedia.org/wiki/Help:Tool_Labs/Database and... [16:40:50] <wikibugs> 10Revision-Scoring-As-A-Service-Backlog, 10rsaas-articlequality : [Explore] Spam and Vandalism new page creation - https://phabricator.wikimedia.org/T135644#2649001 (10Halfak) [18:24:18] <wikibugs> 10Revision-Scoring-As-A-Service-Backlog, 10Wikilabels, 10rsaas-edittypes: Train edit types model on labeled data for English Wikipedia - https://phabricator.wikimedia.org/T121715#2649691 (10ggellerman) [18:31:11] <wikibugs> 10Revision-Scoring-As-A-Service-Backlog, 10MediaWiki-extensions-ORES: Expose ores_model data in API using meta=ores - https://phabricator.wikimedia.org/T143617#2649745 (10Anomie) [18:31:14] <wikibugs> 10Revision-Scoring-As-A-Service-Backlog, 10MediaWiki-extensions-ORES: Introduce rcshow=oresreview and similar ones - https://phabricator.wikimedia.org/T143616#2649746 (10Anomie) [18:31:18] <wikibugs> 10Revision-Scoring-As-A-Service-Backlog, 10MediaWiki-extensions-ORES: Introduce ORES rvprop - https://phabricator.wikimedia.org/T143614#2649750 (10Anomie) [18:33:17] <wikibugs> 06Revision-Scoring-As-A-Service, 10RESTBase-API, 06Services, 15User-mobrovac: Public API endpoints for new services - https://phabricator.wikimedia.org/T103811#2649754 (10ggellerman) [18:33:42] <wikibugs> 06Revision-Scoring-As-A-Service, 10RESTBase-API, 06Services, 15User-mobrovac: Public API endpoints for new services - https://phabricator.wikimedia.org/T103811#1399718 (10ggellerman) removing Research and Data backlog. @DarTar is still subscribed [18:56:28] <wikibugs> 10Revision-Scoring-As-A-Service-Backlog, 10MediaWiki-extensions-ORES: Expose ores_model data in API using meta=ores - https://phabricator.wikimedia.org/T143617#2649883 (10Ladsgroup) I missed this ping, sorry. >>! In T143617#2612529, @Anomie wrote: > A few questions: > > * Should non-current models in `ores_m... [18:59:49] <wikibugs> 10Revision-Scoring-As-A-Service-Backlog, 10MediaWiki-extensions-ORES: Introduce rcshow=oresreview and similar ones - https://phabricator.wikimedia.org/T143616#2649894 (10Anomie) `list=watchlist` might be a bit tricky, since it's using WatchedItemQueryService instead of doing the queries directly so it won't be... [19:36:00] <wikibugs> 06Revision-Scoring-As-A-Service, 10RESTBase-API, 06Services, 15User-mobrovac: Public API endpoints for new services - https://phabricator.wikimedia.org/T103811#2649979 (10GWicke) 05Open>03Resolved The concrete issue discussed on this task has been resolved, so there is nothing actionable left. Lets clo... [20:22:17] <grrrit-wm> (03CR) 10Catrope: [WIP] Only make hidenondamaging available if damaging is enabled (031 comment) [extensions/ORES] - 10https://gerrit.wikimedia.org/r/310475 (owner: 10Catrope) [20:51:51] <halfak> Amir1, around? [22:53:36] <grrrit-wm> (03PS2) 10Catrope: Only make hidenondamaging available if damaging is enabled [extensions/ORES] - 10https://gerrit.wikimedia.org/r/310475 [22:54:24] <grrrit-wm> (03CR) 10jenkins-bot: [V: 04-1] Only make hidenondamaging available if damaging is enabled [extensions/ORES] - 10https://gerrit.wikimedia.org/r/310475 (owner: 10Catrope)