[00:48:43] 10Revision-Scoring-As-A-Service-Backlog, 10Edit-Review-Improvements-RC-Page, 10ORES, 06Collaboration-Team-Triage (Collab-Team-Q3-Jan-Mar-2017): Damaging levels on Polish Wikipedia overlap too much - https://phabricator.wikimedia.org/T161655#3165579 (10Catrope) [00:48:45] 10Revision-Scoring-As-A-Service-Backlog, 10Edit-Review-Improvements-RC-Page, 10ORES, 06Collaboration-Team-Triage (Collab-Team-Q3-Jan-Mar-2017): Add more values to test_stats - https://phabricator.wikimedia.org/T161767#3165577 (10Catrope) 05Open>03Resolved a:03Catrope [15:29:45] o/ [15:29:54] glorian_wd, hey dude. I'm happy to work with you on that script today [15:30:08] I finished the script [15:30:33] halfak: I guess I need your help to modify it, such that we can pass argument into it [15:30:51] should I send you the script? [15:31:38] Yeah. Gist would be fine. I imagine it's a single file, right? [15:36:38] halfak: yeah [15:36:45] but it's a bit long [15:36:56] Perhaps, email would be better [15:37:02] one moment, I'll send you via email [15:37:17] a gist should handle the same size as an email. [15:37:26] hmm ok [15:37:28] Also this should be like 250 lines of code 0.O [15:38:49] https://gist.github.com/anonymous/13b40133ebb8f9c2a9946111a48c3996 [15:39:26] halfak: it's not OO yet. [15:41:04] I'm really confused about how your code flow continues in the 'except:' of a try block. [15:41:36] Does this work? [15:41:42] halfak: hehe.. yeah [15:42:02] :P OK time to turn this into a utility script. [15:42:48] basically, it reads the query result file, check if it's redirect page. If not a redirect page, check if it has unwanted page. If it's not the case, write the content (rev_id, item_id, page_len) into a new CSV [15:42:55] glorian_wd, want to get on a call with me and I'll walk you through what I'm doing? [15:43:05] halfak: sure [15:43:20] one moment, I ping you when I'm ready [15:46:00] halfak: hey, let's do this now [18:23:20] 06Revision-Scoring-As-A-Service, 10Wikidata, 10rsaas-articlequality : Implement "unwanted pages" filtering strategy for Wikidata - https://phabricator.wikimedia.org/T162530#3166263 (10Halfak) [18:23:48] 10Revision-Scoring-As-A-Service-Backlog, 10Wikidata, 10Wikilabels: Deploy Wikidata item quality campaign - https://phabricator.wikimedia.org/T157493#3166288 (10Halfak) [18:23:50] 06Revision-Scoring-As-A-Service, 10Wikidata, 10rsaas-articlequality : Implement "unwanted pages" filtering strategy for Wikidata - https://phabricator.wikimedia.org/T162530#3166287 (10Halfak) [18:24:50] 10Revision-Scoring-As-A-Service-Backlog, 10Wikidata, 10Wikilabels: Deploy Wikidata item quality campaign - https://phabricator.wikimedia.org/T157493#3007227 (10Halfak) http://labels.wmflabs.org/campaigns/wikidatawiki/51/?campaign=stats [18:25:00] 10Revision-Scoring-As-A-Service-Backlog, 10Wikidata, 10Wikilabels: Deploy Wikidata item quality campaign - https://phabricator.wikimedia.org/T157493#3166290 (10Halfak) [18:25:29] 06Revision-Scoring-As-A-Service, 10Wikidata, 10rsaas-articlequality : Implement "unwanted pages" filtering strategy for Wikidata - https://phabricator.wikimedia.org/T162530#3166263 (10Halfak) https://github.com/wiki-ai/wikiclass/pull/32 [18:26:05] 06Revision-Scoring-As-A-Service, 10Wikidata, 10rsaas-articlequality : Implement "unwanted pages" filtering strategy for Wikidata - https://phabricator.wikimedia.org/T162530#3166292 (10Halfak) Filters out disambiguation, category, list and wikinews pages from the items sample. Results in a 5k stratified samp... [18:26:18] 06Revision-Scoring-As-A-Service, 10Wikidata, 10Wikilabels: Deploy Wikidata item quality campaign - https://phabricator.wikimedia.org/T157493#3007227 (10Halfak) [18:27:32] 10Revision-Scoring-As-A-Service-Backlog, 10Wikidata, 10Wikilabels: Complete Wikidata item quality campaign - https://phabricator.wikimedia.org/T157495#3166295 (10Halfak) [18:27:42] 06Revision-Scoring-As-A-Service, 10Wikidata, 10Wikilabels: Complete Wikidata item quality campaign - https://phabricator.wikimedia.org/T157495#3007265 (10Halfak) [18:28:10] 06Revision-Scoring-As-A-Service, 10Wikidata, 10rsaas-articlequality : Implement "unwanted pages" filtering strategy for Wikidata - https://phabricator.wikimedia.org/T162530#3166298 (10Halfak) See https://www.wikidata.org/wiki/Wikidata_talk:Item_quality#Proposed_Changes_from_the_Pilot_Campaign_Analysis_Result [18:30:53] 06Revision-Scoring-As-A-Service, 10Wikidata, 10Wikilabels: Complete Wikidata item quality campaign - https://phabricator.wikimedia.org/T157495#3166301 (10Halfak) See https://quarry.wmflabs.org/query/17885 for my query that removes redirects. [18:31:19] 06Revision-Scoring-As-A-Service, 10Wikidata, 10Wikilabels: Complete Wikidata item quality campaign - https://phabricator.wikimedia.org/T157495#3166302 (10Halfak) Also, BTW, this is deployed. http://labels.wmflabs.org/campaigns/wikidatawiki/51/?campaign=stats [18:33:00] 06Revision-Scoring-As-A-Service, 10Wikidata, 10Wikilabels: Deploy Wikidata item quality campaign - https://phabricator.wikimedia.org/T157493#3166303 (10Halfak) ``` halfak@wikilabels-01:~/datasets$ sudo -u www-data /srv/wikilabels/venv/bin/wikilabels new_campaign wikidatawiki "Item quality (5k stratified)" it... [18:59:07] Alright, I think all of the things are updated and cleaned. [18:59:10] Heading out for the day [18:59:12] o/