[02:38:43] 10Scoring-platform-team-Backlog, 10Bad-Words-Detection-System, 10revscoring, 10Malayalam-Sites, 10artificial-intelligence: Add language support for Malayalam - https://phabricator.wikimedia.org/T173193#3521815 (10Mahir256) [02:39:35] 10Scoring-platform-team-Backlog, 10Bad-Words-Detection-System, 10revscoring, 10Tamil-Sites, 10artificial-intelligence: Add language support for Tamil - https://phabricator.wikimedia.org/T173192#3521818 (10Mahir256) [05:16:39] 10Scoring-platform-team-Backlog, 10Bad-Words-Detection-System, 10revscoring, 10Tamil-Sites, 10artificial-intelligence: Add language support for Tamil - https://phabricator.wikimedia.org/T173192#3521887 (10awight) 05Open>03Invalid I realized after creating this task that we already have a Tamil "rever... [05:17:18] 10Scoring-platform-team-Backlog, 10Bad-Words-Detection-System, 10revscoring, 10Malayalam-Sites, 10artificial-intelligence: Add language support for Malayalam - https://phabricator.wikimedia.org/T173193#3521889 (10awight) @Mahir256: Thank you for the correction! [05:45:53] 10Scoring-platform-team-Backlog, 10Bad-Words-Detection-System, 10revscoring, 10Hindi-Sites, 10artificial-intelligence: Add language support for Hindi - https://phabricator.wikimedia.org/T173122#3521914 (10awight) I left a note on the User talk page for @hindustanilanguage, this discussion is currently at... [13:40:01] https://en.wikipedia.org/wiki/Predictive_Model_Markup_Language [13:45:31] 10Scoring-platform-team-Backlog, 10Analytics, 10revscoring, 10artificial-intelligence: [Investigate] Use PMML for prediction model serialization - https://phabricator.wikimedia.org/T173244#3522058 (10awight) [14:06:18] 10Scoring-platform-team-Backlog, 10Bad-Words-Detection-System, 10revscoring, 10Hindi-Sites, 10artificial-intelligence: Add language support for Hindi - https://phabricator.wikimedia.org/T173122#3522140 (10awight) @Halfak Found a Hindi word list which is ready for review: https://meta.wikimedia.org/wiki/R... [14:08:58] 10Scoring-platform-team-Backlog, 10Continuous-Integration-Config: Have CI merge research/ores/wheels changes - https://phabricator.wikimedia.org/T173251#3522157 (10awight) [14:12:36] 10Scoring-platform-team-Backlog: Automation and intermediate storage for population rates - https://phabricator.wikimedia.org/T173252#3522172 (10awight) [15:24:34] 10Scoring-platform-team-Backlog, 10revscoring, 10artificial-intelligence: '!' doesn't work for threshold optimizations - https://phabricator.wikimedia.org/T173261#3522382 (10Halfak) [16:16:06] 10Scoring-platform-team, 10revscoring, 10artificial-intelligence: '!' doesn't work for threshold optimizations - https://phabricator.wikimedia.org/T173261#3522531 (10Halfak) [16:17:18] 10Scoring-platform-team, 10revscoring, 10artificial-intelligence: '!' doesn't work for threshold optimizations - https://phabricator.wikimedia.org/T173261#3522382 (10Halfak) https://github.com/wiki-ai/revscoring/pull/349 [16:18:48] wiki-ai/revscoring#1182 (fix_thresh_opt_pattern - d093ef7 : Aaron Halfaker): The build failed. https://travis-ci.org/wiki-ai/revscoring/builds/264102721 [16:22:25] Hi, do you know if I have a way to download the ORES database(s) in a CSV or similar format? [16:23:10] Kelson: hey, it depends on the model you want to download [16:23:57] Amir1: I need for all wikipedia articles in all languages two information: the rating (A, B, C... etc) and the recommended revision id [16:23:58] if it's about edit quality (fighting vandalism), it's not possible (and you can have access to some of the scores in ores_classification table in mediawiki) [16:24:10] what you want is article quality [16:24:15] it's possible but the dump is huge [16:24:16] Amir1: yes [16:24:25] Amir1: ok, where is that located? [16:24:47] it's in labsdb, let me take a look [16:24:55] Kelson: but one thing, this is for enwiki only [16:25:14] other languages either: 1- don't have the model 2- the data is not in the database [16:25:19] Amir1: ah ok, my bad, thought ORES works for all Wikipedia [16:25:35] Amir1: anyway, that is a start [16:25:44] it does but with different level of support [16:26:14] Kelson, there's a nice publication from nettrom for what you want [16:26:19] woops [16:26:21] forgot I was a goat [16:26:35] https://figshare.com/articles/English_Wikipedia_Quality_Asssessment_Dataset/1375406 [16:26:50] Kelson: https://datasets.wikimedia.org/public-datasets/enwiki/article_quality/enwiki-20160801.wp10.monthly.tsv.bz2 [16:27:05] halfak: I was looking for you :D [16:27:14] o/ [16:27:21] Sorry. been very bvusy today [16:27:24] I did get your ping :) [16:27:41] Amir1: the link you give to me since to be the WP10 rating [16:28:12] it's not the wp10 rating [16:28:14] Amir1: I thought ORES has its own rating (based on other inputs) [16:28:20] Amir1: ok. [16:28:22] it's the predication we make based on those ratings [16:28:47] we train our AI based on the data and we project it to all revisions/articles [16:28:58] Amir1: ok, got it [16:29:28] Amir1: thx for your help I will have a look to the files. [16:29:48] Thank you for working on it [18:00:19] 10Scoring-platform-team-Backlog, 10revscoring, 10artificial-intelligence: ThresholdOptimizations fail when a stat is null-ed - https://phabricator.wikimedia.org/T173268#3522646 (10Halfak)