[00:00:50] 10Scoring-platform-team: Article Quality tokenizer error - https://phabricator.wikimedia.org/T256085 (10ACraze) [00:01:17] thanks danilo for the bug report, just added it to phabricator [03:36:21] mediawiki-utilities/python-mwtext#119 (galtay-bug-fix-2 - 7fe4456 : Gabriel): The build passed. https://travis-ci.org/mediawiki-utilities/python-mwtext/builds/701126131 [09:57:34] In the context of Constraint Satisfaction Problems, a _support_ is formally defined as follows: [09:57:48] A support for a value assignment V=d in a constraint C is an assignment A to all of the other variables in scope(C) s.t. A ∪ {V=d} satisfies C. [09:58:00] Is it a gross mutilation of the term "support" if I said: [09:58:25] "The assignment V=d is a support for A that satisfies C"? [15:15:46] Hey danilo! Do you know what revision you were processing when you got that error? [15:16:42] no, I don't know [15:18:16] Dang. I can report this to mwparserfromhell upstream, but I'm not sure how they'll be able to take it on. I wonder if you could add some output in your try/catch to output some details about the thing being parsed. [15:21:31] yes, I will put a print in the try/except to get the page in next time that occurs [15:55:36] 10Scoring-platform-team, 10articlequality-modeling, 10artificial-intelligence: Convert to nettrom style WP 1.0 label extraction process - https://phabricator.wikimedia.org/T130312 (10Halfak) 05Open→03Resolved a:03Halfak This is done. [15:55:38] 10Scoring-platform-team (Current), 10articlequality-modeling, 10Epic, 10artificial-intelligence: [Epic] Article quality models (wp10) - https://phabricator.wikimedia.org/T130259 (10Halfak) [16:27:52] 10Scoring-platform-team, 10Wikilabels, 10articlequality-modeling, 10artificial-intelligence: Build article quality model for Dutch Wikipedia - https://phabricator.wikimedia.org/T223782 (10Halfak) p:05Lowest→03Medium [17:00:48] Hey folks. Won't make it to BS meeting today :( [17:00:53] I am doing a media training thing. [17:02:10] Okay sounds good, I'm happy to cancel it, frankly I need a little break in my day-o-meetings. [17:02:19] ^ yeah same [17:04:08] chtnnh, ^ [17:04:27] That's fine [18:38:59] lunch! [19:38:49] 10Jade, 10AbuseFilter, 10Analytics-Radar, 10Cognate, and 28 others: Replace PageContent(Insert|Save)Complete hooks - https://phabricator.wikimedia.org/T250566 (10DannyS712) [19:40:26] (03PS1) 10Halfak: Updates for revscoring-2.8.2 [services/ores/deploy] - 10https://gerrit.wikimedia.org/r/607354 (https://phabricator.wikimedia.org/T254505) [19:43:53] finally got the ORES change in place. I spent hours fighting with submitting to gerrit from stat7 before I finally gave up. [19:46:23] accraze, do you have some time to look at https://gerrit.wikimedia.org/r/607354 ? [19:47:02] (03PS2) 10Halfak: Updates for revscoring-2.8.2 [services/ores/deploy] - 10https://gerrit.wikimedia.org/r/607354 (https://phabricator.wikimedia.org/T254505) [19:51:00] yeah taking a quick look [19:51:53] thanks! [19:56:35] (03CR) 10Accraze: [C: 03+2] "LGTM!" (031 comment) [services/ores/deploy] - 10https://gerrit.wikimedia.org/r/607354 (https://phabricator.wikimedia.org/T254505) (owner: 10Halfak) [19:57:35] oh is this the repo where we also need to do verified +2? [19:58:29] ah yeah no jenkins-bot integration on this one yet [19:58:37] (03CR) 10Accraze: [V: 03+2 C: 03+2] Updates for revscoring-2.8.2 [services/ores/deploy] - 10https://gerrit.wikimedia.org/r/607354 (https://phabricator.wikimedia.org/T254505) (owner: 10Halfak) [20:28:03] Right. Thanks accraze [20:29:53] Going to beta! [20:30:06] And this concludes my 30 minutes of meeting free work for the day :) [20:41:19] haksoat1, https://ores-beta.wmflabs.org/v3/scores/ukwiki/7239990 [20:41:22] your work is up on beta! [20:41:27] It's making predictions. [20:42:26] I blocked off some time tomorrow to do the prod deployment. [20:44:39] halfak, there are some AttributeErrors at https://ores-beta.wmflabs.org/v3/scores/ptwiki/58585551 [20:44:53] is it expected? [20:45:11] Hmm. No. Looks like this might be a version mismatch in our wheels. I'm looking into it now. [20:47:14] weird. sklearn version matches. [20:51:23] Confirmed. [20:51:33] Looks like I can use the model locally. Testing on beta. [20:54:46] what the heck!? [20:57:37] 10ORES, 10Scoring-platform-team (Current), 10artificial-intelligence: Review model performance for ptwiki 'articlequality' and 'draftquality' - https://phabricator.wikimedia.org/T250809 (10He7d3r) @Danilo generated the following table comparing articlequality scores for the latest version of all articles to... [20:58:33] I can load the model on beta. [20:58:35] Can I score iwth it? [20:58:59] I can even generate a score. [20:59:58] Aha! Wrong model. [21:00:05] Looks like one is failing and it's not the damaging model. [21:00:23] It looks like draftquality [21:00:41] [offtopic] halfak, I posted at T250809 some data from danilo's investigation with the articlequality model (I mentioned on today's meeting) [21:00:42] T250809: Review model performance for ptwiki 'articlequality' and 'draftquality' - https://phabricator.wikimedia.org/T250809 [21:08:03] Hello halfak, stuck with an error from building the drafttopic model. [21:08:08] ValueError: Cannot have number of splits n_splits=5 greater than the number of samples: n_samples=0. [21:08:09] Makefile:126: recipe for target 'models/arwiki.articletopic.gradient_boosting.model' failed [21:08:09] make: *** [models/arwiki.articletopic.gradient_boosting.model] Error 1 [21:08:09] make: *** Deleting file 'models/arwiki.articletopic.gradient_boosting.model' [21:08:24] Looks like one of the input files is empty. [21:08:28] Might have failed. [21:09:02] Okay. I'll take a look [21:53:40] Confirmed that I'm able to run the model on the meta server. [21:54:09] Woah... it just started working? [21:54:30] Hmm. No something is still failing. [21:56:03] aha! The ptwiki articlequality model was built with an old version of revscoring. [21:57:14] Aha! The ptwiki models weren't added to the common rules. I'll fix that and get a PR up/. [21:59:36] I'm starting from scratch because we updated the extractor and I'm not sure I have a fresh label extraction. This will probably take a while. So I'll be sitting on this until tomorrow. [21:59:56] Helder, I don't know how you found that so fast, but I'm really glad you were looking! [22:13:53] mediawiki-utilities/python-mwtext#122 (wikidata2words_pids - 1d75e9b : halfak): The build passed. https://travis-ci.org/mediawiki-utilities/python-mwtext/builds/701443125 [22:19:09] OK I think I have everything in a good state. With any luck, we'll still be able to make the deployment window tomorrow. [22:19:16] I'm going to head out soon. [22:19:23] Any questions before I run haksoat1? [22:23:50] OK I'm off. Take care folks!