[13:30:47] 10Scoring-platform-team, 10Bad-Words-Detection-System, 10revscoring, 10artificial-intelligence: Language assets for Azerbaijani - https://phabricator.wikimedia.org/T162014 (10Aklapper) a:05Wertuose→03None I am resetting the assignee of this task because there has not been progress lately (please correc... [13:32:40] 10Scoring-platform-team, 10drafttopic-modeling: Refactor scripts fetching text and other metadata - https://phabricator.wikimedia.org/T181074 (10Aklapper) @Sumit: Hi! This task has been assigned to you a while ago. Could you maybe share an update? Do you still plan to work on this, or do you need any help? Tha... [14:33:58] brb need to run to the post office to do some passport things. [15:34:23] 10Scoring-platform-team (Current), 10articlequality-modeling, 10artificial-intelligence: Add features for English Language idioms to articlequality models - https://phabricator.wikimedia.org/T247000 (10Halfak) I ran a test with this and discussed it yesterday with @Haksoat. In https://gist.github.com/halfak... [15:42:40] Back! Forgot to say. [18:51:29] Hello @hald [18:51:35] Hello halfak [18:53:20] You posted the wrong gist link to Phabricator [18:53:28] https://gist.github.com/halfak/b9ce3f174a066e4851d04a2de7d2437d [18:53:36] Should be this ^ [18:56:49] Hey! Woops! [18:57:17] fixed [18:57:22] BRB. Need to change locations. [18:58:03] Alright [20:02:53] haksoat, back. [20:03:06] Sorry got a bunch of questions when I sat down ^_^ [20:09:57] Hehe [20:10:00] Alright [20:20:39] 10Scoring-platform-team, 10drafttopic-modeling: Follow-up cleanup to topic models - https://phabricator.wikimedia.org/T246909 (10MMiller_WMF) [20:20:41] 10Scoring-platform-team, 10Discovery-Search, 10Epic, 10Growth-Team (Current Sprint): [EPIC] Growth: Newcomer tasks 1.1.1 (ORES topics) - https://phabricator.wikimedia.org/T240517 (10MMiller_WMF) [20:27:37] 10Scoring-platform-team, 10drafttopic-modeling: Filter out disambiguation pages in topic labels - https://phabricator.wikimedia.org/T246910 (10MMiller_WMF) @Halfak -- are you saying that this would exclude disambiguation pages from model training? Or from model scoring? I still think that disambiguation page... [20:29:14] 10Scoring-platform-team, 10drafttopic-modeling: Follow-up cleanup to topic models - https://phabricator.wikimedia.org/T246909 (10MMiller_WMF) Thanks for creating this task, @Halfak. I'll respond to T245368#5941808 here (also @Tgr and @Isaac, who were participating). I see what you mean about women-related to... [20:34:03] 10MediaWiki-extensions-ORES, 10Scoring-platform-team, 10Discovery-Search, 10NewcomerTasks 1.1, and 3 others: Expose ORES drafttopic data in ElasticSearch via a custom CirrusSearch keyword - https://phabricator.wikimedia.org/T240559 (10MMiller_WMF) @kostajh @Johan @EBernhardson -- I don't think it's quite r... [20:50:14] 10Scoring-platform-team, 10Discovery-Search, 10Growth Design, 10Growth-Team (Current Sprint), 10MW-1.35-notes (1.35.0-wmf.22; 2020-03-03): Newcomer tasks: UX changes for ORES topics - https://phabricator.wikimedia.org/T244421 (10MMiller_WMF) 05Open→03Resolved Great, thank you! [20:50:16] 10Scoring-platform-team, 10Discovery-Search, 10Epic, 10Growth-Team (Current Sprint): [EPIC] Growth: Newcomer tasks 1.1.1 (ORES topics) - https://phabricator.wikimedia.org/T240517 (10MMiller_WMF) [21:01:08] 10MediaWiki-extensions-ORES, 10Scoring-platform-team, 10Discovery-Search, 10NewcomerTasks 1.1, and 3 others: Expose ORES drafttopic data in ElasticSearch via a custom CirrusSearch keyword - https://phabricator.wikimedia.org/T240559 (10EBernhardson) @Mmiller_WMF The complication is that the process that mak... [21:53:53] halfak regarding the articles [21:54:42] How do you suggest we go about picking them? so we can fetch the non-idioms [21:54:56] I could prolly do that over the weekend in my free time [21:56:36] Or perhaps we remove all the two-worded phrases. We may lose out some idioms though. [21:58:12] Or maybe I manually remove the phrases I can easily find... Will take a while but should be worth it [23:16:40] halfak [23:16:51] Still there? [23:17:24] Yup [23:17:39] Oh. Re, articles... (Sorry I missed the ping) [23:17:49] * halfak thinks. [23:18:12] Great [23:18:38] So I was thinking that I could manually take out some of the entries [23:19:09] I saw a list of phrases on wikitionary too I think, so I could do some comparisons. [23:19:28] We probably want a sample of article in https://en.wikipedia.org/wiki/Category:Wikipedia_featured_articles [23:20:31] We can filter phrases that are common in these articles. What do you think? [23:20:41] I can help get a sample [23:22:13] https://quarry.wmflabs.org/query/42787 [23:23:21] I'm also interested in the manual run. If you want to do that, it would be good to keep that list of phrases you remove handy for any refresh. [23:28:15] 10ORES, 10Scoring-platform-team, 10Goatification: Goat detection and evaluation - https://phabricator.wikimedia.org/T173126 (10Quiddity)