[00:37:53] 10Scoring-platform-team (Research), 10Outreach-Programs-Projects, 10Google-Summer-of-Code (2020), 10artificial-intelligence: Proposal (GSoC 2020): Implement an NSFW image classifier with open_nsfw - https://phabricator.wikimedia.org/T247614 (10MusikAnimal) >>! In T247614#5988813, @Chtnnh wrote: > What do y... [11:12:06] 10Scoring-platform-team (Research), 10Outreach-Programs-Projects, 10Google-Summer-of-Code (2020), 10artificial-intelligence: Proposal (GSoC 2020): Implement an NSFW image classifier with open_nsfw - https://phabricator.wikimedia.org/T247614 (10Lazy-restless) Best wishes and cordial prayers to the project a... [14:03:54] o/ [16:10:52] Hey folks! Async time. [16:10:54] Last week: I had a lot of meetings. They were mostly related to annual planning and hiring. But I still managed to hand off some code to a new volunteer (article quality gadget), I made a lot of different types of mock up for Jade and talked to the team about them. I did some Jade outreach, got us some CR support, and I got light/initial approval to reach out to simplewiki for a pilot. I also did some work on revising the ORES systems [16:10:54] paper in response to reviewers critiques. [16:10:54] Today: I wrote up some interview questions and I have an interview for the VP position. I'm also working through some documentation to help Grant and our future VP make sense of what we're doing with advanced algorithms @ Wikimedia. [16:14:25] Last week: Designed Jade UI ( JS + CSS ) based on Wireframes and pushed a couple of patchsets. I also reviewed Andy's patchsets. [16:14:25] [16:14:26] T: Added check for missing user Talk Page - When endorsement author's user Talk Page is missing, their talk page link becomes red. [16:17:14] Scoring staff meeting! [16:17:18] kevinbazira, ^ [16:18:10] Joining now, halfak o/ [16:19:19] is this different from the one on april 3? [16:40:15] 10Scoring-platform-team (Research), 10Outreach-Programs-Projects, 10Google-Summer-of-Code (2020), 10artificial-intelligence: Proposal (GSoC 2020): Implement an NSFW image classifier with open_nsfw - https://phabricator.wikimedia.org/T247614 (10Chtnnh) >The idea we had was for a image_filter_scoring table t... [16:48:46] 10Scoring-platform-team (Research), 10MediaWiki-extensions-General, 10articlequality-modeling, 10WorkType-NewFunctionality, 10artificial-intelligence: Create Quality Bias Alert for Cited Information Sources - https://phabricator.wikimedia.org/T28426 (10Halfak) [16:49:08] 10Jade, 10Scoring-platform-team (Current), 10CommRel-Specialists-Support (Jan-Mar-2020): Design Jade pilot deployment plan with the Scoring Platform team - https://phabricator.wikimedia.org/T246486 (10Halfak) a:03Halfak [16:49:29] 10Scoring-platform-team (Current), 10Wikilabels, 10articlequality-modeling, 10artificial-intelligence: Build article quality model for ptwikipedia - https://phabricator.wikimedia.org/T246663 (10Halfak) [16:50:07] 10Scoring-platform-team, 10Wikilabels, 10editquality-modeling, 10artificial-intelligence: Build draft quality model for ptwikipedia - https://phabricator.wikimedia.org/T246667 (10Halfak) p:05Triage→03Medium [16:52:05] 10Scoring-platform-team, 10Wikilabels, 10editquality-modeling, 10artificial-intelligence: Create follow-up edit quality campaign for ptwikipedia - https://phabricator.wikimedia.org/T246668 (10Halfak) p:05Triage→03Medium [16:52:59] 10Scoring-platform-team (Current), 10Wikilabels, 10editquality-modeling, 10artificial-intelligence: Create follow-up edit quality campaign for ptwikipedia - https://phabricator.wikimedia.org/T246668 (10Halfak) a:03Halfak [16:55:33] 10Scoring-platform-team, 10drafttopic-modeling: Follow-up cleanup to topic models - https://phabricator.wikimedia.org/T246909 (10Halfak) p:05Triage→03High [16:56:33] 10Scoring-platform-team, 10drafttopic-modeling: Filter out disambiguation pages in topic labels - https://phabricator.wikimedia.org/T246910 (10Halfak) p:05Triage→03High [17:02:24] fyi will be a few minutes late [18:29:54] 10Scoring-platform-team (Research), 10Structured-Data-Backlog, 10artificial-intelligence: Implement NSFW image classifier using Open NSFW - https://phabricator.wikimedia.org/T214201 (10Chtnnh) In T225664 @Mholloway developed an API implementation of `open_nsfw` based on `open_nsfw--`. Utilizing this implemen... [18:32:41] 10Scoring-platform-team (Research), 10Outreach-Programs-Projects, 10Google-Summer-of-Code (2020), 10artificial-intelligence: Proposal (GSoC 2020): Implement an NSFW image classifier with open_nsfw - https://phabricator.wikimedia.org/T247614 (10Chtnnh) https://www.mediawiki.org/wiki/Manual:Image_table @Mus... [18:47:26] 10Scoring-platform-team, 10Wikilabels, 10editquality-modeling, 10artificial-intelligence: Build draft quality model for ptwikipedia - https://phabricator.wikimedia.org/T246667 (10Chtnnh) Some follow up questions: 1. How many of the above categories are we targeting? 2. Do we require the model to classify... [18:58:43] 10Scoring-platform-team, 10Wikilabels, 10editquality-modeling, 10artificial-intelligence: Build draft quality model for ptwikipedia - https://phabricator.wikimedia.org/T246667 (10Halfak) @chtnnh, it looks like we want to target a few of these deletion reasons. Essentially, we want to product vandalism, sp... [19:00:15] Hello halfak It's been a while. I was a bit ill. [19:00:28] 10Scoring-platform-team, 10Wikilabels, 10editquality-modeling, 10artificial-intelligence: Build draft quality model for ptwikipedia - https://phabricator.wikimedia.org/T246667 (10Chtnnh) Understood. That will be our training and testing data and then upon satisfactory performance we can try deploying the m... [19:00:46] haksoat are you better now? [19:00:56] Yes, I am. Thank you. [19:19:07] Hey haksoat! [19:19:13] I'm glad you are feeling better! [19:19:42] I was wondering how you were. [19:20:58] Yeah. Thanks. [19:21:08] Any progress on the idioms? [19:22:21] I haven't made any performance improvements. I was hoping to find time to propose ways that we could further narrow the list. Were you going to try to read idioms from that table? [19:22:32] * halfak tries to remember. [19:22:45] XP [19:23:10] Yeah. I already tried from the table. [19:23:47] No reasonable improvements. Then.... [19:24:00] From this gist you shared: https://gist.github.com/halfak/656d4370b4583c2bd2bbb6836c4008b2 [19:24:06] Oh! I didn't know we tried that. Did it produce a similar number of idioms? [19:24:33] No. The list is way smaller. [19:24:52] I think that gist was from the full idiom set. Aha! Did you have a PR with the way smaller set? I'm sorry I'm trying to remember where we left off :) [19:25:11] From the gist you shared, can you take a look at the time it takes to run the features without idioms? [19:25:21] Similar to: [19:25:24] [f for f in enwiki.wp10 if not "idiom" in str(f)] [19:25:27] But: [19:25:33] [f for f in enwiki.wp10 if "idiom" in str(f)] [19:26:29] Let's see how long it takes to run that [19:27:00] And compare to the time it takes to run idioms extraction using "english.idioms.revision.matches" directly [19:29:01] To your question, no I have not opened a PR with the smaller set yet. [19:30:33] Oh! Yes. I can do that. I'll update the script and run it. [19:30:51] Great. [19:34:10] haksoat, I updated https://gist.github.com/halfak/656d4370b4583c2bd2bbb6836c4008b2 [19:34:34] Note that this is with the limited list we got from the idioms category. I'd be really curious what performance looks like if we work with idioms from the table. [19:40:53] Okay. I'll send a gist with the list from the table. [19:41:02] To be clear, you are referring to this: https://en.wiktionary.org/wiki/Appendix:Glossary_of_idioms_%E2%80%93_A [19:41:04] Right? [19:41:57] If that is the one you are referring to, here's the list: https://gist.github.com/HAKSOAT/bc49d2cb84f56d1163b54cae5f5ff9ce [19:42:07] Yes. [19:42:09] Oh cool. [19:42:18] A lot less! [19:42:33] Looks like we might want to remove the parens. [19:42:40] E.g. "bee in one's bonnet (to have)" [19:43:30] Seems like we want to replace "one's" with (his|her|their) [19:44:08] True. I'll do that. [19:50:53] I've updated it halfak [19:51:21] chtnnh, I got a start on adapting this to ptwiki: https://quarry.wmflabs.org/query/43197 [19:51:29] haksoat, the PR or the gist? [19:51:44] Aha! I see in the gist [19:51:52] let me just check that out quickly halfak [19:52:34] The gist [19:53:34] I've not opened a PR. Since there's nothing concrete yet. [20:05:01] I will need to brush up on my SQL XD [20:05:20] Lol [20:22:40] 10Jade, 10Scoring-platform-team (Current), 10MW-1.35-notes (1.35.0-wmf.25; 2020-03-24): Clicking "Endorsements (#)" should expand/contract the set of endorsements. - https://phabricator.wikimedia.org/T247452 (10ACraze) [21:33:49] 10Jade, 10Scoring-platform-team (Current): Edit comment text is missing from "setpreference" actions in Jade - https://phabricator.wikimedia.org/T247456 (10ACraze) a:03ACraze [22:21:13] 10Jade, 10Scoring-platform-team (Current), 10Patch-For-Review: Edit comment text is missing from "setpreference" actions in Jade - https://phabricator.wikimedia.org/T247456 (10ACraze) Looks like the `SetPreference` api module was missing the comments param. I fixed that and also exposed the comment in the re...