[07:53:08] 10Scoring-platform-team (Research), 10Outreach-Programs-Projects, 10Google-Summer-of-Code (2020), 10artificial-intelligence: Proposal (GSoC 2020): Implement an NSFW image classifier with open_nsfw - https://phabricator.wikimedia.org/T247614 (10Chtnnh) > Brion Vibber certainly does, and he designed the sch... [09:26:56] 10Scoring-platform-team, 10Wikilabels, 10articlequality-modeling, 10artificial-intelligence: Build article quality model for bswiki - https://phabricator.wikimedia.org/T194509 (10Aklapper) @Srdjan_m: Hi! This task has been assigned to you a while ago. Could you maybe share an update? Do you still plan to w... [12:15:19] o/ [12:33:53] 10Scoring-platform-team (Current), 10Wikilabels, 10editquality-modeling, 10artificial-intelligence: Create follow-up edit quality campaign for ptwikipedia - https://phabricator.wikimedia.org/T246668 (10Halfak) I think it would be valuable to update our dataset with new labels. This will let us check on th... [12:36:53] 10Scoring-platform-team (Current), 10Wikilabels, 10editquality-modeling, 10artificial-intelligence: Create follow-up edit quality campaign for ptwikipedia - https://phabricator.wikimedia.org/T246668 (10GoEThe) I think it sounds reasonable. I can recruit some Wikipedians to do this on the Village Pump and o... [12:42:24] 10Scoring-platform-team (Current), 10Wikilabels, 10editquality-modeling, 10artificial-intelligence: Create follow-up edit quality campaign for ptwikipedia - https://phabricator.wikimedia.org/T246668 (10Halfak) OK I'll get a campaign loaded to wiki labels and will ping here when it is ready. Oh! In the me... [12:48:14] 10Scoring-platform-team (Current), 10Wikilabels, 10editquality-modeling, 10artificial-intelligence: Create follow-up edit quality campaign for ptwikipedia - https://phabricator.wikimedia.org/T246668 (10GoEThe) "Amostragem da qualidade de 4 mil edições (2020)" seems a reasonable translation to Portuguese. [13:26:49] 10Jade, 10Scoring-platform-team (Current): Label notes field should autosize with a max of 10 rows - https://phabricator.wikimedia.org/T247463 (10Halfak) a:03kevinbazira [13:28:23] 10Jade, 10Scoring-platform-team: Using the mouse doesn't for setting cursor position in label notes/edit comment editing - https://phabricator.wikimedia.org/T248470 (10Halfak) [13:34:53] 10Jade, 10Scoring-platform-team: Render usernames in Jade edit comments. - https://phabricator.wikimedia.org/T248135 (10Halfak) Right now, we have "id 10" stored as text in the `comment` table. We'd like to be rendered on the history page like: [EpochFail](https://en.wikipedia.org/wiki/User:EpochFail). Tha... [13:53:27] 10Jade, 10Scoring-platform-team (Current), 10Patch-For-Review: Label notes field should autosize with a max of 10 rows - https://phabricator.wikimedia.org/T247463 (10kevinbazira) I have added autosize and maxRows to label notes [14:39:40] 10Scoring-platform-team, 10Discovery-Search, 10Elasticsearch, 10revscoring, 10artificial-intelligence: Improve the performance and quality of tokenization in revscoring - https://phabricator.wikimedia.org/T248480 (10Halfak) [14:48:07] 10Scoring-platform-team, 10Discovery-Search, 10Elasticsearch, 10revscoring, 10artificial-intelligence: Improve the performance and quality of tokenization in revscoring - https://phabricator.wikimedia.org/T248480 (10Halfak) [14:48:13] 10Scoring-platform-team, 10Discovery-Search, 10Elasticsearch, 10revscoring, 10artificial-intelligence: Improve the performance and quality of tokenization in revscoring - https://phabricator.wikimedia.org/T248480 (10Halfak) In the past, @TJones gave a second-hand recommendation for TextBlob. https://tex... [14:48:17] 10Scoring-platform-team, 10Discovery-Search, 10Elasticsearch, 10revscoring, 10artificial-intelligence: Improve the performance and quality of tokenization in revscoring - https://phabricator.wikimedia.org/T248480 (10Halfak) Also here's some notes from our past conversations around NLP and ORES: https://e... [14:48:19] 10Scoring-platform-team, 10Discovery-Search, 10Elasticsearch, 10revscoring, 10artificial-intelligence: Improve the performance and quality of tokenization in revscoring - https://phabricator.wikimedia.org/T248480 (10Halfak) I'm currently running a profiling script against feature extraction for ORES Engl... [15:55:15] 10Scoring-platform-team, 10Outreach-Programs-Projects, 10Google-Summer-of-Code (2020), 10artificial-intelligence: Proposal (GSoC 2020): Implement articlequality model for ptwiki - https://phabricator.wikimedia.org/T247847 (10Chtnnh) @Darwinius Hello! Do you think you would be able to help us out with this... [18:02:23] 10Scoring-platform-team, 10Outreach-Programs-Projects, 10Google-Summer-of-Code (2020), 10artificial-intelligence: Proposal (GSoC 2020): Implement articlequality model for ptwiki - https://phabricator.wikimedia.org/T247847 (10Halfak) Answering question with a 1-2 day lag would be perfectly acceptable. I th... [18:39:07] 10Scoring-platform-team (Current), 10Wikilabels, 10articlequality-modeling, 10artificial-intelligence: Build article quality model for ptwikipedia - https://phabricator.wikimedia.org/T246663 (10Halfak) When I ran this code, I got the following label counts: ` $ cat ptwiki.labelings.20200301.json | json2ts... [18:58:54] 10Scoring-platform-team (Current), 10Wikilabels, 10articlequality-modeling, 10artificial-intelligence: Build article quality model for ptwikipedia - https://phabricator.wikimedia.org/T246663 (10Halfak) I just grabbed https://pt.wikipedia.org/w/index.php?title=SS_Edmund_Fitzgerald from the home page and it... [18:59:12] 10Scoring-platform-team (Current), 10Wikilabels, 10articlequality-modeling, 10artificial-intelligence: Build article quality model for ptwikipedia - https://phabricator.wikimedia.org/T246663 (10Halfak) Aha! I just checked https://pt.wikipedia.org/w/index.php?title=Discuss%C3%A3o:Anarquismo and it has the... [19:00:22] 10Scoring-platform-team (Current), 10Wikilabels, 10articlequality-modeling, 10artificial-intelligence: Build article quality model for ptwikipedia - https://phabricator.wikimedia.org/T246663 (10GoEThe) No, it is not automatically generated. I believe only when there is a '?' is the classification automatic... [19:18:09] 10Scoring-platform-team, 10Wikilabels, 10editquality-modeling, 10artificial-intelligence: Build draft quality model for ptwikipedia - https://phabricator.wikimedia.org/T246667 (10Halfak) From https://quarry.wmflabs.org/query/43261 we can see that we're able to track about 19k deleted articles from the past... [21:23:33] Hey folks! I'm posting our async updates on the way out the door. [21:23:53] kevinbazira: [21:23:54] Y: Moved "alternatives V" to the top of the list of alternative labels [21:23:54] - I used the preferredProposal / nonPreferredProposals strategy - similar to Andy's proposal but using a single ProposalListWidget instead of two separate ones. [21:23:54] T: Reviewed Andy's patchset [21:23:54] 583174 (Update FacetWidget & FacetListWidget docs) [21:23:55] 583173 (Update jade.widgets module doc) [21:23:57] Added autosize and maxRows to label notes [21:23:59] - Since a user can't resize the textarea, at least the textarea should autosize when the user is adding new lines so that they can be able to see their text without having to scroll much. [21:24:08] accraze: [21:24:09] Y: Worked on improving Jade UI docs using JSDoc syntax, reviewed kevinbazira's patchset for moving the alternative toggle button and continue fixing naming conflicts [21:24:09] T: More of the same (JSDocs, code review, naming conflicts) [21:24:20] And me! [21:24:21] Y: Did an interview for the VP of DS. I did some writing on the ORES systems paper. I did some work with chtnnh and haksoat. I'm mostly applying their code and reporting on results. But chtnnh and I also did some work on querying ptwiki. [21:24:22] T: Continued work on the ptwiki queries. I started work on a new contract request. I talked to Johan about the Jade pilot and I'll need to consult with releng before we can settle on a deployment plan. I also did some documentation work for out upcoming tuning session. I'm on vacation until Tuesday. [21:25:19] Oh I forgot that I also wrote a project description for exploring ElasticSearch/Lucene's tokenization strategy and figuring out where we can use it or otherwise share code. [21:25:30] Now I'm on vacation until Tuesday. Take care all! [21:32:49] later halfak!