[02:58:48] 10ORES, 10Scoring-platform-team, 10Analytics, 10Analytics-Cluster, 10Research: Desired packages to be installed/upgraded on the PySpark cluster (jupyterhub) - https://phabricator.wikimedia.org/T249078 (10diego) [05:40:10] 10Scoring-platform-team (Current), 10Wikilabels, 10articlequality-modeling, 10artificial-intelligence: Build article quality model for ptwikipedia - https://phabricator.wikimedia.org/T246663 (10Chtnnh) @Halfak Why is the sum of all articles in all classes so low? Doesn't the ptwiki have more than a million... [06:07:28] 10ORES, 10Scoring-platform-team, 10Analytics, 10Analytics-Cluster, 10Research: Desired packages to be installed/upgraded on the PySpark cluster (jupyterhub) - https://phabricator.wikimedia.org/T249078 (10elukey) Hey Diego, not sure if you have seen https://docs.google.com/document/d/1r-oqMXViWvQCqsYz0qze... [06:45:34] 10ORES, 10Scoring-platform-team, 10Analytics, 10Analytics-Cluster, 10Research: Desired packages to be installed/upgraded on the PySpark cluster (jupyterhub) - https://phabricator.wikimedia.org/T249078 (10diego) Hey @elukey . Thanks for sharing, @Ottomata has talked about the general idea, but I was not... [09:17:39] 10Scoring-platform-team (Current), 10Wikilabels, 10articlequality-modeling, 10artificial-intelligence: Build article quality model for ptwikipedia - https://phabricator.wikimedia.org/T246663 (10GoEThe) >>! In T246663#6016245, @Halfak wrote: > OK I think I got lost a bit earlier. Let me try checking my ass... [09:19:25] 10Scoring-platform-team (Current), 10Wikilabels, 10articlequality-modeling, 10artificial-intelligence: Build article quality model for ptwikipedia - https://phabricator.wikimedia.org/T246663 (10GoEThe) >>! In T246663#6017423, @Chtnnh wrote: > @Halfak Why is the sum of all articles in all classes so low? Do... [09:25:41] 10Scoring-platform-team (Current), 10Wikilabels, 10articlequality-modeling, 10artificial-intelligence: Build article quality model for ptwikipedia - https://phabricator.wikimedia.org/T246663 (10GoEThe) >>! In T246663#6016890, @Halfak wrote: > I re-ran the extractor with these new assumptions and got: > >... [09:54:23] 10Scoring-platform-team (Current), 10Wikilabels, 10articlequality-modeling, 10artificial-intelligence: Build article quality model for ptwikipedia - https://phabricator.wikimedia.org/T246663 (10Chtnnh) >>! In T246663#6017841, @GoEThe wrote: >>>! In T246663#6017423, @Chtnnh wrote: >> @Halfak Why is the sum... [11:05:52] 10ORES, 10Scoring-platform-team, 10Growth-Team, 10PageCuration: Add ORES topic prediction to the NewPagesFeed - https://phabricator.wikimedia.org/T218132 (10kostajh) > would it be possible for me to work on this now? It seems straightforward @Chtnnh sure, please go ahead if you would like to work on it. [11:18:14] 10MediaWiki-extensions-ORES, 10Scoring-platform-team, 10Edit-Review-Improvements-RC-Page, 10Growth-Team, 10MediaWiki-Recent-changes: Expose ORES topics in recent changes filters - https://phabricator.wikimedia.org/T245906 (10kostajh) @Chtnnh if you're interested to work on this and you have questions, le... [12:03:15] 10Scoring-platform-team (Current), 10Wikilabels, 10articlequality-modeling, 10artificial-intelligence: Build article quality model for ptwikipedia - https://phabricator.wikimedia.org/T246663 (10He7d3r) See also: {https://phabricator.wikimedia.org/T138726} [12:23:26] 10Scoring-platform-team (Current), 10Wikilabels, 10articlequality-modeling, 10artificial-intelligence: Build article quality model for ptwikipedia - https://phabricator.wikimedia.org/T246663 (10Chtnnh) Thank you @He7d3r! That clarifies the template situation completely [12:38:22] 10ORES, 10Scoring-platform-team, 10Analytics, 10Analytics-Cluster, 10Research: Desired packages to be installed/upgraded on the PySpark cluster (jupyterhub) - https://phabricator.wikimedia.org/T249078 (10Ottomata) [13:09:58] 10MediaWiki-extensions-ORES, 10Scoring-platform-team, 10Edit-Review-Improvements-RC-Page, 10Growth-Team, 10MediaWiki-Recent-changes: Expose ORES topics in recent changes filters - https://phabricator.wikimedia.org/T245906 (10Chtnnh) Great! T240558 is not yet done so how do you suggest I go about this t... [15:04:18] hello halfak [15:04:35] im currently working on the Makefile for draftquality [15:04:45] trying to build the pipeline for ptwiki [15:05:28] I was wondering if you need me to build the *big dataset* section of the makefile for ptwiki too? [15:05:58] and what are the factors I should consider while deciding what the datasets should look like? [15:12:17] I think the pipeline can start with the output of the query we have on quarry. [15:12:40] If you read the enwiki part of the makefile from top to bottom, it should roughly show you what you need to do. [15:12:54] We'll need to replicate the feature_list for ptwiki [15:19:13] yes i see that [15:19:34] I think we dont need the big dataset part as is because that builds the sql query for enwiki [15:19:55] what I am currently doing is building the part that will ready the datasets and tuning reports [15:20:02] let me show you a little bit [15:21:46] https://gist.github.com/chtnnh/c41256a35afaa430e1af2312c1e5f086 [15:26:32] I don't think you need to convert the tsv to json. [15:26:40] So like 266 doesn't make too much sense. [15:27:06] the second rule (line 271) looks right. [15:27:13] It down-samples "OK" examples. [15:27:38] We'll want to use wget to get the ptwiki sample. [15:27:49] yes [15:28:03] See an example of wget here: https://github.com/wikimedia/editquality/blob/master/Makefile#L118 [15:28:05] is there an API for quarry? [15:28:16] Can you link me to the query you are pulling from? [15:28:33] https://quarry.wmflabs.org/query/43197 [15:29:13] okay i see what line 266 should look like [15:34:14] https://quarry.wmflabs.org/run/43197/output/0/json [15:34:23] this link doesnt seem to be the data we want to pull [15:34:29] can you please check this out [15:38:45] Download this link: https://quarry.wmflabs.org/run/444292/output/0/json-lines [15:39:12] You can find this under the "Download data" dropdown just above the output list. [15:46:20] let me check that out [15:46:24] halfak: this may be interesting to you: https://wikiconference.org/wiki/2019/Grants/Classifying_Actors_on_Talk_Pages [15:48:00] Hi hare! I've. Been talking to Carlin. Didn't realize he'd posted a proposal. [15:49:18] Thanks for the pointer. [15:55:42] okay senpai i have downloaded the json file [16:16:10] https://gist.github.com/chtnnh/c41256a35afaa430e1af2312c1e5f086 [16:18:43] typo: https://gist.github.com/chtnnh/c41256a35afaa430e1af2312c1e5f086#file-makefile-L276 [16:18:55] Otherwise it looks great! [16:19:04] chtnnh, ^ [16:19:13] I need to run an errand. Back in ~45. [17:00:03] back! [17:01:53] hare, dropped a question on the talk page on that proposal. Thanks for bringing it to my attention. [17:12:04] async update! [17:12:16] Y: I mostly caught up on emails and worked a bit with chtnnh. I did some finalizing of haksoats' plan to work with us starting in a couple of weeks and I cleaned up some tasks in the phab board. I also offboarded Nikhil as it looks like he won't be able to help us explore Jade for now. [17:12:30] halfak: [17:12:40] T: I've got a few outstanding research tasks that I want to handle. The ORES Systems paper needs some attention and I have an outstanding review for Transactions on Social Computing. Otherwise, I'll be supporting chtnnh's work on ptwiki. [17:12:56] accraze: [17:13:07] Y: Continued work on fixing naming conflicts for Jade 2ndary integrations [17:13:17] T: More of the same, got 4 classes finished, have 4 more to go [17:13:32] me: [17:13:48] Y: [17:13:50] Reviewed Andy's patchsets [17:13:50] - 584670 (Rename ProposalLinkTable -> EntityLinkTable) [17:13:50] - 584678 (Rename ProposalLinkTableHelper -> EntityLinkTableHelper) [17:13:50] Pivoted the proposed Jade toast to stop relying on OOUI Dialog and rely on OOUI Element instead. [17:13:50] - Problem: [17:13:52] > I noticed the OOUI Dialog displays elements (like the dialog header and footer) on the page that we shall not need. [17:13:55] > When activated, the Dialog content frame also covers the content below it which would be intrusive to user actions. [17:13:58] - Solution: [17:14:00] > The OOUI Element is light weight on the page, doesn't cover content and is much easier to position. [17:14:03] T: [17:14:05] Reviewed Andy's patchsets [17:14:08] - 585037 (Rename ProposalPageWikitextRenderer -> EntityPageWikitextRenderer) [17:14:09] - 585048 (Rename ProposalSummarizer -> EntitySummarizer) [17:14:11] I have been digging into mw-notification-area toast that Aaron shared with me comparing it with the Jade toast element I had created. [17:14:14] https://github.com/wikimedia/mediawiki-debian/blob/025bd8113a281ec6052ce81dccbf3c6038f45f5e/resources/src/mediawiki/mediawiki.notification.js [17:14:17] I'm yet to understand how extensions can utilize mw-notification-area toast before we roll our own Jade toast element. [17:14:29] Have a great day everyone 👋 [17:50:07] 10Jade, 10Scoring-platform-team (Current), 10MW-1.35-notes (1.35.0-wmf.27; 2020-04-07): Clean up naming conflicts around writing secondary schema data for Jade - https://phabricator.wikimedia.org/T235003 (10ACraze) [18:44:32] 10MediaWiki-extensions-ORES, 10Scoring-platform-team, 10Edit-Review-Improvements-RC-Page, 10Growth-Team, 10MediaWiki-Recent-changes: Expose ORES topics in recent changes filters - https://phabricator.wikimedia.org/T245906 (10Chtnnh) a:03Chtnnh Claiming the task to clear any confusion regarding work. I... [19:13:26] 10Scoring-platform-team (Current), 10Wikilabels, 10articlequality-modeling, 10artificial-intelligence: Build article quality model for ptwikipedia - https://phabricator.wikimedia.org/T246663 (10Halfak) Here's some new results just looking at the first number in the templates: 19 "0" 147170 "1"... [20:52:57] o/ xinbenlv [20:53:00] No meeting today?