[13:35:25] 10Jade, 10Scoring-platform-team (Current): [Spike] What facilities are available to us when rendering edit comments? - https://phabricator.wikimedia.org/T250723 (10kevinbazira) I ended up using these two hooks to [[ https://phabricator.wikimedia.org/T247457 | render edit comments in Jade ]]. [13:35:26] 10[1] 04https://meta.wikimedia.org/wiki/https://phabricator.wikimedia.org/T247457 [13:36:51] o/ [13:36:58] 10Jade, 10Scoring-platform-team (Current), 10Patch-For-Review: Render edit comments in Jade - https://phabricator.wikimedia.org/T247457 (10kevinbazira) [13:39:09] 10Jade, 10Scoring-platform-team: Apply i18n to rendered edit comments in Jade - https://phabricator.wikimedia.org/T250721 (10kevinbazira) a:03kevinbazira [13:39:37] 10Jade, 10Scoring-platform-team (Current): Apply i18n to rendered edit comments in Jade - https://phabricator.wikimedia.org/T250721 (10kevinbazira) [13:47:37] 10Jade, 10Scoring-platform-team (Current), 10CommRel-Specialists-Support (Jan-Mar-2020): Design Jade pilot deployment plan with the Scoring Platform team - https://phabricator.wikimedia.org/T246486 (10Johan) (The deployment schedule for this has been pushed back, as [[ https://wikitech.wikimedia.org/wiki/Dep... [13:49:07] 10Jade, 10Scoring-platform-team (Current), 10CommRel-Specialists-Support (Apr-Jun-2020): Design Jade pilot deployment plan with the Scoring Platform team - https://phabricator.wikimedia.org/T246486 (10Elitre) [13:54:11] * halfak picks up some papers on software platform design. [14:20:33] 10Jade, 10Scoring-platform-team (Current): [Spike] What facilities are available to us when rendering edit comments? - https://phabricator.wikimedia.org/T250723 (10Halfak) 05Open→03Resolved It looks like this spike is complete so I'm resolving. Please revert if I'm jumping the gun. [14:20:38] 10Jade, 10Scoring-platform-team (Current), 10Patch-For-Review: Render edit comments in Jade - https://phabricator.wikimedia.org/T247457 (10Halfak) [14:44:17] 10Scoring-platform-team, 10CommRel-Specialists-Support: Outreach campaign to raise awareness of Scoring Platform - https://phabricator.wikimedia.org/T217232 (10Elitre) [15:38:07] hello halfak [15:38:13] 1489 1 [15:38:13] 1497 2 [15:38:13] 1494 3 [15:38:13] 1489 4 [15:38:13] 1210 5 [15:38:14] 1395 6 [15:38:34] these are the numbers in the labeled_revision with text [15:41:10] seems weird that 5 and 6 are significantly lesser than the rest [15:41:47] WHat about the dataset before that? [15:41:57] chtnnh, ^ [15:42:07] let me check [15:43:43] 1500 1 [15:43:44] 1500 2 [15:43:44] 1500 3 [15:43:44] 1500 4 [15:43:44] 1224 5 [15:43:44] 1416 6 [15:44:12] this is the balanced dataset [15:44:49] halfak ^ [15:45:03] Aha! Something went wrong there, I think. [15:45:09] Maybe the dataset before that? [15:47:35] 271064 ? [15:47:35] 143916 1 [15:47:35] 31885 2 [15:47:35] 5016 3 [15:47:35] 1731 4 [15:47:36] 1224 5 [15:47:38] 1416 6 [15:47:42] looks like nothings wrong here then [15:47:45] halfak ^ [15:47:47] yep. [15:47:52] Looks wrong to me. That's old data! [15:48:06] Note that there are few observations for 5 and 6 [15:48:11] Let me help get you an updated dataset. [15:48:20] it collects the data from an xml dump if im correct [15:48:26] sure! [15:50:40] Copy over /home/halfak/projects/articlequality/datasets/ptwiki.labelings.20200301.json [15:50:44] It is updated. [15:52:36] 145585 1 [15:52:36] 32694 2 [15:52:36] 6088 3 [15:52:36] 2229 4 [15:52:36] 1553 5 [15:52:37] 1484 6 [15:52:42] new numbers halfak ^ [15:53:24] should i rebuild the model? [15:53:49] Yes. [15:53:50] Looks good. [15:53:57] I need to go offline for a reboot. BRB [16:18:49] Hey! Hopefully I can keep my connection for a bit. [16:53:26] 10Scoring-platform-team, 10articlequality-modeling, 10artificial-intelligence: Add text complexity scoring to article quality models - https://phabricator.wikimedia.org/T246438 (10Halfak) [16:53:56] 10ORES, 10Scoring-platform-team (Current), 10artificial-intelligence: Review model performance for ptwiki 'articlequality' and 'draftquality' - https://phabricator.wikimedia.org/T250809 (10Halfak) [16:54:00] 10ORES, 10Scoring-platform-team (Current), 10artificial-intelligence: Review model performance for ptwiki 'articlequality' and 'draftquality' - https://phabricator.wikimedia.org/T250809 (10Halfak) a:03Chtnnh [16:54:36] 10Scoring-platform-team (Current), 10artificial-intelligence: Add `words_to_watch` to articlequality and draftquality models in ptwiki - https://phabricator.wikimedia.org/T251171 (10Halfak) [16:57:33] 10Scoring-platform-team, 10articlequality-modeling, 10artificial-intelligence: Text fetched by articlequality's `fetch_text` might not match the talk page label (for moved pages) - https://phabricator.wikimedia.org/T251608 (10Halfak) I'm really not sure how we could track this better. Do you have some sugge... [16:58:26] 10Scoring-platform-team, 10articlequality-modeling, 10artificial-intelligence: Text fetched by articlequality's `fetch_text` might not match the talk page label (for moved pages) - https://phabricator.wikimedia.org/T251608 (10Halfak) p:05Triage→03Low [17:00:27] 10Scoring-platform-team, 10Wikilabels, 10articlequality-modeling, 10artificial-intelligence: Build article quality model for Ukrainian Wikipedia - https://phabricator.wikimedia.org/T251571 (10Halfak) p:05Triage→03Medium [17:03:03] 10ORES, 10Scoring-platform-team: ORES Beta startup errors not being routed to our app logging. - https://phabricator.wikimedia.org/T250712 (10Halfak) p:05Triage→03Low [17:03:33] 10Scoring-platform-team, 10articlequality-modeling, 10artificial-intelligence: Add text complexity scoring to article quality models - https://phabricator.wikimedia.org/T246438 (10Halfak) p:05Triage→03Low [17:04:22] 10ORES, 10Scoring-platform-team: Allow browser caching of ORES responses - https://phabricator.wikimedia.org/T251004 (10Halfak) [17:06:04] 10ORES, 10Scoring-platform-team: Allow browser caching of ORES responses - https://phabricator.wikimedia.org/T251004 (10Halfak) p:05Triage→03Lowest [17:07:39] 10Scoring-platform-team, 10Continuous-Integration-Config: CI should check to see if our wheels are good - https://phabricator.wikimedia.org/T250746 (10Halfak) I think we'd just need to run the `pip install wheels/*.whl --no-deps` command and see if it exits successfully. [17:08:33] 10Scoring-platform-team, 10Continuous-Integration-Config: CI should check to see if our wheels are good - https://phabricator.wikimedia.org/T250746 (10Halfak) p:05Triage→03Medium [17:12:58] 10Jade, 10Scoring-platform-team, 10Documentation: Write data consumer documentation for Jade - https://phabricator.wikimedia.org/T235280 (10Halfak) p:05Medium→03High [17:13:10] 10Jade, 10Scoring-platform-team, 10Documentation, 10User-srodlund: Review and improve mw:Jade - https://phabricator.wikimedia.org/T206150 (10Halfak) @srodlund, still looking at this? I've done a few editing passes since we filed this. Could resolve. Could use your review and iteration if you see fit. L... [17:14:21] 10Scoring-platform-team, 10drafttopic-modeling: Follow-up cleanup to topic models - https://phabricator.wikimedia.org/T246909 (10Halfak) p:05High→03Medium Moving this to "Medium" because it seems getting more topic models has higher priority than this. [17:18:32] 10ORES, 10Scoring-platform-team, 10Patch-For-Review: Review prometheus ORES rules for completeness - https://phabricator.wikimedia.org/T233448 (10Halfak) a:03Halfak [17:18:52] 10ORES, 10Scoring-platform-team (Current), 10Patch-For-Review: Review prometheus ORES rules for completeness - https://phabricator.wikimedia.org/T233448 (10Halfak) [17:20:26] 10Scoring-platform-team, 10ORES-Support-Checklist: Document and share operational details of ores-support-checklist - https://phabricator.wikimedia.org/T222271 (10Halfak) Essentially, we need to write docs on the sequence of commands to run when we want to re-deploy with new code or restart the system. [17:23:17] 10Scoring-platform-team (Current), 10articlequality-modeling, 10artificial-intelligence: Create new labeling campaign for Basque Wikipedia articlequality - https://phabricator.wikimedia.org/T215351 (10Halfak) 05Open→03Resolved a:03Halfak [17:25:15] 10ORES, 10Scoring-platform-team, 10Scap: Investigate: why are we getting overload errors during ORES deployments? - https://phabricator.wikimedia.org/T213116 (10Halfak) We are getting overload errors because the celery workers stop consuming tasks for a moment and it takes celery about 2 minutes to go from o... [17:25:38] 10ORES, 10Scoring-platform-team, 10Scap: Document delay between canary and continued deployment to minimize overload during restarts - https://phabricator.wikimedia.org/T213116 (10Halfak) [17:26:27] so groomed [20:56:56] 10Scoring-platform-team, 10drafttopic-modeling: Compress Gensim models with term hashing - https://phabricator.wikimedia.org/T247523 (10Pavol86) according to Medium article (but this issue is common on forums) : Main RAM issues of fasttext are: 1. "binary model carries not only weights for words and n-grams,... [21:45:38] 10Scoring-platform-team, 10drafttopic-modeling: Compress Gensim models with term hashing - https://phabricator.wikimedia.org/T247523 (10Halfak) I'm not sure if we can use bigrams with our current pipeline. We could extend our pipeline to produce bigrams though. I'm not sure how this makes the model smaller t... [22:15:00] posting our async update notes -- [22:15:11] halfak- [22:15:13] Last week: I did a lot of work to set up volunteers for ptwiki work. Helder and chtnnh are experimenting with building models. I also spent a bunch of time working with my weird audio issues. Seems to be OK now. I did some work on the ORES paper. I'm trying to become more articulate about product platform decoupling and brush up on the software architecture lit. I also did some design work for [22:15:15] Jade and I did some work for the SWE panel. [22:15:17] T: Reviewing SWE tasks and doing an interview. I am helping chtnnh with a data pipeline issue that appeared out of nowhere (seemingly). I'll otherwise continue my reading on software architecture strategies and hopefully get some of my own writing together today. [22:15:30] kevinbazira- [22:15:32] Last Week: [22:15:34] Focused on the spike task about what facilities are available to us when rendering edit comments: https://phabricator.wikimedia.org/T250723 [22:15:36] Worked on localizing the first and second part of the edit comments on the Jade history page: https://phabricator.wikimedia.org/T247457 [22:15:38] T: [22:15:40] Made comment prefixes bold [22:15:42] Used '''wikitext''' to make comment prefixes on the Jade history page bold. [22:15:44] This has been done for comment prefixes of all the 8 Jade actions that create, update or delete. [22:15:53] haksoat- [22:15:55] Last week: [22:15:57] Worked extensively on understanding regex engines, especially the NFA which Python and java uses. I got to play around with the existing regex and got performance improvements. [22:15:59] Today: [22:16:01] Start reading up tips on writing efficient regex from the material I am studying from, work on the English Idioms task on revscoring. [22:16:03] and me- [22:16:16] Last week: mostly cleaning up the WIP patchset for re-enabling the db hooks for Jade in hopes to deploy to beta next week. [22:16:18] T: I've got Jenkins passing, although I have some test failures locally, will continue to debug and get the Jade db hooks patchset ready to go