[02:29:03] 06Data-Engineering, 06Data-Platform-SRE: Conda-Analytics environments are prone to dependency conflicts and installation errors - https://phabricator.wikimedia.org/T423067 (10nshahquinn-wmf) 03NEW [02:30:44] 06Data-Engineering, 06Data-Platform-SRE: Conda-Analytics environments stuck with very outdated packages - https://phabricator.wikimedia.org/T423052#11812520 (10nshahquinn-wmf) [02:40:00] 06Data-Engineering, 06Data-Platform-SRE: Conda-Analytics environments are prone to dependency conflicts and installation errors - https://phabricator.wikimedia.org/T423067#11812524 (10nshahquinn-wmf) [07:28:37] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Dumps-Generation: when analyzing a Wikifunctions dump, parent_id in page creation revisions is sometimes 0 and sometimes None - https://phabricator.wikimedia.org/T420974#11812777 (10APizzata-WMF) >1. Emit 0 instead of null on page_change events for rev_p... [08:42:33] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Dumps-Generation: Data missing from en.wiktionary.org February 2026 "MediaWiki Content File Exports" compared to "XML Database dump" - https://phabricator.wikimedia.org/T417596#11813084 (10APizzata-WMF) The query: ` spark.sql(f""" select page... [08:43:02] !log Test Kitchen edge-unique experiments (poll 105456) - adds: logged-out-retention-round7; removes: none; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD [08:43:04] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [08:43:43] !log Test Kitchen edge-unique experiments (poll 105458) - adds: none; removes: none; fields: logged-out-retention-round7 - xLab/MPIC/TK tips at https://w.wiki/FwuD [08:43:44] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [09:21:02] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10AQS2.0: Consider updating our heuristics for media type classification in AQS / wikistats - https://phabricator.wikimedia.org/T419882#11813437 (10GGoncalves-WMF) Thanks! If I understand correctly, we have 3 possible approaches: 1. Overwrite the `media_c... [09:49:33] 06Data-Engineering, 06Data-Engineering-Radar, 10Data-Platform, 06Growth-Team, and 5 others: Image Suggestions uses AI-generated images from Commons when adding images on English Wikipedia - https://phabricator.wikimedia.org/T422513#11813538 (10Michael) This is something that needs to happen in the pipeline... [10:07:30] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 07Essential-Work: Perform a one-time clean up of retained data sets in event_sanitize - https://phabricator.wikimedia.org/T417694#11813575 (10phuedx) Re. the tables in the "From other prior cleanup tasks" table: **General** IIRC I tidied those instrument... [11:47:07] 06Data-Engineering, 06Data-Engineering-Radar, 10Data-Platform, 06Growth-Team, and 5 others: Image Suggestions uses AI-generated images from Commons when adding images on English Wikipedia - https://phabricator.wikimedia.org/T422513#11813945 (10mfossati) @Ahoelzl TL;DR: not easy. Besides a tag that my form... [11:52:30] 06Data-Engineering, 06Data-Engineering-Radar, 10Data-Platform, 06Growth-Team, and 5 others: Image Suggestions uses AI-generated images from Commons when adding images on English Wikipedia - https://phabricator.wikimedia.org/T422513#11813956 (10mfossati) >>! In T422513#11813943, @mfossati wrote: > IIRC thos... [12:36:42] !log Test Kitchen edge-unique experiments (poll 106150) - adds: we-1-8-account-creation-form-v1; removes: none; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD [12:36:44] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [12:42:03] 06Data-Engineering, 06Data-Platform-SRE (2026-03-27 - 2026-04-17), 10Event-Platform, 07good first task: Flink base image should not install into system python environment - https://phabricator.wikimedia.org/T418525#11814102 (10atsuko) a:03atsuko [13:39:29] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop SecurePoll tables from closed wikis - https://phabricator.wikimedia.org/T423128 (10Dreamy_Jazz) 03NEW [13:40:07] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop SecurePoll tables from closed wikis - https://phabricator.wikimedia.org/T423128#11814311 (10Dreamy_Jazz) [13:41:52] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop SecurePoll tables from closed wikis - https://phabricator.wikimedia.org/T423128#11814317 (10Marostegui) p:05Triage→03Medium a:03Marostegui [13:47:44] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop SecurePoll tables from closed wikis - https://phabricator.wikimedia.org/T423128#11814359 (10Dreamy_Jazz) [13:49:06] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop SecurePoll tables from closed wikis - https://phabricator.wikimedia.org/T423128#11814366 (10Dreamy_Jazz) [13:51:54] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop SecurePoll tables from closed wikis - https://phabricator.wikimedia.org/T423128#11814378 (10Dreamy_Jazz) [13:56:35] 06Data-Engineering, 06Data-Platform-SRE, 06Java-Scala-Standardization, 06Discovery-Search (2026.04.06 - 2026.05.01), and 2 others: Migrate existing Java packages to deploying to Gitlab, including new version of parent pom, validation that all dependencies ... - https://phabricator.wikimedia.org/T367405#11814415 [14:30:13] !log Test Kitchen edge-unique experiments (poll 106488) - adds: none; removes: synth-aa-test-traffic-impact-2, synth-aa-test-traffic-impact-1, synth-aa-test-traffic-impact-3; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD [14:30:15] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [15:17:34] !log Test Kitchen mw-user experiment (poll 106628) - adds: none; removes: email_confirmation_banner_ab_test; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD [15:17:35] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [15:46:36] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 07Essential-Work: Perform a one-time clean up of retained data sets in event_sanitize - https://phabricator.wikimedia.org/T417694#11815173 (10MNeisler) @xcollazo > searchsatisfaction — ~2.3M files, 6,426 GB total, actively written, data goes back to 2021.... [15:47:30] (03CR) 10Lucas Werkmeister (WMDE): "I confess I’m a bit annoyed at this new “permanent” terminology, considering we’ve spent the past few years [having `isNamed()` drilled in" [analytics/wmde/scripts] - 10https://gerrit.wikimedia.org/r/1269033 (https://phabricator.wikimedia.org/T422500) (owner: 10Andrew McAllister (WMDE)) [15:47:34] (03CR) 10Lucas Werkmeister (WMDE): [C:03+1] Split user changes by namespace to perm/tmp users [analytics/wmde/scripts] - 10https://gerrit.wikimedia.org/r/1269033 (https://phabricator.wikimedia.org/T422500) (owner: 10Andrew McAllister (WMDE)) [17:15:20] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10MediaWiki-Core-Revision-backend, 10MediaWiki-DomainEvents, 06MW-Interfaces-Team, and 5 others: MediaWiki\Revision\RevisionAccessException: Unable to load fresh row for rev_id: {rev_id} - https://phabricator.wikimedia.org/T400380#11815891 (10Ottomata)... [17:30:46] !log Test Kitchen edge-unique experiments (poll 107024) - adds: this-is-just-a-test; removes: none; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD [17:30:48] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [17:33:47] !log Test Kitchen edge-unique experiments (poll 107033) - adds: none; removes: this-is-just-a-test; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD [17:33:49] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [18:25:01] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 07Essential-Work: Perform a one-time clean up of retained data sets in event_sanitize - https://phabricator.wikimedia.org/T417694#11816374 (10dcausse) >>! In T417694#11815172, @MNeisler wrote: > @xcollazo >> searchsatisfaction — ~2.3M files, 6,426 GB tota... [19:00:13] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform: HTML Pipeline - Performance improvements - https://phabricator.wikimedia.org/T422928#11816550 (10Ottomata) [20:16:32] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform, 13Patch-For-Review: HTML Enrichment - Tuning & Backfilling configuration - https://phabricator.wikimedia.org/T421216#11816771 (10Ottomata) Alright, I merged the patch for {T421965}, and bumped MWEE to use it. I merged your patch to use... [20:18:04] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform, 13Patch-For-Review: HTML Enrichment - Tuning & Backfilling configuration - https://phabricator.wikimedia.org/T421216#11816777 (10Ottomata) Hm, async_enabled=False EventProcessFunction metrics look busted though: https://grafana.wikimedi... [20:49:46] (03PS2) 10Andrew McAllister (WMDE): Split user changes by namespace to perm/tmp users [analytics/wmde/scripts] - 10https://gerrit.wikimedia.org/r/1269033 (https://phabricator.wikimedia.org/T422500)