[00:35:20] 06Data-Engineering, 06Data-Engineering-Radar, 06Research: Request for Hourly Pageview Data for multiple articles– July 18 to September 8, 2025 - https://phabricator.wikimedia.org/T409676#11863782 (10Madocmadofmadog) Hi Ottomata. Thank you for the support. Is there a pathway in which I could work towards coll... [02:39:06] 06Data-Engineering, 10Commons-Impact-Metrics, 10Commons-Impact-Metrics-Requests: Update Commons Impact Metrics allow-list April 2026 - https://phabricator.wikimedia.org/T424607 (10GFontenelle_WMF) 03NEW [09:25:42] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform: HTML enrichment: Fix memory leak - https://phabricator.wikimedia.org/T424624 (10JMonton-WMF) 03NEW [09:29:08] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform, 13Patch-For-Review: HTML Enrichment - Alerting - https://phabricator.wikimedia.org/T423996#11864689 (10JMonton-WMF) [09:49:23] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Image-Suggestions, 06Discovery-Search (2026.04.06 - 2026.05.01), 13Patch-For-Review: ALIS data pipeline produced too many suggestions - https://phabricator.wikimedia.org/T423238#11864911 (10APizzata-WMF) a:03APizzata-WMF [10:53:30] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Image-Suggestions, 06Discovery-Search (2026.04.06 - 2026.05.01), 13Patch-For-Review: ALIS data pipeline produced too many suggestions - https://phabricator.wikimedia.org/T423238#11865191 (10APizzata-WMF) tested the results with the changes in the air... [12:27:17] 06Data-Engineering, 06Data-Platform-SRE (2026-04-24 - 2026-05-15): Requesting Kerberos access for Daniel Kinzler - https://phabricator.wikimedia.org/T422947#11865567 (10Gehel) [12:27:30] 06Data-Engineering, 06Data-Platform-SRE (2026-04-24 - 2026-05-15): Requesting Kerberos access for Daniel Kinzler - https://phabricator.wikimedia.org/T422947#11865570 (10Gehel) p:05Triage→03High [12:31:00] 06Data-Engineering, 06Data-Platform-SRE (2026-04-24 - 2026-05-15): Requesting Kerberos access for Daniel Kinzler - https://phabricator.wikimedia.org/T422947#11865602 (10atsuko) 05Open→03In progress a:03atsuko [12:53:21] 06Data-Engineering, 06Data-Platform-SRE (2026-04-24 - 2026-05-15): Requesting Kerberos access for Daniel Kinzler - https://phabricator.wikimedia.org/T422947#11865703 (10atsuko) 05In progress→03Resolved Merged, principal created. ` atsuko@krb1002:~$ sudo manage_principals.py create daniel --email=dkinzl... [13:07:08] 06Data-Engineering, 06Data-Engineering-Radar, 10Event-Platform, 06Machine-Learning-Team (Q4 FY2025-26), 13Patch-For-Review: Add Multilingual RevertRisk predictions to mediawiki.page_revert_risk_prediction_change - https://phabricator.wikimedia.org/T415892#11865808 (10gkyziridis) === Update === After some... [13:21:51] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 07Essential-Work: Blunderbuss: Move Hadoop/HDFS XML configuration into Helm deployment chart - https://phabricator.wikimedia.org/T402323#11865911 (10Gehel) [13:22:25] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Research, 10Event-Platform, 13Patch-For-Review: Analyze size distribution of wiki page html - https://phabricator.wikimedia.org/T419495#11865913 (10Miriam) @Ottomata hello! Do you need more research support here or is the analysis from @cscott enough... [13:36:10] 06Data-Engineering, 10Event-Platform, 13Patch-For-Review: [Event Platform] Declare webrequest as an Event Platform stream - https://phabricator.wikimedia.org/T314956#11865995 (10Ottomata) [14:14:17] (03CR) 10Tchanders: [C:03+1] Add event.wiki in EditAttemptStep to allowlist [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1277760 (owner: 10Conniecc1) [14:43:07] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Incremental MediaWiki History - https://phabricator.wikimedia.org/T424350#11866257 (10xcollazo) [14:51:28] 06Data-Engineering, 06Data-Platform-SRE (2026-04-24 - 2026-05-15): Requesting Kerberos access for Daniel Kinzler - https://phabricator.wikimedia.org/T422947#11866290 (10daniel) >>! In T422947#11865703, @atsuko wrote: > @daniel Hi, you should receive the email with your temporary kerberos password and the i... [14:52:55] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Draft: Architectural design agreement: Incremental MediaWiki History - https://phabricator.wikimedia.org/T424359#11866313 (10xcollazo) >>! In T424359#11856247, @Ahoelzl wrote: > I think I need AI to review this :-) Haha, I know, it is verbose. I do plan to... [14:56:20] 06Data-Engineering, 06Data-Engineering-Radar, 10MediaWiki-extensions-EventLogging, 07Essential-Work, and 2 others: Migrate "WikiLambda API" instrument to use the Test Kitchen SDK - https://phabricator.wikimedia.org/T415254#11866327 (10Sfaci) [15:17:07] 06Data-Engineering, 06Data-Engineering-Radar, 10DPE-Mediawiki-Content, 10GitLab (Upstream pit of despair 🕳️): Gitlab bug makes us have spurious artifact mismatch errors on YARN when running mw_content_merge_events_to_mw_content_history_daily - https://phabricator.wikimedia.org/T391123#11866444 (10xcollazo)... [15:26:46] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform, 13Patch-For-Review: Streaming HTML & Edit Types - productionization checklist - https://phabricator.wikimedia.org/T423920#11866518 (10Ottomata) [15:27:03] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform, 13Patch-For-Review: Streaming HTML & Edit Types - productionization checklist - https://phabricator.wikimedia.org/T423920#11866519 (10Ottomata) I deployed [EventStreamConfig - Declare .v1 streams for html content and feature counts (127... [16:07:14] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform, 13Patch-For-Review: Analyze size distribution of wiki page html - https://phabricator.wikimedia.org/T419495#11866799 (10Ottomata) Hi! No, no more Research support needed. I got some good data from @awight that I haven't had time to pro... [16:30:17] 06Data-Engineering, 06Data-Engineering-Radar, 10DPE-Mediawiki-Content, 10GitLab (Upstream pit of despair 🕳️): Gitlab bug makes us have spurious artifact mismatch errors on YARN when running mw_content_merge_events_to_mw_content_history_daily - https://phabricator.wikimedia.org/T391123#11866954 (10amastilovi... [16:52:00] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10DPE-Mediawiki-Content, 10GitLab (Upstream pit of despair 🕳️): Gitlab bug makes us have spurious artifact mismatch errors on YARN when running mw_content_merge_events_to_mw_content_history_daily - https://phabricator.wikimedia.org/T391123#11867072 (1... [16:52:10] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10DPE-Mediawiki-Content, 10GitLab (Upstream pit of despair 🕳️): Gitlab bug makes us have spurious artifact mismatch errors on YARN when running mw_content_merge_events_to_mw_content_history_daily - https://phabricator.wikimedia.org/T391123#11867074 (1... [17:19:34] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform: Emit comprehensive mediawiki user block change information in an event stream - https://phabricator.wikimedia.org/T424685 (10Ottomata) 03NEW [17:20:28] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Incremental MediaWiki History - https://phabricator.wikimedia.org/T424350#11867227 (10Ottomata) [17:20:31] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform: Weekly core contributor metrics - MediaWiki event data source improvements for incremental MWH - https://phabricator.wikimedia.org/T423935#11867226 (10Ottomata) [17:20:34] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Weekly delivery cadence of core contributor metrics - https://phabricator.wikimedia.org/T418032#11867228 (10Ottomata) [17:23:43] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform: Incremental MWH - MediaWiki event data source improvements - https://phabricator.wikimedia.org/T423935#11867239 (10Ottomata) [17:24:07] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform: Emit comprehensive mediawiki user block change information in an event stream - https://phabricator.wikimedia.org/T424685#11867246 (10Ottomata) I don't think this task blocks any immediate work for {T424350}, as that can probably use the... [18:09:14] 06Data-Engineering, 06Data-Platform-SRE (2026-04-24 - 2026-05-15): Requesting Kerberos access for Daniel Kinzler - https://phabricator.wikimedia.org/T422947#11868056 (10daniel) @atsuko I still can't log in. The only thing I can think of is that I messed up copying the password from my password manager into... [18:22:53] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Airflow REST API calls failing with 403s - https://phabricator.wikimedia.org/T424761 (10xcollazo) 03NEW [18:23:05] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Airflow REST API calls failing with 403s - https://phabricator.wikimedia.org/T424761#11868203 (10xcollazo) 05Open→03In progress p:05Triage→03High [18:35:01] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10DPE-Mediawiki-Content: Missing/inconsistent page_redirect_target field for redirects in Mediawiki content current v1 dumps - https://phabricator.wikimedia.org/T400632#11868264 (10xcollazo) >>! In T400632#11856295, @Isaac wrote: > Just an FYI that @MGerla... [18:35:40] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 13Patch-For-Review: Airflow REST API calls failing with 403s - https://phabricator.wikimedia.org/T424761#11868266 (10xcollazo) a:03xcollazo [18:51:17] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 13Patch-For-Review: Airflow REST API calls failing with 403s - https://phabricator.wikimedia.org/T424761#11868317 (10xcollazo) >>! In T424761#11868270, @CodeReviewBot wrote: > xcollazo **merged** https://gitlab.wikimedia.org/repos/data-engineering/airflow-... [18:57:29] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform: EventBus - consider schema versions when serializing entities - https://phabricator.wikimedia.org/T424767 (10Ottomata) 03NEW [19:32:28] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Draft: Architectural design agreement: Incremental MediaWiki History - https://phabricator.wikimedia.org/T424359#11868417 (10xcollazo) [19:40:43] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Draft: Architectural design agreement: Incremental MediaWiki History - https://phabricator.wikimedia.org/T424359#11868443 (10xcollazo) >>! In T424359#11866313, @xcollazo wrote: >>>! In T424359#11856247, @Ahoelzl wrote: >> I think I need AI to review this :-)... [19:51:56] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Airflow REST API calls failing with 403s - https://phabricator.wikimedia.org/T424761#11868482 (10xcollazo) >>! In T424761#11868317, @xcollazo wrote: >>>! In T424761#11868270, @CodeReviewBot wrote: >> xcollazo **merged** https://gitlab.wikimedia.org/repos/dat... [20:07:27] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform: Edit type enrichment: Alerting - https://phabricator.wikimedia.org/T424224#11868513 (10AKhatun_WMF) [20:41:49] (03CR) 10Xcollazo: [V:03+1 C:03+1] Remove DesktopWebUIActionsTracking, MobileWebUIActionsTracking, ReadingDepth from sanitization allowlist [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1275982 (https://phabricator.wikimedia.org/T417694) (owner: 10Xcollazo) [20:42:52] (03CR) 10Xcollazo: [V:03+1 C:03+1] "As per T417694#11852760 and T417694#11853072 we are good to remove these from sanitization allowlist." [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1275982 (https://phabricator.wikimedia.org/T417694) (owner: 10Xcollazo) [20:44:00] !log Test Kitchen edge-unique experiments (poll 171557) - adds: logged-out-retention-round9; removes: none; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD [20:44:02] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [21:29:06] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 07Essential-Work, 13Patch-For-Review: Perform a one-time clean up of retained data sets in event_sanitize - https://phabricator.wikimedia.org/T417694#11868769 (10xcollazo) >>! In T417694#11846081, @xcollazo wrote: > Moving on to next set: > > | Schema |... [21:29:41] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 07Essential-Work, 13Patch-For-Review: Perform a one-time clean up of retained data sets in event_sanitize - https://phabricator.wikimedia.org/T417694#11868772 (10xcollazo) [21:36:33] (03CR) 10Mforns: [C:03+2] Remove DesktopWebUIActionsTracking, MobileWebUIActionsTracking, ReadingDepth from sanitization allowlist [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1275982 (https://phabricator.wikimedia.org/T417694) (owner: 10Xcollazo) [22:58:49] (03CR) 10Xcollazo: [V:03+2 C:03+2] Remove DesktopWebUIActionsTracking, MobileWebUIActionsTracking, ReadingDepth from sanitization allowlist [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1275982 (https://phabricator.wikimedia.org/T417694) (owner: 10Xcollazo) [23:03:14] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 07Essential-Work, 13Patch-For-Review: Perform a one-time clean up of retained data sets in event_sanitize - https://phabricator.wikimedia.org/T417694#11869069 (10xcollazo) [23:39:34] 06Data-Engineering, 06Data-Engineering-Icebox, 06cloud-services-team, 10Data-Services, 07Epic: Plan a replacement for wiki replicas that is better suited to typical OLAP use cases than the MediaWiki OLTP schema - https://phabricator.wikimedia.org/T215858#11869177 (10ttaylor) One day.....