[00:23:25] (03CR) 10Ottomata: [C:03+1] Upgrade graphframes to 0.11.0 from Maven Central, drop Archiva repos [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1286481 (https://phabricator.wikimedia.org/T426114) (owner: 10Xcollazo) [00:29:20] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10MediaWiki-Incremental-History, 10Event-Platform, 05MW-1.47-notes (1.47.0-wmf.2; 2026-05-12): Create mediawiki.user_change event stream - https://phabricator.wikimedia.org/T423952#11915434 (10Ottomata) We have Hive `event.mediawiki_user_change_dev0` r... [00:51:49] (03PS4) 10Xcollazo: Upgrade graphframes to 0.11.0 from Maven Central, drop Archiva repos [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1286481 (https://phabricator.wikimedia.org/T424350) [00:54:21] (03PS1) 10Xcollazo: Upgrade graphframes to 0.11.0 from Maven Central, drop Archiva repos [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1286530 (https://phabricator.wikimedia.org/T426114) [03:24:19] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Load Google Search Console data into the Data Lake - https://phabricator.wikimedia.org/T420996#11915600 (10nshahquinn-wmf) Also, the reasoning for doing a full BigQuery export makes total sense to me. But, if it's possible to reshape the table a bit during t... [05:12:42] 06Data-Engineering, 06SRE, 10SRE-Access-Requests, 13Patch-For-Review: Requesting access to analytics_privatedata_users & Kerberos & SQL Lab for catherinekelsey - https://phabricator.wikimedia.org/T425565#11915644 (10Marostegui) 05In progress→03Resolved a:03Marostegui The change has been deployed.... [05:12:53] 06Data-Engineering, 06SRE, 10SRE-Access-Requests, 13Patch-For-Review: Requesting access to analytics_privatedata_users & Kerberos & SQL Lab for catherinekelsey - https://phabricator.wikimedia.org/T425565#11915647 (10Marostegui) [06:16:14] 06Data-Engineering, 06Data-Engineering-Radar, 06DBA, 07Schema-change-in-production: Drop il_to column from imagelinks table in wmf production - https://phabricator.wikimedia.org/T419635#11915704 (10Marostegui) @FCeratto-WMF you probably want to do the codfw masters switches before the eqiad ones [07:46:35] 06Data-Engineering, 06Data-Engineering-Radar, 10Event-Platform, 06Machine-Learning-Team (Q4 FY2025-26): Add Multilingual RevertRisk predictions to mediawiki.page_revert_risk_prediction_change - https://phabricator.wikimedia.org/T415892#11915843 (10gkyziridis) Configuration of 46 wikis + testwiki on changep... [08:48:52] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10DPE-Mediawiki-Content: Missing/inconsistent page_redirect_target field for redirects in Mediawiki content current v1 dumps - https://phabricator.wikimedia.org/T400632#11916144 (10APizzata-WMF) After some checks here are my results. Total number of `con... [10:56:45] !log Test Kitchen edge-unique experiments (poll 233546) - adds: this-is-just-a-test; removes: none; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD [10:56:46] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [10:58:10] !log Test Kitchen edge-unique experiments (poll 233550) - adds: none; removes: this-is-just-a-test; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD [10:58:12] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [11:08:25] !log Test Kitchen edge-unique experiments (poll 233580) - adds: synth-aa-ncs-1; removes: none; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD [11:08:26] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [11:09:07] ^ That last one is me [11:09:33] (03CR) 10A-pizzata: change create table for mediawiki_content to become private (031 comment) [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1285337 (https://phabricator.wikimedia.org/T424355) (owner: 10A-pizzata) [11:15:40] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10MediaWiki-Incremental-History: mediawiki_history_incremental_v1: schema specification for stakeholder review - https://phabricator.wikimedia.org/T425573#11916888 (10Tchanders) Thanks for the detailed explanations! My comments concern the metric for time-... [11:41:50] !log Test Kitchen edge-unique experiments (poll 233679) - adds: logged-out-retention-round11; removes: none; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD [11:41:51] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [11:55:06] something's broken with an-druid1007, the server is unreachable via SSH and a root login over the serial console also stalls [11:56:46] 06Data-Engineering, 06Product Safety and Integrity (Sprint lily-of-the-valley (May 4 - May 22)), 07Wikimedia-production-error: TypeError: MediaWiki\Extension\EventBus\HookHandlers\MediaWiki\UserChangeHooks::calculateUserEffectiveGroups(): Argument #1 ($user) must be ... - https://phabricator.wikimedia.org/T426185 [11:57:02] 06Data-Engineering, 10MediaWiki-User-management, 10Event-Platform, 07Wikimedia-production-error: TypeError: EventBus\HookHandlers\MediaWiki\UserChangeHooks::calculateUserEffectiveGroups(): Argument #1 ($user) must be of type MediaWiki\User\User, MediaWiki\User\User... - https://phabricator.wikimedia.org/T426186 [12:01:17] 06Data-Engineering, 06Product Safety and Integrity (Sprint lily-of-the-valley (May 4 - May 22)), 07Wikimedia-production-error: TypeError: MediaWiki\Extension\EventBus\HookHandlers\MediaWiki\UserChangeHooks::calculateUserEffectiveGroups(): Argument #1 ($user)... - https://phabricator.wikimedia.org/T426185#11917043 [12:03:19] 06Data-Engineering, 10Event-Platform, 13Patch-For-Review, 06Product Safety and Integrity (Sprint lily-of-the-valley (May 4 - May 22)), 07Wikimedia-production-error: TypeError: MediaWiki\Extension\EventBus\HookHandlers\MediaWiki\UserChangeHooks::calculate... - https://phabricator.wikimedia.org/T426185#11917049 [12:03:23] 06Data-Engineering, 10Event-Platform, 13Patch-For-Review, 06Product Safety and Integrity (Sprint lily-of-the-valley (May 4 - May 22)), 07Wikimedia-production-error: TypeError: MediaWiki\Extension\EventBus\HookHandlers\MediaWiki\UserChangeHooks::calculate... - https://phabricator.wikimedia.org/T426185#11917052 [12:07:14] 06Data-Engineering, 06Stewards-and-global-tools, 10Event-Platform, 13Patch-For-Review, and 2 others: TypeError: MediaWiki\Extension\EventBus\HookHandlers\MediaWiki\UserChangeHooks::calculateUserEffectiveGroups(): Argument #1 ($user) must be of type MediaWi... - https://phabricator.wikimedia.org/T426185#11917056 [12:08:43] 06Data-Engineering, 10MediaWiki-User-management, 10Event-Platform, 07Wikimedia-production-error: TypeError: EventBus\HookHandlers\MediaWiki\UserChangeHooks::calculateUserEffectiveGroups(): Argument #1 ($user) must be of type MediaWiki\User\User, MediaWiki... - https://phabricator.wikimedia.org/T426186#11917061 [12:08:55] 06Data-Engineering, 06Stewards-and-global-tools, 10Event-Platform, 13Patch-For-Review, and 2 others: TypeError: MediaWiki\Extension\EventBus\HookHandlers\MediaWiki\UserChangeHooks::calculateUserEffectiveGroups(): Argument #1 ($user) must be of type MediaWi... - https://phabricator.wikimedia.org/T426185#11917063 [12:15:54] something's broken with an-druid1007, the server is unreachable via SSH and a root login over the serial console also stalls [12:37:29] (03PS5) 10Xcollazo: Upgrade graphframes to 0.11.0 from Maven Central, drop Archiva repos [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1286481 (https://phabricator.wikimedia.org/T426114) [12:53:13] (03CR) 10Xcollazo: [C:03+2] Upgrade graphframes to 0.11.0 from Maven Central, drop Archiva repos [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1286481 (https://phabricator.wikimedia.org/T426114) (owner: 10Xcollazo) [13:07:04] (03Merged) 10jenkins-bot: Upgrade graphframes to 0.11.0 from Maven Central, drop Archiva repos [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1286481 (https://phabricator.wikimedia.org/T426114) (owner: 10Xcollazo) [13:19:32] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10DPE-Mediawiki-Content: Missing/inconsistent page_redirect_target field for redirects in Mediawiki content current v1 dumps - https://phabricator.wikimedia.org/T400632#11917311 (10xcollazo) > So we are detecting the redirects, but the output of the enrich... [13:22:37] (03PS4) 10Xcollazo: Add event_user_is_cross_wiki to wmf.mediawiki_history [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1286385 (https://phabricator.wikimedia.org/T425735) [13:23:54] (03CR) 10Xcollazo: [C:03+2] "Latest patchset adds tests." [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1286385 (https://phabricator.wikimedia.org/T425735) (owner: 10Xcollazo) [13:29:24] 06Data-Engineering, 06Stewards-and-global-tools, 10Event-Platform, 05MW-1.47-notes (1.47.0-wmf.2; 2026-05-12), and 3 others: TypeError: MediaWiki\Extension\EventBus\HookHandlers\MediaWiki\UserChangeHooks::calculateUserEffectiveGroups(): Argument #1 ($user... - https://phabricator.wikimedia.org/T426185#11917402 [13:30:42] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10MediaWiki-Incremental-History, 10Event-Platform, 05MW-1.47-notes (1.47.0-wmf.2; 2026-05-12): Event schemas - mediawiki user entity should be wiki aware - https://phabricator.wikimedia.org/T426198 (10Ottomata) 03NEW [13:30:59] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10MediaWiki-Incremental-History, 10Event-Platform, 05MW-1.47-notes (1.47.0-wmf.2; 2026-05-12): Event schemas - mediawiki user entity should be wiki aware - https://phabricator.wikimedia.org/T426198#11917444 (10Ottomata) [13:31:02] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10MediaWiki-Incremental-History, 10Event-Platform, 05MW-1.47-notes (1.47.0-wmf.3; 2026-05-19), 13Patch-For-Review: EventBus - consider schema versions when serializing entities - https://phabricator.wikimedia.org/T424767#11917445 (10Ottomata) [13:31:12] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10MediaWiki-Incremental-History, 10Event-Platform, 05MW-1.47-notes (1.47.0-wmf.2; 2026-05-12): Event schemas - mediawiki user entity should be wiki aware - https://phabricator.wikimedia.org/T426198#11917448 (10Ottomata) We will need {T424767} to do thi... [13:35:32] (03Merged) 10jenkins-bot: Add event_user_is_cross_wiki to wmf.mediawiki_history [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1286385 (https://phabricator.wikimedia.org/T425735) (owner: 10Xcollazo) [14:03:14] (03Abandoned) 10Xcollazo: Upgrade graphframes to 0.11.0 from Maven Central, drop Archiva repos [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1286530 (https://phabricator.wikimedia.org/T426114) (owner: 10Xcollazo) [14:04:29] (03CR) 10Xcollazo: [C:03+2] Add event_user_is_cross_wiki to mediawiki_history DDL [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1286383 (https://phabricator.wikimedia.org/T425735) (owner: 10Xcollazo) [14:04:35] (03CR) 10Xcollazo: [V:03+2 C:03+2] Add event_user_is_cross_wiki to mediawiki_history DDL [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1286383 (https://phabricator.wikimedia.org/T425735) (owner: 10Xcollazo) [14:05:03] 06Data-Engineering, 10ChangeProp, 10EventStreams, 06MediaWiki-Engineering, and 15 others: Migrate node-based services in production to node22 - https://phabricator.wikimedia.org/T393434#11917643 (10Jdforrester-WMF) [14:14:16] (03PS5) 10Xcollazo: Add event_log_id to wmf.mediawiki_history [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1285904 (https://phabricator.wikimedia.org/T425986) [14:14:49] (03CR) 10Xcollazo: [C:03+2] Add event_log_id to wmf.mediawiki_history [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1285904 (https://phabricator.wikimedia.org/T425986) (owner: 10Xcollazo) [14:15:40] (03CR) 10Xcollazo: [V:03+2 C:03+2] Add event_log_id to mediawiki_history DDL [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1285903 (https://phabricator.wikimedia.org/T425986) (owner: 10Xcollazo) [14:16:37] 06Data-Engineering, 10MediaWiki-extensions-WikimediaEvents, 06Product Safety and Integrity, 10Event-Platform, and 3 others: ArgumentCountError: Too few arguments to function MediaWiki\Extension\EventBus\Serializers\MediaWiki\UserEntitySerializer::__constr... - https://phabricator.wikimedia.org/T426026#11917706 [14:24:21] 06Data-Engineering, 10MediaWiki-extensions-WikimediaEvents, 06Product Safety and Integrity, 10Event-Platform, and 3 others: ArgumentCountError: Too few arguments to function MediaWiki\Extension\EventBus\Serializers\MediaWiki\UserEntitySerializer::__constr... - https://phabricator.wikimedia.org/T426026#11917788 [14:27:01] (03Merged) 10jenkins-bot: Add event_log_id to wmf.mediawiki_history [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1285904 (https://phabricator.wikimedia.org/T425986) (owner: 10Xcollazo) [14:30:58] 06Data-Engineering, 06Stewards-and-global-tools, 10Event-Platform, 05MW-1.47-notes (1.47.0-wmf.2; 2026-05-12), and 2 others: TypeError: MediaWiki\Extension\EventBus\HookHandlers\MediaWiki\UserChangeHooks::calculateUserEffectiveGroups(): Argument #1 ($user... - https://phabricator.wikimedia.org/T426185#11917853 [14:39:28] 06Data-Engineering, 06Data-Platform-SRE (2026-04-24 - 2026-05-15): implement script to move data from P&T data lake to FR Tech data lake - https://phabricator.wikimedia.org/T425133#11917922 (10BTullis) [14:58:29] 06Data-Engineering, 06Stewards-and-global-tools, 10Event-Platform, 05MW-1.47-notes (1.47.0-wmf.2; 2026-05-12), and 2 others: TypeError: MediaWiki\Extension\EventBus\HookHandlers\MediaWiki\UserChangeHooks::calculateUserEffectiveGroups(): Argument #1 ($user... - https://phabricator.wikimedia.org/T426185#11918025 [15:03:42] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Movement-Insights: interwiki imports and its effects on revision data - https://phabricator.wikimedia.org/T425735#11918043 (10Ottomata) BTW, we are dealing with a similar issue in {T426198}. For events, I will be adding a user.wiki_id field. [17:11:36] (03PS1) 10Xcollazo: Remove wmf-analytics-old-uploads Archiva repository [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1286989 (https://phabricator.wikimedia.org/T426114) [17:57:44] (03CR) 10Ottomata: [C:03+1] Remove wmf-analytics-old-uploads Archiva repository [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1286989 (https://phabricator.wikimedia.org/T426114) (owner: 10Xcollazo) [17:58:29] (03CR) 10Xcollazo: [C:03+2] Remove wmf-analytics-old-uploads Archiva repository [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1286989 (https://phabricator.wikimedia.org/T426114) (owner: 10Xcollazo) [18:09:26] (03Merged) 10jenkins-bot: Remove wmf-analytics-old-uploads Archiva repository [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1286989 (https://phabricator.wikimedia.org/T426114) (owner: 10Xcollazo) [18:11:55] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10MediaWiki-Incremental-History: wmf.mediawiki_history contains spurious event_type = 'create-page' for page entity rows - https://phabricator.wikimedia.org/T426242 (10xcollazo) 03NEW [18:40:29] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10MediaWiki-Incremental-History: mediawiki_history_incremental_v1: schema specification for stakeholder review - https://phabricator.wikimedia.org/T425573#11919037 (10Isaac) Ok, I had started some responses to the above but modifying a bit to summarize my... [19:26:11] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10MediaWiki-Incremental-History: Draft: Architectural design agreement: Incremental MediaWiki History - https://phabricator.wikimedia.org/T424359#11919258 (10xcollazo) ## Summary of changes — 2026-05-13 **Session focus:** investigation of `wmf.mediawiki_h... [19:29:52] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10MediaWiki-Incremental-History: wmf.mediawiki_history contains spurious event_type = 'create-page' for page entity rows - https://phabricator.wikimedia.org/T426242#11919276 (10xcollazo) ## Investigation findings After digging into the pipeline code and q... [19:43:48] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Movement-Insights: interwiki imports and its effects on revision data - https://phabricator.wikimedia.org/T425735#11919331 (10nshahquinn-wmf) Somewhat related: {T221482}. [20:31:45] 06Data-Engineering, 10Dumps-Generation: imported pages for which there is no local user are no longer dumped under 1.34.0-wmf.1 - https://phabricator.wikimedia.org/T221399#11919614 (10Pppery) [20:31:57] 06Data-Engineering, 10Dumps-Generation, 10MediaWiki-Core-Snapshots: XmlDUmpWriter::writeRevision sometimes broken by duplicate keys in Link Cache - https://phabricator.wikimedia.org/T220424#11919617 (10Pppery) [20:32:01] 06Data-Engineering, 10Dumps-Generation, 10MediaWiki-Core-Snapshots: XmlDumpWriter::openPage handles main namespace articles with prefixes that are namespace names AND are redirects incorrectly - https://phabricator.wikimedia.org/T220316#11919619 (10Pppery) [20:47:22] re my message in this channel yesterday, i've boldly renamed the Phab project to `#DPE-MediaWiki-Incremental-History` (to match the tag `#DPE-Mediawiki-Content`), in order to avoid the potential ambiguity with MW core itself :) feel free to further rename if you'd like [21:15:10] (03PS4) 10Xcollazo: Add MWHistoryDeltaWriter and MWHistorySnapshotMerger to refinery-job-35 [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1284858 (https://phabricator.wikimedia.org/T425729) [21:21:11] 06Data-Engineering, 10EventStreams: Support namespace_prefix and page_title_noprefix - https://phabricator.wikimedia.org/T426268 (10dr0ptp4kt) 03NEW [21:29:00] (03PS5) 10Xcollazo: Add MWHistoryDeltaWriter and MWHistorySnapshotMerger to refinery-job-35 [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1284858 (https://phabricator.wikimedia.org/T425729) [21:31:16] 06Data-Engineering, 10Pageviews-Anomaly: "Nahui Ollin" is enwiki's top-viewed article in April 2026 - https://phabricator.wikimedia.org/T425600#11920029 (10Hghani) I noticed that most of the pageviews to Nahui Ollin are referred to us from `www.vakarta.com` and on that website there are some links on `https://... [22:06:30] 06Data-Engineering, 10EventStreams: Support namespace_prefix and page_title_noprefix - https://phabricator.wikimedia.org/T426268#11920062 (10dr0ptp4kt)