[05:36:18] 06Data-Engineering: refine_webrequest_hourly_text.refine_webrequest probably needs more memory - https://phabricator.wikimedia.org/T418552 (10dr0ptp4kt) 03NEW [05:52:06] 06Data-Engineering, 06Data-Engineering-Radar, 06DBA, 07Schema-change-in-production: Update imagelinks primary key on wmf production - https://phabricator.wikimedia.org/T415786#11657175 (10Marostegui) [05:58:22] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop cuc_agent & cuc_ip from cu_changes, cule_agent & cule_ip from cu_log_event, and cupe_agent & cupe_ip from cu_private_event on WMF wikis - https://phabricator.wikimedia.org/T418465#11657188 (10Marostegui) [06:17:53] 06Data-Engineering, 06Data-Engineering-Radar, 06DBA, 07Schema-change-in-production: Update imagelinks primary key on wmf production - https://phabricator.wikimedia.org/T415786#11657255 (10Marostegui) [07:04:26] 06Data-Engineering, 06SRE, 10SRE-Access-Requests: Requesting access to analytics-private-users for maxbinderWMF - https://phabricator.wikimedia.org/T417655#11657276 (10MoritzMuehlenhoff) You're using the wrong account: You shell access is for mbinder, but you requested access for the "wmf" group with "ma... [08:08:57] 06Data-Engineering, 06SRE, 10SRE-Access-Requests: Requesting access to analytics-private-users for maxbinderWMF - https://phabricator.wikimedia.org/T417655#11657347 (10MoritzMuehlenhoff) 05Resolved→03Open a:05MatthewVernon→03None @MBinder_WMF I did a little digging in account history: Your original m... [08:11:56] 06Data-Engineering, 06SRE, 10SRE-Access-Requests: Requesting access to analytics-private-users for maxbinderWMF - https://phabricator.wikimedia.org/T417655#11657352 (10MoritzMuehlenhoff) a:03MoritzMuehlenhoff [09:21:10] 06Data-Engineering, 06Data-Engineering-Radar, 06SRE, 10SRE-Access-Requests: Requesting access to Superset for mikez - https://phabricator.wikimedia.org/T418098#11657417 (10mikez-WMF) Thank you very much! [09:28:09] 06Data-Engineering: Create a data product of IP range to owner/provenance label - https://phabricator.wikimedia.org/T418466#11657428 (10JAllemandou) More on the above: * Autonomous system data: the MaxMind DB used by HAProxy is the same the one we use on the cluster, so it's fair to assume the data should be the... [10:45:44] 06Data-Engineering: Create a data product of IP range to owner/provenance label - https://phabricator.wikimedia.org/T418466#11657752 (10KCVelaga_WMF) Thanks @GGoncalves-WMF for creating this. @JAllemandou Ideally, it would be better to use the existing data sources, but I fear there are some challenges with bot... [10:45:50] 06Data-Engineering, 10Event-Platform: EventBus: Invalid mediawiki signature error caused by meta.dt field - https://phabricator.wikimedia.org/T418573 (10EloiFerrer) 03NEW [11:03:45] 06Data-Engineering: refine_webrequest_hourly_text.refine_webrequest probably needs more memory - https://phabricator.wikimedia.org/T418552#11657838 (10JAllemandou) From [[ https://wikimedia.slack.com/archives/C02291Z9YQY/p1772167933462929 | this slack thread ]]: The memory bump is needed when there is a signifi... [11:31:17] 06Data-Engineering, 06Data-Engineering-Radar, 06Data-Platform-SRE, 07Essential-Work: Do performance testing of a big Hadoop Table hosted by Ceph - https://phabricator.wikimedia.org/T381416#11657964 (10BTullis) [11:32:20] 06Data-Engineering, 06Data-Engineering-Radar, 06Data-Platform-SRE (2026-02-13 - 2026-03-06), 07Essential-Work: Do performance testing of a big Hadoop Table hosted by Ceph - https://phabricator.wikimedia.org/T381416#11657965 (10BTullis) Brining into the current milestone, since we plan to move forward with... [12:09:06] 06Data-Engineering, 06Data-Engineering-Radar, 06Data-Platform-SRE (2026-02-13 - 2026-03-06), 07Essential-Work: Provide an access to MaxMind GeoIP in DSE K8S pods - https://phabricator.wikimedia.org/T405509#11658056 (10BTullis) a:03BTullis [12:44:57] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop cuc_agent & cuc_ip from cu_changes, cule_agent & cule_ip from cu_log_event, and cupe_agent & cupe_ip from cu_private_event on WMF wikis - https://phabricator.wikimedia.org/T418465#11658123 (10Marostegui) [12:48:07] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop cuc_agent & cuc_ip from cu_changes, cule_agent & cule_ip from cu_log_event, and cupe_agent & cupe_ip from cu_private_event on WMF wikis - https://phabricator.wikimedia.org/T418465#11658127 (10Marostegui) [13:45:25] 06Data-Engineering: Create a data product of IP range to owner/provenance label - https://phabricator.wikimedia.org/T418466#11658314 (10GGoncalves-WMF) Thank you so much for the detailed requirements, KC! Having discussed this with @JAllemandou some more, I think it makes sense to skip the MaxMind and Spur data... [13:54:03] 06Data-Engineering: Create a data product of IP range to owner/provenance label - https://phabricator.wikimedia.org/T418466#11658343 (10GGoncalves-WMF) [14:42:20] 06Data-Engineering: Create a data product of IP range to owner/provenance label - https://phabricator.wikimedia.org/T418466#11658577 (10KCVelaga_WMF) @GGoncalves-WMF the updated task description captures all the details well, thank you! I will share the list on Monday, and yes, a UDF would be great- much better... [15:07:53] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop cuc_agent & cuc_ip from cu_changes, cule_agent & cule_ip from cu_log_event, and cupe_agent & cupe_ip from cu_private_event on WMF wikis - https://phabricator.wikimedia.org/T418465#11658656 (10Marostegui) [17:49:34] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 06MW-Interfaces-Team, 10Event-Platform: mediawiki.page_change.v1 event stream - Investigate mistmatched meta.dt and dt (and rev_dt) fields - https://phabricator.wikimedia.org/T409105#11659205 (10Ahoelzl) @Ottomata can you help unblock this? [18:21:11] 06Data-Engineering, 10Dumps-Generation: Some wikimedia 20260101 dump files missing - https://phabricator.wikimedia.org/T413767#11659278 (10xcollazo) I synced up with @brouberol via Slack. We speculate that the behavior is coming from interactions between the Airflow DAGs that we use to run the dump tasks ([[... [18:40:44] (03CR) 10Michael Große: [C:03+1] Add BSD 3-Clause License to the repo backdated to first commit year [analytics/wmde/scripts] - 10https://gerrit.wikimedia.org/r/1243749 (owner: 10Andrew McAllister (WMDE)) [18:50:58] (03CR) 10Addshore: [C:03+1] Add BSD 3-Clause License to the repo backdated to first commit year [analytics/wmde/scripts] - 10https://gerrit.wikimedia.org/r/1243749 (owner: 10Andrew McAllister (WMDE)) [18:53:53] !log Test Kitchen edge-unique experiments (poll 189382) - adds: none; removes: none; fields: mobile-toc-abc - xLab/MPIC/TK tips at https://w.wiki/FwuD [18:53:55] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [19:50:21] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 10MediaWiki-Core-Revision-backend, 10MediaWiki-DomainEvents, 06MW-Interfaces-Team, and 4 others: MediaWiki\Revision\RevisionAccessException: Unable to load fresh row for rev_id: {rev_id} - https://phabricator.wikimedia.org/T400380#11659514 (10Ottomat... [19:54:18] 06Data-Engineering, 06MW-Interfaces-Team, 10Event-Platform: EventBus: Invalid mediawiki signature error caused by meta.dt field - https://phabricator.wikimedia.org/T418573#11659525 (10Ottomata) [19:57:22] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 13Patch-For-Review: Adapt Sqoop for imagelinks schema changes - https://phabricator.wikimedia.org/T416481#11659532 (10Zabe) >>! In T416481#11654804, @Snwachukwu wrote: > [...] > > How should we proceed given that the `il_to` column is expected to be de... [19:57:52] 06Data-Engineering, 06MW-Interfaces-Team, 10Event-Platform: EventBus: Invalid mediawiki signature error caused by meta.dt field - https://phabricator.wikimedia.org/T418573#11659534 (10Ottomata) Interesting! Adding #MW-Interfaces-Team, they manage JobQueue, and might be able say more about why this isn't a p... [20:00:35] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 06MW-Interfaces-Team, 10Event-Platform: mediawiki.page_change.v1 event stream - Investigate mistmatched meta.dt and dt (and rev_dt) fields - https://phabricator.wikimedia.org/T409105#11659540 (10Ottomata) I will put it in my queue! :)