[05:11:30] 06Data-Engineering: Request for Hourly Pageview Data for multiple articles– July 18 to September 8, 2025 - https://phabricator.wikimedia.org/T409676#11357153 (10BhumanSharma) Sorry for removing Aklapper. It was an accident [05:18:52] 06Data-Engineering, 10Data-Engineering-Wikistats: Create report for "articles with most contributors" in Wikistats2 - https://phabricator.wikimedia.org/T204965#11357181 (10Pppery) [05:19:14] 06Data-Engineering, 10Data-Engineering-Wikistats, 13Patch-Needs-Improvement: Improve scoping of CSS - https://phabricator.wikimedia.org/T190915#11357183 (10Pppery) [08:29:46] 06Data-Engineering: Request for Hourly Pageview Data for multiple articles– July 18 to September 8, 2025 - https://phabricator.wikimedia.org/T409676#11357360 (10A_smart_kitten) Hello @bhumansharma & welcome to Wikimedia Phabricator! A few notes: - You've claimed this task, which would usually mean that you plan... [08:51:30] 06Data-Engineering (Q2 FY25/26 October 1st - December 31th): Migrate Sqoop jobs to Airflow - https://phabricator.wikimedia.org/T409514#11357384 (10JAllemandou) > Airflow DAGs that would handle each table-Sqoop job independently I don't think this approach will work out of the box. The python script handles conc... [08:56:18] (03CR) 10Joal: [V:03+2 C:03+2] "Merging" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1199521 (https://phabricator.wikimedia.org/T309738) (owner: 10Zabe) [09:48:40] (03PS1) 10Joal: Make referer classification more robust [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1203389 (https://phabricator.wikimedia.org/T406531) [09:53:11] 06Data-Engineering (Q2 FY25/26 October 1st - December 31th): Iceberg Merge strategies with dbt - https://phabricator.wikimedia.org/T409099#11357694 (10JMonton-WMF) a:03JMonton-WMF [09:56:26] (03CR) 10JavierMonton: [C:03+1] "Looks good to me." [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1203389 (https://phabricator.wikimedia.org/T406531) (owner: 10Joal) [10:05:10] 06Data-Engineering (Q2 FY25/26 October 1st - December 31th): Iceberg Merge strategies with dbt - https://phabricator.wikimedia.org/T409099#11357772 (10JMonton-WMF) @JAllemandou suggested that we could try to develop in `dbt` one of the HQL pipelines we have, so we try it with a real case that might need a custom... [10:50:39] 06Data-Engineering, 10CheckUser, 07Essential-Work, 06Product Safety and Integrity (PSI (Sprint Nov 10 - Nov 28)), and 2 others: Drop the cupe_private column in the cu_private_event table - https://phabricator.wikimedia.org/T409710 (10Dreamy_Jazz) 03NEW [10:50:47] 06Data-Engineering, 10CheckUser, 07Essential-Work, 06Product Safety and Integrity (PSI (Sprint Nov 10 - Nov 28)), and 2 others: Drop the cupe_private column in the cu_private_event table - https://phabricator.wikimedia.org/T409710#11357922 (10Dreamy_Jazz) [10:51:08] 06Data-Engineering, 10CheckUser, 06Product Safety and Integrity, 07Essential-Work, and 2 others: Drop the cupe_private column in the cu_private_event table - https://phabricator.wikimedia.org/T409710#11357924 (10Dreamy_Jazz) [10:53:12] (03CR) 10Joal: [C:03+2] "Merging for later deploy" [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1203389 (https://phabricator.wikimedia.org/T406531) (owner: 10Joal) [10:58:16] 06Data-Engineering (Q2 FY25/26 October 1st - December 31th), 10Data-Platform, 07Essential-Work, 06Movement-Insights (FY25-26 H1), 13Patch-For-Review: NEWFEATURE REQUEST: Add new referral sources to pageview data - https://phabricator.wikimedia.org/T406531#11357930 (10JAllemandou) While backfilling I disc... [11:09:13] (03Merged) 10jenkins-bot: Make referer classification more robust [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1203389 (https://phabricator.wikimedia.org/T406531) (owner: 10Joal) [12:09:22] 06Data-Engineering, 10CheckUser-SuggestedInvestigations, 10ConfirmEdit (CAPTCHA extension), 06Data-Persistence, and 4 others: Add columns to store associated log ID or revision ID that caused a signal to match - https://phabricator.wikimedia.org/T409093#11358219 (10kostajh) >>! In T409093#11354744, @Dreamy... [12:14:10] 06Data-Engineering, 10CheckUser-SuggestedInvestigations, 10ConfirmEdit (CAPTCHA extension), 06Data-Persistence, and 4 others: Add columns to store associated log ID or revision ID that caused a signal to match - https://phabricator.wikimedia.org/T409093#11358245 (10OKryva-WMF) [12:22:59] 06Data-Engineering, 10CheckUser-SuggestedInvestigations, 10ConfirmEdit (CAPTCHA extension), 06Data-Persistence, and 4 others: Add columns to store associated log ID or revision ID that caused a signal to match - https://phabricator.wikimedia.org/T409093#11358285 (10Ladsgroup) LGTM [14:08:00] 06Data-Engineering, 10CheckUser-SuggestedInvestigations, 06Product Safety and Integrity, 07Schema-change-in-production, 05WE4.2 Bot detection (WE4.2 hCaptcha editing trial): Add the sis_trigger_id and sis_trigger_type columns to the cusi_signal table on WMF wikis - https://phabricator.wikimedia.org/T409733... [14:08:06] 06Data-Engineering, 10CheckUser-SuggestedInvestigations, 06Product Safety and Integrity, 07Schema-change-in-production, 05WE4.2 Bot detection (WE4.2 hCaptcha editing trial): Add the sis_trigger_id and sis_trigger_type columns to the cusi_signal table on ... - https://phabricator.wikimedia.org/T409733#11358716 [14:08:32] 06Data-Engineering, 10CheckUser-SuggestedInvestigations, 06DBA, 06Product Safety and Integrity, and 2 others: Add the sis_trigger_id and sis_trigger_type columns to the cusi_signal table on WMF wikis - https://phabricator.wikimedia.org/T409733#11358718 (10Dreamy_Jazz) [14:11:08] 06Data-Engineering, 10CheckUser-SuggestedInvestigations, 06DBA, 06Product Safety and Integrity, and 2 others: Add the sis_trigger_id and sis_trigger_type columns to the cusi_signal table on WMF wikis - https://phabricator.wikimedia.org/T409733#11358727 (10Dreamy_Jazz) Like with {T409539}, due to the very s... [14:14:24] 06Data-Engineering, 10CheckUser-SuggestedInvestigations, 06Product Safety and Integrity, 05WE4.2 Bot detection (WE4.2 hCaptcha editing trial): Add the sis_trigger_id and sis_trigger_type columns to the cusi_signal table on WMF wikis - https://phabricator.wikimedia.org/T409733#11358732 (10Dreamy_Jazz) I nee... [14:24:37] 06Data-Engineering, 10CheckUser-SuggestedInvestigations, 06Product Safety and Integrity, 13Patch-For-Review, 05WE4.2 Bot detection (WE4.2 hCaptcha editing trial): Add the sis_trigger_id and sis_trigger_type columns to the cusi_signal table on WMF wikis - https://phabricator.wikimedia.org/T409733#11358761 (... [15:08:16] 06Data-Engineering (Q2 FY25/26 October 1st - December 31th), 10Data-Platform, 07Essential-Work, 06Movement-Insights (FY25-26 H1), 13Patch-For-Review: NEWFEATURE REQUEST: Add new referral sources to pageview data - https://phabricator.wikimedia.org/T406531#11359077 (10JAllemandou) And the druid datasource... [15:21:12] 06Data-Engineering, 06Data-Engineering-Radar, 10Observability-Logging, 06serviceops, and 2 others: Fix Kafka replicas skew - https://phabricator.wikimedia.org/T407185#11359142 (10herron) a:05herron→03None FWIW T326419 has some details about the last rebalance on kafka-logging [16:09:41] 06Data-Engineering, 10CheckUser-SuggestedInvestigations, 06Product Safety and Integrity, 05MW-1.46-notes (1.46.0-wmf.2; 2025-11-12), 05WE4.2 Bot detection (WE4.2 hCaptcha editing trial): Add the sis_trigger_id and sis_trigger_type columns to the cusi_sig... - https://phabricator.wikimedia.org/T409733#11359361 [16:09:59] (03PS2) 10Snwachukwu: Fix Duplicate Pageview metrics records in data quality tables. [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1203129 [16:10:19] 06Data-Engineering, 06Data-Engineering-Radar, 10Observability-Logging, 06serviceops, and 2 others: Fix Kafka replicas skew - https://phabricator.wikimedia.org/T407185#11359365 (10brouberol) a:03brouberol [16:12:08] 06Data-Engineering, 10CheckUser-SuggestedInvestigations, 06DBA, 06Product Safety and Integrity, and 3 others: Add the sis_trigger_id and sis_trigger_type columns to the cusi_signal table on WMF wikis - https://phabricator.wikimedia.org/T409733#11359375 (10Dreamy_Jazz) >>! In T409733#11358761, @Marostegui w... [16:12:26] 06Data-Engineering, 10CheckUser-SuggestedInvestigations, 06DBA, 06Product Safety and Integrity, and 3 others: Add the sis_trigger_id and sis_trigger_type columns to the cusi_signal table on WMF wikis - https://phabricator.wikimedia.org/T409733#11359379 (10Dreamy_Jazz) a:03Marostegui [16:12:32] 06Data-Engineering, 10CheckUser-SuggestedInvestigations, 06DBA, 06Product Safety and Integrity, and 3 others: Add the sis_trigger_id and sis_trigger_type columns to the cusi_signal table on WMF wikis - https://phabricator.wikimedia.org/T409733#11359380 (10Dreamy_Jazz) [16:28:02] (03CR) 10Aleksandar Mastilovic: [V:03+2 C:03+2] "LGTM!" [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1203129 (owner: 10Snwachukwu) [16:30:00] (03CR) 10Snwachukwu: [V:03+2] Fix Duplicate Pageview metrics records in data quality tables. (031 comment) [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1203129 (owner: 10Snwachukwu) [16:30:38] (03CR) 10Snwachukwu: [V:03+2] Fix Duplicate Pageview metrics records in data quality tables. (031 comment) [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1203129 (owner: 10Snwachukwu) [16:36:33] 06Data-Engineering, 06Data-Engineering-Radar, 10Observability-Logging, 06serviceops, and 2 others: Fix Kafka replicas skew - https://phabricator.wikimedia.org/T407185#11359541 (10brouberol) The leadership skew is expected, as it is a result of //storage// rebalancing. Some brokers are leaders of few large... [16:36:56] 06Data-Engineering, 06Data-Engineering-Radar, 10Observability-Logging, 06serviceops, and 2 others: Fix Kafka replicas skew - https://phabricator.wikimedia.org/T407185#11359542 (10brouberol) Which I'll run on wednesday, as tomorrow is a bank holiday in France. [17:06:58] (03CR) 10Joal: [C:03+1] "Good catch. Can be merged" [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1203124 (https://phabricator.wikimedia.org/T407649) (owner: 10Xcollazo) [17:39:19] 06Data-Engineering, 06Data-Engineering-Radar, 06Discovery-Search, 06Infrastructure-Foundations, and 2 others: Elasticsearch dependency upgrade in spicerack - https://phabricator.wikimedia.org/T390860#11359914 (10MoritzMuehlenhoff) 05Resolved→03Open sre.elasticsearch.ban still uses python-elasticsearch:... [18:02:10] (03PS2) 10Xcollazo: Fix bug MW Dumper in which vertical bars ( `|` ) were not being honored. [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1203124 (https://phabricator.wikimedia.org/T407649) [18:04:36] 06Data-Engineering, 07Epic, 07OKR-Work, 13Patch-For-Review: SDS 1.3.2 [EPIC] Automated alerting for changes in automated traffic behavior - https://phabricator.wikimedia.org/T407235#11360055 (10Ahoelzl) [18:10:37] 06Data-Engineering, 10CheckUser-SuggestedInvestigations, 10ConfirmEdit (CAPTCHA extension), 06Data-Persistence, and 4 others: Add columns to store associated log ID or revision ID that caused a signal to match - https://phabricator.wikimedia.org/T409093#11360096 (10Dreamy_Jazz) [18:10:48] 06Data-Engineering, 10CheckUser-SuggestedInvestigations, 10ConfirmEdit (CAPTCHA extension), 06Data-Persistence, and 4 others: Add columns to store associated log ID or revision ID that caused a signal to match - https://phabricator.wikimedia.org/T409093#11360098 (10Dreamy_Jazz) 05Open→03Resolved [19:14:53] 06Data-Engineering: Request for Hourly Pageview Data for multiple articles– July 18 to September 8, 2025 - https://phabricator.wikimedia.org/T409676#11360414 (10Madocmadofmadog) >>! In T409676#11357360, @A_smart_kitten wrote: > Hello @bhumansharma & welcome to Wikimedia Phabricator! A few notes: > - You've clai... [19:27:48] 06Data-Engineering (Q2 FY25/26 October 1st - December 31th), 10DPE-Mediawiki-Content, 13Patch-For-Review: Compare the exported content between File Export and DumpV1 - https://phabricator.wikimedia.org/T407649#11360455 (10xcollazo) Since the fix from https://gerrit.wikimedia.org/r/1203124 can affect all dat... [19:47:38] 06Data-Engineering, 06Research: Request for Hourly Pageview Data for multiple articles– July 18 to September 8, 2025 - https://phabricator.wikimedia.org/T409676#11360544 (10A_smart_kitten) a:05BhumanSharma→03None >>! In T409676#11360414, @Madocmadofmadog wrote: > Yes its the same project and we're collabor... [19:47:54] 06Data-Engineering, 06Research: Request for Hourly Pageview Data for multiple articles– July 18 to September 8, 2025 - https://phabricator.wikimedia.org/T409676#11360550 (10A_smart_kitten) [19:47:56] 14Analytics-Clusters, 10LDAP-Access-Requests, 06Research, 10Research-collaborations, 06SRE: Hourly pageview data request — Splitsville (2025) and related indie-film Wikipedia pages - https://phabricator.wikimedia.org/T409639#11360548 (10A_smart_kitten) →14Duplicate dup:03T409676 [19:50:24] (03CR) 10Xcollazo: [C:03+2] "Last patch moves all XML producing code to a `StringBuilder` approach." [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1203124 (https://phabricator.wikimedia.org/T407649) (owner: 10Xcollazo) [20:03:10] (03Merged) 10jenkins-bot: Fix bug MW Dumper in which vertical bars ( `|` ) were not being honored. [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1203124 (https://phabricator.wikimedia.org/T407649) (owner: 10Xcollazo) [20:44:40] 06Data-Engineering (Q2 FY25/26 October 1st - December 31th): Update thresholds configuration for MediaWiki History Reduced error checks - https://phabricator.wikimedia.org/T409782 (10amastilovic) 03NEW [22:35:57] 06Data-Engineering, 06Research: Request for Hourly Pageview Data for multiple articles– July 18 to September 8, 2025 - https://phabricator.wikimedia.org/T409676#11361273 (10Madocmadofmadog) [22:36:55] 06Data-Engineering, 06Research: Request for Hourly Pageview Data for multiple articles– July 18 to September 8, 2025 - https://phabricator.wikimedia.org/T409676#11361277 (10Madocmadofmadog) >>! In T409676#11360544, @A_smart_kitten wrote: >>>! In T409676#11360414, @Madocmadofmadog wrote: >> Yes its the same pro... [22:43:47] 06Data-Engineering (Q2 FY25/26 October 1st - December 31th), 06Data-Persistence, 10Data-Persistence-Design-Review, 06Growth-Team, and 3 others: Data Persistence Design Review: Improve Tone Suggested Edits newcomer task - https://phabricator.wikimedia.org/T401021#11361284 (10Eevans) [22:45:17] 06Data-Engineering (Q2 FY25/26 October 1st - December 31th), 06Data-Persistence, 10Data-Persistence-Design-Review, 06Growth-Team, and 3 others: Data Persistence Design Review: Improve Tone Suggested Edits newcomer task - https://phabricator.wikimedia.org/T401021#11361288 (10Eevans) [23:41:52] 06Data-Engineering, 06Reader Growth Team, 06Wikipedia-Android-App-Backlog, 06Wikipedia-iOS-App-Backlog, and 2 others: Add page_id and namespace to X-Analytics header in Mobile App requests (2025 remake) - https://phabricator.wikimedia.org/T409358#11361517 (10Jdlrobson-WMF) [23:42:01] 06Data-Engineering, 06Reader Growth Team, 06Wikipedia-Android-App-Backlog, 06Wikipedia-iOS-App-Backlog, and 2 others: Add page_id and namespace to X-Analytics header in Mobile App requests (2025 remake) - https://phabricator.wikimedia.org/T409358#11361518 (10Jdlrobson-WMF)