[00:25:22] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 06Research, 10Event-Platform, 13Patch-For-Review: Event stream with latest revision HTML & parent revision HTML diff - https://phabricator.wikimedia.org/T360794#11784786 (10Ottomata) @JMonton-WMF something we should keep an eye on: kafka topic size... [00:32:35] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th): Add support for variables to DbtSkeinOperator - https://phabricator.wikimedia.org/T421789#11784793 (10amastilovic) @Mayakp.wiki yes this is for the backfill functionality, among other stuff! [01:08:17] 06Data-Engineering, 06Data-Engineering-Icebox, 06DBA, 13Patch-For-Review: Move Mostcategories computation to Hadoop - https://phabricator.wikimedia.org/T413362#11784863 (10Zabe) Taking a look at https://analytics.wikimedia.org/published/datasets/querypage/MostCategories/commonswiki.json and comparing it to... [01:13:55] 06Data-Engineering, 06Data-Engineering-Icebox, 06DBA, 13Patch-For-Review: Move Mostcategories computation to Hadoop - https://phabricator.wikimedia.org/T413362#11784867 (10Zabe) Ok, the difference is that the MediaWiki implementation filters for pages in `$wgContentNamespaces` and this includes files on co... [02:29:03] FIRING: [3x] MediawikiContentHistoryReconcileEnrichHighKafkaConsumerLag: ... [02:29:09] High Kafka consumer lag for mw_content_history_reconcile_enrich in eqiad - TODO - https://grafana.wikimedia.org/d/K9x0c4aVk/flink-app?orgId=1&var-datasource=eqiad%20prometheus/k8s-dse&var-namespace=mw-content-history-reconcile-enrich&var-helm_release=production&var-operator_name=All&var-flink_job_name=mw_content_history_reconcile_enrich - https://alerts.wikimedia.org/?q=alertname%3DMediawikiContentHistoryReconcileEnrichHighKafkaConsumerLag [05:00:06] FIRING: [3x] MediawikiContentHistoryReconcileEnrichHighKafkaConsumerLag: ... [05:00:06] High Kafka consumer lag for mw_content_history_reconcile_enrich in eqiad - TODO - https://grafana.wikimedia.org/d/K9x0c4aVk/flink-app?orgId=1&var-datasource=eqiad%20prometheus/k8s-dse&var-namespace=mw-content-history-reconcile-enrich&var-helm_release=production&var-operator_name=All&var-flink_job_name=mw_content_history_reconcile_enrich - https://alerts.wikimedia.org/?q=alertname%3DMediawikiContentHistoryReconcileEnrichHighKafkaConsumerLag [05:18:51] FIRING: [2x] MediawikiContentHistoryReconcileEnrichHighKafkaConsumerLag: ... [05:18:51] High Kafka consumer lag for mw_content_history_reconcile_enrich in eqiad - TODO - https://grafana.wikimedia.org/d/K9x0c4aVk/flink-app?orgId=1&var-datasource=eqiad%20prometheus/k8s-dse&var-namespace=mw-content-history-reconcile-enrich&var-helm_release=production&var-operator_name=All&var-flink_job_name=mw_content_history_reconcile_enrich - https://alerts.wikimedia.org/?q=alertname%3DMediawikiContentHistoryReconcileEnrichHighKafkaConsumerLag [05:23:51] RESOLVED: [2x] MediawikiContentHistoryReconcileEnrichHighKafkaConsumerLag: ... [05:23:51] High Kafka consumer lag for mw_content_history_reconcile_enrich in eqiad - TODO - https://grafana.wikimedia.org/d/K9x0c4aVk/flink-app?orgId=1&var-datasource=eqiad%20prometheus/k8s-dse&var-namespace=mw-content-history-reconcile-enrich&var-helm_release=production&var-operator_name=All&var-flink_job_name=mw_content_history_reconcile_enrich - https://alerts.wikimedia.org/?q=alertname%3DMediawikiContentHistoryReconcileEnrichHighKafkaConsumerLag [06:42:05] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 06Research, 10Event-Platform, 13Patch-For-Review: Event stream with latest revision HTML & parent revision HTML diff - https://phabricator.wikimedia.org/T360794#11785032 (10brouberol) @Ottomata Assuming you mean 290GB and not 290TB, we should be all... [11:16:58] 06Data-Engineering, 06Data-Platform-SRE (2026-03-27 - 2026-04-17): Requesting Kerberos access for matmarex - https://phabricator.wikimedia.org/T421783#11785408 (10brouberol) a:03brouberol [11:17:01] 06Data-Engineering, 06Data-Platform-SRE (2026-03-27 - 2026-04-17): Requesting Kerberos access for matmarex - https://phabricator.wikimedia.org/T421783#11785410 (10brouberol) 05Open→03In progress [11:17:40] 06Data-Engineering, 06Data-Platform-SRE (2026-03-27 - 2026-04-17): Requesting Kerberos access for matmarex - https://phabricator.wikimedia.org/T421783#11785411 (10brouberol) ` brouberol@krb1002:~$ sudo manage_principals.py create matmarex --email=bdziewonski@wikimedia.org Principal already created (or an erro... [11:18:02] 06Data-Engineering, 06Data-Platform-SRE (2026-03-27 - 2026-04-17): Requesting Kerberos access for matmarex - https://phabricator.wikimedia.org/T421783#11785412 (10brouberol) It appears as though you already have a kerberos principal created @matmarex [11:18:14] 06Data-Engineering, 06Data-Platform-SRE (2026-03-27 - 2026-04-17): Requesting Kerberos access for matmarex - https://phabricator.wikimedia.org/T421783#11785415 (10brouberol) 05In progress→03Resolved [13:28:33] !log Test Kitchen edge-unique experiments (poll 63615) - adds: logged-out-retention-round6; removes: none; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD [13:28:35] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [14:44:53] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th): Implement list of JA3N-JA4H pairs to be tagged as automated into the bot detection pipeline - https://phabricator.wikimedia.org/T420412#11785675 (10mforns) I've finished the tests. Hamid and I have checked that both actor counts and pageview counts for di... [15:24:14] 06Data-Engineering, 06Data-Platform-SRE (2026-03-27 - 2026-04-17): Requesting Kerberos access for matmarex - https://phabricator.wikimedia.org/T421783#11785782 (10matmarex) 05Resolved→03Open That's a surprise. In that case, may I ask for its password to be reset? (per 06Data-Engineering, 06Data-Platform-SRE (2026-03-27 - 2026-04-17): Requesting Kerberos access for matmarex - https://phabricator.wikimedia.org/T421783#11785827 (10brouberol) Sure thing! ` brouberol@krb1002:~$ sudo manage_principals.py reset-password matmarex --email=bdziewonski@wikimedia.org Password reset su... [16:42:18] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 06Growth-Team, 10Image-Suggestions, 13Patch-For-Review: Add an Image: filtering by suggestion "kind" or "confidence" - https://phabricator.wikimedia.org/T368987#11786005 (10Ahoelzl) @dcausse can you also confirm that the latest April run / data was... [17:51:20] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 06Commons, 06Data-Persistence, 07Epic, and 2 others: FY2025-26 WE 6.4.1: Move links tables of commons to a dedicated cluster - https://phabricator.wikimedia.org/T398709#11786217 (10Ahoelzl) [17:52:01] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th): Traffic referrer analysis - https://phabricator.wikimedia.org/T421516#11786223 (10Ahoelzl) [18:31:38] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 06Reader Experience Team, 10Test Kitchen, 05MW-1.46-notes (1.46.0-wmf.22; 2026-03-31): Logged in reader retention logging - https://phabricator.wikimedia.org/T420621#11786329 (10tchin) Data is now available in the data lake under `wmf_readership.act... [18:37:51] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 10Observability-Metrics: [Data Quality] Sending Apache Spark metrics to PushGateway - https://phabricator.wikimedia.org/T297231#11786340 (10Ottomata) Cool! Seems easy enough, except last commit is 7 years ago? :D [18:40:16] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 10Event-Platform, 13Patch-For-Review: HTML Enrichment - Tuning & Backfilling configuration - https://phabricator.wikimedia.org/T421216#11786345 (10Ottomata) Looks like prod died, and is now backfilling! Ah, but if there is no checkpointed offsets, fl... [18:48:26] 06Data-Engineering, 06Data-Platform-SRE (2026-03-27 - 2026-04-17): Requesting Kerberos access for matmarex - https://phabricator.wikimedia.org/T421783#11786351 (10matmarex) 05Open→03Resolved Thanks! [19:23:16] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 13Patch-For-Review: Implement list of JA3N-JA4H pairs to be tagged as automated into the bot detection pipeline - https://phabricator.wikimedia.org/T420412#11786393 (10mforns) I prepared a deployment plan for the automated traffic detection changes: 1.... [20:09:31] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 10Event-Platform: HTML Enrichment - Tuning & Backfilling configuration - https://phabricator.wikimedia.org/T421216#11786474 (10Ottomata) Well! Staging is failing with message too large in kafka sink again: > Caused by: org.apache.kafka.common.errors.R... [20:13:27] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 10Event-Platform: HTML Enrichment - Tuning & Backfilling configuration - https://phabricator.wikimedia.org/T421216#11786485 (10Ottomata) For now, in staging, I'm going to reduce `enrich.max_content_size` to 15MB, giving us a 5MB margin. ` helmfile app... [21:33:29] (03PS1) 10Zabe: querypage: mostcategories: Include NS_FILE if running on commons [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1267966 (https://phabricator.wikimedia.org/T413362) [22:18:28] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 10Event-Platform: HTML Enrichment - Tuning & Backfilling configuration - https://phabricator.wikimedia.org/T421216#11786725 (10Ottomata) Well, whatever I changed didn't work. staging still dying due to the same size too large in kafka sink error. [22:22:43] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Remove the test DBT DAG from test_k8s Airflow - https://phabricator.wikimedia.org/T422080#11786731 (10Ahoelzl) [22:22:50] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Traffic: Investigate raise in Invalid HAProxyKafka messages in esams - https://phabricator.wikimedia.org/T422033#11786732 (10Ahoelzl) [22:22:53] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Commons-Impact-Metrics, 10Commons-Impact-Metrics-Requests: Update Commons Impact Metrics allow-list March 2026 - https://phabricator.wikimedia.org/T421982#11786734 (10Ahoelzl) [22:22:54] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Traffic: Surge in webrequest validation check - https://phabricator.wikimedia.org/T422030#11786733 (10Ahoelzl) [22:22:56] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Backfill newly productionized edit types dataset - https://phabricator.wikimedia.org/T421919#11786735 (10Ahoelzl) [22:22:57] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Observability-Metrics: [Data Quality] Sending Apache Spark metrics to PushGateway - https://phabricator.wikimedia.org/T297231#11786737 (10Ahoelzl) [22:22:58] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Add support for variables to DbtSkeinOperator - https://phabricator.wikimedia.org/T421789#11786738 (10Ahoelzl) [22:23:01] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10AQS2.0: Use transcoding signal to resolve ambiguous extensions - https://phabricator.wikimedia.org/T421743#11786739 (10Ahoelzl) [22:23:05] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Research, 10Event-Platform, 13Patch-For-Review: eventutilties-python - support synchronous Flink process function mode - https://phabricator.wikimedia.org/T421965#11786736 (10Ahoelzl) [22:23:11] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Traffic referrer analysis - https://phabricator.wikimedia.org/T421516#11786740 (10Ahoelzl) [22:23:15] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Move all currently scheduled DBT DAGs to the `dbt_scheduled` Airflow DAGs - https://phabricator.wikimedia.org/T421434#11786741 (10Ahoelzl) [22:23:19] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Fix iceberg table location in hive metastore - https://phabricator.wikimedia.org/T408939#11786744 (10Ahoelzl) [22:23:23] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform: Debug edit type pipeline for production readiness - https://phabricator.wikimedia.org/T421026#11786742 (10Ahoelzl) [22:23:27] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform: Unbalanced partitions in eqiad.mediawiki.content_history_reconcile.v1 topic - https://phabricator.wikimedia.org/T420359#11786746 (10Ahoelzl) [22:23:31] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Reconcile Flink Job: Too many warnings - https://phabricator.wikimedia.org/T420353#11786747 (10Ahoelzl) [22:23:35] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Research-engineering, 06Research-Freezer, 10Event-Platform, 13Patch-For-Review: Update edit-type flink job with new schema - https://phabricator.wikimedia.org/T421005#11786743 (10Ahoelzl) [22:23:42] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Data-Engineering-Wikistats, 10Wikidata: Wikidata unique devices statistics are obviously wrong - https://phabricator.wikimedia.org/T420210#11786745 (10Ahoelzl) [22:23:45] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10PageViewInfo, 06ServiceOps new, 10ServiceOps-SharedInfra: Migrate PageViewInfo calls away from rest-gateway - https://phabricator.wikimedia.org/T411771#11786748 (10Ahoelzl) [22:23:52] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Add new API rate limiting fields from webrequest_logs to Turnilo view - https://phabricator.wikimedia.org/T419736#11786749 (10Ahoelzl) [22:23:56] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): The revision_seconds_to_identity_revert field in wmf.mediawiki_history has sometimes negative values - https://phabricator.wikimedia.org/T419267#11786751 (10Ahoelzl) [22:24:00] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Create an MVP data product of API requests - https://phabricator.wikimedia.org/T419522#11786750 (10Ahoelzl) [22:24:04] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Data-Platform-SRE (2026-03-27 - 2026-04-17): Task Tries and Logs for Airflow DAGs sometimes unavailable - https://phabricator.wikimedia.org/T419162#11786752 (10Ahoelzl) [22:24:10] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 07Epic: Attribution Research: Instrument Donation Attempts - https://phabricator.wikimedia.org/T419569#11786753 (10Ahoelzl) [22:24:14] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Create a custom DBT materialization macro - https://phabricator.wikimedia.org/T419310#11786755 (10Ahoelzl) [22:24:18] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Data-Platform-SRE (2026-03-27 - 2026-04-17): Optimize enqueueing of refine_webrequest_hourly pipeline - https://phabricator.wikimedia.org/T419050#11786756 (10Ahoelzl) [22:24:24] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 07OKR-Work (WE1 FY2025-26): [Spike] Adding access_method metadata to moderator action event streams - https://phabricator.wikimedia.org/T419019#11786757 (10Ahoelzl) [22:24:28] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06MW-Interfaces-Team, 10Event-Platform: EventBus: Invalid mediawiki signature error caused by meta.dt field - https://phabricator.wikimedia.org/T418573#11786758 (10Ahoelzl) [22:24:32] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Research, 10Event-Platform, 13Patch-For-Review: Analyze size distribution of wiki page html - https://phabricator.wikimedia.org/T419495#11786754 (10Ahoelzl) [22:24:36] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Create a data product of IP range to owner/provenance label - https://phabricator.wikimedia.org/T418466#11786760 (10Ahoelzl) [22:24:40] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): haproxykafka and varnishkafka sent different uri_paths - https://phabricator.wikimedia.org/T418767#11786761 (10Ahoelzl) [22:24:44] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Refine: persist resolved stream configurations to make reruns deterministic - https://phabricator.wikimedia.org/T418151#11786763 (10Ahoelzl) [22:24:48] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Data Pipelines, 06Data-Platform-SRE (2026-03-27 - 2026-04-17), 07Essential-Work: Airflow dynamic task mapping logs mix up when, on rerun, an id is mapped to a different map_index_template - https://phabricator.wikimedia.org/T408802#11786762 (10Ahoelzl) [22:24:54] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 07Epic, 05MW-1.46-notes (1.46.0-wmf.21; 2026-03-24): Attribution Research: Instrument pageviews - https://phabricator.wikimedia.org/T417050#11786764 (10Ahoelzl) [22:24:59] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): [OpsWeek] Testing on airflow-devenvs can generate false alerts such as SLO misses - https://phabricator.wikimedia.org/T416596#11786766 (10Ahoelzl) [22:25:03] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform: Library for unified_diffs - https://phabricator.wikimedia.org/T419969#11786765 (10Ahoelzl) [22:25:07] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Do not set WMF-Last-Access cookie when Sec-Fetch-Dest is not 'document' - https://phabricator.wikimedia.org/T403897#11786770 (10Ahoelzl) [22:25:11] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Airflow main performance instance optimization - https://phabricator.wikimedia.org/T411988#11786768 (10Ahoelzl) [22:25:15] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 07Essential-Work: Refine generates very large XCOM values - https://phabricator.wikimedia.org/T414953#11786767 (10Ahoelzl) [22:25:19] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Data-Engineering-Wikistats: The stat site show INVALID on Cantonese Wikipedia. - https://phabricator.wikimedia.org/T411938#11786769 (10Ahoelzl) [22:25:23] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Data-Platform-SRE (2026-03-27 - 2026-04-17), 07Essential-Work: Blunderbuss: Move Hadoop/HDFS XML configuration into Helm deployment chart - https://phabricator.wikimedia.org/T402323#11786771 (10Ahoelzl) [22:25:29] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Update Blunderbuss wikitech documentation - https://phabricator.wikimedia.org/T402290#11786772 (10Ahoelzl) [22:25:33] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Secret management on airflow for the automated transfer of (public) datasets from stats infra --> WME AWS - https://phabricator.wikimedia.org/T415208#11786774 (10Ahoelzl) [22:25:37] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Product Safety and Integrity, 06Product-Analytics (Kanban): Add mediawiki_product_metrics_incident_reporting_system_interaction to the sanitization allowlist - https://phabricator.wikimedia.org/T384650#11786773 (10Ahoelzl) [22:25:41] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): mw_content_history_reconcile_enrich api call returned 503 - https://phabricator.wikimedia.org/T415264#11786775 (10Ahoelzl) [22:25:45] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Java-Scala-Standardization, 07Essential-Work: Ignore MacOS .DS_Store in parent pom - https://phabricator.wikimedia.org/T407514#11786777 (10Ahoelzl) [22:25:49] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Data-Platform-SRE, 06SRE: Move Druid realtime configuration out of Refinery into standalone repo on GitLab - https://phabricator.wikimedia.org/T407994#11786776 (10Ahoelzl) [22:25:58] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Wikimedia Enterprise, 10Wikimedia Enterprise - Content Integrity, 06Data-Platform-SRE (2026-03-27 - 2026-04-17), 07Essential-Work: Implement an Airflow operator for moving data from point A to B - https://phabricator.wikimedia.org/T405360#11786778 (... [22:26:04] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 13Patch-For-Review: Optimize metrics computation for the MW Content Pipeline - https://phabricator.wikimedia.org/T401010#11786779 (10Ahoelzl) [22:26:08] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10DPE-Mediawiki-Content: Missing/inconsistent page_redirect_target field for redirects in Mediawiki content current v1 dumps - https://phabricator.wikimedia.org/T400632#11786780 (10Ahoelzl) [22:26:12] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): refine_to_hive dag optimizations - https://phabricator.wikimedia.org/T392668#11786782 (10Ahoelzl) [22:26:16] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): [Refine Simplification] Remove Schema Merging in Refine Process by Enforcing Backward Compatibility - https://phabricator.wikimedia.org/T381072#11786783 (10Ahoelzl) [22:26:20] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Analyze and optimize Airflow Postgres backend performance - https://phabricator.wikimedia.org/T411990#11786784 (10Ahoelzl) [22:26:25] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Movement-Insights (FY25-26 H2): dbt repository structure (Milestone 3) - https://phabricator.wikimedia.org/T416672#11786787 (10Ahoelzl) [22:26:29] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Data-Engineering-Wikistats, 06Movement-Insights: NEW FEATURE REQUEST: Temp Accounts on Wikistats - https://phabricator.wikimedia.org/T410796#11786785 (10Ahoelzl) [22:26:33] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 13Patch-For-Review: Support for Java 25 and Flink 2 - https://phabricator.wikimedia.org/T412978#11786786 (10Ahoelzl) [22:26:37] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Migrate Sqoop jobs to Airflow - https://phabricator.wikimedia.org/T409514#11786789 (10Ahoelzl) [22:26:41] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform, 13Patch-For-Review, 07Spike: Explore how to migrate PyFlink to Java/Scala - https://phabricator.wikimedia.org/T410266#11786788 (10Ahoelzl) [22:26:45] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Data-Platform-SRE (2026-03-27 - 2026-04-17), 07Essential-Work: Create alert on Airflow scheduler slow down - https://phabricator.wikimedia.org/T411405#11786794 (10Ahoelzl) [22:26:51] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Research: MediaWiki content history dataset issues - https://phabricator.wikimedia.org/T415311#11786790 (10Ahoelzl) [22:26:55] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Commons, 06Data-Persistence, 07Epic, and 3 others: FY2025-26 WE 6.4.1: Move links tables of commons to a dedicated cluster - https://phabricator.wikimedia.org/T398709#11786792 (10Ahoelzl) [22:27:03] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Data-Platform-SRE (2026-03-27 - 2026-04-17): Investigate Gobblin failures - https://phabricator.wikimedia.org/T419436#11786798 (10Ahoelzl) [22:27:09] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10AQS2.0: Consider updating our heuristics for media type classification in AQS / wikistats - https://phabricator.wikimedia.org/T419882#11786800 (10Ahoelzl) [22:27:13] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Dumps-Generation: when analyzing a Wikifunctions dump, parent_id in page creation revisions is sometimes 0 and sometimes None - https://phabricator.wikimedia.org/T420974#11786796 (10Ahoelzl) [22:27:17] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Data-Platform-SRE (2026-03-27 - 2026-04-17): Analyze SQL queries generating metrics - https://phabricator.wikimedia.org/T420434#11786802 (10Ahoelzl) [22:27:23] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 07Epic: Streaming job transforms and saves Experiment data - https://phabricator.wikimedia.org/T420428#11786804 (10Ahoelzl) [22:27:27] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Backfill datasets affected by Nov 2025 automated traffic incident - https://phabricator.wikimedia.org/T421735#11786806 (10Ahoelzl) [22:27:31] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): WE1.5 Consult on monthly active moderators data lake pipeline - https://phabricator.wikimedia.org/T419584#11786812 (10Ahoelzl) [22:27:35] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Refactor pingback reports pipelines using dbt - https://phabricator.wikimedia.org/T418190#11786816 (10Ahoelzl) [22:27:39] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 13Patch-For-Review: DagProperties don't automatically update Airflow variables - https://phabricator.wikimedia.org/T348963#11786808 (10Ahoelzl) [22:27:43] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Data-Engineering-Radar, 06ServiceOps new, 10ServiceOps-Services-Oids, and 2 others: Make eventstreams-internal available to WMF staff without an ssh tunnel - https://phabricator.wikimedia.org/T348763#11786814 (10Ahoelzl) [22:27:51] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Refactor pingback analytics pipeline - https://phabricator.wikimedia.org/T415283#11786818 (10Ahoelzl) [22:27:55] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Weekly delivery cadence of core contributor metrics - https://phabricator.wikimedia.org/T418032#11786820 (10Ahoelzl) [22:27:59] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 07Epic: [EPIC] Instrumentation for FY25/26 WE 5.3.4 "Qualified Traffic & Downstream Outcomes" - https://phabricator.wikimedia.org/T417049#11786822 (10Ahoelzl) [22:28:03] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 07OKR-Work (WE1 FY2025-26): WE1.5.3 Productize Data for Monthly Active Moderator Actions - https://phabricator.wikimedia.org/T410940#11786826 (10Ahoelzl) [22:28:07] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Reader Experience Team, 10Test Kitchen, 05MW-1.46-notes (1.46.0-wmf.22; 2026-03-31): Logged in reader retention logging - https://phabricator.wikimedia.org/T420621#11786828 (10Ahoelzl) [22:28:12] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Migrate cleanup jobs for snapshot datasets from systemd timers to Airflow - https://phabricator.wikimedia.org/T411999#11786832 (10Ahoelzl) [22:28:16] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform, 13Patch-For-Review: Instance-level EventGate configuration to enable/disable functionality - https://phabricator.wikimedia.org/T415549#11786830 (10Ahoelzl) [22:28:20] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Data-Engineering-Roadmap, 07Epic, 07OKR-Work: Analyze JA3N data and generate JA3N-UA table - https://phabricator.wikimedia.org/T409577#11786834 (10Ahoelzl) [22:28:24] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform: Audit and fix observability (logging and metrics) for pyflink jobs - https://phabricator.wikimedia.org/T418996#11786836 (10Ahoelzl) [22:28:32] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform: Fix PyFlink log levels - https://phabricator.wikimedia.org/T419997#11786838 (10Ahoelzl) [22:28:40] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform: HTML Enrichment - Tuning & Backfilling configuration - https://phabricator.wikimedia.org/T421216#11786840 (10Ahoelzl) [22:28:50] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Data-Platform-SRE (2026-03-27 - 2026-04-17), 07Essential-Work: Carry out end-user testing of spark on kubernetes - https://phabricator.wikimedia.org/T412925#11786842 (10Ahoelzl) [22:28:56] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 07Essential-Work: Revive data engineering alert metrics dashboard - https://phabricator.wikimedia.org/T399518#11786848 (10Ahoelzl) [22:29:00] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Review SLIS image suggestion pipeline - https://phabricator.wikimedia.org/T415195#11786846 (10Ahoelzl) [22:29:08] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Analyze hourly webrequest traffic loss - July 2025 - https://phabricator.wikimedia.org/T399312#11786850 (10Ahoelzl) [22:29:12] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Implement full parity between HiveSensor and RESTExternalTaskSensor - https://phabricator.wikimedia.org/T384726#11786852 (10Ahoelzl) [22:29:16] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 13Patch-For-Review: NEW BUG REPORT wmf.interlanguage_navigation missing mobile data - https://phabricator.wikimedia.org/T396514#11786844 (10Ahoelzl) [22:29:20] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10DPE-Mediawiki-Content: MediaWiki Content History alerts too much for minor reconcile issues - https://phabricator.wikimedia.org/T395139#11786856 (10Ahoelzl) [22:29:24] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 13Patch-For-Review: Monthly reconcile continues to emit a really large amount of events after user_id changes - https://phabricator.wikimedia.org/T419055#11786858 (10Ahoelzl) [22:29:28] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform, 13Patch-For-Review: [Event Platform] eventutilites-python: improve consistency guarantees of async process functions - https://phabricator.wikimedia.org/T347282#11786860 (10Ahoelzl) [22:29:32] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Data-Platform-SRE, 07OKR-Work, 13Patch-For-Review: Provide a Spark-on-k8s access for sql tools (dbt) - https://phabricator.wikimedia.org/T410017#11786862 (10Ahoelzl) [22:29:38] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06MW-Interfaces-Team, 10Event-Platform: mediawiki.page_change.v1 event stream - Investigate mismatched meta.dt and dt (and rev_dt) fields - https://phabricator.wikimedia.org/T409105#11786868 (10Ahoelzl) [22:29:46] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 13Patch-For-Review: Add overwrite to check_bad_parsing - https://phabricator.wikimedia.org/T421677#11786870 (10Ahoelzl) [22:29:50] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Visualizing inconsistencies and reconciles via Superset - https://phabricator.wikimedia.org/T420787#11786876 (10Ahoelzl) [22:29:58] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 13Patch-For-Review: Implement list of JA3N-JA4H pairs to be tagged as automated into the bot detection pipeline - https://phabricator.wikimedia.org/T420412#11786872 (10Ahoelzl) [22:30:02] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10AQS2.0: Introduce a new AQS endpoint to expose video plays - https://phabricator.wikimedia.org/T415202#11786880 (10Ahoelzl) [22:30:06] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10MediaWiki-Core-Revision-backend, 10MediaWiki-DomainEvents, 06MW-Interfaces-Team, and 5 others: MediaWiki\Revision\RevisionAccessException: Unable to load fresh row for rev_id: {rev_id} - https://phabricator.wikimedia.org/T400380#11786884 (10Ahoelzl) [22:30:15] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Wikidata, 10Wikidata-Query-Service, 13Patch-For-Review: Add a --output-dir argument to wikibase rdf and json dumps - https://phabricator.wikimedia.org/T401296#11786878 (10Ahoelzl) [22:30:23] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Research, 10Event-Platform, 13Patch-For-Review: Event stream with latest revision HTML & parent revision HTML diff - https://phabricator.wikimedia.org/T360794#11786882 (10Ahoelzl) [22:30:35] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Dumps-Generation: Data missing from en.wiktionary.org February 2026 "MediaWiki Content File Exports" compared to "XML Database dump" - https://phabricator.wikimedia.org/T417596#11786888 (10Ahoelzl) [22:30:39] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Content-Transform-Team, 06MW-Interfaces-Team, 10Event-Platform: Common event data model for data derived from parsed page revision html (and more!) - https://phabricator.wikimedia.org/T415158#11786886 (10Ahoelzl) [22:30:43] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Attribution Research First Experiment - https://phabricator.wikimedia.org/T416200#11786894 (10Ahoelzl) [22:30:47] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Growth-Team, 10Image-Suggestions, 13Patch-For-Review: Add an Image: filtering by suggestion "kind" or "confidence" - https://phabricator.wikimedia.org/T368987#11786890 (10Ahoelzl) [22:30:51] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Research-engineering, 06Research-Freezer, 10Event-Platform, 13Patch-For-Review: Productionized Edit Types - https://phabricator.wikimedia.org/T351225#11786892 (10Ahoelzl) [22:31:03] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform: eventutilities-python - make Flink Source and Sink parallelism configurable - https://phabricator.wikimedia.org/T421951#11786897 (10Ahoelzl) [22:31:07] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Data Pipelines: Airflow skein hook shouldn't fail when not managing to gather yarn logs - https://phabricator.wikimedia.org/T332215#11786899 (10Ahoelzl) [22:31:11] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Build a set of configurable pre-scheduled DBT Airflow DAGs executing dbt-jobs models - https://phabricator.wikimedia.org/T419925#11786901 (10Ahoelzl) [22:31:15] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): refine_webrequest_hourly_text.refine_webrequest probably needs more memory, executors - https://phabricator.wikimedia.org/T418552#11786905 (10Ahoelzl) [22:31:23] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10DPE-Mediawiki-Content: Use wmf.mediawiki_history as baseline for slo completeness - https://phabricator.wikimedia.org/T416312#11786907 (10Ahoelzl) [22:31:27] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 13Patch-For-Review: Improvements to local dev environment for Airflow - https://phabricator.wikimedia.org/T420752#11786903 (10Ahoelzl) [22:31:31] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10DPE-Mediawiki-Content: On reconcile, consider what happens when a restore and a delete happen on the same revision - https://phabricator.wikimedia.org/T412461#11786909 (10Ahoelzl) [22:31:35] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): update image suggestion readme - https://phabricator.wikimedia.org/T421128#11786911 (10Ahoelzl) [22:31:40] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Growth-Team, 10Image-Suggestions: Section Image Suggestions no longer available? - https://phabricator.wikimedia.org/T420244#11786915 (10Ahoelzl) [22:31:46] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10MediaWiki-extensions-CentralAuth, 06MediaWiki-Platform-Team: CentralAuth's localuser table contains many nulls and duplicate mappings - https://phabricator.wikimedia.org/T411116#11786913 (10Ahoelzl) [22:31:52] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 06Growth-Team, 10Image-Suggestions: Fix Image suggestion DagProperty values - https://phabricator.wikimedia.org/T419204#11786917 (10Ahoelzl) [22:31:56] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform: Update HTML pipeline schema - rendering_content_change - https://phabricator.wikimedia.org/T421341#11786921 (10Ahoelzl) [22:32:00] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st), 10Event-Platform, 13Patch-For-Review: Fix mediawiki event enrichment to work with newest version of Blubber - https://phabricator.wikimedia.org/T406872#11786919 (10Ahoelzl) [22:32:04] 06Data-Engineering (Q4 FS25/26 April 1st - June 30st): Add Human-Bot Alert Runbook Link to Alert Email. - https://phabricator.wikimedia.org/T420046#11786927 (10Ahoelzl)