[08:31:07] 10Data-Engineering (Q2 2024 October 1st - December 31th), 06Data Products, 06DBA, 07Schema-change-in-production: Cleanup revision table schema - https://phabricator.wikimedia.org/T367856#10396027 (10Marostegui) Schema change finished on both clouddb hosts and they are now catching up. They went from 8 days... [09:18:33] 10Data-Engineering (Q2 2024 October 1st - December 31th), 10Dumps 2.0 (Kanban Board): Identify indicators to inform an SLO for event emission and intake - https://phabricator.wikimedia.org/T345195#10396124 (10gmodena) The doc has been published on wikitech: https://wikitech.wikimedia.org/wiki/SLO/Event_Platform [09:32:57] (03PS16) 10Gehel: Extraction of RefineHelper [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1080706 [09:33:29] (03PS17) 10Gehel: Extraction of RefineHelper [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1080706 [09:48:08] (03PS2) 10Peter Fischer: Rewrite MediawikiDumper partitioning implementation [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1101892 [09:56:47] (03CR) 10CI reject: [V:04-1] Rewrite MediawikiDumper partitioning implementation [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1101892 (owner: 10Peter Fischer) [10:01:21] (03PS18) 10Gehel: Extraction of RefineHelper [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1080706 [10:17:07] (03PS19) 10Gehel: Extraction of RawRefineDataReader [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1080706 [10:23:34] 06Data-Engineering, 06Research, 10Data-Platform-SRE (2024.11.30 - 2024.12.20), 03Discovery-Search (Current work): Low available space on Hadoop / HDFS - https://phabricator.wikimedia.org/T381707#10396395 (10BTullis) [11:12:24] (03PS3) 10Peter Fischer: Rewrite MediawikiDumper partitioning implementation [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1101892 [12:07:58] 06Data-Engineering, 06Data Products, 07Documentation, 10Event-Platform: Render human-readable schemas on schema.wikimedia.org - https://phabricator.wikimedia.org/T376841#10396759 (10Milimetric) p:05Medium→03Low [12:57:01] 06Data-Engineering, 10CampaignEvents, 06Data Products, 06DBA, and 3 others: Add "event_is_test_event" field to "campaign_events" table - https://phabricator.wikimedia.org/T381759#10396956 (10MHorsey-WMF) [13:02:37] 06Data-Engineering, 10CampaignEvents, 06Data Products, 06DBA, and 3 others: Add "event_is_test_event" field to "campaign_events" table - https://phabricator.wikimedia.org/T381759#10396985 (10MHorsey-WMF) **Context** This functionality will allow event organizers to specify whether an event is a test event... [14:10:47] Heads-up, there will be some brief periods (~minutes) of downtime for Archiva today, as we add another disk to the host. If you experience any issues with a deploy that you feel may be related to Archiva, please wait a few minutes and try again. If it still doesn't work, ping me. [14:12:00] For: T381961 [14:12:08] T381961: Increase the capacity of /var/lib/archiva on archiva1002.wikimedia.org - https://phabricator.wikimedia.org/T381961 [14:44:38] (03CR) 10Xcollazo: "(Adding Dan to reviewers.)" [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1101892 (owner: 10Peter Fischer) [15:02:22] 06Data-Engineering: Create Kerberos identity for Jimmy Ly - https://phabricator.wikimedia.org/T381986 (10Jly) 03NEW [15:15:07] 06Data-Engineering, 10CampaignEvents, 06Data Products, 06DBA, and 3 others: Add "event_is_test_event" field to "campaign_events" table - https://phabricator.wikimedia.org/T381759#10397443 (10Marostegui) @MHorsey-WMF please follow the procedure for the schema change creation described at https://wikitech.wi... [15:15:22] 06Data-Engineering, 10CampaignEvents, 06Data Products, 06Data-Persistence, and 3 others: Add "event_is_test_event" field to "campaign_events" table - https://phabricator.wikimedia.org/T381759#10397444 (10Marostegui) [15:19:33] 06Data-Engineering, 10CampaignEvents, 06Data Products, 06Data-Persistence, and 3 others: Add "event_is_test_event" field to "campaign_events" table - https://phabricator.wikimedia.org/T381759#10397461 (10MHorsey-WMF) Apologies @Marostegui, I thought I was by adding the "schema_change" tag, but then somebod... [15:20:31] 06Data-Engineering, 10CampaignEvents, 06Data Products, 06Data-Persistence, and 3 others: Add "event_is_test_event" field to "campaign_events" table - https://phabricator.wikimedia.org/T381759#10397465 (10Marostegui) >>! In T381759#10397461, @MHorsey-WMF wrote: > Apologies @Marostegui, I thought I was by ad... [15:23:04] (03PS4) 10Peter Fischer: Rewrite MediawikiDumper partitioning implementation [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1101892 [15:25:18] 10Data-Engineering (Q2 2024 October 1st - December 31th), 06Data Products, 06DBA, 07Schema-change-in-production: Cleanup revision table schema - https://phabricator.wikimedia.org/T367856#10397495 (10Marostegui) Clouddb* hosts are back in sync with the master. [15:27:33] 10Data-Engineering (Q2 2024 October 1st - December 31th), 06Data Products, 06DBA, 07Schema-change-in-production: Cleanup revision table schema - https://phabricator.wikimedia.org/T367856#10397517 (10Marostegui) [15:38:26] 10Data-Engineering (Q2 2024 October 1st - December 31th), 06Data Products, 06Data-Platform, 06Movement-Insights, 13Patch-For-Review: Backfill and recalculate unique devices data from July 2024 to present - https://phabricator.wikimedia.org/T378852#10397546 (10JAllemandou) >>! In T378852#10395498, @Ahoelz... [15:58:56] (03CR) 10Milimetric: Modify MediaWiki History queries to support Temp Accounts (031 comment) [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1088342 (https://phabricator.wikimedia.org/T379230) (owner: 10Mforns) [16:08:46] 06Data-Engineering, 10CampaignEvents, 06Data Products, 06Data-Persistence, and 3 others: Add "event_is_test_event" field to "campaign_events" table - https://phabricator.wikimedia.org/T381759#10397637 (10Daimona) Hi @Marostegui! To clarify, this is not a request to //perform// the schema change just yet. W... [16:11:43] 06Data-Engineering, 10CampaignEvents, 06Data Products, 06Data-Persistence, and 3 others: Add "event_is_test_event" field to "campaign_events" table - https://phabricator.wikimedia.org/T381759#10397645 (10Marostegui) @Daimona ah! As the syntax was included there I thought it was. I think it does make sense [18:02:16] 10Data-Engineering (Q2 2024 October 1st - December 31th), 06Data Products, 06Data-Platform, 06Movement-Insights, 13Patch-For-Review: Backfill and recalculate unique devices data from July 2024 to present - https://phabricator.wikimedia.org/T378852#10398385 (10JAllemandou) Writing updates here as well as... [18:44:38] 10Data-Engineering (Q2 2024 October 1st - December 31th), 06Product-Analytics, 13Patch-For-Review: [SPIKE] Experiment with approaches for a incremental updates of MediaWiki data in the Data Lake - https://phabricator.wikimedia.org/T370354#10398574 (10Ottomata) [18:44:39] 10Data-Engineering (Q2 2024 October 1st - December 31th): [SPIKE] Learn and document how to use Flink-CDC from MediaWiki MariaDB locally - https://phabricator.wikimedia.org/T373144#10398578 (10Ottomata) https://www.decodable.co/blog/exploring-flink-cdc :) [19:00:14] 06Data-Engineering, 06Data Products, 07Documentation, 10Event-Platform: Render human-readable schemas on schema.wikimedia.org - https://phabricator.wikimedia.org/T376841#10398627 (10Ottomata) FWIW, I just toyed with https://github.com/coveooss/json-schema-for-humans and https://github.com/tomcollins/json-s... [19:01:51] 06Data-Engineering, 06Data Products, 07Documentation, 10Event-Platform: Render human-readable schemas on schema.wikimedia.org - https://phabricator.wikimedia.org/T376841#10398631 (10Ottomata) > FWIW, I think this would not be too hard to do with jsonschema-tools. You'd have to add support for a new html co... [19:52:02] 10Data-Engineering (Q2 2024 October 1st - December 31th), 06Product-Analytics, 13Patch-For-Review: [SPIKE] Experiment with approaches for a incremental updates of MediaWiki data in the Data Lake - https://phabricator.wikimedia.org/T370354#10398795 (10Ottomata) FTR, I ran ` GRANT SELECT, SHOW DATABASES, REP... [20:15:37] (03PS1) 10Gehel: refactoring(RefineHelper): extract SparkSchemaLoader from RefineHelper [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1102386 [20:20:40] (03CR) 10CI reject: [V:04-1] refactoring(RefineHelper): extract SparkSchemaLoader from RefineHelper [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1102386 (owner: 10Gehel) [20:48:02] 10Data-Engineering (Q2 2024 October 1st - December 31th): [SPIKE] Learn and document how to use Flink-CDC from MediaWiki MariaDB locally - https://phabricator.wikimedia.org/T373144#10399028 (10NoZeroDay) This is looking nice! [21:10:00] (03CR) 10Xcollazo: "Left some comments below." [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1101892 (owner: 10Peter Fischer) [21:35:09] 06Data-Engineering, 10MediaWiki-General, 10Event-Platform, 10MediaWiki-Platform-Team (Radar), 13Patch-For-Review: Create legacy EventLogging proxy HTTP intake (for MediaWikiPingback) endpoint to EventGate - https://phabricator.wikimedia.org/T353817#10399153 (10Ottomata) > mediawiki-config - mediawiki.org...