[02:26:00] Analytics-Kanban, EventBus, Services, Wikimedia-Stream, User-mobrovac: Public Event Streams - https://phabricator.wikimedia.org/T130651#2665062 (MZMcBride) Skimming this discussion makes me think IRC isn't so bad. The task description currently says "deprecate RCStream python/redis based ser... [14:00:04] PROBLEM - Difference between raw and validated EventLogging overall message rates on graphite1001 is CRITICAL: CRITICAL: 26.67% of data above the critical threshold [30.0] [14:05:27] RECOVERY - Difference between raw and validated EventLogging overall message rates on graphite1001 is OK: OK: Less than 20.00% above the threshold [20.0] [14:39:10] PROBLEM - Difference between raw and validated EventLogging overall message rates on graphite1001 is CRITICAL: CRITICAL: 20.00% of data above the critical threshold [30.0] [15:07:34] RECOVERY - Difference between raw and validated EventLogging overall message rates on graphite1001 is OK: OK: Less than 20.00% above the threshold [20.0] [15:26:47] super weird, it is still flapping near the 20 value [15:30:56] I don't know how to check invalid events, not sure if those are in logs [15:32:26] ah I can see "MediaWiki/1.28.0-wmf.20" (Invalid \escape: line 1 column 1322 (char 1321))" [15:33:17] but not a lot to justify this [15:34:34] can find more with sudo grep -rni "Unable to process" * but not sure if it is the right strategy [15:35:14] anyhow, not super urgent, let me know how to debug these things that it could be a good documentation add-on