[00:27:27] 10Traffic, 10Operations, 10Patch-For-Review, 10Prometheus-metrics-monitoring: Port gdnsd statistics from ganglia to prometheus - https://phabricator.wikimedia.org/T147426#2692643 (10Dzahn) removed the ganglia stats for this today [03:48:46] 10Traffic, 10Operations, 10Patch-For-Review: Content purges are unreliable - https://phabricator.wikimedia.org/T133821#3681880 (10Tbayer) For the record: One manifestation of this issue (video and audio files remaining available on upload.wikimedia.org for many hours after being deleted by an admin, T69559)... [07:09:36] 10Traffic, 10DC-Ops, 10Operations, 10ops-esams: Multiple systems in esams OE10 showing PSU failures - https://phabricator.wikimedia.org/T177228#3682015 (10ema) p:05Triage>03Normal [08:10:08] 10Traffic, 10Operations, 10Wikimedia-Logstash, 10Services (watching): RESTBase logs disappeared from logstash - https://phabricator.wikimedia.org/T178078#3682104 (10fgiunchedi) [09:57:49] 10Traffic, 10Operations, 10Pybal: RunCommandMonitoringProtocol throws an exception if runcommand.arguments is not specified - https://phabricator.wikimedia.org/T178149#3682399 (10ema) [09:58:05] 10Traffic, 10Operations, 10Pybal: RunCommandMonitoringProtocol throws an exception if runcommand.arguments is not specified - https://phabricator.wikimedia.org/T178149#3682411 (10ema) p:05Triage>03Normal [10:15:45] 10Traffic, 10Operations, 10Pybal: Add UDP monitor for pybal - https://phabricator.wikimedia.org/T178151#3682485 (10ema) p:05Triage>03Normal [14:19:55] chasemp: hey, I was looking into T133791 and I've tried to reproduce it on labs-ns0.wikimedia.org [14:19:55] T133791: check_dns needs to be rewritten - https://phabricator.wikimedia.org/T133791 [14:20:06] unless I've misunderstood the issue it seems unreproducible now [14:20:25] $ /usr/lib/nagios/plugins/check_dns -H foo.eqiad.wmflabs -s labs-ns0.wikimedia.org -v [14:20:28] /usr/bin/nslookup -sil foo.eqiad.wmflabs labs-ns0.wikimedia.org [14:20:31] Server: labs-ns0.wikimedia.org [14:20:31] $ echo $? [14:20:34] Address: 208.80.155.117#53 [14:20:36] ** server can't find foo.eqiad.wmflabs: NXDOMAIN [14:20:39] Domain foo.eqiad.wmflabs was not found by the server [14:20:40] pretty awesome of me not to note the version for the check_dns in use there [14:20:41] 2 [14:21:23] :) [14:23:02] ema: nothing mysterious in that report I think, and being from well over a year ago I think icinga has been rebuilt since then even? [14:23:23] seems closeable as legacy to me now [14:24:51] chasemp: ok, thanks! I'll close it then [14:26:38] 10Traffic, 10Cloud-Services, 10Operations: check_dns needs to be rewritten - https://phabricator.wikimedia.org/T133791#3683244 (10ema) 05Open>03Resolved a:03ema check_dns v1.5 (nagios-plugins 1.5) seems to be doing the right thing currently: ``` 14:25:09 ema@labservices1001.wikimedia.org:~ $ /usr/lib/... [15:06:53] 10Traffic, 10Operations: Renew unified certificates 2017 - https://phabricator.wikimedia.org/T178173#3683312 (10BBlack) [19:14:41] 10netops, 10Operations, 10fundraising-tech-ops: bonded/redundant network connections for fundraising hosts - https://phabricator.wikimedia.org/T171962#3684051 (10Jgreen) p:05Triage>03Normal [19:15:01] 10netops, 10Operations, 10fundraising-tech-ops, 10ops-codfw: connect second ethernet interface for fundraising codfw hosts - https://phabricator.wikimedia.org/T176175#3684053 (10Jgreen) p:05Triage>03Normal [20:36:01] 10netops, 10Operations, 10Patch-For-Review: Merge AS14907 with AS43821 - https://phabricator.wikimedia.org/T167840#3346480 (10Krinkle) It's not often that one of our primary cache PoPs ends up depooled for multiple hours. While obviously unintended, this was an interesting opportunity to measure the differen... [20:36:15] 10netops, 10Operations, 10Performance-Team, 10Patch-For-Review, 10Performance-Team-notice: Merge AS14907 with AS43821 - https://phabricator.wikimedia.org/T167840#3684258 (10Krinkle) [20:36:27] 10netops, 10Operations, 10Patch-For-Review, 10Performance-Team (Radar), 10Performance-Team-notice: Merge AS14907 with AS43821 - https://phabricator.wikimedia.org/T167840#3346480 (10Krinkle) [22:55:44] 10Traffic, 10Discovery, 10Operations, 10WMDE-Analytics-Engineering, and 3 others: Allow access to wdqs.svc.eqiad.wmnet on port 8888 - https://phabricator.wikimedia.org/T176875#3684693 (10Dzahn) [23:03:36] 10Traffic, 10Discovery, 10Operations, 10WMDE-Analytics-Engineering, and 3 others: Allow access to wdqs.svc.eqiad.wmnet on port 8888 - https://phabricator.wikimedia.org/T176875#3684701 (10Smalyshev) I wonder if it may be more beneficial to use codfw ones for longer tasks, since they are getting less routine...