[09:03:24] 10netops, 06DC-Ops, 06Infrastructure-Foundations: Take advantage of 10Gb NICs in the new network stack - https://phabricator.wikimedia.org/T360297#10279917 (10ayounsi) @Papaul 54 but that only included rows A and B, now C and D are also eligible to a free 10G upgrade when available. @Volans I tried to repro... [09:35:29] 10netops, 06DC-Ops, 06Infrastructure-Foundations: Take advantage of 10Gb NICs in the new network stack - https://phabricator.wikimedia.org/T360297#10279998 (10ayounsi) Had a chat with Riccardo on IRC, here is the new list I came up with: ` db[2136,2139-2182,2185-2189,2191-2195,2206-2220].codfw.wmnet es[2020... [11:25:25] FIRING: SystemdUnitFailed: haproxykafka.service on cp3066:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state - https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status - https://alerts.wikimedia.org/?q=alertname%3DSystemdUnitFailed [11:35:25] RESOLVED: SystemdUnitFailed: haproxykafka.service on cp3066:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state - https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status - https://alerts.wikimedia.org/?q=alertname%3DSystemdUnitFailed [13:24:51] 10netops, 06DC-Ops, 06Infrastructure-Foundations: Take advantage of 10Gb NICs in the new network stack - https://phabricator.wikimedia.org/T360297#10280778 (10ayounsi) [13:30:51] 10netops, 06DC-Ops, 06Infrastructure-Foundations: Take advantage of 10Gb NICs in the new network stack - https://phabricator.wikimedia.org/T360297#10280820 (10ayounsi) [14:57:39] 06Traffic, 06SRE: Create provisioning and post-provisioning checks for Traffic hosts to confirm validity of varying hardware configurations - https://phabricator.wikimedia.org/T378724 (10ssingh) 03NEW [14:58:58] 06Traffic, 06SRE: Create provisioning and post-provisioning checks for Traffic hosts to confirm validity of varying hardware configurations - https://phabricator.wikimedia.org/T378724#10281202 (10ssingh) [15:02:11] 06Traffic, 06SRE: Create provisioning and post-provisioning checks for Traffic hosts to confirm validity of varying hardware configurations - https://phabricator.wikimedia.org/T378724#10281206 (10ssingh) p:05Triage→03Medium [15:22:25] FIRING: SystemdUnitFailed: haproxykafka.service on cp3066:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state - https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status - https://alerts.wikimedia.org/?q=alertname%3DSystemdUnitFailed [15:27:25] RESOLVED: SystemdUnitFailed: haproxykafka.service on cp3066:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state - https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status - https://alerts.wikimedia.org/?q=alertname%3DSystemdUnitFailed [15:59:10] 06Traffic: Upgrade Varnish from 6.0 to 7.1 - https://phabricator.wikimedia.org/T378737 (10BCornwall) 03NEW [15:59:24] 06Traffic: Upgrade Varnish from 6.0 to 7.1 - https://phabricator.wikimedia.org/T378737#10281546 (10BCornwall) p:05Triage→03Medium [16:04:24] 06Traffic: Upgrade Varnish from 6.0 to 7.1 - https://phabricator.wikimedia.org/T378737#10281594 (10BCornwall) [16:05:16] 06Traffic: Upgrade Varnish from 6.0 to 7.1 - https://phabricator.wikimedia.org/T378737#10281600 (10BCornwall) [16:06:05] 06Traffic: Upgrade Varnish from 6.0 to 7.1 - https://phabricator.wikimedia.org/T378737#10281610 (10BCornwall) [16:42:41] 06Traffic: GeoDNS: consider sending CN to eqsin - https://phabricator.wikimedia.org/T378744 (10ayounsi) 03NEW [16:42:56] 06Traffic: GeoDNS: consider sending CN to eqsin - https://phabricator.wikimedia.org/T378744#10281805 (10ayounsi) [16:45:25] FIRING: [2x] SystemdUnitFailed: haproxykafka.service on cp3066:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state - https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status - https://alerts.wikimedia.org/?q=alertname%3DSystemdUnitFailed [16:55:25] FIRING: [2x] SystemdUnitFailed: haproxykafka.service on cp3066:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state - https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status - https://alerts.wikimedia.org/?q=alertname%3DSystemdUnitFailed [17:09:24] 06Traffic, 10conftool: HIDDENPARMA should display the contents of patterns on the action's page - https://phabricator.wikimedia.org/T378355#10281928 (10kamila) 05Open→03Resolved [17:35:25] RESOLVED: SystemdUnitFailed: haproxykafka.service on cp3066:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state - https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status - https://alerts.wikimedia.org/?q=alertname%3DSystemdUnitFailed [18:06:13] 06Traffic: GeoDNS: consider sending CN to eqsin - https://phabricator.wikimedia.org/T378744#10282257 (10ssingh) Hi, thanks for the task. There are no objections from Traffic since the need for moving it is clear and so is the data around latency. (Hopefully we become aware of such improvements automatically wit... [18:43:17] I know you know this already but 😭 https://wikis.world/@feistyduck@infosec.exchange/113403462365337081 [19:10:38] 06Traffic, 06SRE, 13Patch-For-Review: Create provisioning and post-provisioning checks for Traffic hosts to confirm validity of varying hardware configurations - https://phabricator.wikimedia.org/T378724#10282438 (10ssingh) a:03CDobbins [19:31:04] 06Traffic: Set CPU affinity for haproxykafka process - https://phabricator.wikimedia.org/T378758 (10Fabfur) 03NEW [20:26:12] Amir1: Apple apparently wanted to push even lower :) [20:26:52] there was some reporting they were pushing for 10 days [20:27:52] 😭 [20:28:20] it'll be ok, we're on track to be ready for fully-automated and short-days-ok by the time of even the proposed drop to 200 days [20:28:54] (for all the acme-chief issued stuff, the unified and others) [20:29:26] but yeah, it's time to start thinking twice about any remaining old-school manual-issue cases for which there isn't a transition plan [20:33:31] anyways, there's been multiple proposals like this recently, and I'm generally a fan, and we can do it for our big stuff by any reasonable timeline they propose, I think. [20:33:52] but there's a lot of little edge cases, and I don't know if they'll really get this through on the currently-aggressive schedule or not [23:43:38] 06Traffic: Fix Varnish tests - https://phabricator.wikimedia.org/T370202#10283208 (10BCornwall) 05Open→03In progress a:03BCornwall [23:47:01] 06Traffic: Fix Varnish tests - https://phabricator.wikimedia.org/T370202#10283213 (10BCornwall) [23:47:31] 06Traffic: Fix Varnish tests - https://phabricator.wikimedia.org/T370202#10283214 (10BCornwall) [23:50:29] 06Traffic: Fix Varnish tests - https://phabricator.wikimedia.org/T370202#10283215 (10BCornwall) I tried switching the `bad_ip` value to `198.51.100.2` (a different subnet reserved for docs) in case the `192.0.2.255` address was somehow responding differently. No dice.