[00:02:25] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 29.29, 23.86, 21.29 [00:04:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 23.07, 22.94, 21.24 [00:08:25] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 19.43, 20.06, 20.40 [00:09:28] PROBLEM - mw152 MediaWiki Rendering on mw152 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:11:32] RECOVERY - mw152 MediaWiki Rendering on mw152 is OK: HTTP OK: HTTP/1.1 200 OK - 8191 bytes in 4.540 second response time [00:13:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [00:18:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 22.23, 21.75, 20.74 [00:20:25] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 27.40, 23.62, 21.52 [00:29:26] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 26.33, 23.28, 20.20 [00:31:22] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 18.18, 21.27, 19.83 [00:33:18] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 17.25, 19.99, 19.53 [00:53:18] PROBLEM - cp26 Varnish Backends on cp26 is CRITICAL: 1 backends are down. mw152 [00:55:15] RECOVERY - cp26 Varnish Backends on cp26 is OK: All 19 backends are healthy [01:28:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 19.59, 20.25, 23.42 [01:34:25] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 25.07, 21.31, 22.68 [01:36:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 23.36, 22.70, 23.08 [01:38:30] [Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [01:44:25] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 27.09, 23.22, 22.68 [01:46:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 23.17, 22.97, 22.65 [02:04:25] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 28.08, 22.31, 21.36 [02:06:00] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [02:06:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 18.67, 20.87, 20.97 [02:08:25] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 14.78, 19.03, 20.31 [02:08:48] PROBLEM - wiki.pulsus.cc - LetsEncrypt on sslhost is WARNING: WARNING - Certificate 'wiki.pulsus.cc' expires in 15 day(s) (Tue 13 Aug 2024 01:37:49 AM GMT +0000). [02:09:01] [02ssl] 07WikiTideSSLBot pushed 031 commit to 03master [+0/-0/±1] 13https://github.com/miraheze/ssl/compare/f05db672148d...c23a31f7f7e9 [02:09:03] [02ssl] 07WikiTideSSLBot 03c23a31f - Bot: Update SSL cert for wiki.pulsus.cc [02:14:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 23.23, 21.11, 20.74 [02:16:25] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 24.69, 22.05, 21.11 [02:17:44] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 23.37, 20.72, 17.92 [02:19:43] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 18.28, 19.69, 17.90 [02:20:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 23.47, 23.78, 22.13 [02:27:47] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 22.32, 21.31, 19.48 [02:29:43] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 20.34, 20.24, 19.27 [02:32:25] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 24.76, 23.83, 22.88 [02:38:06] RECOVERY - wiki.pulsus.cc - LetsEncrypt on sslhost is OK: OK - Certificate 'wiki.pulsus.cc' will expire on Sat 26 Oct 2024 01:08:54 AM GMT +0000. [02:38:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 18.82, 23.81, 23.40 [02:48:25] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 28.74, 23.87, 23.03 [02:49:04] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 21.84, 19.83, 18.93 [02:50:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 19.10, 22.32, 22.61 [02:51:00] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 16.49, 18.70, 18.63 [03:02:25] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 26.00, 23.85, 22.95 [03:02:38] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 22.78, 20.17, 18.95 [03:04:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 19.45, 22.78, 22.72 [03:04:38] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 15.75, 18.16, 18.35 [03:06:00] [Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [03:09:18] PROBLEM - wiki.andreijiroh.uk.eu.org - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wiki.andreijiroh.uk.eu.org All nameservers failed to answer the query. [03:12:25] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 25.06, 20.84, 21.38 [03:14:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 19.00, 19.91, 20.97 [03:26:25] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 27.27, 23.75, 22.05 [03:28:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 19.72, 22.32, 21.76 [03:32:25] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 15.36, 18.58, 20.34 [03:33:00] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [03:36:25] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 24.41, 21.92, 21.29 [03:38:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 20.03, 20.80, 20.94 [03:42:48] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 24.51, 20.25, 18.23 [03:44:25] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 17.48, 19.25, 20.24 [03:44:44] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 18.36, 19.19, 18.07 [03:54:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 19.87, 20.72, 20.55 [03:56:25] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 17.35, 19.66, 20.20 [04:06:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 21.75, 20.07, 19.78 [04:08:25] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 19.59, 19.15, 19.44 [04:09:05] RECOVERY - wiki.andreijiroh.uk.eu.org - reverse DNS on sslhost is OK: SSL OK - wiki.andreijiroh.uk.eu.org reverse DNS resolves to cp36.wikitide.net - CNAME OK [04:41:08] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 20.99, 20.47, 18.85 [04:43:03] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 16.07, 19.28, 18.64 [04:46:52] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 24.00, 22.18, 19.98 [04:50:41] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 18.91, 20.35, 19.69 [04:53:00] [Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [04:58:06] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 23.00, 21.16, 18.44 [05:02:06] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 14.18, 18.31, 18.00 [05:02:25] [Grafana] !tech FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [05:07:25] [Grafana] !tech RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [05:08:50] [Grafana] !tech FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [05:10:21] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [05:10:25] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.36, 3.80, 2.10 [05:12:19] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 1.83, 3.08, 2.04 [05:12:22] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 6.188 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [05:13:50] [Grafana] !tech RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [05:17:05] PROBLEM - ns2 Puppet on ns2 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [05:22:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [05:36:37] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 19.56, 21.71, 20.26 [05:42:25] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 17.83, 20.17, 20.15 [05:45:03] RECOVERY - ns2 Puppet on ns2 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [05:47:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] !tech RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [05:52:25] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 24.54, 22.57, 20.95 [05:52:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] !tech FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [05:57:30] [Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] !tech RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [05:58:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 18.44, 22.11, 21.46 [05:59:50] [Grafana] !tech FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [05:59:51] PROBLEM - db161 Current Load on db161 is CRITICAL: LOAD CRITICAL - total load average: 37.91, 16.53, 6.92 [06:03:51] RECOVERY - db161 Current Load on db161 is OK: LOAD OK - total load average: 1.43, 8.55, 5.92 [06:04:25] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 27.71, 23.99, 22.38 [06:04:50] [Grafana] !tech RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [06:07:03] [Grafana] !tech FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [06:13:50] PROBLEM - swiftac171 Current Load on swiftac171 is WARNING: LOAD WARNING - total load average: 11.26, 9.73, 5.30 [06:14:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 21.16, 23.57, 23.23 [06:15:46] RECOVERY - swiftac171 Current Load on swiftac171 is OK: LOAD OK - total load average: 3.76, 7.42, 4.97 [06:16:25] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 24.81, 24.05, 23.44 [06:17:03] [Grafana] !tech RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [06:17:26] PROBLEM - ns2 Puppet on ns2 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [06:18:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 22.60, 23.49, 23.31 [06:23:20] [Grafana] !tech FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [06:24:13] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 6.66, 4.10, 3.24 [06:24:25] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 27.90, 23.78, 23.16 [06:26:07] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.51, 3.65, 3.19 [06:28:01] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 2.15, 3.28, 3.12 [06:28:20] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] !tech RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [06:28:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 19.59, 22.22, 22.72 [06:33:20] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] !tech FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [06:33:44] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 6.45, 4.49, 3.57 [06:35:38] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.52, 3.75, 3.39 [06:36:25] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 15.01, 17.93, 20.40 [06:37:37] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.34, 4.38, 3.67 [06:38:20] [Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] !tech FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [06:39:36] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.31, 3.43, 3.40 [06:40:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 18.22, 20.01, 20.86 [06:41:36] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 1.60, 2.86, 3.20 [06:44:25] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 28.84, 23.82, 22.09 [06:45:14] RECOVERY - ns2 Puppet on ns2 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [06:48:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 21.04, 23.26, 22.35 [06:53:15] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.55, 3.50, 3.37 [06:53:20] [Grafana] !tech FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] !tech RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [06:55:09] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.27, 3.73, 3.45 [06:57:03] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 2.82, 3.13, 3.25 [07:04:39] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.23, 3.53, 3.33 [07:06:34] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.24, 3.55, 3.35 [07:08:20] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] !tech FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] !tech RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [07:08:25] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 16.95, 19.03, 20.29 [07:08:28] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.98, 3.68, 3.44 [07:10:21] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 0.63, 2.56, 3.06 [07:13:20] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] !tech RESOLVED: High Job Queue Backlog https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [07:15:35] PROBLEM - wiki.moores.tech - reverse DNS on sslhost is WARNING: rDNS WARNING - reverse DNS entry for wiki.moores.tech could not be found [07:23:20] [Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [07:36:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 18.84, 21.29, 20.39 [07:38:25] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 15.16, 19.07, 19.67 [08:01:54] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 25.03, 21.09, 19.65 [08:07:38] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 22.20, 23.31, 21.21 [08:09:32] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 26.10, 23.64, 21.52 [08:11:27] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 20.21, 22.20, 21.25 [08:15:15] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 24.07, 22.80, 21.65 [08:17:10] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 23.33, 22.41, 21.61 [08:18:00] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [08:19:04] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 24.41, 22.32, 21.62 [08:22:53] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 21.91, 22.97, 22.13 [08:28:36] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 24.92, 21.53, 21.47 [08:30:31] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 19.05, 21.11, 21.37 [08:44:25] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 16.34, 18.38, 19.93 [09:15:33] PROBLEM - ns2 Puppet on ns2 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [09:17:14] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 21.01, 19.89, 19.34 [09:19:08] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 18.75, 19.86, 19.42 [09:24:38] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 25.62, 20.71, 18.37 [09:24:52] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 22.46, 21.32, 20.14 [09:26:38] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 19.09, 20.75, 18.73 [09:28:41] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 28.40, 23.82, 21.30 [09:32:29] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 17.75, 22.01, 21.27 [09:32:38] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 17.18, 19.47, 18.98 [09:38:25] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 12.81, 18.48, 20.14 [09:45:04] RECOVERY - ns2 Puppet on ns2 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [10:04:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 23.88, 20.31, 18.58 [10:06:25] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 18.99, 19.66, 18.55 [10:17:12] PROBLEM - ns2 Puppet on ns2 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [10:22:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 21.65, 20.24, 19.00 [10:28:25] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 18.24, 20.09, 19.48 [10:36:25] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 21.03, 21.69, 20.24 [10:44:25] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 18.86, 19.60, 19.87 [10:47:09] RECOVERY - ns2 Puppet on ns2 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [11:03:00] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] !tech FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [11:03:09] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [11:03:36] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.42, 2.92, 1.23 [11:05:12] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 9.083 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [11:05:36] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 3.31, 2.70, 1.34 [11:07:44] PROBLEM - db171 Backups SQL on db171 is WARNING: FILE_AGE WARNING: /var/log/sql-backup.log is 864197 seconds old and 139855 bytes [11:08:00] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] !tech RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [11:18:00] [Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] !tech RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [11:25:20] [Grafana] !tech FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [11:25:36] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 12.88, 19.62, 23.89 [11:26:59] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 6.64, 4.06, 2.84 [11:27:06] PROBLEM - prometheus151 SSH on prometheus151 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [11:29:05] RECOVERY - prometheus151 SSH on prometheus151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u3 (protocol 2.0) [11:30:20] [Grafana] !tech RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [11:32:20] [Grafana] !tech FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [11:34:43] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 1.63, 3.88, 3.46 [11:35:35] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 28.48, 21.46, 22.31 [11:36:37] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 1.46, 3.35, 3.33 [11:37:20] [Grafana] !tech RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [11:39:35] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 16.53, 22.05, 22.64 [11:43:20] [Grafana] !tech FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [11:45:12] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.14, 4.17, 3.65 [11:47:35] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 27.19, 21.53, 21.77 [11:48:20] [Grafana] !tech RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [11:49:00] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.65, 3.97, 3.70 [11:49:35] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 15.80, 19.10, 20.86 [11:50:55] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.68, 4.36, 3.85 [11:51:35] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 13.91, 17.04, 19.87 [11:52:49] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.96, 3.54, 3.60 [11:54:44] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 2.14, 3.00, 3.39 [11:56:20] [Grafana] !tech FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [11:57:33] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [11:58:12] PROBLEM - prometheus151 SSH on prometheus151 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [11:58:36] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 6.29, 4.83, 4.06 [11:59:27] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 0.074 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [12:01:20] [Grafana] !tech RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [12:01:50] [Grafana] !tech FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [12:03:48] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [12:04:19] RECOVERY - prometheus151 SSH on prometheus151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u3 (protocol 2.0) [12:05:42] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 0.087 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [12:06:18] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 21.98, 20.55, 18.14 [12:08:18] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 19.94, 20.24, 18.33 [12:09:12] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 20.70, 21.73, 19.97 [12:11:50] [Grafana] !tech RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [12:12:16] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 19.69, 21.02, 19.18 [12:12:20] [Grafana] !tech FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [12:13:51] PROBLEM - ns2 NTP time on ns2 is UNKNOWN: check_ntp_time: Invalid hostname/address - time.cloudflare.comUsage: check_ntp_time -H [-4|-6] [-w ] [-c ] [-v verbose] [-o