[00:01:47] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.45, 3.23, 3.54 [00:03:47] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.67, 4.27, 3.89 [00:05:47] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.69, 3.54, 3.67 [00:10:59] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 18.81, 21.61, 23.76 [00:12:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 20.82, 22.42, 23.81 [00:13:46] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.32, 3.29, 3.46 [00:18:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 24.18, 22.38, 23.22 [00:19:48] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.13, 3.70, 3.66 [00:20:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 22.18, 22.10, 23.01 [00:25:46] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 1.87, 2.73, 3.28 [00:28:59] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 25.07, 22.46, 22.60 [00:30:59] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 23.64, 22.05, 22.38 [00:35:41] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.11, 3.99, 3.63 [00:37:40] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.04, 4.43, 3.82 [00:44:03] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 18.22, 19.20, 20.34 [00:45:35] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.52, 3.87, 3.91 [00:47:34] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.58, 4.66, 4.20 [00:48:23] PROBLEM - prometheus151 SSH on prometheus151 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:48:59] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 24.20, 22.18, 22.08 [00:50:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 24.72, 22.58, 21.41 [00:51:32] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 20.51, 19.15, 17.74 [00:52:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 18.35, 20.88, 20.94 [00:52:59] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 23.53, 23.10, 22.52 [00:53:32] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 15.93, 17.77, 17.41 [00:56:03] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 17.71, 19.34, 20.30 [00:58:50] RECOVERY - prometheus151 SSH on prometheus151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u3 (protocol 2.0) [00:59:25] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.56, 3.58, 3.96 [01:00:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 20.98, 21.28, 20.92 [01:01:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [01:02:59] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 27.04, 23.61, 22.80 [01:03:24] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.96, 4.18, 4.10 [01:04:59] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 21.61, 22.68, 22.57 [01:05:23] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.16, 3.67, 3.94 [01:08:03] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 15.93, 18.78, 19.98 [01:09:21] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 6.51, 4.26, 4.05 [01:12:59] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 24.36, 22.43, 22.45 [01:13:18] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.18, 3.82, 4.00 [01:14:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 24.28, 21.00, 20.38 [01:14:59] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 22.61, 22.15, 22.33 [01:16:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 23.80, 21.98, 20.82 [01:16:59] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 25.20, 23.32, 22.74 [01:18:59] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 23.52, 23.20, 22.77 [01:19:14] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.43, 3.50, 3.75 [01:23:10] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.68, 3.69, 3.81 [01:24:03] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 17.22, 19.78, 20.39 [01:27:09] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.32, 3.25, 3.54 [01:28:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 19.31, 20.22, 20.48 [01:29:08] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.19, 3.45, 3.60 [01:31:08] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.05, 3.97, 3.77 [01:36:53] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [01:41:01] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.70, 3.51, 3.80 [01:42:03] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 16.24, 19.03, 19.98 [01:43:01] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 0.121 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [01:46:58] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 6.45, 4.39, 4.02 [01:50:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 23.75, 21.36, 20.57 [01:50:54] PROBLEM - prometheus151 APT on prometheus151 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 60 seconds. [01:51:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [01:52:03] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 17.74, 19.69, 20.04 [01:53:16] RECOVERY - prometheus151 APT on prometheus151 is OK: APT OK: 51 packages available for upgrade (0 critical updates). [02:01:30] [Grafana] FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [02:12:59] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 19.13, 20.00, 20.34 [02:16:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 25.83, 21.30, 20.16 [02:16:59] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 26.30, 22.46, 21.21 [02:18:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 23.41, 22.45, 20.75 [02:18:57] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 20.88, 18.26, 16.40 [02:18:59] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 21.56, 22.10, 21.24 [02:20:55] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 16.06, 17.65, 16.42 [02:23:05] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [02:24:59] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 25.92, 22.99, 21.74 [02:25:23] PROBLEM - prometheus151 APT on prometheus151 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 60 seconds. [02:25:29] [02mediawiki-repos] 07OAuthority pushed 031 commit to 03master [+0/-0/±1] 13https://github.com/miraheze/mediawiki-repos/compare/0f3d78436495...3d65b9b1d804 [02:25:32] [02mediawiki-repos] 07AverageHelper 033d65b9b - T12447: Add GoogleForms (#30) [02:25:32] PROBLEM - prometheus151 Puppet on prometheus151 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 60 seconds. [02:25:33] [02mediawiki-repos] 07OAuthority closed pull request 03#30: T12447: Add GoogleForms - 13https://github.com/miraheze/mediawiki-repos/pull/30 [02:26:30] [Grafana] FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [02:26:59] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 23.28, 23.17, 21.98 [02:27:07] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 0.071 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [02:27:19] RECOVERY - prometheus151 APT on prometheus151 is OK: APT OK: 51 packages available for upgrade (0 critical updates). [02:27:33] RECOVERY - prometheus151 Puppet on prometheus151 is OK: OK: Puppet is currently enabled, last run 9 minutes ago with 0 failures [02:31:30] [Grafana] RESOLVED: High Job Queue Backlog https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [02:33:59] [02mw-config] 07OAuthority pushed 031 commit to 03master [+0/-0/±1] 13https://github.com/miraheze/mw-config/compare/8f23490b9596...b194aea19ac3 [02:34:02] [02mw-config] 07anpang54 03b194aea - Remove spacewiki code in LocalWiki.php (#5637) [02:34:03] [02mw-config] 07OAuthority closed pull request 03#5637: Remove spacewiki code in LocalWiki.php - 13https://github.com/miraheze/mw-config/pull/5637 [02:35:00] miraheze/mw-config - OAuthority the build passed. [02:35:12] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 21.55, 19.85, 18.01 [02:37:12] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 18.23, 19.08, 17.95 [02:37:40] !log [@mwtask171] starting deploy of {'config': True} to all [02:37:47] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [02:37:55] !log [@mwtask171] finished deploy of {'config': True} to all - SUCCESS in 15s [02:38:05] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [02:42:59] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 25.01, 22.15, 21.90 [02:44:59] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 22.21, 21.23, 21.54 [02:46:03] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 18.40, 19.52, 20.25 [02:50:19] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 1.31, 2.83, 3.97 [02:52:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 21.19, 20.73, 20.52 [02:52:19] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.61, 3.90, 4.22 [02:52:59] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 24.11, 21.46, 21.22 [02:53:19] !log [@test151] starting deploy of {'config': True} to test151 [02:53:19] !log [@test151] finished deploy of {'config': True} to test151 - SUCCESS in 0s [02:53:30] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [02:53:37] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [02:53:46] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [02:54:59] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 21.95, 21.63, 21.32 [02:57:31] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 0.073 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [02:58:17] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.23, 3.29, 3.90 [03:00:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 24.03, 21.75, 20.93 [03:00:05] RECOVERY - db171 Backups SQL on db171 is OK: FILE_AGE OK: /var/log/sql-backup.log is 4 seconds old and 0 bytes [03:02:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 19.44, 21.13, 20.83 [03:02:03] !log [@mwtask181] starting deploy of {'config': True} to all [03:02:10] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [03:02:27] !log [@mwtask181] finished deploy of {'config': True} to all - SUCCESS in 23s [03:02:43] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [03:06:08] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [03:06:14] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.95, 4.23, 4.08 [03:06:30] [Grafana] FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [03:08:03] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 17.23, 18.76, 19.98 [03:08:06] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 1.612 second response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [03:11:30] [Grafana] FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [03:16:59] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 25.00, 22.50, 21.38 [03:18:07] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 1.84, 3.23, 3.90 [03:18:59] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 20.61, 21.90, 21.31 [03:22:05] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.16, 3.79, 3.94 [03:22:41] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 21.26, 18.88, 17.19 [03:24:41] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 16.42, 17.70, 16.95 [03:26:01] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.93, 3.54, 3.83 [03:28:01] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 7.62, 4.38, 4.06 [03:29:34] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 21.21, 21.58, 20.48 [03:31:57] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.61, 3.85, 3.97 [03:32:30] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 21.03, 19.40, 17.65 [03:32:59] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 27.08, 23.89, 22.43 [03:33:56] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.30, 3.67, 3.86 [03:34:30] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 18.12, 18.83, 17.65 [03:34:59] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 23.22, 23.49, 22.46 [03:35:54] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.03, 3.12, 3.62 [03:36:30] [Grafana] RESOLVED: High Job Queue Backlog https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [03:37:54] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [03:37:55] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 6.42, 4.30, 3.99 [03:39:54] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 3.603 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [03:39:54] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.25, 3.74, 3.82 [03:43:11] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 19.91, 19.77, 20.23 [03:49:49] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.81, 3.64, 3.67 [03:50:59] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 24.64, 22.62, 22.16 [03:52:59] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 20.53, 21.56, 21.82 [03:54:12] PROBLEM - prometheus151 APT on prometheus151 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 60 seconds. [03:54:22] PROBLEM - prometheus151 Puppet on prometheus151 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 60 seconds. [03:56:08] RECOVERY - prometheus151 APT on prometheus151 is OK: APT OK: 51 packages available for upgrade (0 critical updates). [03:56:23] RECOVERY - prometheus151 Puppet on prometheus151 is OK: OK: Puppet is currently enabled, last run 8 minutes ago with 0 failures [03:56:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [04:00:12] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [04:01:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [04:04:59] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 25.74, 22.94, 22.13 [04:06:26] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 19.97, 20.55, 19.84 [04:06:27] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 8.377 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [04:16:10] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 24.52, 23.22, 21.60 [04:16:17] RECOVERY - os162 Current Load on os162 is OK: LOAD OK - total load average: 5.77, 6.36, 6.79 [04:17:46] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 1.44, 2.90, 3.94 [04:18:06] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 22.59, 22.70, 21.59 [04:19:47] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 6.49, 4.21, 4.28 [04:23:46] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 1.46, 3.19, 3.89 [04:24:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 25.32, 22.66, 21.83 [04:24:17] PROBLEM - os162 Current Load on os162 is WARNING: LOAD WARNING - total load average: 6.84, 6.75, 6.73 [04:25:47] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.48, 4.04, 4.11 [04:28:17] RECOVERY - os162 Current Load on os162 is OK: LOAD OK - total load average: 5.63, 6.28, 6.55 [04:30:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 18.72, 22.55, 22.37 [04:32:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 27.30, 24.62, 23.15 [04:32:09] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [04:34:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 21.67, 23.58, 22.96 [04:34:04] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 0.211 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [04:45:11] PROBLEM - os162 Current Load on os162 is WARNING: LOAD WARNING - total load average: 7.14, 6.76, 6.54 [04:49:09] RECOVERY - os162 Current Load on os162 is OK: LOAD OK - total load average: 6.25, 6.59, 6.54 [04:50:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 26.03, 21.87, 21.35 [04:51:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [04:52:17] PROBLEM - prometheus151 APT on prometheus151 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 60 seconds. [04:52:26] PROBLEM - prometheus151 SSH on prometheus151 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [04:54:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 20.13, 22.00, 21.63 [04:55:08] RECOVERY - prometheus151 APT on prometheus151 is OK: APT OK: 51 packages available for upgrade (0 critical updates). [04:56:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [04:56:38] RECOVERY - prometheus151 SSH on prometheus151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u3 (protocol 2.0) [05:05:58] PROBLEM - os162 Current Load on os162 is WARNING: LOAD WARNING - total load average: 7.23, 6.72, 6.55 [05:07:46] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 0.34, 2.46, 3.93 [05:07:57] RECOVERY - os162 Current Load on os162 is OK: LOAD OK - total load average: 5.97, 6.39, 6.45 [05:10:03] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 17.86, 19.08, 20.38 [05:11:46] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 0.05, 1.14, 3.05 [05:14:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 26.41, 23.52, 21.95 [05:18:49] PROBLEM - os162 Current Load on os162 is WARNING: LOAD WARNING - total load average: 7.55, 6.99, 6.73 [05:21:29] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 26.65, 21.25, 18.07 [05:22:26] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 25.34, 21.37, 18.61 [05:23:12] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 21.86, 20.58, 18.34 [05:23:27] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 20.36, 21.14, 18.45 [05:24:30] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 22.45, 20.88, 17.92 [05:24:46] RECOVERY - os162 Current Load on os162 is OK: LOAD OK - total load average: 5.91, 6.75, 6.76 [05:25:12] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 18.67, 20.31, 18.55 [05:26:17] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 20.92, 21.40, 19.23 [05:28:13] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 19.49, 20.13, 18.98 [05:28:30] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 16.22, 19.22, 18.00 [05:31:12] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 21.39, 21.46, 19.63 [05:31:22] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 19.49, 20.16, 19.16 [05:33:12] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 28.46, 24.02, 20.79 [05:33:28] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 27.43, 22.17, 18.68 [05:34:01] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 27.63, 23.74, 20.71 [05:34:30] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 20.57, 21.48, 19.36 [05:35:12] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 18.04, 20.91, 20.00 [05:35:17] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 22.32, 22.11, 20.32 [05:35:24] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 18.88, 20.85, 18.62 [05:35:57] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 19.79, 22.00, 20.43 [05:37:20] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 14.58, 18.65, 18.09 [05:40:30] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 17.77, 20.11, 19.51 [05:41:12] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 16.20, 19.40, 19.87 [05:41:13] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 15.40, 18.75, 19.55 [05:41:36] PROBLEM - os162 Current Load on os162 is WARNING: LOAD WARNING - total load average: 7.24, 6.94, 6.63 [05:41:44] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 16.95, 19.68, 20.02 [05:45:34] RECOVERY - os162 Current Load on os162 is OK: LOAD OK - total load average: 6.70, 6.80, 6.65 [05:47:07] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 26.64, 22.70, 20.87 [05:47:12] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 25.48, 22.80, 21.03 [05:47:19] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 27.88, 23.98, 21.16 [05:47:32] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 27.07, 23.64, 21.49 [05:49:05] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 21.50, 22.33, 20.99 [05:49:12] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 19.96, 20.95, 20.53 [05:49:13] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 17.93, 21.57, 20.62 [05:49:32] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 20.59, 22.15, 21.19 [05:51:07] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 17.12, 20.22, 20.25 [05:51:12] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 15.55, 19.12, 19.93 [05:52:00] PROBLEM - ns2 NTP time on ns2 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:53:03] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 17.37, 20.09, 20.38 [05:53:29] PROBLEM - os162 Current Load on os162 is WARNING: LOAD WARNING - total load average: 7.05, 6.94, 6.75 [05:53:32] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 15.78, 18.58, 19.93 [05:54:01] RECOVERY - ns2 NTP time on ns2 is OK: NTP OK: Offset -0.001596301794 secs [05:59:26] RECOVERY - os162 Current Load on os162 is OK: LOAD OK - total load average: 5.04, 6.55, 6.72 [06:11:19] PROBLEM - os162 Current Load on os162 is WARNING: LOAD WARNING - total load average: 7.87, 6.72, 6.50 [06:15:17] RECOVERY - os162 Current Load on os162 is OK: LOAD OK - total load average: 5.79, 6.45, 6.47 [06:17:42] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 21.59, 19.77, 19.05 [06:17:47] PROBLEM - ns2 Puppet on ns2 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [06:19:41] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 17.27, 18.87, 18.82 [06:24:44] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 23.66, 19.86, 17.37 [06:25:11] PROBLEM - os162 Current Load on os162 is WARNING: LOAD WARNING - total load average: 7.44, 6.71, 6.50 [06:26:44] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 16.93, 18.69, 17.25 [06:29:09] RECOVERY - os162 Current Load on os162 is OK: LOAD OK - total load average: 5.07, 6.36, 6.45 [06:34:12] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 22.89, 20.78, 19.43 [06:34:22] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 22.98, 20.74, 19.40 [06:34:44] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 23.04, 20.76, 18.36 [06:35:06] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 20.65, 21.24, 19.74 [06:36:07] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 20.17, 20.31, 19.40 [06:36:44] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 17.39, 19.81, 18.33 [06:37:00] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 16.40, 19.95, 19.48 [06:38:11] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 16.20, 18.85, 18.96 [06:45:05] PROBLEM - ns2 NTP time on ns2 is UNKNOWN: check_ntp_time: Invalid hostname/address - time.cloudflare.comUsage: check_ntp_time -H [-4|-6] [-w ] [-c ] [-v verbose] [-o