[00:00:05] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.90, 4.11, 3.47 [00:02:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 24.39, 22.59, 22.40 [00:02:04] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.22, 3.54, 3.33 [00:03:36] [02python-functions] 07dependabot[bot] created branch 03dependabot/pip/dot-github/setuptools-73.0.1 - 13https://github.com/miraheze/python-functions [00:03:38] [02python-functions] 07dependabot[bot] pushed 031 commit to 03dependabot/pip/dot-github/setuptools-73.0.1 [+0/-0/±1] 13https://github.com/miraheze/python-functions/commit/d5c03cbc202b [00:03:40] [02python-functions] 07dependabot[bot] 03d5c03cb - Bump setuptools from 71.0.1 to 73.0.1 in /.github [00:03:41] [02python-functions] 07dependabot[bot] labeled pull request 03#56: Bump setuptools from 71.0.1 to 73.0.1 in /.github - 13https://github.com/miraheze/python-functions/pull/56 [00:03:42] [02python-functions] 07dependabot[bot] opened pull request 03#56: Bump setuptools from 71.0.1 to 73.0.1 in /.github - 13https://github.com/miraheze/python-functions/pull/56 [00:03:45] [02python-functions] 07dependabot[bot] labeled pull request 03#56: Bump setuptools from 71.0.1 to 73.0.1 in /.github - 13https://github.com/miraheze/python-functions/pull/56 [00:03:46] [02python-functions] 07dependabot[bot] closed pull request 03#55: Bump setuptools from 71.0.1 to 73.0.0 in /.github - 13https://github.com/miraheze/python-functions/pull/55 [00:03:48] [02python-functions] 07dependabot[bot] commented on pull request 03#55: Bump setuptools from 71.0.1 to 73.0.0 in /.github - 13https://github.com/miraheze/python-functions/pull/55#issuecomment-2299962552 [00:03:50] [02python-functions] 07dependabot[bot] deleted branch 03dependabot/pip/dot-github/setuptools-73.0.0 [00:03:51] [02python-functions] 07dependabot[bot] deleted branch 03dependabot/pip/dot-github/setuptools-73.0.0 - 13https://github.com/miraheze/python-functions [00:03:53] [02python-functions] 07coderabbitai[bot] commented on pull request 03#56: Bump setuptools from 71.0.1 to 73.0.1 in /.github - 13https://github.com/miraheze/python-functions/pull/56#issuecomment-2299962698 [00:04:02] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.80, 4.52, 3.73 [00:04:45] [02python-functions] 07coderabbitai[bot] edited pull request 03#56: Bump setuptools from 71.0.1 to 73.0.1 in /.github - 13https://github.com/miraheze/python-functions/pull/56 [00:06:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 23.18, 23.50, 22.87 [00:06:21] [02python-functions] 07coderabbitai[bot] edited a comment on pull request 03#56: Bump setuptools from 71.0.1 to 73.0.1 in /.github - 13https://github.com/miraheze/python-functions/pull/56#issuecomment-2299962698 [00:09:31] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [00:09:34] PROBLEM - prometheus151 SSH on prometheus151 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:11:33] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 6.467 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [00:11:34] RECOVERY - prometheus151 SSH on prometheus151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u3 (protocol 2.0) [00:12:25] [Grafana] RESOLVED: High Job Queue Backlog https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [00:16:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [00:17:56] miraheze/python-functions - dependabot[bot] the build passed. [00:18:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 24.70, 22.83, 22.33 [00:22:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 23.54, 23.80, 22.91 [00:23:52] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 1.29, 3.20, 3.99 [00:24:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 25.95, 24.50, 23.27 [00:25:52] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 6.11, 4.15, 4.22 [00:39:47] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.40, 3.29, 3.96 [00:41:47] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.65, 4.24, 4.23 [00:47:47] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.59, 3.12, 3.77 [00:49:48] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 6.04, 4.29, 4.12 [00:50:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 18.52, 20.78, 23.31 [00:50:11] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [00:51:30] [Grafana] FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [00:51:46] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.44, 3.83, 4.00 [00:52:07] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 0.089 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [00:56:30] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [00:58:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 24.27, 22.79, 23.14 [00:59:48] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.42, 3.50, 3.66 [01:00:30] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 27.56, 20.61, 17.73 [01:00:31] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 0.078 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [01:02:30] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 20.90, 20.46, 18.04 [01:02:41] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 19.84, 20.51, 18.09 [01:04:30] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 13.96, 18.08, 17.47 [01:04:41] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 18.11, 19.56, 18.02 [01:05:47] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 1.60, 3.65, 3.76 [01:06:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 21.25, 23.57, 23.70 [01:06:30] [Grafana] FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [01:07:46] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 0.28, 2.46, 3.31 [01:10:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 28.23, 24.69, 23.96 [01:12:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 22.03, 23.36, 23.56 [01:18:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 25.49, 24.23, 23.73 [01:20:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 17.61, 21.74, 22.89 [01:26:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 24.40, 22.34, 22.49 [01:32:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 21.51, 23.81, 23.36 [01:41:30] [Grafana] FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [01:42:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 28.52, 23.82, 23.11 [01:43:14] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 24.15, 20.56, 17.53 [01:43:25] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 27.87, 21.65, 17.90 [01:43:31] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 28.22, 22.07, 18.63 [01:44:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 21.49, 23.74, 23.23 [01:45:13] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 16.25, 18.80, 17.24 [01:45:19] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 15.36, 19.02, 17.38 [01:45:26] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 13.99, 18.92, 17.90 [02:11:30] [Grafana] FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [02:18:03] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 18.78, 19.09, 20.29 [02:28:53] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 23.06, 21.35, 20.62 [02:30:50] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 24.61, 22.35, 21.07 [02:32:47] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 17.20, 20.85, 20.70 [02:34:43] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 16.64, 19.56, 20.25 [02:46:18] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 29.72, 24.34, 21.83 [02:46:44] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 28.33, 20.82, 17.15 [02:47:26] PROBLEM - cp27 Varnish Backends on cp27 is CRITICAL: 3 backends are down. mw151 mw161 mw182 [02:48:30] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 24.89, 22.19, 18.01 [02:48:41] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 21.81, 20.75, 17.50 [02:48:44] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 23.49, 22.22, 18.15 [02:49:12] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 20.57, 21.55, 18.52 [02:49:25] RECOVERY - cp27 Varnish Backends on cp27 is OK: All 19 backends are healthy [02:50:11] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 20.34, 23.93, 22.41 [02:50:30] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 16.33, 20.63, 18.01 [02:50:41] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 17.59, 19.77, 17.56 [02:50:44] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 16.16, 19.93, 17.82 [02:52:08] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 27.35, 25.10, 22.99 [02:52:30] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 25.76, 22.31, 18.90 [02:53:12] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 28.79, 23.65, 19.85 [02:54:05] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 18.61, 23.46, 22.70 [02:54:30] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 19.81, 22.08, 19.29 [02:55:12] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 19.08, 22.23, 19.84 [02:56:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 24.84, 23.77, 22.87 [02:56:30] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 13.59, 19.57, 18.75 [02:57:12] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 15.29, 19.76, 19.22 [03:00:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 22.46, 23.01, 22.78 [03:01:12] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 21.11, 20.32, 19.54 [03:02:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 29.05, 25.42, 23.70 [03:03:12] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 17.44, 18.80, 19.05 [03:04:07] PROBLEM - mon181 Backups Grafana on mon181 is WARNING: FILE_AGE WARNING: /var/log/grafana-backup.log is 864225 seconds old and 92 bytes [03:05:47] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 6.66, 3.48, 1.52 [03:06:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 22.30, 23.75, 23.40 [03:07:47] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 2.99, 2.93, 1.54 [03:16:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 24.03, 22.11, 22.45 [03:24:30] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 21.42, 20.66, 18.72 [03:28:30] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 18.02, 19.84, 18.88 [03:30:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 16.85, 22.79, 23.82 [03:36:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 24.79, 23.22, 23.57 [03:36:35] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 7.33, 4.76, 3.33 [03:38:34] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.67, 3.89, 3.19 [03:38:40] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 27.83, 21.20, 18.86 [03:40:30] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 22.71, 21.25, 19.30 [03:40:32] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 1.51, 3.33, 3.09 [03:41:28] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 20.01, 21.02, 18.93 [03:42:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 22.47, 23.61, 23.85 [03:42:28] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 15.70, 19.81, 18.96 [03:42:30] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 16.30, 19.74, 19.01 [03:43:24] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 15.00, 19.02, 18.45 [03:43:53] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [03:44:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 24.08, 23.97, 23.96 [03:44:35] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 8.99, 5.78, 4.10 [03:45:49] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 0.063 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [03:46:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 22.37, 23.26, 23.70 [03:48:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 28.69, 25.22, 24.35 [03:48:32] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 1.92, 3.93, 3.75 [03:49:13] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 27.87, 22.65, 19.95 [03:50:05] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 21.60, 21.08, 19.59 [03:51:08] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 18.30, 21.53, 19.92 [03:51:59] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 19.24, 20.12, 19.39 [03:54:27] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.09, 3.43, 3.56 [03:55:00] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 17.28, 19.52, 19.48 [03:56:26] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 2.91, 2.93, 3.35 [04:00:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 16.32, 21.70, 23.43 [04:03:20] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.34, 3.40, 3.38 [04:04:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 26.75, 23.78, 23.83 [04:05:53] PROBLEM - prometheus151 SSH on prometheus151 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [04:06:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 20.29, 22.71, 23.46 [04:11:30] [Grafana] RESOLVED: High Job Queue Backlog https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [04:16:29] RECOVERY - prometheus151 SSH on prometheus151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u3 (protocol 2.0) [04:16:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [04:19:13] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.01, 3.63, 3.94 [04:21:12] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 6.69, 4.80, 4.34 [04:24:03] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 15.81, 17.99, 19.84 [04:29:07] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 1.37, 3.49, 3.99 [04:31:06] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.14, 4.18, 4.18 [04:36:39] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [04:38:10] PROBLEM - prometheus151 SSH on prometheus151 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [04:38:34] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 0.235 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [04:42:15] RECOVERY - prometheus151 SSH on prometheus151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u3 (protocol 2.0) [04:43:34] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 28.51, 22.60, 20.57 [04:45:12] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 28.45, 20.58, 17.00 [04:45:32] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 28.82, 19.76, 15.34 [04:46:30] [Grafana] FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [04:47:12] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 19.45, 20.85, 17.61 [04:47:27] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 21.23, 22.92, 21.22 [04:47:32] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 15.63, 17.91, 15.22 [04:48:57] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.66, 3.53, 3.95 [04:49:12] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 12.80, 17.66, 16.82 [04:50:55] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.63, 4.55, 4.29 [04:51:21] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 27.50, 23.54, 21.68 [05:01:05] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 19.29, 22.92, 22.79 [05:01:30] [Grafana] FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [05:06:30] [Grafana] FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [05:11:30] [Grafana] FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [05:14:41] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 1.87, 3.16, 3.99 [05:16:39] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 25.77, 22.75, 21.94 [05:16:39] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 3.66, 3.71, 4.12 [05:18:36] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 21.01, 21.83, 21.70 [05:18:38] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.59, 3.53, 3.99 [05:20:37] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.95, 3.89, 4.05 [05:26:36] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.68, 3.52, 3.88 [05:28:19] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 15.49, 18.62, 20.05 [05:34:31] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.95, 3.86, 3.73 [05:36:30] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.80, 3.27, 3.52 [05:38:29] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.35, 4.08, 3.79 [05:41:30] [Grafana] FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [05:42:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 28.86, 22.98, 20.68 [05:42:26] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 2.37, 2.98, 3.38 [05:42:30] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 21.06, 18.97, 16.06 [05:44:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 20.69, 21.95, 20.60 [05:44:30] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 15.30, 17.78, 15.99 [05:46:22] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.90, 3.61, 3.60 [05:50:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 25.93, 23.54, 21.56 [05:50:20] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.25, 3.58, 3.57 [05:52:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 18.52, 21.31, 20.96 [05:53:37] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [05:54:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 26.72, 23.68, 21.88 [05:56:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 16.81, 20.78, 21.02 [05:56:30] [Grafana] FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [05:57:38] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 0.068 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [05:58:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 25.78, 22.49, 21.60 [06:00:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 20.42, 21.38, 21.29 [06:06:03] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 14.06, 17.43, 19.69 [06:08:12] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.12, 3.53, 3.94 [06:12:09] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.03, 3.65, 3.88 [06:14:08] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.43, 3.26, 3.69 [06:18:07] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 6.09, 4.17, 3.93 [06:20:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 27.54, 22.64, 20.65 [06:22:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 23.16, 22.32, 20.76 [06:26:02] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.85, 3.83, 3.99 [06:26:03] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 24.06, 22.63, 21.21 [06:28:01] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.93, 4.22, 4.11 [06:30:00] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.29, 3.73, 3.94 [06:30:03] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 23.28, 23.92, 22.12 [06:33:57] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.03, 3.21, 3.62 [06:37:35] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [06:39:00] PROBLEM - prometheus151 APT on prometheus151 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 60 seconds. [06:41:30] [Grafana] FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [06:41:38] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 1.323 second response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [06:41:49] RECOVERY - prometheus151 APT on prometheus151 is OK: APT OK: 51 packages available for upgrade (0 critical updates). [06:44:03] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 14.38, 17.84, 19.94 [06:46:30] [Grafana] FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [06:47:30] PROBLEM - ns2 Puppet on ns2 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [06:49:10] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [06:54:00] PROBLEM - prometheus151 SSH on prometheus151 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:55:17] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 0.075 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [06:56:04] RECOVERY - prometheus151 SSH on prometheus151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u3 (protocol 2.0) [06:56:44] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 28.61, 20.44, 16.78 [06:56:47] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 28.87, 22.94, 20.69 [06:57:04] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 27.91, 19.92, 15.78 [06:57:12] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 28.91, 21.70, 17.47 [06:57:32] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 30.05, 21.45, 16.72 [06:58:14] PROBLEM - cp36 Varnish Backends on cp36 is CRITICAL: 2 backends are down. mw171 mw181 [06:58:30] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 28.88, 22.87, 17.61 [06:58:44] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 20.30, 20.80, 17.40 [06:59:32] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 23.09, 22.50, 17.74 [07:00:14] RECOVERY - cp36 Varnish Backends on cp36 is OK: All 19 backends are healthy [07:00:30] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 19.45, 22.14, 18.04 [07:00:44] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 16.76, 19.14, 17.19 [07:01:01] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 18.91, 20.26, 17.01 [07:01:12] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 14.90, 19.83, 17.87 [07:01:32] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 16.52, 20.17, 17.46 [07:02:26] PROBLEM - ns2 NTP time on ns2 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:02:30] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 14.10, 19.67, 17.67 [07:04:32] RECOVERY - ns2 NTP time on ns2 is OK: NTP OK: Offset -0.001920163631 secs [07:04:34] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 20.08, 23.82, 22.41 [07:13:56] RECOVERY - ns2 Puppet on ns2 is OK: OK: Puppet is currently enabled, last run 30 seconds ago with 0 failures [07:14:18] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 15.54, 18.08, 20.09 [07:19:38] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [07:21:24] PROBLEM - prometheus151 APT on prometheus151 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 60 seconds. [07:21:33] PROBLEM - prometheus151 Puppet on prometheus151 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 60 seconds. [07:23:01] PROBLEM - prometheus151 SSH on prometheus151 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:23:20] RECOVERY - prometheus151 APT on prometheus151 is OK: APT OK: 51 packages available for upgrade (0 critical updates). [07:23:34] RECOVERY - prometheus151 Puppet on prometheus151 is OK: OK: Puppet is currently enabled, last run 7 minutes ago with 0 failures [07:23:39] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 0.070 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [07:29:13] RECOVERY - prometheus151 SSH on prometheus151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u3 (protocol 2.0) [07:41:30] [Grafana] FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [07:41:57] PROBLEM - ns2 NTP time on ns2 is UNKNOWN: check_ntp_time: Invalid hostname/address - time.cloudflare.comUsage: check_ntp_time -H [-4|-6] [-w ] [-c ] [-v verbose] [-o