[00:11:49] RECOVERY Current Load is now: OK on bots-3 i-000000e5 output: OK - load average: 3.25, 3.87, 4.85 [00:14:09] PROBLEM Puppet freshness is now: CRITICAL on precise-test i-00000231 output: Puppet has not run in last 20 hours [00:18:39] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [00:20:39] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [00:20:39] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [00:20:39] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [00:46:29] PROBLEM Current Load is now: WARNING on bots-sql2 i-000000af output: WARNING - load average: 5.10, 5.24, 5.09 [00:48:39] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [00:50:39] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [00:50:39] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [00:50:39] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [00:51:29] RECOVERY Current Load is now: OK on bots-sql2 i-000000af output: OK - load average: 4.83, 4.86, 4.97 [01:18:39] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [01:20:39] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [01:20:39] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [01:20:39] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [01:48:39] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [01:50:39] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [01:50:39] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [01:50:39] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [02:18:39] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [02:20:39] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [02:20:39] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [02:20:39] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [02:48:42] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [02:50:42] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [02:50:42] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [02:50:42] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [02:53:20] 06/04/2012 - 02:53:20 - Updating keys for laner at /export/home/deployment-prep/laner [02:56:22] RECOVERY Puppet freshness is now: OK on pybal-precise i-00000289 output: puppet ran at Mon Jun 4 02:56:12 UTC 2012 [03:18:42] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [03:20:42] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [03:20:42] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [03:20:42] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [03:37:32] PROBLEM Free ram is now: WARNING on test-oneiric i-00000187 output: Warning: 16% free memory [03:40:42] PROBLEM Free ram is now: WARNING on nova-daas-1 i-000000e7 output: Warning: 12% free memory [03:43:02] @add #wm-bot [03:45:42] PROBLEM Free ram is now: WARNING on utils-abogott i-00000131 output: Warning: 15% free memory [03:48:42] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [03:50:42] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [03:50:42] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [03:50:42] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [03:52:12] PROBLEM Free ram is now: WARNING on orgcharts-dev i-0000018f output: Warning: 15% free memory [03:57:32] PROBLEM Free ram is now: CRITICAL on test-oneiric i-00000187 output: Critical: 3% free memory [04:00:42] PROBLEM Free ram is now: CRITICAL on nova-daas-1 i-000000e7 output: Critical: 5% free memory [04:00:42] PROBLEM Free ram is now: CRITICAL on utils-abogott i-00000131 output: Critical: 5% free memory [04:02:32] RECOVERY Free ram is now: OK on test-oneiric i-00000187 output: OK: 96% free memory [04:10:42] RECOVERY Free ram is now: OK on nova-daas-1 i-000000e7 output: OK: 94% free memory [04:10:42] RECOVERY Free ram is now: OK on utils-abogott i-00000131 output: OK: 97% free memory [04:12:12] PROBLEM Free ram is now: CRITICAL on orgcharts-dev i-0000018f output: Critical: 4% free memory [04:17:12] RECOVERY Free ram is now: OK on orgcharts-dev i-0000018f output: OK: 96% free memory [04:18:42] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [04:20:42] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [04:20:42] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [04:20:42] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [04:48:42] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [04:50:42] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [04:50:42] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [04:50:42] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [05:18:42] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [05:20:42] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [05:20:42] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [05:20:42] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [05:26:52] RECOVERY Disk Space is now: OK on ipv6test1 i-00000282 output: DISK OK [05:34:52] PROBLEM Disk Space is now: WARNING on ipv6test1 i-00000282 output: DISK WARNING - free space: / 71 MB (5% inode=58%): [05:42:12] PROBLEM Puppet freshness is now: CRITICAL on deployment-apache23 i-00000270 output: Puppet has not run in last 20 hours [05:48:42] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [05:50:42] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [05:50:42] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [05:50:42] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [06:18:42] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [06:20:42] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [06:20:42] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [06:20:42] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [06:35:52] PROBLEM Disk Space is now: CRITICAL on ganglia-test4 i-000002a2 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:35:52] PROBLEM Current Users is now: CRITICAL on ganglia-test4 i-000002a2 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:37:24] PROBLEM Total Processes is now: CRITICAL on maps-test2 i-00000253 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:37:49] PROBLEM Current Load is now: CRITICAL on maps-test2 i-00000253 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:37:49] PROBLEM Current Users is now: CRITICAL on maps-test2 i-00000253 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:37:49] PROBLEM Current Load is now: CRITICAL on ganglia-test4 i-000002a2 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:37:49] PROBLEM Total Processes is now: CRITICAL on ganglia-test4 i-000002a2 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:39:14] PROBLEM Disk Space is now: CRITICAL on maps-test2 i-00000253 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:39:14] PROBLEM Free ram is now: CRITICAL on maps-test2 i-00000253 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:39:14] PROBLEM Current Users is now: CRITICAL on e3 i-00000291 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:39:14] PROBLEM dpkg-check is now: CRITICAL on maps-test2 i-00000253 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:39:51] PROBLEM dpkg-check is now: CRITICAL on incubator-bot0 i-00000296 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:39:51] PROBLEM Current Users is now: CRITICAL on incubator-bot0 i-00000296 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:39:51] PROBLEM Disk Space is now: CRITICAL on incubator-bot0 i-00000296 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:40:01] PROBLEM Current Load is now: CRITICAL on incubator-bot0 i-00000296 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:40:01] PROBLEM Free ram is now: CRITICAL on incubator-bot0 i-00000296 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:40:01] PROBLEM Total Processes is now: CRITICAL on incubator-bot0 i-00000296 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:40:44] RECOVERY Disk Space is now: OK on ganglia-test4 i-000002a2 output: DISK OK [06:40:44] RECOVERY Current Users is now: OK on ganglia-test4 i-000002a2 output: USERS OK - 0 users currently logged in [06:41:09] PROBLEM Free ram is now: CRITICAL on e3 i-00000291 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:41:15] PROBLEM dpkg-check is now: CRITICAL on e3 i-00000291 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:41:35] RECOVERY Total Processes is now: OK on maps-test2 i-00000253 output: PROCS OK: 90 processes [06:42:05] PROBLEM Total Processes is now: CRITICAL on ve-nodejs i-00000245 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:42:35] RECOVERY Current Load is now: OK on maps-test2 i-00000253 output: OK - load average: 0.94, 4.48, 3.02 [06:42:36] RECOVERY Current Users is now: OK on maps-test2 i-00000253 output: USERS OK - 0 users currently logged in [06:42:36] RECOVERY Current Load is now: OK on ganglia-test4 i-000002a2 output: OK - load average: 2.31, 5.42, 3.31 [06:42:36] RECOVERY Total Processes is now: OK on ganglia-test4 i-000002a2 output: PROCS OK: 193 processes [06:43:18] PROBLEM Current Load is now: CRITICAL on deployment-jobrunner05 i-0000028c output: CHECK_NRPE: Socket timeout after 10 seconds. [06:43:46] RECOVERY Current Users is now: OK on e3 i-00000291 output: USERS OK - 0 users currently logged in [06:43:46] RECOVERY Disk Space is now: OK on maps-test2 i-00000253 output: DISK OK [06:43:46] RECOVERY Free ram is now: OK on maps-test2 i-00000253 output: OK: 94% free memory [06:43:46] RECOVERY dpkg-check is now: OK on maps-test2 i-00000253 output: All packages OK [06:44:39] PROBLEM Disk Space is now: CRITICAL on deployment-jobrunner05 i-0000028c output: CHECK_NRPE: Socket timeout after 10 seconds. [06:44:39] PROBLEM dpkg-check is now: CRITICAL on deployment-jobrunner05 i-0000028c output: CHECK_NRPE: Socket timeout after 10 seconds. [06:44:39] PROBLEM Current Users is now: CRITICAL on deployment-jobrunner05 i-0000028c output: CHECK_NRPE: Socket timeout after 10 seconds. [06:44:39] PROBLEM Free ram is now: CRITICAL on deployment-jobrunner05 i-0000028c output: CHECK_NRPE: Socket timeout after 10 seconds. [06:44:39] PROBLEM Current Load is now: WARNING on bots-sql2 i-000000af output: WARNING - load average: 7.40, 6.50, 5.50 [06:46:14] PROBLEM Total Processes is now: CRITICAL on zeromq1 i-000002b7 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:46:39] PROBLEM Disk Space is now: CRITICAL on pediapress-ocg2 i-00000234 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:46:55] PROBLEM Free ram is now: CRITICAL on zeromq1 i-000002b7 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:46:56] PROBLEM Free ram is now: CRITICAL on build-precise1 i-00000273 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:46:56] PROBLEM Current Load is now: CRITICAL on zeromq1 i-000002b7 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:49:14] PROBLEM dpkg-check is now: CRITICAL on zeromq1 i-000002b7 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:49:14] PROBLEM Disk Space is now: CRITICAL on zeromq1 i-000002b7 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:49:28] PROBLEM Total Processes is now: CRITICAL on e3 i-00000291 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:49:34] PROBLEM Disk Space is now: CRITICAL on e3 i-00000291 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:49:34] PROBLEM Disk Space is now: CRITICAL on fr-wiki-db-precise i-0000023e output: CHECK_NRPE: Socket timeout after 10 seconds. [06:49:34] PROBLEM Current Load is now: CRITICAL on fr-wiki-db-precise i-0000023e output: CHECK_NRPE: Socket timeout after 10 seconds. [06:49:34] PROBLEM Current Users is now: CRITICAL on fr-wiki-db-precise i-0000023e output: CHECK_NRPE: Socket timeout after 10 seconds. [06:49:34] PROBLEM Free ram is now: CRITICAL on fr-wiki-db-precise i-0000023e output: CHECK_NRPE: Socket timeout after 10 seconds. [06:49:34] PROBLEM dpkg-check is now: CRITICAL on fr-wiki-db-precise i-0000023e output: CHECK_NRPE: Socket timeout after 10 seconds. [06:49:41] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [06:50:18] PROBLEM Total Processes is now: CRITICAL on fr-wiki-db-precise i-0000023e output: CHECK_NRPE: Socket timeout after 10 seconds. [06:50:38] RECOVERY Total Processes is now: OK on zeromq1 i-000002b7 output: PROCS OK: 87 processes [06:50:58] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [06:50:58] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [06:50:58] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [06:50:58] RECOVERY Free ram is now: OK on e3 i-00000291 output: OK: 94% free memory [06:50:58] RECOVERY dpkg-check is now: OK on e3 i-00000291 output: All packages OK [06:51:19] PROBLEM Current Load is now: CRITICAL on ganglia-test4 i-000002a2 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:51:19] PROBLEM Total Processes is now: CRITICAL on ganglia-test4 i-000002a2 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:53:33] RECOVERY Total Processes is now: OK on e3 i-00000291 output: PROCS OK: 91 processes [06:53:41] RECOVERY Disk Space is now: OK on e3 i-00000291 output: DISK OK [06:53:54] PROBLEM dpkg-check is now: CRITICAL on ganglia-test4 i-000002a2 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:54:33] PROBLEM Current Load is now: CRITICAL on build-precise1 i-00000273 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:54:33] PROBLEM Current Users is now: CRITICAL on build-precise1 i-00000273 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:54:33] PROBLEM Current Users is now: CRITICAL on ganglia-test4 i-000002a2 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:54:33] PROBLEM Disk Space is now: CRITICAL on ganglia-test4 i-000002a2 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:54:33] PROBLEM Disk Space is now: CRITICAL on build-precise1 i-00000273 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:54:33] PROBLEM dpkg-check is now: CRITICAL on build-precise1 i-00000273 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:54:34] PROBLEM Total Processes is now: CRITICAL on build-precise1 i-00000273 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:54:40] PROBLEM Disk Space is now: CRITICAL on nagios 127.0.0.1 output: (Service Check Timed Out) [06:55:00] PROBLEM Free ram is now: CRITICAL on ganglia-test4 i-000002a2 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:56:10] PROBLEM Current Load is now: CRITICAL on mwreview-test6 i-000002b9 output: CHECK_NRPE: Socket timeout after 10 seconds. [06:56:10] PROBLEM Current Load is now: CRITICAL on deployment-bastion i-000002bd output: CHECK_NRPE: Socket timeout after 10 seconds. [06:56:10] PROBLEM Current Users is now: CRITICAL on deployment-bastion i-000002bd output: CHECK_NRPE: Socket timeout after 10 seconds. [06:56:52] RECOVERY Disk Space is now: OK on pediapress-ocg2 i-00000234 output: DISK OK [06:57:21] PROBLEM SSH is now: CRITICAL on mobile-wlm i-000002bc output: No route to host [06:57:33] PROBLEM Disk Space is now: CRITICAL on tw-next i-0000027e output: CHECK_NRPE: Socket timeout after 10 seconds. [06:57:33] PROBLEM Current Load is now: CRITICAL on tw-next i-0000027e output: CHECK_NRPE: Socket timeout after 10 seconds. [06:57:33] PROBLEM Current Users is now: CRITICAL on tw-next i-0000027e output: CHECK_NRPE: Socket timeout after 10 seconds. [06:57:33] PROBLEM Total Processes is now: CRITICAL on tw-next i-0000027e output: CHECK_NRPE: Socket timeout after 10 seconds. [06:57:41] PROBLEM Free ram is now: CRITICAL on tw-next i-0000027e output: CHECK_NRPE: Socket timeout after 10 seconds. [06:58:29] PROBLEM Disk Space is now: WARNING on nagios 127.0.0.1 output: DISK WARNING - free space: /home/dzahn 3589 MB (20% inode=77%): [06:58:29] RECOVERY dpkg-check is now: OK on ganglia-test4 i-000002a2 output: All packages OK [06:58:48] RECOVERY Disk Space is now: OK on fr-wiki-db-precise i-0000023e output: DISK OK [06:58:49] RECOVERY Current Load is now: OK on fr-wiki-db-precise i-0000023e output: OK - load average: 6.06, 6.74, 4.67 [06:58:49] RECOVERY Current Users is now: OK on fr-wiki-db-precise i-0000023e output: USERS OK - 0 users currently logged in [06:58:49] RECOVERY Free ram is now: OK on fr-wiki-db-precise i-0000023e output: OK: 83% free memory [06:58:49] RECOVERY dpkg-check is now: OK on fr-wiki-db-precise i-0000023e output: All packages OK [06:59:42] RECOVERY Free ram is now: OK on ganglia-test4 i-000002a2 output: OK: 79% free memory [06:59:52] RECOVERY Disk Space is now: OK on incubator-bot0 i-00000296 output: DISK OK [06:59:52] RECOVERY Current Load is now: OK on incubator-bot0 i-00000296 output: OK - load average: 2.41, 3.65, 4.07 [06:59:52] RECOVERY Current Users is now: OK on incubator-bot0 i-00000296 output: USERS OK - 0 users currently logged in [06:59:52] RECOVERY Free ram is now: OK on incubator-bot0 i-00000296 output: OK: 87% free memory [06:59:52] RECOVERY Total Processes is now: OK on incubator-bot0 i-00000296 output: PROCS OK: 102 processes [06:59:57] RECOVERY dpkg-check is now: OK on incubator-bot0 i-00000296 output: All packages OK [07:00:08] PROBLEM Total Processes is now: CRITICAL on maps-test2 i-00000253 output: CHECK_NRPE: Socket timeout after 10 seconds. [07:01:01] RECOVERY Current Load is now: OK on deployment-bastion i-000002bd output: OK - load average: 1.33, 3.11, 2.10 [07:01:01] RECOVERY Current Users is now: OK on deployment-bastion i-000002bd output: USERS OK - 0 users currently logged in [07:01:01] RECOVERY Current Load is now: OK on mwreview-test6 i-000002b9 output: OK - load average: 0.11, 1.27, 0.94 [07:01:14] PROBLEM dpkg-check is now: CRITICAL on en-wiki-db-precise i-0000023c output: CHECK_NRPE: Socket timeout after 10 seconds. [07:01:14] PROBLEM Current Load is now: CRITICAL on maps-test2 i-00000253 output: CHECK_NRPE: Socket timeout after 10 seconds. [07:01:14] PROBLEM Current Users is now: CRITICAL on maps-test2 i-00000253 output: CHECK_NRPE: Socket timeout after 10 seconds. [07:01:14] PROBLEM Disk Space is now: CRITICAL on en-wiki-db-precise i-0000023c output: CHECK_NRPE: Socket timeout after 10 seconds. [07:01:14] PROBLEM Current Users is now: CRITICAL on en-wiki-db-precise i-0000023c output: CHECK_NRPE: Socket timeout after 10 seconds. [07:02:16] PROBLEM Disk Space is now: CRITICAL on maps-test2 i-00000253 output: CHECK_NRPE: Socket timeout after 10 seconds. [07:02:16] PROBLEM Free ram is now: CRITICAL on maps-test2 i-00000253 output: CHECK_NRPE: Socket timeout after 10 seconds. [07:02:16] PROBLEM dpkg-check is now: CRITICAL on maps-test2 i-00000253 output: CHECK_NRPE: Socket timeout after 10 seconds. [07:02:27] PROBLEM dpkg-check is now: CRITICAL on pediapress-ocg2 i-00000234 output: CHECK_NRPE: Socket timeout after 10 seconds. [07:02:47] PROBLEM Current Users is now: CRITICAL on pediapress-ocg2 i-00000234 output: CHECK_NRPE: Socket timeout after 10 seconds. [07:02:47] PROBLEM Free ram is now: CRITICAL on pediapress-ocg2 i-00000234 output: CHECK_NRPE: Socket timeout after 10 seconds. [07:02:57] RECOVERY Total Processes is now: OK on ve-nodejs i-00000245 output: PROCS OK: 87 processes [07:03:35] PROBLEM Current Load is now: WARNING on deployment-jobrunner05 i-0000028c output: WARNING - load average: 6.89, 7.24, 7.46 [07:04:25] RECOVERY Current Users is now: OK on build-precise1 i-00000273 output: USERS OK - 0 users currently logged in [07:04:25] RECOVERY Current Load is now: OK on build-precise1 i-00000273 output: OK - load average: 5.62, 5.89, 4.65 [07:04:25] RECOVERY Disk Space is now: OK on build-precise1 i-00000273 output: DISK OK [07:04:25] RECOVERY Total Processes is now: OK on build-precise1 i-00000273 output: PROCS OK: 92 processes [07:04:31] RECOVERY dpkg-check is now: OK on build-precise1 i-00000273 output: All packages OK [07:04:45] RECOVERY Disk Space is now: OK on deployment-jobrunner05 i-0000028c output: DISK OK [07:04:45] RECOVERY Current Users is now: OK on deployment-jobrunner05 i-0000028c output: USERS OK - 0 users currently logged in [07:04:45] RECOVERY dpkg-check is now: OK on deployment-jobrunner05 i-0000028c output: All packages OK [07:04:45] RECOVERY Free ram is now: OK on deployment-jobrunner05 i-0000028c output: OK: 87% free memory [07:06:02] RECOVERY dpkg-check is now: OK on en-wiki-db-precise i-0000023c output: All packages OK [07:06:11] RECOVERY Disk Space is now: OK on en-wiki-db-precise i-0000023c output: DISK OK [07:06:11] RECOVERY Current Users is now: OK on en-wiki-db-precise i-0000023c output: USERS OK - 0 users currently logged in [07:06:31] RECOVERY Disk Space is now: OK on zeromq1 i-000002b7 output: DISK OK [07:06:31] RECOVERY dpkg-check is now: OK on zeromq1 i-000002b7 output: All packages OK [07:06:41] PROBLEM Current Load is now: WARNING on fr-wiki-db-precise i-0000023e output: WARNING - load average: 6.76, 6.40, 5.24 [07:06:51] RECOVERY SSH is now: OK on mobile-wlm i-000002bc output: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1 (protocol 2.0) [07:06:51] RECOVERY Free ram is now: OK on build-precise1 i-00000273 output: OK: 91% free memory [07:06:51] RECOVERY Disk Space is now: OK on tw-next i-0000027e output: DISK OK [07:06:51] RECOVERY Current Users is now: OK on tw-next i-0000027e output: USERS OK - 0 users currently logged in [07:06:51] RECOVERY Current Load is now: OK on tw-next i-0000027e output: OK - load average: 2.44, 4.49, 3.41 [07:06:52] RECOVERY Total Processes is now: OK on tw-next i-0000027e output: PROCS OK: 76 processes [07:06:58] RECOVERY Free ram is now: OK on tw-next i-0000027e output: OK: 84% free memory [07:07:21] RECOVERY dpkg-check is now: OK on pediapress-ocg2 i-00000234 output: All packages OK [07:09:51] RECOVERY Total Processes is now: OK on fr-wiki-db-precise i-0000023e output: PROCS OK: 80 processes [07:11:51] RECOVERY Current Load is now: OK on zeromq1 i-000002b7 output: OK - load average: 0.17, 2.43, 3.78 [07:11:51] RECOVERY Free ram is now: OK on zeromq1 i-000002b7 output: OK: 88% free memory [07:12:31] RECOVERY Current Users is now: OK on pediapress-ocg2 i-00000234 output: USERS OK - 0 users currently logged in [07:12:31] RECOVERY Free ram is now: OK on pediapress-ocg2 i-00000234 output: OK: 91% free memory [07:13:01] PROBLEM Current Load is now: WARNING on mobile-wlm i-000002bc output: WARNING - load average: 0.03, 2.79, 5.25 [07:18:01] RECOVERY Current Load is now: OK on mobile-wlm i-000002bc output: OK - load average: 0.00, 1.02, 3.80 [07:20:51] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [07:21:01] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [07:21:01] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [07:21:01] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [07:23:31] RECOVERY Current Load is now: OK on deployment-jobrunner05 i-0000028c output: OK - load average: 3.26, 3.61, 4.79 [07:24:51] RECOVERY Current Load is now: OK on bots-sql2 i-000000af output: OK - load average: 2.38, 3.53, 4.82 [07:51:04] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [07:51:04] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [07:51:05] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [07:51:05] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [08:06:54] PROBLEM Current Load is now: WARNING on bots-3 i-000000e5 output: WARNING - load average: 4.66, 5.10, 5.03 [08:11:54] RECOVERY Current Load is now: OK on bots-3 i-000000e5 output: OK - load average: 3.77, 4.36, 4.74 [08:21:04] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [08:21:04] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [08:21:04] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [08:21:04] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [08:32:54] RECOVERY Disk Space is now: OK on deployment-transcoding i-00000105 output: DISK OK [08:40:54] PROBLEM Disk Space is now: WARNING on deployment-transcoding i-00000105 output: DISK WARNING - free space: / 76 MB (5% inode=52%): [08:51:04] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [08:51:04] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [08:51:04] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [08:51:04] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [09:21:04] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [09:21:04] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [09:21:04] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [09:21:04] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [09:42:54] PROBLEM Current Load is now: WARNING on bots-sql2 i-000000af output: WARNING - load average: 6.07, 5.99, 5.41 [09:47:14] PROBLEM Puppet freshness is now: CRITICAL on blamemaps-m1small i-000002a1 output: Puppet has not run in last 20 hours [09:51:04] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [09:51:04] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [09:51:04] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [09:51:04] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [10:15:14] PROBLEM Puppet freshness is now: CRITICAL on precise-test i-00000231 output: Puppet has not run in last 20 hours [10:21:04] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [10:21:04] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [10:21:04] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [10:21:04] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [10:27:54] RECOVERY Current Load is now: OK on bots-sql2 i-000000af output: OK - load average: 3.34, 4.19, 4.97 [10:40:54] PROBLEM Current Load is now: WARNING on bots-sql2 i-000000af output: WARNING - load average: 6.50, 5.74, 5.24 [10:51:04] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [10:51:04] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [10:51:04] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [10:51:04] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [11:07:54] PROBLEM Free ram is now: WARNING on bots-3 i-000000e5 output: Warning: 6% free memory [11:17:54] PROBLEM Free ram is now: CRITICAL on bots-3 i-000000e5 output: Critical: 2% free memory [11:21:04] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [11:21:04] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [11:21:04] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [11:21:04] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [11:22:54] PROBLEM Free ram is now: WARNING on bots-3 i-000000e5 output: Warning: 6% free memory [11:30:54] RECOVERY Current Load is now: OK on bots-sql2 i-000000af output: OK - load average: 4.07, 4.37, 4.82 [11:51:04] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [11:51:04] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [11:51:04] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [11:51:04] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [12:02:34] hey Ryan_Lane [12:02:38] howdy [12:02:54] are you still in Germany? :D [12:02:59] yep [12:03:01] I never saw you so early here [12:03:02] till the 29th [12:03:03] ah right [12:03:07] wow [12:03:09] why? [12:03:14] why not? :) [12:03:25] you have vacation or is there some huge wikimedia thing? [12:03:38] just working from here [12:03:41] for the hell of it [12:03:41] ah ok [12:03:52] I though you are like merging toolserver and labs :D [12:03:56] PROBLEM Current Load is now: WARNING on bots-sql2 i-000000af output: WARNING - load average: 5.80, 5.75, 5.33 [12:04:04] or something like that [12:05:01] well, labs will be an additional environment to toolserver [12:05:12] and likely toolserver users will eventually switch [12:11:36] btw Ryan_Lane is there any update on ipv6 on production? [12:11:51] ask in -operations [12:11:55] Erik sent a message that you are going to enable it soon, but I have no idea if anything happened or not [12:12:02] k [12:12:09] I haven't been working on it much [12:21:04] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [12:21:04] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [12:21:04] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [12:21:04] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [12:21:36] Ryan_Lane: why are these instances down? [12:21:40] did you suspend them? [12:21:50] because I don't think it's possible to shutdown -h on labs [12:21:53] no. ganglia is down because it has bad mount options [12:22:00] aha [12:22:16] the others likely have the same ones [12:22:19] so there boxes are running but somewhere in a boot process waiting for someone [12:22:30] these [12:22:49] I thought they are "down" like powered off [12:23:51] well, someone needs to mount the disks on the virtual host and fix them manually [12:23:59] I think sara will do that next time she is on [12:24:40] ok [12:32:59] is bots also down then Ryan_Lane? [12:33:10] my bot hasn't edited since 1 June [12:33:54] RECOVERY Current Load is now: OK on bots-sql2 i-000000af output: OK - load average: 4.64, 4.62, 4.97 [12:34:43] it should be up [12:35:14] hmm that's odd I can't SSH [12:35:21] no supported authentication methods available.. [12:35:34] RECOVERY Disk Space is now: OK on ipv6test1 i-00000282 output: DISK OK [12:43:34] PROBLEM Disk Space is now: WARNING on ipv6test1 i-00000282 output: DISK WARNING - free space: / 70 MB (5% inode=57%): [12:51:04] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [12:51:04] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [12:51:04] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [12:51:04] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [12:55:44] petan|wk: http://labs.wikimedia.beta.wmflabs.org/ should redirect to deployment.wikimedia... [12:56:10] Thehelpfulone: I know [12:56:15] hashar did something with that [12:56:25] Thehelpfulone: your bot is down because cluster was down [12:57:01] ok, how can I get it to restart by itself, or is it a manual task? [12:58:52] Ryan_Lane: can you add labsconsole into the interwiki map for wikitech.wikimedia.org or does that go off the general one on Meta? [13:00:39] umm [13:00:41] I dunno [13:00:57] Thehelpfulone: just ssh then start it [13:01:04] why does it need to be in the interwikimap? [13:03:07] it's in the normal interwiki map already [13:03:19] I just can't [[labsconsole:foo|]] it on wikitech [13:03:26] wikitech is being merged, though [13:03:48] labsconsole and wikitech will eventually just be wikitech [13:04:12] can you allow importing from wikitech to labsconsole? [13:04:14] Thehelpfulone: you already have access there? [13:04:18] yes [13:04:21] ah [13:04:34] I created a couple of templates that you can use to tag, derived from the ones on meta [13:05:39] Ryan_Lane: http://wikitech.wikimedia.org/edit/Wikitech:Project_to_transfer_content_to_Labs?redlink=1 can you start on something like that if you have an idea from an ops point of view as to what will be moved and what will stay (use http://meta.wikimedia.org/wiki/Meta:MetaProject_to_transfer_content_to_MediaWiki.org for some inspiration if you like) [13:07:42] * Ryan_Lane nods [13:07:52] may take a little bit for us to get to that [13:08:00] wikitech migration wasn't a really high priority [13:15:54] PROBLEM Current Load is now: WARNING on bots-3 i-000000e5 output: WARNING - load average: 5.93, 6.14, 5.32 [13:21:04] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [13:21:04] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [13:21:04] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [13:21:04] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [13:51:04] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [13:51:04] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [13:51:04] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [13:51:04] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [14:10:54] RECOVERY Current Load is now: OK on bots-3 i-000000e5 output: OK - load average: 4.18, 4.57, 4.93 [14:21:04] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [14:21:04] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [14:21:04] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [14:21:04] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [14:44:45] chrismcmahon: hey [14:44:49] omtsh is that guy [14:44:58] he is now in this chan [14:45:05] I've been in this chan for ages [14:45:06] Why? [14:45:30] chrismcmahon wanted to ask regarding some block you made on some guy from QA... [14:45:36] huh? [14:46:04] Just unblock it if it's a legit use [14:46:06] * user [14:46:10] I've only blocked spammers [14:46:11] hi omtsh, there is a candidate for the QA Engineer position at WMF, Alister Scott, who was experimenting with some browser automation on beta labs cluster, you blocked his account for editing with nonsense: http://en.wikipedia.beta.wmflabs.org/wiki/Special:RecentChanges [14:46:19] Ahhh, yes [14:46:43] btw chrismcmahon if you tell me your account name I give you some global rights, ok? [14:46:51] unblocked [14:46:53] I've spoken with him, he understands the situation, I'd like to get that account unblocked. [14:47:04] Yeah, I just unblocked it, my aplogies [14:47:07] omtsh: I'm Cmcmahon everywhere [14:47:09] * apologies [14:47:13] chrismcmahon: in fact, he should feel free to do any similar tests, that's what the site is for [14:47:15] omtsh: not a problem, thanks [14:48:01] petan|wk: yes, that's something we'll have to figure out, distinguishing legitimate users who might be posting nonsense for a purpose [14:49:02] PROBLEM Current Load is now: WARNING on bots-3 i-000000e5 output: WARNING - load average: 6.07, 5.92, 5.38 [14:49:06] chrismcmahon: ok, I gave you steward and some other bits, so you should be able to unblock anyone in future [14:49:17] thanks petan|wk [14:49:19] yw [14:50:36] petan|wk: what's the search for all projects? [14:50:43] on labs I mean, how do I see the full ist [14:51:04] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [14:51:04] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [14:51:04] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [14:51:04] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [14:51:28] Thehelpfulone: there was a link in bz [14:52:40] https://bugzilla.wikimedia.org/show_bug.cgi?id=37298 [14:54:20] !projectlist is https://labsconsole.wikimedia.org/w/index.php?title=Special%3AAsk&q=[[Resource+Type%3A%3Aproject]]&po=%3F%0D%0A%3FMember%0D%0A%3FDescription&sort_num=&order_num=ASC&eq=yes&p[format]=broadtable&p[limit]=500&p[sort]=&p[order]=&p[offset]=&p[headers]=show&p[mainlabel]=&p[link]=all&p[searchlabel]=%E2%80%A6+further+results&p[intro]=&p[outro]=&p[default]=&p[class]=sortable+wikitable+smwtable&eq=yes [14:54:20] Key was added [14:54:26] :) [14:54:30] short url would be nice [14:56:09] ok [14:56:19] !pl is https://labsconsole.wikimedia.org/w/index.php?title=Special:Ask&q=[[Resource+Type%3A%3Aproject]]&p=format%3Dbroadtable%2Fheaders%3Dshow%2Flink%3Dall%2Fsearchlabel%3D%E2%80%A6-20further-20results%2Fclass%3Dsortable-20wikitable-20smwtable&po=%3FMember%0A%3FDescription%0A&limit=500&eq=no [14:56:19] Key was added [14:56:25] !projects alias pl [14:56:25] Created new alias for this key [14:56:29] !projects [14:56:29] https://labsconsole.wikimedia.org/wiki/Special:Ask/-5B-5BResource-20Type::project-5D-5D/-3F/-3FMember/-3FDescription/mainlabel%3D-2D [14:56:40] aha [14:56:43] we already have it [14:56:50] !projects unalias [14:56:50] Alias removed! [14:56:56] !projectlist del [14:56:56] Successfully removed projectlist [14:57:03] !projectlist alias projects [14:57:03] Created new alias for this key [14:57:37] !projects del [14:57:38] Successfully removed projects [14:57:41] !pl [14:57:41] https://labsconsole.wikimedia.org/w/index.php?title=Special:Ask&q=[[Resource+Type%3A%3Aproject]]&p=format%3Dbroadtable%2Fheaders%3Dshow%2Flink%3Dall%2Fsearchlabel%3D%E2%80%A6-20further-20results%2Fclass%3Dsortable-20wikitable-20smwtable&po=%3FMember%0A%3FDescription%0A&limit=500&eq=no [14:57:46] !projects is https://labsconsole.wikimedia.org/w/index.php?title=Special:Ask&q=[[Resource+Type%3A%3Aproject]]&p=format%3Dbroadtable%2Fheaders%3Dshow%2Flink%3Dall%2Fsearchlabel%3D%E2%80%A6-20further-20results%2Fclass%3Dsortable-20wikitable-20smwtable&po=%3FMember%0A%3FDescription%0A&limit=500&eq=no [14:57:47] Key was added [14:57:54] here we go [14:58:24] fixed it? [14:58:30] ah [14:58:31] Ryan_Lane: fixed what [14:58:36] not from the main page :( [14:58:38] SMW is broken [14:58:42] it's not [14:58:57] it require parameter which it didn't [14:59:25] from the main page? [14:59:27] which one? [14:59:34] limit [14:59:40] that's needed [14:59:42] on the main page [14:59:47] but it wasn't there [14:59:57] it totally is on the main page [14:59:59] limit=0 [15:00:22] I don't think so, when you click edit query it doesn't show 0 [15:00:27] I know [15:00:28] maybe they changed name of parameter [15:00:30] but it's needed [15:00:31] nope [15:00:40] it was just fixed for me by JeroenDeDauw [15:03:12] aha [15:06:52] lemme try to apply that change [15:11:54] btw Ryan_Lane how do you like the Czech bear?:P [15:12:02] budweiser? ;) [15:12:03] * beer [15:12:12] Pilsner, we had it on hackaton [15:12:23] the Czech one is great [15:12:28] that fridge was full of that [15:12:34] yeah. it wasn't bad [15:12:50] the american budweiser is like sex in a canoe [15:12:51] heh, it's considered one of best we have :D [15:12:56] haha [15:13:08] (fucking close to water) [15:13:22] I heard that American version of czech beer is something else, just packed in similar bottles :D [15:13:44] maybe not even that [15:13:56] czechvar or something like that [15:14:00] heh [15:14:15] Czech beers are pretty good though [15:14:25] there isn't enough variety of beer in this area, though [15:14:47] hm... I guess there must be many pubs in Berlin [15:14:54] are you still in Berlin Ryan_Lane? [15:15:06] surprisingly, he is [15:16:19] there was still a lot of food at hackaton we didn't eat, so Ryan was assigned to stay there and finish it all [15:16:26] hehe [15:16:46] + fridge with 200 bottles of beer [15:21:09] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [15:21:09] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [15:21:09] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [15:21:09] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [15:31:23] \o/ searches from main page are fixed [15:43:09] PROBLEM Puppet freshness is now: CRITICAL on deployment-apache23 i-00000270 output: Puppet has not run in last 20 hours [15:51:09] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [15:51:09] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [15:51:09] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [15:51:09] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [15:55:24] Good morning! [15:55:47] I'm getting funny errors on a labs instance that (apparently) had crashed and restarted, all having to do with locale settings [15:56:32] It appears LC_ALL isn't set [15:57:07] Consequently, mongod dies with the error " what(): locale::facet::_S_create_c_locale name not valid [15:57:07] " [15:59:00] RECOVERY Current Load is now: OK on bots-3 i-000000e5 output: OK - load average: 4.90, 4.69, 4.94 [16:08:15] Turns out the only valid locale on this labs instance is currently "C" [16:21:36] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [16:21:36] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [16:21:36] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [16:21:36] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [16:22:30] Hm, now it's giving me the "transport endpoint not connected" error again [16:37:39] RECOVERY host: ganglia-test2 is UP address: i-00000250 PING OK - Packet loss = 0%, RTA = 8.89 ms [16:38:59] RECOVERY SSH is now: OK on ganglia-test2 i-00000250 output: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [16:38:59] RECOVERY Free ram is now: OK on ganglia-test2 i-00000250 output: OK: 89% free memory [16:41:09] RECOVERY Current Users is now: OK on ganglia-test2 i-00000250 output: USERS OK - 1 users currently logged in [16:41:09] RECOVERY Disk Space is now: OK on ganglia-test2 i-00000250 output: DISK OK [16:41:09] RECOVERY Current Load is now: OK on ganglia-test2 i-00000250 output: OK - load average: 0.43, 0.72, 0.33 [16:41:09] RECOVERY dpkg-check is now: OK on ganglia-test2 i-00000250 output: All packages OK [16:41:09] RECOVERY Total Processes is now: OK on ganglia-test2 i-00000250 output: PROCS OK: 198 processes [16:51:09] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [16:51:09] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [16:51:09] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [16:56:59] PROBLEM Current Load is now: WARNING on bots-3 i-000000e5 output: WARNING - load average: 6.27, 5.74, 5.18 [16:57:59] PROBLEM Free ram is now: CRITICAL on bots-3 i-000000e5 output: Critical: 5% free memory [17:02:59] PROBLEM Free ram is now: WARNING on bots-3 i-000000e5 output: Warning: 6% free memory [17:11:09] PROBLEM host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [17:16:19] RECOVERY Puppet freshness is now: OK on precise-test i-00000231 output: puppet ran at Mon Jun 4 17:16:03 UTC 2012 [17:20:59] PROBLEM Free ram is now: WARNING on orgcharts-dev i-0000018f output: Warning: 17% free memory [17:21:09] PROBLEM host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [17:21:09] PROBLEM host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [17:21:09] PROBLEM host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [17:26:59] RECOVERY Current Load is now: OK on bots-3 i-000000e5 output: OK - load average: 4.31, 4.78, 5.00 [17:37:59] PROBLEM Free ram is now: CRITICAL on bots-3 i-000000e5 output: Critical: 5% free memory [17:40:03] ACKNOWLEDGEMENT host: aggregator-test is DOWN address: i-0000024d CRITICAL - Host Unreachable (i-0000024d) [17:40:03] ACKNOWLEDGEMENT host: aggregator-test3 is DOWN address: i-00000293 CRITICAL - Host Unreachable (i-00000293) [17:40:18] ACKNOWLEDGEMENT host: aggregator1 is DOWN address: i-0000010c CRITICAL - Host Unreachable (i-0000010c) [17:40:48] ACKNOWLEDGEMENT host: ganglia-test2 is DOWN address: i-00000250 CRITICAL - Host Unreachable (i-00000250) [17:40:59] PROBLEM Free ram is now: CRITICAL on orgcharts-dev i-0000018f output: Critical: 4% free memory [17:45:59] RECOVERY Free ram is now: OK on orgcharts-dev i-0000018f output: OK: 96% free memory [17:51:03] ACKNOWLEDGEMENT Puppet freshness is now: CRITICAL on blamemaps-m1small i-000002a1 output: Puppet has not run in last 20 hours [17:51:18] ACKNOWLEDGEMENT Puppet freshness is now: CRITICAL on deployment-apache23 i-00000270 output: Puppet has not run in last 20 hours [17:57:39] RECOVERY host: ganglia-test2 is UP address: i-00000250 PING OK - Packet loss = 0%, RTA = 0.67 ms [18:23:56] PROBLEM Current Load is now: CRITICAL on aggregator-test1 i-000002bf output: Connection refused by host [18:24:36] PROBLEM Current Users is now: CRITICAL on aggregator-test1 i-000002bf output: Connection refused by host [18:25:16] PROBLEM Disk Space is now: CRITICAL on aggregator-test1 i-000002bf output: Connection refused by host [18:25:56] PROBLEM Free ram is now: CRITICAL on aggregator-test1 i-000002bf output: Connection refused by host [18:27:06] PROBLEM Total Processes is now: CRITICAL on aggregator-test1 i-000002bf output: Connection refused by host [18:27:46] PROBLEM dpkg-check is now: CRITICAL on aggregator-test1 i-000002bf output: CHECK_NRPE: Error - Could not complete SSL handshake. [18:30:42] Folks, I could use a bit of help, getting this error when starting mongodb: [18:30:51] Transport endpoint is not connected: "/data/db/", terminating [18:33:44] PROBLEM Current Load is now: CRITICAL on aggregator2 i-000002c0 output: CHECK_NRPE: Error - Could not complete SSL handshake. [18:33:48] I'm also getting locale errors, but I've managed to circumvent them for now [18:34:24] PROBLEM Current Users is now: CRITICAL on aggregator2 i-000002c0 output: CHECK_NRPE: Error - Could not complete SSL handshake. [18:35:04] PROBLEM Disk Space is now: CRITICAL on aggregator2 i-000002c0 output: CHECK_NRPE: Error - Could not complete SSL handshake. [18:35:44] PROBLEM Free ram is now: CRITICAL on aggregator2 i-000002c0 output: CHECK_NRPE: Error - Could not complete SSL handshake. [18:36:04] PROBLEM host: ganglia-test3 is DOWN address: i-0000025b check_ping: Invalid hostname/address - i-0000025b [18:36:54] PROBLEM Total Processes is now: CRITICAL on aggregator2 i-000002c0 output: CHECK_NRPE: Error - Could not complete SSL handshake. [18:37:34] PROBLEM dpkg-check is now: CRITICAL on aggregator2 i-000002c0 output: CHECK_NRPE: Error - Could not complete SSL handshake. [18:46:15] RECOVERY host: aggregator1 is UP address: i-0000010c PING OK - Packet loss = 0%, RTA = 0.49 ms [19:01:16] Ryan_Lane: just reminding you that wikitech [19:15:29] New patchset: Sara; "Fix bug that was causing gmetad servers to hang on boot (on lucid): Mount and populate /mnt/ganglia_tmp/rrds.pmtpa when gmetad starts." [operations/puppet] (test) - https://gerrit.wikimedia.org/r/10135 [19:15:45] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/10135 [19:20:37] New review: Sara; "(no comment)" [operations/puppet] (test); V: 0 C: 2; - https://gerrit.wikimedia.org/r/10135 [19:20:39] Change merged: Sara; [operations/puppet] (test) - https://gerrit.wikimedia.org/r/10135 [19:24:22] petan|wk: right [19:24:44] user/email? [19:27:37] ah. was in -ops [19:27:42] petan|wk: done [19:32:35] RECOVERY dpkg-check is now: OK on aggregator2 i-000002c0 output: All packages OK [19:33:45] RECOVERY Current Load is now: OK on aggregator2 i-000002c0 output: OK - load average: 0.44, 0.28, 0.15 [19:34:04] Ryan_Lane: while logging in to instance I have: Permission denied (publickey). [19:34:13] which instance? [19:34:17] i-000002a1.pmtpa.wmflabs [19:34:25] RECOVERY Current Users is now: OK on aggregator2 i-000002c0 output: USERS OK - 1 users currently logged in [19:34:53] I was able to log in before to another instance, but after I deleted old one and created new one, I cannot do it anymore [19:35:05] RECOVERY Disk Space is now: OK on aggregator2 i-000002c0 output: DISK OK [19:35:35] RECOVERY Free ram is now: OK on aggregator2 i-000002c0 output: OK: 95% free memory [19:35:55] RECOVERY Free ram is now: OK on aggregator-test1 i-000002bf output: OK: 95% free memory [19:36:01] what's the instance name, rather than the id? [19:36:26] blamemaps-m1small [19:36:55] RECOVERY Total Processes is now: OK on aggregator2 i-000002c0 output: PROCS OK: 103 processes [19:36:58] ok [19:37:01] gimme a sec [19:37:06] RECOVERY Total Processes is now: OK on aggregator-test1 i-000002bf output: PROCS OK: 104 processes [19:37:09] sure [19:38:55] RECOVERY Current Load is now: OK on aggregator-test1 i-000002bf output: OK - load average: 1.70, 0.78, 0.30 [19:39:05] PROBLEM host: ganglia-test6 is DOWN address: i-000002af check_ping: Invalid hostname/address - i-000002af [19:39:19] hm [19:39:33] were you ever able to log into this instance? [19:39:35] RECOVERY Current Users is now: OK on aggregator-test1 i-000002bf output: USERS OK - 1 users currently logged in [19:39:49] not to this, to previous one which I delete [19:39:50] d [19:39:58] it seems it didn't build properly [19:40:04] please delete/recreate [19:40:09] ok [19:40:16] RECOVERY Disk Space is now: OK on aggregator-test1 i-000002bf output: DISK OK [19:40:56] PROBLEM Current Load is now: WARNING on bots-3 i-000000e5 output: WARNING - load average: 6.04, 5.17, 5.11 [19:42:56] PROBLEM dpkg-check is now: UNKNOWN on blamemaps-m1small i-000002a1 output: Invalid host name i-000002a1 [19:44:06] ganglia.wmflabs.org is back! [19:45:56] RECOVERY Current Load is now: OK on bots-3 i-000000e5 output: OK - load average: 4.13, 4.74, 4.96 [19:46:30] I created another instance i-000002c1 with name blamemaps-small [19:46:38] have the same problem [19:47:46] PROBLEM host: blamemaps-m1small is DOWN address: i-000002a1 check_ping: Invalid hostname/address - i-000002a1 [19:47:46] RECOVERY dpkg-check is now: OK on aggregator-test1 i-000002bf output: All packages OK [19:53:45] PROBLEM dpkg-check is now: CRITICAL on blamemaps-small i-000002c1 output: Connection refused by host [19:54:25] PROBLEM Current Load is now: CRITICAL on blamemaps-small i-000002c1 output: Connection refused by host [19:55:05] PROBLEM Current Users is now: CRITICAL on blamemaps-small i-000002c1 output: Connection refused by host [19:55:45] PROBLEM Disk Space is now: CRITICAL on blamemaps-small i-000002c1 output: Connection refused by host [19:56:20] PROBLEM Free ram is now: CRITICAL on blamemaps-small i-000002c1 output: Connection refused by host [19:57:05] PROBLEM HTTP is now: CRITICAL on blamemaps-small i-000002c1 output: CRITICAL - Socket timeout after 10 seconds [19:58:15] PROBLEM Total Processes is now: CRITICAL on blamemaps-small i-000002c1 output: Connection refused by host [20:23:35] RECOVERY Disk Space is now: OK on nagios 127.0.0.1 output: DISK OK [20:34:43] "Transport endpoint is not connected: "/data/db/", terminating" <- after running mongodb. What's happening? [20:40:21] Ryan_Lane: I have the same problem with new instance, name is blamemaps-small, instance id is i-000002c1 [20:41:13] you sure that's the instance'd id? [20:42:04] bah [20:42:06] ignore me [20:42:08] !log bots wmib: updating wm-bot [20:42:12] Logged the message, Master [20:45:02] you need to wait till it's totally done [20:45:06] the instance build is failing because you are choosing classes that aren't compatible [20:45:07] oh [20:45:07] and doing so before puppet runs [20:45:12] which classes? [20:46:47] don't add any until the instance it totally fully built and you can login [20:47:02] you added some webserver classes [20:47:15] okay [20:50:31] !log bots wmib: inserting wikimania to bot config and reloading it [20:50:33] Logged the message, Master [20:55:44] PROBLEM host: blamemaps-dontblame is DOWN address: i-000002c2 CRITICAL - Host Unreachable (i-000002c2) [21:02:05] New review: Demon; "Adding things like this that will never be merged to production just makes the difference between te..." [operations/puppet] (test); V: 0 C: -2; - https://gerrit.wikimedia.org/r/6467 [21:03:44] PROBLEM Current Load is now: CRITICAL on blamemaps-s1 i-000002c3 output: CHECK_NRPE: Error - Could not complete SSL handshake. [21:04:24] PROBLEM Current Users is now: CRITICAL on blamemaps-s1 i-000002c3 output: CHECK_NRPE: Error - Could not complete SSL handshake. [21:05:04] PROBLEM Disk Space is now: CRITICAL on blamemaps-s1 i-000002c3 output: CHECK_NRPE: Error - Could not complete SSL handshake. [21:05:44] PROBLEM Free ram is now: CRITICAL on blamemaps-s1 i-000002c3 output: CHECK_NRPE: Error - Could not complete SSL handshake. [21:06:54] Ryan_Lane: Thank you, now I can log in. While clicking on create instance I had page "instance is created". But it wasn't. I went to console output and waited till message 'puppet finish run'. Thank you again! [21:06:54] PROBLEM Total Processes is now: CRITICAL on blamemaps-s1 i-000002c3 output: CHECK_NRPE: Error - Could not complete SSL handshake. [21:07:34] PROBLEM dpkg-check is now: CRITICAL on blamemaps-s1 i-000002c3 output: CHECK_NRPE: Error - Could not complete SSL handshake. [21:08:16] yeah, it's a misleading thing it tells people [21:08:32] the instance is created, but that doesn't mean it's available for use, yet :( [21:08:57] yes, I understand it now ) [21:57:53] !log deployment-prep Reenabled the CheckUser extension on beta labs so we can actually use the checkuser audit function ;-) [21:57:54] Logged the message, Master [22:04:46] !log deployment-prep Made myself a steward using a database query on `labswiki` : insert into user_groups VALUES (183,'steward'); [22:04:48] Logged the message, Master [22:05:16] New patchset: Sara; "Templatize ganglia gmetad.conf." [operations/puppet] (test) - https://gerrit.wikimedia.org/r/10190 [22:05:33] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/10190 [22:08:24] New review: Sara; "(no comment)" [operations/puppet] (test); V: 0 C: 2; - https://gerrit.wikimedia.org/r/10190 [22:08:27] Change merged: Sara; [operations/puppet] (test) - https://gerrit.wikimedia.org/r/10190 [22:08:59] <^demon> hashar: There aren't elections? [22:09:31] for stewards ? [22:09:36] in production for sure [22:09:47] <^demon> We should have elections in labs ;-) [22:09:54] <^demon> And RfA !votes [22:09:54] for labs, since I am not aware of any process. I am just electing myself as a gentle dictator [22:10:33] <^demon> Are you subscribed to repo-discuss? [22:11:37] hashar: me too, petan|wk gave me some privs earlier today [22:12:36] ^demon: nop to repo-discuss [22:12:39] <^demon> Hehe, https://groups.google.com/forum/#!msg/repo-discuss/vV-WQlXC6a0/qWLtlBUJgX0J - doing ctrl+c during a `git clone` has the potential to leave gerrit in a broken state :p [22:12:47] <^demon> Silly gerrit. [22:12:49] ^demon: have you managed to fix up the operations/debs/testswarm repo ? :-( [22:13:01] ^demon: haha that is a good one [22:13:16] chrismcmahon: yup I think we can just give each other those rights [22:13:37] chrismcmahon: there is a potential issue with checkuser though cause it can let people see other people IP address and other private date [22:13:39] hashar: where are you anyway? isn't it really late? [22:13:46] midnight [22:13:51] New patchset: Sara; "Fix path to ganglia gmetad.conf template." [operations/puppet] (test) - https://gerrit.wikimedia.org/r/10191 [22:13:56] but I had to fix something for Philippe Beaudette so ... [22:14:04] also I am on vacation for the next 4 days [22:14:08] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/10191 [22:14:09] or 3 days [22:14:24] yeah 3 days break that is it [22:14:43] New review: Sara; "(no comment)" [operations/puppet] (test); V: 0 C: 2; - https://gerrit.wikimedia.org/r/10191 [22:14:44] chrismcmahon: I have met petan in real life during the hackaton :-]]] [22:14:46] Change merged: Sara; [operations/puppet] (test) - https://gerrit.wikimedia.org/r/10191 [22:14:58] hashar: so yeah, we're avoiding using checkuser/oversight on labs because of the privacy policy and how it's not too clear for labs [22:15:00] nice! [22:15:09] chrismcmahon: faidon and the ops are doing good progress on migrating packages for the TMH [22:15:15] but if Philippe told you to do something with it, then by all means :) [22:15:47] <^demon> hashar: Ran set-project-parent for you. [22:15:54] hashar: too bad you're out for a few days, it sounds like the beta cluster is nearly getting useful now, there are some things I want to ask you about [22:16:06] hashar: but they can wait :-) enjoy your time off! [22:16:34] Thehelpfulone: I somehow asked internally that we start writing a basic policy. My opinion is something like: this is beta, your data (password, name submitted, preferences, email) are going to be leaked to the public. [22:16:51] ^demon: yup I forgot that switch unfortunately :-( [22:17:08] password can be leaked publically? [22:17:12] I thought it was hashed or something [22:17:15] <^demon> hashar: No worries, easily fixed. I need to write a guide on how to make projects properly :) [22:18:14] Thehelpfulone: yeah everything can be leaked publicly. [22:18:21] Thehelpfulone: so you definitely want to use another password on labs [22:18:26] Thehelpfulone: (I am serious) [22:18:43] yeah I am using another password, but aren't passwords stored securely? [22:18:55] somehow yes ;-] [22:19:23] it is something like md5( $username . '-' . md5( $password ) ); [22:19:30] so kind of encrypted [22:19:56] but there are other possibilities to get your plain text password [22:20:12] <^demon> Plus the salt. [22:20:32] hashar: I presume IPs *won't* be leaked publically? [22:20:53] New patchset: Sara; "Attempt to set gmetad.conf data_sources from template." [operations/puppet] (test) - https://gerrit.wikimedia.org/r/10192 [22:20:57] so both the IPs of people who are looking at labs and those of users (obviously CUs shouldn't be leaking data..) [22:21:09] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/10192 [22:21:38] New review: Sara; "(no comment)" [operations/puppet] (test); V: 0 C: 2; - https://gerrit.wikimedia.org/r/10192 [22:21:41] Change merged: Sara; [operations/puppet] (test) - https://gerrit.wikimedia.org/r/10192 [22:22:37] ^demon: any idea how I can do an initial import of a git repo ? I can't grant my self git push force on operations/debs/testswarm :D [22:23:00] (or I could just rebase -i and add Change-Id hehe [22:23:25] <^demon> How about you do that so I can avoid having to tweak permissions for you :p [22:25:47] New patchset: Sara; "Fix has_variable? in ganglia gmetad template." [operations/puppet] (test) - https://gerrit.wikimedia.org/r/10193 [22:26:02] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/10193 [22:26:31] New review: Sara; "(no comment)" [operations/puppet] (test); V: 0 C: 2; - https://gerrit.wikimedia.org/r/10193 [22:26:35] Change merged: Sara; [operations/puppet] (test) - https://gerrit.wikimedia.org/r/10193 [22:28:27] ^demon: well I cannot figure out how to edit the first commit :-( [22:28:49] <^demon> Do you have more than one commit? [22:28:56] yup [22:29:02] <^demon> :( [22:29:03] that is an import from the svn repository [22:29:08] I made using git svn [22:29:13] then some git filter-branch tricks [22:29:50] http://serverfault.com/questions/12918/how-do-i-fix-the-metainformation-on-the-first-commit-in-git [22:30:30] <^demon> Ok, I granted you create reference and push+force on operations/debs/testswarm. [22:30:34] <^demon> Go ahead and push -f [22:30:48] oh [22:31:00] <^demon> Easier than playing with git-filter-branch ;-) [22:35:47] bah [22:35:53] they all have change-ID now :-D [22:38:18] is anyone else seeing ldap problems in labs? [22:38:42] <^demon> What sort of problems? [22:39:10] among others: sudo: ldap_start_tls_s(): Can't contact LDAP server [22:42:18] <^demon> No problems here. Everything I use that points at LDAP is functioning ok. [22:44:06] ^demon: thanks! I have git push -f operations/debs/testswarm , you can get ride of my rights [22:44:16] the rest will follow up as Gerrit changes now ;-) [22:44:48] <^demon> mmk [22:47:03] !log deployment-prep Made [[user:Philippe (WMF)|]] a steward and checkuser user. [23:42:10] !log restarted opendj on virt0 [23:42:35] dammit, the log bot hates me [23:44:23] <^demon> log bot died a little while ago. [23:45:00] I take it personally, as with all deaths. [23:52:27] New patchset: Sara; "Add authority_url to ganglia gmetad.conf template." [operations/puppet] (test) - https://gerrit.wikimedia.org/r/10204 [23:52:43] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/10204 [23:55:34] New review: Sara; "(no comment)" [operations/puppet] (test); V: 0 C: 2; - https://gerrit.wikimedia.org/r/10204 [23:55:37] Change merged: Sara; [operations/puppet] (test) - https://gerrit.wikimedia.org/r/10204