[01:10:57] New review: Dzahn; "ok, going to do this (expect NRPE breakage, will stop nagios-wm temp.)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/3144 [01:11:00] Change merged: Dzahn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3144 [01:50:04] New patchset: Dzahn; "add the nagios-nrpe-server init file" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3233 [01:50:16] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/3233 [01:52:44] New patchset: Dzahn; "add the nagios-nrpe-server init file" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3233 [01:52:57] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/3233 [02:02:17] PROBLEM Free ram is now: CRITICAL on mobile-enwp mobile-enwp output: Critical: 3% free memory [02:22:17] PROBLEM Free ram is now: WARNING on mobile-enwp mobile-enwp output: Warning: 15% free memory [02:45:01] RECOVERY dpkg-check is now: OK on pediapress-ocg2 pediapress-ocg2 output: All packages OK [02:45:01] RECOVERY Current Users is now: OK on pediapress-ocg2 pediapress-ocg2 output: USERS OK - 0 users currently logged in [02:45:01] RECOVERY Disk Space is now: OK on pediapress-ocg2 pediapress-ocg2 output: DISK OK [02:45:01] RECOVERY Free ram is now: OK on pediapress-ocg2 pediapress-ocg2 output: OK: 84% free memory [02:47:17] RECOVERY Total Processes is now: OK on pediapress-ocg2 pediapress-ocg2 output: PROCS OK: 81 processes [02:47:22] RECOVERY Current Load is now: OK on pediapress-ocg2 pediapress-ocg2 output: OK - load average: 0.02, 0.11, 0.06 [02:47:27] PROBLEM Free ram is now: CRITICAL on mobile-enwp mobile-enwp output: CHECK_NRPE: Socket timeout after 10 seconds. [02:47:27] PROBLEM Current Load is now: CRITICAL on mobile-enwp mobile-enwp output: CHECK_NRPE: Socket timeout after 10 seconds. [02:52:27] PROBLEM Current Load is now: WARNING on mobile-enwp mobile-enwp output: WARNING - load average: 7.30, 8.02, 8.33 [02:52:27] PROBLEM Free ram is now: WARNING on mobile-enwp mobile-enwp output: Warning: 10% free memory [03:02:17] RECOVERY dpkg-check is now: OK on pediapress-ocg3 pediapress-ocg3 output: All packages OK [03:02:17] RECOVERY Current Load is now: OK on pediapress-ocg3 pediapress-ocg3 output: OK - load average: 0.64, 1.74, 1.15 [03:02:17] RECOVERY Current Users is now: OK on pediapress-ocg3 pediapress-ocg3 output: USERS OK - 0 users currently logged in [03:04:57] RECOVERY Disk Space is now: OK on pediapress-ocg3 pediapress-ocg3 output: DISK OK [03:04:57] RECOVERY Free ram is now: OK on pediapress-ocg3 pediapress-ocg3 output: OK: 84% free memory [03:04:57] RECOVERY Total Processes is now: OK on pediapress-ocg3 pediapress-ocg3 output: PROCS OK: 81 processes [03:14:35] New review: Dzahn; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/3233 [03:14:40] Change merged: Dzahn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3233 [03:27:17] RECOVERY dpkg-check is now: OK on kant1 kant1 output: All packages OK [03:27:17] RECOVERY Current Load is now: OK on kant1 kant1 output: OK - load average: 0.11, 0.08, 0.05 [03:28:21] New patchset: Dzahn; "sleep 10 before cleaning up PID file, it seemed like sometimes it deletes the pid before the service has started and that might cause the failures" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3234 [03:28:33] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/3234 [03:29:57] RECOVERY Current Users is now: OK on kant1 kant1 output: USERS OK - 0 users currently logged in [03:29:57] RECOVERY Disk Space is now: OK on kant1 kant1 output: DISK OK [03:29:57] RECOVERY Free ram is now: OK on kant1 kant1 output: OK: 96% free memory [03:30:07] RECOVERY Total Processes is now: OK on kant1 kant1 output: PROCS OK: 172 processes [03:30:57] New patchset: Dzahn; "sleep 10 before cleaning up PID file, it seemed like sometimes it deletes the pid before the service has stopped and that might cause the failures" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3234 [03:31:09] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/3234 [03:32:24] New review: Dzahn; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/3234 [03:32:27] Change merged: Dzahn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3234 [03:45:25] New patchset: Dzahn; "limit swift process monitoring to ms-be hosts because the testing machinges do not have nrpe installed" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3235 [03:45:37] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/3235 [03:46:38] New review: Dzahn; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/3235 [03:46:41] Change merged: Dzahn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3235 [04:32:27] PROBLEM Free ram is now: CRITICAL on mobile-enwp mobile-enwp output: Critical: 2% free memory [04:42:27] PROBLEM Free ram is now: WARNING on mobile-enwp mobile-enwp output: Warning: 10% free memory [04:47:27] PROBLEM Free ram is now: CRITICAL on mobile-enwp mobile-enwp output: Critical: 4% free memory [05:05:36] is anyone with access on labs up? stews seem to be afk and I want something done and... [05:05:48] I mean the beta cluster [05:06:32] * Hazard-SJ wonders if developers like pings [05:07:27] PROBLEM Free ram is now: WARNING on mobile-enwp mobile-enwp output: Warning: 10% free memory [05:47:27] RECOVERY Current Load is now: OK on mobile-enwp mobile-enwp output: OK - load average: 5.06, 4.56, 4.92 [06:13:37] New review: Dzahn; "looks good, i would just have called public-services-2 "208.80.153.192/26" = labs => public, but sam..." [operations/puppet] (production); V: 1 C: 1; - https://gerrit.wikimedia.org/r/3115 [06:52:48] New patchset: Dzahn; "decommission dataset1, as it's dead per RT-1345" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3236 [06:53:00] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/3236 [07:12:25] RECOVERY Free ram is now: OK on mobile-enwp mobile-enwp output: OK: 24% free memory [07:38:14] New patchset: Dzahn; "decommission project2 per RT-2637" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3237 [07:38:27] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/3237 [07:39:10] New patchset: Dzahn; "decommission project2 per RT-2637" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3237 [07:39:22] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/3237 [07:48:25] PROBLEM Free ram is now: WARNING on mobile-enwp mobile-enwp output: Warning: 18% free memory [08:13:35] PROBLEM Free ram is now: CRITICAL on mobile-enwp mobile-enwp output: CHECK_NRPE: Socket timeout after 10 seconds. [08:23:10] New patchset: Dzahn; "replace all occurences of "*.wikimedia.org" with "star.wikimedia.org" per RT-2512" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3238 [08:23:22] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/3238 [09:00:45] New patchset: Dzahn; "do not permit root logins on bastion hosts" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3239 [09:00:53] New review: gerrit2; "Change did not pass lint check. You will need to send an amended patchset for this (see: https://lab..." [operations/puppet] (production); V: -1 - https://gerrit.wikimedia.org/r/3239 [09:02:37] New patchset: Dzahn; "do not permit root logins on bastion hosts" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3239 [09:02:45] New review: gerrit2; "Change did not pass lint check. You will need to send an amended patchset for this (see: https://lab..." [operations/puppet] (production); V: -1 - https://gerrit.wikimedia.org/r/3239 [09:05:31] New patchset: Dzahn; "do not permit root logins on bastion hosts" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3239 [09:05:43] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/3239 [10:55:44] New patchset: ArielGlenn; "vanilla stanza for ms1001 for a start" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3240 [10:55:57] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/3240 [10:56:40] New review: ArielGlenn; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/3240 [10:56:43] Change merged: ArielGlenn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3240 [14:32:37] petan: petan|wk: Do you know how to delete global groups? [14:36:15] hexmode: yes [14:36:42] petan|wk: good, Let me see if I can get you to delete this group [14:36:45] 1s [14:37:25] petan|wk: "global sysop" and "global_sysop" [14:37:50] http://labs.wikimedia.beta.wmflabs.org/wiki/Special:GlobalGroupPermissions [14:38:21] ok [14:39:49] New patchset: Mark Bergsma; "Fix dependencies of varnish::logging" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3242 [14:40:02] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/3242 [14:44:38] New review: Mark Bergsma; "First of all, why are root logins being disallowed? Was there a discussion about this somewhere?" [operations/puppet] (production); V: 0 C: -2; - https://gerrit.wikimedia.org/r/3239 [14:45:30] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/3242 [14:45:33] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3242 [14:53:32] zzz deployment-sql died again [14:55:17] I wonder if its actually a bug in CentralAuth regarding the global groups issue [14:57:24] hexmode: done [14:57:33] Hydriz: I think it's gluster [14:57:47] process on -sql server died [14:57:50] oh [14:57:54] the one which handle connection to gluster [14:57:57] nevermind, its up anyway [14:58:01] I know [14:58:08] I was on the server in that time [14:58:14] \o/ its deleted [14:59:12] petan|wk: If possible, do excute steward stuff on metawiki, so that its a central place which we do things (of course labswiki, if really needed) [14:59:56] I don't really do steward stuff [15:00:03] that's what I have you for [15:00:07] yeah [15:00:15] I need to inform THO on that [15:00:19] but I think he knows :) [15:00:20] hm... [15:00:46] and if possible, just spam give global sysop or something [15:01:00] ? [15:01:22] I mean [15:01:50] stewardship, isn't too big of an issue, but give global sysop first before assigning stewardship afterwards [15:02:14] but anyway I think its only up to me and THO to worry anyway [15:02:25] or hexmode [15:02:27] it's his site :P [15:02:35] he rules there [15:02:35] :D [15:02:45] hexmode the second [15:02:51] ruler of beta [15:03:06] petan|wk: you are my equal, surely! [15:03:06] :P [15:03:10] lol [15:03:13] hehe [15:03:25] * Hydriz placed you people as staff or developers [15:03:38] and not stewards anyway [15:03:50] I don't really care [15:03:58] yeah, just for fun :) [15:04:01] but all devs with svn should be able to get developer right [15:04:13] which basicaly allows them to do anything they need [15:04:19] for development [15:04:29] yep [15:04:42] petan|wk: I need to prod Ryan_Lane to help us get shell users on there [15:04:47] but if extensions are enabled, it doesn't show the right on metawiki (I think) [15:05:20] hexmode: I think Reedy already does [15:05:41] petan|wk: really? [15:05:50] btw Hydriz in case you ever want to check whether someone's on the WMF staff, http://wikimediafoundation.org/wiki/Staff [15:05:51] yes he has access to beta shell [15:05:52] Reedy: shell users are on beta? [15:05:53] is fairly useful [15:05:56] yep [15:06:20] actually I don't use that list anyway :P [15:07:52] hexmode: also werdna has access to there [15:08:01] is it what you mean [15:08:17] petan|wk: no, that is root access [15:08:23] ah [15:08:28] I know what you mean now [15:08:34] I'm talking about ability to deploy things w/o sudo [15:08:44] ok [15:08:49] that's not done [15:10:00] btw hexmode I think we should handle that privacy policy stuff [15:10:15] petan|wk: ?? [15:10:52] Ryan told us that sites like beta doesn't require people to be identified to wmf in order to have access to sensitive data if they have some own privacy policies or what [15:11:15] I don't really know how does it work, I am no lawyer heh [15:11:39] but in past it wasn't even clear if users who aren't identified can get steward on beta [15:11:40] hexmode: sorry, what? [15:11:56] Reedy: nm, it is cleared up now [15:11:58] Reedy: probably nothing, I didn't get what he meant [15:12:52] hexmode: Ryan said he had a meeting with legals and he was told that labs is exception but all sites must follow something... I check logs, sec [15:13:28] @search meeting [15:13:28] No results found! :| [15:13:35] @regsearch conf [15:13:36] Results (found 1): pathconflict, [15:13:40] damn [15:16:55] http://bots.wmflabs.org/~petrb/logs/%23wikimedia-labs/20120218.txt [15:16:56] here [15:17:21] [18:26:29] labs project will be required to display the privacy policy and terms of use, and they must show a warning to users anywhere information can be collected [15:17:25] [18:26:47] also, privacy information should simply not be kept where possible [15:17:28] hexmode: ^ [15:17:51] I don't really know what he means [15:18:00] but we probably need to create some own policies [15:18:02] or so [15:18:36] me either [15:18:46] right [15:18:46] hrm.... have to think about this [15:18:51] {{resolved}} [15:18:54] :D [15:18:59] INVALID [15:19:10] :P [15:20:00] SCREW THIS [15:20:11] edits that never gets saved [15:20:14] grrr [15:21:06] petan|wk: we should just rollback once a week [15:21:11] problem solved [15:22:37] heh [15:22:44] I don't know if people and devs would like it [15:23:14] but it's quite easy solution though [15:23:21] it could be even cronned [15:23:28] however rollback would take site down [15:23:38] for time of 30 min or so [15:23:43] maybe even more [15:23:57] also creation of new wikis would be hard [15:24:32] user accounts would be lost [15:24:33] etc [15:24:36] bans too [15:24:42] spambots would like it [15:24:45] :P [15:25:37] The server has something against me or something [15:25:48] which [15:25:53] deployment [15:26:00] there are 10+ [15:26:08] I am trying to assign permissions for Global sysops [15:26:19] and? [15:26:20] then it just exits without anything saved [15:26:28] that's weird [15:26:40] and it just passed [15:26:49] grr, always happens [15:28:23] chrismcmahon: could you work with Ryan_Lane on the privacy policy thing petan|wk was talking about? [15:29:00] chrismcmahon: actually, that might be legal that needs consulting [15:29:25] chrismcmahon: I can email them and CC you if that works [15:29:37] * hexmode pings incessently [15:29:43] again it happened [15:29:46] grrr [15:29:56] works to me [15:30:04] http://meta.wikimedia.beta.wmflabs.org/wiki/Special:GlobalGroupPermissions/global_sysop [15:30:33] but it magically fails everytime [15:30:41] making usergroup with space is asking for troubles though [15:30:44] only when I change 1 thing, then it works [15:30:54] try now [15:31:08] ok, you right [15:31:10] there is problem [15:31:37] * Hydriz is holding down his temper :P [15:32:27] 2 is passable [15:32:32] let me try 3 [15:33:11] boom [15:33:13] failed [15:34:41] now it's because of memc [15:34:52] it doesn't matter how many u edit [15:35:03] finally [16:07:09] PROBLEM Free ram is now: WARNING on bots-3 bots-3 output: Warning: 19% free memory [16:14:59] New patchset: Mark Bergsma; "Package varnish is now installed everywhere instead of varnish3" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3247 [16:15:12] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/3247 [16:15:43] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/3247 [16:15:46] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3247 [16:20:48] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/3237 [16:21:29] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/3236 [16:21:32] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3237 [16:21:33] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3236 [16:43:21] New patchset: Mark Bergsma; "Rename role/seach.pp to role/search.pp" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3249 [16:43:34] New patchset: Mark Bergsma; "Retab search.pp" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3250 [16:43:48] New patchset: Mark Bergsma; "The role classes are actually called role::lucene instead" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3251 [16:44:01] New patchset: Mark Bergsma; "Add review comments" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3252 [16:44:14] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/3249 [16:44:14] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/3250 [16:44:14] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/3251 [16:44:14] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/3252 [16:48:07] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/2682 [16:48:10] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/2682 [16:49:30] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/3024 [16:52:05] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/3115 [16:52:08] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3115 [16:54:11] New patchset: Mark Bergsma; "Move labs hosts subnet to private" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3253 [16:54:23] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/3253 [16:54:42] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/3253 [16:54:45] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3253 [16:57:00] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/2788 [16:57:03] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/2788 [16:58:40] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/3131 [16:58:43] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3131 [17:00:07] New review: Mark Bergsma; "Set -1 until git switchover" [operations/puppet] (production); V: -1 C: 0; - https://gerrit.wikimedia.org/r/2786 [17:13:58] petan|wk: so, it seems that legal and community may have changed their mind [17:14:43] they want to follow the admin policy for projects, and as such we need to identify for things like beta like we do for other projects [17:15:14] New review: Mark Bergsma; "That hostname check is not appropriate in swift.pp! Give the class a parameter, and call it as appro..." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3072 [17:16:46] Ryan_Lane: I'm looking at this line: { "nova-core-release": repo_string => "nova-core/release", apt_key => "2A2356C9", dist => "lucid", ensure => "absent" } [17:17:05] Doesn't ensure => absent mean 'remove this repo from apt sources if it is present' [17:17:06] ? [17:17:09] yep [17:17:28] Then what does it mean that many other classes depend on 'nova-core-release'? [17:17:40] lemme look [17:18:01] hm [17:18:03] The next line is apt::pparepo { "nova-core-release-diablo": repo_string => "openstack-release/2011.3", apt_key => "3D1B4472", dist => "lucid", ensure => "present" } which, presumably, is the repo you want everyone to be using. [17:18:04] they should rely on nova-core-release-diablo [17:18:14] But then, diablo is only referred to in /one/ other place. [17:19:07] Are those other deps on nova-core-release just leftovers that happen not to break anything? [17:20:34] New review: Mark Bergsma; "Please fix the indentation in iron's node entry" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3121 [17:21:39] Ryan_Lane: does it mean I need to remove all users from the user groups? [17:22:02] btw I need to go now [17:23:13] petan|wk: I'll find out [17:23:16] I hope not [17:23:25] andrewbogott: yep [17:23:36] andrewbogott: because it's still there, even though it's set as absent [17:24:10] it's a bug, though. hejh [17:24:20] *heh [17:24:43] OK -- that's encouraging, actually :) [17:25:27] New review: Mark Bergsma; "Rob:" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3122 [17:27:26] New review: Pyoungmeister; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/3249 [17:27:29] Change merged: Pyoungmeister; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3249 [17:28:20] New review: Pyoungmeister; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/3250 [17:28:22] Change merged: Pyoungmeister; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3250 [17:29:08] hm. why is the bot sending production branch changes in here? [17:29:13] that's not right [17:29:13] New review: Pyoungmeister; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/3251 [17:29:16] Change merged: Pyoungmeister; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3251 [17:29:29] the bot spam is evil and must be purged with fire [17:30:44] New review: Pyoungmeister; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/3252 [17:31:35] Are apt::pparepo lines traversed even if nothing depends on the repo? [17:31:42] yep [17:32:26] New patchset: Ryan Lane; "Fixing labs/production split in gerrit reporting" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3254 [17:32:38] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/3254 [17:33:10] New review: Ryan Lane; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/3254 [17:33:12] Change merged: Ryan Lane; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3254 [17:34:12] Are any of y'all using git-review on OSX? [17:35:08] oh, nm, found a howto by hashar. [17:37:44] which fails, alas! [17:46:37] andrewbogott: yeah, we're using it [17:46:40] why's that? [17:46:54] I'm running Leopard, probably too old of a python. [17:47:01] ah [17:47:03] s'alright, I can do what I want from a linux box. [17:47:06] for git-review? [17:47:53] Yeah, I get a syntax error in the installer. [17:48:12] I think 'with' is a newish keyword. [17:56:46] git-review uses the same key as the one I submitted for svn access? [17:58:13] New review: Mark Bergsma; "Why is this done through a gazillion checkcommands instead of one which takes a parameter?!" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3144 [18:04:03] Ryan_Lane: Who is the/a boss of gerrit access? My ssh key is getting rejected /and/ I seem to not know my password to the web interface. [18:04:14] * andrewbogott suffers from chronic noob syndrome [18:04:17] heh [18:04:29] it's your wiki username/password [18:04:45] Gerrit is its own boss :D [18:04:52] if you don't remember it, you can get the wiki to send you a new password [18:23:52] Hm... nope, I can log into en.wikipedia.org but not gerrit.wikimedia.org. Is that what you meant? [18:26:56] gerrit is ldap [18:27:00] so use your labsconsole login [18:27:12] or get labsconsole to send you a new pass [18:27:24] andrewbogott: labsconsole :) [18:27:33] they both share the same credentials [18:28:49] Ah! Ok, now I'm in. Let's see if I can get git-review to work now... [18:32:03] ssmollett: so, we should likely make a new project for salt [18:32:13] hm. I should really work on adding that new cisco node [18:32:25] we're really hitting a wall with instance creation [18:33:58] grrrrr.... [18:34:12] OK, so when talking to gerrit via ssh is my username my shell name or my wiki username? [18:35:17] nm, key confusion [18:37:04] shell [18:41:58] andrewbogott: shell [18:42:02] also, it uses a different key [18:42:12] if you want it to be the same key as labs, you need to upload that same key [18:42:18] it's an open bug in gerrit [18:42:24] to pull from LDAP, that is [18:42:35] yeah, I figured it out. Why does the gerrit-bot identify me as 'anonymous coward'? [18:42:40] o.O [18:42:46] it definitely should not [18:42:53] lemme take a look at your account [18:43:09] log out and back in? [18:43:39] your account looks fine [18:45:46] andrewbogott: go to Settings [18:45:51] what does "Profile" show? [18:46:14] My name and email [18:46:30] and in the top of the page it shows "Anonymous Coward"? [18:47:04] No, but I think I see what's happening... have two emails registered and only one has a name attached. [18:47:09] ahhhhh [18:47:09] ok [18:47:23] that's likely the reason [18:48:20] andrewbogott: so, ma rk will probably end up updating pybal for ipv6 using twisted, or he'll probably handle converting to another framework [18:48:27] but, we can still write a driver for pybal :) [18:48:31] sure [18:48:43] it's configured via text files [18:48:47] we just need to manage the text files [18:49:11] which actually makes things a million times easier [19:05:33] New patchset: Jgreen; "sigh, pure puppet trial and error, waste of revision control and notification packets" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/3264 [19:05:43] New review: gerrit2; "Change did not pass lint check. You will need to send an amended patchset for this (see: https://lab..." [operations/puppet] (test); V: -1 - https://gerrit.wikimedia.org/r/3264 [19:08:54] New patchset: Jgreen; "sigh, pure puppet trial and error, waste of revision control and notification packets more rcs . . . wasted." [operations/puppet] (test) - https://gerrit.wikimedia.org/r/3264 [19:09:05] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/3264 [19:09:35] New review: Jgreen; "(no comment)" [operations/puppet] (test); V: 1 C: 2; - https://gerrit.wikimedia.org/r/3264 [19:09:35] Change merged: Jgreen; [operations/puppet] (test) - https://gerrit.wikimedia.org/r/3264 [19:23:54] hm... Jeff_Green, did your patch merge automatically when you approved and verified, or is there another step to prompt a merge? [19:24:18] it merged automatically [19:24:33] 'k, probably I just need to be patient. [19:24:37] I just followed the directions to amend and it worked [19:25:13] of course half the time I intend to do that I botch it and end up blasting my git depot and starting fresh :-) [19:25:58] My change looks the same as yours did right before it merged itself... https://gerrit.wikimedia.org/r/#change,3262 [19:26:49] Ah! I clicked 'publish' instead of 'publish and submit' [19:26:52] ah [19:27:33] i would really like a way to do rapid testing of puppet tweaks without the gerrit overhead [19:27:43] yeah, that's in the works. [19:27:53] good! [19:27:58] Although I haven't made much progress on my part of the job :( [19:28:11] Doing a little bit of puppet development will motivate me! [19:28:28] puppet is such an unfriendly convoluted beast that it becomes excruciating to hack on it this way [19:29:27] I wonder how much of a disaster it would be to bring up a puppet instance within a project and point my instances at it [19:30:21] Or just clone the repo onto an instance and hack on it there pointing puppet at it locally. [19:30:28] yeah exactly [19:30:49] i'm thinking about the missing use of site.pp [19:31:38] I don't really understand how we get from the puppet class checkboxes in labsconsole to instances fetching classes [19:32:52] the node is assigned puppet classes in ldap with puppet accepting ldap as an external source I believe [19:32:58] then the classes just relate to the classes in the repo [19:33:58] ahh, so if I clobber the local puppet config on an instance I can bypass that? [19:34:52] puppetmaster does that, you could create a node definition with the classes you want and use that for developing locally. [19:35:17] In my work puppet dev setup I actually just dump nodes out into configs so I don't rely on the ldap cluster being up. [19:35:32] i see [19:39:25] Damianz: by 'puppetmaster' you mean the class listed in labsconsole? [19:40:15] Nope I mean the puppetmaster, it's probably configured by that class though [19:40:34] Currently there is a central node everything connects to and pulls their stuff from which runs the puppetmaster [19:41:00] There was talk about using git and having each instance pull then do puppet stuff locally so we can ditch that box. [19:41:01] oh i see [19:41:18] Doing it that way would also allow us to run seperate branches for testing stuff before getting involved with gerrit and production stuff. [19:41:28] right [19:41:55] i'm just going to hand configure a puppetmaster within my project and decouple it from git [19:42:04] since we're cherry picking everything anyway [19:42:46] You could do that or if you're only wanting to dev stuff to then be picked into the test branch just run puppet directly on your working repo. [19:42:56] Configuring a puppetmaster would be overkill for dev [19:43:17] eh, I did it for payments--there's not much to it [19:54:06] PROBLEM dpkg-check is now: CRITICAL on pediapress-puppetmaster pediapress-puppetmaster output: CHECK_NRPE: Error - Could not complete SSL handshake. [19:54:46] PROBLEM Current Load is now: CRITICAL on pediapress-puppetmaster pediapress-puppetmaster output: CHECK_NRPE: Error - Could not complete SSL handshake. [19:54:54] Ooh, new instance? [19:55:07] muwhahahahahha [19:55:26] Oh no! My IRC logs will be filled with spam! [19:55:36] PROBLEM Current Users is now: CRITICAL on pediapress-puppetmaster pediapress-puppetmaster output: CHECK_NRPE: Error - Could not complete SSL handshake. [19:55:45] sorry about that, if I knew how to unmonitor all the pediapress labs instances I would [19:56:04] Nah. D'you know the command to stop it? [19:56:10] nope [19:56:16] PROBLEM Disk Space is now: CRITICAL on pediapress-puppetmaster pediapress-puppetmaster output: CHECK_NRPE: Error - Could not complete SSL handshake. [19:56:34] sudo su -c "puppetd -tv" [19:56:49] oh, yeah that won't work [19:56:52] Ah. [19:56:56] PROBLEM Free ram is now: CRITICAL on pediapress-puppetmaster pediapress-puppetmaster output: CHECK_NRPE: Error - Could not complete SSL handshake. [19:56:56] I just decoupled the host from site puppet [19:57:09] Well, fair enough [19:58:26] PROBLEM Total Processes is now: CRITICAL on pediapress-puppetmaster pediapress-puppetmaster output: CHECK_NRPE: Error - Could not complete SSL handshake. [20:20:30] Why I can't change the full name on gerrit? [20:21:30] Ryan_Lane: I see some failing tests that look like this: /usr/bin/mysql -uroot ${openstack::nova_config::nova_db_name} -e 'exit' [20:21:48] Notably, no password is specified in those tests... is there some reason why they should work anyway? [20:41:47] andrewbogott: hm. [20:41:52] no password is needed [20:42:11] puppet should install the password before using that command [20:42:41] in set_root? [20:43:46] The same command fails in the shell; is there something in puppet's env that handles the password? [20:44:21] it should stick it into .my.cf [20:44:39] err .my.cnf [20:44:41] in /root [20:45:13] IWorld: which full name? [20:45:18] IWorld: it's pulling from LDAP [20:45:20] ok, I see it in .my.cnf. [20:45:23] it's your wikiname [20:45:25] ah [20:45:27] ok [20:45:38] When does .my.cnf take effect? [20:47:02] should be immediate [20:47:08] can you type mysql -uroot [20:47:09] ? [20:47:16] it should give you a shell without a password prompt [20:47:29] * Jeff_Green and down the rabbit hole we go "Execution of '/usr/local/bin/position-of-the-moon' returned 1" [20:47:38] yeah [20:47:47] you are trying to run them locally? [20:48:01] it's amazingly hard to do [20:48:04] Ryan_Lane: me? [20:48:07] yes [20:48:25] $ mysql -uroot [20:48:25] ERROR 1045 (28000): Access denied for user 'root'@'localhost' (using password: YES) [20:48:26] yeah I'm just going to knife anything that gets in my way so I can work on the trivial cherrypicking stuff I need to work on [20:48:33] hm [20:48:40] But if I specify the password on the commandline (the same one that's in .my.cnf) it works. [20:48:46] that's really weird [20:49:20] Actually, if I specify it with -p it works, and if with --password it gives me a password prompt. [20:49:44] Oh, nm, that last statement is false. [20:49:50] this is a very long shot and I haven't read much backscroll, but have you guys confirmed that the grants are subnet based and not hostname based? [20:49:56] It works with --password too if I get the syntax right. [20:51:28] Jeff_Green: it's localhost [20:51:51] Unix sockets ftl. [20:52:07] well, it's a problem with the .my.cnf, likely [20:52:18] Permissions on the .my.cnf or syntax related? [20:52:18] which instance is this? [20:52:27] Ryan_Lane: yeah I know, just saying--it's something I've seen cause all manner of auth higgledypiggldy [20:52:39] yep [20:52:42] ssmollett, hi there [20:52:52] andrewbogott: which instance is this? [20:53:05] essex-test-l still. [20:53:34] that's an 'l' as 'lucid' not a 1. [20:53:39] ok [20:53:55] PROBLEM Current Load is now: CRITICAL on essex-1 essex-1 output: CHECK_NRPE: Error - Could not complete SSL handshake. [20:54:16] What does the bots project? [20:54:16] things are such much faster now that I unloaded virt2 [20:54:23] I really need to pay more attention to its load [20:54:35] IWorld: the bots project is for running bots [20:54:36] PROBLEM Current Users is now: CRITICAL on essex-1 essex-1 output: CHECK_NRPE: Error - Could not complete SSL handshake. [20:54:46] andrewbogott: it works for me [20:55:00] Ryan_Lane: for wikis, irc and more? [20:55:08] maybe you need to exit and go back into root? [20:55:15] PROBLEM Disk Space is now: CRITICAL on essex-1 essex-1 output: CHECK_NRPE: Error - Could not complete SSL handshake. [20:55:54] IWorld: yep [20:56:05] PROBLEM Free ram is now: CRITICAL on essex-1 essex-1 output: CHECK_NRPE: Error - Could not complete SSL handshake. [20:56:06] aah [20:56:15] Ryan_Lane: a toolserver alternate? [20:56:39] yes [20:56:54] we don't have replicated copies of the databases yet, though [20:57:07] ah [20:57:09] but starting today we'll have public datasets available :) [20:57:17] cool [20:57:24] :o [20:57:25] PROBLEM Total Processes is now: CRITICAL on essex-1 essex-1 output: CHECK_NRPE: Error - Could not complete SSL handshake. [20:57:40] also, I need to change the directory scheme for project data [20:57:55] to be /data/pmtpa/ rather than /data/ [20:57:58] Replicated dbs would be so nice but sigh. Might as well just rewrite bits of the bot for now to make it less bizzare. [20:58:01] Ryan_Lane: Are you becoming root via 'sudo bash'? [20:58:01] since we'll have multiple zones [20:58:04] andrewbogott: no [20:58:10] that doesn't source your environment [20:58:11] o.0 [20:58:14] sudo su - ftw [20:58:15] PROBLEM dpkg-check is now: CRITICAL on essex-1 essex-1 output: CHECK_NRPE: Error - Could not complete SSL handshake. [20:58:16] sudo bash ftl [20:58:18] sudo su -s, or sudo su - [20:58:21] Ah, of course, it's looking at my local env... [20:58:24] err [20:58:30] sudo -s, or sudo su - [20:58:45] sudo ryan sleep; [20:58:56] you only want me to sleep for 1 second? [20:59:05] PROBLEM Total Processes is now: CRITICAL on pediapress-ocg3 pediapress-ocg3 output: CHECK_NRPE: Error - Could not complete SSL handshake. [20:59:16] OK, well, this explains why it was failing on the commandline, although not why it was failing for puppet. Hm. [20:59:19] 1second is enough for anyone. [21:00:25] PROBLEM Current Load is now: CRITICAL on pediapress-ocg3 pediapress-ocg3 output: CHECK_NRPE: Error - Could not complete SSL handshake. [21:00:25] PROBLEM Current Users is now: CRITICAL on pediapress-ocg3 pediapress-ocg3 output: CHECK_NRPE: Error - Could not complete SSL handshake. [21:00:25] PROBLEM dpkg-check is now: CRITICAL on pediapress-ocg3 pediapress-ocg3 output: CHECK_NRPE: Error - Could not complete SSL handshake. [21:01:55] PROBLEM Disk Space is now: CRITICAL on pediapress-ocg3 pediapress-ocg3 output: CHECK_NRPE: Error - Could not complete SSL handshake. [21:02:45] PROBLEM Free ram is now: CRITICAL on pediapress-ocg3 pediapress-ocg3 output: CHECK_NRPE: Error - Could not complete SSL handshake. [21:14:40] New patchset: Andrew Bogott; "First stab at making the openstack version configurable." [operations/puppet] (test) - https://gerrit.wikimedia.org/r/3268 [21:14:51] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/3268 [21:14:57] Is labs the better toolserver? [21:15:57] Good night [21:25:07] New review: Andrew Bogott; "(no comment)" [operations/puppet] (test); V: 0 C: 0; - https://gerrit.wikimedia.org/r/3268 [21:25:24] New review: Andrew Bogott; "(no comment)" [operations/puppet] (test); V: 1 C: 2; - https://gerrit.wikimedia.org/r/3268 [21:25:27] Change merged: Andrew Bogott; [operations/puppet] (test) - https://gerrit.wikimedia.org/r/3268 [21:37:25] RECOVERY Total Processes is now: OK on essex-1 essex-1 output: PROCS OK: 103 processes [21:38:55] RECOVERY Current Load is now: OK on essex-1 essex-1 output: OK - load average: 1.94, 1.39, 0.65 [21:39:35] RECOVERY Current Users is now: OK on essex-1 essex-1 output: USERS OK - 1 users currently logged in [21:40:16] RECOVERY Disk Space is now: OK on essex-1 essex-1 output: DISK OK [21:41:06] RECOVERY Free ram is now: OK on essex-1 essex-1 output: OK: 79% free memory [21:43:16] RECOVERY dpkg-check is now: OK on essex-1 essex-1 output: All packages OK [21:53:39] New patchset: Andrew Bogott; "Added a (possibly incorrect) apt-key for nova-core/trunk" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/3271 [21:53:50] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/3271 [21:54:45] New review: Andrew Bogott; "(no comment)" [operations/puppet] (test); V: 1 C: 2; - https://gerrit.wikimedia.org/r/3271 [21:54:47] Change merged: Andrew Bogott; [operations/puppet] (test) - https://gerrit.wikimedia.org/r/3271 [22:01:16] PROBLEM dpkg-check is now: CRITICAL on essex-1 essex-1 output: DPKG CRITICAL dpkg reports broken packages [22:49:18] 03/16/2012 - 22:49:18 - Updating keys for pgehres [22:50:12] 03/16/2012 - 22:50:12 - Updating keys for pgehres [23:18:24] rlane: here [23:18:57] !account-questions [23:18:57] I need the following info from you: 1. Your preferred wiki user name. This will also be your git username, so if you'd prefer this to be your real name, then provide your real name. 2. Your preferred email address. 3. Your SVN account name, or your preferred shell account name, if you do not have SVN access. [23:19:55] rfaulk: ^^ [23:22:12] 1. rfaulk 2. rfaulkner@wikimedia.org 3. rfaulk [23:22:57] !initial-login | rfaulk [23:22:58] rfaulk: https://labsconsole.wikimedia.org/wiki/Access#Initial_log_in [23:28:37] thanks, im all set [23:35:23] cool