[00:00:05] that's the link to the exe, best to just use that as there's 3 things there that you will need, and there's a 4th that could potentially be useful [00:00:30] Command line. Oh balls. [00:00:32] the download page is http://www.chiark.greenend.org.uk/~sgtatham/putty/download.html - where you can download the individual programs, but just use the all in one installer [00:01:03] Thehelpfulone: once I get everything set up though, it'll all work by itself? [00:01:29] your bots? yes they can be configured to work by themselves [00:02:26] Alright. I have to go, I'll be back in an hour. If you're still around then, we can continue. Sorry [00:03:46] Thehelpfulone: yes, that's the problem [00:03:55] no problem [00:03:59] Is problem [00:04:00] (that was to Sven_Manguard ) [00:06:59] How do I get mysql access on the bots-sql servers? [00:07:13] Ask nicely [00:07:20] please? :) [00:09:08] What for, how much space, required username etc? [00:25:51] sorry, one sec... [00:26:34] just for some cimple bots, 100MB, username 'kaldari' [00:26:59] cimple/simple [00:28:29] the first bots I'm transferring over is HotArticlesBot which only stores subscriptions in the DB, so it needs virtually no space, but I might move some of my Commons bots over as well, which need more db storage, thus the 100MB. [01:02:51] Thehelpfulone: I'm back, right when I said I would be. [01:05:43] Okay, wow. Is this *legal* ? [01:06:06] wait, no, encryption isn't illegal in the US I don't think [01:09:21] Thehelpfulone or Damianz or Ryan_Lane (although didn't work end for you all) so do I need to download the individual files from the second link or just click the first link [01:09:49] Sven_Manguard: the first link [01:09:57] I'm off to bed soon but I'll start you of [01:10:00] run the installer [01:10:01] And you trust this to be sage [01:10:07] safe* [01:10:08] yes of course [01:11:02] then run putty gen to generate a private/public key pair [01:11:09] Alright, it's installed [01:11:34] Wait, Thehelpfulone I've heard about Public Keys before (from XKCD) but what are they? [01:12:07] see http://www.unixwiz.net/techtips/putty-openssh.html [01:12:22] that will explain everything [01:13:06] Sven_Manguard: I'll help you tomorrow evening my time (UTC) set this up, but in the mean tie [01:13:07] time [01:13:11] skip out Insure the installation directory is in the command path [01:13:28] do the section that says Create and install a public/private keypair [01:13:45] wa wa wha? [01:14:06] Okay, go to bed and then tomorrow you can help. [01:14:12] Sven_Manguard: can you wait until tomorrow for this? I can do it step by step with you as I've set it up myself recently [01:14:17] mhmm [01:14:23] night night [01:14:25] :) [01:14:38] * Sven_Manguard wanders off in a daze and finds himself at his Xbox [01:14:47] ah, there we go, familiarity [01:18:09] * Damianz is bored [01:22:10] Damianz: you know Java? [01:22:40] I can give you a pile of bots and let you mull over them if you know Java [01:23:42] * Sven_Manguard holds out my hand to engage in thumb war with Damianz [01:23:49] * Sven_Manguard realizes that there's no way that could work [01:23:53] I don't particularly know java apart from hacking on it and bordem != lots of free time [01:24:07] bordem == non entertaining lots of work to do [01:24:45] you're working? at 9:30 PM east coast 6:30 PM west coast? [01:32:28] 2.30 gmt [01:33:00] And yes, I pretty much work in one capacity or another for like 20hours a day :P [02:36:59] paravoid: /srv/org/wikimedia/controller/wikis/1.18wmf1/extensions/OATHAuth [02:40:07] 05/03/2012 - 02:40:07 - Creating a home directory for faidon at /export/home/openstack/faidon [02:41:07] 05/03/2012 - 02:41:07 - Updating keys for faidon [02:49:56] no credentials found for your account. can we add rotating BZ quips to that message? [04:44:57] !log deployment-prep [deployment-dbdump] did a checkpoint `git commit -a` on deploymentprep-conf (/usr/local/apache/common) (locally not pushed anywhere) because there were lots of changes on disk but not in the repo. but didn't add any new files to the repo. (so there's still stuff reported uncommited by `git status`) [04:44:58] Logged the message, Master [04:47:14] !log deployment-prep [deployment-dbdump] changed all refs to IPs of prod hosts nfs-home and ekrem to be deployment-feed instead. and commited that to the local repo. (again not pushed anywhere yet) [04:47:15] Logged the message, Master [04:48:09] !log deployment-prep [deployment-dbdump] (that was to address per complaints about beta simplewiki appearing in #cvn-simplewikis on freenode) [04:48:10] Logged the message, Master [04:59:10] !log deployment-prep [deployment-dbdump] apt-get installed 'ack' [04:59:11] Logged the message, Master [05:02:28] ack > grep ? [05:02:38] read about it a few times, never tried it [05:02:55] btw, i found otu that you did that by reading my enotif ;P [05:02:58] out* [05:03:45] !log deployment-prep [deployment-dbdump] apt-get purge ack; - On ubuntu ack is "ack-grep" which was already installed [05:03:46] Logged the message, Master [05:04:11] what's ack then? [05:04:39] haha, kanji code converter/checker [05:12:39] jeremyb: eh.. so.. how am I supposed to make a change to CommonSettings.php on labs? I've done it in the past on prototype.wikimedia.org but permissions are different here [05:12:45] I can't as myself, and I can't under sudo [05:12:48] both denied access [05:19:07] Krinkle-away: where did you try? [05:19:09] jeremyb: btw, https://labsconsole.wikimedia.org/w/index.php?title=Nova_Resource%3ADeployment-prep%2FSAL&action=historysubmit&diff=3396&oldid=3395 [05:19:39] the docs say don't do anything on the apaches, do everything on -dbdump [05:19:42] jeremyb: deployment-dbdump, /usr/local/apache/common-local/wmf-config/CommonSettings.php [05:19:43] so that's where i did it [05:19:49] yep [05:19:57] oh, i did on common/ not common-local/ [05:20:02] same thing [05:20:03] alias [05:20:13] * jeremyb logs back in [05:20:38] Do you change user or something? [05:20:44] didn't [05:20:56] just straight vim/emacs/nano w/e ? [05:21:02] vim [05:22:06] Whatever editor I use I get [ Error writing CommonSettings.php: Permission denied ] [05:22:22] also tried cp to my ~/ , edit there (works), and cp back [05:22:29] on the cp back, same error: [ Error writing CommonSettings.php: Permission denied ] [05:23:03] I can sudo and change the chmod, but that doesn't make sense [05:23:43] give me more context. your cd/pushd cmds, pwd, vim cmd, etc. [05:24:14] krinkle$ ssh deployment-dbdump.pmtpa.wmflabs [05:24:37] cd /usr/local/apache/common-local/wmf-config/ [05:24:53] nano CommonSettings.php [05:24:55] (edit) [05:25:02] ^X [05:25:06] [ Error writing CommonSettings.php: Permission denied ] [05:25:50] i just wrote. tried nano even [05:27:47] $ id jeremyb [05:27:47] uid=2127(jeremyb) gid=550(svn) groups=1042(project-bots),1054(project-bastion),1070(project-search),1072(project-deployment-prep),1106(project-upload-wizard),1122(project-otrs),1146(project-swift),1156(project-swift3),550(svn) [05:27:54] $ id krinkle [05:27:55] uid=2008(krinkle) gid=550(svn) groups=1038(project-testswarm),1042(project-bots),1054(project-bastion),1066(project-gerrit),1072(project-deployment-prep),1152(project-resourceloader2),550(svn) [05:28:13] no obvious issues [05:28:41] same group (svn0 [05:31:22] Krinkle: stat teh thing that you're trying to write [05:32:05] ? [05:32:30] Krinkle: `stat path/to/file/i/passed/to/nano` [05:32:38] oh [05:32:58] ok [05:33:10] jeremyb: Anything in particular? [05:33:16] Access: (0664/-rw-rw-r--) Uid: ( 1002/mwdeploy) Gid: ( 1075/ depops) [05:33:17] pastebin? or /msg [05:33:25] yeah, that's the same thing i have [05:33:42] i have an idea [05:34:11] I'm all ears :0 [05:34:32] btw, this change has no rush. just an E_NOTICE that has been bugging me for a bit [05:34:45] thanks for the help :) [05:35:08] if/when I get it to work I'll be able to help out more on labs (which I intend to) [05:35:18] bingo [05:35:20] figured it out [05:35:41] extract(bingo) [05:36:51] $ for box in deployment-{nfs-memc,dbdump}.pmtpa.wmflabs; do echo -n "$box: "; ssh $box getent group depops; done 2>/dev/null [05:36:54] deployment-nfs-memc.pmtpa.wmflabs: depops:x:1075:petrb,jeremyb,mah,hashar,johnduhart,catrope [05:36:57] deployment-dbdump.pmtpa.wmflabs: depops:x:1075:petrb,hashar,mah,johnduhart,platonides,werdna [05:37:11] oh [05:37:14] hm [05:37:25] since when is that managed per instance? [05:37:31] I thought it was per group / project on labs [05:37:35] I mean.. I can ssh into it [05:37:45] so, what matters here is your uid on the nfs client and your group on the nfs server [05:38:17] that particular group happens to be local. but the gid might overlap with nfs [05:38:27] ugh. [05:38:32] can you or I fix it? [05:38:42] sure [05:45:01] aight, will check out tomorrow [05:45:10] where are you going? ;) [06:02:08] !log deployment-prep [deployment-prep, deployment-nfs-memc] ran `for u in catrope hashar jeremyb johnduhart krinkle mah petrb platonides werdna; do sudo usermod -a -G depops $u; done`; krinkle was unable to modify files in wmf-config and I thought i saw why he couldn't but couldn't see why '''I''' could. turned out the groups on nfs-memc were the important ones and I was there. synced the 2 boxes with eachother and added krinkle to the list. som [06:02:10] Logged the message, Master [06:03:52] !log deployment-prep [deployment-prep, deployment-nfs-memc] …some other deployment-prep boxes have different depops groups. (one empty with a different gid than the rest. one is same gid but just has petrb) [06:04:19] labs-morebots: ping? [06:04:31] damnit. is it the ellipsis? [06:05:01] !log deployment-prep ping [06:05:02] Logged the message, Master [06:05:15] !log deployment-prep [deployment-prep, deployment-nfs-memc] some other deployment-prep boxes have different depops groups. (one empty with a different gid than the rest. one is same gid but just has petrb) [06:05:15] Logged the message, Master [06:27:00] Change on 12mediawiki a page Wikimedia Labs/Terms of use was modified, changed by Ryan lane link https://www.mediawiki.org/w/index.php?diff=531986 edit summary: [06:43:35] !log deployment-prep hashar: hashar: bug 36441, added ErrorDocument 404 [06:43:37] Logged the message, Master [06:46:22] !log dumps Mounted goodies on every host. [06:46:23] Logged the message, Master [07:12:29] jeremyb: hey [07:16:48] hey hashar Additionally, a 404 Not Found error was encountered while trying to use an ErrorDocument [07:16:57] it doesn't work [07:17:07] Ryan_Lane: is there a way to make sudo doesn't need pw [07:17:25] we badly need it on beta [07:17:49] hey :) [07:18:04] petan|wk: for? [07:18:04] had to commute from home to coworking place [07:18:23] Ryan_Lane: we need to be able to execute commands on all instances from one [07:18:29] I do plan on switching sudo to nopasswd, but I'm also changing ALL to the project group [07:18:30] ssh to all and run sudo [07:18:50] petan|wk: works for me : http://en.wikipedia.beta.wmflabs.org/IDONOTEXIST [07:19:03] I need to make some changes for this to work safely [07:19:04] if we put it to for xxx do ssh blah sudo blah it crashes [07:19:08] petan|wk: that gives me a nice 404 error page and redirects me to /wiki/IDONOTEXIST [07:19:20] if you give nopasswd to ALL, you give system services sudo too [07:19:29] which is the same as running all services as root [07:19:38] Ryan_Lane: why system services are in sudoers [07:19:48] they shouldn't be able to run sudo [07:19:52] on my servers they can't [07:19:59] why they can here [07:20:06] ALL means all users [07:20:24] that's why I'm going to switch it to the project group [07:20:49] aha [07:20:59] Ryan_Lane: for example on deployment not ALL users have access [07:21:03] we specified roles [07:21:27] so, for that, you can set NOPASSWD: ALL in the command line [07:21:28] ALL have access only on dbdump [07:21:33] rather than just ALL [07:21:39] that's what I did result was that sudo didn't work at all [07:21:44] just don't do that on any role that has ALL set for users [07:22:03] whatever I did it told me I am not allowed to run that [07:22:11] probably some syntax [07:22:13] problem [07:22:17] dunno [07:22:17] could be [07:22:21] NOPASSWD: ALL [07:22:25] ^^ that should work [07:22:26] I did NOPASSWD: ALL to command line [07:22:28] that didn't work [07:22:29] hm [07:22:34] I'll need to see how it works [07:23:23] I didn't check how the produced file looked [07:23:32] produced file? [07:23:33] I would need to know local root pw [07:23:39] puppet create /etc/sudoers [07:23:39] what produced file? [07:23:42] no [07:23:46] not anymore [07:23:48] I didn't know how did it look [07:23:50] it's in ldap [07:23:59] right but there must be some /etc/sudoers [07:24:00] no [07:24:05] sudo pulls from ldap [07:24:07] that's config file of sudo [07:24:22] ok but how did you tell to sudo it should pull it from ldap [07:24:30] where is the NOPASSWD part [07:24:36] if not in sudoers file [07:24:40] in ldap? [07:24:40] it's all in ldap [07:24:43] eh [07:24:45] right [07:24:54] actually I tried to change it by hand in sudoers and it worked [07:25:12] yeah. it could be that you need to specify it differently in ldap [07:25:17] hm [07:25:20] I'll figure it out and let you know [07:25:29] are we fine using the name of « betalabs » ? :-D [07:25:34] yes [07:26:04] will amend my changes and send them to review so :-] [07:28:27] should I rename it then [07:28:44] meh [07:28:56] I don't see why the links need to change [07:29:02] err [07:29:02] urls [07:29:28] petan|wk: you know it isn't possible to have a project not renamed at minimum 3 times, right? [07:29:40] welcome to the inside joke of projects at WMF [07:29:44] project not [07:29:53] I mean the dns [07:30:00] we didn't rename project so far [07:30:03] I'd prefer to not mess with the dns [07:30:12] we already changed deployment to beta [07:30:20] yeah. let's stick with beta [07:30:22] ok [07:30:28] so hashar what did you want to change then [07:30:39] betalabs is new name of what [07:30:40] it has wmflabs in the domain name. what's the point in adding labs in there again? [07:30:48] I don't know [07:30:48] The two hardest problems in computer science: 1. naming things; 2. cache invalidation; 3. off-by-one errors. [07:30:55] hashar wanted to change it ask him [07:31:09] when we refer to it, i think we should refer to it as beta in labs [07:31:14] I wanted to have a short and useful name to refers to the beta project [07:31:21] jerith: I usualy name variables a b x etc [07:31:23] and got it as betalabs :-] [07:31:24] :D [07:31:25] it's currently beta ;) [07:31:42] so now I can give a useful name to my puppet classes [07:31:58] why do the puppet classes need to be specific to beta? [07:32:01] they should be generic [07:32:02] hashar: ok so I changed the category in bugzilla [07:32:19] a *lot* of the puppet work is already done, it just needs to be applied [07:32:23] and maybe tweaked a little [07:32:35] New patchset: Hashar; "classes for deployment preparation project (beta)" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/5790 [07:32:50] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/5790 [07:33:12] New review: Hashar; "patchset 3 uses « betalabs » instead of « beta »" [operations/puppet] (test); V: 0 C: 0; - https://gerrit.wikimedia.org/r/5790 [07:33:26] I'm really confused by this [07:33:36] wmf deployment-prep (betalabs) [07:33:40] hashar: hm? [07:33:47] we shouldn't be using labs project specific puppet configuration [07:33:58] Ryan_Lane: what you mean [07:34:09] Ryan_Lane: we need to be able to approve changes to puppet for projects [07:34:10] this is a clone of production [07:34:18] or puppet is not going to be useable [07:34:20] use the classes from production [07:34:26] those classes are just about installing a few specifics packages (joe, ack-grep) and an attempt to document the 'devops' group [07:34:28] make changes to them when they aren't generic enough [07:34:52] group which is not that useful btw since its not our primary group :-D [07:35:01] hashar: is that name ok [07:35:05] hashar: wmf deployment-prep (betalabs) [07:35:09] why not use project-deployment-prep, rather than devops/ [07:35:10] that's what is now in bugzilla [07:35:26] petan|wk: whatever name :-]  that one sounds good yes [07:35:31] if you are in the project, you're assumed to be able to deploy, right? [07:35:36] I don't know Ryan, devops was there already :( [07:35:37] I don't know who inserted wmf there, but I like using both [07:35:42] deployment-prep betalabs [07:35:55] so that it's clear which project in console it is [07:36:00] either way, don't make any puppet config specific to the project [07:36:08] puppet config is supposed to be generic [07:36:23] !realm | hashar [07:36:23] hashar: $realm is a variable used in puppet to determine which cluster a system is in. See also $site. [07:36:25] :o [07:36:26] if you need to do something labs specific, use $realm [07:36:31] the problem is that we will probably end up having to write a lot of production classes which will take time to validate [07:36:52] whereas adding specific labs classes let us quickly document our changes and implements the labs faster [07:36:53] !site [07:36:53] $site is a variable used in puppet to determine which datacenter a system is in. See also $realm. [07:36:56] it's bad enough that beta isn't using the production puppet classes [07:36:58] yay [07:37:02] datacenter? [07:37:10] if we start writing all new classes, they'll *never* be similar [07:37:25] Ryan_Lane: we need to make a new variable to $PROJECTNAME [07:37:37] so that I can get a project name in shell [07:37:41] hashar: what I'm saying is, nearly everything you need is already done [07:37:49] hashar: it just needs to be tweaked [07:38:02] see how it's done in production, and apply that to beta [07:38:11] petan|wk: yeah [07:38:18] I saw the bug :) [07:38:21] that is what I did for the syslog server. I tweaked the production class so we can choose the remote syslog IP [07:38:48] but I know for sure that ack-grep joe or tree will not make it in production since I already tried to get them installed on fenari :-] [07:38:59] and I still find them useful, hence the new class [07:39:03] wtf is joe [07:39:09] petan|wk: an editor [07:39:23] New review: Ryan Lane; "Please don't make classes specific to a labs project." [operations/puppet] (test); V: 0 C: -2; - https://gerrit.wikimedia.org/r/5790 [07:39:25] is it better than vim? [07:39:29] as for the "devops" groups, we could most probably use the default wmflabs one [07:39:44] what I am missing though is the ability to change the users primary group [07:39:54] we are all in svn(550) right now :-] [07:40:03] still, it shouldn't be specific to the project [07:40:20] make them a class, and import them where needed [07:40:26] but generic classes [07:40:48] stuff such as: package::ack-grep package::joe etc... [07:40:59] then I "just": include package::ack-grep [07:41:00] ? [07:41:43] yes [07:42:25] I think we do it in generic.pp for the most part right now? [07:42:38] indeed [07:43:07] !log nagios root: aptitude upgrade [07:43:08] Logged the message, Master [07:44:55] New review: Hashar; "Following a discussion with Ryan, we should use the default group project-deployment-prep instead of..." [operations/puppet] (test); V: 0 C: 0; - https://gerrit.wikimedia.org/r/5790 [07:45:04] Change abandoned: Hashar; "(no reason)" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/5790 [07:45:19] Ryan_Lane: can we upgrade dist on instances? [07:45:28] is it supported? [07:45:31] ubuntu 12 [07:45:41] I think you use it on prod [07:45:49] umm [07:45:51] I'd wait on that [07:45:56] I haven't tested if upgrades work [07:45:58] ok [07:46:03] and definitely don't do it yet on beta [07:46:08] unless you want everything to break [07:46:22] we haven't started upgrading most things in production [07:46:26] right [07:48:53] \o/ nagious [07:48:57] *nagios [07:57:27] Ryan_Lane: before you get to sleep, we will probably have to merge the production branch back in test :-D [07:57:28] probably not something you want to do right now ;-] [07:57:28] hahahahaha [07:57:28] good luck on that [07:57:28] it's so insanely diverged right now [07:59:51] New patchset: Hashar; "class to install the Joe editor" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/6466 [08:00:05] New patchset: Hashar; "class to install the ack-grep utility" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/6467 [08:00:20] New patchset: Hashar; "class to install the 'tree' utility" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/6468 [08:00:30] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/6466 [08:00:30] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/6467 [08:00:30] New review: gerrit2; "Change did not pass lint check. You will need to send an amended patchset for this (see: https://lab..." [operations/puppet] (test); V: -1 - https://gerrit.wikimedia.org/r/6468 [08:06:28] New patchset: Hashar; "packages specifics to labs bastions" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/6469 [08:06:42] New patchset: Hashar; "class to install the 'tree' utility" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/6468 [08:06:56] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/6469 [08:06:56] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/6468 [08:13:26] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [08:13:35] New patchset: Hashar; "git ignore /private/" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/6471 [08:13:49] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/6471 [08:14:46] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [08:16:36] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [08:27:46] PROBLEM Free ram is now: WARNING on mobile-feeds i-000000c1 output: Warning: 17% free memory [08:37:30] New review: Dzahn; "Joe's Own Editor. http://joe-editor.sourceforge.net/" [operations/puppet] (test); V: 1 C: 2; - https://gerrit.wikimedia.org/r/6466 [08:37:33] Change merged: Dzahn; [operations/puppet] (test) - https://gerrit.wikimedia.org/r/6466 [08:42:19] New review: Dzahn; "https://launchpad.net/ubuntu/+source/tree" [operations/puppet] (test); V: 1 C: 2; - https://gerrit.wikimedia.org/r/6468 [08:43:26] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [08:43:33] New review: Dzahn; "just saying. did you also know "ytree"?" [operations/puppet] (test); V: 0 C: 0; - https://gerrit.wikimedia.org/r/6468 [08:44:46] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [08:46:25] New review: Dzahn; "(no comment)" [operations/puppet] (test); V: 0 C: -1; - https://gerrit.wikimedia.org/r/6467 [08:46:36] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [08:54:26] New review: Dzahn; "yeah, i guess if $::realm == "labs" { is preferred, as used in lvs.pp. Just on a note, i also see va..." [operations/puppet] (test); V: 0 C: 1; - https://gerrit.wikimedia.org/r/6469 [09:07:46] PROBLEM Free ram is now: CRITICAL on mobile-feeds i-000000c1 output: Critical: 5% free memory [09:12:23] mutante: there is something wrong with puppet checks [09:12:48] I give up merging production in test :-( [09:12:55] toooooo many conflicts to solve [09:13:05] hm [09:13:26] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [09:14:46] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [09:15:27] petan|wk: ok, checking, i see [09:15:57] I think it's problem in nagios [09:16:12] what is the service which listen to passive checks [09:16:16] PROBLEM Disk Space is now: WARNING on nagios 127.0.0.1 output: DISK WARNING - free space: /home/petrb 3353 MB (19% inode=81%): [09:16:17] I think it's down [09:16:36] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [09:22:46] RECOVERY Puppet freshness is now: OK on labs-build1 i-0000006b output: puppet ran at Thu May 3 09:22:43 UTC 2012 [09:23:06] RECOVERY Puppet freshness is now: OK on deployment-backup i-000000f8 output: puppet ran at Thu May 3 09:22:58 UTC 2012 [09:23:16] RECOVERY Puppet freshness is now: OK on dev-solr i-00000152 output: puppet ran at Thu May 3 09:23:09 UTC 2012 [09:23:16] !log nagios starting snmptrapd [09:23:17] Logged the message, Master [09:23:46] RECOVERY Puppet freshness is now: OK on bots-dev i-00000190 output: puppet ran at Thu May 3 09:23:36 UTC 2012 [09:23:48] petan|wk: that is that part i mentioned back then that may not be reboot safe. [09:24:16] RECOVERY Puppet freshness is now: OK on dumps-4 i-0000017d output: puppet ran at Thu May 3 09:24:02 UTC 2012 [09:24:16] RECOVERY Puppet freshness is now: OK on p-b i-000000ae output: puppet ran at Thu May 3 09:24:02 UTC 2012 [09:24:16] RECOVERY Puppet freshness is now: OK on hugglewa-db i-00000188 output: puppet ran at Thu May 3 09:24:07 UTC 2012 [09:24:36] RECOVERY Puppet freshness is now: OK on reportcard2 i-000001ea output: puppet ran at Thu May 3 09:24:25 UTC 2012 [09:25:16] RECOVERY Puppet freshness is now: OK on wikisource-web i-000000fe output: puppet ran at Thu May 3 09:25:14 UTC 2012 [09:25:36] RECOVERY Puppet freshness is now: OK on incubator-bots2 i-00000119 output: puppet ran at Thu May 3 09:25:19 UTC 2012 [09:25:36] RECOVERY Puppet freshness is now: OK on pediapress-ocg1 i-00000233 output: puppet ran at Thu May 3 09:25:22 UTC 2012 [09:25:36] RECOVERY Puppet freshness is now: OK on labs-relay i-00000103 output: puppet ran at Thu May 3 09:25:26 UTC 2012 [09:25:46] RECOVERY Puppet freshness is now: OK on bots-nfs i-000000b1 output: puppet ran at Thu May 3 09:25:38 UTC 2012 [09:25:46] RECOVERY Puppet freshness is now: OK on bz-dev i-000001db output: puppet ran at Thu May 3 09:25:43 UTC 2012 [09:26:06] RECOVERY Puppet freshness is now: OK on building i-0000014d output: puppet ran at Thu May 3 09:25:53 UTC 2012 [09:26:06] RECOVERY Puppet freshness is now: OK on simplewikt i-00000149 output: puppet ran at Thu May 3 09:25:57 UTC 2012 [09:26:16] RECOVERY Puppet freshness is now: OK on mobile-feeds i-000000c1 output: puppet ran at Thu May 3 09:26:14 UTC 2012 [09:26:36] RECOVERY Puppet freshness is now: OK on firstinstance i-0000013e output: puppet ran at Thu May 3 09:26:17 UTC 2012 [09:26:36] RECOVERY Puppet freshness is now: OK on worker1 i-00000208 output: puppet ran at Thu May 3 09:26:28 UTC 2012 [09:27:06] RECOVERY Puppet freshness is now: OK on ganglia-collector i-000000b7 output: puppet ran at Thu May 3 09:26:55 UTC 2012 [09:27:16] RECOVERY Puppet freshness is now: OK on deployment-sql i-000000d0 output: puppet ran at Thu May 3 09:27:15 UTC 2012 [09:27:46] RECOVERY Puppet freshness is now: OK on wikidata-dev-3 i-00000225 output: puppet ran at Thu May 3 09:27:35 UTC 2012 [09:27:46] RECOVERY Puppet freshness is now: OK on bots-4 i-000000e8 output: puppet ran at Thu May 3 09:27:39 UTC 2012 [09:27:46] RECOVERY Puppet freshness is now: OK on aggregator1 i-0000010c output: puppet ran at Thu May 3 09:27:40 UTC 2012 [09:27:46] RECOVERY Puppet freshness is now: OK on webserver-lcarr i-00000134 output: puppet ran at Thu May 3 09:27:42 UTC 2012 [09:28:03] :O [09:28:16] RECOVERY Puppet freshness is now: OK on deployment-nfs-memc i-000000d7 output: puppet ran at Thu May 3 09:28:01 UTC 2012 [09:28:33] mutante: ok [09:28:36] RECOVERY Puppet freshness is now: OK on swift-be1 i-000001c7 output: puppet ran at Thu May 3 09:28:23 UTC 2012 [09:28:46] RECOVERY Puppet freshness is now: OK on dumps-2 i-00000174 output: puppet ran at Thu May 3 09:28:31 UTC 2012 [09:28:57] mutante: so you only did sudo service snmptrap restart? [09:29:06] RECOVERY Puppet freshness is now: OK on wikidata-dev-1 i-0000020c output: puppet ran at Thu May 3 09:28:47 UTC 2012 [09:29:06] RECOVERY Puppet freshness is now: OK on embed-sandbox i-000000d1 output: puppet ran at Thu May 3 09:28:47 UTC 2012 [09:29:06] RECOVERY Puppet freshness is now: OK on bots-sql3 i-000000b4 output: puppet ran at Thu May 3 09:28:50 UTC 2012 [09:29:06] RECOVERY Puppet freshness is now: OK on deployment-squid i-000000dc output: puppet ran at Thu May 3 09:28:50 UTC 2012 [09:29:16] RECOVERY Puppet freshness is now: OK on translation-memory-1 i-0000013a output: puppet ran at Thu May 3 09:29:14 UTC 2012 [09:29:36] RECOVERY Puppet freshness is now: OK on deployment-web i-00000217 output: puppet ran at Thu May 3 09:29:22 UTC 2012 [09:29:36] RECOVERY Puppet freshness is now: OK on swift-fe1 i-000001d2 output: puppet ran at Thu May 3 09:29:28 UTC 2012 [09:29:46] RECOVERY Puppet freshness is now: OK on incubator-apache i-00000211 output: puppet ran at Thu May 3 09:29:34 UTC 2012 [09:29:57] :D [09:31:05] petan|wk: no, i did this: [09:31:14] /usr/sbin/snmptrapd -On -Lsd -p /var/run/snmptrapd.pid [09:31:16] RECOVERY Puppet freshness is now: OK on fundraising-civicrm i-00000169 output: puppet ran at Thu May 3 09:31:14 UTC 2012 [09:31:20] put it in a bug for docs [09:31:32] https://bugzilla.wikimedia.org/show_bug.cgi?id=36470 [09:31:39] mutante: ok [09:31:55] !nagios-fix is !b 36470 [09:31:55] Key was added! [09:31:56] yea, there is no init script [09:32:00] thats the bug [10:29:09] lunch time will be back :-D [10:44:46] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [10:46:36] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [11:13:26] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [11:13:40] !log deployment-prep petrb: removing logrotate from all apaches it broke central log [11:13:42] Logged the message, Master [11:14:46] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [11:16:36] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [11:18:39] hashar: site is pretty fast now, how did we fix it [11:43:26] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [11:44:46] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [11:46:36] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [12:02:46] RECOVERY Free ram is now: OK on mobile-feeds i-000000c1 output: OK: 74% free memory [12:13:26] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [12:14:16] CACHE_ACCEL requested but no suitable object cache is present. You may want to install APC. [12:14:16] Backtrace: [12:14:16] #0 [internal function]: ObjectCache::newAccelerator(Array) [12:14:16] #1 [12:14:18] hashar: ^ [12:14:24] why is that [12:14:27] how to fix it [12:14:46] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [12:16:36] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [12:16:40] meh [12:16:57] hashar: where are you [12:17:05] :| [12:27:34] petan|wk: apt-get install php-apc fixed it on the transcoding node [12:27:40] ok [12:41:07] 05/03/2012 - 12:41:07 - Updating keys for diederik [12:41:13] 05/03/2012 - 12:41:13 - Updating keys for diederik [12:41:18] 05/03/2012 - 12:41:18 - Updating keys for diederik [12:41:21] 05/03/2012 - 12:41:21 - Updating keys for diederik [12:41:22] 05/03/2012 - 12:41:22 - Updating keys for diederik [12:41:41] 05/03/2012 - 12:41:41 - Updating keys for diederik [12:43:26] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [12:44:46] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [12:46:36] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [12:46:48] hey folks, i was fooling around with my ssh keys but i can no longer access bastion [12:46:56] i did register my public key on labs [12:47:05] any tips? [13:13:26] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [13:14:46] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [13:16:36] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [13:18:50] drdee_: are you loading the matching private key to the one ending in "MacBook-Pro.local"? [13:19:36] i guess you had to wait for next puppet run, i cant check on bastion myself though [13:19:53] i can tell you though the one above is the one you have in LDAP [13:34:44] New review: Dzahn; "think you can do it like here:" [operations/puppet] (test); V: 0 C: 1; - https://gerrit.wikimedia.org/r/6469 [13:39:22] yes [13:39:44] mutante: access still denied [13:43:26] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [13:44:10] drdee_: maybe whitespace at beginning or end? or a line break? do you use this key elsewhere or just created a new one [13:44:56] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [13:46:36] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [13:47:40] mutante: when i visit the Manage SSH page on Labsconsole I get this message: There were no Nova credentials found for your user account. Please ask a Nova administrator to create credentials for you. [13:52:46] drdee_: that's weird. i can still see your credentials with the nova command line tool [13:53:36] drdee_: "diederik" is still a nova / LDAP user.. uhmpf [13:54:06] something went wrong when it was updated several times so quickly it seems [13:55:13] mutante: what to do? [13:55:25] Special:NovaKey , right? [13:55:44] you are logged in but it does not let you paste a key anymore? [13:56:04] do you still see a key there now? [13:56:18] exactly [13:56:24] i don't see a key [13:56:53] logging out / logging in fixes it [13:56:55] hmm.. almost sounds like some kind of flood protection [13:57:04] ah [13:57:36] 05/03/2012 - 13:57:36 - Updating keys for diederik [13:57:47] let's try this [13:58:07] 05/03/2012 - 13:58:07 - Updating keys for diederik [13:58:13] 05/03/2012 - 13:58:12 - Updating keys for diederik [13:58:17] 05/03/2012 - 13:58:17 - Updating keys for diederik [13:58:21] 05/03/2012 - 13:58:21 - Updating keys for diederik [13:58:22] 05/03/2012 - 13:58:22 - Updating keys for diederik [14:04:46] drdee_: better? [14:07:26] hi all, i'm trying to upload my extention to the mediawiki git repository, but i get really lost in all the walkthroughs in how to do it. [14:07:38] can anyone help me with this? [14:13:26] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [14:14:15] HV_: do you already have a user? [14:14:28] yes [14:15:26] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [14:15:32] HV_: general git setup and instructions for cloning are here: https://labsconsole.wikimedia.org/wiki/Git#Git.2FGerrit_and_the_repositories [14:16:07] for the part "what is the right place for my extension" you might want to ask in #mediawiki-dev as well [14:16:36] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [14:18:41] well the way i see it i need to create a project in gerrit in /mediawiki/extentions/something [14:18:59] and then upload the extention in it [14:19:16] mutante: no [14:19:27] still being kicked out from bastion [14:20:08] HV_: sounds about right, i'm ops though not dev. uploading = commit in git and then push for review into gerrit , yes [14:21:05] drdee_: sigh .. how did this start? just replaced your key with a new one? [14:21:45] i tried to use proxycommand as described on wiki [14:21:58] that wouldn't work even though i could ssh directly into bastion [14:22:06] then i thought i should start clean [14:22:08] ah, can you try going back to the state before? [14:22:19] and removed all files from bastion:~/.ssh/ [14:22:27] no i am locked out :) [14:22:43] even with redeploying my public key [14:22:46] i mean in the local setup re: proxy and stuff [14:23:23] ah, you removed files from .ssh manually? [14:23:31] possible you changed permissions somehow? [14:23:39] on the directory or files in it [14:24:03] just removed the files manually, did not fiddle with permissions at all [14:24:05] i cant't check :p [14:25:42] hold on.. gotta try add myself then [14:31:05] !log bastion added myself to admin roles to see if that lets me login to debug ssh key for diederik, but it doesnt [14:31:06] Logged the message, Master [14:31:19] drdee_: sorry, running out of ideas, gotta draw the ryan card [14:31:33] ok thx [14:31:36] it's cause i can't check on the host itself [14:31:54] and what i could do, (LDAP, nova-manage) i still see you as normal [14:34:29] drdee_: go back to old key that worked before and revert all changes you made in client setup, just in case [14:34:44] because it does not look like it is NOT updating your key, see bot output [14:35:24] i didnt't make any local changes, the only change i made was removing some files from bastion [14:36:27] ok, i guess puppet fails to recreate them now for some permission issue or because .ssh itself is also gone and it doesnt create it [14:43:26] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [14:45:26] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [14:46:36] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [14:53:35] New patchset: Hashar; "(bug 36246) rsyslog: send logs to a custom server" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/5813 [14:53:50] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/5813 [15:13:26] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [15:15:26] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [15:16:36] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [15:18:47] New patchset: Hashar; "rsyslog: ability to send logs to a custom server" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/5813 [15:19:02] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/5813 [15:22:37] New review: Mark Bergsma; "(no comment)" [operations/puppet] (test); V: 0 C: 2; - https://gerrit.wikimedia.org/r/5813 [15:22:41] Change merged: Mark Bergsma; [operations/puppet] (test) - https://gerrit.wikimedia.org/r/5813 [15:27:33] petan|wk: looks like we are going to get syslog messages again :D [15:27:34] -*.info;mail.none;authpriv.none;cron.none @syslog.pmtpa.wmnet [15:27:34] +*.info;mail.none;authpriv.none;cron.none @deployment-dbdump.pmtpa.wmflabs [15:27:39] https://bugzilla.wikimedia.org/show_bug.cgi?id=36246 [15:27:43] running puppet on apaches [15:29:21] !log deployment-prep hashar: running puppet on apaches to have them send their syslog to deployment-dbdump (bug 36246) [15:29:22] Logged the message, Master [15:34:33] Change abandoned: Mark Bergsma; "(no reason)" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/2012 [15:34:48] Change abandoned: Mark Bergsma; "(no reason)" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/1464 [15:43:26] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [15:45:26] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [15:46:26] New patchset: Hashar; "avoid syslog loop in labs" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/6528 [15:46:36] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [15:46:40] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/6528 [15:48:33] New review: Mark Bergsma; "(no comment)" [operations/puppet] (test); V: 0 C: 2; - https://gerrit.wikimedia.org/r/6528 [15:48:36] Change merged: Mark Bergsma; [operations/puppet] (test) - https://gerrit.wikimedia.org/r/6528 [15:53:45] !log deployment-prep hashar: adding misc::mediawiki-logger and misc::scripts classes to deployment-dbdump [15:53:46] Logged the message, Master [16:02:23] New patchset: Hashar; "class to install the ack-grep utility" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/6467 [16:02:37] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/6467 [16:02:47] New review: Hashar; "(no comment)" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/6467 [16:14:04] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [16:16:14] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [16:16:44] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [16:20:13] * jeremyb waves petan|wk petan_ [16:41:13] 05/03/2012 - 16:41:13 - Updating keys for diederik [16:42:12] very weird, the final deployment of my ssh key took 2 hours [16:42:18] but now i have access again to bastion [16:44:04] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [16:45:43] drdee_: ryan just fixed that for you, we talked on another channel [16:45:51] thx ryan_lane and mutante [16:45:53] there was a bug [16:46:07] the stupid script has a bug :( [16:46:14] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [16:46:22] it's solved! that's all that matter [16:46:23] s [16:46:27] :) [16:46:31] kind of [16:46:44] it'll break again if you remove the entire directory, or just the authorized_keys file [16:46:44] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [16:46:48] until I fix the script [16:58:08] hey ryan (howdy) [16:58:26] 03 16:51:28 -!- Ryan_Lane [~Ryan_Lane@wikimedia/Ryan-lane] has quit [Quit: Leaving.] [17:00:06] so i cannot get the proxy access to work, i can ssh directly to bastion and to my instance but through the proxy I get access denied [17:00:26] which method are you using? [17:00:37] the one from labs wiki [17:00:48] so... which method are you using? [17:00:57] there is no *one* from the labs wiki [17:01:01] there's like 3 at least [17:01:25] my reccomendation is ProxyCommand [17:01:42] the one that says Using ProxyCommand ssh option [17:02:07] ok. so what are you typing? which box are you trying to get into? [17:02:17] what does your local ssh config say? [17:02:56] ssh reportcard2.pmtpa.wmflabs [17:03:07] ssh config is exactly copied from the wiki [17:03:40] and what does it say? [17:04:15] permission denied [17:04:29] but if i access through bastion myself then it does work [17:04:40] oh now it said: "ssh_exchange_identification: Connection closed by remote host" [17:05:48] you should try ssh -vvv [17:07:31] jeremyb: http://pastebin.com/SRiudkBm [17:08:50] huh [17:09:19] ohhh [17:09:35] drdee_: do you use bastion or bastion-restricted? [17:09:50] i suspect bastion, let me check [17:10:14] bastion1 i see [17:10:16] i mean when you say you can ssh to bastion and it works from there. which one are you connecting to? [17:10:19] huh [17:10:29] bastion.wmflabs.org [17:12:38] Host yourinstance [17:13:02] ProxyCommand ssh -W %h:%p -A user@bastion.wmflabs.org [17:13:17] should do it [17:13:20] oh, i see a problem [17:13:28] drdee_: pastebin your whole local config [17:13:31] ok [17:14:04] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [17:14:21] jeremyb: http://pastebin.com/ay97WqJJ [17:14:51] yeah, first 3 lines go away [17:14:55] * jeremyb keeps reading [17:15:39] ok, that should fix it? [17:15:48] trying... [17:15:48] try again with those gone [17:16:14] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [17:16:18] key_type_from_name: unknown key type '-----BEGIN' [17:16:22] AMAZING! [17:16:26] it works [17:16:27] thx [17:16:30] sure ;) [17:16:44] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [17:16:46] i wonder why there wasn't a more useful error [17:18:02] mutante: that's fairly normal. compare to your own output for `ssh -vvv instancethatsnotabastion.pmtpa.wmflabs echo foo 2>&1` [17:31:06] New review: Ryan Lane; "(no comment)" [labs/private] (master); V: 0 C: 2; - https://gerrit.wikimedia.org/r/5792 [17:31:15] New review: Ryan Lane; "(no comment)" [labs/private] (master); V: 1 C: 2; - https://gerrit.wikimedia.org/r/5792 [17:31:17] Change merged: Ryan Lane; [labs/private] (master) - https://gerrit.wikimedia.org/r/5792 [17:36:36] hmm [17:36:46] when I try git push-for-review-test [17:36:48] I get: [17:37:51] To ssh://thehelpfulone@gerrit.wikimedia.org:29418/operations/puppet ! [remote rejected] HEAD -> refs/for/test error: failed to push some refs to 'ssh://thehelpfulone@gerrit.wikimedia.org:29418/operations/puppet' [17:38:51] oh I missed a step I think [17:38:52] * Thehelpfulone tries again [17:41:36] Thehelpfulone: fyi, there are generally 2 different ways, the one without "git-review" but WITH the push-for-review-test alias (as described on that page i showed you and what i said) and the one using git-review [17:41:50] just to avoid some possible confusion. really need to run now though. afk [17:42:00] ok thanks [17:42:20] <^demon> Ryan_Lane: Stupid question, how do I tell if a user is in the 'wmf' group in ldap? And how do I add someone to it, if I needed to? [17:42:34] ^demon: ldaplist -l group wmf [17:42:46] ^demon: modify-ldap-group —addmembers= wmf [17:43:07] <^demon> Awesome, thanks. [17:43:10] * ^demon writes that down [17:43:24] hm, is that not documented anywhere? [17:43:50] <^demon> I didn't look to be honest. [17:44:04] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [17:44:42] Wikitech "Ldaplist" vs. just "LDAP" .. /out [17:45:08] <^demon> Duh, it's right there. [17:45:09] <^demon> Thanks [17:45:46] <^demon> Ryan_Lane: And other than the stuff in gerrit, that doesn't give any extra access, right? [17:46:14] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [17:46:34] for some reason I deleted a .crt [17:46:41] when I try to revert it in git GUI i get error: unable to create file files/ssl/*.wikimedia.org.crt (Invalid argument) [17:46:44] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [17:46:54] oh I didn't delete it [17:46:59] it's just windows doesn't like it [17:47:18] ^demon: nope [17:47:31] ^demon: but that is only for people who are wmf staff or contractors [17:47:38] <^demon> Right, that's what I was using it for. [17:47:41] * Ryan_Lane nods [17:52:28] Thehelpfulone: i think the solution is don't use windows [17:52:40] :P [17:53:51] same problem :( [17:54:10] same problem when you use notwindows?! [17:54:25] I don't have an alternate to windowz :P [17:54:33] run a live cd or a VM [17:54:50] ooh I think it did something [17:55:09] if it had done something it would have spoken here i think [17:55:13] heh nothing on gerrit :( [17:55:47] so it's the same error message, any ideas how to fix it? [17:56:44] <^demon> Does `git log` show your commit at the top? [17:56:48] <^demon> (just checking the obvious...) [17:56:53] Ryan_Lane: so, last night we had UDP recent changes going to prod and ending up in #cvn-simplewikis (and probably other places too). i fixed it on the labs side but this is now at least the 2nd time it's happened. should that be blocked for good in nova or in leslie's magic or in the per instance firewall stuff? (or at all?). the problem with the per instance stuff is it's immutable [17:56:57] New patchset: Thehelpfulone; "Making the htaccess production only" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/6541 [17:57:12] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/6541 [17:57:18] hmm that was a mixture of the git GUI [17:57:19] maybe [17:57:22] and git bash [17:58:44] can you review that ^demon please? and yeah feel free to check the obvious with me [18:00:08] New review: Demon; "(no comment)" [operations/puppet] (test); V: 0 C: 2; - https://gerrit.wikimedia.org/r/6541 [18:00:40] <^demon> Hrm, why don't I have Submit on operations/puppet on test? [18:01:19] <^demon> I have +2 but not submit :\ [18:01:39] heh [18:02:00] ^demon: yeah we have to get someone from ops to merge the changes :-(( [18:02:17] on the + side, that force a review! [18:03:34] New review: Demon; "Didn't this happen before? I seem to remember mark saying no." [operations/puppet] (test); V: 0 C: 0; - https://gerrit.wikimedia.org/r/6468 [18:04:42] thanks ^demon [18:04:47] <^demon> yw. [18:04:57] PROBLEM dpkg-check is now: CRITICAL on deployment-dbdump i-000000d2 output: DPKG CRITICAL dpkg reports broken packages [18:05:05] * Thehelpfulone points Ryan_Lane to https://gerrit.wikimedia.org/r/6541 to press that nice submit button [18:05:23] canvassing [18:05:40] it needs indention [18:05:53] i was going to say the same thing [18:05:58] that concept does not exist on labs yet :P [18:06:01] * ^demon indents Ryan [18:06:01] also could be merged with the block above it? [18:06:04] canvassing I mean [18:06:57] indentation? where abouts? [18:09:18] you nested a block inside another block. so the logical level of a block increased but it's indentation didn't [18:11:16] New patchset: Thehelpfulone; "Merging a block, trying to fix indentation." [operations/puppet] (test) - https://gerrit.wikimedia.org/r/6543 [18:11:29] umm [18:11:30] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/6543 [18:11:58] you should have amended the other change [18:12:07] rather than making a new change that's dependent on the old one [18:12:20] I did git push-for-review-test [18:12:23] New review: Demon; "You should amend the previous commit rather than adding a new one." [operations/puppet] (test); V: 0 C: -2; - https://gerrit.wikimedia.org/r/6543 [18:12:24] what would you do instead? [18:12:33] on the old commit, git commit -a --amend [18:12:37] then push [18:13:25] now I'll try to figure out how to get to the old commit [18:13:57] you can rebase to remove the new one, then amend the old one [18:14:13] rebase -i HEAD^2 [18:14:19] delete the line with the new commit [18:14:20] save [18:14:24] or just squash [18:14:31] squash works too [18:14:33] <^demon> Ryan_Lane: Mind taking a look at 5810? [18:14:48] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [18:14:58] RECOVERY dpkg-check is now: OK on deployment-dbdump i-000000d2 output: All packages OK [18:15:00] how do I squash? [18:16:18] Thehelpfulone: same command Ryan_Lane gave you [18:16:28] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [18:16:39] one of the things you can do with commits in the editor that appears is squash [18:16:48] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [18:16:48] ^demon: reviewed [18:16:50] it tells me that it's an unknown option --i [18:16:59] -- i* [18:17:02] huh? [18:17:09] did you prefix with 'git '? [18:17:11] so I'm in git bash in my repo [18:17:26] Thehelpfulone: just -i not --i [18:17:27] duh [18:17:40] it says fatal: Needed a single revision now [18:17:47] invalid upstream HEAD^2 [18:17:52] <^demon> Ryan_Lane: Are you saying make the file owned by root:gerrit2 or root:root? [18:17:57] root root [18:18:02] all the files in etc should be [18:18:06] if they aren't [18:18:12] gerrit shouldn't be able to change its own config [18:19:01] Cannot rebase: You have unstaged changes. Please commit or stash them. [18:19:03] I really don't like that everything is under the user's home directory [18:19:09] HEAD~2 not ^2 [18:19:16] Thehelpfulone: you probably have a failed rebase [18:19:20] git rebase —abort [18:19:26] that's -- [18:19:39] "No rebase in progress?" [18:19:54] <^demon> Ryan_Lane: About half are root:root and half are gerrit2:gerrit2. [18:20:00] :( [18:20:14] Thehelpfulone: then you have uncommitted changes [18:20:20] git reset --hard [18:20:23] yes, I can't commit it though [18:20:52] fatal: Unable to create .git/index.lock File exists [18:21:04] should I delete that file? [18:21:08] um [18:21:11] I have no clue [18:21:16] RoanKattouw: any clue here? [18:21:22] you have the gui and shell simultaneously open? [18:21:31] nope the GUI is closed now [18:21:36] but it was at one point, yes [18:21:47] you want to get something like this to show up: [18:21:48] pick b274da8 Making the htaccess production only [18:21:48] pick 796580a Merging a block, trying to fix indentation. [18:21:58] <^demon> Actually, most are gerrit2:gerrit2. Only ones that are root:root are /etc/gerrit2/*, gerrit.war and gerrit.sh. [18:22:06] and then change the second one from "pick" to "squash" (or "s") [18:22:09] we should fix that [18:22:31] how do I get that to show up jeremyb? [18:22:43] damn I hate not being able to copy these error messages [18:22:50] I deleted the index.lock file [18:22:57] <^demon> /etc/gerrit2 just seems to be where you stashed some .sql files. Everything else gerrit is in /var/lib/gerrit2/ [18:23:05] now it's Fatal: Could not reset index file to revision 'HEAD' [18:23:57] in the same error message, error: unable to create file files/ssl/*.wikimedia.org.crt (Invalid argument) [18:24:01] because windows doesn't like * [18:24:06] !log deployment-prep installed on dbdump misc::syslog-server [18:24:07] Logged the message, Master [18:25:41] Thehelpfulone: clone from scratch [18:25:57] the whole thing? [18:26:06] hmm okay, let me figure out how I did that before [18:27:06] <^demon> Before recloning, try `git reset --hard origin/test` (or production or whatever branch you're trying to regain) [18:27:24] it's the labs one so test I think [18:28:08] nope still can't set the index file [18:29:35] <^demon> How about `git reset --hard ORIG_HEAD` [18:29:59] unable to reset to revision "ORIG_HEAD" [18:30:01] ^demon: you know this is windows? [18:30:09] windows is evil [18:30:37] `git reset --hard 796580ad99775cf673182f1a525492fdf6cb7401` [18:31:23] how do I copy/paste into git bash? [18:31:26] it's really annoying me [18:31:43] right click? or right click the menu bar? [18:31:47] should be there somewhere [18:32:02] anyway, your 2nd patchset wasn't even the right indentation [18:32:16] yeah I don't quite know what I'm doing with indentation [18:32:36] still can't reset the bloody index file [18:32:47] i repeat, clone from scratch ;) [18:33:01] <^demon> Yeah, at this point it'll just be faster. [18:33:06] so should I delete everything so far? or reclone in a new place? [18:33:14] new place [18:33:27] `git remote -v` will tell you what your current origin is [18:33:34] and you can reclone from the same place [18:34:01] when you've done that then: [18:34:04] git fetch origin refs/changes/*:refs/changes/* [18:34:14] git checkout refs/changes/43/6543/1 [18:34:27] git rebase -i HEAD^^ [18:34:37] so I want git clone ssh://@gerrit.wikimedia.org:29418/operations/puppet.git to begin with? [18:34:48] I think I'm going to do this all through bash, that GUI is probably causing problems too [18:34:51] whatever it says in `git remote -v` [18:34:57] yes, good idea [18:37:57] ok I'll be back in a bit then - hopefully this clone works ;) [18:39:06] Thehelpfulone: say my name [18:39:12] Thehelpfulone: you made a dartar? [18:39:24] (when you return) [18:43:37] Ryan_Lane: Could I have a labs project for VisualEditor please? [18:43:45] yep [18:43:50] <^demon> Ryan_Lane: When do you think we could do the gerrit upgrade? [18:44:05] we were thinking friday, right? [18:44:27] RoanKattouw: done [18:44:31] <^demon> Yeah, nothing's on the calendar for tomorrow and my schedule's wide open. [18:44:54] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [18:45:10] Ryan_Lane: thanks [18:45:39] 05/03/2012 - 18:45:39 - Creating a project directory for visualeditor [18:45:39] 05/03/2012 - 18:45:39 - Creating a home directory for catrope at /export/home/visualeditor/catrope [18:45:43] yw [18:46:43] 05/03/2012 - 18:46:43 - Updating keys for catrope [18:46:44] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [18:47:04] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [18:51:10] New patchset: Hashar; "nfs::home::wikipedia describe /home/wikipedia" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/6545 [18:51:17] ^^^--- probably sure this one is wrong [18:51:24] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/6545 [18:51:30] but I do need to have a /home/wikipedia directory :-] [18:52:31] Ryan_Lane: I know he's on a different floor, but do you have an intra-office phone system, and if so could you ask Jamesofur to come online? [18:52:38] no [18:52:42] I'm not a paging system, sorry [18:52:52] we're also in the middle of an outage [18:53:01] oh, sorry, didn't know [18:54:26] New patchset: Hashar; "syslog-server requires /home/wikipedia" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/6546 [18:54:41] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/6546 [19:09:44] PROBLEM Free ram is now: CRITICAL on mobile-enwp i-000000ce output: Critical: 5% free memory [19:09:53] mkdir: cannot create directory `/home/wikipedia': No such file or directory [19:09:57] I love Unix :-] [19:10:09] as cryptic as MS Windows [19:13:54] PROBLEM Current Load is now: CRITICAL on mobile-enwp i-000000ce output: CHECK_NRPE: Socket timeout after 10 seconds. [19:15:24] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [19:16:44] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [19:17:04] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [19:17:48] ok back jeremyb - it seems like git crashed [19:18:07] or it could have been this outage [19:20:44] PROBLEM host: mobile-enwp is DOWN address: i-000000ce CRITICAL - Host Unreachable (i-000000ce) [19:21:05] <^demon> Gerrit is up for me. [19:22:30] I'll try again ;) [19:23:25] jeremyb: and a dartar? [19:23:30] New review: Rich Smith; "(no comment)" [operations/puppet] (test) C: 1; - https://gerrit.wikimedia.org/r/6546 [19:23:39] oh I see :P [19:23:52] yes but you weren't supposed to figure that out :P [19:24:00] New patchset: Hashar; "syslog-server requires /home/wikipedia/syslog" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/6546 [19:24:15] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/6546 [19:24:27] Sven_Manguard: I might be able to get hold of him, anything you need him for in particular? [19:24:56] PM cha [19:24:58] New review: Rich Smith; "(no comment)" [operations/puppet] (test) C: 1; - https://gerrit.wikimedia.org/r/6546 [19:25:00] chat* [19:27:03] New review: Rich Smith; "(no comment)" [operations/puppet] (test) C: 1; - https://gerrit.wikimedia.org/r/6545 [19:28:23] Thehelpfulone: you were making a clone and then DarTar_clone showed up [19:28:38] Thehelpfulone: anyway, did the clone work? [19:28:41] yep I figured that out [19:28:45] so it didn't work the first time [19:28:50] running it again with fingers crossed [19:28:51] did you run the other stuff i said to make? [19:28:55] s/make/run/ [19:29:06] I mean the actual clone in the first place didn't work [19:29:09] oh, clone still going then [19:29:13] yeah [19:29:13] ok, back in a bit [19:30:35] New review: Hashar; "I honestly don't care. I originally just did it the puppet way aka:" [operations/puppet] (test); V: 0 C: 0; - https://gerrit.wikimedia.org/r/6469 [19:30:40] sure thanks [19:41:21] Sven_Manguard: I was told you were looking for me :) [19:41:46] (apologies if I have to leave relatively quickly, I'm out of the office for a workshop but waiting for it to start ;) ) [19:43:44] PROBLEM dpkg-check is now: CRITICAL on log1 i-00000239 output: CHECK_NRPE: Error - Could not complete SSL handshake. [19:45:04] PROBLEM Current Load is now: CRITICAL on log1 i-00000239 output: CHECK_NRPE: Error - Could not complete SSL handshake. [19:45:24] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [19:45:44] PROBLEM Current Users is now: CRITICAL on log1 i-00000239 output: CHECK_NRPE: Error - Could not complete SSL handshake. [19:46:19] PROBLEM Disk Space is now: CRITICAL on log1 i-00000239 output: CHECK_NRPE: Error - Could not complete SSL handshake. [19:46:43] jeremyb: okay so I followed your instructions, same cannot rebase you have unstaged changes [19:46:44] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [19:46:54] PROBLEM Free ram is now: CRITICAL on log1 i-00000239 output: CHECK_NRPE: Error - Could not complete SSL handshake. [19:47:04] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [19:48:14] PROBLEM Total Processes is now: CRITICAL on log1 i-00000239 output: CHECK_NRPE: Error - Could not complete SSL handshake. [19:51:04] PROBLEM host: mobile-enwp is DOWN address: i-000000ce CRITICAL - Host Unreachable (i-000000ce) [19:55:04] RECOVERY Current Load is now: OK on log1 i-00000239 output: OK - load average: 0.30, 0.25, 0.22 [19:55:44] RECOVERY Current Users is now: OK on log1 i-00000239 output: USERS OK - 1 users currently logged in [19:56:14] RECOVERY Disk Space is now: OK on log1 i-00000239 output: DISK OK [19:56:54] RECOVERY Free ram is now: OK on log1 i-00000239 output: OK: 87% free memory [19:58:14] RECOVERY Total Processes is now: OK on log1 i-00000239 output: PROCS OK: 86 processes [19:58:44] RECOVERY dpkg-check is now: OK on log1 i-00000239 output: All packages OK [20:00:48] any ideas ^demon? ([20:46:44] jeremyb: okay so I followed your instructions, same cannot rebase you have unstaged changes) [20:01:47] <^demon> If you've got stuff you haven't committed yet, you can't do a rebase. [20:01:54] <^demon> Either get rid of the changes or stash them [20:03:00] how do I get rid of them? [20:04:13] <^demon> git reset --hard [20:06:04] Thehelpfulone: git reset --hard refs/changes/43/6543/1 [20:06:21] I did a git stash save --keep-index [20:06:25] then git stash drop [20:06:31] seems to have done something [20:06:41] Dropped refs/stash@{0} (e4328e02336391c2993638cfac092ac7620124f0) [20:06:49] eww, no stash required [20:06:57] should I try the rebase or do your git reset --hard? [20:06:59] just do that above [20:07:03] ok [20:07:54] $ git reset --hard refs/changes/43/6543/1 [20:07:54] error: unable to create file files/ssl/*.wikimedia.org.crt (Invalid argument) [20:07:55] fatal: Could not reset index file to revision 'refs/changes/43/6543/1'. [20:08:25] windows doesn't do * in file names [20:08:41] <^demon> That was already fixed in production. Somebody should cherry-pick that over to test. [20:09:07] what? you think it's a problem in a hook? [20:10:27] <^demon> Windows doesn't like *, so who knows in what fun ways git might explode. [20:12:00] puppet test branch question: [20:12:00] i want to set up some minimal configs on my labs instance [20:12:00] but these configs will never be used in production [20:12:00] i want to set up nginx, varnish + ncsa, squid, udp2log, etc. [20:12:00] but I want to do it just for testing changes [20:12:00] much like I've done on my local VM now [20:12:01] but more officially in labs [20:12:01] i could skip puppet altogether [20:12:02] but i thought it'd be nice to have the labs instance puppetized [20:12:02] i think I need to make new classes in order to install this stuff without all the production files [20:12:03] For example, I just need squid doing a simple reverse proxy to localhost [20:12:03] there is no way for me to just install squid and set up my custom conf file [20:12:04] there's no squid base class, as far as I can tell [20:12:04] so I guess i'll create a class for my machine? [20:12:05] which is kinda dumb [20:12:05] or a class for logging_tests [20:12:51] ottomata: no flooding here... [20:13:04] drats [20:13:09] but I typed so much over in ops [20:13:10] ^demon: well that's probably a git bug? [20:13:14] just wanted to copy paste! [20:13:35] ottomata: you could have merged some short lines into single lines? pastebin? [20:13:39] ottomata: summary? [20:13:41] does that mean you will not get my messages? or that they will come in slowly? [20:13:42] just saying [20:13:44] haa, yeah [20:13:48] um [20:13:51] tl;dr [20:13:52] <^demon> jeremyb: Probably, but easily worked around by just renaming the file :) [20:14:01] want to test logging setups on a simple single labs machine [20:14:06] cannot include classes as they are [20:14:08] ^demon: i think it's named based on the cert CN, no? [20:14:12] too much irrelevant production stuff [20:14:19] should I even bother puppetizing? [20:14:30] <^demon> jeremyb: In production, we renamed it to star.whatever instead of *.whatever [20:14:48] so can someone rename it in labs please? [20:14:54] ^demon: sure. so, again a git bug i think [20:14:55] i need to have, squid, nginx, varnish, udp2log, filters, etc. [20:15:03] with very minimal setups [20:15:06] and my own custom configs [20:15:20] ohhhh [20:15:24] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [20:15:25] i'm confused. ignore me [20:15:31] Thehelpfulone: sure, doing it now [20:15:36] thanks [20:16:03] still you'll have trouble checking out the version you're trying to change... [20:16:21] :( [20:16:44] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [20:17:04] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [20:17:35] ottomata: so, we don't have squid puppetized [20:17:43] and may never have it puppetized [20:18:01] ottomata: don't make specific puppet classes! [20:18:04] always generic [20:18:17] yeah i know, but for labs test branch? [20:18:19] does it matter? [20:18:27] it'll eventually be merged into production [20:18:28] i mean, to install nginx right now [20:18:34] i have to include protoproxy stuff [20:18:43] make that more generic [20:18:44] test will be merged into production? [20:18:47] ideally, turn things into modules [20:19:00] did i hear modules? [20:19:03] blasphemy [20:19:05] haha [20:19:18] test originally was meant to be constantly merged with production [20:19:24] somehow I doubt test will be merged into production :p [20:19:24] but it's totally fucked right now and hasn't been merged in ages [20:19:39] we have plans on getting rid of the test branch [20:19:45] yay [20:19:49] paravoid is working on per-project feature branches [20:19:55] we'll likely run default off the production branch [20:19:59] aye [20:20:01] that's cool [20:20:07] and changes will be done in short-lived(ish) feature branches [20:20:14] this is kind of the idea right now [20:20:18] i like that [20:20:21] in the meantime [20:20:22] it may change depending on his implementation [20:20:26] i want to install squid [20:20:31] with a very minimal reverse proxy/cache [20:20:35] actually no cache [20:20:37] just reverse proxy [20:20:42] why use squid, then? [20:20:46] for testing logs [20:20:48] that's it [20:20:48] ah [20:20:55] apt-get install squid-frontend [20:21:02] so no puppetize? [20:21:04] PROBLEM host: mobile-enwp is DOWN address: i-000000ce CRITICAL - Host Unreachable (i-000000ce) [20:21:06] i'm fine with that :) [20:21:08] for labs [20:21:12] why bother, if it's just for testing? [20:21:17] right [20:21:22] because the branch is called test? [20:21:26] puppetization is for long-lives things [20:21:37] again, I *hate* the test branch [20:21:37] well, we'd like this to be a place to test logging changes [20:21:38] before they are made [20:21:43] well this is a long-lives thing [20:21:43] * Ryan_Lane nods [20:21:48] you can puppetize it, if you'd like [20:21:56] haha, not really , in the test branch it does not sound like fun [20:22:01] i'd have to refactor lots of stuff [20:22:06] or do a custom class just for me [20:22:10] it would be nice to have a basic squid config for labs, based on the production config [20:22:22] maybe i will wait [20:22:25] * Ryan_Lane nods [20:22:28] until the per-branch stuff is done [20:22:38] and just setup manually for now [20:23:03] mmk, thankyouuu [20:24:55] jeremyb: what should I try next? ;) [20:25:11] Thehelpfulone: what happens when you get that error? [20:25:25] I go back to the input box [20:25:33] Thehelpfulone: what does `git status` say? [20:26:30] # Not currently on any branch. [20:26:31] # Changes not staged for commit: [20:26:32] # deleted: files/ssl/*.wikimedia.org.crt [20:26:33] no changes added to commit (use "git add" and/or "git commit -a") [20:26:43] i cut out some instructions from there [20:27:46] oh maybe it's not loaded the test branch for some reason? [20:28:18] ok, that's find [20:28:21] fine* [20:28:33] now `git rebase -i HEAD^^` [20:29:06] cannot rebase - you have unstaged changes [20:29:12] please commit or stash them [20:29:24] it's to do with that .crt I think have you renamed it so that I can pull? [20:31:06] ok, we can do this still ;P [20:31:09] git commit -a [20:31:16] then `git rebase -i HEAD^^` [20:31:27] (write any garbage msg) [20:31:31] for -a [20:31:39] just so you can get to the rebase [20:31:51] err [20:32:01] make that: git rebase -i HEAD^^^ [20:32:09] one extra commit now that you have the garbage on top [20:32:19] one I'm in git commit -a [20:32:27] do I type that git rebase -i HEAD^^^ ? [20:32:34] then hitting enter just takes me to a new line [20:33:31] no, git commit -a takes you to an editor [20:33:36] yeah [20:33:38] throw in a garbage msg and save [20:33:45] then git rebase -i HEAD^^^ [20:33:46] how do I save? [20:33:52] idk, depends what editor [20:34:38] I think it's vim [20:34:56] so, save with ZZ [20:35:42] just type that? [20:36:02] yes [20:36:11] (capitals) [20:36:21] damn [20:36:29] I had to close git because it was stuck [20:36:35] but now I've got that index.lock error [20:38:03] anyway, not dealing with * right now because it's also still there in prod [20:38:24] i'm tempted to give up and just say i won't help with windows. period [20:38:34] you really need to use something else [20:38:43] cygwin even [20:38:55] but as i said before, live cd or a VM [20:39:24] ok so I managed to fix the index lock error [20:39:28] I'll try that git commit -a again [20:39:33] wait [20:39:42] just do git commit -a -m '.' [20:39:53] then you don't have to worry about getting an editor again [20:40:28] gah I'm already in the editor again [20:40:51] try this: :wq [20:40:58] (with the colon on the front) [20:41:41] ok yeah [20:41:47] I had to press ctrl +c first [20:41:54] I found this http://www.tuxfiles.org/linuxhelp/vimcheat.html [20:41:58] whatever, it's done now? [20:42:01] yep [20:42:05] trying the git rebase now [20:42:27] when the rebase is up, copy in the first 3 lines from there to here [20:42:38] brb [20:42:41] ok [20:43:06] pick b274da8 Making the htaccess production only [20:43:07] pick 796580a Merging a block, trying to fix indentation. [20:43:08] pick f894389 thanks jeremy b [20:44:03] replace the last two picks with squash? [20:44:07] no [20:44:33] delete the last line. replace the first word of the middle line with squash [20:44:35] it tells me I can delete them too, "if you remove a line here THAT COMMIT WILL BE LOST." [20:44:48] ok [20:44:51] now save? [20:44:54] yes [20:45:08] ok [20:45:12] done that [20:45:24] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [20:45:49] phew [20:45:51] ok, bye [20:46:00] that's it? [20:46:00] see you in ~3-5 hrs [20:46:07] idk, have to run [20:46:14] lol ok [20:46:44] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [20:47:04] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [20:51:04] PROBLEM host: mobile-enwp is DOWN address: i-000000ce CRITICAL - Host Unreachable (i-000000ce) [21:07:19] ^demon: now that I've fixed it on my end, I'm still not sure what the identation is, can you do it so that I can see the view in gerrit? [21:09:09] it's at https://gerrit.wikimedia.org/r/#change,6541 [21:09:13] <^demon> I'm looking [21:09:36] thanks [21:12:25] New patchset: Demon; "Making the htaccess production only" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/6541 [21:12:38] New review: gerrit2; "Change did not pass lint check. You will need to send an amended patchset for this (see: https://lab..." [operations/puppet] (test); V: -1 - https://gerrit.wikimedia.org/r/6541 [21:13:47] New patchset: Demon; "Making the htaccess production only" [operations/puppet] (test) - https://gerrit.wikimedia.org/r/6541 [21:13:48] <^demon> Whoops, syntax error :) [21:14:02] New review: gerrit2; "Lint check passed." [operations/puppet] (test); V: 1 - https://gerrit.wikimedia.org/r/6541 [21:14:03] any idea who Rich Smith is ? :) [21:14:09] <^demon> Thehelpfulone: Ok, patch 3 should be the right one now :) [21:14:17] great [21:14:50] New review: Thehelpfulone; "Thank you!" [operations/puppet] (test) C: 1; - https://gerrit.wikimedia.org/r/6541 [21:15:10] * Thehelpfulone asks Ryan_Lane to review it, https://gerrit.wikimedia.org/r/#change,6541 [21:15:24] PROBLEM host: analytics is DOWN address: i-000000e2 CRITICAL - Host Unreachable (i-000000e2) [21:15:41] enough for today [21:15:46] have a good day and see you tomorrow [21:16:44] PROBLEM host: ganglia-test is DOWN address: i-00000202 CRITICAL - Host Unreachable (i-00000202) [21:17:04] PROBLEM host: salt is DOWN address: i-000001c1 CRITICAL - Host Unreachable (i-000001c1) [21:21:04] PROBLEM host: mobile-enwp is DOWN address: i-000000ce CRITICAL - Host Unreachable (i-000000ce) [21:35:33] welcome back maplebed [21:35:51] I think someone told me that you can review puppet stuff? [21:36:08] umm... as well as any opsen... [21:36:08] https://gerrit.wikimedia.org/r/#change,6541 please [21:44:40] i think some networking stuff, from reading ops chats [21:44:40] not sure though [21:44:41] ottomata: yes, network problems, be patient :) [21:44:46] will do, was responding to TheOpenFreeFan abou that [21:44:49] actually, this is a great stopping point [21:44:52] gnight folks! [22:03:40] 05/03/2012 - 22:03:38 - Updating keys for orion [23:04:08] New patchset: Bhartshorne; "adding an empty placeholder file for lighttpd/htdigest so that mailman will install in labs" [labs/private] (master) - https://gerrit.wikimedia.org/r/6566 [23:04:31] Ryan_Lane: would you tell me if that's right? ^^^^ [23:07:06] New review: Bhartshorne; "(no comment)" [labs/private] (master); V: 1 C: 2; - https://gerrit.wikimedia.org/r/6566 [23:07:08] Change merged: Bhartshorne; [labs/private] (master) - https://gerrit.wikimedia.org/r/6566 [23:07:24] Thehelpfulone: ^^^ it's merged now. [23:07:30] in a minute it should be avaialble to puppet for you. [23:07:35] ok great [23:07:39] (if I did my job right) [23:07:41] :P [23:07:49] let's hope you did ;) [23:10:29] ok so the puppetd -tv ran without any problems maplebed [23:10:37] \o/ [23:10:47] but I was under the impression that running that would make everything else magically work - like the web interface? [23:11:04] when you need to put content in that file send one of us a patch and we'll get it in np. [23:11:16] ok [23:11:30] about starting up the web interface... err... idunno?! [23:11:32] ;) [23:11:33] http://mailman.wmflabs.org/ still shows the "It works" web server default though, so maybe I'm missing something? [23:12:15] it's got the same classes as production so I think Damianz was saying that it should work [23:18:08] yeah ,I haven't looked at the puppet configs for mailman at all. [23:18:31] at this point, my suggestions are more mundane (i.e. see who's listening on the port, make sure it's lighttpd, look at the config, etc.) [23:19:09] there's probably some easy switch that needs to be flicked to make it work, I'll see what Damianz or mutante says tomorrow :) [23:19:18] well relatively easy ;) [23:20:33] !nagios [23:20:33] http://nagios.wmflabs.org/nagios3 [23:20:44] is deployment-prep down? [23:33:18] actually maplebed how good is your lighttpd knowledge? it's telling me that Problem: All Mailman web pages give a 404 File not found error. Answer: Your web server has not been set up properly for handling Mailman's CGI programs. Make sure you have: 1) configured the web server to give permissions to $prefix/cgi-bin 2) restarted the web server properly [23:33:27] nonexistent. [23:33:31] I've never actually run the server. [23:33:32] :P [23:33:43] heh [23:33:54] maybe you need a chmod +x somewhere? [23:34:42] If you want me to log in to the thing I can try and see what I see, but if you have the fortitude to keep digging, that's cool too. [23:35:07] can I view /etc/lighttpd/lighttpd.conf on the production server? [23:35:16] server = cluster [23:35:22] if it's in puppet, you should be able to. [23:35:32] * maplebed checks if it's in puppet [23:36:25] I see https://gerrit.wikimedia.org/r/gitweb?p=operations/puppet.git;a=blob;f=files/download/lighttpd.conf;h=e629ac8118000b72ce1d8dc19d5edaad8d064d8c;hb=HEAD [23:36:41] but that looks like it's for a different service. [23:37:03] yeah I found that one [23:37:20] I'll happily give you access or if you can give it to yourself, feel free [23:38:14] if it's not in puppet would it be somewhere else? [23:38:50] there's also https://gerrit.wikimedia.org/r/gitweb?p=operations/puppet.git;a=tree;f=files/lighttpd;h=2214ceabe097c0e08f44f1b74c76739b4757080a;hb=HEAD [23:39:02] I can see that the production config pulls in 50-mailman.conf from there [23:40:43] yeah that looks like it could be it [23:41:56] but I imagine a straight copy and page wouldn't quite work [23:43:17] you're right; it has redirects to lists hardcoded in there. [23:43:49] yeah those could be tweaked to mailman.wmflabs.org but should using puppet mean all the lists are automatically configured? [23:44:09] but that's a prime candidate for using pupet .erb templates and passing the canonical name of the list server as a variable. [23:44:11] :D [23:44:51] * Damianz adds Thehelpfulone to his unhelpful list of people [23:45:06] Damianz! <3 [23:45:11] :( [23:45:19] * Thehelpfulone watches Damianz do some magic [23:45:41] anyone around to take a look at beta labs? [23:45:55] http://en.wikipedia.beta.wmflabs.org/wiki/Main_Page <-not working [23:46:02] yeah it seems to be down for me too robla, but nagios doesn't show any problems [23:47:03] odd. petan|wk, Reedy, either of the two of you still up? [23:47:20] It's only 00:47 ;) [23:48:08] Reedy: and you're still up ;) [23:48:14] whee! [23:48:54] Reedy: are you in any spot yet to beat beta labs into submission? :) [23:49:32] Thehelpfulone: just fyi, I've stopped looking at puppet and lighttpd; I have some deletion-script babysitting I need to get through before I leave in 30m. [23:49:57] ok no problem [23:50:11] Damianz's here now ;) [23:51:19] Ryan_Lane: does it look like this PHP Notice: unserialize() [function.unserialize]: Error at offset 6 of 10 bytes [23:51:27] yes [23:51:55] I'm storing *way* more than 255 bytes [23:52:25] Ryan_Lane: what does strlen say about it? [23:52:27] I'm base32 encoding 5 512 byte string [23:52:48] that's definitely not going to fit into 255 bytes ;) [23:53:28] Ryan_Lane: you also might want to do a base64_encode base64_decode too [23:53:40] why's that? [23:53:47] Ryan_Lane: e.g., $safe_string_to_store = base64_encode(serialize($multidimensional_array)); [23:53:55] Ryan_Lane: and $array_restored_from_db = unserialize(base64_decode($encoded_serialized_string)); [23:54:02] is a base32 not safe to store? [23:54:09] heh [23:54:16] It turns out that if there's a ", ', :, or ; in any of the array values the serialization may get corrupted. [23:56:09] * Damianz gives Ryan_Lane a cookie [23:57:12] http://www.jaccuzzi.ch/html/affichjacc_264_e.html That's some serious jaccuzzing