[00:11:16] restarted glusterd services. short public key outage [00:11:38] I hate how glusterd manages that service [00:13:30] <^demon> s/(how|manages that service)// [00:13:36] indeed [00:14:19] RAWR [00:16:31] god damn it [00:17:51] if a single glusterd service is down, none of them can export nfs [00:18:01] what a giant piece of shit gluster is [00:25:52] :< [00:25:58] this is really pissing me off [00:28:35] Ryan_Lane: just to check, will this impact instances that are still on lucid? [00:28:56] well, this is specifically the ssh keys [00:29:00] it only affects login [00:29:16] everything else should continue to work as expected [00:29:24] Ryan_Lane: When will it be resolved? I was in the middle of a deploy. [00:29:30] a deploy? [00:29:32] in labs? [00:29:34] To a labs instance. [00:29:37] ah [00:29:40] hopefully soon [00:29:48] no fucking clue why nfs won't mount [00:29:51] it's showing in showmount -e [00:30:00] and services are running in rpcinfi -p [00:30:03] rpcinfo -p [00:30:25] this is what I get for trying to work on gluster [00:30:29] it only ever fucks me [00:41:49] hi, how do I log into bastion? (I see the issue has been discussed on labs-l, but there was no fix described) [00:42:35] ssh bastion2.wmflabs.org [00:43:03] .. oh, nope. I swear that was working earlier. [00:43:09] hm [00:43:12] *now* it's a gluster issue [00:43:20] Krenair: permission denied for me [00:43:25] at least it's responding :p [00:43:27] same. I realised after I sent the message :p [00:43:38] I restarted glusted services to unlock the daemon and now it's nfs services are fucked [00:45:06] so tl;dr no way into labs for now? [00:45:10] correct [00:45:13] until I fix this [00:45:24] do we get paid time off? [00:45:29] :) [00:57:51] ah ha [00:57:54] ok [00:57:58] keys are back up [00:58:06] had to kill all nfs daemons and restart glusterd again [00:58:07] however.... [00:58:11] bastion1 is hung [00:58:18] I think lucid may have an issue with nfs [00:58:37] bastion2/3 will work? [00:58:39] yep [00:58:48] yay [00:59:05] dschoon: login should work again [00:59:12] gribeco, ^ [00:59:22] nope [00:59:22] I think I'm going to rebuild bastion as precise [00:59:25] not for me :( [00:59:25] ah, I'll give it a try [00:59:30] dschoon: which instance? [00:59:35] legoktm@bastion2:~$ ssh legoktm@bots-3 [00:59:35] If you are having access problems, please see: https://wikitech.wikimedia.org/wiki/Access#Accessing_public_and_private_instances [00:59:35] Permission denied (publickey). [00:59:35] kripke [00:59:45] boxofjuice, did you connect to bastion with -A? [00:59:48] I got into bastion2! [00:59:59] Also you won't need the legoktm@ bit when you're already legoktm :) [01:00:02] bots-3 has public/keys [01:00:03] Krenair: no, lemme try that [01:00:09] it should definitely work [01:00:17] Krenair: thanks that worked [01:00:24] dschoon: try again? [01:00:24] Ryan_Lane: seems i needed to set -A [01:00:28] boxofjuice: yeah [01:00:29] trying [01:00:40] dschoon: i just ls'd in /public/keys and it's there [01:00:46] hm didn't know you could forget '.pmtpa.wmflabs' and still have it work [01:01:15] bastion1.pmtpa.wmflabs gives me conn closed [01:01:27] oh [01:01:27] which bastion should i use? [01:01:28] yeah, as mentioned bastion1 is broken [01:01:31] one sec [01:01:33] I'll reboot it [01:01:37] I'm going to rebuild it as precise [01:01:40] As I said bastion2 works [01:01:44] ...while you do that [01:01:47] is there another? [01:02:15] ......... Oh well I sort of said that. I realised I was wrong afterwards ;) [01:02:46] dschoon: bastion2.wmflabs.org and bastion3.wmflabs.org [01:02:49] okay. [01:04:11] so, as a positive, it looks like glusterfs isn't crapping itself every few minutes in the dmesg [01:04:35] and when I rebuild bastion as precise it should recover itself from an nfs outage [01:08:30] Ryan_Lane: looks good with bastion2 [01:08:30] ty [01:08:35] yw [01:08:39] now to solve the issue on bastion1 [01:09:40] Ryan_Lane: is there a reason adding a new hostname to an instance wouldn't be updating DNS right now? [01:10:01] did you check for the name before you added it? [01:10:48] what's the hostname/ip? [01:11:15] usually this ends up being negative cache, but I can check [01:11:20] dschoon: ^^ [01:12:45] ee-dashboard [01:12:48] ^^ Ryan_Lane [01:12:54] .wmflabs.org? [01:13:12] ;; ANSWER SECTION: [01:13:12] ee-dashboard.wmflabs.org. 3600 IN A 208.80.153.237 [01:13:53] I'd imagine you're hitting negative cache [01:14:28] yeah, I'm hitting the metrics dashboard when visiting it [01:15:21] dschoon: ^^ [01:15:44] weird. [01:15:48] maybe it's my local DNS caceh. [01:15:49] *cache [01:15:51] http://kripke.wmflabs.org/ wfm [01:15:53] yep [01:16:02] ty [01:16:06] either local, or your resolver [01:16:08] yw [01:16:22] it's a 1 hour negative [01:59:35] Die Gluster Die! [02:01:11] Well, actually, that's what it keeps doing. :-/ [02:03:52] heh [02:03:55] yep [02:04:12] !log bastion rebooting bastion-restricted [02:04:13] Logged the message, Master [02:04:52] On the positive side, I got a fairly performing DB up. Mariadb >> plain mysql [02:05:04] * Coren still prefers postgres. [02:08:29] Ryan_Lane: bastion2 doesn't do forwarding? [02:08:43] liangent: it does. are you having an issue? [02:09:34] yes. http://dpaste.com/1014641/ [02:10:14] oh. port forwarding [02:10:59] debug1: getpeername failed: Bad file descriptor [02:11:05] liangent: what are you trying to do? [02:11:39] ssh -a -W instance-proxy.eqiad.wmflabs:22 bastion1.eqiad.wmflabs -vvv [02:12:11] wait. is .eqiad.wmflabs valid? [02:12:12] are you trying to do proxycommand? [02:12:15] nope :) [02:12:18] it's pmtpa [02:12:27] we don't have the eqiad zone up yet [02:12:44] its listed in https://wikitech.wikimedia.org/wiki/Access#Using_ProxyCommand_ssh_option [02:12:48] yeah [02:12:52] and listed as bastion2 section [02:12:55] that will eventually be the bastion there [02:13:02] ah. whoops [02:13:13] let me update that [02:13:13] so I imagine bastion2 is for eqiad [02:13:45] well, it was going to be [02:14:00] I guess we'll name it something else [02:16:17] ok it works now [02:17:08] so bastion[1-3] are identical currently? [02:27:18] liangent: yep [02:47:05] any docs about Special:OATH? [02:52:35] liangent: not really. I need to add some docs and such [05:52:35] Ryan_Lane: ping : you have a PM [06:07:18] petan: can you install "libmysqlclient-dev" on bots-bnr1? [06:07:31] i need it to compile python-oursql [06:41:23] Ryan_Lane: you there? [06:44:31] Krinkle: yep [06:44:41] aaron and I fixed the search, btw [06:45:02] Ryan_Lane: apergos is telling my my cron on a vm is spamming all of ops every 15 minutes [06:45:12] how awesome is that [06:45:19] heh [06:45:29] any idea which instance, or what cron? [06:45:52] most labs instances send a ton of cron spam [06:45:55] Ryan_Lane: So, aside from the issue that the default for root crons in labs shouldn't go to all of ops (lol!), I've got an idea, based on how things go with MMP at toolserver. [06:46:04] Ryan_Lane: etc/crontab on cvn-app1 [06:46:10] I just edited it to go to /dev/null instead [06:46:29] we can't do anything about mail until we have a exim relay for labs [06:46:33] apparently the default for stdout in cron there from root is to go to ops [06:46:42] The idea is as follows [06:46:43] it's been on mark/faidon's radar for ages [06:46:50] set up an email alias on wmflabs for nova projects [06:47:02] e.g. cvn@mail.wmflabs.org would go to members of https://wikitech.wikimedia.org/wiki/Nova_Resource:Cvn [06:47:16] yep. this is basically how we're going to handle it [06:47:19] (their LDAP mailaddress, assuming that's mandatory) [06:47:25] yep [06:47:33] and secondly, aside from that being a useful thing. set it as default for things like cron spam. [06:47:36] we're going to send cronspam to project admins [06:47:41] great [06:47:42] not regular users [06:47:57] we'll likely make an alias for regular users too [06:48:06] Hm.. is project admins new? [06:48:09] I haven't see that before [06:48:18] since it would be useful to be able to send email to all members of a project [06:48:24] we merged netadmin and sysadmin [06:48:29] I created the cvn project in labs, I'd consider myself an admin (create/delete instances, assign ips etc.) [06:48:39] but I see that section is empty [06:48:48] yeah. that makes you a "projectadmin" [06:48:55] and novaadmin is in there, what's that doing there? [06:49:14] essex and above of openstack have no concept of global admin [06:49:31] I think havana, or maybe grizzly will add that back in [06:49:42] so, we have to have a single user that's in every project [06:49:46] as projectadmin as well [06:50:10] we should likely be hiding it from the interface, but it's not a major issue [06:50:19] ok, I don't mind the "novaadmin" in the list, but I mean Project admins. [06:50:22] Why is that empty for Cvn? [06:50:30] it's empty? [06:50:32] let me see [06:51:06] it's not empty [06:51:16] it has four members [06:51:53] Azariv, Krinkle, novaadmin, Sactage [06:51:55] oohhhhhhh [06:52:00] you mean on: [06:52:03] !resource cvn [06:52:04] https://labsconsole.wikimedia.org/wiki/Nova_Resource:cvn [06:52:10] members, yes. [06:52:14] but the "admins" list is empty? [06:52:27] yes, because that's currently broken [06:52:27] I know I'm a project admin (or so I guess) [06:52:41] k [06:52:42] I should remove it from the interface till I have it working again [06:53:41] I wonder if I have an open bug on that [06:53:46] I'm pretty sure I do [06:54:14] http://www.mediawiki.org/wiki/Wikimedia_Labs/Account_creation_improvement_project#Current_account_creation_process [06:54:20] https://bugzilla.wikimedia.org/show_bug.cgi?id=43515 [06:54:29] @notify petan [06:54:29] This user is now online in #huggle so I will let you know when they show some activity (talk etc) [06:54:32] it's probably not a ton of work [06:55:32] and it's even easier now that it's just one group [06:56:23] Change on 12mediawiki a page Wikimedia Labs/Account creation improvement project was modified, changed by Ryan lane link https://www.mediawiki.org/w/index.php?diff=656319 edit summary: [-17] /* Current account creation process */ [06:56:48] Change on 12mediawiki a page Wikimedia Labs/Account creation improvement project was modified, changed by Ryan lane link https://www.mediawiki.org/w/index.php?diff=656320 edit summary: [-257] [06:56:58] Hooray, crossed an entire section out! [06:57:26] lol [06:58:27] :) [06:58:41] Ryan_Lane: any chance you can install a package on bots-bnr1 for me? [06:58:47] Change on 12mediawiki a page Wikimedia Labs/Account creation improvement project was modified, changed by Ryan lane link https://www.mediawiki.org/w/index.php?diff=656321 edit summary: [-182] /* OpenID as a provider */ [06:58:53] and a bunch of bugs crossed off too! [06:59:11] boxofjuice: which package? [06:59:23] "libmysqlclient-dev" [06:59:44] i need it to build oursql [06:59:54] oursql? [07:00:10] its a python package [07:00:20] library for mysql, replacement for MySQLdb [07:00:29] * Ryan_Lane nods [07:00:29] ok [07:00:49] !log bots installed libmysqlclient-dev on bots-bnr1 [07:00:51] Logged the message, Master [07:01:10] ty :) [07:01:30] yw [07:02:41] I really need to finish puppet stuff for bots for this [07:02:45] * Damianz sigh [07:03:36] Ryan_Lane: regarding the mail alias to nova project group, open bug for that or want me to create one? and the cron going to root/ops bug, and the enhancement to make it go to project (once the first thing is implemented) [07:04:36] cron going to roots is just funny... though we need a relay to direct that to project admins :( [07:04:40] I think there's already an RT for it [07:04:45] a really, really old one :( [07:05:11] ok. removed admins for now from the project pages [07:06:41] [bz] (NEW - created by: Krinkle, priority: Unprioritized - major) [Bug 45827] Labs: Mails from cron should not go to ops-wikimedia - https://bugzilla.wikimedia.org/show_bug.cgi?id=45827 [07:07:57] [bz] (NEW - created by: Krinkle, priority: Unprioritized - normal) [Bug 45828] Labs: Implement mail aliases for projects (@wmflabs.org) - https://bugzilla.wikimedia.org/show_bug.cgi?id=45828 [07:09:21] [bz] (NEW - created by: Krinkle, priority: Unprioritized - enhancement) [Bug 45829] Labs: Let mail (from cron and perhaps other defaults) go to project forwarder by default - https://bugzilla.wikimedia.org/show_bug.cgi?id=45829 [07:09:50] Ryan_Lane: Merge https://gerrit.wikimedia.org/r/#/c/52350/ please :p [07:10:03] heh [07:10:05] it scares me [07:10:08] let me look at it tomorrow [07:10:43] some things are poorly named Nova right now [07:10:48] controller, for instance [07:11:00] though really I probably just need to pull the keystone stuff out of it [07:11:05] and the rest call stuff [07:11:17] haha [07:11:37] I'll probably merge it through tomorrow, then refactor afterwards [07:12:56] Krinkle: thanks for adding in the bugs [07:13:04] yw [07:13:04] this is quite unique right now this moment [07:13:11] what is? [07:13:28] so, Ryan in SF, 11:12PM. Reedy 7.12AM (night, morning? not sure how you feel), Krinkle, 8.12AM (end of night) [07:13:40] reedy is in SF ;) [07:13:46] reeeeeally [07:13:50] yep [07:13:50] My laptop says it's tomorrow though [07:13:54] heh [07:13:56] So does my IRC client [07:13:58] I found it unusually late indeed, even for Reedy [07:14:01] ;) [07:14:08] late? Meh I just got out of bed :P [07:14:30] Change on 12mediawiki a page Wikimedia Labs was modified, changed by Ryan lane link https://www.mediawiki.org/w/index.php?diff=656324 edit summary: [+75] /* Proposals */ [07:16:52] Change on 12mediawiki a page Wikimedia Labs/Interface usability improvement project was modified, changed by Ryan lane link https://www.mediawiki.org/w/index.php?diff=656326 edit summary: [-609] [07:17:13] Change on 12mediawiki a page Wikimedia Labs/Communication improvement project was modified, changed by Ryan lane link https://www.mediawiki.org/w/index.php?diff=656327 edit summary: [+609] [07:23:30] Ryan_Lane: So it's final that there isn't going to be a separate "Test/Dev Labs" and "Tool Labs", right? [07:23:46] I'm about to update stuff to reflect that [07:23:51] tools will be a labs project [07:24:01] and "Test/Dev Labs" is a term no longer to be used. [07:24:28] well, those terms were meant more as projects [07:24:36] they were always meant to be in the same infrastructure [07:24:51] sure [07:25:06] phase 1 was test/dev labs (which is what's currently running) and tool labs is phase 2 [07:25:18] ok [07:25:19] which Cor en is so unlucky to be doing :) [07:25:32] I'll leave it as is for now then [07:25:35] kidding it'll be fun :) [07:26:00] thanks [07:26:50] btw, I got 2 small random bots I'd like to migrate from my krinkle@willow.toolserver crontab to labs. Is bots still the place to go or is tools ready for action already? (I know you don't manage that, but Coren is asleep and I figured you are aware of how it progresses) [07:27:16] hm. may want to ask coren on progress [07:27:20] k [07:27:22] I have a feeling bots project is still the spot [07:28:15] Ryan_Lane: interesting, looks like does properly display projectadmin [07:28:25] what does? [07:28:54] http://cl.ly/image/3Z0t1v1N340H [07:28:58] Special:NovaProject * [07:29:20] yeah, it displays properly in manage projects [07:29:29] but not on project pages [07:29:48] I didn't want to make project pages dynamic pull from ldap [07:29:57] as that's just waiting to be abused [07:30:12] doesn't it already do that for members? [07:30:17] so, when projects are modified, OpenStackManage updates the project page [07:30:27] oh, I see. [07:30:27] it's doing it for members, but not for projectadmins [07:30:29] it is hardcoded in wikitext [07:30:32] yeah [07:30:47] the only thing that updates the memberships is wikitech, so it's not an issue [07:31:17] if that ever moves to keystone, we'll make a keystone plugin that updates mediawiki with the info [07:31:22] like we do for nova [07:34:45] hell, let me see if I can just push this in now [11:22:40] !log bots enforcing new security rules for sql and passwords storage [11:22:42] Logged the message, Master [11:44:31] Damianz ping [11:44:43] petan: pong [11:44:55] Damianz when you want to move cluebot to bsql? [11:45:13] petan: Erm... tomorrow night maybe, I've got to work late tonight [11:45:16] that would actually allow us to turn off sql2 :P [11:45:18] mhm ok [12:26:05] Silke_WMDE: about that tool list... am I expected to list all my bots one by one? [12:29:53] * addshore dislikes interwiki bots [12:30:45] liangent: that would be best. how many bots so you run? [12:33:21] Silke_WMDE: eight. different invocations (with different arguments) of the same script are counted as one [12:33:44] I would say so, yes [12:35:30] Silke_WMDE: so one entry per script is ok? [12:35:48] yes [12:35:53] Silke_WMDE: hmm I also have a webtool, which consists of some smaller apps. one entry or multiple? [12:36:10] I already added it as one -- am I expected to expand it now? https://www.mediawiki.org/w/index.php?title=Toolserver%2FList_of_Tools&diff=656432&oldid=656425 [12:36:40] no, i think it's ok [12:57:32] Silke_WMDE: and among those bots, one does nothing itself, and it depends on all configured "plugins". should I list those plugins? [12:58:15] yes, please list them as dependencies [12:58:48] Silke_WMDE: eh the "framework" and "plugins" are all written by myself. [12:58:57] it's coded in this way for easier maintenance [12:59:01] ah [12:59:21] then please list them separately [12:59:23] each "plugin" does something completely different from others [14:21:36] Top 10 bad starts to a day: (3) drive failure on your primary desktop [14:22:43] what's (2) and (1) [14:22:43] ? [14:23:34] Not sure, but I expect "being woken up by a phone call announcing the death of a family member" should be up there. :-) [14:34:14] * Coren fetches caffeine. [16:07:11] addshore we should restart sqld [16:07:18] why? [16:07:23] addshore I found out we really need to restart it for it to read config file [16:07:32] so as we changed the sizes it never took effect [16:07:49] I did a sneaky restart the other day :) but if you have made more changes go for it :) [16:07:51] it eats around 7gb out of 16 [16:07:55] aha [16:08:02] I configured it to use 10GB for innodb [16:08:03] I did today increased the buffer to 12gb [16:08:06] oh lol [16:08:09] I changed that to 12 [16:08:10] :P [16:08:12] kk :) [16:08:13] mhm [16:08:26] I think I was going to have it at 12 also [16:08:26] for some reason it uses very little ram right now [16:08:31] I don't know whty [16:08:42] no requetss running on it atm [16:10:13] ok but it should be full anyway [16:10:20] so that it doesn't need to read from disk [16:13:27] gribeco generates lot of errors in sql :/ [16:13:31] dunno what's up [16:15:12] !log tools can haz database (support for user/tool databases in place) [16:15:13] Logged the message, Master [16:15:36] petan: if you want to restart it, restart it now :) [16:15:40] Coren how do you maintain db creation [16:15:52] addshore idk wouldn't it crash some bots? [16:15:56] more people use it [16:16:01] * addshore checks [16:16:20] petan: At this time, this is part of my create-a-tool script. Eventually (read: real soon) it will be done via the wikitech interface. [16:16:24] Threads: 8 Questions: 1527517 Slow queries: 3 Opens: 274 Flush tables: 1 Open tables: 165 Queries per second avg: 16.048 [16:16:35] addshore let's wait [16:16:47] looks like we will have to schedule a restart in a week or so :) [16:16:48] we will restart it one day... I believe labs will crash soon again :D [16:16:50] then it will restart [16:16:53] yee xD [16:17:07] just about to run my bot now :) [16:17:15] 16 queries per second :D [16:17:20] addshore how you get these stats [16:17:25] from information schema [16:17:28] loginto mysql [16:17:28] or you got a tool [16:17:33] and type "status" ;p [16:17:41] yay [16:19:25] mhm Coren what about that procedure I posted :P or some self-service script [16:19:31] so that users could create db's using terminal [16:19:41] * petan hates web interfaces [16:20:10] also, what if I wanted to create a DB and give permissions to 5 more people to it [16:20:38] petan: 'with grant option' is given. [16:20:50] ok, how do you handle database dropping then? [16:20:57] because when you drop a db, grants remain [16:21:19] so, if someone recreated a db which existed in past, they might have a security problem [16:21:23] petan: said interface will also clean up the grants. [16:21:39] ok but if I have grants [16:21:43] I don't need to use your interface [16:21:51] one could just execute drop database [16:22:19] petan: Yes, you get to shoot yourself in the foot. If you do so, you are welcome to use the hole for any purpose you like. [16:22:40] petan: The only one you're going to affect is yourself. [16:22:49] right [16:24:06] I'm not trying to protect the maintainers from themselves; I'm trying to protect their tools from being broken by others. If you break your own tool, you get to keep the pieces. :-) [16:24:23] ok [16:26:04] But there's not going to be a way around having to use the wikitech interface sometimes; it alone holds the right to play in the labs-wide LDAP [16:26:24] :/ [16:26:32] that's sad [16:26:35] Although, I would expect ~5 minutes of use over the lifetime of a tool. [16:26:57] poor people with no graphical interfaces [16:27:15] I heard about people who don't even run X server [16:27:16] :D [16:27:22] on their home computers lol [16:27:38] I believe Ryan is one of them [16:27:47] and Damianz [16:27:47] elinks works very well. [16:29:14] Honestly, I don't believe any weakening of reliability or security is justified to avoid a few minutes of incovenience for .03% of users. :-) [16:29:23] Coren: WE NEED AN API [16:29:26] * Damianz just saying [16:29:28] +1 [16:29:37] we need a terminal based labs console [16:29:47] so that you don't need to move your hands from keyboard [16:29:59] because that is annoying :D [16:30:23] well, it doesn't need to be spartan [16:30:23] also labs console is terribly slow because all that html shit [16:30:23] it could have a ncurses interface :) [16:30:37] + mediawiki itself is horribly slow [16:31:13] petan: Now come. Rule number one of optimization: do not spend more time optimizing than could be saved by the optimization. [16:31:19] it can even crash your browser :P [16:31:45] if it was up to me I would take all that 98% of crap in mediawiki, ripped it out and made mediawiki only VERY minimal lightweight and fast core. And all these 98% of unneeded stuff could be converted into extensions [16:32:08] we don't even need any skins in core :P [16:32:11] You've already spent more time arguing about wanting a text-based inteface than you will spend /in/ the interface. :-P [16:32:11] that all can be extensions :D [16:32:18] heh [16:32:20] maybe [16:32:49] but still it would be more hackish to have labs console being actually a console [16:33:20] it would be so much faster to operate labs [16:33:25] actually I put some effort into that [16:33:31] like I can do this [16:33:37] @labs-project bots [16:33:43] HAHA petan http://ganglia.wmflabs.org/latest/?r=20min&cs=&ce=&c=bots&h=bots-bsql01&tab=m&vn=&mc=2&z=medium&metric_group=ALLGROUPS [16:33:43] @labs-project-info bots [16:33:43] The project Bots has 19 instances and 61 members, description: A project for creating and running both wiki bots for use on Wikimedia Foundation sites and IRC bots. [16:33:52] bsql01 seems to handle everything perfectly [16:33:56] instead of searching through web :D [16:34:04] which would take 20 times more of my expensive time [16:34:04] see the impact on bnr1 ;p http://ganglia.wmflabs.org/latest/?r=20min&cs=&ce=&c=bots&h=bots-bnr1&tab=m&vn=&mc=2&z=medium&metric_group=ALLGROUPS [16:34:11] addshore yes I know [16:34:55] admitadly that is with about 500 less queries than a regular run :P [16:34:57] petan: Well, you could always write a command-line tool that uses http to talk to Wikitech. :-) [16:35:09] Coren if only there were apis [16:35:14] which is what I am talking about all the time [16:35:27] console based labs console could just be interface to wikitech [16:35:32] using api [16:36:02] https://gerrit.wikimedia.org/r/#/admin/projects/mediawiki/extensions/OpenStackManager [16:36:12] mhm :D [16:36:18] I wouldn't touch Ryan's baby [16:36:37] You know the wiki way. {{sofixit}} :-) [16:36:54] but Ryan way: {{touchiturdead}} [16:37:11] lol [16:37:19] I once commited something in that :D it got reverted quickly [16:37:25] Oh, I doubt that. It may take some doing for him to +2 it though. :-) [16:43:30] Change on 12mediawiki a page Wikimedia Labs/Tool Labs/TODO was modified, changed by MPelletier (WMF) link https://www.mediawiki.org/w/index.php?diff=656513 edit summary: [+71] progress [16:44:44] Change on 12mediawiki a page Wikimedia Labs/Tool Labs/TODO was modified, changed by MPelletier (WMF) link https://www.mediawiki.org/w/index.php?diff=656515 edit summary: [-63] also progress [16:44:55] whois Silke_WMDE_ [16:45:01] * Coren grumbles. [16:45:48] * Platonides returns "Silke Meyer" [16:47:52] * Coren is a bit surprised she doesn't have a cloak. [16:50:13] Change on 12mediawiki a page Wikimedia Labs/Tool Labs/TODO was modified, changed by MPelletier (WMF) link https://www.mediawiki.org/w/index.php?diff=656520 edit summary: [+94] mo' progress [16:57:28] Coren: that's a pretty good point, mail her about it [16:57:42] she should get one [16:58:02] I think she has one [16:58:23] yes, she has: Silke_WMDE is ~quassel@wikidata/Silke-WMDE * Silke Meyer [16:59:07] it's probably that as she logged in as Silke_WMDE_, she didn't identify to services [17:03:50] ^^ exactly [17:04:11] just a second :) [17:06:34] * Silke_WMDE has put her cloak on [17:17:50] Coren: Could you run the DB part of your create-a-tool script for wikilint, please? Just curious :-). [17:18:02] scfc_de: 'tis done already [17:19:18] Silke_WMDE, you can also use /msg nickserv identify Silke_WMDE [17:19:29] no need to reconnect [17:19:53] Coren: Ah, okay, simple sudo isn't enough, sudo -i does the trick. Thanks! [17:35:54] Change on 12mediawiki a page Wikimedia Labs/Tool Labs/TODO was modified, changed by MPelletier (WMF) link https://www.mediawiki.org/w/index.php?diff=656534 edit summary: [+225] No legal obstacles [17:45:43] scfc_de: Working fine for you? [17:45:55] hi Coren - happy Thursday [17:46:09] Coren: I hope you will be able to watch the monthly metrics meeting today [17:46:11] sumanah: Hi! Err... happy Thursday to you to. :-) [17:46:34] sumanah: What's a monthly metrics meeting? :-) [17:46:50] Coren: Only tested the login so far, and that worked :-). Will do some actual tests later. [17:47:41] Coren: https://meta.wikimedia.org/wiki/Metrics_and_activities_meetings/2013-03-07 7 March 2013 starting at 19:00 UTC (11:00 Pacific Standard Time) [17:48:39] sumanah: I'll keep an eye on it to be sure. [17:48:51] Eew. Youtube and not hangout? [17:49:01] ok! the channel #wikimedia-office [17:49:06] Coren: Not sure what you're saying. [17:49:23] It is a Google+ Hangout and people who want to present can get invites. [17:49:35] And then it streams live (maybe a 5-second delay) onto YouTube. [17:49:38] Why is that "ew"? [17:50:14] The page you linked to didn't say anything about hangout. I thought it was Youtube /instead/ of a G+ hangout [17:52:18] OK. Why should it say anything about a Hangout when that only affects the people presenting? [17:52:23] No, it's a hangout syndicated to Youtube [17:52:42] I mean, if it's important to you that it mention "here is the equipment we use" then you can edit and add that. [17:53:07] Maybe I'm misreading you. [17:54:35] Coren: You may want to present at the April or May metrics meetings to talk about your work [17:54:56] sumanah: Sure, to keep everyone up to date. :-) [17:55:06] right [17:55:15] and to solicit any particular input you need [17:55:19] from the WMF and community as a whole [18:00:58] Change on 12mediawiki a page Wikimedia Labs/Tool Labs/TODO was modified, changed by Tim.landscheidt link https://www.mediawiki.org/w/index.php?diff=656541 edit summary: [+0] Fix typo. [18:39:57] Coren: have you been able to follow up with Asher &/or Peter re the current status of db replication? [18:40:05] if you have a question I can try to ask Peter in person [18:40:27] sumanah: I've been on communication with asher, yes. :-) [18:40:37] sumanah: in* [18:40:51] ok. if you could send an update to labs-l if there's new info / delivery ETA that would be great [18:41:56] sumanah: It'll certainly be part of my weekly status update if I get news. :-) [18:42:36] ok [18:42:41] matanya: howdy [18:43:22] hi [18:44:05] so, where can I help, in particular? I'm strong at the fields of monitoring and server level [18:44:21] a bit weaker on network [18:46:55] so Ryan_Lane what would you suggest to start with? [18:47:19] let's see.... [18:48:02] I have the following things tracked well, right now: [18:48:02] https://www.mediawiki.org/wiki/Wikimedia_Labs/Account_creation_improvement_project [18:48:08] https://www.mediawiki.org/wiki/Wikimedia_Labs/Instance_creation_improvement_project [18:48:16] https://www.mediawiki.org/wiki/Wikimedia_Labs/Interface_usability_improvement_project [18:48:22] https://www.mediawiki.org/wiki/Wikimedia_Labs/Stability_improvement_project [18:48:28] https://www.mediawiki.org/wiki/Wikimedia_Labs/Communication_improvement_project [18:48:39] some of them involve mediawiki development [18:48:54] others involve adding saltstack support [18:49:00] monitoring right now is likely easiest [18:49:12] hashar: anything you need in beta? [18:49:19] ok, lets go in that path [18:49:43] https://www.mediawiki.org/wiki/Wikimedia_Labs/Stability_improvement_project#Monitoring [18:50:03] [bz] (NEW - created by: Ryan Lane, priority: Unprioritized - normal) [Bug 43371] Allow non-admins to give out shell access - https://bugzilla.wikimedia.org/show_bug.cgi?id=43371 [18:50:16] Thank you matanya! I appreciate your help [18:50:22] adding ganglia metrics for openstack/kvm would also be very helpful [18:50:37] * sumanah goes upstairs to metrics meeting [18:50:52] sumanah: I do it all day long, would be glad to use some of the knowledge back [18:51:20] Change on 12mediawiki a page Wikimedia Labs/Stability improvement project was modified, changed by Ryan lane link https://www.mediawiki.org/w/index.php?diff=656557 edit summary: [+147] /* Monitoring */ [18:51:31] I added a few more to that list [18:51:39] I think I'll start with disk space stuff, really easiest for a new-comer [18:51:39] we're missing checks for all kinds of stuff [18:51:42] cool [18:51:54] done on puppet? [18:51:56] it may be as easy as just enabling the check [18:52:06] yes/no [18:52:07] :) [18:52:11] means? [18:52:14] in production it's done via puppet [18:52:26] in labs it's done some other kind of strange way, based on puppet classes [18:52:48] can you point me to the right places? :) [18:52:52] yep [18:53:09] the check should be the same for both, so it should be doable [18:53:42] our puppet repo is at: https://gerrit.wikimedia.org/r/operations/puppet [18:53:51] you can git clone that [18:53:59] have it already [18:54:02] ah. cool [18:54:07] I have pushed one thing [18:54:10] was rejected [18:54:17] heh [18:54:26] we'll get some things through [18:54:42] nagios stuff is in templates/nagios [18:54:50] and files/nagios [18:55:16] why do we have duplicate nagios/icinga? [18:55:20] checks are added at the service level [18:55:20] oh [18:55:21] crap [18:55:22] right [18:55:25] we switched to icinga [18:55:45] so, things are actually at files/icinga and templates/icinga [18:55:46] not completely I see [18:55:56] we switched last week [18:56:16] checks are added at the service level via: monitor_service { "ldap": description => "LDAP", check_command => "check_tcp!389" } [18:56:20] for instance [18:56:32] Change on 12mediawiki a page Wikimedia Labs/Tool Labs/Needed Toolserver features was modified, changed by Jarekt link https://www.mediawiki.org/w/index.php?diff=656559 edit summary: [+547] /* Database */ [18:56:43] some checks are done for all services [18:56:49] so the file would be ..puppet/files/icinga [18:56:55] yep [18:57:02] what file exactly there? [18:57:41] well, it's a combination of files in files/icinga and templates/icinga [18:57:46] our repo is kind of a mess [18:57:58] hm. it seems we have checks for disk space and raid.... [18:58:07] I've never seen them trigger [18:58:08] ever [18:58:35] those checks are under: base::monitoring::host [18:58:58] I thought so when first saw the file. to admit, I'm a bit confused. where are you looking now? [18:59:06] manifests/base.pp [18:59:20] our repo is a giant mess of spaghetti code, as I mentioned at my talk ;) [18:59:42] paravoid: ^^ [18:59:53] paravoid: any specific monitoring things you want? [18:59:59] you did, though I did imagine this is the case [19:00:04] heh [19:00:16] need to move all to modules [19:00:19] paravoid wanted a lot of monitoring things [19:00:27] matanya: yep, very much need to [19:00:48] paravoid may be asleep right now [19:01:13] I have at my work place 560 checks for ~100 devices. (all done by me, sadly) [19:01:25] hey [19:01:32] hi there paravoid [19:01:34] have a look at that for ideas https://nagios.debian.org/icinga/ [19:01:38] all the checks are public [19:02:34] user/password? [19:02:36] memory, swap usage, df, load, uptime, hardware RAID, software RAID, more process checks [19:02:44] dsa-guest/dsa-guest [19:03:19] on the writing part, no problem. on the right place to right - big problem [19:03:26] where are the checks in puppet? [19:03:43] quite of lost in the woods [19:03:46] cleaning up puppet regarding monitoring would be fine with us too [19:04:21] 560 checks for 100 devices isn't much btw [19:04:28] I'd be glad to do that, if I knew were to start [19:04:44] paravoid: better than we have, isn't it? ?) [19:04:46] err :) [19:04:56] I know, only basic checks on hardware, an some services [19:05:06] *and [19:05:08] it is :) [19:06:54] for example on a random server : [19:06:58] [19:06:58] [19:06:58] Bonding-status [19:06:58] [19:06:58] OK 07-03-2013 18:57:58 29d 5h 13m 58s 1/3 Interface bond0 is up: mode=4 (802.3ad), 2 slaves: p2p1, p2p2 [19:07:00] [19:07:02] CPU-Load [19:07:04] [19:07:06] OK 07-03-2013 18:57:58 29d 5h 25m 28s 1/3 OK - load average: 0.02, 0.08, 0.07 [19:07:10] [19:07:12] Cpu-usage [19:07:14] [19:07:16] OK 07-03-2013 18:58:05 29d 5h 24m 15s 1/3 CPU OK : idle 90.84% [19:07:18] [19:07:20] Current-Users [19:07:22] [19:07:23] use pastebin ;) [19:07:24] OK 07-03-2013 18:57:58 29d 5h 23m 2s 1/3 USERS OK - 0 users currently logged in [19:07:26] [19:07:28] Dell-Hardware-report [19:07:30] [19:07:32] OK 07-03-2013 18:58:04 0d 3h 14m 27s 1/3 OK - System: 'PowerEdge R710', SN: 'H85RQL1', 16 GB ram (4 dimms), 1 logical drives, 3 physical drives [19:07:35] [19:07:37] current users is a bit scary [19:07:37] Disk [19:07:41] [19:07:43] OK 07-03-2013 18:57:58 67d 21h 54m 48s 1/3 DISK OK - free space: / 151865 MB (79% inode=98%): /dev/shm 7967 MB (100% inode=99%): /boot 395 MB (86% inode=99%): /home 91496 MB (95% inode=99%): /var 505497 MB (88% inode=99%): [19:07:47] [19:07:49] Disk-usage [19:07:51] [19:07:53] OK 07-03-2013 19:04:11 0d 0h 33m 10s 1/3 DISK OK io : [19:07:55] [19:07:57] HTTP [19:07:59] [19:08:01] OK 07-03-2013 18:57:58 165d 0h 38m 31s 1/3 HTTP OK: HTTP/1.1 200 OK - 1546 bytes in 0.001 second response time [19:08:04] [19:08:06] Mem-usage [19:08:10] [19:08:12] OK 07-03-2013 18:58:00 8d 6h 17m 5s 1/3 MEMORY OK : Mem used: 96.84%, Swap used: 31.28% [19:08:14] [19:08:16] SSH [19:08:18] [19:08:20] OK 07-03-2013 18:57:58 193d 12h 36m 52s 1/3 SSH OK - OpenSSH_5.3 (protocol 2.0) [19:08:23] [19:08:24] Slow-Subscribres [19:08:26] [19:08:28] OK 07-03-2013 18:57:58 67d 21h 54m 48s 1/3 OK: No Slow subscribers [19:08:30] [19:08:32] Swap [19:08:34] [19:08:36] OK 07-03-2013 18:57:58 67d 21h 54m 48s 1/3 SWAP OK - 69% free (22551 MB out of 32767 MB) [19:08:40] oops [19:08:42] meant to paste the pastbin [19:08:44] night at the office, paravoid [19:08:53] ? [19:09:20] this is a login server, for devs, not a production one [19:09:46] no devs at office, no users logged in. [19:10:06] anyway, to the point, where do I start? [19:10:23] what file would be the best? [19:11:50] paravoid: ^ [19:12:09] Ryan_Lane: we can talk about beta during lunch if you want :-] [19:19:04] petan: around? [19:19:38] I am a bit concerend that ever since my bot has been editing the disk space availible on bsql01 has been going down :/ [19:22:12] addshore: are you logging everything? [19:22:26] * addshore is unsure [19:22:29] as in mysql logs? [19:24:23] no, on the disk. logs of your edits, like in pywikibot package in ../logs [19:25:17] Ryan_Lane: can you point me to the correct file? [19:25:33] matanya: nope [19:25:45] the bot isnt actually running on bsql01 it is on bnr1 [19:26:01] so the reason for bsql01 disk usage going up has to be something to do with the database :/ [19:26:17] are you writing any type of log? [19:26:33] nope [19:26:47] Damianz: are you moving cluebots mysql atm? [19:28:18] hashar: well, I was asking because matanya was looking for ops volunteer work [19:28:34] we could use an HTTPS proxy :-] [19:28:40] hashar: so, in irc would be better, and specifically, talking with matanya about it [19:28:42] but I am currently busy setting up search [19:28:45] yeah sure [19:28:45] * Ryan_Lane nods [19:29:17] Ryan_Lane: I'm lost in the puppet tree :) [19:29:24] :D [19:29:31] yeah, it's terrible, right? [19:29:32] grep is your friend hehe [19:29:51] one day we would have to write a 30'000 feet overview about our puppet repository [19:30:06] well, it would be less of a problem if everything was modules [19:30:25] grep is my best friend, but if the results come from 6 different files, it doesn't help much [19:31:02] matanya: which timezone are you in ? I am in SF right now but will be back in Europe next week so GMT+1 for me (GMT+2 in a few weeks when DST kicks) [19:31:03] !log tools access.log now split by tools (in tool homedir) [19:31:05] Logged the message, Master [19:31:17] Ryan_Lane: can you point me to the exact place to add a simple disk space check [19:31:27] hashar: i'm in UTC+2 [19:31:31] \O/ [19:31:53] as I said I am working on lucene right now [19:31:56] moving to UTC + 3 in a few weeks as well [19:32:02] no rush [19:32:11] but one could try setting up a HTTPS proxy [19:32:27] matanya: templates/icinga/checkcommands.cfg.erb [19:32:28] need an instance, a wildcard certificate and then adapt the puppet class form production to fit in labs [19:32:44] if it doesn't already exist [19:32:45] ok [19:32:54] Ryan_Lane: it does [19:33:02] disk space checking may just be broken [19:33:03] getting more monitoring would be nice too. Damianz can help on that, he is maintaining a python script that generate the nagios configuration files for all labs instances. [19:33:17] or may only alert when disks are full, which isn't helpful [19:34:10] Ryan_Lane: regarding splitting our puppet files to several modules, I am all for it :-] That would also be a nice way for you guys to level up on different areas. [19:36:34] Ryan_Lane: it is configured to warn on 5% and 6% [19:37:19] that's way too low [19:39:14] I agree [19:40:57] Ryan_Lane: on /puppet/templates/icinga/nrpe_local.cfg.erb command[check_disk_6_4]=/usr/lib/nagios/plugins/check_disk -w 6% -c 4% -l -e [19:41:55] warn on 25% critic on 10% ? :D [19:43:08] what hashar ? [19:44:30] Change on 12mediawiki a page Wikimedia Labs/Account creation improvement project was modified, changed by Ryan lane link https://www.mediawiki.org/w/index.php?diff=656574 edit summary: [-113] /* Current account creation process */ [19:45:08] * matanya is clueless  [19:45:09] hashar: not all systems should page [19:45:26] matanya: alerts set as critical will page us [19:45:44] dbs should definitely page us if they get too low [19:45:55] you have a pager? [19:46:11] phones [19:46:15] oh [19:46:19] SMS [19:46:33] and how come such a low range didn't trigger yet? [19:47:03] as Ryan said they are probably not defined as a critical service [19:47:12] the critical part makes them send SMS as opposed to just email [19:47:31] mutante: where is that configured? [19:48:33] matanya: search operations/puppet repo for "monitor_service" definitions [19:48:48] monitor_service can have critical => true [19:51:29] mutante: manifests/base.pp: @monitor_service { "disk space": description => "Disk space", check_command => "nrpe_check_disk_6_3", tag => nrpe, contact_group => $contact_group } [19:53:59] matanya: yep, default is false, the definition is in nagios.pp line 59 [19:54:04] !mail [19:54:05] we have a mailing list labs-l@lists.wikimedia.org feel free to send a message there, don't forget to subscribe [19:54:19] and ehm, makes me realize that is not in icinga.pp ..hrm [19:58:07] thanks mutante [19:58:22] I see I need to copy the entire class to icinga [19:58:28] matanya: we are in the process of cleaning that up ..so it should be less confusing soonish (icinga vs. nagios) [20:06:46] * Coren says mean things about php [20:26:29] How do I get puppet to run to make sure this change takes effect in the apache config? https://wikitech.wikimedia.org/w/index.php?title=Nova_Resource:I-00000553&diff=prev&oldid=62275 [20:28:13] Krenair: On a given labs instance you can just do 'sudo puppetd -tv' to force a puppet run. [20:28:19] Otherwise, it will run on its own within 30 minutes. [20:28:28] (Is that what you were asking?) [20:28:52] Hm I thought that's what it was. [20:29:07] Except I'm getting err: Could not retrieve catalog from remote server: Connection refused - connect(2) [20:29:11] Of course if puppet is failing partway through then it might not get around to applying your changes :( [20:29:34] I'm expecting /etc/apache2/sites-available/nova-precise2.pmtpa.wmflabs to be changed by this [20:29:34] Oh -- that could just mean that the puppet server is swamped. Give it a few minutes and try again? [20:29:52] When I last logged in it said The last Puppet run was at Wed Mar 6 17:50:38 UTC 2013 (1586 minutes ago). [20:29:57] when I logged in* [20:30:34] hm [20:30:40] What instance is this? [20:31:07] nova-precise2 [20:31:39] lemme poke around. Might be that we just need a reboot to catch up with recent security changes. [20:31:39] Is Labs' hardware available somewhere? [20:31:56] That guy asking about replicating enwiki/dewiki on wikitech-l would probably be interested. [20:32:06] Susan: You mean, a description of the platform we're running on? [20:32:33] Well, this guy is asking what kind of hardware he'd need to fully replicated enwiki and dewiki. [20:32:41] I figure this is pretty much what Labs is doing right now. [20:32:51] So there must be recent hardware provisioning math somewhere. [20:32:59] Or specs. [20:33:32] replicate * [20:34:07] Susan: Sorry, I'm behind on my email so don't know the context for this. [20:34:53] Aren't we all. [20:34:53] The 'deployment-prep' project in labs is an attempt to replicate much of the production infrastructure. But it doesn't replicate the actual content. [20:34:57] I just happened to notice today. [20:34:57] http://lists.wikimedia.org/pipermail/wikitech-l/2013-March/067214.html [20:35:12] Right. But Wikimedia Labs is about to set up database replication of all projects. [20:35:15] Similar to the Toolserver. [20:35:55] Ah, yes, that's right. I don't know immediately what the hardware setup is for the db replication. An email to labs-l might dig that up. [20:37:18] K. :-) [20:40:38] Krenair, if you don't mind, I'm going to reboot nova-precise2. [20:40:49] okay [20:41:04] !log tools tools' php errors now sent to ~/php_errors.log [20:41:06] Logged the message, Master [20:42:41] !log openstack rebooting nova-precise2 in hopes of getting puppet runs to work again [20:42:43] Logged the message, dummy [20:43:47] * Coren chuckles at andrewbogott [20:46:21] Krenair: I'm none the wiser, but puppet seems to be running now. You can log in and see if you got the update you wanted. [20:47:21] hm, nope :/ [20:47:54] nova-precise2 is only barely puppetized, which I'm gradually working on. I have a bunch of related puppet patches pending atm [20:48:46] It keeps trying to recreate existing databases even though the test for existence works any time I do it on the commandline [20:49:14] Krenair, do you want me to look at your patch or are you still debugging things yourself? [20:49:34] Ohey, I note what I presume to be icinga hitting my webproxy at interval for /server-status. Enable? [20:50:54] Coren: I'm ignorant, sorry. LeslieCarr might know something [20:50:59] although she is not in this channel it seems [20:52:37] Coren labs icinga? [20:52:41] or production? [20:52:46] if it's labs me is the person to blame [20:52:56] petan: Not sure atm. :-) [20:53:09] o_O [20:53:13] !icinga [20:53:13] http://208.80.153.210/icinga http://icinga.wmflabs.org/ [20:53:42] Actually, I saw the hits /from/ the proxy, but now I see they are coming from 127.0.0.1 on the proxy. [20:54:24] Coren if you enabled webserver using puppet icinga created a web server check [20:54:27] for your instance [20:54:30] it's automated [20:54:37] Which makes sense. [20:54:45] what instance it is [20:54:49] tools-webproxy [20:55:07] I don't mind the check, but that means I have to turn on /server-status I'm guessing. [20:55:14] no there is no check for httpd for that instance [20:55:20] o_O [20:55:43] it might be bug in icinga actually, we didn't check if that puppet class works recently [20:55:53] it was like half a year ago [20:55:57] or more [20:56:40] Krenair: OK, having actually looked at your link now… I don't know that that change can easily by made on a running instance. Puppet only uses that variable during the original mediawiki setup. [20:56:51] So, probably you'll have to dig into the mw config on the instance and change it to correspond. [20:57:05] Pretty sure this is an apache config thing, not mediawiki [20:57:11] petan: No great worry atm. I'll file a mental note to look into it. [20:57:25] or bugzilla ;) [20:57:33] andrewbogott: are the changes in OpenStackManager on nova-precise2 still needed? [20:57:44] it looks like something that's been merged, but I'm not sure [20:57:45] not to me [20:58:19] Ryan_Lane: I'm sure you can do a reset there. Any changes that are live there are just mirroring local stuff I have in a different git repo. [20:58:49] ah. cool [20:58:49] I've made the change to /etc/apache2/sites-available/nova-precise2.pmtpa.wmflabs manually and restarted apache, it's working [20:58:50] ok [20:58:57] I don't want puppet to come along and revert my change later [20:58:59] Krenair: Nope, that particular var is passed into install.php [20:59:36] andrewbogott, are you sure you're talking about controller_hostname, not labs_mediawiki_hostname? [20:59:43] The diff is misleading [21:00:31] Oh, um, I was talking about labs_mediawiki_hostname [21:01:01] So, you're right about what controller_hostname does. and I don't know why it wasn't working :( [21:03:51] yeah, I looked at that the other day and couldn't figure that out either [21:08:32] Krenair, Ryan_Lane, if you remain curious then I can investigate, but it sounds like the actual problem is now solved... [21:08:59] I guess we'll find out next time puppet runs [21:09:29] Well, most likely puppet was simply doing nothing. In which case the status quo will be preserved! [21:09:29] But I [21:09:35] I will force a run right now and we'll see. [21:09:36] Anyway, nova-precise2 is now accessible via the web (https-only, with an invalid cert, but it works) [21:09:53] https://wikitech-test.wmflabs.org/wiki/Main_Page [21:10:02] Ah, you gave it a public IP? [21:10:11] It already had one [21:10:40] I think Ryan did that. I doubt I can give out public IPs [21:10:52] hm, ok. [21:11:17] well, puppet just reverted it. [21:11:19] So… hm. [21:11:36] !log deployment-prep updating mediawiki-config fc22500..71e689a [21:11:38] Logged the message, Master [21:12:14] MaxSem: I have deployed your new mobile template on beta ( https://gerrit.wikimedia.org/r/#/c/52667/ ) [21:12:59] hashar, thanks! [21:13:24] MaxSem: I have no idea what is supposed to fix though, but since it harmless to the production, I have just merged it :-] [21:24:22] Ryan_Lane: Teh oddz: https://wikitech.wikimedia.org/wiki/Shell_Request/Arun289 [21:24:29] Ryan_Lane: Doesn't actually have an account. [21:31:54] !log deployment-prep Creating OAI repositories on sql and sql02 master databases [21:31:55] Logged the message, Master [21:39:08] ahh [21:39:26] and I have no idea where is the root password for the beta sql database :-D [21:39:29] I love that project [21:39:50] hashar do sudo su [21:39:53] mysql [21:39:54] :P [21:40:05] it's in password file I think and if not try password puppet [21:40:14] ah puppet [21:40:19] oh wait [21:40:25] it's not the mysql server I installed? [21:40:31] if it's the one I made password is puppet [21:40:41] but it will work from localhost only [21:40:55] petan: "sudo su"? [21:41:01] petan: Baad petan! [21:41:05] Coren yes there is password file in /root [21:41:12] petan: sudo -s [21:41:13] Coren that's why :P [21:41:29] petan: or sudo -i if you need a login shell [21:41:35] petan: been changed somehow :( [21:41:50] bad guy petan ;-) [21:41:54] okay then either get credentials from php [21:42:03] or --skip-grant-tables [21:42:18] not sure whether I will be able to create a new database with the user credentials. I can try touhg [21:42:34] yes youi will [21:43:02] but I can grant :] [21:43:02] > GRANT ALL PRIVILEGES ON oai.* TO 'mw'@'localhost'; [21:43:03] ERROR 1044 (42000): Access denied for user 'mw'@'%' to database 'oai' [21:48:39] Ryan_Lane: 1x PM, please [21:53:31] petan: all attempts failed :( [21:53:38] petan: I guess I will have to recover the root pass [21:54:06] hashar did you check if there is password file? [21:54:09] in /root [21:54:18] yeah there is one [21:54:25] but that does not let me log in [21:54:35] wtf [21:54:41] then someone had to change the root pw [21:54:47] btw are you trying it from localhost? [21:54:53] because root is local only [21:54:56] root@deployment-sql:/root# mysql -u root -p [21:55:15] ERROR 1045 (28000): Access denied for user 'root'@'localhost' (using password: YES) [21:55:25] try just mysql [21:55:29] with no parameters [21:55:48] or try turning off mysql [21:55:57] then run mysqld --skip-grant-tables [21:56:18] that will allow you to login without pw [21:56:28] then you need to change pw in mysql.users [21:56:38] flush privileges; [21:56:41] restart server [21:57:43] !log deployment-prep stopping mysql server on -sql [21:57:45] Logged the message, Master [22:00:20] petan: yeaahhh I am in [22:00:36] problem is you can;t change passwords in this mode [22:00:39] using alter user [22:00:45] so that you need to directly change table [22:00:49] with passwords [22:02:18] Ryan_Lane, https://wikitech-test.wmflabs.org/w/index.php?title=Special:NovaInstance&action=create&project=testing®ion=pmtpa is broken [22:02:35] no instance types/image types [22:04:39] !log deployment-prep Restored mysql admin password on deployment-sql [22:04:41] Logged the message, Master [22:04:44] petan: thanks :-] [22:04:54] np :) [22:15:03] !log deployment-prep Set up an OAI repository user for lucene search. Password in puppet. [22:15:04] Logged the message, Master [22:15:55] hey andre__ I heard you are located in Prague, is that true? :o [22:23:20] hashar: when you are done, I'd love some help. [22:24:19] matanya: already multitasking on three tasks, can't take anymore :D [22:24:44] classic admins problem [22:25:01] but it says you are on duty! :P [22:25:32] lol [22:25:42] matanya, can I help? [22:25:55] I think so Platonides [22:26:14] I tried to add support for disk space check on icinga.pp [22:26:45] Ryan_Lane: You might want to add Babel to wikitech [22:26:49] but it seems to be there via checkcommands.cfg [22:27:22] Reedy: ok [22:28:02] From what you and qgil_ were discussing it would probably be useful in some of those cases [22:28:09] on the other hand, I see it is also checked via nrpe, but also already there - via nrpe_local.cfg.erb [22:28:09] for userboxes etc [22:29:11] so I'm puzzled here Platonides [22:31:06] you probably want someone with more puppet experience [22:32:17] I guess so, thanks anyway! [22:32:51] sorry [22:38:51] uh, stupid question for a long-time developer: how to get the computed ArticlePath without $1 - do we have that available ? [22:39:34] Wikinaut: What do you mean? Computed article path for a given title? [22:40:49] i need the fully specified url for something like [http://www.example.org/wiki/index.php/]Special:Pagename -- the part in [... ] [22:41:04] this isn't $wgArticlePath [22:41:16] becuase $wgArticlePath has $1 in it [22:41:33] [bz] (ASSIGNED - created by: Antoine "hashar" Musso, priority: Normal - enhancement) [Bug 45814] setup OAI repository on beta - https://bugzilla.wikimedia.org/show_bug.cgi?id=45814 [22:41:33] Wikinaut, for what? [22:41:36] can use str_replace [22:41:48] to set up a default value [22:41:51] are you in php? [22:41:56] yes [22:41:56] in javascript? [22:42:00] php [22:42:02] in a mediawiki extension? [22:42:10] in my defaultsettings.php [22:42:20] uh? [22:42:27] after require_once( myextension) [22:42:40] let me start again [22:42:56] $wgArticlePath has $1 in it [22:43:06] is there already something like it without the $1 [22:43:13] or do I have to str_replace that [22:43:44] and here's what happens when you run dist-upgrade on 118 systems at once: https://ganglia.wikimedia.org/latest/?r=hour&cs=&ce=&s=by+name&c=Virtualization%2520cluster%2520pmtpa&tab=m&vn= [22:43:47] you str_replace it [22:44:37] Ryan_Lane, the host were very impacient to be updated :) [22:44:43] heh [22:44:58] well, one thing we know is the network bonding we did the other day is working [22:45:01] 300MB/s [22:45:25] Ryan_Lane: I'm about to give up for today [22:45:32] I might try some other day [22:45:38] matanya: ok. I know it's kind of a pain at first [22:45:59] thanks for starting on this :) [22:46:44] np, I wish to find a guide to the file hierarchy [23:07:56] Ryan_Lane: 1 PM [23:09:53] Ryan_Lane, poke [23:10:21] do you know why we use a a template on https://wikitech.wikimedia.org/w/index.php?title=Main_Page&action=edit ? [23:14:12] Thehelpfulone: because someone made it a template ;) [23:16:42] heh, what are your thoughts on opening that home page up to autoconfirmed editing? it might get some improvements made to it and if there is any vandalism then it can fairly easily be reverted and protection restored - I'm thinking of all the wikitech stuff that's been moved across stuff on the wikitech side probably needs updating [23:24:33] Thehelpfulone, Same thing is done on mediawikiwiki (main page template) [23:24:43] yeah I saw that [23:33:22] Thehelpfulone: yeah. likely fine [23:33:25] let me change it [23:33:57] thanks, I was making some changes to it the other day but might as well let some other people work on it ;) [23:34:50] Ryan_Lane, can we also get wmflabs.org redirect to wikitech.wikimedia.org? [23:35:12] hm. probably. [23:35:17] redirected* [23:35:20] is there a ticket open for this? [23:35:39] I don't think so, just thought of it now [23:35:46] should I create one in the domains queue? [23:40:59] !log deployment-prep setup squid to pass the WWW-Authorization headers to the Apache. Done by configuring login=PASSTHRU for each cache_peer (*crosses fingers*) [23:41:01] Logged the message, Master [23:41:12] !log deployment-prep reloading squid [23:41:14] Logged the message, Master [23:42:19] !log deployment-prep for squid login=PASSTHRU replaced by login=PASS. Reloaded squid. [23:42:21] Logged the message, Master [23:43:56] !log deployment-prep OAI repository set up on beta !!! {{bug|45814}} [23:43:58] Logged the message, Master [23:50:10] hashar: <3 [23:50:22] addshore: no [23:51:17] Krenair: hm. I wonder what I broke there [23:51:26] Krinkle: for instance names/types [23:52:16] Ryan_Lane, autocomplete fail? :P [23:52:29] which one [23:52:32] heh [23:52:34] yes [23:52:39] Krenair: [23:53:05] It took me 3 attempts to realise both Krenair and Krinkle where talking then :P [23:53:09] heh [23:53:57] * Damianz goes to defrost his toes in style [23:55:04] I HATE NAT [23:55:25] hashar: You need to try out PAT then [23:55:26] hashar: bridging? [23:55:36] yeah all of that [23:55:38] I hate it [23:56:01] so I got an instance that tries to access the public IP and thus I get a "could not connect to host" [23:56:02] damn [23:56:09] need to go poke the ops about it :-] [23:56:24] I am sick of that project to setup lucene search :D [23:56:50] Yeah that sucks ;( [23:57:04] Would like split horizon dns to get rid of the confusion for names but for ips you're screwed [23:57:42] indeed