[00:00:02] andrewbogott: everything should be working now [00:00:08] Still - the error message is insane [00:00:15] And shouldn't break everyone's stuff [00:00:34] And if you "rev_user = (SELECT user_id FROM user WHERE user_name = ?)"? [00:00:36] Damianz: Maybe the request /does/ time out and the php script gets wedged in a crazy mode? [00:01:06] andrewbogott, Damianz: quick code review? https://gerrit.wikimedia.org/r/#/c/64868/ [00:01:07] scfc_de: comparing user isn't all that much faster from user_text [00:01:15] I'd expect it to die, but it doesn't seem to [00:01:37] Damianz: Lemme try to figure out why the query is slow. It probably shouldn't be. [00:01:49] Ryan_Lane: Looks ok to me [00:01:56] +1? :) [00:01:58] Coren: I was trying to do an explain, but I can't lock the table [00:02:01] I'll +2/merge [00:02:14] (I've already tested it [00:02:15] ) [00:02:21] wait [00:02:22] shit [00:02:25] wrong thing? [00:02:35] https://gerrit.wikimedia.org/r/#/c/64868/ [00:02:44] no. right thing [00:02:53] I have too many tabs open [00:03:05] Damianz: That's normal, and pretty much unavoidable. [00:03:19] 2110 < just a little time [00:04:10] Ryan_Lane: But do you have more than me? :p [00:04:23] a930913: more what? [00:04:29] Ryan_Lane: Tabs. [00:04:33] oh [00:04:33] heh [00:04:36] Damians: select count(distinct rev_page) from revision_userindex where `rev_user_text` = 'Cydebot'; -- takes 11s [00:04:40] I don't know. I only had 40 open [00:05:02] Is 1.7 million pages likely? [00:05:13] Ryan_Lane: Only 40? I have that in just one window. :p [00:05:17] :D [00:05:26] Hm. seems about right. [00:05:27] Coren: Possibly [00:05:42] 3646663 without distinct [00:05:50] and it did seem fast, but there's doezens in show processlist with hundreds of seconds against them [00:06:11] Damianz: Note the slightly distinct query I use. [00:06:28] you're avoiding a sub select [00:06:30] On 'coren' it only takes 0.04 seconds. [00:07:24] andrewbogott: have you tested this yet? https://gerrit.wikimedia.org/r/#/c/64303/ [00:07:29] 1sec, we'll see [00:07:52] Ryan_Lane, I can log in to nova-precise2 but it's still messed up in various ways. But… I need to go soon so don't fix it now on my account :) [00:08:03] heh [00:08:10] Damianz: I think your subquery created a temporary table. :-) [00:08:17] I've tested that sudoers patch enough to make sure that the right stuff is getting into ldap. I haven't tested the function of the actual policies. [00:08:19] Would make sense [00:08:45] well, if you tested the correct stuff is going in, that's fine [00:09:00] hm. I wonder why this is timing out [00:09:09] Damianz: Did you change your query? because it's behaving MUCH better now. [00:09:14] yes [00:09:28] Just going to kill these queries, but it seems ok - hitting it wiht 1k of requests [00:09:46] Damianz: Looks like some evil interaction with php and timeout of the query. [00:10:34] Warning: There is 1 user waiting for shell: Shawnanthony420 (waiting 10 minutes) [00:10:52] andrewbogott: fixed [00:11:00] it was glance [00:11:07] Coren: Yup - it's maxing out at like 4/5 sec now... much better [00:12:46] * Damianz wonders how many more ways he can break Coren's servers in an evening [00:13:04] Damianz: It's good that you are stress testing it. :-) [00:13:25] Damianz: /me is still trying to figure out /why/ apache started going bonkers withe 75% free ram though. [00:13:30] Damianz: Just in time to catch the second vandalism peak :) http://bit.ly/180Xhfo [00:14:52] a930913: =D [00:15:54] Without being scientific and doing the whole graphing thing, I'm pretty sure this db is faster/more up to date than ts... seems more edits are going past the point of bailing out because it can't get the data. [00:16:19] Does that graph suggest that a large percentage of vandalism is done by Americans at work? [00:17:08] hm. I guess I shouldn't deploy this OpenStackManager change right before I leave, eh? [00:17:09] Then dips when workers go home, until they arrive home and vandalise a bit more. [00:17:16] especially seeing as that I'm flying tomorrow.... [00:17:41] Ryan_Lane, you mean the sec group thing, or the sudoer thing, or both? [00:17:41] Damianz: It'd make sense, it's much closer to the masters. [00:17:47] andrewbogott: both :) [00:17:57] I guess neither one will really break things [00:17:59] I can deploy tomorrow when I have a day ahead of me [00:18:10] oh. well, I'll deploy it now, then :) [00:18:13] if you'll be around to revert [00:18:18] Tomorrow I will [00:18:21] cool [00:18:30] Ryan_Lane: You're like american.. you get wireless on planes [00:18:44] Damianz: not to AMS I don't ;) [00:18:46] Damianz: Aha. A quick google sez: You're breaking the PHP memory limit. [00:18:57] Ryan_Lane, btw, the instance status pages should be working again, let me know if you see anything amiss there [00:19:02] Coren: The php memory limit isn't per /process/? [00:19:08] andrewbogott: sweet. thanks [00:19:25] Oh yeah it's AMS this weekend isn't it [00:19:33] Hm, no real instances on nova-precise2 yet :( [00:19:47] Damianz: It is, but maybe something odd goes on in the process when the script wedges on a timed out connection? [00:19:54] andrewbogott: it still errors? [00:20:16] yeah, although it was marked as 'building' briefly. I haven't investigated [00:20:39] * Ryan_Lane nods [00:20:41] Coren: It's like what you'd see with mod_php where it fills up the worker pool and maxes out the apache slap... but it's suphp, so should be pretty much true processes... and I've only ever seen that with ulimit style restrictions before [00:20:46] well, at least memory won't be it now :) [00:20:51] should make it easier to debug [00:21:13] Oh god gerrit, my poor inbox [00:21:29] 15 emails is just not needed [00:21:31] Damianz: I checked; that's not what you're hitting. I suspect a bug with the query timing out. [00:22:01] Mental note: Write custom c++ server to accept xml, return xml and query mysql [00:22:04] * Damianz rofls [00:22:22] It's weird.. but yeah [00:23:01] The stranger question is, why did it break all the users... not jsut the one since they should be forked then setuid... something must be really screwed [00:23:59] Warning: There is 1 user waiting for shell: Shawnanthony420 (waiting 24 minutes) [00:32:20] Damianz: I'll just keep an eye on it. Besides, activating the second backend is overdue. [00:33:29] going to split the apps betwean then with cold failover if required? [00:35:00] Damianz: That's the objective. [00:37:28] Warning: There is 1 user waiting for shell: Shawnanthony420 (waiting 37 minutes) [00:41:34] Coren: PHP Fatal error: Out of memory (allocated 262144) (tried to allocate 523800 bytes) in Unknown on line 0 [00:41:41] Only a few times, but ties up with something weird happening memory wise [00:41:53] * Damianz dislikes php_errors.log [00:42:01] Damianz: Why? [00:42:15] What's wrong with error_log with everything in it :P [00:42:48] Damianz: It discloses too many things atm. I'm working on a new way of doing things. [00:44:10] The problem with using php_errors (assuming it's the error log set as a php option - looks like it) is php has to actually run for it to error :( [00:45:04] It is, and you're correct. Which is why I will look for an alternative. [00:45:52] I may have to please Ryan and go virtual-host-per tool; which has other inconvenients. [00:46:20] dunno, could do .toollabs.org or someting... no ll though, that looks stupid [00:46:56] Oh, no need for dns trickery. [00:47:09] Still, that's for the After-Amsterdam [00:47:45] Hmm... I guess you could do multi host mod_userdir style, never been that mental [00:50:43] Damianz: I just improved error reporting a lot about php not starting. :-) [00:51:11] :D [00:58:34] Warning: There is 1 user waiting for shell: Ilfeld (waiting 0 minutes) [01:00:00] Coren, petan: Could you please 'for JOBID in 81662 81672 81648 81659 81660 81649 81658 81661 81669 81670; do qacct -j "$JOBID" | mail -s "$JOBID" scfc; done' on tools-master? TIA. [01:00:27] scfc_de: You keep forgetting the ';done' :-) [01:01:19] Coren: Hmmm? Do you mind the space in "; done"? :-) [01:01:56] Oh, no. Ignore me. [01:01:58] (I don't care if its one mail per job or everything.) [01:02:08] I just realized that I did the other thing last time. :-) [01:02:28] scfc_de: {{DONE}} [01:02:37] Coren: Thanks :-). [01:03:07] What pager does mail use on Ubuntu? It sucks for people used to less. [01:03:37] scfc_de: I think it's more. If I were you, I'd use alpine [01:03:45] and not mail. :-) [01:04:04] you can usually export PAGER=/usr/bin/less though [01:04:47] Well, / is now on -02 [01:04:59] Seems to work. [01:05:17] Damianz: Gonna move clubot there [01:05:32] If I got a labs account, I'd get a shell to run my bots in, right? [01:05:52] Coren: Yep, works nicely. alpine -- heaven no. Starts with "Creating subdirectory "/home/scfc/mail" where Alpine will store its mail folders." -- I don't want that, just read mail. [01:06:30] a930913: You need to be a member of the Tools project as well. [01:06:49] (Don't ever run anything on Labs bastion :-).) [01:07:05] Bleh, is it worth all the effort? [01:07:51] What effort? What's your Labs username? [01:07:52] Damianz: Tell me if that broke cluebot, but I don't expect it should. [01:08:17] Coren: Looks good here. [01:08:43] scfc_de: I don't have one. [01:09:02] I'm currently running my stuff on my computer. [01:11:02] a930913: Just follow the instructions on https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/Help#Getting_Access. You'll get access to replicated databases, a 24/7 platform, and lots of other developers you can collaborate with -- so definitely worth the effort :-). [01:11:59] Warning: There is 1 user waiting for shell: Ilfeld (waiting 13 minutes) [01:14:41] scfc_de: But they want RL info, no? :( [01:15:27] I don't, never concerned me. [01:15:31] *know [01:15:49] scfc_de: Don't you have to give your real name oe? [01:16:32] I don't see a rule that requires this. [01:16:38] http://www.mediawiki.org/wiki/Wikimedia_Labs/Terms_of_use [01:19:06] a930913: First I hear of this. [01:23:37] Clearly I am a bot, because these are some of the worst capchas I've seen >.> [01:25:31] Warning: There is 1 user waiting for shell: A930913 (waiting 0 minutes) [01:30:52] a930913: I've preemptively added you to the tools project as well. [01:35:34] Coren: \o/ [01:35:38] (Doing TAR now.) [01:37:02] As a reminder, in case you haven't read all the doc, you don't want to run tools from your personal account, but create service groups for them. :-) [01:37:49] I think I might have to read the docs :p [01:37:53] Can I mosh in? [01:38:19] Yep [01:38:56] "No Nova credentials found for your account."? [01:39:00] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 1 minutes) [01:39:30] I'm not sure preemption worked :p [01:41:22] Hmm, I don't need to mosh in, I used the key on my server. :) [01:48:07] Coren: Do you need to gerfingerpoken something more so I can make a tool? [01:52:24] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 15 minutes) [01:53:06] a930913: Do you get an error message? [01:54:36] a930913: You may have to log off and back on again. [01:54:48] a930913: In fact, I'm pretty sure you have to. :-) [01:59:17] \o/ [02:00:15] So I need to make the account "local-bracketbot"? [02:01:07] No, "Add service group" ^, right? [02:05:45] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 28 minutes) [02:08:40] a930913: No, the name should be just "bracketbot". [02:09:56] scfc_de: I think it stripped it for me? [02:12:12] So I should just upload my script into the labs and run "jstart script"? [02:14:04] a930913: In the most basic form, that should be enough. [02:14:33] (If it's continuously running and should be restarted if it stops.) [02:15:20] Do I get a high port to open publicly? [02:16:11] Because my bot does its reporting via an irccat. [02:19:06] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 42 minutes) [02:21:12] There bots that report on IRC, you should probably copy their behaviour. Someone still around? Damianz? petan? addshore? [02:22:56] scfc_de: My cat listens on my computer for UDP12345 messages and prints them to IRC. [02:23:26] So if I moved the cat, I'd want to be able to send from my computer too. [02:25:27] Don't you have an IRC client?! [02:26:42] scfc_de: Eh? [02:27:14] scfc_de: As in I can cat reports to the cat. [02:27:40] And it will cat it to IRC. [02:28:46] a930913: Ah, so you want to run the bot on your computer in parallel? Then why don't you add IRC support on it there as well? Sending data from the bot by another protocol to another daemon seems like another potential point of failure to me. [02:30:34] Well, different bot stuff. The lovely thing about UDP is that it's best effort, and failure doesn't break things. [02:31:50] Though actually... If the daemon is run on the grid, how do you find it? [02:32:27] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 55 minutes) [02:33:45] I don't know, that's why I would keep it all in one place. But I don't have experience with bots that report to IRC. [02:38:38] a930913: You don't really; inbound to bots aren't supported (though outbound are) [02:39:05] a930913: So a bot can connect to IRC for instance. [02:39:56] An ugly hack I can think of is to make an HTTP carrier proxy it :p [02:40:36] Coren: I want a bot to connect to IRC on the labs, and listen on a port for incoming messages. [02:41:26] The latter isn't supported; that's too much of a security liability. OTOH, you can place a small script in your public_html that'd allow remote control via an RPC call. [02:41:46] Much more robust scheme, too. [02:45:47] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 68 minutes) [02:46:17] Coren: But how can I find my IRC bot from the public_html script? [02:47:26] a930913: The simplest scheme is to communicate through the local filesystem. You can pass information through files, the home is shared between all the nodes. [02:47:44] Coren: Filesystems, really? [02:48:04] a930913: Hey, it's well proven. Just plop your ip and port in a file for the script to find. [02:48:58] Meh, fine. [02:49:38] If I ":~$ become bracketbot" should it complain about sudo? [02:49:45] Coren: Could you explain the difference between logging and logging_userindex again? [02:49:55] a930913: What's the error message? [02:50:09] scfc_de: "sudo: sorry, a password is required to run sudo" [02:50:12] a930913: Did you create your tool with local- in the name? [02:50:25] Coren: Might have :/ [02:50:51] Known bug. :-( It will only have partially stipped it and the tool account is broken. I'll delete it and recreate it for yu. [02:51:40] * a930913 hands Coren an extra large double clawed hammer to fix it with. [02:52:05] a930913: You'll have to log off and back on (from -login) for the new group to take effect. [02:52:50] Still complaining. [02:53:11] Oh, wait. I might have forgotten a step. [02:53:48] Do I install stuff like pywikibot and twisted locally? [02:58:59] a930913: Try this. [02:59:08] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 82 minutes) [02:59:27] wm-bot: You are insane. [02:59:27] Hi Coren, there is some error, I am a stupid bot and I am not intelligent enough to hold a conversation with you :-) [02:59:32] Coren: Nope. [02:59:43] You did log off? I mean since last time? :-) [03:00:05] Logged off after you said to try it. [03:00:25] * Coren is a moron. [03:00:33] Perhaps, just perhaps, I should put you in the group? :-) [03:01:05] Hmm, maybe.. :-) [03:02:36] Ah, and it's cached locally by the ldap daemon. How... uncouth. [03:06:21] Something odd is going on. [03:07:26] Coren: BTW, is the local- issue tracked somewhere? [03:07:55] Hm. I honestly don't know. I know andrew is aware of it, but I haven't noticed if there's a bz for it [03:08:38] I'll file one just in case so it doesn't get lost. [03:11:53] a930913: Something is /really/ odd about your account in a way I haven't yet figured out. For some reason, it seems unable to actually give you membership in any tool automatically. [03:12:32] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 95 minutes) [03:12:59] Or nslcd is caching the hell out of something. [03:13:23] Coren: I do apologise for the inconvenience. [03:13:41] * a930913 mutters something about knowing it was a bad idea. [03:13:49] a930913: No, no, thanks actually. The faster those odd problems crop up, the faster they get squished. :-) [03:14:11] * a930913 hands Coren a bigger hammer. [03:24:40] a930913: I'd like to try something else. Do you have an alternate name you'd find acceptable for your tool? [03:25:09] For some reason, it's not able to fully build it with that name (and I can't figure out what could possibly be wrong with it) [03:25:21] Looks like it got partly created and broke. [03:25:50] Go create a new group, and I'll move and chown the files. [03:25:57] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 108 minutes) [03:26:09] Coren: Should I make a generic name for running all my private bots? [03:26:14] If that works, I'll have Andrew (the maintainer of that code) look into why it broke so it doesn't break again. [03:26:51] a930913: Well, depends. Some people prefer that, some prefer separating them so they can hand off maintenance or add helpers differently depending on which tool. Your pick. [03:28:37] D: [03:28:51] Doesn't let me choose 930913. [03:29:00] No, has to start with a letter. [03:29:23] * a930913 rages against that being so common. [03:29:40] It avoids ambiguity with user IDs. [03:30:06] Coren: To confirm, the 500 thrown is the confirmation page, right? [03:30:19] ... that shouldn't happen actually. [03:30:32] Lol. [03:30:46] ... and the group you just created is broken in the same way(!) [03:31:00] What in /blazes/ is wrong with your account? [03:31:16] * a930913 is just special. [03:31:22] * a930913 headdesks. [03:32:31] * Coren sees nothing wrong with it. [03:34:50] Hm. Nope, it's not your account. Service groups I create are just as broken. [03:35:02] Wikitech is teh br0ken tonight. [03:35:21] * Coren pokes the right people [03:35:28] Sorry for the inconvenience. [03:38:30] * Coren tries to see if he can find something obvious. [03:38:39] Not my code, though. It'd have to be hella obvious. [03:39:18] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 122 minutes) [03:41:31] Hm. I think I might actually have found it. [03:41:49] :o [03:42:15] * a930913 hands Coren a machete to cut the blighter out with. [03:44:33] Yep. It's a bugfix that introduced a new bug. How fun. [03:46:54] * a930913 hands Coren a printed version of wikipedia to squash the bug with. [03:47:36] Coren, still up? [03:47:51] andrewbogott: I am. Check out https://gerrit.wikimedia.org/r/#/c/64880/1 [03:49:04] andrewbogott: Tried it by hand directly on Wikitech. Fixes it. [03:49:19] Makes sense. [03:49:43] * Coren is just glad the error_log pointed at the right direction. [03:49:44] Might be good to make it case-insensitive. Also, should probably add actual error handling to the bit that was crashing when the lookup failed... [03:49:53] Are you in deep enough to do that or do you want me to take over the patch? [03:50:15] And, does that fix both problems you mentioned in the email? (service groups + sudoers) [03:50:36] Yes, the service group failed to create properly because it broke when trying to add the sudoers. [03:50:57] makes sense. [03:51:07] Feel free to take it over, that was just a panicky hotfix. :-) [03:51:50] ok [03:52:07] Do you need me to stick around? I've been up for ~20h now. :-) [03:52:51] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 135 minutes) [03:52:55] FIY, the break was at getFullyQualifiedDisplayName() that broke because the $hostobj was null. [03:53:19] Coren: So what was special about me that broke it? [03:53:38] Just guarding against /that/ would work too, I suppose, though you'd still want to guard against trying to be smart with 'all' [03:53:51] a930913: Nothing, you're just the first to stumble upon it. :-) [03:53:54] Yep, should do both [03:54:23] Coren: What changed? I was the first since that bugfix? [03:54:46] a930913: Or the first to notice. Dunno when that code was last updated -- I expect Andrew could tell you. [03:55:44] andrewbogott: I can haz sleepz? :-) [03:55:54] Yeah, I'll fix and merge. [03:55:58] Coren: Good night! :-) [03:56:03] Good tight, Andrew, and thanks. [03:56:15] a930913: You should be all set once that patch goes in. Good luck. [03:56:20] a930913, it was caused by a patch that Ryan merged right before I went out for dinner, a couple of hours ago. [03:56:35] (Um, a patch I wrote, I should add :) ) [04:06:12] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 149 minutes) [04:07:55] andrewbogott: Is it patched now? [04:08:01] not yet [04:13:25] a930913, want to try it now? [04:13:47] * a930913 cues the explosions. [04:14:20] Well, it threw the 500 again. [04:14:25] hm, ok. [04:14:44] * andrewbogott returns to the test box [04:15:11] Nope, still the sudo error. [04:15:55] How about now? [04:16:56] andrewbogott: Making another service group, or becomeing? [04:17:10] Um… making a service group. [04:17:16] If that fails then becoming will definitely not work [04:18:02] 500 [04:19:07] curious [04:19:44] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 162 minutes) [04:28:01] a930913: OK, once more? [04:30:56] 200 [04:31:52] that's good, right? [04:31:55] andrewbogott: Works :) [04:32:05] ok, lemme tie this up... [04:32:05] andrewbogott: HTTP 200 is OK. [04:32:20] Yeah, but just 'cause it returns 200 doesn't mean it did anything :) [04:32:39] andrewbogott: Could you remove the tests and bracketbot, so I can make a proper bracketbot. [04:32:53] sure, just a second [04:33:13] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 176 minutes) [04:33:24] andrewbogott: Well, it was a note of improvement on the 500 :p [04:36:39] a930913: OK, cleaned up. [04:40:35] andrewbogott: \o/ [04:46:46] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 189 minutes) [05:00:09] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 203 minutes) [05:13:30] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 216 minutes) [05:26:50] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 229 minutes) [05:40:11] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 243 minutes) [05:53:35] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 256 minutes) [06:07:00] Warning: There is 1 user waiting for shell: Fantasticfears (waiting 4 minutes) [06:07:01] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 269 minutes) [06:20:20] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 283 minutes) [06:33:53] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 296 minutes) [06:47:18] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 310 minutes) [07:00:47] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 323 minutes) [07:14:07] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 337 minutes) [07:27:32] Warning: There is 1 user waiting for shell: Yonjah (waiting 8 minutes) [07:27:33] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 350 minutes) [07:36:37] petan: bots-login doesn't allow outbound network connections? [07:37:06] liangent@bots-login:~$ sftp nightshade.toolserver.org [07:37:06] ssh: connect to host nightshade.toolserver.org port 22: Network is unreachable [07:37:07] Couldn't read packet: Connection reset by peer [07:40:53] Warning: There is 1 user waiting for shell: Yonjah (waiting 21 minutes) [07:40:54] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 363 minutes) [07:54:18] Warning: There is 1 user waiting for shell: Yonjah (waiting 35 minutes) [07:54:19] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 377 minutes) [08:07:51] Warning: There is 1 user waiting for shell: Yonjah (waiting 48 minutes) [08:07:52] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 390 minutes) [08:21:20] Warning: There are 2 users waiting for shell, displaying last 2: Yonjah (waiting 62 minutes) Damzow (waiting 7 minutes) [08:21:21] Warning: There is 1 user waiting for access to tools project: A930913 (waiting 404 minutes) [08:23:30] liangent it should allow it [08:23:50] liangent I often connect to foreign servers, I would rather say toolserver is down [08:25:25] !tr A930913 [08:25:25] request page: https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/Access_Request/A930913?action=edit talk page: https://wikitech.wikimedia.org/wiki/User_talk:A930913?action=edit§ion=new&preload=Template:ToolsGranted link: https://wikitech.wikimedia.org/w/index.php?title=Special:NovaProject&action=addmember&projectname=tools [08:25:31] !rt A930913 [08:25:31] http://rt.wikimedia.org/Ticket/Display.html?id=A930913 [08:25:42] lol [08:27:02] Coren|Sleep if you grant someone access to tools project, you need to also flag the request as completed, or the bot will keep complaining... :/ [08:27:25] if you don't like this design you can of course invent a better one [08:28:19] a page with three iframes, corresponding to each of those to-be-edited pages? :p [08:32:01] !petan- [08:32:01] There are multiple keys, refine your input: petan-build, petan-forgot, [08:32:05] !petan-build [08:32:06] make -j `getconf _NPROCESSORS_ONLN` deb-pkg LOCALVERSION=-custom [08:33:37] petan: I can ssh to nightshade at ts from my own computer [08:34:52] Warning: There are 2 users waiting for shell, displaying last 2: Yonjah (waiting 75 minutes) Damzow (waiting 20 minutes) [08:37:51] liangent let me try [08:39:19] yes it is weird but connection to other hosts work [08:39:24] just toolserver is broken :/ [08:39:34] maybe ask Ryan or someone who is responsible for network [08:48:21] Warning: There are 2 users waiting for shell, displaying last 2: Yonjah (waiting 89 minutes) Damzow (waiting 34 minutes) [09:01:46] Warning: There are 2 users waiting for shell, displaying last 2: Yonjah (waiting 102 minutes) Damzow (waiting 47 minutes) [09:10:05] !rq Yonjah [09:10:05] https://wikitech.wikimedia.org/wiki/Shell_Request/Yonjah?action=edit https://wikitech.wikimedia.org/wiki/User_talk:Yonjah?action=edit§ion=new&preload=Template:ShellGranted https://wikitech.wikimedia.org/wiki/Special:UserRights/Yonjah [09:10:10] !rq Damzow [09:10:10] https://wikitech.wikimedia.org/wiki/Shell_Request/Damzow?action=edit https://wikitech.wikimedia.org/wiki/User_talk:Damzow?action=edit§ion=new&preload=Template:ShellGranted https://wikitech.wikimedia.org/wiki/Special:UserRights/Damzow [09:15:11] Warning: There is 1 user waiting for access to tools project: Tb (waiting 5 minutes) [09:16:22] !tr Tb [09:16:22] request page: https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/Access_Request/Tb?action=edit talk page: https://wikitech.wikimedia.org/wiki/User_talk:Tb?action=edit§ion=new&preload=Template:ToolsGranted link: https://wikitech.wikimedia.org/w/index.php?title=Special:NovaProject&action=addmember&projectname=tools [09:18:01] Coren|Sleep: how the replica users are being created? [09:18:05] is it automatic? [09:33:15] Do I install stuff like pywikibot and twisted locally? [09:33:44] define locally [09:34:38] In my/the tools space. [09:34:47] you can install it locally to /tmp or you can install it to your home which is shared on all instances, or you can install it to /shared which is shared too. everywhere. [09:35:34] locally == local to instance || locally == local to user? [09:37:28] Okay, so pywikibot is in /shared. [09:38:49] And twisted is installed. [09:38:55] That's me for now. [09:38:59] :o [09:39:19] But let's say that I wanted "pip install txws" [09:39:27] What's the right way to do it? [09:40:03] request it to be converted to .deb package OR convert it yourself to .deb package and request it to be installed project wide [09:40:45] !pypi [09:40:45] https://pypi.python.org/pypi/stdeb#the-commands [09:41:26] Oh cool. [09:42:08] petan: Any restrictions on the internal ports I use? [09:42:18] depends [09:42:25] (High) [09:42:32] instances aren't accessible from outside, except for -dev -login and -proxy [09:43:01] otherwise there are no restrictions I know of as long as you need to access these ports on local instances only [09:50:05] petan: Just so something doesn't blow up, talk me through my first job? (A bot for logging to IRC.) [09:51:12] I just run "jsub -once python bot.py"? [09:52:57] a930913: if it needs to run continuously, use -continuous [09:53:10] a930913: in any case, add -N jobname; otherwise it will use 'python' as job name [09:53:33] e.g. jsub -once -N myawesomeircbot python bot.py [09:53:50] Is the jobname unique to me? [09:56:02] yes [09:57:15] a930913 I recommend you to cron a job that would check if your bot is running and if not it start it [09:57:25] because IMHO the grid engine is very much not reliable in this [09:58:20] petan: So my job can break without warning and not restart? [09:58:46] that is my experience [10:01:07] So the bot is running, but it's not connecting to freenode (it hasn't joined.) Is there some magic in the way? [10:01:23] yes... there are some limits from side of freenode [10:01:34] it probably can't connect because there are too many bots already connected [10:04:01] !mail [10:04:01] we have a mailing list labs-l@lists.wikimedia.org feel free to send a message there, don't forget to subscribe [10:04:12] I just ran it without jsub and it worked. [10:06:17] Is it only the grid that has the overload? [10:11:50] unlikely [10:12:01] is your bot providing any debug output? [10:13:19] Coren|Sleep: Seems to be good ;) [10:13:31] Nope :( [10:14:11] I presume it's debugging to jobname.out/.err? [10:15:04] yes [10:15:38] It's a Heisenbug :( [10:17:04] ? [10:20:41] It's a bug than doesn't appear when observed. [10:20:47] aha [10:20:53] I just tried it on another network and it worked. [10:48:57] So confirmed that it's because there are too many connections already. [10:49:17] Is there another irccat I could use? [10:49:56] Does wm-bot have an API? [11:09:32] a930913 it can be extended... but [11:09:40] not sure if it's suitable for your needs [11:10:23] you would need to describe a bit what you actually need to do, if you just need to relay some text in another channel, that would be easily doable [11:13:44] Warning: There is 1 user waiting for shell: Ronengi (waiting 0 minutes) [11:15:55] petan: At the moment, I have my bot listen on udp12345 for any messages, and it prints the messages to IRC. [11:27:13] Warning: There is 1 user waiting for shell: Ronengi (waiting 13 minutes) [11:40:37] Warning: There is 1 user waiting for shell: Ronengi (waiting 27 minutes) [11:48:29] petan: I had already asked the #freenode staff and they're on it. [11:52:32] petan: Your advice about not trusting -continuous is well intended, but not necessary. Since my grid has been up, tasks have been properly restarted through restarts of the exec notes, switching filesystems, master crashing by human error, and the whole VM hosts being rebooted. :-) [11:54:05] Warning: There is 1 user waiting for shell: Ronengi (waiting 40 minutes) [11:56:57] a930913: for that, you can also use http://n.tkte.ch/ [12:01:30] valhallasw: Yeah, but that's offsite. [12:07:34] Warning: There are 2 users waiting for shell, displaying last 2: Ronengi (waiting 54 minutes) Amirber (waiting 2 minutes) [12:22:11] petan: Are you going to be in Amsterdam? [12:24:16] Would jobname.out grow to infinity? [12:27:44] a930913: It can, if you let it, though I'm going to start giving you hell about it before it reaches TB range. :-) [12:28:16] a930913: I'll probably whip-up a tool by which users can add a logrotate as needed. [12:28:24] In my Copious Free Time™ :-) [12:29:50] Coren: So I should cut down on output until then? (Not that it's particularly much.) [12:30:02] Though I could just delete it once in a while. [12:30:20] a930913: It's not worrisome unless it grows to gigantic size really fast; just a regular cleanup will do. [12:33:13] I wasn't so concerned about it taking up space, I was wondering how long it would take for me to scroll to the bottom :p [12:36:47] man tail [12:36:48] :-) [12:38:42] more | cat | tail s [12:45:24] Coren yes [12:53:50] Coren not for my bot ;) [12:54:06] Coren my bot has been crashing on grid and wasn't recovered in any way [12:55:10] petan: Perhaps your bot /was/ crashing the grid? :p [12:55:27] petan: Are you actually starting it -continuous? [12:55:32] let me check [12:55:47] petan: Also, there is one error that - by design - the grid will not recover from: busting your memory allocation. [12:56:29] Coren jstart -continuous ? [12:57:10] Yay. We're breaking labs. [12:57:25] busting memory allocation is most frequent reason for my bot to crash :x [12:57:29] If you just jstart, the -continuous is implied. [12:57:53] I am doing: jstart -mem 800m /data/project/afcbot/start_bot2.sh >> $logfile [12:57:55] Coren, I'm hungry [12:57:59] :p [12:58:00] Warning: There is 1 user waiting for shell: Tom Morris (waiting 0 minutes) [12:58:19] petan: I'll help you look at it and see if we can help it during the Hackaton. :-) [12:58:20] !rq Tom_Morris [12:58:21] https://wikitech.wikimedia.org/wiki/Shell_Request/Tom_Morris?action=edit https://wikitech.wikimedia.org/wiki/User_talk:Tom_Morris?action=edit§ion=new&preload=Template:ShellGranted https://wikitech.wikimedia.org/wiki/Special:UserRights/Tom_Morris [12:58:37] * a930913 hands Cyberpower678 a cookie. [12:58:40] Coren, sudo make me a sandwhich. [12:58:46] :D [12:59:35] * Cyberpower678 takes the cookie from a930913 and eats it with gratitude. [12:59:55] Cyberpower678: :o It was a tracking cookie. [13:00:17] * a930913 tracks Cyberpower678. *Blip* *Blip* [13:00:40] a930913, it won't be able to track once I digest it. :p [13:00:58] * Cyberpower678 eats a few lemons to speed up the process. [13:02:16] Cyberpower678: I wouldn't do that if I were you, you'll release the laxatives. [13:02:25] :P [13:02:35] a930913, want a Lager? [13:03:14] Sure. [13:03:30] * Cyberpower678 gives a930913 a Lager? [13:03:38] no question mark. [13:04:08] * a930913 filters the lager for poisonous substances. [13:05:03] * a930913 pours into his hydrolic fluid reserve. [13:05:15] ??? [13:05:35] Cyberpower678: What? Was it something I said? [13:05:48] hydrolic fluid reserve? [13:06:32] a930913, oh right. You're an incredibly sophisticated bot. [13:07:05] Cyberpower678: The 930913rd revision, still in alpha though. [13:07:13] * addshore throws u[p [13:07:19] a930913, :o it was a key Lager. [13:07:22] addshore hey [13:07:26] hey petan :) [13:07:31] * a930913 catches whatever addshore threw up. [13:07:50] a930913: its mostly beer, and then weird blue stuff, and burgers ;p [13:08:20] * Cyberpower678 is now tracking a930913 keystrokes. He now knows his Wikipedia password. :p [13:08:22] Cyberpower678: We are talking about the intoxicating liquid, right? [13:08:31] Oh, I just got that... [13:08:35] * a930913 groans. [13:08:56] LOL [13:08:59] :D [13:09:10] a930913: yes :P [13:09:21] addshore, are you in Germany? [13:09:32] O_o [13:09:39] no, I am in bed :D [13:09:55] in about 24 hours ill be in the middle of the uk [13:10:02] Who here isn't in bed out of curiosity? [13:10:10] You had beer and burgers in bed? o.O [13:10:10] addshore: Poor you. [13:10:11] I am in office now [13:10:12] and in 36 hours ill be in amsterdam ;p [13:10:24] 36? [13:10:28] I will be there soon :P [13:10:28] Cyberpower678: beer and burger was last night ;p [13:10:31] petan: Your office needs beds :p [13:10:48] addshore, you need to get drunk more. :p [13:11:01] petan I catch the first train of my epic journey in about 16 hours :) [13:11:11] train? [13:11:11] Cyberpower678: that was not physically possible last night [13:11:12] :D [13:11:15] addshore, train? [13:11:26] petan: Im getting a train to a plane ;p but the plane is on the other side of the country ;p [13:11:40] ???? [13:11:57] That's about the wierdest travel plan I've ever heard. [13:12:12] :D cheapest [13:12:34] ewww. Coach trains. [13:12:52] I prefer first class if I'm sitting in a train for that long. [13:16:36] mmm where did the documentation for the jobs / grid stuff in Tools Labs go? [13:17:40] !tooldocs [13:17:40] https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/Help [13:17:50] YuviPanda: ^^ there [13:17:56] ah [13:18:01] thank you :) [13:18:13] I take it that glusterfs has been killed? My Git problems seem to have disappeared automagically :) [13:22:17] so... hows peacy? :) [13:22:29] peachy... [13:22:34] YuviPanda: It's gone. For good. :-) [13:23:18] Coren: hows dbs looking? ;p [13:24:03] addshore: They looking pretty good, I think. People are having fun hammering on them. [13:25:12] * Coren flies to London, takes the Eurostar to brussels, and drives to AMS from there. Making a vacation of it with my SO, we'll linger on the way back in the UK. :-) [13:26:33] Coren: are you on the way to AMS? [13:26:46] YuviPanda: Leaving in a couple hours. [13:26:50] ah nice [13:27:00] dammit, will I never have luck with tool labs? [13:27:09] now my jobs are getting sigkilled with empty .out and .err files :( [13:27:23] Coren: note that parking in amsterdam is hellish [13:27:29] sigkilled? You probably lack memory. What language are they working on? [13:27:38] valhallasw: We have a parking spot at the hotel, so we be good. [13:27:39] python [13:27:49] Coren: perfect! [13:27:55] but there's nothing memory-biggish happening, it's just shelling out to git [13:28:10] YuviPanda: add -m a -M to jsub [13:28:15] YuviPanda: Python is a vmem hog. Try starting at -mem 1g, you can then see how much it really needs and scale down if apropriate. [13:28:16] YuviPanda: then you will get an e-mail on abort [13:28:28] 1g? wah [13:28:34] valhallasw: i'm using -sync [13:28:39] vmem, not hard memory. [13:28:48] will email give me more details? [13:29:00] Coren: 1g works [13:29:01] YuviPanda: ok, then try qstat -j , I think that should be available for a period fo time after the run [13:29:03] let me scale it down [13:29:39] valhallasw: ah, thanks! [13:30:01] YuviPanda: if that doesn't work, add a time.sleep(100) to your python script and monitor qstat -j while it runs [13:30:04] hmm, it doesn't exist right after it died. [13:30:13] oh wait, it getsw killed [13:30:14] never mind [13:30:25] valhallasw: increasing memory fixes it! [13:30:28] works at 512m too [13:31:02] YuviPanda: anyway, while it runs you can monitor qstat -j, and see the actual memory usage [13:31:05] * addshore plans to try and rewrite his php framework on the way to AMS xD [13:31:22] valhallasw: Or look at http://tools.wmflabs.org/?status [13:31:38] wah! [13:31:38] that's neat! [13:32:16] hmm [13:32:16] CPU: 0.3s VMEM: 206M/488M (peak 243M) [13:32:25] 512M should be fine I think [13:32:52] Coren: still no native WSGI, I presume? [13:32:55] Caveat: gridengine uses powers of 10 when parsing -mem, the status page reports in powers of 2. [13:32:59] Coren how do I display mem usage using terminal [13:33:09] petan: qstat -j [13:33:10] YuviPanda: That's on the between-Amsterdam-and-Hong-Kong list. [13:33:14] ah, nice [13:33:32] so I think instead of fighting with CGI and Python I'll just write that part in PHP [13:33:39] petan: or top if you want local info (or probably ps, but no idea how to do that) [13:33:41] Coren: oh, and I'm going to shell out to jsub from PHP [13:33:48] sounds okay to me [13:33:52] is that a Bad Thing? [13:34:05] YuviPanda: That's a good way to do it; the web servers are submit hosts exactly for that reason. [13:34:09] as long as you do your shell escaping right :-) [13:34:10] wonderful [13:34:18] hehe, yes [13:34:29] that was one of the reasons I wanted python, actually. sh is a nice library that does proper escaping... [13:34:34] oh well, PHP then [13:34:57] YuviPanda: simply python CGI is not that bad [13:35:10] I'm told so, but my last experience with that was very traumatic [13:35:10] just slow [13:35:20] that was a few years ago, but still... [13:35:31] besides, I can always poke people if it is in PHP and not working on Tool Labs :P [13:35:36] valhallasw: usage 1: cpu=03:23:47, mem=2471.10034 GBs, io=0.01148, vmem=423.840M, maxvmem=423.840M [13:35:40] and I'll port to Python as soon as Coren does WSGI [13:35:58] how do I read how many memory was eaten and how much remains in vmem quota [13:36:09] * addshore spots replica.my.cnf [13:36:30] addshore production db replica is online on labs [13:36:32] petan: vmem=423.840M, maxvmem=423.840M [13:36:34] petan: vmem is what you are using (maxvmem being how high it peaked at). I think you should see your current h_vmem value there [13:36:38] what hosts? :P [13:36:42] Warning: There is 1 user waiting for shell: Ydanziger (waiting 0 minutes) [13:36:43] I missed all of this ;p [13:36:44] h_vmem is a few lines above [13:37:00] valhallasw ok so that line say the task is using 100% of resources? o.O [13:37:03] Coren ^ [13:37:06] ? no [13:37:07] addshore: https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/Help#Database_access [13:37:11] ;p [13:37:16] ok so, how do I count the % [13:37:25] petan: h_vmem is your quota [13:37:31] petan: vmem is your current usage [13:37:31] petan: h_vmem is what you requested; vmem is what you are using, maxvmem is the max of what you have used [13:37:41] petan: maxvmem is how high you've ever gotten [13:37:44] ok, I know ut how do I display how much the task is using NOW [13:37:59] vmem. [13:37:59] the number which I can see on website... [13:38:07] That's vmem [13:38:15] usage 1: cpu=03:23:47, mem=2471.10034 GBs, io=0.01148, vmem=423.840M, maxvmem=423.840M [13:38:26] mmm, this is nice. time to reward myself with a lungful of carcinogens. [13:38:28] vmem=423.840M <-- current usage [13:38:34] thanks for the help, Coren, valhallasw [13:38:36] aha [13:38:52] so this task has 80mb of memory remaining... [13:39:12] Assuming your h_vmem is 500m, yes. :-) [13:39:22] we could probably create a script that would list the tasks that are getting out of memory / send a mail to maintainers before crash [13:39:46] If you do, you probably want to do qsub with -xml for the script to parse. [13:39:59] hmm [13:42:00] Coren you could at least display a "free" memory of each task in % on that web page and display in red if it's bellow 10% etc [13:42:44] !taskinfo is qstat -j "task-id" [13:42:45] Key was added [13:49:45] petan: Can I send stuff to wm-bot to print to a channel? [13:50:07] Warning: There is 1 user waiting for shell: Ydanziger (waiting 13 minutes) [13:50:19] hm... technically possible, let me implement that somehow [13:51:14] How does it do those warnings? [13:51:23] it's a plugin [13:51:25] for wm-bot [13:51:41] http://bots.wmflabs.org/~petrb/db/systemdata.htm [13:52:31] petan: How could I make a cat plugin? [13:52:38] I will do that... sec [13:54:52] petan: As in I've got the stuff I use already if it helps. [13:55:21] I would just open a tcp port you would send your text in format channel: text [13:55:27] like #wikimedia-labs: hi [13:56:32] No semicolon, just "#wikimedia-labs Hi." [13:57:22] And preferably on UDP too, as knowinly unreliable is more reliable than unknowingly reliable. ;) [14:02:00] tcp should be enough... irc runs on tcp as well [14:02:26] petan: Yeah but tcp breaks stuff. [14:02:26] !log tools tools-webserver-02 now live; / and /cluebot/ moved there [14:02:28] Logged the message, Master [14:02:38] tcp breaks stuff? since when? [14:02:57] petan: There are pipes to break that need reconnecting. [14:03:07] UDP you just send and forget. [14:03:11] Coren is 02 using the apache I installed or another one? [14:03:33] Warning: There are 2 users waiting for shell, displaying last 2: Ydanziger (waiting 26 minutes) Alonga2811 (waiting 12 minutes) [14:03:53] petan: Same as -01; the move to 2.4 is a wash for now, too many incompatibilities with modules. I'll look into it again once things settle after Amsterdam. [14:04:39] a930913 what tcp port you like? [14:04:49] 12345 [14:05:30] petan: Are you not doing UDP then? :'( [14:05:56] eh... maybe later when I have a lot of time :P or you can do that yourself this bot is opensource [14:06:13] I am doing tcp 65834 (12345 is too bleh for me) [14:06:44] petan: Where's the sauce? [14:07:23] Type @commands for list of commands. This bot is running http://meta.wikimedia.org/wiki/WM-Bot version wikimedia bot v. 1.10.8.10 source code licensed under GPL and located at https://github.com/benapetr/wikimedia-bot [14:11:16] so in theroy this would work? [14:11:30] Server: enwiki.labsdb Database: enwiki_p details from replica.my.cnf ? [14:12:24] a930913 so are you sure you want to use space as separator? [14:12:29] #channel text bla bla [14:12:54] petan: Because channel names can never contain a space. [14:13:20] petan: #channel :text bla bla ;-) [14:13:29] valhallasw that is what I would prefere [14:13:29] It's how all cats I've worked with have done it, so there must be something in it. [14:13:30] for IRC style points [14:13:37] valhallasw but he wants just space with no colon [14:13:44] idc [14:14:03] a930913 ok no problem [14:14:07] petan: botinstance.say(data.partition(" ")[0],data.partition(" ")[2]) [14:14:09] :p [14:14:19] sounds evil [14:14:23] hmm, I dont seem to be able to see anything except for the infomation schema :/ [14:16:58] Warning: There are 2 users waiting for shell, displaying last 2: Ydanziger (waiting 40 minutes) Alonga2811 (waiting 26 minutes) [14:16:59] addshore: Where are you connecting and how? [14:26:21] LOL [14:26:39] port is definitely not going to be that... [14:26:52] try this 64834 [14:28:06] !log tools petrb: installed telnet to -dev [14:28:07] Logged the message, Master [14:28:32] !log tools petrb: installed netcat as well [14:28:33] Logged the message, Master [14:29:00] petan: What IP is wm-bot? [14:29:08] hold on [14:29:10] testing it [14:31:00] seems to work [14:32:26] File not found modules/modules/NetCat.bin [14:32:32] Loaded module modules/NetCat.bin [14:32:40] fuck [14:32:48] Unable to load module modules/NetCat.bin [14:32:53] Kasplasm? [14:32:59] @system-rm NetCat [14:32:59] Unloaded module NetCat [14:33:02] Loaded module modules/NetCat.bin [14:33:10] here we go [14:33:16] Coren: I just dont seem to be able to connect :/ [14:33:24] @relay-on [14:33:43] OMG [14:33:44] addshore: I need more information before I know what's going on. What are you trying to connect to exactly? What command line? [14:33:46] I am idiot [14:33:48] @@relay-on [14:33:48] Relay was enabled [14:33:55] wait ignore me Coren >.< [14:34:01] hi [14:34:06] o/ [14:34:09] apparently im too hungover to notice anything or do anything properly :P [14:34:14] it works! :D [14:34:17] :D [14:34:24] There's an extra space :p [14:34:33] a930913 so try telnet bots-labs 64834 [14:34:40] really? :/ [14:34:45] damn [14:35:21] Testing. [14:35:40] I see [14:35:55] but it is still better than nothing, or not? :P [14:35:59] But \o/ [14:36:07] So, I wonder... [14:36:29] <3 petan [14:37:03] Damianz stinks [14:37:11] :O [14:37:15] * Damianz sprays deodrant in wm-bot's face [14:37:20] Testing more. [14:37:31] echo "#wikimedia-labs Testing more." > /dev/tcp/bots-labs/64834 [14:37:40] probably works [14:37:51] Just did ^ :) [14:38:06] Testing more. [14:38:29] let's try... echo "#wikimedia-labs Damianz is cool" > /dev/tcp/bots-labs/64834 [14:38:37] Damianz is lamest [14:38:39] oh [14:38:41] I think a bug? [14:38:48] Clearly the bot has issues [14:39:00] I think it's an interface error. [14:39:10] One between chair and keyboard. [14:39:11] * Damianz gets cluebotng to ban wm-bot for being vandalistic [14:39:18] :o [14:39:28] :'( [14:39:36] * Damianz pats wm-bot on the nose [14:40:09] Coren: :D http://tools.wmflabs.org/addshore-dev/addwiki/grep/grep.php?pattern=.pdf&lang=en&wiki=wiki&ns=6 [14:40:38] @add ##930913 [14:40:38] Permission denied [14:40:44] love it :) [14:40:55] a930913 you need to do this in #wm-bot [14:40:56] addshore: .+ loads slowly :( [14:41:14] or any channel you are admin in... [14:41:15] addshore: It is teh workz! [14:41:34] @relay-off [14:41:34] Relay was disabled [14:41:39] Damianz: indeed :O [14:41:53] Is that using sql with REGXP? [14:41:56] a930913 you will also need to enable it by doing @@relay-on which in future will be just @relay-on [14:41:58] :D [14:41:59] * Damianz wonders how he can break it [14:42:20] petan: How does doing it in a channel I'm an admin in help, if that's the one I want to add it to? :p [14:42:32] I mean bot admin [14:42:36] like when you do @whoami [14:42:41] Oh. [14:42:43] join #wm-bot [14:42:45] Coren: PHP seems happy now - not failed overnight [14:42:48] type @add #bla [14:43:15] Done :) [14:43:15] Damianz: That was... odd. It really seems that queries timing out was at the root of the problem. [14:43:36] mhm... I bet it's the way the php<>c api is written [14:44:28] * Damianz goes back to stabbing sophos to death [14:50:54] hi [14:51:37] a930913: The links fail :( [14:51:41] commons.wikimedia.beta.wmflabs.org - pleas enable the Translate extension. [14:51:46] Thank you^^ [14:51:54] Damianz: Links? [14:52:14] en.wiki.org vs en.wikipedia.org [14:52:49] Damianz: Where? [14:53:06] petan: :D [14:53:19] a930913? [14:53:27] petan: Spacefix. [14:53:32] yes it is fixed now [14:53:34] http://pastebin.com/JGBf73si < like all links [14:53:45] actually [14:53:46] ffs [14:53:53] opendns... that explains it [14:54:27] Anyway yeah, invalid domain name it seems [14:55:28] http://pastebin.com/5HDAvYmH < invalid subdomain [14:55:44] I HATE dns services that intercept dns responses and inject their advertising bs [14:56:01] hi petan: commons.wikimedia.beta.wmflabs.org - is possible to install Translate extension.? [14:56:17] hi, yes [14:56:20] @notify hashar [14:56:20] This user is now online in #wikimedia-dev. I'll let you know when they show some activity (talk, etc.) [14:56:22] hashar: ^ [14:56:24] hashar can do that [14:56:29] hashar ^^^ [14:56:36] hashar we all need you! <3 [14:56:42] do what ? [14:56:50] commons.wikimedia.beta.wmflabs.org - is possible to install Translate extension.? [14:56:51] break beta... err install an extension [14:57:08] Translate is already on beta iirc [14:57:22] niklas did it a few days ago [14:57:36] okay :) [14:57:50] thoughts? :) >> http://grab.by/mMxm [14:57:51] thx :D [14:58:04] @relay-on [14:58:04] Relay was enabled [14:58:26] Nikerabbit: haven't you installed Translate on beta ? [14:59:15] Coren, petan, I just wanted to let you know that I ported a script that uses the replicated databases from toolserver to Labs today; not only did it work fine, but a script that typically took 3-4 hours to complete on TS was finished in 3-4 minutes here! good job!! [14:59:17] hashar: nope [14:59:39] only ULS [14:59:50] russblau good job of Coren, I can't really do much in regards with production dbs ;) [15:00:15] the translation memory enviroment is not duplicated either [15:00:50] log test test [15:01:15] log test test [15:01:32] find out the potential optimization at: if ( substr( $filename, 0, 1 ) != '/' && substr( $filename, 1, 1 ) != ':' ) [15:01:34] :D [15:01:59] !log test [15:01:59] Message missing. Nothing logged. [15:04:25] test [15:04:44] !log bots petrb: blah [15:04:47] Logged the message, Master [15:04:48] yay [15:04:51] I fixed bottie [15:04:57] actually I replaced it with wm-bot [15:05:42] !log nagios petrb: restarted nagios irc bot [15:05:44] Logged the message, Master [15:11:13] russblau: \o/ [15:28:44] How do I run piped commands on the grid? [15:29:27] You mean run the whole pipe? [15:29:44] Coren: As in "foo | bar" [15:29:46] Easiest way is to shellscript it. Or push a bash -c "your|pipe" [15:30:05] The latter is a little trickier because of quoting. [15:30:15] But should work just as well. [15:44:10] Coren: do you know about networking on lab instances? [15:44:48] liangent: In general, yes. [15:47:33] Coren: so: [15:47:35] liangent@bots-login:~$ ssh nightshade.toolserver.org [15:47:35] ssh: connect to host nightshade.toolserver.org port 22: Network is unreachable [15:48:29] There's no reason that shouldn't work... [15:48:44] Oh! I'm betting it's trying to use IPv6 by default! [15:48:53] Try with -4 [15:49:05] liangent@bots-login:~$ ssh -4 nightshade.toolserver.org [15:49:05] ssh: connect to host nightshade.toolserver.org port 22: No route to host [15:49:23] o_O [15:50:18] That's downright bizarre. I can ssh anywhere /else/ [15:50:54] Oh... DaB. You didn't. :-( [15:51:35] Packets get all the way to Nightshade. Something /there/ is blocking ssh traffic. [15:51:41] Coren, could it be because toolserver.org is using IPv6 addresses now? [15:51:54] russblau: It has both, and we tried -4 explicitly [15:52:05] oh well [15:52:08] toolserver doesn't like labs now [15:52:24] I can connect to nightshade from everywhere else, and I can connect everywhere else from the tools project. [15:52:37] * Coren sighs. [15:53:11] btw ssh to labs from toolserver works [15:53:43] liangent: Yeah, clearly only Labs -> TS is blocked somehow. [15:54:17] And I'm pretty damn sure it's not on our end. [15:56:56] Yarrow is the same. [15:58:15] Coren: ping works [15:58:28] why do you even need to connect to TS now? [15:58:47] valhallasw: Yeah, I know. I can tracepath to it too, so it's clearly not a routing issue. [15:59:18] TS was always highly unreliable [15:59:27] 'Network is unreachable' implies a routing issue, though [15:59:53] Coren: Got it :) (Only half an hour to debug D: ) [16:00:25] valhallasw: Not necessarily; firewall rules generally let you pick the reject cause. [16:00:42] petan: to transfer data between them [16:00:44] Coren: ah, ok [16:00:45] of course this is the general reason to do networking [16:01:06] anyway any user migrating from toolserver need to move/copy data from TS to labs [16:04:28] What's the correct way to cron? Cron the jsub? [16:05:39] a930913: Right. You might want to -once though, to make sure you don't start something that's already running. [16:09:38] "dc: command not found" :o [16:13:25] Coren: How do I get that installed? [16:14:28] dc? Heh. Odd request. You ask one of the sysadmins, who'll deploy it to the environment. Like me. :-) [16:14:42] Nominally, opening a bugzilla for package requests is the best way. [16:15:47] dc seemed the easiest way to do 6/5 in bash :) [16:16:05] $[6/5] [16:16:19] Coren: 1 [16:16:30] Ah, you want floaties. :-0 [16:16:35] It only does integers. [16:16:36] Yeah. [16:16:50] Plus, because it's RPN, it's cool :D [16:18:06] Coren: So how much of your Copious Free Time™ will it take? :p [16:18:17] I'm pushing the update now. [16:19:09] {{done}} [16:19:12] Coren: Would you consider yourself a sysadmin, a devop or a developer? [16:19:24] Oh, huh. I should have done it on -login too. :-0 [16:19:26] Coren 1 + [16:20:37] I'm a sysadmin that was a dev in his distant past, I rarely have time to code more than glue and integration, but I can still code at need. :-) [16:20:47] Coren: so should I contact toolserver people too? [16:20:58] or you'll do it [16:20:59] liangent: Probably. [16:21:18] maybe direct talk between ops resolves the issue faster [16:21:29] liangent: It'd weigh more coming for a user than from "the competitor" (I really wish Daniel didn't take Labs so personally) [16:21:39] a930913: {{done}} on -login too [16:24:31] Coren: dab is not around [16:24:33] so what information should I provide? [16:24:46] Coren: It works :D [16:26:05] liangent: Just that ssh from tools-login to nightshade fails with Network Unreachable. [16:50:24] !taskinfo [16:50:24] qstat -j "task-id" [16:51:47] Is "mem=0.28129 GBs" really GB? [16:52:43] that's like 281MB if I'm reading it right [16:55:00] Damianz: I'll admit my irssi is bigger than that, but somehow I don't think a bot listening to IRC would hog that much. [16:55:20] Actaul memory or virtual memory though. [16:55:42] usage 1: cpu=00:00:02, mem=0.30664 GBs, io=0.00531, vmem=180.191M, maxvmem=180.191M [16:56:04] I like how vmem is in M and mem in GBs [16:56:27] That's why I don't think it's right. [16:57:07] Above - 14:38 < petan> usage 1: cpu=03:23:47, mem=2471.10034 GBs, io=0.01148, vmem=423.840M, maxvmem=423.840M [16:57:17] lol [16:57:27] 2.4T of ram would be awesome [16:57:46] Damianz: What would you do with it? :p [16:57:58] Run kernel builds in it [16:58:12] Wouldn't believe how fast cross compiling a kernel for arm is in a ramfs vs on disk [17:16:10] Coren, are you skilled in the ways of the apt? [17:16:31] Hm, Coren is probably on an airplane [17:27:51] Coren|Travel: http://www.onlamp.com/pub/a/databases/2006/08/10/mysql-federated-tables.html?page=4 [17:28:06] the bitr about aggregate functions worries me [17:38:35] petan: Could you run 'for JOBID in 82218 82284 82291 82292 82293 82294 82295 82304 82305 82306 82307 82308 82309 82310 82311 82318 82319 82320 82323 82325 82326 82327 82335 82336 82337 82338 82339 82347 82348 82349 82940 82941 82942 82943; do qacct -j "$JOBID" | mail -s "$JOBID" scfc; done' on tools-master, please? TIA. [17:39:29] petan: I had requested freenode allow more connections a few weeks ago and didn't get any response [17:40:30] Ryan_Lane: Apparently there's a ticket. [17:40:40] I have added replication lag stats to Ganglia (http://ganglia.wmflabs.org/latest/?r=hour&cs=&ce=&m=load_one&s=by+name&c=tools&h=tools-login&host_regex=&max_graphs=0&tab=m&vn=&sh=1&z=small&hc=4) -- single digit -- seconds! That feels cozy. [17:41:35] ah. neat. thanks [17:43:59] Hope it never reaches "1.6e+06" as on Toolserver :-) (cf. http://toolserver.org/~bryan/stats/replag/). [17:46:01] :D [18:01:30] @notify petan [18:01:30] This user is now online in #huggle. I'll let you know when they show some activity (talk, etc.) [18:31:40] Ryan_Lane, can you explain why puppet installs nova-compute 2012.1.3 on a new instance when 2012.2.3 is available on brewster? [18:31:54] The newer version is on virt0 and nova-precise2… I guess you updated by hand? [18:32:14] if set to folsom, it should use the cloud archive [18:32:26] we shouldn't have anything at all on brewster [18:32:48] maybe the ordering in puppet is screwed up? [18:32:54] ah, right, not brewster, just, available upstream. [18:33:31] oh. I bet it needs an apt-get update after it installs the source [18:34:03] Oh… I see. So the 2012.2.3 versions aren't available on a standard instance, that's something that puppet sets up? [18:34:13] yes [18:34:16] OK, I see it now. [18:34:21] I'll figure out the ordering. Thanks [18:34:25] cool. yw [19:00:03] can someone tell me why I can't shell out to jsub from PHP? [19:00:05] 'sh: 1: /usr/local/bin/jsub: not found' [19:01:33] no coren :( [19:01:36] petan: ^ any idea? [19:01:39] or valhallasw? [19:06:12] do we have safe mode on [19:06:13] ? [19:06:24] is that why I can't call out to jsub? [19:07:56] YuviPanda: No, there's simply no jsub in /usr/local/bin on webserver-01. [19:08:46] well, no perl either? I tried 'perl ' and got back 'sh: 1: perl: not found' [19:08:58] scfc_de: how are we supposed to call out to the Grid Engine from a web server? [19:09:07] scfc_de: Coren|Travel told me earlier today that this is a reccomended pattern [19:09:22] YuviPanda: There's qsub in /usr/bin, or you would have to wait until Coren is back to install jsub (don't know if this is intentional, so I wouldn't want to mess with it without his approval). [19:09:22] hmm, also http://tools.wmflabs.org/suchaserver/info.php tells me that we don't have safe mode on [19:10:07] Probably best to file a bug to install jsub on the webservers so that doesn't get lost. Would you do that, YuviPanda? [19:10:15] bugzilla? [19:10:16] sure [19:10:30] scfc_de: but would that also explain why i got 'sh: 1: perl: not found'? [19:10:34] Yep, component Labs -> tools. [19:10:40] Lemme check. [19:10:45] is that a path? [19:10:50] should I use full path to perl to check? [19:11:35] No, component was for Bugzilla (first item to choose is "Wikimedia Labs" IIRC, then "tools". [19:12:02] yes, I got that :) [19:12:04] asking about perl [19:12:12] Ah, found it: PATH is set to /bin (don't know why), perl is in /usr/bin. So you would have to call "/usr/bin/perl" (or let Coren clean that up -- it doesn't look right). [19:12:13] exec( 'perl /usr/local/bin/jsub -mem 512M ~/code/SuchABot/suchabot/sync.bash ' . escapeshellarg( $repo ) . ' ' . escapeshellarg( $number ) ); [19:12:15] is my php command [19:12:16] aah [19:12:20] right [19:13:18] It doesn't help as long as jsub is not installed :-). You could replicate the whole jsub business in /data/project/$YOURTOOL, but I would wait for Coren to show up if it's not super-urgent. [19:14:06] I'll file a separate bug for PATH. [19:14:22] yeah, the path thing makes sense for PHP [19:16:38] err [19:16:38] perl [19:16:39] i meant [19:16:48] https://bugzilla.wikimedia.org/show_bug.cgi?id=48722 [19:18:02] scfc_de: https://bugzilla.wikimedia.org/show_bug.cgi?id=48723 [19:19:04] Perfect. Perhaps Coren can fix this during the hackathon. [19:19:38] Warning: There is 1 user waiting for shell: Colbeseder (waiting 0 minutes) [19:21:26] scfc_de: :) so, I don't think moving jsub to userspace is goign to work [19:21:30] '/usr/bin/qsub: 1: /usr/bin/qsub: basename: not found' [19:21:37] i'm just going to wait for Coren [19:22:26] thanks for the help, scfc_de [19:23:28] YuviPanda: yw [19:24:31] Yep, you would probably have to set PATH manually as well and lots of other tweaks -- much simpler for Coren to just change two settings :-). [19:25:18] scfc_de: lol that got fixed fast :D [19:25:36] scfc_de: Coren|Travel fixed the missing jsub :) [19:26:01] scfc_de: can you do a 'which jsub' and tell me what the full path is? [19:26:04] or if it is in path/ [19:28:07] YuviPanda: Actually, it was petan :-). You should be able to start it with "/usr/local/bin/jsub", but I'm not sure if the PATH set to "/bin" will not cause problems. Just try it. [19:28:46] ? according to https://bugzilla.wikimedia.org/show_bug.cgi?id=48723#c1 it was Coren [19:28:55] also, thanks :) [19:30:57] scfc_de: lack of PATH causing problems [19:30:58] scfc_de: /usr/bin/qsub: 1: /usr/bin/qsub: basename: not found [19:31:08] (I think?) [19:31:16] Hmmm. He's working under false flag then, cause jsub is owned by petrb.svn :-). [19:31:42] Try calling "PATH=/bin:/usr/bin:/usr/local/bin /usr/local/bin/jsub ...". [19:39:46] AnomieBOT is talented, 0M of memory used for any of its processes. [19:42:55] Well, there comes a time for judgment when sums will be added :-). [19:44:31] bleh, sorry scfc_de got disconnected [19:44:34] not sure if youg ot my messages [19:44:44] setting path myself works [19:45:06] but the jobs themselves don't seem to be getting submitted [19:45:13] I don't see a 'sync.out' on the project home [19:46:41] YuviPanda: What's your tool's name? (local-...) [19:46:47] scfc_de: suchaserver [19:46:53] local-suchaserver, that is [19:47:26] Jobs 89887 and 89916 are in state error. Moment. [19:47:42] error: can't open output file "/sync.out": Permission denied [19:47:53] (qstat -j 89887,89916) [19:48:03] How did you specify your output file? [19:48:18] I didn't [19:48:22] should I explicitly do that? [19:48:24] it defaulted fine... [19:48:31] let me do that [19:49:33] hmm, scfc_de according to doc at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/Help I don't have to explicitly do that [19:49:40] > By default, jsub will append stdout and stderr to the files jobname.out and jobname.err in the tool account's home directory, and will not have standard input. If a directory is given for -o or -e, new files jobname.ojobid and jobname.ejobid are created there for each job. [19:50:21] Yes. $HOME seems to be not set for PHP scripts, so when jsub uses it, $HOME/sync.out becomes /sync.out and / is write-protected. Lemme check. [19:51:03] Okay, jsub is broken in this point :-). Could you explicitely set an output file to try? [19:51:19] ah [19:51:30] just doing [19:52:08] scfc_de: okay, that seems to work! [19:57:18] https://bugzilla.wikimedia.org/show_bug.cgi?id=48726 [19:58:10] Warning: There is 1 user waiting for shell: Vladk (waiting 0 minutes) [20:07:23] stupid question of the day: how do I redirect my old web-tool URL on toolserver.org (using a cgi-bin, if that makes any difference) to my new shiny tools.wmflabs.org tool? [20:11:18] russblau: It should be possible with .htaccess, but I'm not sure about the correct syntax ATM. [20:11:35] Warning: There is 1 user waiting for shell: Vladk (waiting 13 minutes) [20:14:57] russblau: "Redirect /~timl/cgi-bin/wikilint http://tools.wmflabs.org/wikilint/cgi-bin/wikilint" works for me for a single CGI, and it also redirects the parameters ("?a=b") as well. [20:18:05] scfc_de: thanks [20:20:09] Does my user get a public_html/? [20:21:41] a930913: No, only tools have web stuff. [20:23:15] scfc_de: So I should make a tool "a930913" for general stuff/testing? [20:25:09] Warning: There is 1 user waiting for shell: Vladk (waiting 27 minutes) [20:25:13] scfc_de? [20:25:13] a930913: You can. Ideally you would make a tool for every separatable tool, so that it can be maintained by more than one developer, but others have created tools (= service groups) for their personal testing as well. [20:25:31] petan: Yes. Did you see my request above? (qacct?) [20:25:48] reading [20:25:50] wasn here [20:25:58] scfc_de: So service groups should be made somewhat liberally? [20:27:42] scfc_de (21:28:07) YuviPanda: Actually, it was petan :-). You should be able to start it with "/usr/local/bin/jsub", but I'm not sure if the PATH set to "/bin" will not cause problems. Just try it. [20:27:48] ?? [20:28:05] a930913: In theory the idea is that you have a task ("bot that does X"), create a service group (tool) for it ("bot-that-does-X"), invite other developers, so that we don't run into Toolserver problems where when people left, their tools rotted. [20:28:09] petan: No. Moment. [20:28:19] petan: Could you run 'for JOBID in 82218 82284 82291 82292 82293 [20:28:19] 82294 82295 82304 82305 82306 82307 82308 82309 82310 82311 82318 [20:28:19] 82319 82320 82323 82325 82326 82327 82335 82336 82337 82338 82339 [20:28:19] 82347 82348 82349 82940 82941 82942 82943; do qacct -j "$JOBID" | [20:28:27] mail -s "$JOBID" scfc; done' on tools-master, please? TIA. [20:28:37] huh. what is that for? :o [20:28:51] Seeing memory consumption of my jobs. [20:29:20] qstat -j doesnt work? [20:30:17] For finished jobs: No. [20:30:22] aha [20:31:18] petrb@tools-login:~$ qacct -j 82295 [20:31:33] "on tools-master". [20:31:33] eh [20:31:38] hold on [20:31:53] BBIAB. [20:34:37] scfc_de sent [20:34:44] error: job id 82325 not found [20:34:45] error: job id 82943 not found [20:34:53] except these 2 [20:38:38] Warning: There is 1 user waiting for shell: Vladk (waiting 40 minutes) [20:38:43] !stats [20:39:17] !stats is http://tools.wmflabs.org/?status [20:39:17] Key was added [20:43:06] "touch: cannot touch `/data/project/bracketbot/.description': Permission denied"? :/ [20:43:24] a930913 you need to switch user [20:43:29] become bracketbot [20:44:23] petan: I tried that too. [20:44:58] who is local-tb-dev [20:45:07] that user own the folder of your bot [20:45:31] Bug? [20:45:41] I don t think [20:45:45] hold on [20:46:05] @labs-user local-tb-dev [20:46:05] That user is not a member of any project [20:46:10] meh [20:46:11] There was a user tb who got an account shortly after me I think. [20:46:31] @labs-user tb [20:46:31] That user is not a member of any project [20:46:36] eh [20:46:40] right [20:46:54] you can probably fix this by doing take on whole folder [20:46:59] or I can fix that if you want [20:48:00] petan: Thanks. Just noticed: The two jobs that didn't have results are still running, so everything's fine :-). [20:48:21] ok [20:49:10] petan: Take as tool or user? [20:49:21] take as tool but that might not work as well [20:49:41] "you must own the containing directory" [20:49:49] Coren|Travel 2 rants: take doesnt correspond to GNU conventions, has no man and no -h nor --help + that thing is creepily insecure [20:50:30] it allows you to overtake any file in any folder as long as a single of subdirectories is owned by you - not just in /data/project o.O [20:50:48] I dont know why but it doesnt make me feel well [20:52:12] Warning: There are 2 users waiting for shell, displaying last 2: Vladk (waiting 54 minutes) Th3w4v3 (waiting 3 minutes) [20:52:37] a930913 should I fix it? [20:52:45] petan: If you can :p [20:52:54] petan: A single *sub*directory? Not the containing directory? [20:53:17] eh, of course... I wanted to say single parent directory :P [20:53:43] but at some even that is... evil [20:53:49] * at some point [20:54:11] How?! :-) As long as you don't chown /home to you again, we're safe :-). [20:54:15] !log tools petrb: changing ownership of /data/project/bracketbot/ to local-bracketbot [20:54:17] Logged the message, Master [20:54:58] a930913 itś fixed [20:55:17] scfc_de I dont know how it is insecure but I will figure out :) [20:55:19] petan: Ta. [20:55:20] BTW, that !log just got displayed on my shell session. Is that intended this way? [20:55:29] at some point yes [20:55:38] Mine too, it was a broadcast. [20:56:12] in past it was created as a tool that help people work together on projects, so they notice the changes even in console / multiple people avoid doing same thing etc... [20:56:21] on tools project it might not be needed... [20:56:44] but on deployment-prep it was quite useful :P I didnt need to switch window to see that someone changed something [20:57:29] it is command log everyone can use it [20:57:37] petrb@tools-login:/data/project$ log changing ownership of /data/project/bracketbot/ to local-bracketbot [20:58:45] oh no, wait [20:58:53] it was never installed project wide... [20:59:03] it is just in my home/bin [20:59:04] :/ [20:59:27] well, I think you can just copy it from /home/petrb/bin if you wanted [21:00:22] -bash: cd: /data/project/rschen7754/: Transport endpoint is not connected [21:00:35] sec [21:00:55] rschen7754 which project? [21:01:00] petan: bots [21:01:13] petan: the obvious symlink/hardlink options are covered, in any case :-) [21:01:13] for some reason my bot just crashed [21:01:13] freaking gluster [21:01:57] valhallasw that is a good point, how do you cover a hardlink? [21:02:07] how can you know which hardlink was the original? [21:02:16] petan: not, but you know the file has >1 link to it [21:02:22] so it denies a take on both [21:02:33] so you can denial-of-service take, but that's not really an issue ;-) [21:03:23] I would be more happy if that thing worked only inside of /data/project/ [21:03:52] @notify Ryan_Lane [21:03:52] I'll let you know when I see Ryan_Lane around here [21:03:57] andrewbogott_afk ping [21:04:01] paravoid can you fix gluster? [21:04:09] @notify andrewbogott_afk [21:04:09] This user is now online in #wikimedia-dev. I'll let you know when they show some activity (talk, etc.) [21:05:05] valhallasw what if the original owner decided to remove it and didnt know there is another hardlink [21:05:19] valhallasw that would let the person overtake and eventually read the secret content [21:05:37] Warning: There are 3 users waiting for shell, displaying last 3: Vladk (waiting 67 minutes) Th3w4v3 (waiting 16 minutes) Dafna (waiting 2 minutes) [21:05:55] !rq Dafna [21:05:56] https://wikitech.wikimedia.org/wiki/Shell_Request/Dafna?action=edit https://wikitech.wikimedia.org/wiki/User_talk:Dafna?action=edit§ion=new&preload=Template:ShellGranted https://wikitech.wikimedia.org/wiki/Special:UserRights/Dafna [21:06:03] petan: yes, that is true [21:06:05] !rq Th3w4v3 [21:06:06] https://wikitech.wikimedia.org/wiki/Shell_Request/Th3w4v3?action=edit https://wikitech.wikimedia.org/wiki/User_talk:Th3w4v3?action=edit§ion=new&preload=Template:ShellGranted https://wikitech.wikimedia.org/wiki/Special:UserRights/Th3w4v3 [21:06:13] good point [21:07:08] !rq Vladk [21:07:08] https://wikitech.wikimedia.org/wiki/Shell_Request/Vladk?action=edit https://wikitech.wikimedia.org/wiki/User_talk:Vladk?action=edit§ion=new&preload=Template:ShellGranted https://wikitech.wikimedia.org/wiki/Special:UserRights/Vladk [21:08:07] valhallasw does it consider file descriptor a regular hard link? [21:09:35] come on people you know you can request that permission to grant shell to others :P yet I am almost only one who deal with these :P [21:10:02] petan: not sure what you mean? /dev/pts is on a different FS, so no hrad links are possible [21:10:08] and I am pretty lazy [21:10:38] file descriptor is in memory, not really on FS [21:11:50] How do I make a helloworld.py on the webservice? I made it print() but it's throwing a 500 [21:12:23] a930913: what are you printing? [21:13:18] Betacommand: Just plaintext. [21:13:28] Does it need to be a properly formed document? [21:13:54] More often than not you need to format the page or at least send the right headers [21:14:41] IE print "Content-Type: text/html" [21:15:03] I need to generate the /whole/ response? [21:15:20] define "whole response" [21:15:26] 200 OK [21:15:29] No [21:16:06] a930913: Have you checked ownership and permissions? The webserver is very pedant with that. [21:16:37] scfc_de: I think it was 755 or something. [21:16:59] But that wouldn't throw a 500, would it? [21:17:26] a930913: I think 500 is no output [21:17:39] a930913: I think it does, but 755 should be okay (if owned by the tool account). You need to ask petan to look in the errors logs on webserver-01. [21:17:42] Betacommand: Internal Server Error. [21:17:48] I.e. a problem with my script. [21:18:29] a930913: more often than not it means something crapped in your program and there was no valid output [21:18:52] Betacommand: Do you have a cgi.py example? [21:19:02] yeah [21:19:45] scfc_de which file / url [21:20:08] or tool name [21:20:32] btw -nagios works again [21:20:35] fixed today [21:21:05] Betacommand: Could you write it to my terminal? [21:21:38] a930913: give me a sec and Ill pastebin it [21:21:56] a930913: what program are you using to edit the file? [21:21:59] Writing is much more 1337 :p [21:22:02] Betacommand: Nano. [21:23:17] a930913: I would recommend using notepad++ on your local computer and then SFTPing it to labs [21:23:31] petan: Whatever URL a930913 is using ... a930913? [21:23:38] Betacommand: Yeah, but keys :p [21:23:45] Betacommand I would recommend vim on localhost ;) [21:23:55] Warning: There is 1 user waiting for shell: Divec (waiting 0 minutes) [21:24:08] http://tools.wmflabs.org/cluestuff/cgi-bin/csv.py [21:24:58] petan: given the mess python can be a standard text editor is recommended. (keyword highlighting, bracket matching, indentation help) [21:25:02] a930913: http://pastebin.com/1msY0q40 [21:25:13] Betacommand that all is included in vim [21:25:28] it can do all of that and much more than notepad++ or anything like that [21:25:36] vim <3 [21:25:42] petan: I doubt it [21:26:24] !vim is best editor on earth [21:26:24] Key was added [21:26:28] !vim [21:26:28] best editor on earth [21:26:38] you see? even wm-bot knows it! so it is true [21:27:07] petan: can vim divide by Zero? [21:27:15] I am even writing c# code in vim these days :P [21:27:30] of course, vim can compute in complex numbers [21:27:31] Thought not :P [21:27:36] Ah, found the error. [21:27:40] It was the shebang. [21:27:50] a930913: ? [21:27:51] shebang - sounds like a porn [21:27:56] I was using /usr/bin/env python [21:28:23] * Betacommand points back to what he said before [21:28:25] Shebang = #! [21:28:42] a930913: more often than not it means something crapped in your program and there was no valid output [21:28:43] I call that a hash :P [21:28:44] Betacommand: The no valid output part? [21:28:56] petan: Yeah, hashbang. [21:29:08] well AfK for about an hour [21:29:37] Betacommand you better go learn some vim :) [21:30:31] http://buruonbrails.blogspot.cz/2009/06/top-5-reasons-why-vim-is-best-text.html [21:30:41] Good for health. [21:30:44] :D :D [21:30:55] think of your health! use vim [21:31:37] mcedit! <3 [21:34:04] addshore are you on train already? XD [21:34:06] lol traveling [21:34:50] Tonight we play the game of how many people on wikipedia can we make hate us :D [21:36:31] andrewbogott gluster is down on bots [21:37:01] petan, home or project? [21:37:01] gluster is up on my instance [21:37:18] andrewbogott on bots-login /data/project [21:37:24] Warning: There are 2 users waiting for shell, displaying last 2: Divec (waiting 13 minutes) Jamesmontalvo3 (waiting 4 minutes) [21:41:03] petan, I think the volume is working and there's a local problem on bots-login. Is it safe to reboot? [21:41:20] I don t know :/ [21:41:38] it is a main login instance for bots project, is there a way to fix this without reboot? [21:41:58] You probably know as much as I do. [21:42:07] If I try to unmount /data/project it tells me it's busy [21:42:15] which precludes remounting [21:42:30] Damianz: If you want hate, I know a server that aliased nano to vim. [21:42:36] That's love [21:43:08] If I want hate, I just find some wikipedia admins and grumpy users :P [21:43:50] What's the permissions I need to read ~/dir/foo.txt from a tool? [21:46:16] openstack-role-dev14 < I wonder when andrewbogott will have enough servers :P [21:48:29] Do I have to a+x on ~? [21:49:07] Assuming it's not in the group - yes [21:50:49] Warning: There are 2 users waiting for shell, displaying last 2: Divec (waiting 27 minutes) Jamesmontalvo3 (waiting 17 minutes) [21:53:49] petan's pussy freaks me out everytime I go to his talk page [21:54:10] i got a pussy on my page? [21:54:21] Stripy pussy [21:57:16] Damianz: Because I'm testing initial server setup, I keep making new ones and killing the old ones. So, as you can see, I've had 14 failed attempts at this task :( [21:57:47] andrewbogott: Thinking I might patch notifications to enable ignoring specific projects ;) [21:58:09] Yeah, or I should make my own project when I'm going to be like that [22:01:01] petan, any idea how to remount that volume? [22:01:07] damn... I really came around here in 2011? time flies [22:03:08] anyone know how many public IPs we have left? andrewbogott? [22:03:24] Not off hand. Why do you ask? [22:04:10] Warning: There are 2 users waiting for shell, displaying last 2: Divec (waiting 40 minutes) Jamesmontalvo3 (waiting 31 minutes) [22:04:21] I remember someone saying a couple of weeks ago that we're running out or something and I just thought that I'm not really using the mailman project atm (which has a public IP) - so that can be reallocated if it's needed [22:07:25] I don't think we're literally running out -- if it's easier for you to keep it you should keep it. [22:07:27] Thanks for offering though [22:09:14] Hmmm ips... ripe has even stopped people using Pi space now :( [22:17:44] Warning: There are 3 users waiting for shell, displaying last 3: Divec (waiting 54 minutes) Jamesmontalvo3 (waiting 44 minutes) Martijn Hoekstra (waiting 7 minutes) [23:26:31] Anyone in here a software developer as the day job? [23:44:33] Damianz: That is what I have done and probably will do. [23:45:19] Is getting paid to write code as fun as writing it for free? [23:47:28] It depends on what you're writing I guess, and how long you've been writing it for. [23:49:12] Considering switching from enterprise engineering to software development on embeded systems doing crazy things with networking and audio, so I get to play with kernels all day... but I've never actually worked in a software department heh [23:50:06] Damianz: Is this about your pi? [23:50:45] Pi? [23:51:42] Damianz: Raspi. [23:51:44] is gluster still down on bots? [23:52:22] well, i got in on bots-gs the one that's not supposed to be used :P [23:52:46] Nah - my Pi is for a side project to work.... though we do use Pi's for testing... and arm chips... in the control surfaces at least, got a bunch of powerpc on the DSP boards though, which is fun [23:59:59] Pretty stuff. http://tools.wmflabs.org/cluestuff/cgi-bin/charts.py