[00:01:04] [bz] (NEW - created by: Antoine "hashar" Musso, priority: Normal - normal) [Bug 36748] [OPS] syslog::server (in test) unusuable - https://bugzilla.wikimedia.org/show_bug.cgi?id=36748 [00:28:53] This is not good, is it? v [00:28:54] http://ganglia.wmflabs.org/latest/graph.php?h=cvn-app1&r=week&g=mem_report&c=cvn [00:29:01] I suspect one of the cvn bots has a memory leak :( [00:29:20] when various vms dead earlier this week, it was saved by the bell [00:36:02] Krinkle: yeah, definitely looks like it [00:36:31] should be easy enough to figure out which one [00:47:02] !log deployment-prep Trying to import enwiki database on the lucene search deployment-searchidx01 : sudo -u lsearch /a/search/lucene.jobs.sh import-db enwiki [00:47:05] Logged the message, Master [00:47:15] xyzram: seems I have imported enwiki in the lucene search indexer \O/ [00:48:55] xyzram: https://wikitech.wikimedia.org/wiki/Deployment/Search :-] [00:59:49] hashar: Great. [01:12:18] I need to fill bugs [01:17:55] xyzram: I will fill bug later on :-] [01:20:09] [bz] (NEW - created by: Krinkle, priority: Lowest - enhancement) [Bug 34250] [beta project] Set up search (tracking) - https://bugzilla.wikimedia.org/show_bug.cgi?id=34250 [01:22:20] ? [01:22:30] That was 11 months ago [01:22:51] its status is NEW but it isn't a newly created bug [01:38:12] Did Bastion's SSH host key change? [01:39:01] FastLizard4: no [01:39:06] is it showing as different? [01:39:17] Yes [01:39:29] Or, at least it was on my laptop [01:39:31] Shows the same for me. [01:39:38] Trying it from my server doesn't [01:39:47] hm [01:40:03] I had to restart nova network twice [01:40:14] it's a possibility that the NAT isn't working properly right noew [01:40:19] since it's reapplying the rules [01:40:26] It's also rejecting my public key from my laptop, but working fine from my server [01:40:41] very likely due to NAT rules [01:40:43] one minute [01:42:10] And in my sillyness I already answered the PuTTY prompt [01:42:15] So I can't tell you what it showed as the new host key :P [01:42:16] hm [01:42:18] it lets me in [01:42:24] ah [01:42:32] I think I know the issue [01:42:35] Ahh, now it's showing host key changed again [01:42:37] its key didn't change [01:42:43] Ryan_Lane: I had an account on the older Wikitech, and it seems to exist on the new, but what do I use for "Token" when logging in? [01:42:57] Jasper_Deng: If you didn't set up two-factor auth, nothing [01:43:08] your old wikitech password will not work [01:43:23] I imported content, not users [01:43:47] Ryan_Lane: Now it works [01:43:49] * FastLizard4 shrugs [01:44:03] yeah, it was due to nova network being restarted [01:44:04] hhm it said that my password was wrong, implying that my acccount was imported [01:44:18] when that occurs the NAT rules temporarily go away [01:44:30] Jasper_Deng: do you have a new wikitech account? [01:44:36] it turns out I don't [01:46:24] you can create a new one [01:46:39] does labs now have an automatic query killer? [01:46:40] the "incorrect password" error was misleading in this sense [01:46:54] because my screen session just had a "Terminated" [01:48:41] legoktm: on which instance was it running? [01:48:46] bots-bnr1 [01:49:08] i was doing a dump scan that might have used a lot of memory/cpu [01:49:19] but i had been doing them before for the past few days with no issues [01:49:58] as far as I know it doesn't [01:50:15] and I don't see an OOM in the dmesg [01:51:08] hmm ill ask petan then since thats a no root instance [01:51:36] Well, this day had shitty productivity. I'm calling it. [01:51:38] * Coren waves. [02:09:32] Ryan_Lane: oh i forgot, can you create a mysql user for me on bots-bsql or does petan have to do it? [02:09:45] petan I believe [03:06:58] !log account-creation-assistance Running apt-get upgrade on all server [03:06:58] Logged the message, Master [03:11:10] !log account-creation-assistance All servers requesting reboots, will provide [03:11:11] Logged the message, Master [03:51:03] !log bots addshore: legoktm user created on bsql01 [03:51:05] Logged the message, Master [03:51:09] :> [03:51:49] by the looks of things everyone with an account on bots already has a user on bsql01 [03:51:59] but I am not sure what it sets the passwords too O_o [04:15:31] @notify petan [04:15:31] This user is now online in #huggle so I will let you know when they show some activity (talk etc) [04:15:34] o.o [05:33:04] addsleeep: [05:33:05] mysql> create database persondata; [05:33:05] ERROR 1044 (42000): Access denied for user 'legoktm'@'%' to database 'persondata' [06:26:04] mysql> CREATE TABLE persondata ( page VARCHAR(255), qid INT, param VARCHAR(255), value VARCHAR(255)); [06:26:04] Query OK, 0 rows affected (14.75 sec) [06:26:13] Maybe its just me, or did that take too long? [09:03:10] boxofjuice hi [09:03:15] hi [09:03:18] what's up [09:03:26] well i got most things figured out [09:03:27] but [09:03:35] :o [09:03:38] is there a way i can limit how much memory a program can use? [09:03:42] I killed ur query yesterday [09:03:52] boxofjuice yes if you control source :P [09:03:59] just watch how much memory are you using :D [09:04:06] heh.... [09:04:12] i'm running a dump scan for someone [09:04:13] for example http://bots.wmflabs.org/~wm-bot/db/systemdata.htm [09:04:17] aha [09:04:24] okay... [09:04:29] across all revisions on all wmf projects [09:04:58] and apparently my script is rather greedy :/ [09:05:20] i'm not really sure how to control it [09:05:36] i just pulled it out of the pywikipedia repository (that should say enough) [09:05:37] ok man bash [09:05:40] ulimit [09:05:59] ulimit is a built in command in the shell. It will restrict any processes from exceeding the set criteria. You have the following options, memory included [09:06:11] core file size (blocks) 0 [09:06:12] data seg size (kbytes) unlimited [09:06:12] file size (blocks) unlimited [09:06:13] max locked memory (kbytes) unlimited [09:06:15] max memory size (kbytes) unlimited [09:06:16] open files 1024 [09:06:17] but is it going to kill my script when it goes over, or what? [09:06:18] pipe size (512 bytes) 8 [09:06:19] stack size (kbytes) 8192 [09:06:20] cpu time (seconds) unlimited [09:06:20] max user processes 1024 [09:06:22] virtual memory (kbytes) unlimited [09:06:22] yes [09:06:25] hmmm [09:06:34] i dont want to kill it, just a way to limit it. [09:06:41] im guessing that has to be done in the program itself. [09:06:49] okay in that case you need to do it from source code [09:06:51] yes [09:07:08] maybe that process has a parametere [09:07:13] ? [09:07:15] try --help [09:07:33] lol no its not that advanced [09:07:40] in that case no [09:07:48] ill look into how python does this stuff... [09:10:20] k [09:10:23] !tu [09:10:23] ssh -f user@bastion.wmflabs.org -L :server: -N Example for sftp "ssh chewbacca@bastion.wmflabs.org -L 6000:bots-1:22 -N" will open bots-1:22 as localhost:6000 [10:14:53] [bz] (NEW - created by: Peter Bena, priority: Unprioritized - major) [Bug 45768] console doesn't show proper errors - https://bugzilla.wikimedia.org/show_bug.cgi?id=45768 [10:15:01] Ryan_Lane ping [10:15:06] Add rule [10:15:06] ? [10:15:06] Jump to: navigation, search [10:15:07] Failed to add rule. -- wtf? [10:15:13] can you tell my what it means? [10:15:21] what what means? [10:15:29] ah [10:15:33] failed to add rule? [10:15:34] it would be so cool if console told you more than just "Error happened" [10:15:52] why it fails to add a rule? what is wrong? [10:16:10] are you trying to add a source group and a normal rule? [10:16:29] you need to tell me what you are doing so that I can figure out why it isn't working [10:16:34] I clicked add rule [10:16:44] https://wikitech.wikimedia.org/w/index.php?title=Special:NovaSecurityGroup&showmsg=setfilter [10:16:45] right, with what values set? [10:16:59] port range 3306 - 3306 [10:17:01] protocol tcp [10:17:06] cidr 10.4.0.0/21 [10:17:11] source group default [10:17:29] "Note: Individual rules and group rules are mutually exclusive. Choose one or the other, not both." [10:18:06] so by filling this I don't assign the rule to specific group? [10:18:18] source groups don't do what you think they do [10:18:28] "Instances in added security groups will be allowed ingress of all ports and protocols." [10:19:16] if instance a and b are in security group web, and you have a rule that uses source group web, those two instances will be able to talk to each other on all port [10:19:17] *ports [10:19:50] we allow default, which allows all instances to talk to each other on all ports within a project [10:20:12] ah [10:20:17] normal security group rules define how things outside of a project can talk with instances in a project [10:20:25] anyway it would be nice to have error message that tell you details [10:20:28] it's honestly a confusing concept and it's awkward [10:20:40] in this case the passed through message is just as worthless as ours ;) [10:20:50] because I've been incredibly confused about this [10:21:31] really we should have error checking before it ever sends it to nova [10:21:52] if mutually exclusive rules are added together the interface should just deny and give a proper error message [10:50:55] !log wikidata-dev wikidata-dev-9: Moved devrepo's debug log from /srv/logs/devrepo.log to /var/log/wikidata-devrepo-debug.log. Added cronjob that deletes it once a day. [10:51:00] Logged the message, Master [11:13:35] Is there anyone with puppet merging super powers here right now? [11:21:00] paravoid for example? [11:41:17] ... 36% done... [12:16:20] petan: ping me if your around :) [12:56:12] addshore here [12:56:39] is there some sort of nice list with all of the parameters in for mariadb? :P [12:57:09] as far as I cant tell some system vars a different to regular mysql :/ [13:07:08] nice xD https://kb.askmonty.org/en/sample-mycnf-files/ "Place holder for sample my.cnf files" [13:28:30] * addshore found it [13:52:17] !log bots addshore: optimize bsql01 my.cnf to utilise more server resources [13:52:19] Logged the message, Master [13:52:45] use more server resources? [13:53:27] yes :D [13:53:52] its a big server but the config didnt reflect that [14:08:50] is bastion down? [14:15:51] JasonDC, appears to be [14:15:54] JasonDC, try bastion2 or bastion3 [14:16:01] the keys are broken again :/ [14:20:35] lol [14:20:55] bastion2 works [14:21:18] yep :) [14:21:22] !bastion [14:21:23] http://en.wikipedia.org/wiki/Bastion_host; lab's specific bastion host is: bastion.wmflabs.org which should resolve to 208.80.153.194; see !access [14:21:55] hehe PROBLEM SSH is now: CRITICAL on bastion1.pmtpa.wmflabs 10.4.0.54 output: Server answer: [14:21:57] !bastion del [14:21:57] Successfully removed bastion [14:22:41] !bastion is http://en.wikipedia.org/wiki/Bastion_host; lab's specific bastion host is: bastion.wmflabs.org (208.80.153.194) bastion2.wmflabs.org (208.80.153.202) see !access [14:22:41] Key was added [14:22:50] mhm [14:24:01] petan: and bastion3 ;p [14:24:36] no lol it's gluster this time [14:24:41] I am still on bastion1 [14:24:48] it's not possible to open /public/keys [14:25:14] Ryan_Lane ^^ [14:25:30] labs should have more admins :/ [14:25:40] so bad they only recruit paid employees [14:25:55] this is something what suck on wikimedia most [14:26:02] it is so wmf-centric [14:26:07] xD [14:26:24] * addshore is also on bastion1 still [14:26:30] seriously... it's not open source project anymore, it's owned by a company [14:27:21] petan: these are not mutually exclusive... [14:27:31] I know but it suck anyway [14:27:44] Linux is mostly improved by employees these days [14:27:46] it's not community based project [14:27:59] gribeco linux is completely developed by paid developers these days [14:28:02] that suck as well [14:28:23] it's almost impossible for regular volunteer to contribute to kernel [14:28:41] just as it's more and more impossible for volunteers to contribute to wikimedia project [14:28:59] that's why company based open source projects are bad [14:29:07] community is better and friendlier [14:29:13] and more open :> [14:30:40] :D [14:31:09] that's why I spend more time these days working on my own open source project rather than wikimedia :P [14:31:17] despite I am only member, it's still more open :D [14:31:41] petan, its great, as far as I can tell I increased caches and buffers a bit on bsql01 (one I actually increased 10 fold) and now it is using less ram O_o [14:31:43] project(s) [14:31:59] addshore did you restart sql server after that? [14:32:05] yep [14:32:17] addshore did you check if people were using it? :P [14:32:27] *didnt restart, reloaded* [14:32:28] because it's a very bad idea to kill server when it's being used [14:32:30] ah [14:32:38] I think you need to restart it for these to take effect [14:32:49] you can't change ram size while it's running [14:32:52] hmm, but after the reload I saw a decrease in mem usage :P [14:33:02] mhm... who knows [14:33:29] I bet mysql is using garbage collectors just as other software does which kind of suck [14:33:49] these are things that make it simple for devs but produced binaries mostly suck more [14:34:04] all my programs written in c# eat 200 times more memory than my programs written in c++ [14:34:23] like typical windows application in c++ eats around 2 mb of ram [14:34:30] typical c# 400 mb [14:35:51] right i shall be back later when WD rolls out ;p have a nice day! [14:36:02] addshore how's your query? [14:36:04] how you check it? [14:36:11] I don't see any progress bar :D [14:36:11] it was only at 30% so I killed it [14:36:22] I left it running in a screen [14:36:22] ok how you check it's in 30% [14:36:23] petan, define typical [14:36:32] and it shows me a % of how far through it is [14:36:43] let's say "simple" [14:36:43] almost tempted to go and run it again now I have altered my.cnf [14:36:56] eg application that doesn't need more resources than these to display its windows [14:37:08] like bunch of simple forms [14:37:16] as someone with 10 years of Windows development both in C and C#, I disagree [14:37:29] 10 years of development in c# [14:37:39] I am unsure if c# exist for so long? but maybe it does [14:37:50] anyway from my experience it eats more memory than c++ programs [14:37:57] not only windows application even terminal apps [14:38:02] * applications [14:38:18] if you write 2 programs doing exactly the same thing in both languages, the difference will be nowhere near that [14:38:51] of course, hello world in C# will eat prbably 30 megs due to runtime [14:39:22] but other than that, it's going to be a sane difference [14:39:29] MaxSem that's my point... two application doing the same will have very different need for resources [14:39:39] well, not 200 times more, ofc [14:39:46] but not 1:20; [14:39:51] 0 [14:40:37] but for example very simple tcp server I made in c# is eating about 100mb, the almost same application made in c++ eats about 10mb... I don't know why [14:40:47] wm-bot is eating 300mb+ [14:40:52] it's just a simple irc bot [14:40:55] of course, it's always possible to use a pathological behaviour in GC to achieve 400 megs working set size, but you'll have to try really hard [14:41:05] http://bots.wmflabs.org/~wm-bot/db/systemdata.htm [14:41:14] 306272kb [14:41:24] I can't understand what needs to much memory in that bot [14:41:37] very little number of strings and some classes [14:41:44] basically everything is flushed to disk [14:42:07] dotTrace? [14:42:11] even if it wasn't - all datafiles that are loaded have 3.7mb on disk [14:42:22] even if I loaded them all to memory it should eat 3.7mb of ram [14:42:39] MaxSem no idea what is dottrance [14:42:41] * trace [14:42:52] but I really do miss "delete" [14:42:53] good typo [14:42:55] in c# [14:43:14] it's a profiler [14:43:29] ok I have had one profiler verfy good - ants [14:43:36] I used it on one of my application and discovered nothing [14:43:46] 80% of allocated memory was used by .net internals [14:43:50] I used it mostly for perf stuff, but it also has memory profiling [14:45:07] I don't even think it's technically possible for GC to handle memory better than programmer - it can either handle it same good as programmer, or better in case programmer suck [14:45:22] it's not possible [14:45:30] what is not possible [14:45:52] for GC to manage memory better than human [14:45:55] HOWEVER [14:46:07] to err is human [14:46:16] More to the point, it's not possible for a FC to handle memory better than a human /can/. [14:46:20] GC* [14:46:20] in that case I don't see many advantages in that, other than that it's less error prone [14:46:22] thus, GC is there to prevent memory leakks [14:46:38] yes I know [14:46:59] but it should be possible to opt-out from using it, otherwise... I am afraid c# had a very bad future [14:47:06] * has [14:47:30] perfect code in c++ will be always faster and less memory expensive than code using GC [14:47:41] even if that code was perfect as well [14:47:42] ever wrote a multithreaded program with boost::shared_ptr?:P [14:47:48] nope :P [14:48:01] oooh, what a lucky man [14:48:02] I have made a lot of c++ programs with memory leaks though [14:48:37] but still I believe c# should have possibility to disable GC and let human control memory [14:48:45] MaxSem: If you inflict boost on yourself, you deserve the result. :-) [14:49:14] Coren, the result was me moving to WMF:0 [14:49:31] YES I DESERVE IT [14:49:35] :-) [14:50:01] if used sanely, boost isn't that bad [14:50:07] I like C++ very much, but I use it with the "C with classes" idiom. That generally means staying the fuck away from std. :-) [14:50:22] boost is right out. [14:50:36] and C++0x actually makes a lot of STL/Boost code sane [14:51:23] Meh. People overuse the OO idiom anyways. It only maps to a very limited problem space in practice. [14:51:37] Classes as code encapsulation / interface definition is Good. [14:51:53] C++'s idea of classes is bad [14:52:00] I know:) [14:52:03] MaxSem: iff you try to use it for OO [14:52:34] As syntactic / organization sugar over C, they're very good. [14:53:38] as soon as you need a container class, you either have to store pointers and manage memory yourself, or use that automagical machine gun for your feet called assignment constructor [14:54:25] MaxSem: True; which is why I have my own container idiom handy for reuse when I need it. [14:54:51] using the NIH pattern?:P [14:55:01] am i the only one who cannot login to the bastion right now? [14:55:05] no [14:55:47] Heh. No, using the "there's no reason to have a complicated solution when a simple one will do" pattern. :-) [14:56:09] Silke_WMDE_: Hm. Broken for me too. I shouldn't have closed that ssh I already have opened. [14:56:25] Coren, do you already have powers to fix it? [14:56:37] MaxSem: Nope. [14:56:54] * MaxSem bites the ops team [14:56:57] Coren: So did I when switching to wifi a few minutes ago. m) [15:00:08] MaxSem: I have two generic classes I use for my code; a pointer-to-something and a reference-counted-pointer-to-something, both with threadsafe variants, an a discardable allocation pool class. Between those five, I solved 99.9% of all my needs. [15:00:33] sigh [15:00:39] classic C++ [15:00:49] Why the sigh? [15:01:14] there's a standard library, but it does so little and with so many restrictions that nobody uses it [15:02:11] MaxSem: That's an unavoidable consequence of most people trying to use C++ for the wrong thing, not of the language design. If you need Java, use Java. :-) [15:03:14] and if I need C's performance AND C#/Java's qability to handle huge piles of code? [15:03:50] that was a typical task at my previous work [15:04:20] several epic architects failed at it, some were fired [15:05:18] You need to sacrficice something then. I'd also like a computer with ten times the performance of my current desktop and I have a budget of $100. :-) [15:05:45] FAIL [15:06:53] Coren: I call that powerful computer "WMF labs" ;) [15:07:27] (And I'd dispute the putative quality of C#/Java to "handle huge piles of code" any better than any other tool. Coder discipline cannot be compensated for by the language.) [15:07:43] boxofjuice, and gluster be upon us!:P [15:07:46] boxofjuice: I'm fairly confident the WMF had a budget slightly bigger than mine. :-) [15:09:13] Coren, the problem is that in C/C++ you tend to struggle a lot with language's features while the more scriptish a language is, the more you can dedicate yourself to actually doing the shit done. but everything come with its own price [15:09:29] hmmm [15:09:47] is bastion broken? [15:09:54] aude: yup :/ [15:09:58] MaxSem: Of course. This is why perl remains a weapon of choice in my arsenal. [15:09:58] * aude reads above... [15:09:59] ok [15:10:17] eww, I hate perl [15:10:25] MaxSem / Coren: since you guys were discussing memory earlier, would you like to help me figure out how to make a python dump parse script i'm using not use as much memory as possible? [15:10:52] boxofjuice: My python-fu is weak. [15:10:56] my python is very noobish, unfortunately [15:11:04] :( [15:12:05] MaxSem: Perl is really good at what it does. For the typical use case, I'm usually done with implementation before someone else finished loading their IDE. :-) [15:12:35] it's too schizophrenic for me [15:13:49] MaxSem: The biggest hurdle is that it's /highly/ idiomatic. [15:14:26] @#Y$@#$Y#@$Y@#$^#$^#@$^@# [15:14:41] a typical perl one-liner [15:15:09] MaxSem: Also a very /mean/ one. IOCCC has nothing on perl. [15:15:11] sorry, should've included a couple of ~s [15:15:22] MaxSem: But it /is/ possible to write very readable and maintainable perl. [15:16:15] MaxSem: The biggest difficulty stems from the fact that reading perl requires the same level of still writing perl did; something that is generally not true with other languages. [15:18:13] while(<>) { stuff if m/crap/; } is highly expressive and understandable once you know the idioms. [15:20:18] And you gotta love a language where you can say "open somefile or die;" :-) [15:22:35] !log bots petrb: updated buffer size of sql server, was too big [15:22:38] Logged the message, Master [15:51:08] Coren: MaxSem fwiw, the Socialtext wiki is in Perl, and it is (or at least was) one of the better code bases I've worked in. [15:52:03] some people even write production code in Erlang, so what? [15:53:11] MaxSem: It's just illustrative that code quality doesn't depend on the language but on the coders. :-) [15:56:02] Ryan_Lane: Gimme the bits so I can fix bastion when it is broken! [16:10:06] Coren: Will you go to the Hackathon? [16:10:29] Darkdadaah: Ayup [16:10:55] Ah good, I'll probably go this time :) [16:13:33] labs down? [16:13:38] *bastion down? [16:14:32] ssh_exchange_identification: Connection closed by remote host [16:14:39] bastion1 is [16:15:10] JasonDC: so which one can I use? [16:15:11] oh, and keys are having issues: Permission denied (publickey). [16:15:16] bastion2 or 3 [16:19:15] JasonDC: so you were describing the issue when ssh'ing from bastion2/3 to instances? [16:19:24] ya [16:21:36] ok I'll just wait... [16:24:36] Coren, you can say the same thing in php... [16:25:37] Platonides: So you can. [16:37:13] Coren: Will it be long/difficult to add user databases to Tools? [16:38:29] Darkdadaah: Probably not, though round one will probably be a transitory method. [16:38:37] Darkdadaah: (Mysql on a project instance) [16:40:06] Did you settle for a mean of identification? [16:41:21] Darkdadaah: That's what we'll be doing today, time allowing. [16:45:28] I'll just wait until it's ready then. Thanks for your work. [16:56:31] Damianz I sorted out the self creation of db's but I need you to insert one line to your script [16:56:48] grant select, execute on system.* to user; [16:56:58] where user is a new user you are making [16:57:29] I have that in my current script, but if we are going to switch to yours :P [17:02:59] JasonDC: still having key issues? can i get some details? [17:04:03] oh, its not forwarding the key cause its bastion2 [17:04:18] forgot to set that back up :P [17:06:56] JasonDC: so it works now? [17:07:10] * jeremyb_ tries to summon the strength to ignore petan [17:09:08] i guess one option for the bastion issue is to force it to really be just a bastion. most people will never even be able to get a shell there. and then you can apply for some exception to the no shells rule if you've proven that you understand what not to run [17:12:00] ugh, how is it noon [17:15:56] Can't ssh now. [17:18:58] petan: I'll take a look later - not pushed it for review yet [17:28:46] Damianz: more details? [17:29:12] errr [17:29:23] that was for Darkdadaah [17:29:26] [bz] (NEW - created by: Krinkle, priority: High - enhancement) [Bug 34250] [beta project] Set up search (tracking) - https://bugzilla.wikimedia.org/show_bug.cgi?id=34250 [17:29:28] but he didn't stick around... [17:30:12] jeremyb_: 010101010010101010110101011010001010 [17:30:51] uhuh [17:31:44] [bz] (NEW - created by: Antoine "hashar" Musso, priority: Unprioritized - enhancement) [Bug 45784] search: mediawiki configuration for lucene in beta - https://bugzilla.wikimedia.org/show_bug.cgi?id=45784 [17:34:26] BTW, bastion2 works for me. [17:38:25] Coren: yes. "can't ssh now" is quite broad [17:38:36] [bz] (NEW - created by: Antoine "hashar" Musso, priority: Unprioritized - enhancement) [Bug 45786] search: craft a fake InitialiseSettings.php for lucene-search-2 - https://bugzilla.wikimedia.org/show_bug.cgi?id=45786 [17:39:20] jeremyb_: bastion, otoh, is quite down. [17:39:34] yeah, but i don't think that was expected to change [17:39:40] jeremyb_: sshd closes the socket before any exchange takes place. [17:39:49] Coren: errr... [17:40:12] marc@mordor:~/Downloads$ ssh bastion [17:40:12] ssh_exchange_identification: Connection closed by remote host [17:40:26] Coren: https://rt.wikimedia.org/Ticket/Display.html?id=4631 [17:41:33] jeremyb_: So, you're trying to tell me you're way ahead of me? :-) [17:41:39] kinda [17:41:45] :-) [17:42:32] * Coren doesn't pay much attention to RT tickets since he doesn't have the bits to do anything about them. [17:43:09] Coren: is there some plan/policy for who will have root in tools? i guess most stuff will go in puppet but who fixes stuff day to day and who does stuff like adding or removing users from a shared subproject. (where subproject is like a stable toolserver project. i think. don't remember the TS terms) [17:43:49] Coren! [17:44:18] jeremyb_: The actual tool maintainers will have the rights to manage their userlists and such. I don't expect many roots outside the general ops, certainly no systematic ones. [17:44:31] systematic? [17:44:50] and also is there some todo list? do you have your own BZ component? [17:44:53] I mean no plan to grant root to tools maintainers as a matter of fact. [17:45:02] oh, ok [17:45:16] but you could grant root in the same way TS already grants root [17:45:37] I don't have my own BZ component yet, and the todo is http://www.mediawiki.org/wiki/Wikimedia_Labs/Tool_Labs/TODO [17:46:23] are you on my watchlist? huh, no [17:46:25] jeremyb_: Possibly, but there are different implications given the isolation from PII. There's no reason to not have volunteer roots a priori, but there needs to be some discussion first. [17:46:48] ok, watched :) [17:46:51] ok [17:47:28] I know legal wants to be kept in the loop there. [17:50:09] Coren: so when are we getting you down to NY? we have 2 events. this weekend. also in april: http://fcx2013.org/fcxabout.html [17:50:25] errr, i think there's a stray period there [17:51:45] jeremyb_: I'm not going to have the availability for anything this soon. Dunno about April just yet. :-) [17:52:26] Coren: yeah, i figured not [17:53:05] I know I'll be in ams though [17:54:00] ah, right [17:54:13] that sounds so long from now [17:54:47] I'm also going to try for Wikimania, but I know I fell out of Erik's budget so it has to be on my dime. [17:55:06] so... I see other people are getting ssh_exchange_identification [17:55:07] or i guess scholarship [17:55:10] ? [17:55:17] Vacation9: use an alternate bastion [17:55:28] there are alternate bastions [17:55:33] bastion1? [17:55:41] $ ssh bastion3.wmflabs.org echo foo [17:55:41] foo [17:55:46] Vacation9: No, that's the not-alternate. [17:55:54] ah 3, okay [17:55:55] Vacation9: Try 2 or 3 [17:56:07] okay, that works [17:56:12] thanks [17:58:59] Hm. I need an instance with mo' local storage. [18:05:00] seems the bastion is dead for me :( [18:05:13] $ ssh bastion.wmflabs.org [18:05:14] ssh_exchange_identification: Connection closed by remote host [18:05:15] :( [18:07:58] hashar: Use bastion2 or bastion3 [18:14:24] Coren: thx [18:14:39] Coren: I was expecting bastion.wmflabs.org to points to some kind of failover IP :-] [18:15:07] hashar: Not as far as I know. It could be RR but that brings its own set of problems. [18:17:18] no, can't be RR. that just won't work [18:19:36] I meant a LVS :-] [18:19:39] or something [18:20:25] jeremyb_: It has issues, but why 'just won't work'? [18:25:21] hashar: what do you do in the topic? [18:25:39] giftpflanze: no idea :-) [18:25:54] ^^ [18:26:12] i can't login into labs :( [18:26:21] use bastion2 [18:26:28] or 3. [18:29:40] someone please change the channel notice to say use bastion2 or bastion3 [18:29:55] if you're having problems [18:37:24] !log deployment-prep rebooting search indexer [18:37:27] Logged the message, Master [18:42:29] Coren: because the hsots would have different ssh keys [18:43:06] jeremyb_: There's no reason why they should; and you'd want identical bastions to sport the same keys in a setup like this regardless of the method. [18:43:18] yeah... idk [18:43:23] * jeremyb_ is conflicted [18:43:57] They are Bastion. Their cryptological distinctiveness will be assimilated. :-) [18:44:08] Vacation9: errr, the topic's editable by all [18:44:10] YAY BORG JOKE! [18:46:28] each bastion instance should just auto restart if something goes wrong :/ [18:48:00] are their keys... ? [18:48:01] bastion2 -> RSA key fingerprint is 45:e3:61:b6:e4:0a:69:fd:95:31:89:2c:0b:db:47:3b. [18:48:01] bastion3 -> RSA key fingerprint is 36:f5:d0:3e:90:7a:3e:4e:25:e7:54:3b:7b:28:8b:31. [19:02:16] !log deployment-prep refreshed wikiversions.cdb [19:02:20] Logged the message, Master [19:03:59] Platonides: idk. i'm just not doing anything with an unknown host until i can verify them [19:06:01] so, something is causing processes to hand on bastin [19:06:03] *bastion [19:06:10] Vacation9 everyone can change topic here [19:06:35] Ryan_Lane yes I believe it's gluster or driver of fs - when I accessed /public/keys the ls hung [19:07:06] Ryan_Lane for some reason even bastion2 and 3 didn't work for a while [19:07:07] Ryan_Lane: to "hand"? [19:07:10] oh, hang? [19:07:15] :P [19:07:17] indeed [19:07:25] *hang [19:07:39] if that was the case then every single instance would hang [19:08:29] well, anyway, what's the state of bastion1 now? [19:08:36] hung [19:08:38] I'm going to reboot it [19:08:54] that alone makes me disbelieve this is gluster related [19:09:13] (/public/keys is nfs, btw) [19:10:31] anyway, as I've mentioned a million times, if gluster is having issues password auth isn't going to help you [19:10:31] at all [19:10:37] because it'll hang accessing your home directory [19:11:27] so, looking at labstore1, glusterfs processes are getting killed every 30 minutes or so [19:11:37] and glusterd brings them back up [19:16:15] so, glusterfs is what runs nfs [19:16:28] hi [19:16:41] does anybody know why I can't connect with bastion? [19:16:45] yes [19:16:53] use bastion2 or bastion3 [19:18:02] * jeremyb_ is slow [19:18:10] actually, you can use bastion now [19:18:11] it's back up [19:21:13] Ryan_Lane if gluster can only affect nothing or everything howcome sometimes it happen that it's broken only on one instance? like when we recovered wm-bot [19:21:18] it was broken on bots-1 only [19:21:59] yes, it can be broken if a server is down [19:22:16] btw did you find out what it was this time? [19:22:27] it may be gluster's nfs services [19:22:45] you can't share gluster mounts read-only [19:22:49] when I was on bastion when it happened I tried to access /public/keys and it didn't work so clearly it was problem accessing the disk [19:22:51] not OOM etc [19:22:52] so we have to use gluster's nfs [19:23:13] ok is it possible to mirror that to local disk or something as backup? [19:23:17] not really [19:23:20] why not [19:23:27] you could just create a script to sync it [19:23:29] because it still needs to be in the ssh config path [19:23:36] which means ssh is going to access it [19:23:44] sure, you can sync it with some folder which is going to be in ssh config [19:23:54] nfs and local [19:23:59] local would sync with nfs [19:24:02] and ssh would look into local [19:24:05] it's actually possible that this was related to ldap and not gluster/nfs [19:24:12] the system wasn't dead [19:24:20] I know [19:24:21] I was tehre [19:24:23] I think it was actually having timeouts on ldap [19:24:44] if it was ldap I probably wouldn't be recognized by system [19:24:52] when I was on instance where ldap died [19:25:07] system started telling me "who are you?" and instead of my name I saw id [19:25:08] you're assuming a condition where ldap is completely dead [19:25:27] mhm, anyway 2 questions [19:25:32] opendj has a memory leak [19:25:37] why logs in /var/log on bastion are readable by root only? [19:25:49] if it gets bad enough the ldap server takes ages to return queries [19:25:53] petan: why wouldn't they be? [19:26:08] why do you think it was problem with LDAP when it clearly wasn't possible to open /public/keys [19:26:23] do you think it wasn't possible to open that folder because of LDAP [19:26:24] I think it's a combination of things [19:26:34] is gluster nfs depending on ldap? [19:26:43] yes, of course it is [19:26:57] nfs actually does look up ids [19:27:06] they should be readable by non-root because they contain useful debugin information which isn't secret? [19:27:15] what debug info? [19:27:20] for example kernel.log is not readable [19:27:23] syslog is [19:27:33] same data which go to kernel.log go to syslog [19:27:36] it's totally pointless [19:27:36] use dmesg [19:27:41] yes I know [19:27:48] it's possible to read it so why it's not readable? [19:27:54] doesn't make sense to me [19:28:02] because no one has put in any effort to change that? [19:28:08] mhm [19:28:22] if I actually could read the logs maybe I would figure out what is wrong when I was there [19:28:40] that's my point of logs not being readable [19:28:40] on a system you can't log into? [19:28:45] !log deployment-prep regenerating lucene prefixes [19:28:46] I was there [19:28:47] Logged the message, Master [19:28:54] it broke when I was working on bots project [19:29:06] I was in shell in time it didn't work [19:32:12] Ryan_Lane: How do I go making an instance with moar local storage? [19:32:30] Coren: use a larger instance type? :) [19:32:54] Ryan_Lane: Heh. I'm loathe to allocate all of that ram when all I want is more diskspace. [19:34:18] this is for a local db or something? [19:35:07] Ryan_Lane: Aye. Unless you also planned our work on the extension (ready when you are, btw) to include setting up a labs-wide db [19:36:05] I'm, err, not hot at the idea of putting a DB on gluster. :-) [19:39:47] <^demon> Coren: s/a DB/anything/ [19:40:13] Coren: it doesn't work for that anyway [19:40:16] ^demon: Erm, well, that point has alteady been beaten well into the ground. :-) [19:40:47] Aaaaw. I can't resist. "It doesn't work for storing files, either." [19:41:45] Coren mysql doesn't work on gluster and regarding RAM you need it if you want to create db server [19:41:53] <^demon> Coren: Maybe if we were using MyISAM, because then we obviously wouldn't care about our data ;-) [19:41:56] because labs IO is rather slow - caches will make it usable [19:42:17] Ryan_Lane: You're stuck, dude. Even if you manage to get Gluster to six nines of uptime, and faster than SSD, you'll still have peeps harping on it. :-) [19:42:21] Coren we have a one sql server on bots project with 16gb of ram and pretty much utilized all of it [19:42:43] petan: That bad, huh? Seems like a dreadful waste. [19:43:05] Coren I have been playing with sql servers on labs over a year [19:43:23] they never worked - minimum was 2gb of ram to be even useable [19:43:27] for max 1-2 bots [19:43:39] now we have 16gb box and that's first which seems to be sufficient [19:43:58] I don't know if situation is better but IO was really very slow [19:44:07] Coren: use the large instance with lots of ram right now [19:44:18] we'll move the databases off of it later and delete the instance [19:44:21] we have *lots* of ram [19:44:40] Coren don't forget to change the my.cnf and give it more ram for buffer pool [19:44:56] from my experience it's best to give it 80% of all [19:45:03] and leave the rest to system [19:45:25] Hm. Given to innodb_buffer_pool_size or left to the OS buffers? [19:45:34] buffer_pool [19:45:58] OS buffers only store data, buffer_pool is used for some operations too [19:46:14] which normaly would require some temporary spaces to be created [19:46:33] Hm. Increase log_file_size too to reduce IO then. [19:46:35] at least it's recommended by mysql [19:46:57] Coren I don't know - I don't see a reason how it would [19:47:02] these are binary logs [19:47:10] the only reason it uses all of the ram is because it is set to :P [19:47:11] Reduces the number of flushes from the buffer_pool [19:47:12] like arch log in oracle or that [19:47:25] ok but these need to be written to disk as well [19:47:26] there is no point in having unused ram [19:47:40] addshore ofc [19:47:49] addshore but if we didn't use all of ram it would be horribly slow [19:48:25] btw addshore when are you starting your huge bot [19:48:42] In about 2 hours [19:48:52] When Wikidata goes live. [19:48:52] Coren: If you make a nice cnf show me? :D [19:50:55] !log tools created tools-db instance for a (temporary) mysql install [19:50:56] Logged the message, Master [19:56:02] petan: whats boxofjuice said :P [19:56:20] addshore what will change [19:56:25] wikidata are already live [19:56:30] only on 4 wikis [19:56:33] petan: on the rest of the wikis. [19:56:46] mhm [19:56:58] xD [19:57:58] Coren are you installing maria or mysql? [19:58:06] in mysql there are things to fix [19:58:10] weird defaults [19:58:17] maria is ok [19:59:55] !log deployment-prep rebooting apache33 : gluster mount is corrupted [19:59:57] Logged the message, Master [20:15:09] petan: maria. [20:17:23] petan: Well, that's my objective. I'm sure we have a package around for it, right Ryan_Lane ? :-) [20:17:37] I have no clue [20:17:42] I don't deal with databases [20:17:44] I don't know - I used the official maria package [20:17:53] but binasher might know [20:17:58] Ryan_Lane: do we have any way to raise the amount of memory on an instance ? [20:17:59] there is special maria in production [20:18:02] or should I create a new one? [20:18:03] I seriously doubt you'd want to use the production packages [20:18:08] I failed to understanded its purpose though [20:18:12] hashar: only by creating a new one [20:18:17] Ryan_Lane: :( thx [20:18:18] yes I doubt too Ryan_Lane :D [20:18:33] I just mean there is some special wmf package [20:18:37] Hello world; [20:18:39] Ryan_Lane: Why so. Much included black ops magick? [20:18:42] if that was Coren's question [20:18:46] Coren yes [20:19:01] Coren since I heard that it's compiled with debugging symbols I consider it very black [20:19:44] I don't know if what mark said is true or not, but from my own experience all my code that I compile with max optimization - which requires to disable all debugging - are significantly faster [20:19:45] petan: Meh. Slightly bigger file, a few extra microseconds of startup. That's very light grey magic. :-) [20:19:49] Re passwords vs keys (and insecure auth) pllleeeesssseee can we have forced https for login on wikipedia :P [20:19:55] petan: I promise you what mark said is correct [20:20:04] especially since asher said the exact same thing [20:20:08] then there is something weird with my code [20:20:22] Damianz: yes, that went in yesterday [20:20:25] but it doesn't like debugging :P [20:20:28] oreally? [20:20:28] Damianz: but was yanked due to a bug [20:20:28] :D [20:20:30] petan: That depends. You can -g3 without reducing your -O [20:20:33] :( [20:20:37] Bugs are bad [20:20:47] Coren: there was a thread about why you don't want to use our packages ;) [20:20:57] Anyway, we don't have bugs - it's all features :D [20:21:08] Coren from what I read on stack overflow when you enable debugging some optimizations are disabled, even when you enabled then using parameters [20:21:11] !log deployment-prep creating deployment-searchidx02 wich has 16GB of RAM. deployment-searchidx01 does not have enough RAM :( [20:21:13] Logged the message, Master [20:21:14] they are not employed on binary [20:21:36] petan: I've never seen that in practice. I've seen some debug info become less /useful/ unless you reduce optimization, but no more. [20:21:39] You know if you tell the compiler to optimise to the fullest the binary it produces is not usable? :P [20:21:46] well, the bug was that when people unselected the "keep me logged in with https" it still kept them logged in with https [20:21:52] I consider that a feature and not a bug, but that's just me :) [20:21:55] Ryan_Lane: That's acceptable :D [20:22:11] petan, as I said in that thread, I disagree [20:22:27] present a benchmark showing the speed differences [20:22:36] I would very much like to agree with you, but my code just run slower in debug mode :/ [20:23:00] is that code available somewhere? [20:23:08] maybe some of it... [20:23:18] also are we stricly speaking of c++? [20:23:28] because more languages support debugging symbols [20:23:53] * Damianz runs away for food before sumana declares more wikilove [20:24:05] sumana is here? [20:24:14] no... [20:24:18] petan, is it on Windows? [20:24:23] Platonides yes [20:24:25] I remember MSVC having different libcs [20:24:26] no... [20:24:35] the debug one could be quite slower [20:24:40] Platonides the performance issues were on windows [20:25:02] but were you linking with a debug libc? [20:25:15] at some point it might be becaue of runtime debugger which was likely attaching to process somehow [20:25:16] who knoes [20:25:19] *knows [20:25:26] windows are black box to me [20:25:38] ok. I need coffee and food. dealing with gluster related issues in the morning hasn't exactly done good things with my mood, especially with lack of food/coffee [20:25:42] Platonides yes i think debug configuration does that [20:25:45] !log tools tools-db installed mariadb-server from official repo [20:25:46] Logged the message, Master [20:26:06] btw petan, can you add me to bots project? [20:26:13] since these problems I just have a habit of building all release binaries with debugging disabled [20:26:21] ofc [20:26:30] * Damianz gives Ryan_Lane a top hat and a unicorn [20:26:33] day improved by 400% [20:26:37] xD [20:26:38] :D [20:26:50] !log bots added Platonides [20:26:51] Logged the message, Master [20:27:17] I allways knew Platonides was a bot [20:27:22] yes [20:27:25] same here [20:27:28] :) [20:27:45] actually, what tempted me was that stored procedure [20:27:46] now I need to have a quick defrag, err. shower [20:28:08] I need to have a medium length shower after quick food [20:28:23] Platonides try it out :P [20:28:32] it should work, should you find a bug let me know [20:28:57] platonides@bots-1:~$ mysql -h bots-bsql01 [20:28:57] ERROR 1045 (28000): Access denied for user 'platonides'@'i-000000a9.pmtpa.wmflabs' (using password: NO) [20:29:12] Platonides sync script is cronned hourly :/ [20:29:17] I will run it by hand... [20:29:18] hourly? pussy [20:29:19] Platonides: You're clearly not cool enough. :-) [20:29:33] why can't it use ident auth? [20:29:44] You know what would be fucking awesome? All this shit in salt and a little hook on a project member add in labsconsole or even better, nova [20:29:51] Damianz: +1 [20:30:08] petan: sync script? :P [20:30:48] Platonides done [20:31:14] addshore there is sync script that creates mysql user accounts for all members of bots project [20:31:22] xD [20:31:27] addshore it check for all users which are in project but not in sql and create them [20:31:34] Platonides does it work now? [20:31:37] I see :) [20:32:16] * petan wants to know badly [20:32:19] :o [20:32:47] Platonides you are likely first user who is using these scripts let me knwo :P [20:32:47] petan: where is that cron? :O [20:32:50] lol [20:32:57] on bots-1 I think, it's temporary though [20:33:07] I will replace it with Damianz [20:33:12] he will do it like a monkey [20:34:26] Sure, after the like 120 cbng requests I have to review :P [20:34:45] xD [20:35:13] is there any way to alter the thresholds that nagios reports at? [20:35:20] meh I hate how Platonides never respond on irc :/ [20:35:23] @notify Platonides [20:35:23] This user is now online in #huggle so I will let you know when they show some activity (talk etc) [20:35:48] Tickle him [20:35:56] addshore: Sort-of, not really [20:36:04] btw Platonides what did you mean by ident auth [20:36:05] Within a class - yes. For a specific host - no. [20:36:16] ident auth... go read the pgsql docs [20:36:36] * addshore thinks the general proc count threshold should be shunted up to 300 or something similar [20:37:45] See this is where abstraction becomes an issue... because proc count should be in base, but the proc count also depends on what you're doing... so I might just fudge off the seperation and add some ifs [20:38:28] Damianz if I wanted to read docs I wouldn't ask [20:38:29] seems that placing the CREATE DATABASE before the GRANT made it secure [20:38:52] By varying definitions of secure [20:38:54] Platonides why it shouldn't be secure? :o [20:39:00] petan, I was trying to breek your script [20:39:06] Personally I'd of just been a bastard and forced everyone to _ [20:39:07] Platonides I thought that [20:39:12] Platonides you are known hacker [20:39:17] call system.create_db("*"); [20:39:23] but I am known for making unhackable software [20:39:24] :D [20:39:43] *snort* [20:39:44] Platonides yes that probably crash [20:39:53] of course I knew that XD [20:39:56] wm-bot crashes more than java [20:39:56] :D [20:39:56] :PP [20:40:00] Damianz lOL [20:40:14] wm-bot crashes less than ur calculator Damianz [20:40:39] petan, if it ran the GRANT with that, it would allow unlimited privileges [20:40:40] last time it was down was when labs crashed [20:40:48] but it fails for the create database [20:40:56] Platonides yes it would indeed [20:41:04] that might need a fix [20:42:43] Platonides you actually managed to create "database_name":P [20:43:06] petan, I think that one existed [20:43:12] nah [20:43:18] you have full grants for that anyway [20:43:52] actually using mysql procedure isn't really best idea [20:44:01] lawl [20:44:04] I should have create shell script or something [20:44:17] setuid script gives better validation [20:44:26] yes [20:44:36] actually this mysql procedure works just as that [20:44:44] it has setuid as well [20:44:48] or whatever they call it [20:44:52] "definer" [20:45:12] I think you should use the prepared statements with parameters [20:46:11] Platonides this actually still has security issues [20:46:22] when you create a db, then you drop it, the privileges remains [20:46:41] so if you created a db you know which is about to be created by someone and dropped it [20:46:44] I noticed that [20:46:48] you would be able to use it [20:46:59] but I have no idea how to fix that [20:47:07] only way is not to grant all privileges [20:47:16] which would suck [20:47:24] I want sql server to be self maintained as possible [20:47:39] on other hand bots project was never very restrictive, it's all based on trust between its users [20:48:23] there is a growing very restrictive project called tool-labs which will be used for bots as well [20:48:41] so if people want high security they can use that in future and I suppose they will [20:49:04] Wait, what? 'tool-labs'? [20:49:09] yes [20:49:10] Or did you mean 'tools'? [20:49:18] tools-labs I guess [20:49:22] whatever is it called :P [20:49:28] You taking about mine? [20:49:32] yes [20:49:33] :D [20:49:37] Heh. It's just 'tools' [20:49:42] aha ok [20:49:58] the biggest problem of your project from my view [20:50:03] is that it's Coren-centric [20:50:08] o_O? [20:50:12] one big sysadmin to rule them all XD [20:50:38] Actually, my objective is to make sure that the WMF pays me for not having to do a single thing. :-) [20:50:44] while bots project is community maintained, we can have plenty of admins who can be available any time... tools will have same problem as current bastions [20:50:59] when they aren't working we need to wait for Ryan_Lane whole day [20:51:39] ideally we can allow root there to volunteers who sign agreements [20:51:51] Ryan_Lane: Yes, that'd be great. [20:51:54] Ryan_Lane what kind of agreement? :) [20:52:03] NDA, likely [20:52:06] Coren, that's fits my philoosophy, too [20:52:07] petan: NDA, I would expect, given you'd have access to PII [20:52:08] I already identified myself to foundation and yet I feel pretty untrusted [20:52:12] and be approved by ops, like toolserver currently [20:52:25] huh? [20:52:33] I have access to toolserver and didn't need to sign anything [20:52:40] petan: as root? [20:52:41] Coren, PII? [20:52:44] but still I have no problems with signing of stuff [20:52:49] Ryan_Lane nope [20:52:50] Personally Identifying Information. [20:52:53] don't tell me that the bastion is running on a Pentium II xD [20:52:53] I'm talking about full root access and projectadmin on the project [20:53:11] bastion, like other instances, is running on a cisco with 12 cores and 185GB ram ;) [20:53:34] Ryan_Lane I think that my ID card copy I sent to wmf time ago was kind of PII :D [20:53:48] yeah, identifying isn't the same as signing an NDA ;) [20:53:51] Ryan_Lane: If you make me sign an NDA I want root on prod too :P [20:53:52] * Damianz rofls [20:53:56] heh [20:54:01] LOL [20:54:11] * Platonides smacks his lips, 185GB... [20:54:13] well, an end-goal is to have volunteers with root on production again [20:54:22] that's a very long term goal ;) [20:54:22] but don't tell that to firefox developers [20:54:52] Ryan_Lane, I don't think any volunteer got shell on production for many years [20:54:54] Platonides why? [20:54:57] petan: I think you're very much confusing me with a BOFH. My point isn't to have a restricted environment I rule with an iron fist, it's to have an environment that's 99% self-serve but where nobody can break someone else's tools. :-) [20:54:58] Platonides oh lol [20:55:02] Platonides nvm :D [20:55:06] xDD [20:55:13] Platonides: yeah, that's the case [20:55:38] one of my reasons for starting labs was to give us an environment for building trust in volunteers [20:55:49] Coren yes I know but not being able to break stuff mean being restricted to do some stuff [20:56:04] which might be wanted by some... for example reading of logs etc [20:56:09] Coren: Just call it PaaS and put on your swag [20:56:12] petan: You know the idiom: your right to swing your fists ends just as it reaches my nose. :-) [20:56:15] today I really badly wanted to look into bastion logs and it wasn't so easy [20:56:26] logstash? [20:56:28] done [20:56:44] Coren no i don't know too complicated english for me :P [20:56:46] no one on ops cares about log access [20:56:47] Platonides: 185gb is meh, just ordering some blades with 256gb of ram :D [20:56:56] Coren but I will think of meaning of that [20:56:59] they all have root [20:57:20] which leaves it for me to do, and I'm busy enough doing other things [20:57:32] make a proposal, then push in some gerrit changes [20:57:40] Coren oh lol [20:57:50] Coren I missed a letter which made the sentences confusing [20:57:51] if you rely on me to do everything, you'll be waiting ;) [20:57:52] now I get it [20:58:00] * sentence [20:59:40] Now to figure out a good interim way of creating users and DBs on that thing. [21:00:30] 'that thing' = tools? [21:00:42] Platonides: Yep. [21:00:51] I guess those are the project users [21:01:05] I think I'll just do the easies. Since it's project local, I'll just autocreate a .my.cnf in homes. [21:03:18] Coren: Salt! [21:03:54] Damianz: Heh. Once we have a wikitech interface, sure. Right now, it's ad-hoc [21:04:09] I like the ability of using unix accounts for authentication [21:04:23] hmm coming to the ams hackathon? we'll make a wikitech interface ;) [21:04:23] sadly, that only works when you're connecting from localhost [21:04:35] Damianz: we were going to do a sprint for this [21:04:38] Damianz: soonish [21:04:41] Damianz: want to join? [21:04:44] Ryan_Lane: :D [21:05:02] I suck ass at php but I'm up for hacking up the python bit to make it support our needs [21:06:47] I think the first part of this will be all php [21:06:49] * Damianz wonders if he actually sucks ass at php or if he just hates it too much :D [21:08:45] Tbf I'm still half in the mind of 'fuck it' wikitech is an interface, put everything in nova... but then we've got hacks and hard to get upstream in etc issues so yay lets just abstract and make pretteh... well functional then pretteh [21:09:41] Damianz: Yeah, I'll be in Amsterdam [21:09:56] Damianz: But I plan to be further along than that by that time. :-) [21:10:23] It's only like 2 months away! Though yeah :P [21:11:15] * Damianz waves at Sharihareswara [21:11:34] hi Damianz [21:11:38] :) [21:16:12] Change on 12mediawiki a page Wikimedia Labs/Tool Labs/Notepad was modified, changed by MPelletier (WMF) link https://www.mediawiki.org/w/index.php?diff=656090 edit summary: [+22] [21:20:43] Coren: I should likely ask this more often, but is there anything blocking your work right now? [21:21:50] Ryan_Lane: Not on the critical path; since I'm mostly working out the kinks in the infrastructure, ad-hoc solutions work fine. Eventually, I'll create a tools puppet module and will really need to be able to +2 on it otherwise I'm going to be slow as molasses. :-) [21:22:22] Coren: I have a feeling the ops folks are going to want to do a number of reviews before you get +2 [21:22:26] Ryan_Lane: atm, I'm doing mostly manual installs and config and taking notes for the future pupper classes. :-) [21:22:39] but everyone is expecting you'll need it eventually ;) [21:23:06] that all sounds good, though. just wanted to make sure you're not blocked waiting on me [21:23:13] Ryan_Lane: It's mostly a matter of getting used to your idioms. You're doing things The Wrong Way(tm) so I'll need to get sued to it. :-) [21:23:27] :D [21:23:29] Ryan_Lane: Now that I have a local DB, you're not blocking me. Go work on gluster. :-) [21:23:33] heh [21:23:50] "work on gluster" from my perspective is "work on replacing gluster" [21:23:58] \o/ [21:24:16] I hate it. i've been working around issues with it for over a year and I'm done [21:24:19] Ryan's easy to get stuff off, just ask him, repeatedly, about 50 times :P [21:24:19] that may actually be a good idea [21:24:36] we might be faster by rewriting it from scratch :P [21:24:40] if were a physical thing, I'd love to office space it [21:24:41] Ryan_Lane: It's only been broken for like 2 years :P and it's only gonna get worse with more users [21:24:57] Platonides: have you tried write speed with an instance that has the virtio driver? [21:25:39] if you get "Operation not permitted" when running "mii-tool eth0", then it's using virtio [21:26:12] I'm really looking forward to having a network node per compute node so we get uber speed across the entire network [21:28:34] Ryan_Lane, will the instance/network go down if I tried a write speed benchmark? :) [21:30:03] Platonides: it should be fine [21:30:11] I tested this recently and virtio was *much* faster [21:30:17] like 3x faster [21:30:59] 3.4 MB/s [21:32:36] Platonides: that instance is using virtio? [21:33:10] !search html2wiki [21:33:11] http://bots.wmflabs.org/~wm-bot/searchlog/index.php?action=search&channel=%23wikimedia-labs [21:33:26] wow, blind hit:) [21:33:37] Platonides: here's the tests I ran: http://etherpad.wmflabs.org/pad/p/gluster-tests [21:34:20] hm. that last test was copy/pasted wrong [21:40:43] Ryan_Lane, yes: $ mii-tool eth0 [21:40:43] SIOCGMIIPHY on 'eth0' failed: Operation not permitted [21:40:49] * Ryan_Lane nods [21:40:54] I'm running some tests now [21:41:00] on a non-virtio and a virtio [21:41:02] to see the different [21:41:05] *difference [21:47:10] -_- [21:47:28] seems glusterd is hung on labstore1-4 again [21:47:33] this is likely why the nfs services had issues [21:48:11] so. full. of. hate. [21:48:46] maybe that's for running tests? [21:48:59] I'm betting it's been hung for a bit [21:49:11] glusterd is pretty instable [21:49:25] is it better now? [21:49:38] it's actually the only amazingly unstable service. [21:49:38] nope [21:49:48] it doesn't actually affect filesystem access [21:50:06] what happens when it hungs, then? [21:50:08] but it can affect the NFS service it if crashes and restarts [21:50:13] no more changes [21:50:28] and it can affect nfs [21:50:32] * Ryan_Lane grumbles [21:52:05] * Damianz wonders if the wmf has enough money to fund Ryan_Lane's therapy required after a few years of gluster [21:52:12] probably not [21:52:33] if we only had a single volume gluster would probably be totally fine [22:05:11] looks like the problems are over [22:05:15] thankfully [22:06:21] * Damianz pats Vacation9 [22:10:32] I didn't fix it [22:10:35] Thank the server gods [22:19:35] Amount of RAM in use (in MBs): 1,027,072 [22:19:39] Holy poop. [22:20:01] That's almost a TB of ram! [22:22:18] ikr [22:22:41] I wish I had that much :I [22:23:03] and almost 16TB storage [22:25:24] I got more than that at home. But the ram, woah. And I like my desktop because it has 32G. :-) [22:27:28] time to slowly deploy my bot :P [23:02:16] hehe, usage of bnr1 creeping up :D [23:14:13] what task are you doing now addshore [23:15:32] interwiki [23:15:40] still [23:15:59] im like 1/4 of the way through on en [23:16:04] and now rolling out to all other langs [23:18:05] my internet is being gay :/ [23:19:10] * DanielK_WMDE wonders what that means [23:19:34] how_long_has_it_been_runnign [23:19:39] running* [23:19:45] and you're only 1/4 of the way through [23:20:04] why don't you use all the 1 TB ram on labs xD [23:21:44] * Damianz thinks addshore is a whole level of crazy [23:30:08] Vacation9: like a week [23:30:19] editing pretty fast :P [23:39:42] [bz] (ASSIGNED - created by: Antoine "hashar" Musso, priority: Normal - enhancement) [Bug 45786] search: craft a fake InitialiseSettings.php for lucene-search-2 - https://bugzilla.wikimedia.org/show_bug.cgi?id=45786 [23:42:54] [bz] (NEW - created by: Antoine "hashar" Musso, priority: Unprioritized - enhancement) [Bug 45814] setup OAI repository on beta - https://bugzilla.wikimedia.org/show_bug.cgi?id=45814 [23:43:06] [bz] (NEW - created by: Krinkle, priority: High - enhancement) [Bug 34250] [beta project] Set up search (tracking) - https://bugzilla.wikimedia.org/show_bug.cgi?id=34250