[07:05:07] goood morning [07:06:05] I see some errors for wmf auto restart and mcelog on some analytics nodes, I am wondering if anything changed [07:11:10] ah mcelog on the nodes with 4.19 seems not coming up [07:13:14] if versioncmp($::kernelversion, '4.12') >= 0 { [07:13:14] systemd::mask { 'mcelog.service': } [07:13:15] } [07:13:17] perfect [07:15:11] but then in base::standard_packages we have [07:15:11] if os_version('debian == jessie') or os_version('debian == stretch') { [07:15:14] require_package('mcelog') [07:15:17] base::service_auto_restart { 'mcelog': } [07:15:19] } [07:15:20] and I have 4.19 on stretch [07:25:49] there is also a problem with the Zayo transport between eqiad and codfw [07:26:14] from the zayo correspondence I see that an issue was resolved 18h ago for the same link, but then nothing [07:30:10] optics looks good afaics from both sides [07:32:42] and from their last update there is a suggestion to disable/enable if the service is down, but this looks more another issue [07:36:34] cdanis: I saw all now your last email to zayo full of emojis, 10+ [07:36:43] s/all/only [07:37:23] * elukey sends an email to Zayo [07:46:04] done [08:16:16] thx! [09:13:26] elukey: in relaion to the auto restarti mad an update yesterday to switch them all from using cron to using systemd::timer::job. initially there was an issue with jessie not recognising the timespec interval however that should be resolved now. i can take a look if you let me know a host thats having issues [09:18:17] jbond42: that was related to mcelog, which doesn't work with Linux 4.19 [09:18:32] ahh ok thx [09:18:51] and a few analytics servers are now using Stretch with 4.19, so the mcelog auto restart was failing there and the failure was simply no longer silenced [09:19:00] https://gerrit.wikimedia.org/r/c/operations/puppet/+/636560/ will fix it [09:21:17] ack thanks [09:21:57] FYI all ripe81 starts today https://ripe81.ripe.net/programme/meeting-plan/ [09:39:02] thx! [11:34:29] someone is familiar with Apache config? trying to figure out something easy I think [14:14:25] <_joe_> XioNoX: that's a trick question right? [14:14:35] <_joe_> anyone familiar with apache knows it's never easy [14:15:13] haha yeah, that's what I figured out [17:25:24] andrewbogott: can we merge my change related to restricted_to/from? compiler likes it and yesterday's problem is gone, right [17:34:02] mutante: sure, you can merge that now if you want [17:34:04] or I can [17:34:54] andrewbogott: ok, thanks. doing it [17:36:03] merged on master [17:36:59] all: this should remove noise from compiler output on all compiles. those lines at the top about "restricted_to"/"restricted_from" not being found will be gone [17:37:05] confirming it now [17:38:54] yep, looking good [17:39:47] ppl, my nickname was banned, I don't know why, do you know where/how I can try to solve this problem? I can't join most of the channels (because I-m not registered). That is why I-m asking here. [17:41:07] dsaez_: are you familiar with nickserv? make sure you are identified with it by password [17:41:42] mutante, I know nickserv, but I can't change my nick to dsaez, because I recieve this error: [17:41:54] 435 dsaez #wikimedia-office Cannot change nickname while banned/quieted on channel [17:42:40] dsaez_: ah, ok. try leaving these channels, change your nick, then rejoin them [17:42:41] dsaez_: try doing this first: /msg nickserv identify dsaez [17:42:41] aha! [17:42:43] ah [17:42:45] there you go [17:43:34] a lot of channels don't want you to change nick while you are in them. especially large channels like ##debian etc. the fix is to /part them, identify and /join again [17:44:36] they don't mean "banned" as in "cant come back"..it's a bit strange language [17:52:33] https://rachelbythebay.com/w/2020/10/26/num/ <- sounds cumin-applicable and not too difficult [17:54:02] haha nice idea [17:54:03] volans: ^^ [17:55:42] * volans reading [18:05:14] haha didn't realize you followed her blog bblack :) [18:12:14] bblack, paravoid: https://gerrit.wikimedia.org/r/c/operations/software/cumin/+/636729 [18:13:02] volans: maybe add the link to the commit msg to give credit where credit is due :) [18:13:04] ahah [18:13:51] {done} [18:15:10] nice :) [18:15:29] nice indeed! [18:15:37] thanks for the responsiveness, that's awesome :) [18:15:38] I didn't apply the formatting thing, we rarely go over 1k [18:15:45] and sounded a bit too much [18:15:54] but happy to do if you're strong about it [18:16:00] *you feel strongly about it [18:16:18] yeah my bikeshed thoughts are: [18:16:59] 1) Yes, we could add the comma for the 1k+ case, maybe that's a nice additional circuitbreaker when affecting most of our environment, to avoid the copypaste reflex [18:17:31] 2) For big lists, currently the total is only printed above the list: [18:17:33] 1576 hosts will be targeted: [18:17:37] acmechief2001.codfw.wmnet, [...] [18:17:47] which means it almost always scrolls off for big cases [18:17:50] yes, I didn't move it/repeated it on purpose ;) [18:18:14] so yeah, you could take the view that keeping it up top is the same kind of impediment as formatting, kinda [18:18:44] there's a few different ways you could go with both questions, but the way it is now makes for a very simple and minimal change, and isn't bad :) [18:19:49] now just wait 6~12 months for a release :-P [18:20:02] the upstream maintainer is a slacker [18:20:07] if you wanted to make the whole team curse you for all time, you could make the input string be the english version [18:20:10] please do actually print it below the list too [18:20:20] "One thousand four hundred sixty eight" [18:20:21] a speed bump is a good idea, but that doesn't mean you need to intentionally make it difficult ;) [18:21:13] ^ if you take rzl's position, a good trade is to then add the formatting, too, so as to preempty a copypaste reflex for the few very-large cases [18:21:17] rzl: if your window terminal is 50 lines height it doesn't scroll :-P [18:21:50] "works on my machine" [18:22:00] one of my original ideas back in the day was to randomly pick beteen a list of words like 'yes', 'go', 'proceed', et.... [18:22:07] I thought you would be all mad at me [18:22:26] You Don't Need To Intentionally Make It Difficult [18:22:28] only in english? pfft [18:22:30] you could also do a low-end cutoff and just ask/accept yes like before if it's less than some cutoff lke 25 [18:22:38] write it in languages where it preferrably has non a-z characters [18:22:39] but that's less elegant to implelemt :) [18:23:13] convert it to roman numbers [18:23:21] that's so hard to implement :D [18:25:48] https://pypi.org/project/roman/ [18:25:59] yeah :) [18:26:26] take it to the next level with https://web.eecs.umich.edu/~imarkov/Perligata.html ;) [18:27:23] ... and this is why nobody likes Perl anymore :) [18:28:27] two years later, onboarding a new SRE: "what's cumin?" "oh, we used to use that for orchestration but nobody knows how to use it anymore. here, let me show you how we just ssh in a for-loop, it's quicker" [18:29:08] btw: https://techblog.wikimedia.org/2020/10/26/impact-of-using-http-connection-pooling-for-php-applications-at-scale/ is live [18:32:52] \o/ [18:35:56] PHP is a "loved one", heh :) [18:37:20] "i love php" - About 3,210,000,000 results (0.57 seconds) [18:37:39] "i hate php" - About 307,000,000 results (0.54 seconds) [18:38:15] <_joe_> in two lines - what's wrong with the internet. [18:39:24] hehe, there used to an IRC bot for this, !googlefight (https://en.wikipedia.org/wiki/Googlefight) [18:39:32] 🎉 [18:39:39] _joe_: have you seen Slack's defense of PHP? [18:39:52] he links to it in that post :D [18:39:59] <_joe_> cdanis: RTFA [18:41:33] Slack's defense of s/PHP/Hack/g [18:42:19] :D [20:44:24] I'm looking to publish my gpg key by following the instructions at https://wikitech.wikimedia.org/wiki/PGP_Keys, and I'm getting at `keyserver send failed: General error`. The issue may be with the hkps://hkps.pool.sks-keyservers.net keyserver; has anybody else had trouble with that? [20:46:02] yeah, they are mega unreliable nowadays :( [20:46:16] lots of endemic problems there, but the easiest thing to do is to use hkp://keyserver.ubuntu.com [20:50:37] Cool, that worked :) [20:58:55] I'll ACK the logstash codfw alerts because they have been unhandled CRIT with disabled notifications for > 18 days and it's a lot of noise [21:02:08] linking to https://phabricator.wikimedia.org/T263970 because that's the closest i see [22:30:55] mutante: thanks for pointing out the logstash issue. looks like an unassigned shard was preventing an old index from reaching desired replica count on the new cluster. Reduced replica count to 1 so the check would clear up. [22:33:25] shdubsh: ah! cool, thank you