[07:42:37] hello folks! [07:42:49] I am going to merge https://gerrit.wikimedia.org/r/c/operations/puppet/+/1059899 and test a commit in the private puppet repo [07:42:58] if you need to do any work on it, ping me first please :) [07:53:06] tested a commit and it worked nicely [07:54:09] and now running manually dump_cloud_ip_ranges.service [07:56:12] it doesn't have anything to commit, will wait for the next run [07:56:21] all green from my side, if you encounter any issue with puppet private lemme know [08:09:15] o/ I have a wdqs host (wdqs1023.eqiad.wmnet) that is misbehaving and needs to be depooled (it's causing maxlag errors on wikidata) but I can't access it via ssh, could someone maybe powercycle it so that I can try to depool it? [08:11:30] dcausse: sure on it [08:11:40] elukey: thanks! <3 [08:12:22] depooled as starter [08:14:20] dcausse: attached to the management console, and I see a ton of prints related to wdqs-categories [08:14:30] do you need details about it, or just a reboot? [08:14:51] elukey: just a reboot I think [08:15:10] okok, the prints are all about hewiki btw [08:16:28] rebooting now [08:20:10] elukey: when you depooled, did it print the pools it was attached to? [08:20:41] this host just got re-imaged and is not supposed to serve traffic yet [08:21:20] dcausse: ah okok I see, I tried to confctl get and it doesn't show me anything [08:21:24] I thought it was in service [08:21:30] anyway, ready for you now [08:21:35] elukey: thanks! [08:25:30] ah I think I know why... some monitoring queries are polluting the metrics we use to detect if a host is live... [12:11:59] hello on-callers [12:12:51] as FYI I am going to stop debmonitor-server on debmonitor1003 for ~5 mins, to test if we can use debmonitor2003 instead (on a test node with a modified /etc/hosts) [12:13:15] so far it seems that only one django/debmonitor can be writing to the db, so I want to be sure about it [12:13:37] there may be a couple of failures here and there, nothing dramatic since the next runs of debmonitor clients will solve the issue [12:17:23] done, sigh the test failed, back to square one [12:18:08] :( [14:22:03] for the on-callers - I am going to upgrade debmonitor-server on debmonitor1003, new release version that fixes some issues and that will allow a more precise client to be rolled out [14:22:20] elukey: ok :) [14:28:44] all good! [14:28:55] ping me in case you see anything weird [14:29:08] tomorrow I'll rollout the new version of the debmonitor-client as well