[10:44:15] *drumroll* tomorrow's pad is up :) [10:46:38] thx! [10:48:21] from which tree is victorops logo? [10:49:41] good question, I just checked the KB but can't find anything [10:51:13] I'll ask our rep at the sync up meeting :) [11:00:51] I'm looking into adding VO to on/off boarding docs, starting from https://office.wikimedia.org/wiki/SRE/On(Off)boarding [11:01:34] which checklist(s) you reckon would be best ? i.e. officewiki one or wikitech ? [11:02:12] godog: https://office.wikimedia.org/wiki/Technology/Onboarding/Checklists/Template [11:04:28] sounds good, thanks volans [15:33:48] godog: I assume new hires (specifically Janis starting tomorrow) should only get onboarded to VO, entirely skipping the old stack, right? [15:34:21] that was the idea we kinda agreed on the observability meeting yes [15:35:41] ack [15:35:56] yeah exactly [15:36:47] good point btw, I'm also changing the other checklists to remove icinga contacts in private.git [15:39:40] I'm wondering if a contact is required in icinga for the cgi permission settings or the http Authorization username is what counts, I'm assuming the latter [15:39:54] volans: XioNoX: talk me out of doing something heinous involving the homer keyholder, a shell script that mangles juniper cli output, and a cronjob that writes to prometheus node_exporter textfiles [15:40:37] cdanis: don't do it! [15:40:46] don't do it :P [15:40:52] what do you wanna do? [15:41:17] godog: i dont think the contacts are needed for icinga login. just need the entries in /etc/icinga/cgi.cfg [15:41:42] paravoid: track the value of some error counters which are not present in SNMP [15:42:05] which ones? [15:42:14] `show services accounting errors inline-jflow | match "Flow Creation Fail"` [15:42:28] godog: and that is just a flat file in module/icinga/file/cgi.cfg [15:42:31] jbond42: indeed, looks like it! although https://wikitech.wikimedia.org/wiki/Ops_Onboarding mentions that we're also using the contacts to grant users permissions to act on their "own" hosts, services [15:42:38] depending on the router you will also need to write `inline-jflow fpc-slot N` for different values of N [15:43:49] jbond42: I'll bring it up at the meeting tomorrow as well [15:44:36] godog: like every thing icinga "i could be mistaken because nagios is wiered in how it does things" but i dont imidiatly see a link but yes im sure there could be some historic knowlage around that [15:45:31] jbond42: heheh indeed! doesn't fail to disappoint and surprise at the same time [15:46:51] godog: and stright away i see im wrong. i think icinga itsself maps contacts to objects under the hood https://github.com/wikimedia/puppet/blob/production/modules/icinga/files/cgi.cfg#L176-L183 [15:48:03] gasp indeed [15:48:26] * jbond42 and im not looking under the hood have made that mistake before ;) [15:51:04] heheh ok gotta run [15:59:58] cdanis: checkout Juniper streaming telemetry instead [16:01:00] volans: it's ok, I sent him in a great rabbit hole ^ we're good for a few days [16:01:01] XioNoX: interesting, thanks [16:01:16] lol [16:01:42] Juniper documentation is excellent at being verbose while not telling you what actually matters [16:02:05] 2h of meetings, but we can discuss it afterwards [16:02:24] okay there apparently is `/junos/system/linecard/services/inline-jflow/` [16:02:29] "Packet Forwarding Engine sensor for performance metrics of the inline flow sampling process, such as the number of active flows and the number of exported flows." [16:02:50] doesn't say whether or not it has the error counters, and it's interesting that "number of active flows" isn't available from the other commands I've used so far [16:02:59] however there is an issue XioNoX -- "Junos OS Release 16.1R3 and later on MX series and PTX series routers only." [16:03:04] :P [16:03:18] cdanis: we're going on 17+ everywhere [16:03:28] how soon? I thought we were on 15 [16:03:37] https://phabricator.wikimedia.org/T243080 and https://phabricator.wikimedia.org/T242947#5812926 [16:03:41] oh no that's just cr3-esams [16:03:53] ok interesting [16:04:53] pmacct is implementing some, and there are many POC on github [16:06:33] lol, the juniper website just denied me access to download their protobuf definitions [16:08:46] https://webdownload.juniper.net/swdl/dl/secure/site/1/record/85317.html?pf=MX204 --> "You have encountered this error because your account privileges do not permit access to the information or service requested." [16:08:54] guess that means it's lunchtime instead :)