[00:34:36] 06Labs, 10Labs-Infrastructure: New instance creation broken - https://phabricator.wikimedia.org/T144330#2596502 (10yuvipanda) [01:52:11] 10Quarry: Forking your own query results in a new one owned by YuviPanda - https://phabricator.wikimedia.org/T144309#2596611 (10Huji) Now it works fine. Could you please submit the patch that fixed it here before you close the task? [02:19:05] (03PS4) 10Krinkle: Clean up in preparation for the "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307104 (https://phabricator.wikimedia.org/T64914) [02:19:07] (03PS6) 10Krinkle: [WIP] Implement "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307050 (https://phabricator.wikimedia.org/T64914) [02:21:01] (03PS5) 10Krinkle: Clean up in preparation for the "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307104 (https://phabricator.wikimedia.org/T64914) [02:21:03] (03PS7) 10Krinkle: [WIP] Implement "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307050 (https://phabricator.wikimedia.org/T64914) [02:35:29] 10Tool-Labs-tools-Global-user-contributions, 13Patch-For-Review: Global user contributions: Implement "Recent only" feature - https://phabricator.wikimedia.org/T64914#2596652 (10Krinkle) a:03Krinkle [02:38:56] (03PS1) 10Krinkle: guc: Fix broken display of projects with non-zero edits [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307678 (https://phabricator.wikimedia.org/T118662) [02:39:18] 10Tool-Labs-tools-Global-user-contributions, 13Patch-For-Review, 07Regression: GUC tool always outputs "searched 1 projects" instead of actual count - https://phabricator.wikimedia.org/T118662#2596662 (10Krinkle) a:05Devirk>03Krinkle [02:39:38] (03CR) 10Krinkle: [C: 032] guc: Fix broken display of projects with non-zero edits [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307678 (https://phabricator.wikimedia.org/T118662) (owner: 10Krinkle) [02:40:27] (03CR) 10Krinkle: [V: 032] guc: Fix broken display of projects with non-zero edits [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307678 (https://phabricator.wikimedia.org/T118662) (owner: 10Krinkle) [02:40:48] 10Tool-Labs-tools-Global-user-contributions, 07Regression: GUC tool always outputs "searched 1 projects" instead of actual count - https://phabricator.wikimedia.org/T118662#2596666 (10Krinkle) 05Open>03Resolved [02:41:49] 10Tool-Labs-tools-Global-user-contributions: Global user contributions: Implement Atom/RSS feed output format - https://phabricator.wikimedia.org/T64920#2596671 (10Krinkle) [02:41:58] 10Tool-Labs-tools-Global-user-contributions, 13Patch-For-Review: Global user contributions: Implement "Recent only" feature - https://phabricator.wikimedia.org/T64914#648744 (10Krinkle) [02:42:00] 10Tool-Labs-tools-Global-user-contributions: Global user contributions: Implement Atom/RSS feed output format - https://phabricator.wikimedia.org/T64920#649290 (10Krinkle) [02:43:31] (03PS6) 10Krinkle: Clean up in preparation for the "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307104 (https://phabricator.wikimedia.org/T64914) [02:43:38] (03PS8) 10Krinkle: [WIP] Implement "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307050 (https://phabricator.wikimedia.org/T64914) [02:53:31] (03PS9) 10Krinkle: [WIP] Implement "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307050 (https://phabricator.wikimedia.org/T64914) [02:53:32] !log tools.jouncebot Cherry-picked https://gerrit.wikimedia.org/r/#/c/307333 for testing [02:53:36] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.jouncebot/SAL, Master [02:55:40] (03PS10) 10Krinkle: [WIP] Implement "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307050 (https://phabricator.wikimedia.org/T64914) [02:59:45] (03PS11) 10Krinkle: [WIP] Implement "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307050 (https://phabricator.wikimedia.org/T64914) [03:00:17] (03PS12) 10Krinkle: [WIP] Implement "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307050 (https://phabricator.wikimedia.org/T64914) [03:15:20] 06Labs, 10Labs-Infrastructure: Build new Precise base image - https://phabricator.wikimedia.org/T144335#2596700 (10Andrew) [03:26:06] (03PS7) 10Krinkle: Clean up in preparation for the "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307104 (https://phabricator.wikimedia.org/T64914) [03:26:08] (03PS13) 10Krinkle: [WIP] Implement "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307050 (https://phabricator.wikimedia.org/T64914) [03:30:55] (03PS14) 10Krinkle: [WIP] Implement "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307050 (https://phabricator.wikimedia.org/T64914) [03:31:28] (03PS15) 10Krinkle: [WIP] Implement "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307050 (https://phabricator.wikimedia.org/T64914) [03:35:10] (03PS16) 10Krinkle: [WIP] Implement "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307050 (https://phabricator.wikimedia.org/T64914) [03:36:41] (03PS17) 10Krinkle: [WIP] Implement "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307050 (https://phabricator.wikimedia.org/T64914) [03:48:13] (03PS18) 10Krinkle: Implement "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307050 (https://phabricator.wikimedia.org/T64914) [03:48:30] (03CR) 10Krinkle: [C: 032] Clean up in preparation for the "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307104 (https://phabricator.wikimedia.org/T64914) (owner: 10Krinkle) [03:48:35] (03CR) 10Krinkle: [C: 032] Implement "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307050 (https://phabricator.wikimedia.org/T64914) (owner: 10Krinkle) [03:48:37] (03CR) 10Krinkle: [V: 032] Implement "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307050 (https://phabricator.wikimedia.org/T64914) (owner: 10Krinkle) [03:48:42] (03CR) 10Krinkle: [V: 032] Clean up in preparation for the "Recent only" feature [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307104 (https://phabricator.wikimedia.org/T64914) (owner: 10Krinkle) [03:55:01] (03PS1) 10Krinkle: readme: Add 'composer install' to set up [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307683 [03:55:09] (03CR) 10Krinkle: [C: 032 V: 032] readme: Add 'composer install' to set up [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/307683 (owner: 10Krinkle) [03:57:13] 10Tool-Labs-tools-Global-user-contributions: Global user contributions: Implement Atom/RSS feed output format - https://phabricator.wikimedia.org/T64920#2596717 (10Krinkle) [03:57:15] 10Tool-Labs-tools-Global-user-contributions, 13Patch-For-Review: Global user contributions: Implement "Recent only" feature - https://phabricator.wikimedia.org/T64914#2596715 (10Krinkle) 05Open>03Resolved Deployed to . There is a new dropdown menu featuring options "Recent... [04:46:22] what is the difference between wikimedia labs and tools labs? I would like to program a tool which is not a wiki extension, which should I use? [04:51:49] Gryllida: https://wikitech.wikimedia.org/wiki/Labs_labs_labs [04:54:00] am I correct in understanding that wikimedia labs is a thing for working on wiki software, and tools labs are tools which do not intend to be deployed 'on production'? [04:54:13] and thanks, the wiki page made it clear for me which one I need [04:57:13] Gryllida: not necessarily wiki software (i.e., mediawik) You can set up mediawiki instances on both tool labs and other labs projects [04:57:26] but generally that should not be done on tool labs, I think [05:01:43] Thanks... :) [05:15:52] can someone get this channel out of #wikimedia-bans? [05:16:08] I've no idea how to, but I think requiring registration to be here beats a good chunk of the positives of IRC... [05:16:32] yuvipanda: hi [05:16:49] hi [05:17:43] yuvipanda: what happened -- you were unable to join until you identified to nickserv? [05:20:12] yup [05:20:30] or rather, I was identified, and then I got disconnected, and auto re-connect failed [05:21:12] what error message do you get? [05:22:48] @appservice-irc:matrix.org kicked YuviPanda. Reason: IRC error on #wikimedia-labs: err_bannedfromchan [05:22:50] 10Quarry: Forking your own query results in a new one owned by YuviPanda - https://phabricator.wikimedia.org/T144309#2596738 (10yuvipanda) I, uh, just restarted redis and flushed out the db :| when bringing it back up after an outage earlier I had started the wrong redis instance... I really should move Quarry... [05:23:34] yuvipanda: are you able to join #wikimedia-bans ? [05:23:57] I have to run away right now though, so can't debug :( [05:24:10] I think bd808 told me yesterday that #wikimedia-bans started enforcing registration in an effort to fight spam [05:25:29] I think that restriction is gone now [05:25:41] bd808: it's still active. [05:25:42] it was active for ~15 hours or so [05:25:52] gh [05:25:54] ugh [05:26:34] bd808: the $~a entry at #wikimedia-bans seems to be it. [05:26:52] yeah, if they have $-a then auth is required [05:26:55] frack [05:27:36] yuvipanda: do you want the global restriction gone, or you want this channel to stop inheriting bans from #wikimedia-bans ? [05:27:45] yuvipanda: (whenever you're back) [05:27:59] Ideally, the global restriction should be gone. [05:28:06] ok [05:28:07] (got pulled in due to a possible fire) [05:28:46] yeah. I was promised yesterday that -a was temporary [05:29:57] yuvipanda: You could try to do /mode +e $~a [05:30:34] the way I've my client setup I can't realllly execute IRC commands tho [05:30:56] looks like maybe c put it back actually after lifting it for a while yesterday [05:32:22] the op flag I had here seems to have disappeared too :/ [05:32:57] uh [05:33:00] 20 bd808 +Aiotv (op) [modified 1w 1d 10h ago] [05:38:10] yuvipanda: the global restriction is gone now; ta :) [05:39:38] \o/ thank you, gryllida [05:59:38] 06Labs, 10Labs-Infrastructure: Instance creation and deletion on Labs is funky - https://phabricator.wikimedia.org/T144339#2596823 (10madhuvishy) [10:59:27] 10Tool-Labs-tools-Other: Geohack should be mobile friendly - https://phabricator.wikimedia.org/T103409#2597847 (10JGirault) [11:53:13] 06Labs, 10Mail: failed exim service on labs instances - https://phabricator.wikimedia.org/T135033#2286378 (10AlexMonk-WMF) I thought this had been fixed? [11:55:09] 06Labs, 07Documentation: Document, explain, diagram labs vlans and network setup - https://phabricator.wikimedia.org/T100529#2597947 (10hashar) Mark took time to explain at length the various network and wrote a nice summary on T140257#2597938 [12:44:05] 06Labs, 10Wikimedia-Labs-General, 10DBA, 06Operations, 07Tracking: Database replication services (tracking) - https://phabricator.wikimedia.org/T50930#2597985 (10jcrespo) [12:44:09] 06Labs, 10DBA, 06Editing-Analysis: Replicate editor_month table from analytics-store to Labs - https://phabricator.wikimedia.org/T143955#2597983 (10jcrespo) 05Open>03stalled You are asking an implementation "Replicate editor_month table from analytics-store to Labs", but not what you need to do on labs.... [13:12:07] 06Labs: Request creation of wmwcourse labs project - https://phabricator.wikimedia.org/T144388#2598041 (10Nikerabbit) [14:04:34] 06Labs, 10Labs-Infrastructure: New instance first puppet run is broken - https://phabricator.wikimedia.org/T144330#2598176 (10chasemp) [14:05:39] 06Labs, 10Labs-Infrastructure: New instance first puppet run is broken - https://phabricator.wikimedia.org/T144330#2596502 (10chasemp) ```root@disposable-1014-test-vm-09:~# /root/firstboot.sh + '[' '!' -f /var/lib/cloud/instance/boot-finished ']' + echo 'Enabling console logging for puppet while it does the in... [14:06:23] 06Labs, 10Labs-Infrastructure: New instance first puppet run is broken - https://phabricator.wikimedia.org/T144330#2598198 (10chasemp) [14:08:42] 06Labs, 10Labs-Infrastructure: New instance first puppet run is broken - https://phabricator.wikimedia.org/T144330#2598200 (10chasemp) [14:10:49] + project='{ [14:11:01] chasemp: the curl sed sed is off :( [15:31:42] for CI you can poke #wikimedia-releng [15:31:54] will probably end up stopping nodepool and prevent jenkins from processing jobs [15:31:58] patch will queue up in zuul [15:34:27] PROBLEM - Puppet run on tools-cron-01 is CRITICAL: CRITICAL: 100.00% of data above the critical threshold [0.0] [15:49:36] RECOVERY - Puppet run on tools-webgrid-lighttpd-1403 is OK: OK: Less than 1.00% above the threshold [0.0] [15:50:02] RECOVERY - Puppet run on tools-mail-01 is OK: OK: Less than 1.00% above the threshold [0.0] [15:54:13] RECOVERY - Puppet run on tools-worker-1016 is OK: OK: Less than 1.00% above the threshold [0.0] [15:54:15] RECOVERY - Puppet run on tools-exec-1410 is OK: OK: Less than 1.00% above the threshold [0.0] [15:54:19] RECOVERY - Puppet run on tools-flannel-etcd-02 is OK: OK: Less than 1.00% above the threshold [0.0] [15:54:33] RECOVERY - Puppet run on tools-webgrid-lighttpd-1412 is OK: OK: Less than 1.00% above the threshold [0.0] [15:54:39] RECOVERY - Puppet run on tools-web-static-01 is OK: OK: Less than 1.00% above the threshold [0.0] [15:54:43] RECOVERY - Puppet run on tools-k8s-master-01 is OK: OK: Less than 1.00% above the threshold [0.0] [15:54:51] RECOVERY - Puppet run on tools-mail is OK: OK: Less than 1.00% above the threshold [0.0] [15:54:57] RECOVERY - Puppet run on tools-exec-1409 is OK: OK: Less than 1.00% above the threshold [0.0] [15:55:22] RECOVERY - Puppet run on tools-webgrid-lighttpd-1210 is OK: OK: Less than 1.00% above the threshold [0.0] [15:59:08] RECOVERY - Puppet run on tools-exec-1216 is OK: OK: Less than 1.00% above the threshold [0.0] [15:59:12] RECOVERY - Puppet run on tools-exec-gift is OK: OK: Less than 1.00% above the threshold [0.0] [15:59:16] RECOVERY - Puppet run on tools-k8s-etcd-02 is OK: OK: Less than 1.00% above the threshold [0.0] [15:59:22] RECOVERY - Puppet run on tools-webgrid-lighttpd-1410 is OK: OK: Less than 1.00% above the threshold [0.0] [15:59:22] RECOVERY - Puppet run on tools-webgrid-lighttpd-1209 is OK: OK: Less than 1.00% above the threshold [0.0] [15:59:24] RECOVERY - Puppet run on tools-cron-01 is OK: OK: Less than 1.00% above the threshold [0.0] [15:59:28] RECOVERY - Puppet run on tools-bastion-03 is OK: OK: Less than 1.00% above the threshold [0.0] [15:59:34] RECOVERY - Puppet run on tools-k8s-etcd-01 is OK: OK: Less than 1.00% above the threshold [0.0] [15:59:36] RECOVERY - Puppet run on tools-exec-1217 is OK: OK: Less than 1.00% above the threshold [0.0] [15:59:40] RECOVERY - Puppet run on tools-worker-1021 is OK: OK: Less than 1.00% above the threshold [0.0] [15:59:41] RECOVERY - Puppet run on tools-bastion-02 is OK: OK: Less than 1.00% above the threshold [0.0] [15:59:50] RECOVERY - Puppet run on tools-elastic-01 is OK: OK: Less than 1.00% above the threshold [0.0] [15:59:51] RECOVERY - Puppet run on tools-exec-1208 is OK: OK: Less than 1.00% above the threshold [0.0] [15:59:53] RECOVERY - Puppet run on tools-webgrid-lighttpd-1205 is OK: OK: Less than 1.00% above the threshold [0.0] [15:59:59] RECOVERY - Puppet run on tools-webgrid-generic-1402 is OK: OK: Less than 1.00% above the threshold [0.0] [16:00:05] RECOVERY - Puppet run on tools-webgrid-lighttpd-1408 is OK: OK: Less than 1.00% above the threshold [0.0] [16:00:05] RECOVERY - Puppet run on tools-webgrid-lighttpd-1409 is OK: OK: Less than 1.00% above the threshold [0.0] [16:00:15] RECOVERY - Puppet run on tools-exec-1212 is OK: OK: Less than 1.00% above the threshold [0.0] [16:36:51] !log tools Disabling puppet across tools hosts (T134896) [16:38:44] the bots are offline? [16:39:25] bd808: would you happen to know anything about ^? [16:39:50] I was just going to look [16:40:02] thanks [16:43:05] they all think they are running... [16:43:34] sometimes a network interruption will hose them up silently iirc [16:43:42] a restart may be in order [16:43:49] yeah, on it [16:45:45] labs-morebots: hello? [16:45:45] I am a logbot running on tools-exec-1213. [16:45:46] Messages are logged to wikitech.wikimedia.org/wiki/Server_Admin_Log. [16:45:46] To log a message, type !log . [16:50:49] !log Puppet disabling complete (T134896) [16:50:50] Puppet is not a valid project. [16:50:55] !log tools Puppet disabling complete (T134896) [16:51:00] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/SAL, Master [16:51:15] Theres no stashbot [16:51:18] yeah [16:51:20] bd808 ^^ [16:52:44] poor handling of an irc hiccup I think. Should show up in a minute [16:52:53] thanks [16:54:59] !log tools Rsync-ed over latest backup of /srv/scratch from labstore1001 to labstore1003 [16:55:03] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/SAL, Master [17:05:11] PROBLEM - Puppet run on tools-exec-1221 is CRITICAL: CRITICAL: 33.33% of data above the critical threshold [0.0] [17:10:15] PROBLEM - Puppet run on tools-exec-1410 is CRITICAL: CRITICAL: 44.44% of data above the critical threshold [0.0] [17:20:15] RECOVERY - Puppet run on tools-exec-1410 is OK: OK: Less than 1.00% above the threshold [0.0] [17:20:17] PROBLEM - Puppet run on tools-exec-1207 is CRITICAL: CRITICAL: 22.22% of data above the critical threshold [0.0] [17:25:08] PROBLEM - Puppet run on tools-worker-1025 is CRITICAL: CRITICAL: 11.11% of data above the critical threshold [0.0] [17:25:18] RECOVERY - Puppet run on tools-exec-1207 is OK: OK: Less than 1.00% above the threshold [0.0] [17:25:42] PROBLEM - Puppet run on tools-exec-1209 is CRITICAL: CRITICAL: 30.00% of data above the critical threshold [0.0] [17:30:14] RECOVERY - Puppet run on tools-exec-1221 is OK: OK: Less than 1.00% above the threshold [0.0] [17:35:08] RECOVERY - Puppet run on tools-worker-1025 is OK: OK: Less than 1.00% above the threshold [0.0] [17:35:32] PROBLEM - Puppet run on tools-exec-1220 is CRITICAL: CRITICAL: 10.00% of data above the critical threshold [0.0] [17:35:40] RECOVERY - Puppet run on tools-exec-1209 is OK: OK: Less than 1.00% above the threshold [0.0] [17:40:31] RECOVERY - Puppet run on tools-exec-1220 is OK: OK: Less than 1.00% above the threshold [0.0] [17:40:39] PROBLEM - Puppet run on tools-exec-1218 is CRITICAL: CRITICAL: 22.22% of data above the critical threshold [0.0] [17:45:36] PROBLEM - Puppet run on tools-exec-1217 is CRITICAL: CRITICAL: 10.00% of data above the critical threshold [0.0] [17:50:05] PROBLEM - Puppet run on tools-exec-1216 is CRITICAL: CRITICAL: 11.11% of data above the critical threshold [0.0] [17:50:08] ^all me - should get fixed soon [17:50:18] chasemp: should we silence this? I don't know how to [17:50:19] PROBLEM - Puppet run on tools-worker-1008 is CRITICAL: CRITICAL: 10.00% of data above the critical threshold [0.0] [17:50:28] PROBLEM - Puppet run on tools-worker-1007 is CRITICAL: CRITICAL: 10.00% of data above the critical threshold [0.0] [17:50:38] RECOVERY - Puppet run on tools-exec-1218 is OK: OK: Less than 1.00% above the threshold [0.0] [17:50:54] PROBLEM - Puppet run on tools-worker-1005 is CRITICAL: CRITICAL: 11.11% of data above the critical threshold [0.0] [17:50:57] !log tools Reenabling puppet across tools hosts. [17:50:57] Only way is to kill the bot I think [17:50:58] PROBLEM - Puppet run on tools-exec-1215 is CRITICAL: CRITICAL: 20.00% of data above the critical threshold [0.0] [17:51:02] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/SAL, Master [17:51:15] aah [17:55:19] PROBLEM - Puppet run on tools-k8s-master-02 is CRITICAL: CRITICAL: 11.11% of data above the critical threshold [0.0] [17:55:27] PROBLEM - Puppet run on tools-exec-1214 is CRITICAL: CRITICAL: 20.00% of data above the critical threshold [0.0] [17:55:27] PROBLEM - Puppet run on tools-worker-1002 is CRITICAL: CRITICAL: 10.00% of data above the critical threshold [0.0] [17:55:35] RECOVERY - Puppet run on tools-exec-1217 is OK: OK: Less than 1.00% above the threshold [0.0] [17:56:15] PROBLEM - Puppet run on tools-worker-1004 is CRITICAL: CRITICAL: 10.00% of data above the critical threshold [0.0] [17:59:39] madhuvishy, the bot can be quieted [17:59:57] tom29739: ah how can I do that [18:00:05] @quiet shinken-wm [18:00:08] RECOVERY - Puppet run on tools-exec-1216 is OK: OK: Less than 1.00% above the threshold [0.0] [18:00:12] PROBLEM - Puppet run on tools-worker-1015 is CRITICAL: CRITICAL: 10.00% of data above the critical threshold [0.0] [18:00:16] I think it's that [18:00:20] RECOVERY - Puppet run on tools-k8s-master-02 is OK: OK: Less than 1.00% above the threshold [0.0] [18:00:20] RECOVERY - Puppet run on tools-worker-1008 is OK: OK: Less than 1.00% above the threshold [0.0] [18:00:26] PROBLEM - Puppet run on tools-bastion-03 is CRITICAL: CRITICAL: 20.00% of data above the critical threshold [0.0] [18:00:26] RECOVERY - Puppet run on tools-worker-1007 is OK: OK: Less than 1.00% above the threshold [0.0] [18:00:29] It's something. [18:00:30] PROBLEM - Puppet run on tools-worker-1017 is CRITICAL: CRITICAL: 10.00% of data above the critical threshold [0.0] [18:00:33] @q shinken-wm [18:00:33] !quiet shinken-wm [18:00:47] @uq shinken-wm [18:01:07] I am running http://meta.wikimedia.org/wiki/WM-Bot version wikimedia bot v. 2.8.0.0 [libirc v. 1.0.3] my source code is licensed under GPL and located at https://github.com/benapetr/wikimedia-bot I will be very happy if you fix my bugs or implement new features [18:01:07] @help [18:01:37] madhuvishy, it's @q and @unq [18:01:40] @unq shinken-wm [18:01:48] tom29739: cool! thanks [18:02:00] @q shinken-wm [18:02:01] Permission denied [18:02:16] ha [18:02:29] I'll add you. [18:02:35] thanks tom29739 :) [18:03:11] madhuvishy, you don't have a cloak :/ [18:03:37] sid60847@gateway/web/irccloud.com/x-scckqdtpkfwyyhed is your hostmask, which changes on each connect I think [18:03:38] 06Labs, 10Labs-Infrastructure, 07LDAP: Remove shell user "80686" - https://phabricator.wikimedia.org/T63967#2599116 (10demon) I don't understand why LDAP doesn't like the username. Can validnames be fixed? [18:03:47] weird - can you quiet it for now? I'll look into that after this migration [18:03:55] @q shinken-wm [18:03:58] Done. [18:04:00] thank you :) [18:14:16] !log gerrit gerrit-mysql instance has been setup following docs at https://phabricator.wikimedia.org/P3939 [18:14:20] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Gerrit/SAL, Master [18:15:02] arg, wrong project [18:15:42] !log git gerrit-mysql instance has been setup following docs at P3939 [18:15:45] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Git/SAL, Master [18:16:10] !log git gerrit-test3 can be replicated following docs at P3637 [18:16:13] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Git/SAL, Master [18:16:22] /win 2 [18:18:44] !log tools Scratch migration complete for all k8s workers (T134896) [18:18:46] T134896: move nfs /scratch to labstore1003 - https://phabricator.wikimedia.org/T134896 [18:18:48] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/SAL, Master [18:18:56] 06Labs, 06Operations, 06Release-Engineering-Team, 10wikitech.wikimedia.org, 07LDAP: Rename specific account in LDAP, Wikitech and Gerrit - https://phabricator.wikimedia.org/T133968#2599177 (10demon) 05Open>03Resolved >>! In T133968#2551726, @Sophivorus wrote: > I modified my rename request. The reque... [18:35:42] !log extdist adding self as temp admin to fix role class names [18:35:46] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Extdist/SAL, Master [18:36:46] legoktm: ^ eh.. i found that and added myself but i still don't see the instances [18:37:03] looks empty to me [18:37:13] mutante: uhh, you don't see extdist-01 and extdist-02? [18:37:25] no [18:37:26] they do exist... :P [18:37:35] can you click "Configure instance" [18:37:42] do you see the current role name there? [18:37:54] that is the part we'd have to change when that gets merged [18:38:16] i'm adding the new one to the "puppet groups" [18:38:22] so that it can be selected... [18:38:54] * mutante clicks "manage puppet groups" and sees the existing classes . so i am in the extdist context [18:39:13] clicks "Add class" [18:39:44] mutante legoktm role::labs::extdist vs role::extdist [18:39:47] former doesn't exist, latter does [18:40:23] we are fixing it to "role::extdist::extension" [18:40:35] or that would be the plan [18:41:08] i was able to add the class to the puppet groups [18:41:16] before the merge [18:41:20] mutante: oh, log out and log in again from wikitech? just fixed it for me... [18:41:21] so kind of before it exists [18:41:41] whaaat [18:42:01] role::extdist::extension? that... makes no sense, but ok I should shut up and not care. [18:42:03] * yuvipanda does [18:42:58] as opposed to role::extdist it doesn't cause warnings and we can stop include manifests/role one day [18:43:28] i am confused now what role::labs::extdist is [18:43:49] here's the thing that made me fix it in the first place too: [18:43:59] # = Class: role::labs::extdist [18:44:01] description [18:44:16] class role::extdist { [18:44:18] actual class name [18:45:46] i wanna fix puppet warnings and that inconsistency there [18:49:33] abandoned [18:50:07] gives up [18:50:50] :/ sorry [19:08:42] wikibugs is down [19:09:14] huh [19:09:17] everything died [19:09:29] 2016-08-31 18:33:23,932 - wikibugs.wb2-phab - INFO - Shutting down [19:09:53] 2016-08-31 18:41:37,600 - irc3.wikibugs - INFO - Trying to join #wikimedia-labs [19:09:53] 2016-08-31 18:41:37,600 - irc3.wikibugs - DEBUG - > JOIN #wikimedia-labs [19:09:54] 2016-08-31 18:42:03,378 - irc3.wikibugs - DEBUG - > QUIT :INT [19:09:56] weird. [19:10:51] legoktm: connection hang maybe? [19:11:15] !log tools.wikibugs started redis2irc and wikibugs jobs [19:11:21] I guess? [19:11:22] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.wikibugs/SAL, Master [19:13:18] I think normally tools mail redirects all mails? [19:14:11] Luke0815152: yes [19:14:16] unless tools-mail is dead again [19:14:31] 06Labs, 10DBA, 06Editing-Analysis, 05Security: Replicate editor_month table from analytics-store to Labs - https://phabricator.wikimedia.org/T143955#2599417 (10jcrespo) I think it would be easier if you created a public dump, then created a tool with database access and made it public, importing it every s... [19:15:00] valhallasw`cloud: I don't guess so, I recived my test mail :) [19:21:13] valhallasw`cloud hi we can support also draft1 in the gerrit reviewer bot as long as we doint give it drafts view rights it wont view drafts but will resolve the issue around gerrit inline editting not adding reviewers :) [19:36:19] !log tools Scratch migration on all non exec/worker nodes complete (T134896) [19:36:20] T134896: move nfs /scratch to labstore1003 - https://phabricator.wikimedia.org/T134896 [19:36:23] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/SAL, Master [19:37:07] hi there! [19:43:53] Hi could i have some help, i clicked deactive floating ip from an instance i was going to delete and re create [19:44:04] but i carnt add the floating ip now to any of the instances [19:44:16] the project is git [19:44:27] and the floating ip i had it on was gerrit-test3 [19:47:20] in my companies' network (which i need to use on my laptop) are only the ports 80 and 443 open, so i can't access tools via ssh [19:47:34] is it possible to install something like shellinabox somewhere, to get access through port 443? [19:47:52] https://code.google.com/p/shellinabox/ [19:50:59] paladox: uh, what? how would draft rights help? [19:51:30] paladox: mediawiki-svn doesn't receive any messages about drafts [19:51:42] (or whatever the list is called these days, mediawiki-commits?) [19:52:14] (03CR) 10Jean-Frédéric: "@Edouard, would you be able to make these changes ? :)" [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/303933 (https://phabricator.wikimedia.org/T142570) (owner: 10EdouardHue) [19:52:26] valhallasw`cloud i mean for inline edits [19:52:28] (03CR) 10Jean-Frédéric: [C: 032] Add project to dbPrimaryKey [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/306012 (https://phabricator.wikimedia.org/T143481) (owner: 10Lokal Profil) [19:53:10] (03CR) 10Jean-Frédéric: "Ah, damn, wanted to test it locally first. Oh well − I’m pretty sure it will work :)" [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/306012 (https://phabricator.wikimedia.org/T143481) (owner: 10Lokal Profil) [19:53:23] (03Merged) 10jenkins-bot: Add project to dbPrimaryKey [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/306012 (https://phabricator.wikimedia.org/T143481) (owner: 10Lokal Profil) [19:57:10] valhallasw`cloud What i mean is i spoken to upstream and they say we can do that [19:57:30] allow it to read draft-set or what ever it is for uploading drafts like it is for patchset [19:57:41] (03PS2) 10Jean-Frédéric: Setup local development environment for ErfgoedBot [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/303498 [19:57:57] (03CR) 10Jean-Frédéric: Setup local development environment for ErfgoedBot (032 comments) [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/303498 (owner: 10Jean-Frédéric) [19:58:00] but it wont have drafts rights so it wont read drafts but will read patchsets but inline edit fires the drafts instead of patchset which is intended [20:00:56] paladox: I have responded to your patchset on github -- please respond there. [20:02:47] !log tools.heritage Deployed latest from Git: 22496d6 (T143481), c964df5, 00ccf8a, 1891ee0, 4458eeb, 55fea41 [20:02:48] T143481: Fatal error: Cannot use object of type stdClass as array in FormatHtml.php on line 96 - https://phabricator.wikimedia.org/T143481 [20:02:52] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.heritage/SAL, Master [20:04:39] chasemp hi, it seems ssh into new instances is broken [20:04:55] Im getting this [20:05:00] $ ssh -i /c/Users/**/.ssh/id_rsa.pub gerrit-test3 [20:05:00] Password: [20:05:01] Permission denied (publickey,keyboard-interactive). [20:05:01] Killed by signal 1. [20:05:20] or andrewbogott or yuvipanda ^^ [20:06:37] Or Krenair ^^ [20:07:07] Even ssh into bastion manually [20:07:15] and doing ssh gerrit-test3 fails [20:07:29] paladox@bastion-01:~$ ssh gerrit-test3 [20:07:29] The authenticity of host 'gerrit-test3 (10.68.18.136)' can't be established. [20:07:29] ECDSA key fingerprint is 92:d3:31:ff:02:cb:3c:df:54:74:23:74:d4:6d:f2:9b. [20:07:30] Are you sure you want to continue connecting (yes/no)? yes [20:07:30] Warning: Permanently added 'gerrit-test3,10.68.18.136' (ECDSA) to the list of known hosts. [20:07:30] Password: [20:07:31] Permission denied (publickey,keyboard-interactive). [20:07:57] (03Abandoned) 10Jean-Frédéric: Safeguard against non-array $row [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/305794 (https://phabricator.wikimedia.org/T143481) (owner: 10Jean-Frédéric) [20:08:39] paladox, you can't ssh into the bastion and then ssh to an instance [20:08:50] unless you use agent forwarding or something, which you shouldn't be doing [20:08:55] Krenair i can ssh into bastion. [20:09:12] Yes [20:09:14] and nope i am manually ssh into bastion for testing to see if something is broken with ssh for me [20:09:24] but i did ssh paladox@bastion.wmflabs.org [20:09:30] If you ssh to a bastion and then ssh to an instance from there, you should get a password prompt [20:09:37] Your ssh keys won't work without agent forwarding which you should not be using [20:09:49] Yes, but it dosent work with my password [20:09:53] it just fails [20:10:01] It works on the other instances [20:10:12] Your password actually works for logging into instances? [20:10:15] Yes [20:10:31] Okay... I seriously suggest you never actually use that either [20:10:35] This is my config file [20:10:36] Host gerrit-test3 [20:10:36] ProxyCommand ssh -a -W %h:%p -A paladox@bastion.wmflabs.org [20:10:36] User paladox [20:10:40] ssh -A? [20:11:08] I think it does that [20:11:21] I know what it does [20:11:59] But this problem seems to have happened after upgrading labs [20:12:16] from kilo to liberty [20:12:24] uh huh. [20:12:27] Before then it was working when i was creating istances [20:12:43] I know that other users are having this problem like mutante [20:12:46] There have been many successful instance creations since we upgraded to Liberty [20:13:01] Oh, but it seems it wont work with a password [20:13:06] I use my ssh key [20:13:12] but it seems to always ask for a password [20:13:56] This is my command [20:14:01] ssh -i /c/Users/**/.ssh/id_rsa.pub gerrit-test3 [20:14:09] I replace my real name with ** [20:15:01] Okay, yes [20:15:04] I looked at the console log [20:15:07] This instance is broken [20:15:14] But it's not because of Liberty [20:15:42] Oh [20:15:59] Is it because of a broken kernal [20:16:40] I think there was an update for a kernal and that broke things [20:16:52] Not sure if the new jessie image that is on labs includes this [20:16:53] kernal [20:16:58] Can you please stop jumping to such conclusions? [20:17:04] Ok [20:17:05] sorry [20:17:46] The issue is that it thinks it's in a project called '{', instead of 'git' [20:17:56] Oh [20:18:01] I know why [20:18:10] Ok thanks [20:18:30] I'm going to replace it with an instance with a new testing image that should have the problem fixed [20:18:40] Ok thanks [20:18:41] :) [20:20:13] This issue is my fault [20:20:24] Oh [20:20:56] Krenair yay it work [20:20:59] worked thanks [20:21:09] could you publish that image for everyone please? [20:21:35] it is already public [20:21:51] just gotta select 'debian-8.5-jessie (testing)' in the image dropdown instead of debian-8.5-jessie [20:21:52] Oh, i thought it was a test image [20:21:56] it is [20:21:56] Ah [20:22:12] Could it be made debian-8.5-jessie instead of debian-8.5-jessie (testing) please? [20:22:13] (and.rew made the image) [20:22:23] otherwise users wont click on debian-8.5-jessie (testing) due to the name [20:22:28] I expect that will happen soon, paladox [20:22:28] Oh [20:22:32] Ok thanks [20:24:34] 06Labs: Replace debian-8.5-jessie image with debian-8.5-jessie (testing) image - https://phabricator.wikimedia.org/T144439#2599861 (10Paladox) [20:24:40] Krenair i created ^^ for a reminder [20:24:50] for andrewbogott [20:27:37] !log tools.heritage Pulled latest pywikibot (branch 2.0) from Git: 8 commits, including fix for T144438. [20:27:38] T144438: ErfgoedBot categorisation task keeps crashing - https://phabricator.wikimedia.org/T144438 [20:27:43] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.heritage/SAL, Master [20:32:00] Krenair hi when i try to associate a floating ip to the instance i get this error [20:32:01] × [20:32:01] Error: Unable to associate floating IP. [20:33:05] yeah you don't have any more available in that project [20:33:11] need to ask for a quota increase if you want an extra one [20:33:49] Oh i doint need an extra one [20:33:59] just that i carnt seem to use it for the other instance [20:34:23] What i mean is i deactivated it from the instance before i deleted it and re created it [20:34:32] but it isent allowing me to apply it to any instances [20:34:35] Krenair ^^ [20:35:14] going into dns and looking at the domain i get 208.80.155.149 (Unknown instance name) [20:36:25] Ah [20:36:26] It's this problem. [20:36:28] One sec [20:37:26] paladox, try now [20:37:29] I should file a task about that [20:38:19] Ok [20:38:23] thanks [20:38:44] Kreanir thanks that fixed it [20:41:30] maybe an upstream ticket [20:42:53] Oh [20:44:28] tom29739: can we bring back shinken-wm? I'll go look at my irc cloak stuff [20:44:46] @unq shinken-wm [20:45:04] !log Scratch migration complete on all grid exec nodes [20:45:05] Scratch is not a valid project. [20:45:15] madhuvishy, for cloaks: If you have a verified e-mail address, at least 250 edits on Wikimedia projects, and have been registered for at least three months, you may request a Wikipedia cloak at: http://is.gd/2RwcCz - Full details are available at: http://meta.wikimedia.org/wiki/IRC/Cloaks [20:45:35] !log tools Scratch migration complete on all grid exec nodes (T134896) [20:45:36] T134896: move nfs /scratch to labstore1003 - https://phabricator.wikimedia.org/T134896 [20:45:41] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/SAL, Master [20:45:50] I'm pretty sure you can ignore those requirements, they're just to combat spam. [20:46:17] you muted the whole of *!*@internal-server-nat.wmflabs.org? :( [20:46:43] Krenair, that's how wm-bot does it [20:46:59] Krenair: is there a better way? [20:47:11] for well-behaving bots you can just mute their username or nickname [20:47:26] Krenair, if shinken-wm had a cloak, then it'd work better. [20:47:30] yes [20:47:52] And I can't quiet in here except with wm-bot, I don't have ops in here. [20:47:58] I guess we should register all wikimedia bots for the wikimedia cloak [20:48:09] and possibly even create a labs cloak. [20:48:10] labs-morebots doesn't have one either. [20:48:11] I am a logbot running on tools-exec-1213. [20:48:11] Messages are logged to wikitech.wikimedia.org/wiki/Server_Admin_Log. [20:48:11] To log a message, type !log . [20:48:18] grrrit-wm has a cloak :) [20:48:31] don't think we need a labs cloak paladox [20:48:36] Oh ok [20:48:45] !log tools Reenabled puppet across tools hosts [20:48:46] Would a wikimedia one be the better one? [20:48:49] Nor icinga-wm [20:48:50] yes [20:48:52] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/SAL, Master [20:49:05] As it is owned by wikimedia i guess the limitation doint apply for it [20:49:07] Or stashbot [20:49:32] Yeh and jouncebot [20:49:33] The type of cloak you want is wikimedia/bot/piaget [20:49:56] https://meta.wikimedia.org/wiki/IRC/Cloaks/Bots [20:49:59] With "piaget" in that example being the bot's name. [20:50:46] https://docs.google.com/forms/d/e/1FAIpQLSedp6zF8Q7wFZ22OonFwxlIHjHF2v8vVBFU2TLgXEIL7rp6tA/viewform?hl=en&formkey=dG1FTWV1RnNBVHFOSnExMHF6aUhya2c6MA [20:50:52] Can we register them ^^ [20:51:13] shinken-wm and icinga-wm don't have a nickserv account. [20:51:36] WHo owns these bots [20:51:44] Who do i put as the bot owners [20:51:50] Bot operator's nickname (yours) * [20:51:56] paladox, don't request yet [20:52:05] Oh why? [20:52:22] Bot operator's nickname (yours) * [20:52:23] Bot's nickname * [20:53:12] bd808 we can register jouncebot by doing https://docs.google.com/forms/d/e/1FAIpQLSedp6zF8Q7wFZ22OonFwxlIHjHF2v8vVBFU2TLgXEIL7rp6tA/formResponse?hl=en [20:53:29] so labs-morebots, icinga-wm, stashbot and shinken-wm all need cloaks [20:53:35] Yep [20:53:41] plus all the other bots [20:53:49] labs-morebots and stashbot are good to go. [20:53:49] I am a logbot running on tools-exec-1213. [20:53:50] Messages are logged to wikitech.wikimedia.org/wiki/Server_Admin_Log. [20:53:50] To log a message, type !log . [20:53:58] They have nickserv accounts. [20:54:07] tom29739 ok who is the bot owner [20:54:12] who do i put as the owner [20:54:32] For stashbot it's bd808. [20:54:37] Oh bugger i really hope it dosent ask for a test on the next step [20:54:38] and ok [20:54:40] thanks [20:54:46] No, it doesn't. [20:55:00] Choose the first cloak type. [20:55:00] You're almost done! You must now sign on to IRC and type the following command:/msg MemoServ send wmfgc IRC cloak request [20:55:00] This will send us a confirmation memo so that we know that you are the one in control of your IRC account. [20:55:00] (Please note that if you're applying for a bot cloak, you need to do this from both the bot nick and the owner nick.) [20:55:04] Bugger ^^ [20:55:21] Bollocks. [20:55:27] LOL [20:55:28] which bots are you trying to get cloaked? [20:55:33] stashbot [20:55:38] and labs-morebots [20:56:03] I think those can be arranged [20:56:07] We need bd808 to come on and send the memo as the stashbot account for stashbot + his own account. [20:56:08] maybe check stashbot's with bd808 [20:56:15] Oh [20:56:54] * bd808 reads backscroll [20:56:57] AlexZ, can you help us with these cloaks? [20:57:54] I sent the memo for stashbot [20:58:00] Thanks [20:58:15] bd808 ok submitted now :) [20:58:17] thanks [20:58:32] Nows labs morebot turn [20:58:33] I'm honestly not sure what the cloak is good for. [20:58:43] I get cloaking a person [20:58:44] bd808, you'll need to send the memo from your own account too (unless you hadn't) [20:59:12] tom29739: I need to send a memo from bd808 for the stashbot cloak? [20:59:19] Makes it easier to ban/quiet/etc [20:59:34] bd808, "Please note that if you're applying for a bot cloak, you need to do this from both the bot nick and the owner nick." [20:59:35] does it need to mention stashbot shomehow? [20:59:42] k [20:59:46] who is the onwer of labs-morebots [20:59:47] ? [21:00:07] Labs ops I think. Not sure. [21:00:23] Ok, i am not sure if i can use labs op as a nick [21:00:24] yeah, its owned by a hive collective :) [21:00:29] Oh [21:00:40] Will labs ops work [21:00:58] paladox, nope [21:01:01] Ok [21:01:11] Umm, who develops it? [21:01:14] labs-morebots, [21:01:14] I am a logbot running on tools-exec-1213. [21:01:14] Messages are logged to wikitech.wikimedia.org/wiki/Server_Admin_Log. [21:01:14] To log a message, type !log . [21:01:24] tom29739: "us" [21:01:37] it's been here longer than I have [21:01:46] That's a pain. [21:01:48] Can i put you bd808 as the bot irc owner or someone else [21:01:49] "Author(s) Andrew Bogott, Antoine Musso, Kunal Mehta, Ori Livneh, Reedy, Ryan Lane" [21:01:51] please [21:01:58] you might be able to talk andrewbogott into claiming ownership :) [21:02:14] Yeah, he;s first in the list :D [21:02:17] Hmm, andrebogott hashar legoktm ori Reedy want me to put one of you as the owners of labs-morebots [21:02:24] its for a cloak for the bot [21:02:32] and is required for registering. [21:02:41] there are 4-5 accounts for that bot too [21:02:52] Oh [21:02:59] "* [labs-morebots] is logged in as labs-morebots" [21:03:03] that is tools.morebot isn't it ? [21:03:05] got it already [21:03:13] tools.morebots runs multiple irc bots [21:03:27] prod, labs, analytics, qa [21:03:28] shinken-wm needs to be done [21:03:36] It's the one that spams the channel. [21:03:39] shinken doesn't identify with nickserv [21:03:45] That's the trouble. [21:04:24] *shrug* I'm on the verge of having this channel removed from the shared block list anyway [21:04:27] the $-a bans are too much [21:05:11] bd808, this one? * #wikimedia-labs: $j:#wikimedia-bans on Sun Jul 17 20:13:54 2016 by hitchcock.freenode.net [21:05:24] yes [21:05:47] There don't seem to be any $-a bans [21:06:24] there have been 3 in the last 36 hours [21:07:20] So they carry to here? [21:07:21] That's annoying. [21:07:40] they did while we had +b $j:#wikimedia-bans [21:07:55] :) thanks Krenair [21:08:24] I thought Yuvi gave me +o here but when I try to use it I get denied [21:08:34] my irc fu is weak [21:08:42] I trust: .*@wikimedia/.* (2trusted), .*@mediawiki/.* (2trusted), .*@wikimedia/Ryan-lane (2admin), .*@wikipedia/.* (2trusted), .*@nightshade.toolserver.org (2trusted), .*@wikimedia/Krinkle (2admin), .*@[Ww]ikimedia/.* (2trusted), .*@wikipedia/Cyberpower678 (2admin), .*@wirenat2\.strw\.leidenuniv\.nl (2trusted), .*@unaffiliated/valhallasw (2trusted), .*@mediawiki/yuvipanda (2admin), .*@wikipedia/Coren (2admin), [21:08:42] @trusted [21:08:49] Oops. [21:09:06] bd808, /msg ChanServ op #wikimedia-labs [21:10:58] ah ha! [21:11:00] Stuff always seems to happen when no-one (no-one opped) is around. [21:11:05] thanks Krenair [21:12:15] Successfully added .*@wikimedia/BDavis-WMF [21:12:15] @trustadd .*@wikimedia/BDavis-WMF admin [21:12:34] bd808, you can use wm-bot ops commands now too. [21:12:44] what about me? [21:13:05] Successfully added .*@wikimedia/Krenair [21:13:05] @trustadd .*@wikimedia/Krenair admin [21:13:10] :P [21:13:11] tom29739 can you also add me please? [21:13:12] ty [21:13:19] Can't add too many. [21:13:25] I'll get shot by ops. [21:13:29] Ok [21:13:46] But could you add me, i would never use it, but just incase. [21:14:00] tom29739 ^^ [21:14:02] I don't have ops in here. [21:14:15] paladox, just ask the channel ops for that, not tom ;) [21:14:19] Ok [21:14:22] Like, proper ops. [21:14:24] paladox: don't beg for privs. It looks bad [21:14:28] Ok [21:14:39] So I really shouldn't be doing this anyway. [21:15:34] bd808: see the talk ~24h ago on -ops regarding the $-a bans [21:16:02] Don't they like channels having it or something? [21:17:20] yeah.. I'm gonna put that back unless they reinstate the $~a ban [21:21:13] * Platonides notices there have been a few op additions here recently [21:21:42] How'd you get the op list? [21:21:44] yeah we got a bunch of active people added [21:21:55] /msg ChanServ access #wikimedia-labs list [21:25:24] that's good [21:26:15] Krenair, could you add +t *!*@*wiki?edia* [21:26:28] hello, where was that really cool tool for kubernetes that showed usage of a tool over time? [21:26:29] That lets those people update the topic [21:27:04] musikanimal: this? https://grafana-labs-admin.wikimedia.org/dashboard/db/kubernetes-tool-combined-stats [21:28:00] madhuvishy: that's it! thank you [21:28:05] np :) [21:28:30] though it looks like for 'pageviews' the web responses graph isn't working [21:28:57] JS error? I see a little ! guy on the top left that says 'Cannot read property length of undefined' [21:29:10] woap, okay, it's showing now... [21:29:17] musikanimal: i think it just takes a second [21:29:24] gotcha [21:29:24] musikanimal: hit refresh once or twice [21:29:37] grafana and graphite don't always play nice [21:30:23] so for pageviews and xtools I've been using my own usage tracking system, built with an API in my Ruby app [21:30:34] https://tools.wmflabs.org/pageviews/meta [21:31:04] the only advantage (not shown in that /meta graph) is that I get per-project usage stats, so I can see which wikis are using the tool the most [21:31:39] maybe there's a way to pass custom parameters like that to Grafana? [21:32:41] because my Ruby app keeps dieing from all those requests coming in when someone loads pageviews or xtools, I guess it's running out of memory [21:32:47] in general you can collect whatever you want on graphite by sending stats via statsd usually [21:34:25] except... we have limited disk [21:34:32] so "whatever" has many limits [21:34:37] of course [21:36:15] musikanimal: how are you storing those metrics? [21:38:25] bd808: there's the "timeline" metrics which is overall page loads, equivalent to grafana, and then per-project usage which is recorded overall, like: [21:38:30] https://www.irccloud.com/pastebin/bSFbKdsp/ [21:39:00] so that just tells me which wikis are the most popular. It'd be really cool to see that over time, too [21:39:57] I'd like to have a nice analytics service for all tools to use, but there are a lot of more horrible problems to solve in front of it :) [21:40:17] I don't know what that would mean for storage, I have 6 different tools, and what, ~300 possible wikis [21:40:40] so 1800 time series files [21:40:46] yeah haha [21:40:47] bd808, if that logging stuff gets done then that could be used for stats [21:41:32] tom29739: well at least as a data feed, yes [21:41:54] elasticsearch is not a time series datastore [21:42:12] and the problem always becomes the granularity that people think they want/need [21:42:19] I don't really need to see it down to the minute, that's overkill for me. I actually tried zooming out in grafana and I'm not getting anything [21:43:03] that service got rebuilt not too long ago [21:43:08] so probably not much history [21:43:31] the oldest data I see is is Aug 31 at 9:03 [21:43:53] maybe it's just reallllly slow [21:44:31] at least some things have data back to 7/19 [21:44:43] musikanimal: even with changing datetime range? [21:44:54] yes, I've got it set to last 7 days [21:45:00] i see [21:45:08] last 6 hours works, no problem [21:45:13] actually I've got data to 7/15 for SAL [21:45:26] I can't get anything beyond last 6 hours for any tool [21:45:30] that was one of the early k8s services [21:46:14] oh, but not web data which is maybe what you are looking at [21:46:28] that seems to only be for the last day or so [21:47:19] yeah I'm looking at web responses [21:49:28] maybe it deletes the data after so long? [21:50:15] which looks like about 8 hours :/ [21:52:52] I guess I'm going to try to figure out why the Ruby app keeps dying. These are simple OPTIONS requests that writes to my Tool Labs db, nothing that would seem to consume a lot of memory [21:53:15] but I assume it's related to this metric system I added because ever since then the app dies maybe 2-3 days [21:53:24] *every 2-3 days [21:53:39] 06Labs: Use Designate for public/floating labs IPs - https://phabricator.wikimedia.org/T93088#2600099 (10AlexMonk-WMF) 05Open>03Resolved a:03AlexMonk-WMF I did the custom-code thing in T104521 [21:54:14] 06Labs, 10Tool-Labs: Rethink use of floating IPs in tools - https://phabricator.wikimedia.org/T141445#2600104 (10Krenair) [21:58:40] 06Labs, 10Horizon: Hostnames assigned to floating IP persist when deallocated - https://phabricator.wikimedia.org/T55816#2600112 (10AlexMonk-WMF) >>! In T55816#1751006, @Krenair wrote: > Is {T115194} an instance of this bug? Nope, those are fixed IPs. I think what this bug intends is: Say you have a project... [21:59:25] aude: The reason that you can't get past the "link phabricator account" step in striker is that your phab account doesn't match the SUL or LDAP accounts you are using in striker. [22:00:34] * bd808 needs to make a "skip/snooze" option for those golas [22:00:49] 06Labs, 10Labs-Infrastructure, 10Horizon: Disassociating floating IP does not make it available to other instances in the project - https://phabricator.wikimedia.org/T136872#2600119 (10AlexMonk-WMF) [22:03:24] paladox, I think https://phabricator.wikimedia.org/T136872 is the issue you had earlier [22:07:29] Krenair oh [22:07:30] thanks [22:08:21] 06Labs, 06Operations, 13Patch-For-Review, 07Tracking: overhaul labstore setup [tracking] - https://phabricator.wikimedia.org/T126083#2600186 (10madhuvishy) [22:08:23] 06Labs, 06Operations, 13Patch-For-Review: move nfs /scratch to labstore1003 - https://phabricator.wikimedia.org/T134896#2600184 (10madhuvishy) 05Open>03Resolved a:03madhuvishy [22:10:56] Krenair: we are trying to debug something with security groups [22:11:18] We are trying to remotly connect to mysql [22:11:20] we need a mysql connection from one instance to another [22:11:23] from one instance to another [22:11:25] in the same project [22:11:27] per mutante [22:11:40] on the source machine i see iptables rules [22:11:47] on the target instance i dont, empty [22:12:00] paladox tried to add a rule in horizon [22:12:12] but that is ingress [22:12:37] when i nmap it i get that it's still "filtered" [22:13:38] running tcpdum pon destination.. nothing [22:13:58] do we need a rule for outgoing as well? [22:14:38] and do it "manually" instead of using the existing template for mysql [22:14:39] ? [22:16:53] Krenair ^^ [22:17:21] which hosts are these? [22:18:14] Krenair: from: gerrit-test to :gerrit-mysql 3306/tcp [22:18:24] git project? [22:18:27] arg, i cant type, so much lage, let me try again [22:18:33] wrong host [22:18:39] gerrit-test3 to gerrit-mysql [22:18:42] from: gerrit-test3 [22:18:46] yes, git project [22:19:26] i wonder why the gerrit-mysql has no iptables rules at all [22:19:33] while the other instance does [22:19:53] and then i think "but if the destination has no rules, why not just work" [22:20:04] (03CR) 10EdouardHue: "> @Edouard, would you be able to make these changes ? :)" [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/303933 (https://phabricator.wikimedia.org/T142570) (owner: 10EdouardHue) [22:21:06] Krenair ^^ [22:23:56] hm [22:24:58] PORT STATE SERVICE [22:24:58] 3306/tcp filtered mysql [22:25:31] MAC Address: FA:16:3E:06:93:70 (Unknown) [22:27:37] Krenair ^^ [22:27:44] yeah I saw paladox [22:27:51] Sorry [22:31:31] Krenair would you know how we do remote mysql as in connecting to mysql from gerrit-test3 to gerrit-mysql please? [22:32:03] paladox, I'm looking into it for mutante [22:32:07] ok [22:32:11] Be patient. [22:32:14] thanks [22:32:20] and sorry [22:33:17] mutante, was gerrit-mysql set up manually instead of using puppet? [22:33:34] Yes [22:33:47] Krenair: yes, all steps are documented though [22:33:51] per https://phabricator.wikimedia.org/P3939 [22:34:18] paladox: you did not update the @localhost part yet, thought you did [22:34:29] Yep i did [22:34:39] but not in the docs [22:34:43] Oh wait [22:34:48] i forgot to do one bit [22:34:51] i did the other [22:34:52] anyways, it's not even able to connect, so it's not grant [22:34:55] but not that bit [22:34:56] sbut still update that [22:35:31] Krenair: the point is that the DB part is a separate instance, that makes the actual gerrit-instance 100% automatic, 0 manual steps [22:35:38] yes [22:35:40] Ok done updated paste [22:36:05] I installed base::firewall on gerrit-mysql [22:36:09] This created the iptables rules [22:36:42] Didn't fix the problem though [22:37:03] oh [22:37:25] gerrit-mysql has no floating IP so it's not that issue [22:37:57] Yep [22:38:07] Maybe we should vim /etc/mysql/my.cnf [22:38:13] and update the binding ip section [22:38:15] gerrit-test3 has a floating IP , or it had [22:38:16] to the gerrit-mysql ip [22:38:21] instead of localhost [22:38:24] does that maybe change it? [22:38:39] yep gerrit-test3 has a floating ip [22:38:42] paladox: we already did that earlier? [22:38:48] oh [22:38:54] that's what we did before we even asked Krenair, right [22:38:56] but that was on gerrit-mysql [22:39:02] im talking about gerrit-test3 [22:39:04] (first it was just listenign on 127.0.0.1) [22:39:12] for connecting to the other instance [22:39:46] wait, what [22:39:53] that is just a client [22:39:59] oh [22:40:03] why does it even have a server config in the first place [22:40:11] Note sure [22:41:20] there is no mysqld running there, and there shouldbe one [22:41:29] should not be one [22:41:44] we are just a client connecting elsewhere [22:42:00] oh [22:42:01] ok [22:43:42] mysql -h 10.68.23.211 [22:43:42] \ERROR 1130 (HY000): Host 'gerrit-mysql.git.eqiad.wmflabs' is not allowed to con| [22:43:45] nect to this MySQL server [22:43:57] well that is differnt and the GRANTS [22:44:05] to prrof the server is running though [22:44:33] that is when connecting from gerrit-mysql itself to its 10.68. IP [22:45:06] from gerrit-test3, same command.. times out [22:45:40] rebooting it [22:46:51] k [22:48:57] hm, nope [22:49:45] paladox: that rule you made in horizon, paste it again? [22:49:55] Ok [22:50:08] [22:50:08] Direction [22:50:08] Ether Type [22:50:08] IP Protocol [22:50:09] Port Range [22:50:09] Remote IP Prefix [22:50:10] paladox: and try adding the outgoing rule if that is possible? [22:50:11] Remote Security Group [22:50:13] Actions [22:50:15] Ingress - TCP 90 0.0.0.0/0 - Delete Rule [22:50:17] Ingress - TCP 3306 (MYSQL) 0.0.0.0/0 - [22:50:34] is that rule on gerrit-mysql now though? [22:50:42] Yes [22:50:53] 06Labs, 10DBA, 06Editing-Analysis, 05Security: Replicate editor_month table from analytics-store to Labs - https://phabricator.wikimedia.org/T143955#2584567 (10Bawolff) That has a column for deleted edits. Isnt the number of deleted edits somebody makes usually secret? [22:51:00] also i doint know how to add outgoing rule [22:51:11] mutante ^^ [22:51:24] is it possible it doesnt cover gerrit-test3 [22:51:29] because of the floating IP [22:51:48] paladox: there doesn't seem to be an option in Horizon for outgoing rules [22:51:50] Maybe not sure [22:51:53] Yep [22:52:02] But "Ingress" means ingress to the instance [22:52:11] oh [22:52:14] thanks [22:52:18] tom29739: and that applies on all instances in the project,right [22:52:32] So connections outside --> in [22:52:45] mutante: you need to add the instance to the security group [22:52:52] oh yep [22:52:57] paladox: which instance did you add to it? [22:53:01] the source or the destination [22:53:06] gerrit-test3 and gerrit-mysql [22:53:14] i have default and test [22:53:14] uhmm. ok [22:53:20] If it's inter-project then any connections should work [22:53:41] oh it seems to not be working for us [22:53:42] eh, even without the group? it's in the same project and does not work [22:53:54] So if instance 1 wants to connect to instance 2 and they're in the same project it should work [22:54:05] Without any security groups. [22:54:12] paladox: you just said the rule is on gerritmyusql [22:54:16] gerrit-mysql [22:54:28] but i cant confirm that [22:54:34] running iptables,, still empty [22:54:50] mutante: when you netcat the instance + port what happens? [22:54:51] Yep [22:54:54] even now after krenair added base::firewall [22:55:01] tom29739: timeout [22:55:01] it showing as it is added for [22:55:02] me [22:55:07] through the horozon interface [22:55:08] tom29739: "filtered" with nmap [22:55:13] mutante, I removed it again [22:55:16] Does it time out or refuse the connection? [22:55:18] and emptied iptables [22:55:19] Krenair: ok [22:55:23] tom29739: timeout [22:55:37] Security group problem most likely then. [22:55:41] what, that literally just changed [22:55:44] right now [22:55:52] who fixed it :) [22:56:02] If it refuses the connection then it's usually application based. [22:56:07] what? [22:56:21] it's not fixed [22:56:39] mutante: I heard about security groups taking time to update [22:56:41] then what did you jsut run paladox [22:57:34] I ran something since i wanted to see if that worked [22:57:37] but it didnt [22:57:42] it was for a different instance [22:57:51] ok, so it's not fixed [22:57:55] Yep [22:58:11] tom29739 yep your right, that is a bug [22:58:14] mutante ^^ [22:58:50] so i dont know then [22:58:53] we cant use it [22:58:59] Are you absolutely sure that the service is listening on 0.0.0.0 [22:59:11] krenair@gerrit-mysql:~$ sudo lsof -i :mysql [22:59:11] COMMAND PID USER FD TYPE DEVICE SIZE/OFF NODE NAME [22:59:11] mysqld 1138 mysql 10u IPv4 16040 0t0 TCP gerrit-mysql.git.eqiad.wmflabs:mysql (LISTEN) [22:59:44] it's bound to 10.68.23.211:3306 [22:59:59] tcp 0 0 10.68.23.211:3306 0.0.0.0:* LISTEN 114 16040 1138/mysqld [23:00:24] root@gerrit-test3:/home/paladox# mysql -h 10.68.23.211 [23:00:30] ... timeout [23:00:36] Yep [23:00:36] mutante: and it works from localhost? [23:00:42] Yep [23:00:45] works on localhost [23:00:51] Weird. [23:00:52] yes [23:00:55] yes, from localhost itr refuses because GRANTs [23:01:11] which counts as "works" here [23:01:11] It's probably a security group issue then [23:01:32] yes [23:01:34] i guess leave it a day a re visit tomarror [23:01:46] yep, that is why instances creation was disabled [23:01:57] but you said it shoudl work _without_ security groups even [23:02:03] due to that bug, but they just made the timeout higher to prevent it from working [23:02:07] yep [23:02:07] so it's also an issue with the group ? [23:02:12] maybe [23:06:08] i dont know, so what you s "re-visit" [23:06:17] to confirm it's not just taking a ahile [23:06:19] while [23:06:37] i can barealy type with this lag i have anyways [23:07:18] 06Labs, 10DBA, 06Editing-Analysis, 05Security: Replicate editor_month table from analytics-store to Labs - https://phabricator.wikimedia.org/T143955#2600386 (10jcrespo) Privacy is a certainly a very complex issue; even replicating the number of watchers of a page required a non-trivial amount of filtering.... [23:07:29] tom29739: Krenair thanks for looking [23:07:38] paladox: if you kow a ticket.. let's link it to our ticket please [23:07:47] Ok [23:07:56] We will retry tomarror [23:07:59] ok [23:08:19] mutante https://phabricator.wikimedia.org/T142165 [23:08:21] now restarting that router args [23:08:28] Ok [23:08:29] lol [23:08:53] ok, can you link that as blocker or so [23:09:17] 06Labs, 10Labs-Infrastructure: Default source group (security group) allowances do not update properly - https://phabricator.wikimedia.org/T142165#2600394 (10Paladox) [23:09:22] mutante ^^ [23:09:44] thanks [23:10:23] Your welcome [23:16:21] mutante: I was going to suggest waiting a bit [23:16:59] Usually resolves itself after a little while, try again in 20 minutes - an hour [23:18:57] 06Labs, 10Labs-Infrastructure: Default source group (security group) allowances do not update properly - https://phabricator.wikimedia.org/T142165#2525709 (10Dzahn) [23:19:42] tom29739: ok, we will try later, thanks [23:20:21] np [23:21:16] modules/openstack/templates/liberty/nova/nova.conf.erb:allow_same_net_traffic=False [23:22:59] grumble grumble looks like I have zombie jobs not in grid [23:23:21] never mind seems to just be lag [23:27:38] Krenair: works for me (in project traffic) [23:28:39] Think it was yuvi or someone who said it would, I didn't think it did [23:28:52] But it seems to when I tried it, so :) [23:30:01] Okay [23:30:11] gerrit-test.git.eqiad.wmflabs can connect to it [23:30:30] So what is gerrit-test3's problem [23:30:32] Weird how it takes time to work. [23:32:09] I'll believe it just takes time to work when it miraculously starts working [23:32:46] I can't really debug, I don't have access to that project. [23:33:50] differences... [23:34:03] it has a slightly newer kernel (wmf3 instead of wmf2) [23:34:26] traceroute to the target fails on gerrit-test3, but is fine on gerrit-test [23:35:20] Ooh. Weird: Error: Unable to retrieve usage information [23:35:43] In Horizon. [23:35:55] Seems to work on the project I have project admin in. [23:35:55] doing what exactly? [23:36:10] Krenair: just the overview [23:36:11] trying to select a project you're a member of, but not an admin in? [23:36:15] Yeah. [23:36:24] yeah let's discuss that later :) [23:36:33] It used to work. [23:36:48] file a ticket [23:39:55] I can't look at the instances in that project either.. [23:40:02] But I can see the security groups. [23:40:34] And the domains and web proxies [23:40:37] Weird. [23:40:44] * tom29739 files a ticket [23:44:53] Krenair: is it the git project that you're having a problem with? [23:44:57] yes [23:45:14] Think I've found your problem. [23:45:17] https://wikitech.wikimedia.org/wiki/Nova_Resource:Git [23:45:25] It's like the instance doesn't exist. [23:46:24] Having said that, some of the instances in my project don't appear on there. [23:46:45] (on my project's page) [23:47:55] That's weird. [23:48:20] That's also a whole different story [23:48:23] For my project's page, there's 2 instances that I've deleted that are on there [23:49:09] * tom29739 backs off [23:49:23] When it doesn't work, leave alone and wait. [23:53:55] 06Labs: PUppet runs broken on some instances due to labstore::nfs_mount changes - https://phabricator.wikimedia.org/T144460#2600477 (10bd808) [23:57:14] PROBLEM - SSH on tools-exec-1219 is CRITICAL: Server answer