[00:33:27] 10Deployment-Systems, 3Scap3: scap3 should repack / pack-refs git repos under /srv/deployment - https://phabricator.wikimedia.org/T112509#1800144 (10thcipriani) [00:56:55] dr0ptp4kt: ish, been sick most of the day [00:58:41] greg-g: just wanted to check on meaning of week0 and week1 and wmf1 v wmf2. basically if stuff is merged before the tuesday morning cutoff in master, it will be on stable wikipedias by thursday, right? we can chat in person if there's more complexity than meets the eye. [00:59:41] dr0ptp4kt: this should answer all your questions: https://wikitech.wikimedia.org/wiki/Deployments/One_week [01:00:17] 10Beta-Cluster-Infrastructure, 6Collaboration-Team-Backlog, 10Flow, 3Collaboration-Team-Current: Beta Cluster Special:Contributions lags by a long time and notes slow Flow queries - https://phabricator.wikimedia.org/T78671#1800165 (10Catrope) I analyzed all possible query types for ContributionsQuery over... [01:00:22] greg-g: yeah, read that. mainly, i was wondering could the table just have one row instead of two? [01:00:44] yeah, it had two because we used to have a 2 week cycle :) [01:00:56] greg-g: :) [01:01:04] 10Beta-Cluster-Infrastructure, 6Collaboration-Team-Backlog, 10Flow, 3Collaboration-Team-Current: Beta Cluster Special:Contributions lags by a long time and notes slow Flow queries - https://phabricator.wikimedia.org/T78671#1800169 (10Catrope) >>! In T78671#1770170, @dduvall wrote: > Another instance of thi... [01:01:07] so just to be explicit I left it as two, but I can remove it now since we're probably mostly all used to the one week cycle now [01:01:49] greg-g: cool. just wanted to make sure i wasn't missing something! thanks! get well! [01:02:30] dr0ptp4kt: reload [01:02:52] greg-g: sweet [01:02:55] greg-g: 'night [01:03:11] :) g'night [01:10:16] 10Beta-Cluster-Infrastructure, 6Collaboration-Team-Backlog, 10Flow, 3Collaboration-Team-Current: Beta Cluster Special:Contributions lags by a long time and notes slow Flow queries - https://phabricator.wikimedia.org/T78671#1800176 (10Catrope) >>! In T78671#1800169, @Catrope wrote: >>>! In T78671#1770170, @... [01:17:50] PROBLEM - Host deployment-parsoidcache02 is DOWN: PING CRITICAL - Packet loss = 100% [01:39:03] Yippee, build fixed! [01:39:04] Project browsertests-Wikidata-SmokeTests-linux-firefox-sauce build #438: 09FIXED in 22 min: https://integration.wikimedia.org/ci/job/browsertests-Wikidata-SmokeTests-linux-firefox-sauce/438/ [03:13:49] Project beta-scap-eqiad build #78154: 04FAILURE in 9 min 2 sec: https://integration.wikimedia.org/ci/job/beta-scap-eqiad/78154/ [05:28:39] Project browsertests-MultimediaViewer-en.wikipedia.beta.wmflabs.org-windows_7-internet_explorer-11-sauce build #600: 04FAILURE in 26 min: https://integration.wikimedia.org/ci/job/browsertests-MultimediaViewer-en.wikipedia.beta.wmflabs.org-windows_7-internet_explorer-11-sauce/600/ [06:03:01] Yippee, build fixed! [06:03:02] Project beta-scap-eqiad build #78168: 09FIXED in 37 min: https://integration.wikimedia.org/ci/job/beta-scap-eqiad/78168/ [06:06:24] PROBLEM - Free space - all mounts on deployment-bastion is CRITICAL: CRITICAL: deployment-prep.deployment-bastion.diskspace._var.byte_percentfree (<11.11%) [06:18:31] (03PS1) 10Gilles: Configure thumbor/result-storage [integration/config] - 10https://gerrit.wikimedia.org/r/252638 [06:36:22] RECOVERY - Free space - all mounts on deployment-bastion is OK: OK: All targets OK [08:51:01] PROBLEM - puppet last run on scandium is CRITICAL: CRITICAL: puppet fail [09:06:03] RECOVERY - puppet last run on scandium is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [09:33:26] 10Beta-Cluster-Infrastructure, 6Collaboration-Team-Backlog, 10Flow, 3Collaboration-Team-Current, and 2 others: Beta Cluster Special:Contributions lags by a long time and notes slow Flow queries - https://phabricator.wikimedia.org/T78671#1800517 (10jcrespo) > This behavior is strange No it is not, you cann... [09:52:16] RECOVERY - Host deployment-parsoidcache02 is UP: PING OK - Packet loss = 0%, RTA = 0.71 ms [09:54:46] PROBLEM - Puppet staleness on integration-dev is CRITICAL: CRITICAL: 100.00% of data above the critical threshold [43200.0] [10:07:23] 10Beta-Cluster-Infrastructure, 10Flow, 3Collaboration-Team-Current, 5Patch-For-Review, 5WMF-deploy-2015-11-17_(1.27.0-wmf.7): Beta Cluster Special:Contributions lags by a long time and notes slow Flow queries - https://phabricator.wikimedia.org/T78671#1800539 (10Luke081515) [11:27:15] PROBLEM - Host deployment-parsoidcache02 is DOWN: CRITICAL - Host Unreachable (10.68.16.145) [11:29:05] 6Release-Engineering-Team, 7Ruby, 7Tracking: Fix easy problems reported by RuboCop - https://phabricator.wikimedia.org/T91485#1800677 (10zeljkofilipin) [11:29:40] 7Browser-Tests, 10MediaWiki-extensions-Translate, 5Patch-For-Review, 7Ruby, 5WMF-deploy-2015-11-10_(1.27.0-wmf.6): Update Translate mediawiki_selenium Ruby gem to version 1.x - https://phabricator.wikimedia.org/T117978#1800678 (10zeljkofilipin) 5Open>3Resolved [11:29:41] 6Release-Engineering-Team, 10Browser-Tests-Infrastructure, 7Ruby, 7Tracking: Update repositories that use mediawiki_selenium Ruby gem to version 1.x - https://phabricator.wikimedia.org/T94083#1800679 (10zeljkofilipin) [13:26:30] (03PS1) 10Zfilipin: Added Rakefile [ruby/api] - 10https://gerrit.wikimedia.org/r/252683 (https://phabricator.wikimedia.org/T117993) [13:51:29] 7Browser-Tests, 10Continuous-Integration-Config, 5Patch-For-Review, 7Ruby, 5WMF-deploy-2015-11-10_(1.27.0-wmf.6): Add Rakefile to repositories with Ruby code - https://phabricator.wikimedia.org/T117993#1800884 (10zeljkofilipin) [13:52:27] 7Browser-Tests, 10Continuous-Integration-Config, 5Patch-For-Review, 7Ruby, 5WMF-deploy-2015-11-10_(1.27.0-wmf.6): Add Rakefile to repositories with Ruby code - https://phabricator.wikimedia.org/T117993#1793788 (10zeljkofilipin) [13:54:10] hashar: this is done https://phabricator.wikimedia.org/T117993 [13:54:29] https://gerrit.wikimedia.org/r/#/q/status:open+topic:T117993,n,z [13:54:41] I will add rake-jessie job to the repos now [13:55:21] good good [13:55:29] https://phabricator.wikimedia.org/T114860 [13:55:42] don't apply it to ops-puppet for now though [13:55:57] hashar: no, I will create a separate patch for that [13:56:08] since we can not merge puppet ourselves [13:56:09] I don't think operations/puppet.git depends on Nodepool hosts at all [13:56:21] and that repo is rather critical to the infrastructure [13:56:29] so I don't want ops to be blocked because nodepool ends up having a trouble [13:56:32] (I am paranoid) [13:56:40] hashar: ok [13:56:49] I will create a separate patch [13:56:53] we can ignore it for now [14:19:42] (03PS1) 10Zfilipin: Added rake-jessie job to test and gate-and-submit pipelines for operations/puppet repository [integration/config] - 10https://gerrit.wikimedia.org/r/252689 (https://phabricator.wikimedia.org/T114860) [14:22:38] (03PS2) 10Zfilipin: Added rake-jessie job to test and gate-and-submit pipelines for operations/puppet repository [integration/config] - 10https://gerrit.wikimedia.org/r/252689 (https://phabricator.wikimedia.org/T114860) [14:36:21] (03PS2) 10Zfilipin: Added Rakefile [ruby/api] - 10https://gerrit.wikimedia.org/r/252683 (https://phabricator.wikimedia.org/T117993) [14:37:02] (03PS1) 10Zfilipin: Run Ruby jobs using Rake [integration/config] - 10https://gerrit.wikimedia.org/r/252690 (https://phabricator.wikimedia.org/T114860) [14:39:36] 7Browser-Tests, 10Continuous-Integration-Config, 5Patch-For-Review, 7Ruby: Delete ruby2.0lint job and only run bundle-rubocop job for repositories with Ruby code - https://phabricator.wikimedia.org/T114262#1800925 (10zeljkofilipin) [14:39:37] 10Continuous-Integration-Config, 5Patch-For-Review, 7Ruby: Move Bundler Jenkins jobs to Nodepool instances - https://phabricator.wikimedia.org/T114860#1800926 (10zeljkofilipin) [15:15:25] zeljkof: all those rubocop whitespaces changes should probably be made in a single commit :D [15:15:29] mass approving anyway [15:15:43] hashar: probably [15:15:48] but I am never sure [15:15:53] and I prefer smaller commits [15:16:02] and thanks :) [15:22:22] 10Gitblit-Deprecate, 10Diffusion: redirect gerrit repo paths to diffusion callsigns - https://phabricator.wikimedia.org/T110607#1800964 (10demon) I do! Gimme a minute to regenerate that list and I'll pastebin it here on Phab. [15:26:13] 10Gitblit-Deprecate, 10Diffusion: redirect gerrit repo paths to diffusion callsigns - https://phabricator.wikimedia.org/T110607#1800977 (10demon) This is everything as of a ~week ago: {P2306} I haven't gone and created any repos since then really so it should be most everything minus the ~30-50 repos that hav... [15:26:40] twentyafterfour: Ask and ye shall receive :) [15:29:13] (03PS1) 10Zfilipin: Added building the gem to `rake test` (CI entry point for Ruby) [selenium] - 10https://gerrit.wikimedia.org/r/252693 (https://phabricator.wikimedia.org/T117993) [15:29:18] ostriches: awesome! [15:29:29] I will try to put that to use before it gets more outdated ;) [15:29:51] It's already outdated :p [15:29:53] and I suppose we need a procedure to keep it updated until gerrit gets locked into read-only [15:30:05] I said _more_ outdated ;) [15:30:08] But it's like 95% of the repos and it's just adding more later, no changes :) [15:30:12] (03CR) 10Zfilipin: [C: 04-2] "fixing something" [ruby/api] - 10https://gerrit.wikimedia.org/r/252683 (https://phabricator.wikimedia.org/T117993) (owner: 10Zfilipin) [15:30:41] twentyafterfour: Yeah, I've been noodling the repo creation process a tad trying to help make sanity while we're in the migration period. [15:30:42] it's actually difficult to find/browse repos in phabricator, which is unfortunate [15:31:07] Yeah! Diffusion search kinda sucks bad. [15:31:09] I found a couple of duplicates, which I think were probably my fault. [15:31:44] no deterministic unique identifier other than the callsign is still an annoyance [15:32:44] (03PS1) 10Zfilipin: Added Rakefile [ruby/api] - 10https://gerrit.wikimedia.org/r/252695 (https://phabricator.wikimedia.org/T117993) [15:33:51] (03Abandoned) 10Zfilipin: Added Rakefile [ruby/api] - 10https://gerrit.wikimedia.org/r/252695 (https://phabricator.wikimedia.org/T117993) (owner: 10Zfilipin) [15:34:20] (03CR) 10Zfilipin: [C: 04-1] Added Rakefile [ruby/api] - 10https://gerrit.wikimedia.org/r/252683 (https://phabricator.wikimedia.org/T117993) (owner: 10Zfilipin) [15:35:29] (03CR) 10Zfilipin: Added Rakefile [ruby/api] - 10https://gerrit.wikimedia.org/r/252683 (https://phabricator.wikimedia.org/T117993) (owner: 10Zfilipin) [15:38:19] (03PS1) 10Zfilipin: Added Rakefile [ruby/api] - 10https://gerrit.wikimedia.org/r/252698 (https://phabricator.wikimedia.org/T117993) [15:39:06] (03Abandoned) 10Zfilipin: Added Rakefile [ruby/api] - 10https://gerrit.wikimedia.org/r/252683 (https://phabricator.wikimedia.org/T117993) (owner: 10Zfilipin) [15:39:29] (03PS2) 10Zfilipin: Added Rakefile [ruby/api] - 10https://gerrit.wikimedia.org/r/252698 (https://phabricator.wikimedia.org/T117993) [15:55:19] hey, can I get a hand with a sync-common issue in #wikimedia-operations? [15:55:34] looking for my headphones :/ [16:00:51] found them [16:00:58] andrewbogott: paste the error I guess ? [16:01:05] most of us are entering a meeting though [16:01:30] hashar: brian is helping, thanks. [16:24:25] (03CR) 10Hashar: "So mw-tools-scap-tox-doc-publish got refreshed and does poll Phabricator. There is a command error which is unrelated to CI itself (scap " [integration/config] - 10https://gerrit.wikimedia.org/r/251442 (https://phabricator.wikimedia.org/T117770) (owner: 10Thcipriani) [16:31:42] 10Continuous-Integration-Config, 10Continuous-Integration-Infrastructure, 10Gerrit, 5Patch-For-Review, 7Technical-Debt: Disable Gerrit replication to production slaves - https://phabricator.wikimedia.org/T86661#1801181 (10demon) Gerrit is no longer replicating to gallium either. Only replication targets... [16:33:42] RECOVERY - Host deployment-parsoidcache02 is UP: PING OK - Packet loss = 0%, RTA = 0.72 ms [16:49:33] 7Browser-Tests, 5Patch-For-Review, 3Reading Web Sprint 60 - Boom Headshot!: Investigate QuickSurveys browser tests failures - https://phabricator.wikimedia.org/T113534#1801209 (10KLans_WMF) [16:51:32] greg-g: So… wmf.7 next week (17th); wmf.8 two weeks after next (1st) as we're skipping Thanksgiving week; wmf.9 thereafter (8th); no deploys for the weeks of 15th/22nd/29th/5th; wmf.10 on 12 January 2016. Sound right? [16:52:51] James_F: I asked for a deployment freeze on releng internal li st [16:53:01] and yeah a month or so of freeze sounds good [16:54:44] OK, I did https://www.mediawiki.org/w/index.php?title=MediaWiki_1.27/Roadmap&diff=1937345&oldid=1937187 – please revert/edit if wrong. [17:02:10] James_F: 5th as in Jan 5th? I didn't think about all hands yet, and that's right, we should skip that, sadly, but dangit, that makes it 3 weeks :/ [17:02:19] cc twentyafterfour ^ just fyi [17:02:22] greg-g: Four. [17:02:25] twentyafterfour: also if you have any other opinoins [17:02:39] er, 15th we have deploys [17:03:04] week of the 14th we'll do our last of the year train run [17:03:12] cool [17:03:24] I was already in the loop ;) [17:03:41] and week of dec 7th will also be a train run [17:04:05] I gotta run to PT real quick, be back in an hour, but I'll make it explicit on [[wikitech:deployments]] more than it already is [17:04:36] someone broke h2 in production.. [17:04:47] * thedj checks if it's en.wp only [17:05:19] quick survey [17:05:57] @media all and (min-width:768px){.ext-qs-loader-bar,.ext-quick-survey-panel{margin-left:1.4em;width:300px;clear:right;float:right}.infobox,.last-modified-bar,h2{clear:both}} [17:06:04] that's a problem. [17:13:10] thedj: quicksurvey was deployed as part of SWAT this morning for enwiki [17:15:49] greg-g: Yeah, but if 8th is a train run then 15th/22nd/29th/5th is four weeks. [17:18:27] thcipriani: yeah, sorry, i noticed i was in the wrong chan. moved to -ops and -mobile [17:21:52] 6Release-Engineering-Team, 3Scap3, 7Security-General: Scap should be aware of security patches - https://phabricator.wikimedia.org/T118477#1801277 (10demon) 3NEW [17:23:39] 6Release-Engineering-Team, 3Scap3, 7Security-General: Scap should apply security patches - https://phabricator.wikimedia.org/T118478#1801285 (10demon) 3NEW [17:24:05] twentyafterfour: From our discussion earlier ^ [17:25:00] (Lots of overlap, probably solved together, but I think it's actually 2 different things) [17:29:33] (03PS1) 10Zfilipin: Move bundle-rubocop job from experimental to test pipeline for operations/puppet [integration/config] - 10https://gerrit.wikimedia.org/r/252716 (https://phabricator.wikimedia.org/T110019) [17:30:45] (03PS3) 10Zfilipin: Added rake-jessie job to test and gate-and-submit pipelines for operations/puppet repository [integration/config] - 10https://gerrit.wikimedia.org/r/252689 (https://phabricator.wikimedia.org/T110019) [18:15:44] thcipriani, twentyafterfour: Pushed .arcconfig to scap-vagrant (since we...erm...use phab for it :p) [18:16:06] ostriches: good call :) [18:34:06] hey releng folks, can I get a review from somebody on https://gerrit.wikimedia.org/r/#/c/251007 ? monolog stuff [18:34:14] * ostriches pokes [18:34:37] Wrong link? [18:34:55] well yeah... [18:35:03] ostriches: https://gerrit.wikimedia.org/r/#/c/252359/2 [18:37:23] So this is a workaround for a broken monolog "fix" pending another fix? [18:37:39] PROBLEM - Host deployment-parsoidcache02 is DOWN: CRITICAL - Host Unreachable (10.68.16.145) [18:40:16] ostriches: yeah [18:40:50] right now we have a small number (<200) of requests a day that are blowing up because they have user-agents with latin9 encoded data [18:40:59] the new monolog pukes on that [18:41:09] the old one jsut silently dropped those logs [18:41:18] this patch brings that back [18:42:15] lgtm then [18:44:02] James_F: the 15th is another train [18:44:39] greg-g: Oh, OK. [18:44:45] :) [18:45:05] I just sat back down, I'll make [[wikitech:Deployments]] more explicit real quick [18:50:16] James_F: https://wikitech.wikimedia.org/wiki/Deployments#Upcoming [18:52:38] btw, grip strength test went from around 60 (somethings, not sure of the unit) to 80 or 90. yay! [18:53:03] greg-g: https://www.mediawiki.org/w/index.php?title=MediaWiki_1.27/Roadmap&diff=1937372&oldid=1937345 [18:54:05] James_F: thank you [18:55:07] greg-g: And I've created the appropriate milestone projects in Phabricator. [18:58:15] <3 [18:58:48] * James_F waits patiently for the train in order to archive wmf.5. [18:58:53] subbu: btw, parsoidcache02 is down in beta cluster, apparently ^^^ (at 18:37) [19:01:02] hmm .. [19:01:23] i am wondering who set up these caches. [19:03:28] Krenair thinks hashar changed it recently, but I dont see him around. [19:03:47] I am not familiar with the varnish setup .. so, little hard for me to poke at it. [19:06:02] bblack is marked away in ops as well. mobrovac gwicke either of you familiar with varnish setup in beta cluster (parsoidcache02 is down there). [19:07:41] subbu: my only guess would be something limit-related [19:07:45] did you try restarting it? [19:07:56] labs was down recently when the rate limit change was deployed [19:08:02] a restart fixed it [19:08:07] i am not sure i have the permissions for it. [19:08:26] if you are in the deployment-prep project, then you should have root on that box [19:08:38] https://phabricator.wikimedia.org/T118362 [19:08:46] this was the limit issue ^^ [19:13:38] ssastry@bastion-01:~$ ssh -A ssastry@deployment-parsoidcache02 [19:13:38] ssh: Could not resolve hostname deployment-parsoidcache02: Name or service not known [19:14:05] this requires someone in ops to take a look [19:14:37] I think we removed deployment-parsoidcache02 some time ago [19:15:06] heh, then why was shinken recently annoyed? stupid shinken ;) [19:15:53] https://wikitech.wikimedia.org/wiki/Release_Engineering/SAL#2015-10-28 [19:16:18] greg-g: Is getting a bunch of HTTP 429s for image requests in Beta Cluster a known thing? [19:16:34] 10Beta-Cluster-Infrastructure, 10Flow, 3Collaboration-Team-Current, 5Patch-For-Review, 5WMF-deploy-2015-11-17_(1.27.0-wmf.7): Beta Cluster Special:Contributions lags by a long time and notes slow Flow queries - https://phabricator.wikimedia.org/T78671#1801612 (10Catrope) I agree it should be a proper JOI... [19:19:26] James_F: I... dont' think so? [19:19:40] greg-g: OK, will re-purpose the task. [19:20:17] lock errors are back? [19:20:17] 19:17:42 IOError: Lock at '/mnt/jenkins-workspace/workspace/mediawiki-extensions-qunit/src/extensions/MobileApp/.git/refs/heads/wmf/1.23wmf14.lock' could not be obtained [19:21:17] 10Beta-Cluster-Infrastructure, 10VisualEditor: [Regression pre-wmf.7] Images for musical scores, formulæ, heiroglyphics, thumbnails are returning 429s in the Beta Cluster when using VE (and other times?) - https://phabricator.wikimedia.org/T118486#1801621 (10Jdforrester-WMF) [19:23:42] (03PS1) 10Paladox: [Reflect] Add Jenkins tests to extension [integration/config] - 10https://gerrit.wikimedia.org/r/252755 [19:25:28] jzerebecki: Could you review https://gerrit.wikimedia.org/r/#/c/252755/ please. [19:26:46] paladox: not right now [19:26:53] Ok. [19:32:16] RECOVERY - Host deployment-parsoidcache02 is UP: PING OK - Packet loss = 0%, RTA = 0.66 ms [20:22:45] PROBLEM - Puppet staleness on deployment-restbase01 is CRITICAL: CRITICAL: 100.00% of data above the critical threshold [43200.0] [20:26:01] 10Gitblit-Deprecate, 10Diffusion: redirect gerrit repo paths to diffusion callsigns - https://phabricator.wikimedia.org/T110607#1801832 (10Spage) >>! In T110607#1633472, @mmodell wrote: > @spage: upstream is working on path/to/repo support in diffusion. See [[ https://secure.phabricator.com/T4245 | upstream ta... [20:59:41] 10Beta-Cluster-Infrastructure, 10VisualEditor: [Regression pre-wmf.7] Images for musical scores, formulæ, heiroglyphics, thumbnails are returning 429s in the Beta Cluster when using VE (and other times?) - https://phabricator.wikimedia.org/T118486#1801946 (10Jdforrester-WMF) Errors are being thrown by `deploym... [21:19:23] twentyafterfour: I think I was rejecting an idea you didn't propose. Sorry. [21:19:31] twentyafterfour: How would you add that commit in the other branch? [21:19:44] Without bypassing Gerrit [21:19:49] or Jenkins [21:27:08] Krinkle: wouldn't I just merge it to master? [21:27:41] twentyafterfour: You mean by pushing a merge commit to gerrit as review? [21:28:09] a merge commit between the previously merged commit and the head of master [21:28:15] or something like that [21:28:38] I'm not sure how Gerrit behaves in that regard, whether it's going to complain that the commit isn't merged yet. [21:28:51] It'll probably require it to go in master first, btu that could work yeah [21:28:56] then we'd have the same SHA1 in both branche [21:29:30] But then youd also pull in teh rest of master [21:29:31] Krinkle: that was my goal yeah [21:29:32] I see what you mean [21:29:42] You'd have to do it the other way around [21:29:52] I think that's conceptually wrong though [21:30:04] since the wmf/ branch contains only commits from master, you can always safely merge it back to master, ideally [21:32:45] as you can see I'm still thinking it through, and I appreciate your perspective on it. I agree it doesn't seem conceptually perfect. I wish git had a cherry-pick that wouldn't conflict with it's self [21:40:26] Krinkle: there may be some totally different solution, but eventually gerrit (and the convenient cherry-pick button) is going away so I'm trying to come up with a better way to manage hotfixes (to streamline deployments and generally improve the workflow) [21:40:36] so this was the best thing I came up with so far [21:44:53] 10Deployment-Systems, 3Scap3: Write setup.py for scap - https://phabricator.wikimedia.org/T118504#1802112 (10dduvall) 3NEW a:3dduvall [21:52:38] PROBLEM - Host deployment-parsoidcache02 is DOWN: CRITICAL - Host Unreachable (10.68.16.145) [22:02:32] hashar: Is anything even pointing at parsoidcache02 anymore? [22:02:36] It keeps flapping [22:02:55] I think we migrated out of it to Jessie [22:02:57] qunit workspace on integration-slave-trusty-1012 is busted, I'm rm-rf'ing [22:02:59] if it is trusty I guess you can nuke it [22:03:07] https://integration.wikimedia.org/ci/job/mediawiki-extensions-qunit/20240/consoleFull [22:03:50] !log rm -rf mediawiki-extensions-qunit workspace on trusty-1012 [22:03:56] Logged the message at https://wikitech.wikimedia.org/wiki/Release_Engineering/SAL, Master [22:05:09] parsoidcache02 dne. [22:05:17] That's just shinken being dumb hmm [22:05:27] You deleted it on the 28th [22:21:16] marxarelli: did you send me a link to the doc about Selenium upgrade/update? :) [22:21:31] etonkovidova: https://doc.wikimedia.org/rubygems/mediawiki-selenium/ [22:21:56] marxarelli: thx a lot! Cause mooeypoo does not have it :))) [22:22:04] there was an email to the QA about it a while back, too. but the readme should contain everything you need [22:22:06] np! [22:22:15] QA *list* [22:24:56] ostriches: i have no idea how Shinken collects its data :/ [22:25:11] black magic. [22:25:25] yuvipanda would know [22:25:41] maybe it grabs that using a Semantic query and the host is still on wikitech [22:34:37] (03CR) 1020after4: [C: 032] add QuickSurveys extension to make-wmf-branch conf [tools/release] - 10https://gerrit.wikimedia.org/r/251140 (owner: 1020after4) [22:35:10] (03Merged) 10jenkins-bot: add QuickSurveys extension to make-wmf-branch conf [tools/release] - 10https://gerrit.wikimedia.org/r/251140 (owner: 1020after4) [22:36:09] Krinkle: you're right, gerrit does funky things with my merge: https://gerrit.wikimedia.org/r/#/c/252854/ [22:37:24] it though I think it's the 'new wmf branch' commit that causes trouble. [22:37:39] * twentyafterfour goes back to the drawing board. [22:38:49] twentyafterfour: Aye, I'm sorry. [22:39:05] Krinkle: well thanks for humoring me ;) [22:39:30] https://gerrit.wikimedia.org/r/#/c/252857/ [22:42:21] 10Beta-Cluster-Infrastructure, 10VisualEditor, 6operations, 7Varnish: [Regression pre-wmf.7] Images for musical scores, formulæ, heiroglyphics, thumbnails are returning 429s in the Beta Cluster when using VE (and other times?) - https://phabricator.wikimedia.org/T118486#1802367 (10hashar) The error 429 is... [22:44:27] 10Beta-Cluster-Infrastructure, 10VisualEditor, 6operations, 7Varnish: [Regression pre-wmf.7] Images for musical scores, formulæ, heiroglyphics, thumbnails are returning 429s in the Beta Cluster when using VE (and other times?) - https://phabricator.wikimedia.org/T118486#1802380 (10hashar) And `//etc/varnis... [23:02:55] hi! http://en.m.wikipedia.beta.wmflabs.org/ is down? [23:03:34] AndyRussG: Not exactly [23:03:44] Reedy: ? [23:03:51] Just wanted to test some mobile stuff [23:04:06] AndyRussG: it's a rate limiting issue in varnish that was rolled out to the production cluster [23:04:25] Hmm [23:04:26] http://en.wikipedia.beta.wmflabs.org/wiki/Main_Page [23:04:27] works fine [23:04:37] Reedy: but the mobile site is down ^ [23:04:42] Yeah [23:04:45] Well, no [23:04:47] it's not down [23:04:56] You just can't access it :( [23:05:15] https://en.wikipedia.org/wiki/Main_Page?useformat=mobile seems OK, hmmm [23:05:26] Is mobile on a different varnish? [23:05:28] ostriches: ? [23:05:39] Ya [23:05:43] yes [23:05:45] I wonder if that didn't get updated [23:05:47] * Reedy looks [23:06:01] it's cache-mobile instead of cache-text [23:06:16] AndyRussG: https://phabricator.wikimedia.org/T118362 is the original issue [23:06:19] cheers [23:06:45] Reedy: K thanks!! [23:06:51] * Reedy reopens [23:06:58] 10Beta-Cluster-Infrastructure, 6operations, 7Blocked-on-Operations, 5Patch-For-Review: Varnish rate limiting has broken beta - https://phabricator.wikimedia.org/T118362#1802450 (10Reedy) 5Resolved>3Open http://en.m.wikipedia.beta.wmflabs.org/ is broken [23:08:14] session bug [23:08:26] I restarted varnish and it didn't fix anything :/ [23:08:51] I think it saves it in some file [23:08:56] Reedy: aaarg silly me that link above wasn't beta [23:09:37] Reedy: however this does work: http://en.wikipedia.beta.wmflabs.org/wiki/Main_Page?useformat=mobile [23:09:39] it saves what? [23:09:59] 10Beta-Cluster-Infrastructure, 6operations, 7Blocked-on-Operations, 5Patch-For-Review: Varnish rate limiting has broken beta - https://phabricator.wikimedia.org/T118362#1802464 (10Jdforrester-WMF) [23:10:01] 10Beta-Cluster-Infrastructure, 10VisualEditor, 6operations, 7Varnish: [Regression pre-wmf.7] Images for musical scores, formulæ, heiroglyphics, thumbnails are returning 429s in the Beta Cluster when using VE (and other times?) - https://phabricator.wikimedia.org/T118486#1802463 (10Jdforrester-WMF) [23:11:57] 505 -» // no sync to disk, tmpfs, truncate data on reload/restart - simpler [23:11:57] 506 -» // to reason about, and our ratelimits aren't long-term enough for [23:11:57] 507 -» // persistence across daemon restarts to matter much. [23:11:57] 508 -» tbf.open("/run/vmod_tbf/tbf.db", "mode=600;dbname=tbf.bdb;trunc"); [23:12:23] hm [23:13:14] I'll just reboot the whole thing [23:13:29] Reedy, got it. [23:13:37] ? [23:13:43] I took a look through deployment-cache-text04:/root/.bash_history and found the command they used [23:13:47] Ran it on deployment-cache-mobile04 [23:14:01] what was it? [23:14:03] #1447207753 [23:14:03] service varnish restart [23:14:03] #1447207761 [23:14:03] service varnish-frontend restart [23:14:07] ah, yes [23:14:10] restart both [23:14:11] duhh [23:14:12] AndyRussG: fixed? [23:14:31] Reedy: yep!! [23:14:33] :) [23:14:42] 10Beta-Cluster-Infrastructure, 10VisualEditor, 6operations, 7Varnish: [Regression pre-wmf.7] Images for musical scores, formulæ, heiroglyphics, thumbnails are returning 429s in the Beta Cluster when using VE (and other times?) - https://phabricator.wikimedia.org/T118486#1802485 (10Krenair) [23:14:44] 10Beta-Cluster-Infrastructure, 6operations, 7Blocked-on-Operations, 5Patch-For-Review: Varnish rate limiting has broken beta - https://phabricator.wikimedia.org/T118362#1802483 (10Krenair) 5Open>3Resolved ``` I took a look through deployment-cache-text04:/root/.bash_history and found the comma... [23:14:58] hahah [23:15:15] I tried to do the exact same [23:16:27] I had tried the first one already, but I had to find the second [23:19:23] 10Beta-Cluster-Infrastructure, 10VisualEditor, 6operations, 7Varnish: [Regression pre-wmf.7] Images for musical scores, formulæ, heiroglyphics, thumbnails are returning 429s in the Beta Cluster when using VE (and other times?) - https://phabricator.wikimedia.org/T118486#1802493 (10Jdforrester-WMF) @Krenair... [23:22:14] 10Beta-Cluster-Infrastructure, 10VisualEditor, 6operations, 7Varnish: [Regression pre-wmf.7] Images for musical scores, formulæ, heiroglyphics, thumbnails are returning 429s in the Beta Cluster when using VE (and other times?) - https://phabricator.wikimedia.org/T118486#1802504 (10Krenair) 5Open>3Resolv... [23:22:40] Krenair: Thanks! [23:23:23] np [23:23:29] * Krenair tries to stop getting distracted [23:38:42] 10Beta-Cluster-Infrastructure, 10VisualEditor, 6operations, 7Varnish, 7Verified: [Regression pre-wmf.7] Images for musical scores, formulæ, heiroglyphics, thumbnails are returning 429s in the Beta Cluster when using VE (and other times?) - https://phabricator.wikimedia.org/T118486#1802581 (10Ryasmeen) [23:53:58] 10Beta-Cluster-Infrastructure, 10VisualEditor, 6operations, 7Varnish, 7Verified: [Regression pre-wmf.7] Images for musical scores, formulæ, heiroglyphics, thumbnails are returning 429s in the Beta Cluster when using VE (and other times?) - https://phabricator.wikimedia.org/T118486#1802607 (10greg) Thanks...