[00:51:22] 10Beta-Cluster, 10VisualEditor, 10VisualEditor-MediaWiki: On Beta Cluster, switching from VisualEditor to edit source mode intermittently loads the wikitext editor without any CSS - https://phabricator.wikimedia.org/T86624#1111775 (10Krenair) a:5Krenair>3None [02:12:17] PROBLEM - Free space - all mounts on deployment-bastion is CRITICAL: CRITICAL: deployment-prep.deployment-bastion.diskspace._var.byte_percentfree.value (<12.50%) [02:35:49] 10Deployment-Systems: l10nupdate user can't access scap shared ssh key causing nightly l10nupdate sync process to fail - https://phabricator.wikimedia.org/T76061#1111915 (10bd808) scap and sync-* make the ssh connection as the mwdeply user. The l10nupdate ssh key is only good for itself and not this shared deplo... [02:47:24] 10Deployment-Systems, 5Patch-For-Review: l10nupdate user can't access scap shared ssh key causing nightly l10nupdate sync process to fail - https://phabricator.wikimedia.org/T76061#1111938 (10bd808) >>! In T76061#1111915, @bd808 wrote: > scap and sync-* make the ssh connection as the mwdeply user. The l10nupda... [03:14:36] 10Staging, 5Patch-For-Review: Create staging-db* (databases) - https://phabricator.wikimedia.org/T91545#1112018 (10Springle) >>! In T91545#1109576, @yuvipanda wrote: > @springle How would this be handled for master, where there's nowhere to clone from? Masters and slaves are the same. Cloning a new master is... [03:23:47] 10Staging, 5Patch-For-Review: Create staging-db* (databases) - https://phabricator.wikimedia.org/T91545#1112023 (10Springle) >>! In T91545#1109522, @thcipriani wrote: > Next steps: make manual steps ↑ as painless as possible. It would be nice to make de/re-pooling not require a medawiki-config commit. That's... [03:29:27] 10Continuous-Integration, 7Jenkins: PHP fatal errors are not visible in jenkins output in Wikibase phpunit jobs - https://phabricator.wikimedia.org/T92397#1112036 (10Legoktm) It appears to be limited to the mwext-Wikibase-client-tests job, the repo ones are fine. [03:45:58] Yippee, build fixed! [03:45:58] Project browsertests-MultimediaViewer-en.wikipedia.beta.wmflabs.org-windows_8.1-internet_explorer-11-sauce build #365: FIXED in 38 min: https://integration.wikimedia.org/ci/job/browsertests-MultimediaViewer-en.wikipedia.beta.wmflabs.org-windows_8.1-internet_explorer-11-sauce/365/ [03:46:05] greg-g: triggered a bunch of fatals ("Class 'GlobalUsage' not found") by running a maintenance script on the wrong wikis [03:46:13] they are harmless, sorry for the noise [03:56:28] 10Continuous-Integration, 10MediaWiki-Codesniffer, 10Possible-Tech-Projects, 3Google-Summer-of-Code-2015, 3Outreachy-Round-10: Improving static analysis tools for MediaWiki - https://phabricator.wikimedia.org/T89682#1112054 (10devunt) I'm interested at this project and I'm willing to apply GSoC as a cand... [04:05:28] tgr: :) thanks [05:01:57] I may have actually fixed l10nupdate this time [05:02:07] And i'm sort of stoked about that [05:02:57] The last piece was to tell sync-dir to connect to the remote hosts as the l10nupdate user instead of as the mwdeploy user [05:03:03] PROBLEM - Puppet failure on deployment-mediawiki01 is CRITICAL: CRITICAL: 71.43% of data above the critical threshold [0.0] [05:06:45] bd808: !!! [05:07:21] PROBLEM - Puppet failure on deployment-elastic08 is CRITICAL: CRITICAL: 55.56% of data above the critical threshold [0.0] [05:08:00] YuviPanda: is that you ^ (the puppet failures)? [05:09:16] PROBLEM - Puppet failure on deployment-upload is CRITICAL: CRITICAL: 62.50% of data above the critical threshold [0.0] [05:09:21] Project browsertests-UploadWizard-commons.wikimedia.beta.wmflabs.org-linux-firefox-sauce build #534: FAILURE in 35 min: https://integration.wikimedia.org/ci/job/browsertests-UploadWizard-commons.wikimedia.beta.wmflabs.org-linux-firefox-sauce/534/ [05:09:35] yuvipanda: are those puppet failures you? [05:09:43] yup yup [05:09:46] transient [05:09:46] kk [05:09:48] PROBLEM - Puppet failure on deployment-redis01 is CRITICAL: CRITICAL: 66.67% of data above the critical threshold [0.0] [05:09:55] wmf-insecte killed itself [05:10:00] PROBLEM - Puppet failure on deployment-memc03 is CRITICAL: CRITICAL: 62.50% of data above the critical threshold [0.0] [05:10:04] oh. [05:10:10] PROBLEM - Puppet failure on deployment-elastic07 is CRITICAL: CRITICAL: 62.50% of data above the critical threshold [0.0] [05:10:47] PROBLEM - Puppet failure on deployment-salt is CRITICAL: CRITICAL: 12.50% of data above the critical threshold [0.0] [05:11:29] PROBLEM - Puppet failure on deployment-jobrunner01 is CRITICAL: CRITICAL: 57.14% of data above the critical threshold [0.0] [05:11:41] PROBLEM - Puppet failure on deployment-fluoride is CRITICAL: CRITICAL: 77.78% of data above the critical threshold [0.0] [05:11:45] PROBLEM - Puppet failure on deployment-videoscaler01 is CRITICAL: CRITICAL: 50.00% of data above the critical threshold [0.0] [05:11:49] greg-g: basically, two patches were merged, and puppet hit these in the time inbetween (I had to rebase again because someone else merged something in the interim) [05:11:51] is all good now [05:12:50] Guest56166: gotcha [05:15:48] RECOVERY - Puppet failure on deployment-salt is OK: OK: Less than 1.00% above the threshold [0.0] [05:16:10] PROBLEM - Puppet failure on deployment-mediawiki02 is CRITICAL: CRITICAL: 85.71% of data above the critical threshold [0.0] [05:21:29] PROBLEM - Puppet failure on deployment-rsync01 is CRITICAL: CRITICAL: 100.00% of data above the critical threshold [0.0] [05:24:44] RECOVERY - Puppet failure on deployment-redis01 is OK: OK: Less than 1.00% above the threshold [0.0] [05:25:32] greg-g: betalabs scap is going to break for a few mins as well [05:26:18] *cough* beta cluster [05:32:25] RECOVERY - Puppet failure on deployment-elastic08 is OK: OK: Less than 1.00% above the threshold [0.0] [05:34:23] RECOVERY - Puppet failure on deployment-upload is OK: OK: Less than 1.00% above the threshold [0.0] [05:35:23] RECOVERY - Puppet failure on deployment-memc03 is OK: OK: Less than 1.00% above the threshold [0.0] [05:36:35] RECOVERY - Puppet failure on deployment-fluoride is OK: OK: Less than 1.00% above the threshold [0.0] [05:38:07] RECOVERY - Puppet failure on deployment-mediawiki01 is OK: OK: Less than 1.00% above the threshold [0.0] [05:40:10] RECOVERY - Puppet failure on deployment-elastic07 is OK: OK: Less than 1.00% above the threshold [0.0] [05:41:10] RECOVERY - Puppet failure on deployment-mediawiki02 is OK: OK: Less than 1.00% above the threshold [0.0] [06:06:45] PROBLEM - Puppet failure on deployment-salt is CRITICAL: CRITICAL: 50.00% of data above the critical threshold [0.0] [06:10:43] PROBLEM - Puppet failure on deployment-sca01 is CRITICAL: CRITICAL: 62.50% of data above the critical threshold [0.0] [06:17:32] PROBLEM - Puppet failure on deployment-sentry2 is CRITICAL: CRITICAL: 55.56% of data above the critical threshold [0.0] [06:17:34] PROBLEM - Puppet failure on deployment-fluoride is CRITICAL: CRITICAL: 25.00% of data above the critical threshold [0.0] [06:19:04] PROBLEM - Puppet failure on deployment-apertium01 is CRITICAL: CRITICAL: 12.50% of data above the critical threshold [0.0] [06:19:04] PROBLEM - Puppet failure on deployment-mediawiki01 is CRITICAL: CRITICAL: 42.86% of data above the critical threshold [0.0] [06:20:12] PROBLEM - Puppet failure on deployment-pdf02 is CRITICAL: CRITICAL: 75.00% of data above the critical threshold [0.0] [06:21:35] PROBLEM - Puppet failure on deployment-restbase01 is CRITICAL: CRITICAL: 12.50% of data above the critical threshold [0.0] [06:22:41] PROBLEM - Puppet failure on deployment-bastion is CRITICAL: CRITICAL: 37.50% of data above the critical threshold [0.0] [06:25:09] PROBLEM - Puppet failure on deployment-parsoid05 is CRITICAL: CRITICAL: 42.86% of data above the critical threshold [0.0] [06:25:15] PROBLEM - Puppet failure on deployment-upload is CRITICAL: CRITICAL: 12.50% of data above the critical threshold [0.0] [06:25:15] PROBLEM - Puppet failure on deployment-memc04 is CRITICAL: CRITICAL: 75.00% of data above the critical threshold [0.0] [06:25:45] PROBLEM - Puppet failure on deployment-stream is CRITICAL: CRITICAL: 44.44% of data above the critical threshold [0.0] [06:25:59] PROBLEM - Puppet failure on deployment-memc03 is CRITICAL: CRITICAL: 66.67% of data above the critical threshold [0.0] [06:26:41] PROBLEM - Puppet failure on deployment-logstash1 is CRITICAL: CRITICAL: 25.00% of data above the critical threshold [0.0] [06:27:59] PROBLEM - Puppet failure on deployment-elastic05 is CRITICAL: CRITICAL: 55.56% of data above the critical threshold [0.0] [06:28:09] PROBLEM - Puppet failure on deployment-db1 is CRITICAL: CRITICAL: 55.56% of data above the critical threshold [0.0] [06:30:18] PROBLEM - Puppet failure on deployment-parsoidcache02 is CRITICAL: CRITICAL: 62.50% of data above the critical threshold [0.0] [06:31:12] PROBLEM - Puppet failure on deployment-elastic07 is CRITICAL: CRITICAL: 62.50% of data above the critical threshold [0.0] [06:31:18] PROBLEM - Puppet failure on deployment-db2 is CRITICAL: CRITICAL: 85.71% of data above the critical threshold [0.0] [06:31:24] PROBLEM - Puppet failure on deployment-mediawiki03 is CRITICAL: CRITICAL: 62.50% of data above the critical threshold [0.0] [06:31:26] PROBLEM - Puppet failure on deployment-redis02 is CRITICAL: CRITICAL: 37.50% of data above the critical threshold [0.0] [06:32:58] PROBLEM - Puppet failure on deployment-cxserver03 is CRITICAL: CRITICAL: 75.00% of data above the critical threshold [0.0] [06:33:06] PROBLEM - Puppet failure on deployment-parsoid01-test is CRITICAL: CRITICAL: 42.86% of data above the critical threshold [0.0] [06:33:24] PROBLEM - Puppet failure on deployment-elastic08 is CRITICAL: CRITICAL: 33.33% of data above the critical threshold [0.0] [06:33:50] PROBLEM - Puppet failure on deployment-elastic06 is CRITICAL: CRITICAL: 55.56% of data above the critical threshold [0.0] [06:35:46] PROBLEM - Puppet failure on deployment-redis01 is CRITICAL: CRITICAL: 14.29% of data above the critical threshold [0.0] [06:37:14] RECOVERY - Free space - all mounts on deployment-bastion is OK: OK: All targets OK [07:07:24] PROBLEM - Puppet staleness on deployment-test is CRITICAL: CRITICAL: 33.33% of data above the critical threshold [43200.0] [07:11:23] !log rebooted puppetmaster, was dead [07:11:29] Logged the message, Master [07:11:51] !log scap still failing on beta, I'll check when I'm back from lunch [07:11:55] Logged the message, Master [07:19:05] RECOVERY - Puppet failure on deployment-mediawiki01 is OK: OK: Less than 1.00% above the threshold [0.0] [07:20:13] RECOVERY - Puppet failure on deployment-parsoid05 is OK: OK: Less than 1.00% above the threshold [0.0] [07:20:15] RECOVERY - Puppet failure on deployment-memc04 is OK: OK: Less than 1.00% above the threshold [0.0] [07:20:47] RECOVERY - Puppet failure on deployment-stream is OK: OK: Less than 1.00% above the threshold [0.0] [07:22:40] RECOVERY - Puppet failure on deployment-bastion is OK: OK: Less than 1.00% above the threshold [0.0] [07:22:50] RECOVERY - Puppet failure on deployment-elastic05 is OK: OK: Less than 1.00% above the threshold [0.0] [07:23:04] RECOVERY - Puppet failure on deployment-db1 is OK: OK: Less than 1.00% above the threshold [0.0] [07:26:20] RECOVERY - Puppet failure on deployment-mediawiki03 is OK: OK: Less than 1.00% above the threshold [0.0] [07:26:20] RECOVERY - Puppet failure on deployment-db2 is OK: OK: Less than 1.00% above the threshold [0.0] [07:26:36] RECOVERY - Puppet failure on deployment-restbase01 is OK: OK: Less than 1.00% above the threshold [0.0] [07:26:46] RECOVERY - Puppet failure on deployment-logstash1 is OK: OK: Less than 1.00% above the threshold [0.0] [07:26:48] RECOVERY - Puppet failure on deployment-salt is OK: OK: Less than 1.00% above the threshold [0.0] [07:27:56] RECOVERY - Puppet failure on deployment-cxserver03 is OK: OK: Less than 1.00% above the threshold [0.0] [07:28:05] RECOVERY - Puppet failure on deployment-parsoid01-test is OK: OK: Less than 1.00% above the threshold [0.0] [07:28:49] RECOVERY - Puppet failure on deployment-elastic06 is OK: OK: Less than 1.00% above the threshold [0.0] [07:30:17] RECOVERY - Puppet failure on deployment-parsoidcache02 is OK: OK: Less than 1.00% above the threshold [0.0] [07:31:28] RECOVERY - Puppet failure on deployment-redis02 is OK: OK: Less than 1.00% above the threshold [0.0] [07:33:31] RECOVERY - Puppet failure on deployment-elastic08 is OK: OK: Less than 1.00% above the threshold [0.0] [07:35:15] RECOVERY - Puppet failure on deployment-upload is OK: OK: Less than 1.00% above the threshold [0.0] [07:36:08] RECOVERY - Puppet failure on deployment-memc03 is OK: OK: Less than 1.00% above the threshold [0.0] [07:36:18] RECOVERY - Puppet failure on deployment-elastic07 is OK: OK: Less than 1.00% above the threshold [0.0] [07:37:36] RECOVERY - Puppet failure on deployment-sentry2 is OK: OK: Less than 1.00% above the threshold [0.0] [07:37:36] RECOVERY - Puppet failure on deployment-fluoride is OK: OK: Less than 1.00% above the threshold [0.0] [07:39:08] RECOVERY - Puppet failure on deployment-apertium01 is OK: OK: Less than 1.00% above the threshold [0.0] [07:40:16] RECOVERY - Puppet failure on deployment-pdf02 is OK: OK: Less than 1.00% above the threshold [0.0] [07:40:50] RECOVERY - Puppet failure on deployment-redis01 is OK: OK: Less than 1.00% above the threshold [0.0] [07:40:50] RECOVERY - Puppet failure on deployment-sca01 is OK: OK: Less than 1.00% above the threshold [0.0] [08:25:04] PROBLEM - Puppet failure on deployment-mediawiki01 is CRITICAL: CRITICAL: 14.29% of data above the critical threshold [0.0] [08:40:08] RECOVERY - Puppet failure on deployment-mediawiki01 is OK: OK: Less than 1.00% above the threshold [0.0] [08:42:04] This is Manoj. Working in Bangalore,India as a Principal QA Engineer for one of the SF based ecommerce company [08:56:35] 10Continuous-Integration, 10Flow: Jenkins reports test failures in current master: Cannot override frozen service "storage" - https://phabricator.wikimedia.org/T91951#1112315 (10hashar) [09:01:18] 10Continuous-Integration: Remove integration/kss.git - https://phabricator.wikimedia.org/T92482#1112322 (10hashar) 3NEW [09:04:06] 10Continuous-Integration, 10Wikimedia-Git-or-Gerrit: Gerrit: delete integration/kss.git repo - https://phabricator.wikimedia.org/T92483#1112334 (10hashar) 3NEW [09:09:32] PROBLEM - Puppet failure on deployment-bastion is CRITICAL: CRITICAL: 50.00% of data above the critical threshold [0.0] [09:18:37] 10Beta-Cluster: Diamond logstash monitor fills /var/log/apache2 access log - https://phabricator.wikimedia.org/T74175#1112398 (10hashar) 5Resolved>3Open Reopening, that is still happening. The monitoring is using the vhost `logstash.beta.wmflabs.org` thus the spam ends up in `/var/log/apache2/other_vhosts_ac... [09:21:00] aharoni: I am available in our usual time today, in 40 minutes [09:21:12] good thank you [09:21:20] aharoni: can you pair then? [09:21:52] yes [09:23:13] aharoni: great, see you then [09:24:39] RECOVERY - Puppet failure on deployment-bastion is OK: OK: Less than 1.00% above the threshold [0.0] [10:17:55] 10Continuous-Integration, 10Wikimedia-Git-or-Gerrit: Gerrit: delete integration/kss.git repo - https://phabricator.wikimedia.org/T92483#1112488 (10QChris) I deleted the repo from gerrit itself. But since I am lacking acces to the replication targets, @demon can you remove it from github & co? [10:31:48] PROBLEM - Puppet failure on deployment-sca01 is CRITICAL: CRITICAL: 50.00% of data above the critical threshold [0.0] [10:41:09] PROBLEM - Puppet failure on deployment-parsoid05 is CRITICAL: CRITICAL: 100.00% of data above the critical threshold [0.0] [10:43:57] PROBLEM - Puppet failure on deployment-cxserver03 is CRITICAL: CRITICAL: 75.00% of data above the critical threshold [0.0] [10:55:34] 10Continuous-Integration, 10Wikidata, 3§ Wikidata-Sprint-2015-02-03, 3§ Wikidata-Sprint-2015-02-25, 3§ Wikidata-Sprint-2015-03-11: fix the qunit tests for wikidata: mwext-Wikibase-qunit - https://phabricator.wikimedia.org/T74184#1112565 (10adrianheine) Two of the last three qunit jobs failed with databas... [11:01:17] (03PS8) 10Hashar: Package python deps with dh-virtualenv [integration/zuul] (debian/precise-wikimedia) - 10https://gerrit.wikimedia.org/r/195272 (https://phabricator.wikimedia.org/T48552) [11:13:55] 10Continuous-Integration, 7Technical-Debt, 7Tracking: All repositories should pass jshint test (tracking) - https://phabricator.wikimedia.org/T62619#1112590 (10adrianheine) [11:29:50] 10Deployment-Systems, 5Patch-For-Review: LocalisationUpdate broken since 2014-12-16 - https://phabricator.wikimedia.org/T85790#1112617 (10Se4598) any update here? I suspect it is still not working. [11:32:14] 10Continuous-Integration, 7Jenkins: PHP fatal errors are not visible in jenkins output in Wikibase phpunit jobs - https://phabricator.wikimedia.org/T92397#1112624 (10daniel) @legoktm so maybe it's an issue with the php output buffer? I would expect php to discard any buffers before reporting a fatal, but who k... [12:02:57] aharoni: we can not hear or see you [12:04:10] :( [12:04:20] I can see your photos, but I cannot hear anything. [12:05:47] aharoni: we still can not hear you [12:05:59] what nonsense :( [12:06:15] I tried from another device. [12:08:02] PROBLEM - Long lived cherry-picks on puppetmaster on deployment-salt is CRITICAL: CRITICAL: 100.00% of data above the critical threshold [0.0] [12:08:15] zeljkof: grrrrrrrrrrrr [12:08:34] I tried three devices and you cannot hear me in any of them. [12:09:23] aharoni: could you please join our hangout from earlier today? [12:09:44] I will join you there, to see if this is because lu has created the hangout* [12:19:33] (03PS9) 10Hashar: Package python deps with dh-virtualenv [integration/zuul] (debian/precise-wikimedia) - 10https://gerrit.wikimedia.org/r/195272 (https://phabricator.wikimedia.org/T48552) [12:31:00] I’m back [12:31:02] let’s fix thisss [12:31:06] this -> scap [12:39:45] Project beta-scap-eqiad build #44925: FAILURE in 33 sec: https://integration.wikimedia.org/ci/job/beta-scap-eqiad/44925/ [13:09:24] 10Continuous-Integration, 10Wikidata, 3§ Wikidata-Sprint-2015-02-25, 3§ Wikidata-Sprint-2015-03-11: fix the qunit tests for wikidata: mwext-Wikibase-qunit - https://phabricator.wikimedia.org/T74184#1112805 (10JanZerebecki) How do you know they failed because of the database locked errors? They too show the... [13:37:07] 10Continuous-Integration, 10Wikimedia-Git-or-Gerrit: Gerrit: delete integration/kss.git repo - https://phabricator.wikimedia.org/T92483#1112885 (10demon) Deleted from github and gitblit. I don't have rights to delete here on Phab so @chasemp can you `./bin/remove destroy rCIKS` for us? [13:41:26] RECOVERY - Puppet failure on deployment-rsync01 is OK: OK: Less than 1.00% above the threshold [0.0] [13:41:31] 10Deployment-Systems, 5Patch-For-Review: LocalisationUpdate broken since 2014-12-16 - https://phabricator.wikimedia.org/T85790#1112901 (10Nikerabbit) I heve seen patches in Gerrit trying to work out the permissions issues. Someone more close to those is needed to give more detailed status update. [13:46:26] RECOVERY - Puppet failure on deployment-jobrunner01 is OK: OK: Less than 1.00% above the threshold [0.0] [13:56:46] RECOVERY - Puppet failure on deployment-videoscaler01 is OK: OK: Less than 1.00% above the threshold [0.0] [14:50:50] Project browsertests-Wikidata-SmokeTests-linux-firefox-sauce build #184: FAILURE in 33 min: https://integration.wikimedia.org/ci/job/browsertests-Wikidata-SmokeTests-linux-firefox-sauce/184/ [14:53:51] thcipriani: hi [14:54:00] howdy [14:54:06] thcipriani: so that patch has been beaten into submission, and is live on deployment-prep now :) [14:54:12] and scap works... [14:54:30] and mwdeploys no longer reliant on NFS for keys :D [14:54:32] wait do you mean on staging? [14:55:56] thcipriani: no deployment-prep [14:56:12] thcipriani: now I need to merge it and make sure it doesn’t break anything on prod... [14:56:16] and then we get back to staging [14:56:25] so now there are 3 environs to worry about, deployment-prep, staging and prod... [14:56:56] 10Deployment-Systems, 6Release-Engineering: Clean up erroneously created wmf/1.20wmf21 branches - https://phabricator.wikimedia.org/T92501#1113097 (10Krinkle) 3NEW a:3mmodell [14:57:23] 10Deployment-Systems, 6Release-Engineering: Clean up erroneously created wmf/1.20wmf21 branches - https://phabricator.wikimedia.org/T92501#1113105 (10Krinkle) [14:57:26] YuviPanda: gotcha. Should I be testing patches on deployment-prep before moving over to staging? [15:01:21] thcipriani: hmm, probably. or at least see if it breaks... [15:01:39] thcipriani: another difference is that staging also requires the first time setup to be puppetized [15:01:42] deplyment prep doesn't [15:02:57] thcipriani: besdies, problems deployment-prep faces now would also be problems staging faces soon enough... [15:04:09] YuviPanda: sure. Being trying to be mindful about not actually "changing" anything. deployment-prep would be a good test. I'm not sure about the process for patching things there—is there anyone I should ping before making changes there? [15:04:31] thcipriani: I usually !log here, and make sure to cherry-pick only as long as necessary... [15:04:47] thcipriani: and if you’re anticipating major downtime, poking chrismcmahonbrb and greg-g would be good [15:05:13] * thcipriani nods [15:05:50] !log jenkins loves german again [15:05:56] Logged the message, Master [15:07:46] 10Deployment-Systems, 5Patch-For-Review: LocalisationUpdate broken since 2014-12-16 - https://phabricator.wikimedia.org/T85790#1113145 (10bd808) We had a successful manual run in the early morning (UTC) of 2015-03-12. If the normal cron job is successful on 2015-03-13 we may be able to declare victory. Attempt... [15:07:57] thcipriani: can you review https://gerrit.wikimedia.org/r/#/c/195340/? [15:08:31] YuviPanda: yup, I'll take a look. [15:15:12] (03PS1) 10Krinkle: Remove MediaWiki REL1_21 and REL1_22 triggers [integration/config] - 10https://gerrit.wikimedia.org/r/196234 [15:19:09] 10Continuous-Integration, 10Wikidata, 3§ Wikidata-Sprint-2015-02-25, 3§ Wikidata-Sprint-2015-03-11: fix the qunit tests for wikidata: mwext-Wikibase-qunit - https://phabricator.wikimedia.org/T74184#1113158 (10adrianheine) Sorry. It's way done in the mw-debug-www.log build artifact. [15:21:17] thcipriani: cool. we have like.. a 6h window starting now, and I’m wondering if we should just do it. [15:21:59] (03CR) 10Krinkle: [C: 032] Remove MediaWiki REL1_21 and REL1_22 triggers [integration/config] - 10https://gerrit.wikimedia.org/r/196234 (owner: 10Krinkle) [15:22:29] !log Fix Jenkins UI (was stuck in German) [15:22:32] Logged the message, Master [15:22:35] YuviPanda: everything is looking sane so far. Probably best to break stuff when you have more time to fix. [15:22:58] thcipriani: 6h seems plenty... [15:28:39] (03Merged) 10jenkins-bot: Remove MediaWiki REL1_21 and REL1_22 triggers [integration/config] - 10https://gerrit.wikimedia.org/r/196234 (owner: 10Krinkle) [15:29:37] (03PS1) 10Krinkle: Don't run mediawiki-core-bundle-rubocop on <= REL1_23 [integration/config] - 10https://gerrit.wikimedia.org/r/196239 [15:29:47] (03CR) 10Krinkle: [C: 032] Don't run mediawiki-core-bundle-rubocop on <= REL1_23 [integration/config] - 10https://gerrit.wikimedia.org/r/196239 (owner: 10Krinkle) [15:31:01] (03Merged) 10jenkins-bot: Don't run mediawiki-core-bundle-rubocop on <= REL1_23 [integration/config] - 10https://gerrit.wikimedia.org/r/196239 (owner: 10Krinkle) [15:31:03] !log Reloading Zuul to deploy Ia289ebb0 [15:31:07] Logged the message, Master [15:34:21] 10Deployment-Systems, 6Release-Engineering: Clean up erroneously created wmf/1.20wmf21 branches - https://phabricator.wikimedia.org/T92501#1113200 (10greg) p:5Triage>3Normal [15:35:08] YuviPanda: what has to be done to update all the beta scap targets? [15:35:31] thcipriani: uncheck the role::beta::scap_target (or whatever that was) and add beta::deploymentaccess [15:35:32] how manual is that going to be? [15:35:35] thcipriani: I’ve already done that [15:35:41] there were only 5 deployment targets [15:35:41] ah, cool. [15:41:38] 6Release-Engineering: Remove EOL MediaWiki release branches - https://phabricator.wikimedia.org/T92503#1113228 (10Krinkle) 3NEW [15:43:23] (03CR) 1020after4: [C: 032] move auth to an untracked file, auth.php [tools/release] - 10https://gerrit.wikimedia.org/r/195941 (owner: 1020after4) [15:43:50] (03PS1) 10Krinkle: Update mediawiki-core jobs to use standard jsduck [integration/config] - 10https://gerrit.wikimedia.org/r/196245 [15:44:27] (03PS2) 1020after4: migrate-patch utility to retain security patches [tools/release] - 10https://gerrit.wikimedia.org/r/195942 [15:46:33] greg-g: so… looks like there’s a 5hour window now, and I”d rather do it now than tomorrow. is that ok? [15:46:39] it has a +1 from thcipriani :) [15:47:18] YuviPanda: kk, I might have to mute wmf-insecte if the browser tests all start failing in the new couple hours (which is fine, unfortunately) [15:47:54] greg-g: this will affect prod, more like, hence :) [15:47:56] alright [15:48:17] oh right, duh [15:48:24] * greg-g hasn't finished first cup of coffee yet [15:48:30] doit! [15:50:05] (03PS2) 1020after4: Fix branched sub-submodule support [tools/release] - 10https://gerrit.wikimedia.org/r/195972 [15:51:17] (03CR) 1020after4: [C: 032] Fix branched sub-submodule support [tools/release] - 10https://gerrit.wikimedia.org/r/195972 (owner: 1020after4) [15:53:13] greg-g: DOING [15:53:39] * thcipriani holds on to butts [15:55:51] greg-g: thcipriani YAY IT WAS A NOOP IN PROD [15:55:54] AS IT SHOULD HAVE BEEN [15:55:58] :D :D [15:56:19] YuviPanda: \o/ nice! [15:57:32] I love those moments when you plan so much, worry so much, tremble and you hit enter, and then..... exactly as it should have gone. [15:57:46] s/tremble and/tremble as/ [15:58:13] :D [15:58:14] yeah [15:58:45] greg-g: beta is also closer to prod in so many ways now. [15:58:53] partly by moving beta, but also by moving prod [15:58:55] so that’s nice [16:02:16] :) [16:09:10] 10Continuous-Integration, 10Quality-Assurance, 6Release-Engineering, 7Browser-Tests: browsertest jobs should not be allowed to run for 10 hours - https://phabricator.wikimedia.org/T92275#1113352 (10zeljkofilipin) The three longest running [[ https://integration.wikimedia.org/ci/view/BrowserTests/view/-All/... [16:10:42] 6Release-Engineering: Remove EOL MediaWiki release branches - https://phabricator.wikimedia.org/T92503#1113365 (10greg) p:5Triage>3Normal [16:13:59] (03CR) 10Krinkle: [C: 032] "Deployed mediawiki-core-jsduck-publish and mediawiki-core-jsduck." [integration/config] - 10https://gerrit.wikimedia.org/r/196245 (owner: 10Krinkle) [16:15:41] 10Continuous-Integration, 10Quality-Assurance, 6Release-Engineering, 7Browser-Tests: browsertest jobs should not be allowed to run for 10 hours - https://phabricator.wikimedia.org/T92275#1113395 (10greg) >>! In T92275#1113352, @zeljkofilipin wrote: > I think it is safe to set the limit to 3 hours for all b... [16:18:06] 10Continuous-Integration, 10Quality-Assurance, 6Release-Engineering, 7Browser-Tests: browsertest jobs should not be allowed to run for 10 hours - https://phabricator.wikimedia.org/T92275#1113403 (10zeljkofilipin) >>! In T92275#1105133, @hashar wrote: > Can we look at the reason for the slowness? Maybe we a... [16:18:23] 10Continuous-Integration, 10Quality-Assurance, 6Release-Engineering, 7Browser-Tests: browsertest jobs should not be allowed to run for 10 hours - https://phabricator.wikimedia.org/T92275#1113404 (10zeljkofilipin) a:3zeljkofilipin [16:19:33] (03Merged) 10jenkins-bot: Update mediawiki-core jobs to use standard jsduck [integration/config] - 10https://gerrit.wikimedia.org/r/196245 (owner: 10Krinkle) [16:19:38] 10Continuous-Integration, 10Quality-Assurance, 6Release-Engineering, 7Browser-Tests: browsertest jobs should not be allowed to run for 10 hours - https://phabricator.wikimedia.org/T92275#1104513 (10zeljkofilipin) >>! In T92275#1113395, @greg wrote: >>>! In T92275#1113352, @zeljkofilipin wrote: >> I think i... [16:22:13] 6Release-Engineering, 7Browser-Tests: Investigate updating browser versions in Jenkins builds. - https://phabricator.wikimedia.org/T92005#1113423 (10zeljkofilipin) Wait, I am not sure what is the problem here. The first job is a Firefox job, the second job is Chrome job. Firefox job failed maybe because Flow... [16:43:41] Krinkle|detached: hashar where will I find the definitions for the beta scap job? [16:43:50] it’s not in the jenkins-job-builder repo [16:43:55] yes it is [16:44:27] bd808: hmm, last commit on the repo seems to be from… October? [16:44:31] YuviPanda: integration/config [16:44:32] https://github.com/wikimedia/integration-config/blob/master/jjb/beta.yaml [16:44:32] I think I’m looking at the wrong one [16:44:33] gah [16:44:34] right [16:44:53] jjb is just our fork/mirror of the code [16:45:00] https://github.com/wikimedia/integration-config/blob/master/jjb/beta.yaml#L210-L240 [16:45:19] but yeah that's not in the jjb repo any more [16:45:23] bd808: btw, I merged the patch :D beta is using keyholder now, and there are no more keys in NFS for mwdeploy, and best of all, it was a no-op on prod :D [16:45:34] perfect! [16:45:51] thanks to ori for making that possible! [16:45:59] bd808: yup! :D three days and lots of shaven yaks. [16:46:16] if I start thanking ori for all the things he’s made possible I’ll be thanking ori all day :P [16:46:22] same for you as well [16:47:06] says the guy who has infrastructure named after him ;) [16:58:23] bd808: vagrant you mean? :) [16:58:34] heh [17:03:57] 6Release-Engineering, 10Staging, 3releng-201415-Q3: Determine code update cycle/cadence for the staging cluster - https://phabricator.wikimedia.org/T91563#1113560 (10mmodell) Why not just have staging-tin track mediawiki head and automatically run scap every time something changes? [17:07:29] 6Release-Engineering, 10Staging, 3releng-201415-Q3: Determine code update cycle/cadence for the staging cluster - https://phabricator.wikimedia.org/T91563#1113567 (10greg) >>! In T91563#1113560, @mmodell wrote: > Why not just have staging-tin track mediawiki head and automatically run scap every time somethi... [17:08:48] 10Deployment-Systems, 5Patch-For-Review: l10nupdate user can't access scap shared ssh key causing nightly l10nupdate sync process to fail - https://phabricator.wikimedia.org/T76061#1113586 (10mmodell) a:5mmodell>3bd808 looks like you have this one handled. [17:13:31] uh oh [17:13:53] bd808: scap question... [17:14:15] bd808: in prod, scap doesn’t run *as* mwdeploy, right? on the deployment host, when someone invokes it? It only sshs to instances as mwdeploy... [17:14:37] it runs as mwdeploy [17:14:47] most of the time [17:15:14] does things like "sudo -u mwdeploy rsync ..." [17:15:31] oh on the deploy host.... [17:15:34] yeah not in prod [17:15:52] it runs as whoever types "scap" [17:16:18] anyone in the wikidev group [17:16:57] but it runs as mwdeploy on the target hosts [17:17:19] even when it connects over ssh as another user [17:17:40] bd808: right, that was my thinking as well. [17:18:06] so now things are a bit eugh on beta, since keyholder is accessible only to project-deployment-prep, and jekins-bot isn’t project-deployment-prep... [17:18:38] * YuviPanda thinks [17:18:55] oooh [17:18:58] jenkins-deploy? [17:19:03] make the key available to another group? [17:19:17] jenkins-deploy is a proper member... [17:19:33] is that the user that kicks off scap from jenkins? [17:19:40] *shrug* [17:19:47] yeah, I’ll poke [17:20:09] bd808: ideall, I’ll shave this yak too, although I’m mostly afraid of touching anything CI... [17:20:14] so maybe not. [17:20:19] thanks though :) [17:20:35] 10Continuous-Integration, 10Wikidata, 7Jenkins: PHP fatal errors are not visible in jenkins output in mwext-Wikibase-client-tests job - https://phabricator.wikimedia.org/T92397#1113674 (10Legoktm) [17:23:09] 10Continuous-Integration, 10Wikidata, 7Jenkins: PHP fatal errors are not visible in jenkins output in mwext-Wikibase-client-tests job - https://phabricator.wikimedia.org/T92397#1113684 (10Legoktm) Hmm, is something in client messing with the output buffer? I grepped for "ob_" but didn't see anything client-r... [17:25:36] hashar: around/ [17:25:49] hashar: do you know what user jenkins uses to ssh to deployment-bastion? [17:26:14] 10Continuous-Integration, 6Scrum-of-Scrums, 7Blocked-on-MediaWiki-Core: MediaWiki installs in Jenkins frequently fail to access their sqlite database due to locks - https://phabricator.wikimedia.org/T89180#1113698 (10Krinkle) 5Resolved>3Open https://integration.wikimedia.org/ci/job/mwext-VisualEditor-qun... [17:26:14] 10Continuous-Integration, 10VisualEditor, 7Tracking, 3VisualEditor 2014/15 Q3 blockers: Browser disconnects when running QUnit tests with local browsers (tracking) - https://phabricator.wikimedia.org/T89075#1113700 (10Krinkle) [17:27:29] 10Continuous-Integration, 6Scrum-of-Scrums, 7Blocked-on-MediaWiki-Core: MediaWiki installs in Jenkins frequently fail to access their sqlite database due to locks - https://phabricator.wikimedia.org/T89180#1113705 (10Krinkle) a:5Krinkle>3None [17:30:04] 10Continuous-Integration, 10Flow, 6Mobile-Web, 10VisualEditor, 10Wikimedia-Hackathon-2015: Create Jenkins builds for Editing across repositories (MobileFrontend, VisualEditor etc) - https://phabricator.wikimedia.org/T90647#1113707 (10Jdforrester-WMF) p:5Triage>3Normal [17:30:36] 6Release-Engineering, 10VisualEditor, 7Browser-Tests: Create VisualEditor tests targeting the older version of browsers (Chrome and Firefox for now) for better backward compatibility - https://phabricator.wikimedia.org/T90678#1113710 (10Jdforrester-WMF) p:5Triage>3High [17:30:45] 6Release-Engineering, 10VisualEditor, 7Browser-Tests: Selenium bug with Firefox causes VE test failure - https://phabricator.wikimedia.org/T90651#1113712 (10Jdforrester-WMF) p:5Triage>3High [17:31:24] (03PS1) 10Yuvipanda: Don't use separate ssh-agent for running scap [integration/config] - 10https://gerrit.wikimedia.org/r/196273 [17:31:56] legoktm: hashar ^ should unbreak scap on beta. help? [17:32:39] thcipriani: ^d I am considering just getting rid of deployment-rsync01... [17:32:48] 6Release-Engineering, 10VisualEditor, 7Browser-Tests: Selenium bug with Firefox causes VE test failure - https://phabricator.wikimedia.org/T90651#1113724 (10Cmcmahon) Note that I removed the "&" from the string with this commit: https://gerrit.wikimedia.org/r/#/c/195683 [17:33:04] <^d> YuviPanda: No objections here [17:33:13] twentyafterfour: ^ [17:34:18] looking [17:34:20] ? [17:35:24] twentyafterfour: getting rid of deployment-rscync01 [17:35:58] YuviPanda: https://integration.wikimedia.org/ci/view/Beta/job/beta-scap-eqiad/44941/console [17:36:16] YuviPanda: that's with your change, still failing [17:36:30] legoktm: yes, that’s rsync01, that I want to delete [17:36:36] legoktm: before that it was *all* failing [17:36:45] legoktm: let me find a bug to associate with it [17:36:52] actually fuck it :) [17:36:56] legoktm: but yeah, that output is desirable [17:36:59] ok [17:37:26] YuviPanda: why are you killing it? [17:37:36] thcipriani: why is it there? [17:37:42] thcipriani: in prod, we don’t have separate proxy servers [17:37:46] some mw servers are also proxy servers [17:37:59] (03CR) 10Legoktm: [C: 032] "Deployed and confirmed: https://integration.wikimedia.org/ci/view/Beta/job/beta-scap-eqiad/44941/console" [integration/config] - 10https://gerrit.wikimedia.org/r/196273 (owner: 10Yuvipanda) [17:38:06] so scap will fail here, because rsync01 doesn’t actually have any mediawiki in it [17:38:19] thcipriani: plus, it’s causing the job to fail ;D [17:38:43] by all means, kill it [17:38:46] YuviPanda: that sounds like justification enough for me. [17:38:54] +1 works for me [17:38:57] wheeeee [17:39:34] !log killll deployment-rsync01, wasn’t being used for anything discernable, and that’s not how proxies work in prod [17:39:38] Logged the message, Master [17:40:29] rsync01 was handy when I was figuring out how to bring scap into beta but I agree it doesn't help anything now [17:41:13] \o/ [17:41:14] cool [17:41:58] bd808: with that, I wonder if I can declare that prod and beta have same deployment setup going... [17:42:04] or if there are wast swathes of stuff I’m missing [17:43:00] the only thing I can think of is the hack in scap for beta to ignore host keys [17:43:23] bd808: aah, right. [17:43:23] that [17:43:27] (03Merged) 10jenkins-bot: Don't use separate ssh-agent for running scap [integration/config] - 10https://gerrit.wikimedia.org/r/196273 (owner: 10Yuvipanda) [17:44:51] bd808: I want to kill ./files/scripts/beta-apaches as well. I think it’s trivial enough to be salted if needed... [17:44:58] I can keep it if you think it’s useful... [17:45:10] * YuviPanda is trying to pare down the beta module to as lean as possible [17:45:45] I don't use it [17:45:57] it was an ask from hashar in the past [17:46:12] oh, hmm. [17:48:23] bd808: also [17:48:24] class role::beta::trebuchet_testing { [17:48:25] that? [17:50:59] YuviPanda: not sure. what hosts is it applied on? [17:52:40] bd808: hmm, none that I can find, actually [17:52:57] then kill with fire :) [17:54:34] deployment-db1 has it... [17:54:38] wat. [17:54:41] ... [17:54:53] why would deployment-db1 have… trebuchet_testing [17:55:32] deployment-mathoid and deployment-rsync01...I guess..._had_ it [17:55:49] deployment-mathoid _has_ it [17:56:03] it might be something ariel was playing with at some point [17:56:11] apergos: ^ [17:56:37] Yippee, build fixed! [17:56:37] Project beta-scap-eqiad build #44946: FIXED in 40 sec: https://integration.wikimedia.org/ci/job/beta-scap-eqiad/44946/ [17:56:50] legoktm: https://integration.wikimedia.org/ci/view/Beta/job/beta-scap-eqiad/44946/console scap success \o/ [17:57:05] I don’t think I’ve ever felt as good about a green before. [17:57:13] it's just for testing [17:57:13] YuviPanda: https://github.com/wikimedia/operations-puppet/commit/22857337f3f17433e8f04d518129fdae672b1bfa [17:57:19] nothing else, so please don't kill it [17:57:33] apergos: hmm, what is it testing? [17:57:37] trebuchet! [17:57:41] * YuviPanda is trying to move stuff off beta into places elsewhere. [17:57:54] hmm [17:58:03] well this is stuff I want in beta because that's the good subset to test trebuchet out after salt [17:58:30] I want a little cluster with a few different oses that looks mostly like prod etc, and it fits the bill [17:58:33] apergos: right, having it be in beta is completely fine. [17:58:37] ok [17:58:38] apergos: I just want to move the *code* [17:58:41] ahhh [17:58:44] :-D [17:58:54] like, all the scap code just got moved / merged with scap/ module [17:58:59] oh I see [17:59:00] YuviPanda: yay! [17:59:09] legoktm: it’s running scap the same was as prod now :) [18:00:05] I'm not coming up with any bright ideas [18:00:28] yeah, me neither. [18:00:40] PROBLEM - Puppet failure on deployment-bastion is CRITICAL: CRITICAL: 22.22% of data above the critical threshold [0.0] [18:02:15] uh [18:02:23] transient? [18:02:25] * YuviPanda looks [18:05:08] seems to be [18:05:31] RECOVERY - Puppet failure on deployment-bastion is OK: OK: Less than 1.00% above the threshold [0.0] [18:09:52] that’s better [18:10:14] thcipriani: so for staging-tin we still have to do a lot of ‘first time setup steps’ in puppet. like cloning the mediawiki repo, cloning mediawiki-config repo, etc [18:10:29] thcipriani: and there are going to be some manual steps (like setting up db password?) [18:10:36] not sure how to handle the manual steps [18:11:14] YuviPanda: for mariadb, right now, I've just got a bash script that is nowhere :\ so I'm not sure either [18:12:18] thcipriani: I suppose we can start a document of ‘how to build yourself a beta cluster’, which has links to these scripts (which can be put in puppet) [18:14:22] YuviPanda: that sounds good to me. [18:14:40] thcipriani: cool :) [18:24:20] Krinkle: are you fine with ori his request to remove $wgResourceLoaderMaxQueryLength and hardcode the default?: https://gerrit.wikimedia.org/r/#/c/193130/ [18:24:27] ^d, I think this causes the 5::duration bug: http://en.wikipedia.org/w/api.php?action=query&list=recentchanges&rcprop=loginfo&rctype=log&rcstart=20150312084618 [18:24:37] log_id = 63139902 [18:25:03] log_params is just indefinite\nnocreate [18:25:24] ApiQueryRecentChanges does not pass $legacy [18:25:49] 10Continuous-Integration, 6MediaWiki-Core-Team, 6operations: add a check for whitespace before leading so it defaults to assuming not legacy and tries to access 5::duration, when we actually want 0 [18:26:11] ApiQueryWatchlist has a similar issue [18:27:10] 10Continuous-Integration, 6MediaWiki-Core-Team, 6operations: add a check for whitespace before leading Krenair: Ah, I see it now [18:28:36] do we know why legacy logs are still being recorded though? [18:30:43] 10Deployment-Systems: scap's check_php_syntax() should check for text before ' 10Continuous-Integration, 6MediaWiki-Core-Team, 6operations: add a check for whitespace before leading >! In T92531#1113969, @Krinkle wrote: > See {T46875}. > > This and various other errors are already caught by phpcs. Please ensure proj... [18:32:49] 10Continuous-Integration, 6MediaWiki-Core-Team, 6operations: add a check for whitespace before leading >! In T92531#1113969, @Krinkle wrote: > See {T46875}. > > This and various other errors are already caught by phpcs. Please ensure project(... [18:33:02] RECOVERY - Long lived cherry-picks on puppetmaster on deployment-salt is OK: OK: Less than 100.00% above the threshold [0.0] [18:33:38] 10Continuous-Integration, 10Incident-20150312-whitespace, 6MediaWiki-Core-Team, 6operations: add a check for whitespace before leading 10Continuous-Integration, 10Incident-20150312-whitespace, 6MediaWiki-Core-Team, 6operations: add a check for whitespace before leading 3High [18:35:43] ^d [18:37:38] 10Continuous-Integration, 10Incident-20150312-whitespace, 6MediaWiki-Core-Team, 6operations: add a check for whitespace before leading >! In T92531#1114010, @ori wrote: >>>! In T92531#1113969, @Krinkle wrote: >> See {T46875}. >> >> This... [18:37:51] 10Deployment-Systems, 10Incident-20150312-whitespace: scap's check_php_syntax() should check for text before ' ^d: ooh, is checkuser still doing legacy block logging? [18:38:44] <^d> Krenair: No clue [18:38:50] <^d> dunno [18:39:01] I pulled up the last 5 block logs on enwiki without 5::duration in params [18:39:03] all checkuser blocks [18:40:57] Yep. [18:42:16] https://phabricator.wikimedia.org/diffusion/ECHU/browse/master/specials/SpecialCheckUser.php;df3079408a5b95c82dc0bb42dd083f0af1bd7dde$290-297 [18:42:22] that can't possibly be new logging [18:43:49] <^d> YuviPanda: ugh [18:43:50] <^d> Error: Could not retrieve catalog from remote server: Error 400 on SERVER: Duplicate declaration: File[/etc/ssh/userkeys] is already declared in file /etc/puppet/private/modules/passwords/manifests/init.pp:36; cannot redeclare at /etc/puppet/modules/ssh/manifests/server.pp:31 on node i-0000093e.eqiad.wmflabs [18:43:51] 10Deployment-Systems, 10Incident-20150312-whitespace: scap's check_php_syntax() should check for text before ' ^d: need to update private repo [18:44:14] pull I mean [18:44:19] this is staging, isn’t it? [18:44:20] let me do that [18:44:38] <^d> staging, yeah [18:47:56] Project browsertests-MultimediaViewer-en.wikipedia.beta.wmflabs.org-windows_8.1-internet_explorer-11-sauce build #366: FAILURE in 40 min: https://integration.wikimedia.org/ci/job/browsertests-MultimediaViewer-en.wikipedia.beta.wmflabs.org-windows_8.1-internet_explorer-11-sauce/366/ [18:50:40] 10Deployment-Systems, 10Incident-20150312-whitespace: scap's check_php_syntax() should check for text before '3High [18:51:23] <^d> Krenair: Would passing $logEntry->isLegacy() through at least help? [18:51:32] <^d> (easier to fix, quick to swat) [18:52:33] yeah I think that'd do the trick [18:53:08] <^d> Patch incoming [18:53:49] and also to ApiQueryWatchlist [18:58:27] <^d> Krenair: https://gerrit.wikimedia.org/r/#/c/196299/ [19:01:16] ^d: staging puppet master should work again, labs/private needed updated to match operations/puppet [19:09:22] Project browsertests-MultimediaViewer-en.wikipedia.beta.wmflabs.org-os_x_10.9-safari-sauce build #520: FAILURE in 59 min: https://integration.wikimedia.org/ci/job/browsertests-MultimediaViewer-en.wikipedia.beta.wmflabs.org-os_x_10.9-safari-sauce/520/ [19:12:08] (03PS1) 10Legoktm: check_php_syntax: Check for any content before opening (03CR) 10Legoktm: "Untested besides running "tox"" [tools/scap] - 10https://gerrit.wikimedia.org/r/196306 (https://phabricator.wikimedia.org/T92534) (owner: 10Legoktm) [19:23:04] (03CR) 10Legoktm: check_php_syntax: Check for any content before opening 6Release-Engineering, 6Engineering-Community, 6Team-Practices, 10Wikimedia-Hackathon-2015, 3ECT-March-2015: RelEng team offsite - May 2015 - Pre Wikimedia Hackathon - https://phabricator.wikimedia.org/T89036#1114307 (10Rfarrand) Finalizing contract with venue in Annecy - once I have a first draft I will... [19:25:40] (03PS2) 10Legoktm: check_php_syntax: Check for any content before opening 10Deployment-Systems, 10Incident-20150312-whitespace, 5Patch-For-Review: scap's check_php_syntax() should check for text before ' 10Continuous-Integration, 10Incident-20150312-whitespace, 6MediaWiki-Core-Team, 6operations: add a check for whitespace before leading 10Deployment-Systems, 10Incident-20150312-whitespace, 5Patch-For-Review: scap's check_php_syntax() should check for text before ' (03CR) 10BryanDavis: check_php_syntax: Check for any content before opening 10Continuous-Integration: Create generic "tox" jobs - https://phabricator.wikimedia.org/T92557#1114534 (10Legoktm) 3NEW [20:09:15] 10Continuous-Integration: Replace project-specific "{name}-thing" jobs with generic "thing" ones - https://phabricator.wikimedia.org/T91997#1114541 (10Legoktm) [20:15:00] Yippee, build fixed! [20:15:01] Project browsertests-CentralNotice-en.wikipedia.beta.wmflabs.org-os_x_10.9-safari-sauce build #172: FIXED in 1 min 6 sec: https://integration.wikimedia.org/ci/job/browsertests-CentralNotice-en.wikipedia.beta.wmflabs.org-os_x_10.9-safari-sauce/172/ [20:22:50] Project browsertests-VisualEditor-en.wikipedia.beta.wmflabs.org-linux-chrome-sauce build #573: ABORTED in 7 min 49 sec: https://integration.wikimedia.org/ci/job/browsertests-VisualEditor-en.wikipedia.beta.wmflabs.org-linux-chrome-sauce/573/ [20:22:51] Project browsertests-MobileFrontend-en.m.wikipedia.beta.wmflabs.org-linux-firefox-sauce build #602: ABORTED in 16 min: https://integration.wikimedia.org/ci/job/browsertests-MobileFrontend-en.m.wikipedia.beta.wmflabs.org-linux-firefox-sauce/602/ [20:22:53] Project browsertests-VisualEditor-en.wikipedia.beta.wmflabs.org-windows_8-internet_explorer-sauce build #357: ABORTED in 14 min: https://integration.wikimedia.org/ci/job/browsertests-VisualEditor-en.wikipedia.beta.wmflabs.org-windows_8-internet_explorer-sauce/357/ [20:22:54] Project browsertests-MobileFrontend-en.m.wikipedia.beta.wmflabs.org-linux-chrome-sauce build #566: ABORTED in 36 min: https://integration.wikimedia.org/ci/job/browsertests-MobileFrontend-en.m.wikipedia.beta.wmflabs.org-linux-chrome-sauce/566/ [20:26:40] !log Restablished Gearman connection from Zuul due to deadlock [20:26:45] Logged the message, Master [20:31:07] (03PS1) 10Legoktm: Create generic tox-* jobs, use for labs projects [integration/config] - 10https://gerrit.wikimedia.org/r/196392 (https://phabricator.wikimedia.org/T92557) [20:40:24] <^d> thcipriani: Ty sir [20:40:57] ^d: yw sir [20:41:00] (03CR) 10Legoktm: [C: 032] Create generic tox-* jobs, use for labs projects [integration/config] - 10https://gerrit.wikimedia.org/r/196392 (https://phabricator.wikimedia.org/T92557) (owner: 10Legoktm) [20:45:14] 10Continuous-Integration, 10Wikidata, 3§ Wikidata-Sprint-2015-02-25, 3§ Wikidata-Sprint-2015-03-11: fix the qunit tests for wikidata: mwext-Wikibase-qunit - https://phabricator.wikimedia.org/T74184#1114678 (10adrianheine) [20:46:16] ^d: thcipriani I like the respect you show each other :) [20:46:20] (03Merged) 10jenkins-bot: Create generic tox-* jobs, use for labs projects [integration/config] - 10https://gerrit.wikimedia.org/r/196392 (https://phabricator.wikimedia.org/T92557) (owner: 10Legoktm) [20:46:48] <^d> thcipriani: staging-mc[1-3] are now all done, on precise like prod eqiad [20:47:44] ^d: beauty, I'm working on mx for right now. This it's actually a fairly simple patch, looks like, that'll be done this afternoon, likely. [20:47:59] <^d> awesome [20:48:40] ^d: db is done-ish, minus some documentation, bash scripts, patch acceptance, and instance right-sizing. [20:50:21] <^d> Awesome [20:53:16] 10Staging: Create staging cluster (tracking) - https://phabricator.wikimedia.org/T88702#1114705 (10demon) [20:58:32] whoa, tables in phab [20:58:36] eg on https://phabricator.wikimedia.org/T89084 [20:59:09] 10Continuous-Integration, 10Wikimedia-Git-or-Gerrit: Gerrit: delete integration/kss.git repo - https://phabricator.wikimedia.org/T92483#1114740 (10chasemp) >>! In T92483#1112885, @demon wrote: > Deleted from github and gitblit. > > I don't have rights to delete here on Phab so @chasemp can you `./bin/remove d... [20:59:10] (03PS1) 10Legoktm: Use generic tox jobs for analytics [integration/config] - 10https://gerrit.wikimedia.org/r/196422 (https://phabricator.wikimedia.org/T92557) [20:59:35] <^d> greg-g: I save all my snark for bd808 :p [20:59:55] * bd808 needs more snark [21:00:00] greg-g: the syntax for them is even worse than mediawiki's. [21:00:22] MatmaRex: oh, well then :) [21:00:59] 10Continuous-Integration, 5Patch-For-Review: Remove integration/kss.git - https://phabricator.wikimedia.org/T92482#1114754 (10demon) [21:01:00] 10Continuous-Integration, 10Wikimedia-Git-or-Gerrit: Gerrit: delete integration/kss.git repo - https://phabricator.wikimedia.org/T92483#1114752 (10demon) 5Open>3Resolved a:3demon [21:03:20] ^d: question—I think I need a public url (or an internally resolvable url) for a mediawiki instance in staging for the VERP bouncer. What is that going to be? Any reason http://deployment.wikimedia.beta.wmflabs.org is so long? [21:03:37] 6Release-Engineering, 10Ops-Access-Requests, 6Phabricator, 6operations: Chad H. needs access to iridium (Phabricator host) to manage repos - https://phabricator.wikimedia.org/T92564#1114785 (10demon) 3NEW [21:04:24] <^d> thcipriani: We could probably come up with nicer urls. [21:04:44] (03CR) 10Legoktm: [C: 032] Use generic tox jobs for analytics [integration/config] - 10https://gerrit.wikimedia.org/r/196422 (https://phabricator.wikimedia.org/T92557) (owner: 10Legoktm) [21:05:28] ^d: heh, I'm just going to use staging.wmflabs.org for now. Can be changed later. [21:05:38] bd808: in scap, how/where do I add a test case? [21:05:55] It uses doc tests right now [21:06:15] The >>> something comments [21:07:00] hmm, can I add invalid php files to the repo and just point to those paths? [21:07:14] you could I suppose... [21:07:43] or can I add a real unittest case thing? [21:07:55] yeah. tox runes nose over the code [21:08:32] so things in a test module should be picked up magically [21:08:47] *tox runs [21:09:08] "tox runes nose" is the name of my next ska band [21:10:03] (03Merged) 10jenkins-bot: Use generic tox jobs for analytics [integration/config] - 10https://gerrit.wikimedia.org/r/196422 (https://phabricator.wikimedia.org/T92557) (owner: 10Legoktm) [21:11:05] thcipriani: staging.wmflabs.org sounds fine, that would mean {wikiname}.staging.wmflabs.org, which is fine/good. [21:12:16] <^d> We'd still end up with foo.bar.staging.wmflabs.org [21:13:04] <^d> Why do we have a deployment.wm anyway? what is that supposed to mirror from prod? [21:13:11] oh right, en.wikipedia.staging.wmflabs and commons.staging.wmflabs [21:13:23] <^d> No, commons.wikimedia.staging.wmflabs.org [21:13:28] gah [21:13:33] right [21:15:12] <^d> I mean even if we had our own tld, we'd still be at commons.wikimedia.$foo.org [21:15:30] 6Release-Engineering, 10Ops-Access-Requests, 6Phabricator, 6operations, 5Patch-For-Review: Chad H. needs access to iridium (Phabricator host) to manage repos - https://phabricator.wikimedia.org/T92564#1114839 (10chasemp) give me a day or so to poke at this and make sure the perms and abilities are sane (... [21:15:49] yeah, see also ssl cert cost issues [21:16:24] 6Release-Engineering, 10Wikimedia-Hackathon-2015: Release/QA tasks at the Wikimedia Hackathon 2015 - https://phabricator.wikimedia.org/T92565#1114841 (10greg) 3NEW a:3Qgil [21:16:31] * greg-g just creates the task, that one ^ [21:16:48] 6Release-Engineering, 10Wikimedia-Hackathon-2015: Release/QA tasks at the Wikimedia Hackathon 2015 - https://phabricator.wikimedia.org/T92565#1114859 (10greg) a:5Qgil>3None [21:16:56] 6Release-Engineering, 10Ops-Access-Requests, 6Phabricator, 6operations, 5Patch-For-Review: Chad H. needs access to iridium (Phabricator host) to manage repos - https://phabricator.wikimedia.org/T92564#1114862 (10Dzahn) first we should create a new admin group for this, either phab-roots , that would assu... [21:18:17] 6Release-Engineering, 10Ops-Access-Requests, 6Phabricator, 6operations, 5Patch-For-Review: Chad H. needs access to iridium (Phabricator host) to manage repos - https://phabricator.wikimedia.org/T92564#1114863 (10Dzahn) p:5Triage>3Normal [21:18:31] wait, that last /me was meant for -devtools [21:24:15] 10Staging: Create staging-mx (Mail server, pollonium replacement) - https://phabricator.wikimedia.org/T91562#1114892 (10greg) a:3thcipriani [21:24:35] 10Staging, 5Patch-For-Review: Create staging-db* (databases) - https://phabricator.wikimedia.org/T91545#1114899 (10greg) a:3thcipriani [21:25:36] 6Release-Engineering, 10Ops-Access-Requests, 6Phabricator, 6operations, 5Patch-For-Review: Chad H. needs access to iridium (Phabricator host) to manage repos - https://phabricator.wikimedia.org/T92564#1114906 (10greg) I want to have a meme I can use to indicate my manager approval. Approve. [21:30:27] (03PS1) 10Legoktm: Convert all other jobs to use generic tox ones [integration/config] - 10https://gerrit.wikimedia.org/r/196440 [21:33:08] (03PS11) 10Awight: Jenkins job builder definition for CRM job [integration/config] - 10https://gerrit.wikimedia.org/r/195063 (https://phabricator.wikimedia.org/T91895) [21:34:37] (03PS2) 10Legoktm: Convert all other jobs to use generic tox ones [integration/config] - 10https://gerrit.wikimedia.org/r/196440 [21:35:01] (03PS2) 10Awight: WIP: Set to voting once the job passes [integration/config] - 10https://gerrit.wikimedia.org/r/195343 (https://phabricator.wikimedia.org/T91895) [21:35:59] 6Release-Engineering, 10Ops-Access-Requests, 6Phabricator, 6operations, 5Patch-For-Review: Chad H. needs access to iridium (Phabricator host) to manage repos - https://phabricator.wikimedia.org/T92564#1114939 (10Dzahn) http://knowyourmeme.com/memes/approval-guy What are you approving though? :) [21:36:02] (03CR) 10Legoktm: [C: 032] Convert all other jobs to use generic tox ones [integration/config] - 10https://gerrit.wikimedia.org/r/196440 (owner: 10Legoktm) [21:40:45] 6Release-Engineering, 10Ops-Access-Requests, 6Phabricator, 6operations, 5Patch-For-Review: Chad H. needs access to iridium (Phabricator host) to manage repos - https://phabricator.wikimedia.org/T92564#1114948 (10greg) {F85729} >>! In T92564#1114939, @Dzahn wrote: > What are you approving though? :) What... [21:41:39] (03Merged) 10jenkins-bot: Convert all other jobs to use generic tox ones [integration/config] - 10https://gerrit.wikimedia.org/r/196440 (owner: 10Legoktm) [21:45:51] (03PS1) 10Legoktm: Convert integration/* to use generic tox jobs [integration/config] - 10https://gerrit.wikimedia.org/r/196446 [21:49:26] legoktm: Nice going :) [21:49:33] Feels good? [21:49:43] yes :D [21:50:03] Every job deleted makes 'jenkins-jobs' run faster to compile the whole thing, too. [21:50:15] Those infamous 6 minutes are quite annoying [21:50:48] (03CR) 10Legoktm: [C: 032] Convert integration/* to use generic tox jobs [integration/config] - 10https://gerrit.wikimedia.org/r/196446 (owner: 10Legoktm) [21:51:17] it takes about 2.5 min on my laptop [21:51:33] but that's still pretty slow [21:54:17] woah, it's down to 1m52s :D [21:55:25] 6Release-Engineering, 10Wikimedia-Hackathon-2015: Release/QA tasks at the Wikimedia Hackathon 2015 - https://phabricator.wikimedia.org/T92565#1114996 (10greg) [21:56:04] (03Merged) 10jenkins-bot: Convert integration/* to use generic tox jobs [integration/config] - 10https://gerrit.wikimedia.org/r/196446 (owner: 10Legoktm) [21:56:38] (03CR) 10Krinkle: [C: 031] "Shallow submodules is supported by Git, but not implemented by Jenkins GIT Plugin. When we re-implement this shallow macro using Zuul we c" [integration/config] - 10https://gerrit.wikimedia.org/r/195985 (https://phabricator.wikimedia.org/T92042) (owner: 10Legoktm) [21:56:54] legoktm: ls -l output/ | wc -l [21:56:59] legoktm: How many do you got? [21:57:13] It was 3400 last week. 3075 yesterday. [21:57:34] 3045 [21:57:42] which sounds about right [22:03:02] (03PS1) 10Legoktm: Convert labs/tools/extdist,wikibugs2 to use generic tox jobs [integration/config] - 10https://gerrit.wikimedia.org/r/196451 [22:06:15] (03CR) 10Legoktm: [C: 032] Convert labs/tools/extdist,wikibugs2 to use generic tox jobs [integration/config] - 10https://gerrit.wikimedia.org/r/196451 (owner: 10Legoktm) [22:11:42] (03CR) 10jenkins-bot: [V: 04-1] Convert labs/tools/extdist,wikibugs2 to use generic tox jobs [integration/config] - 10https://gerrit.wikimedia.org/r/196451 (owner: 10Legoktm) [22:12:18] (03CR) 10Legoktm: [C: 032] "try again plz" [integration/config] - 10https://gerrit.wikimedia.org/r/196451 (owner: 10Legoktm) [22:12:35] 10Continuous-Integration, 10Wikimedia-Git-or-Gerrit: Gerrit: delete integration/kss.git repo - https://phabricator.wikimedia.org/T92483#1115022 (10hashar) Merci beaucoup! [22:14:52] 6Release-Engineering, 10Security-Reviews, 10Wikimedia-Extension-setup, 10Wikimedia-Site-requests: Install SMW on AffCom wiki - https://phabricator.wikimedia.org/T88748#1115023 (10greg) 5Open>3stalled @80686: I'm setting this request to "stalled" for now given the reality of what is likely to happen any... [22:17:39] (03Merged) 10jenkins-bot: Convert labs/tools/extdist,wikibugs2 to use generic tox jobs [integration/config] - 10https://gerrit.wikimedia.org/r/196451 (owner: 10Legoktm) [22:21:32] (03PS2) 10Legoktm: disable submodules for git-remote-zuul-shallow-clone [integration/config] - 10https://gerrit.wikimedia.org/r/195985 (https://phabricator.wikimedia.org/T92042) [22:29:44] (03CR) 10Legoktm: [C: 032] disable submodules for git-remote-zuul-shallow-clone [integration/config] - 10https://gerrit.wikimedia.org/r/195985 (https://phabricator.wikimedia.org/T92042) (owner: 10Legoktm) [22:30:54] Has anyone written phpunit tests that account for a different response from hhvm vs zend? Every way I try feels ugly.. [22:31:57] csteipp: in the core API tests there are a few if wfIsHHVM() { $this->markTestSkipped() } [22:32:25] I think, or we converted them to general feature checks [22:34:05] legoktm: Do you remember which class? I'm not seeing anything like that. [22:34:56] (03Merged) 10jenkins-bot: disable submodules for git-remote-zuul-shallow-clone [integration/config] - 10https://gerrit.wikimedia.org/r/195985 (https://phabricator.wikimedia.org/T92042) (owner: 10Legoktm) [22:37:47] csteipp: oh, I misremembered. In ApiFormatDumpTest we checked if xdebug was installed and skipped if so [22:40:04] legoktm: Ah, cool. That still feel dirty, but at least I can be consistently dirty. [22:43:41] greg-g / whomever: Is 1.25wmf22 going to be the last 1.25 one or are we going to keep going for a bit? [22:44:39] James_F: right, I forgot about that... [22:44:51] chad reminded me today about branch day coming up [22:45:11] * James_F nods. [22:46:23] greg-g: I'm created the wmf22 release tag yesterday and it reminded me. [22:47:40] 5 weeks pre May 25th is is April 20th, no? [22:58:33] (03PS3) 10Legoktm: check_php_syntax: Check for any content before opening (03CR) 10Legoktm: check_php_syntax: Check for any content before opening James_F: so, wmf22 is on the 18th, which is 2 months pre 1.25 release, vs what M&M did last time which was 6 weeks prior. I'm fine with either. the 2 months is probably good for backports purposes [23:02:50] * greg-g thinking outloud [23:06:11] * James_F nods. [23:07:30] legoktm: do you have branching timing opinions? ^^ [23:07:51] yes [23:07:59] What are they? [23:08:15] :) [23:08:17] I was still expecting the April branch date that hexmode announced [23:08:30] So after wmf25 or something? [23:09:22] https://www.mail-archive.com/wikitech-l@lists.wikimedia.org/msg78127.html said April 15th [23:09:32] what are we on right now?21? [23:09:48] yes [23:09:50] April 20th is about wmf26 [23:10:06] * James_F edits https://www.mediawiki.org/wiki/MediaWiki_1.25/Roadmap [23:10:06] going until wmf25 for 1.25 sounds good :) [23:10:14] ah, 15th, so 25th [23:10:18] er wmf25 [23:13:19] 10Continuous-Integration: Replace project-specific "{name}-thing" jobs with generic "thing" ones - https://phabricator.wikimedia.org/T91997#1115268 (10Legoktm) [23:13:19] 10Continuous-Integration, 5Patch-For-Review: Create generic "tox" jobs - https://phabricator.wikimedia.org/T92557#1115265 (10Legoktm) 5Open>3Resolved a:3Legoktm We now have generic tox jobs for "doc", "flake8", "flake8-bin", "py27", and "py24" environments. Some jobs that are non-voting or are using cust... [23:13:27] greg-g: :-) [23:13:33] 10Continuous-Integration: Replace project-specific "{name}-thing" jobs with generic "thing" ones - https://phabricator.wikimedia.org/T91997#1101137 (10Legoktm) [23:13:53] 10Continuous-Integration, 10Librarization: Zuul-cloner should check out submodules - https://phabricator.wikimedia.org/T84942#1115270 (10awight) The logic to do this correctly seems pretty straightforward, my only question is what the clone statement would look like for submodule projects that should be tested... [23:17:59] greg-g: https://www.mediawiki.org/w/index.php?title=MediaWiki_1.25/Roadmap&diff=1443397&oldid=1441107 [23:18:03] It's official now. :-) [23:20:44] (03PS1) 10Legoktm: Create generic perllint job and use it [integration/config] - 10https://gerrit.wikimedia.org/r/196479 [23:26:30] (03CR) 10Legoktm: [C: 032] Create generic perllint job and use it [integration/config] - 10https://gerrit.wikimedia.org/r/196479 (owner: 10Legoktm) [23:29:50] legoktm: CI changes during SWAT isn't exactly optimal… [23:30:09] James_F: oh, is someone updating the timeline extension? [23:30:17] ;) [23:30:49] I didn't realize it was SWAT though, I'll keep my other ones until after then [23:31:05] legoktm: About 20 people are waiting for Zuul so they can test things, and having it however-smartly pull new config changes which probably only effect non-deploying code is extra risk. [23:31:07] * James_F nods. [23:31:21] legoktm: Clearly we need to add you to the SWAT deployers so you remember… oh. ;-) [23:31:45] (03Merged) 10jenkins-bot: Create generic perllint job and use it [integration/config] - 10https://gerrit.wikimedia.org/r/196479 (owner: 10Legoktm) [23:34:57] !log Depooling integration-slave1402 to play with T92351 [23:35:01] Logged the message, Master [23:39:36] 10Continuous-Integration, 6Labs, 10OOjs, 10Wikimedia-Labs-Infrastructure, 6operations: Jenkins failing with "Error: GET https://saucelabs.com: Couldn't resolve host name." - https://phabricator.wikimedia.org/T92351#1115326 (10Dzahn) related to the DNS work on labs i would suspect. https://phabricator.wi... [23:40:07] 10Continuous-Integration, 6Labs, 10OOjs, 10Wikimedia-Labs-Infrastructure, 6operations: Jenkins failing with "Error: GET https://saucelabs.com: Couldn't resolve host name." - https://phabricator.wikimedia.org/T92351#1115328 (10Dzahn) @Coren ^ is that possible ? [23:45:51] 10Continuous-Integration: Replace project-specific "{name}-thing" jobs with generic "thing" ones - https://phabricator.wikimedia.org/T91997#1115349 (10Legoktm) [23:52:02] legoktm: Don't forget to also delete them on Jenkins. If there I many, I checkout the revision from when they exist in integration/config and use jenkins-jobs --delete 'pattern' [23:52:12] That [23:52:18] That's how I deleted all the -phplint jobs [23:52:37] There's a notable overhead in Jenkins from all those existing. [23:52:39] they have to exist locally so I can delete them? [23:52:53] or can I just pass a glob that it'll process on the server? [23:53:00] Yeah, that's the only way JJB takes the pattern. It uses its own database, not Jenkins. Much faster that way [23:53:15] and also because it has a no-touching policy on jobs not managed by itself [23:53:23] ok [23:53:46] I'll do that after SWAT finishes [23:54:36] 6Release-Engineering, 10Ops-Access-Requests, 6Phabricator, 6operations, 5Patch-For-Review: Chad H. needs access to iridium (Phabricator host) to manage repos - https://phabricator.wikimedia.org/T92564#1115364 (10RobH) a:3chasemp I'm assigning this to @chasemp for his check that this works, and all chan... [23:55:12] legoktm: OTOH, do you want to merge and deploy https://gerrit.wikimedia.org/r/#/c/191063/ for me after SWAT? ;-) [23:56:27] 6Release-Engineering, 10Ops-Access-Requests, 6Phabricator, 6operations, 5Patch-For-Review: Chad H. needs access to iridium (Phabricator host) to manage repos - https://phabricator.wikimedia.org/T92564#1115377 (10Dzahn) >>! In T92564#1115364, @RobH wrote: > Additionally, as this is requesting sudo, it wil... [23:56:37] James_F: is there a jjb commit that creates the npm job? [23:56:52] legoktm: I think it was merged months ago. [23:57:03] I don't see one... [23:57:35] https://integration.wikimedia.org/ci/job/mwext-Citoid-npm/ [23:59:08] legoktm: Meh. Krinkle tells me what to do. [23:59:14] 6Release-Engineering, 10Ops-Access-Requests, 6Phabricator, 6operations, 5Patch-For-Review: Chad H. needs access to iridium (Phabricator host) to manage repos - https://phabricator.wikimedia.org/T92564#1115390 (10RobH) Policy states https://wikitech.wikimedia.org/wiki/Requesting_shell_access: 5. If your...