[00:18:52] 10Continuous-Integration-Config, 10Analytics, 6WMDE-Analytics-Engineering, 10Wikidata: Add basic jenkins linting to analytics-limn-wikidata-data - https://phabricator.wikimedia.org/T116007#1740759 (10Dzahn) [00:19:05] twentyafterfour, greg-g: hence why you shouldn't leave these things unsync'd on tin [00:19:05] 10Continuous-Integration-Config, 10Analytics, 6WMDE-Analytics-Engineering, 10Wikidata: Add basic jenkins linting to analytics-limn-wikidata-data - https://phabricator.wikimedia.org/T116007#1738262 (10Dzahn) [00:19:11] anyone can come along later and scap [00:19:14] not just l10nupdate [00:19:43] Krenair: all deployments were called off, or so I thought [00:19:49] yes [00:19:58] it really should have just been sync'd at the time and been done with [00:20:11] that doesn't mean you can leave stuff alone unsync'd [00:20:11] but there was chaos and opsen were annoyed [00:21:02] I was under the impression that it had been reverted (in fact, ori committed the revert patch last monday at the very beginning of my deployment) [00:33:39] From my perspective this is what happened: people jumped the gun, reverted my in-progress deployment without warning me and without confirming that it was actually the cause of the problems (it wasn't) then I was told no more deployments before I got to finish deploying in the first place. [00:55:39] PROBLEM - Puppet staleness on integration-dev is CRITICAL: CRITICAL: 100.00% of data above the critical threshold [43200.0] [02:47:46] Project beta-scap-eqiad build #75186: 04FAILURE in 3 min 3 sec: https://integration.wikimedia.org/ci/job/beta-scap-eqiad/75186/ [03:27:32] Yippee, build fixed! [03:27:32] Project browsertests-MobileFrontend-en.m.wikipedia.beta.wmflabs.org-linux-firefox-sauce build #855: 09FIXED in 45 min: https://integration.wikimedia.org/ci/job/browsertests-MobileFrontend-en.m.wikipedia.beta.wmflabs.org-linux-firefox-sauce/855/ [04:22:56] 10Beta-Cluster-Infrastructure, 10Deployment-Systems, 7Beta-Cluster-reproducible: Scap failing with 'bool' object has no attribute 'encode' - https://phabricator.wikimedia.org/T114852#1741042 (10greg) This same error is happening again :( ``` 02:47 < wmf-insec> Project beta-scap-eqiad build #75186: FAILURE... [05:55:34] (03CR) 10Legoktm: [C: 032] Use pipeline name as context for Zuul diff [integration/config] - 10https://gerrit.wikimedia.org/r/247543 (owner: 10Hashar) [05:57:29] (03Merged) 10jenkins-bot: Use pipeline name as context for Zuul diff [integration/config] - 10https://gerrit.wikimedia.org/r/247543 (owner: 10Hashar) [06:03:01] 10Continuous-Integration-Config, 10MediaWiki-Codesniffer: Craft a Jenkins job that runs PHP_CodeSniffer for mw/core using composer - https://phabricator.wikimedia.org/T102598#1741147 (10Legoktm) >>! In T102598#1675133, @Legoktm wrote: > I didn't bother trying to create a job that only runs on changed files bec... [06:41:04] 10Beta-Cluster-Infrastructure, 6operations, 7Shinken: Make the Shinken IRC alert bot use colors - https://phabricator.wikimedia.org/T113785#1741252 (10Dzahn) p:5Normal>3Low [06:43:34] 10Beta-Cluster-Infrastructure, 6operations, 7Shinken: Make the Shinken IRC alert bot use colors - https://phabricator.wikimedia.org/T113785#1741255 (10Dzahn) colors for icinga-wm as well. wikibugs has them. so yay [06:47:56] Yippee, build fixed! [06:47:57] Project beta-scap-eqiad build #75209: 09FIXED in 3 min 24 sec: https://integration.wikimedia.org/ci/job/beta-scap-eqiad/75209/ [07:11:13] moar colorssssss [07:34:45] 6Release-Engineering-Team, 6Performance-Team: Investigate loadEventEnd / domComplete regression - https://phabricator.wikimedia.org/T116137#1741433 (10ori) p:5High>3Unbreak! [07:41:22] 6Release-Engineering-Team, 6Performance-Team: Investigate loadEventEnd / domComplete regression - https://phabricator.wikimedia.org/T116137#1741458 (10ori) @greg, could someone from your team help us isolate this by going over code changes that rolled out during this period? From graphite it looks like the reg... [08:34:30] Project browsertests-MultimediaViewer-en.wikipedia.beta.wmflabs.org-os_x_10.9-safari-sauce build #758: 04FAILURE in 24 min: https://integration.wikimedia.org/ci/job/browsertests-MultimediaViewer-en.wikipedia.beta.wmflabs.org-os_x_10.9-safari-sauce/758/ [08:57:11] 10Beta-Cluster-Infrastructure, 10Browser-Tests: Make selenium users use botflags at beta-cluster - https://phabricator.wikimedia.org/T116027#1741631 (10zeljkofilipin) a:3zeljkofilipin [08:58:17] 10Browser-Tests, 10VisualEditor, 5Patch-For-Review: Delete or fix failed VisualEditor browsertests Jenkins jobs - https://phabricator.wikimedia.org/T94162#1741633 (10zeljkofilipin) a:5Amire80>3zeljkofilipin [09:25:01] zeljkof: took longer than expected :/ [09:25:20] zeljkof: had to visit my accountant, come back home, take some railway tickets etc [09:31:45] hashar: Could you review https://gerrit.wikimedia.org/r/#/c/247072/ please [09:32:09] hashar: Can you also review https://gerrit.wikimedia.org/r/#/c/246710/ please. [09:32:10] paladox: will do some reviews after lunch [09:32:19] hashar: Ok thanks. [09:32:21] unpilling the few hundred emails I received overnight :-} [09:37:27] hashar: no problem [09:37:42] We can talk later today or on Friday [09:38:30] (03Abandoned) 10Paladox: Add new phplint template [integration/config] - 10https://gerrit.wikimedia.org/r/245190 (owner: 10Paladox) [09:46:42] 10Beta-Cluster-Infrastructure, 6operations, 7Shinken: Make the Shinken IRC alert bot use colors - https://phabricator.wikimedia.org/T113785#1741714 (10hashar) Seems the notification commands are defined in puppet `modules/nagios_common/templates/notification_commands.cfg.erb` and simply append to a file that... [09:47:19] !sal [09:47:19] https://tools.wmflabs.org/sal/releng [10:10:22] hashar: It dosent seem https://www.mediawiki.org/wiki/User:Legoktm/ci is updated correctly because it shows the column as still Extension not Project. [10:11:10] paladox: been refreshed a couple hours ago, I guess legoktm is still using some old commit [10:11:33] hashar: Oh. How would it be updated. [10:12:06] that is running on tools-labs , so would need legoktm to update the source code there [10:12:18] but I have pushed a bunch of scary changes so he might want to test them first [10:12:25] hashar: Oh ok. [10:43:43] (03PS1) 10Paladox: [ApiFeatureUsage] Update Jenkins tests [integration/config] - 10https://gerrit.wikimedia.org/r/247805 [10:44:23] (03PS2) 10Paladox: [ApiFeatureUsage] Update Jenkins tests [integration/config] - 10https://gerrit.wikimedia.org/r/247805 [10:47:44] Could someone review https://gerrit.wikimedia.org/r/#/c/213825/ please. It is the last of confirmedit files to be converted to extension.json. [11:12:08] 6Release-Engineering-Team, 6Performance-Team: Investigate loadEventEnd / domComplete regression - https://phabricator.wikimedia.org/T116137#1741822 (10Peter) Checked WebPageTest for that period, the data in Graphite is dead but when I compared between different runs in our WPT instance, there's one extra javas... [11:40:46] PROBLEM - Puppet staleness on deployment-restbase01 is CRITICAL: CRITICAL: 22.22% of data above the critical threshold [43200.0] [12:39:00] 5Continuous-Integration-Scaling, 7Tracking: Investigate using Drydock for CI - https://phabricator.wikimedia.org/T116038#1742014 (10JanZerebecki) Docs about it: https://secure.phabricator.com/book/phabricator/article/drydock/ [12:40:05] 5Continuous-Integration-Scaling, 7Tracking: [tracking] Disposable VMs need a cache for package managers - https://phabricator.wikimedia.org/T112560#1742018 (10JanZerebecki) [12:40:06] 5Continuous-Integration-Scaling, 7Tracking: Investigate using Drydock for CI - https://phabricator.wikimedia.org/T116038#1742019 (10JanZerebecki) [12:41:51] 6Release-Engineering-Team, 6Performance-Team: Investigate loadEventEnd / domComplete regression - https://phabricator.wikimedia.org/T116137#1742028 (10hashar) + @Ebernhardson who deployed some potentially related changes to Search ----- From production SAL https://tools.wmflabs.org/sal/production?d=2015-10-1... [12:51:13] 6Release-Engineering-Team, 6Performance-Team: Investigate loadEventEnd / domComplete regression - https://phabricator.wikimedia.org/T116137#1742061 (10hashar) TL;DR above: Both WikimediaEvents make use of `$.jStorage` and are related to the EventLogging schema `TestSearchSatisfication2`. So that is inline wit... [13:05:12] Yippee, build fixed! [13:05:13] Project browsertests-MobileFrontend-en.m.wikipedia.beta.wmflabs.org-linux-chrome-sauce build #823: 09FIXED in 33 min: https://integration.wikimedia.org/ci/job/browsertests-MobileFrontend-en.m.wikipedia.beta.wmflabs.org-linux-chrome-sauce/823/ [13:25:09] 10Continuous-Integration-Infrastructure, 6Commons, 6Multimedia, 10UploadWizard: UploadWizard API smoke tests falling because Selenium_user has been blocked on commons.wikimedia.org - https://phabricator.wikimedia.org/T115465#1742130 (10Steinsplitter) I have reblocked the account. The account is still editi... [13:25:43] 10Continuous-Integration-Infrastructure, 6Commons, 6Multimedia, 10UploadWizard: UploadWizard API smoke tests falling because Selenium_user has been blocked on commons.wikimedia.org - https://phabricator.wikimedia.org/T115465#1742137 (10Steinsplitter) 5Resolved>3Open [13:27:59] 10Browser-Tests, 10MediaWiki-extensions-WikibaseRepository, 10Wikidata: [Bug] timeouts during selenium tests with sourcelabs - https://phabricator.wikimedia.org/T68071#1742151 (10JanZerebecki) [13:28:41] 10Browser-Tests, 10MediaWiki-extensions-WikibaseRepository, 10Wikidata: [Bug] timeouts during selenium tests with beta - https://phabricator.wikimedia.org/T68071#715239 (10JanZerebecki) [13:29:49] 10Browser-Tests, 10MediaWiki-extensions-WikibaseRepository, 10Wikidata: [Task] investigate failing Wikidata browsertests on jenkins - https://phabricator.wikimedia.org/T92619#1742163 (10JanZerebecki) [13:29:53] 10Browser-Tests, 10MediaWiki-extensions-WikibaseRepository, 10Wikidata: [Bug] timeouts during selenium tests with beta - https://phabricator.wikimedia.org/T68071#715239 (10JanZerebecki) 5Open>3Resolved a:3JanZerebecki All the failures I see are usually because of saucelabs problems. Beta itself being t... [13:29:56] 10Browser-Tests, 10Wikidata, 7Tracking: Wikidata Browsertests (tracking) - https://phabricator.wikimedia.org/T88541#1742164 (10JanZerebecki) [13:32:09] 5Continuous-Integration-Scaling, 6operations: install/deploy scandium as zuul merger (ci) server - https://phabricator.wikimedia.org/T95046#1742165 (10hashar) Note the zuul-merger process on scandium will need to be able to reach the Gearman server on gallium (production). [13:32:46] 10Browser-Tests, 10MobileFrontend, 7Upstream: Upstream: Issue with Chrome driver with resizing window - https://phabricator.wikimedia.org/T88288#1742167 (10JanZerebecki) I also seem to see the same issue with firefox on saucelabs. Causes like 1 or 2 failures per run for wikidata. [13:33:01] 10Browser-Tests, 10MobileFrontend, 7Upstream: Upstream: Issue with Chrome driver with resizing window - https://phabricator.wikimedia.org/T88288#1742169 (10JanZerebecki) [13:33:03] 10Browser-Tests, 10MediaWiki-extensions-WikibaseRepository, 10Wikidata: [Task] investigate failing Wikidata browsertests on jenkins - https://phabricator.wikimedia.org/T92619#1742168 (10JanZerebecki) [13:35:19] 10Browser-Tests: jenkins doesn't show all failed tests, but raita does - https://phabricator.wikimedia.org/T116162#1742170 (10JanZerebecki) 3NEW [13:38:12] 10Browser-Tests, 10Wikidata: No test report files were found: job fails in jenkins but is shown as successful in raita - https://phabricator.wikimedia.org/T116164#1742187 (10JanZerebecki) 3NEW [13:38:24] 10Browser-Tests, 10Wikidata: No test report files were found: job fails in jenkins but is shown as successful in raita - https://phabricator.wikimedia.org/T116164#1742195 (10JanZerebecki) [13:38:28] 10Browser-Tests, 10MediaWiki-extensions-WikibaseRepository, 10Wikidata: [Task] investigate failing Wikidata browsertests on jenkins - https://phabricator.wikimedia.org/T92619#1742194 (10JanZerebecki) [13:41:50] 10Browser-Tests, 10Wikidata: move wikidata browsertests to not use saucelabs - https://phabricator.wikimedia.org/T116166#1742216 (10JanZerebecki) [13:43:06] 10Browser-Tests, 10Wikidata: move wikidata browsertests to not use saucelabs - https://phabricator.wikimedia.org/T116166#1742206 (10JanZerebecki) [13:43:10] 10Browser-Tests, 10MediaWiki-extensions-WikibaseRepository, 10Wikidata: [Task] investigate failing Wikidata browsertests on jenkins - https://phabricator.wikimedia.org/T92619#1742221 (10JanZerebecki) [13:51:00] 10Continuous-Integration-Infrastructure, 6Commons, 6Multimedia, 10UploadWizard: UploadWizard API smoke tests falling because Selenium_user has been blocked on commons.wikimedia.org - https://phabricator.wikimedia.org/T115465#1742266 (10hashar) a:5hashar>3None [13:56:08] 10Continuous-Integration-Infrastructure, 6Commons, 6Multimedia, 10UploadWizard: UploadWizard API smoke tests falling because Selenium_user has been blocked on commons.wikimedia.org - https://phabricator.wikimedia.org/T115465#1742290 (10hashar) Surely blocking the user again is a constructive way to priorit... [13:56:18] 10Continuous-Integration-Infrastructure, 6Commons, 6Editing-Department, 6Multimedia, 10UploadWizard: UploadWizard API smoke tests falling because Selenium_user has been blocked on commons.wikimedia.org - https://phabricator.wikimedia.org/T115465#1742292 (10hashar) [14:33:57] PROBLEM - Puppet failure on deployment-restbase02 is CRITICAL: CRITICAL: 20.00% of data above the critical threshold [0.0] [14:39:02] PROBLEM - Puppet failure on deployment-restbase01 is CRITICAL: CRITICAL: 66.67% of data above the critical threshold [0.0] [14:45:44] RECOVERY - Puppet staleness on deployment-restbase01 is OK: OK: Less than 1.00% above the threshold [3600.0] [14:53:51] RECOVERY - Puppet failure on deployment-restbase02 is OK: OK: Less than 1.00% above the threshold [0.0] [14:54:03] RECOVERY - Puppet failure on deployment-restbase01 is OK: OK: Less than 1.00% above the threshold [0.0] [15:03:50] 6Release-Engineering-Team, 6Performance-Team: Investigate loadEventEnd / domComplete regression - https://phabricator.wikimedia.org/T116137#1742501 (10EBernhardson) rECIRab19311a7fc8: Report common terms usage to javascript unlikely to be the cause, the change here is just too small. it amounts to a couple d... [15:07:40] 6Release-Engineering-Team, 6Performance-Team: Investigate loadEventEnd / domComplete regression - https://phabricator.wikimedia.org/T116137#1742511 (10ori) @EBernhardson: Could you revert {rEWMVc5b6de098f16}? It'd be easy to un-revert if we're wrong. [15:10:14] hasharMeeting: Could you review https://gerrit.wikimedia.org/r/246710 please. [15:12:07] paladox: in meeting loop hole then I head back home :/ [15:12:38] hasharMeeting: Could you also review https://gerrit.wikimedia.org/r/#/c/246991 and https://gerrit.wikimedia.org/r/#/c/246424/ and https://gerrit.wikimedia.org/r/#/c/247072/ and https://gerrit.wikimedia.org/r/#/c/247294/ They are too do with adding php code sniffer or adding composer.json and some are fixing jenkins. [15:12:44] hasharMeeting: Ok. [15:12:59] paladox: you might want to explain what you are doing between patchsets :D [15:13:15] hasharMeeting: Ok. [15:13:22] as for the other changes, if they are on extensions there are some other reviewers floating around [15:13:31] but yeah will try to do a few of them either this evening or tomorro [15:14:09] code sniffer changes, I am unlikely to review them [15:14:16] too scary on legacy code [15:14:51] Flow / Echo there a few active reviewers [15:14:59] will deal with Webplatformauth eventually [15:15:18] 10Beta-Cluster-Infrastructure, 10Deployment-Systems, 7Beta-Cluster-reproducible: Scap failing with 'bool' object has no attribute 'encode' - https://phabricator.wikimedia.org/T114852#1742546 (10greg) It fixed itself? https://integration.wikimedia.org/ci/job/beta-scap-eqiad/75209/console [15:17:01] hasharMeeting: Ok thanks. There are probaly a few changes i done that you could probaly review without anything breaking. Ive added the relevent authors to the php code sniffer but some repos doint have an active author only active users who update the code. [15:19:41] Yippee, build fixed! [15:19:41] Project browsertests-CentralNotice-en.wikipedia.beta.wmflabs.org-os_x_10.9-chrome-sauce build #216: 09FIXED in 1 min 40 sec: https://integration.wikimedia.org/ci/job/browsertests-CentralNotice-en.wikipedia.beta.wmflabs.org-os_x_10.9-chrome-sauce/216/ [15:27:39] (03PS3) 10JanZerebecki: pywikibot/core: also run tox-jessie in check pipeline [integration/config] - 10https://gerrit.wikimedia.org/r/247623 (https://phabricator.wikimedia.org/T87169) [15:27:45] (03CR) 10JanZerebecki: [C: 032] pywikibot/core: also run tox-jessie in check pipeline [integration/config] - 10https://gerrit.wikimedia.org/r/247623 (https://phabricator.wikimedia.org/T87169) (owner: 10JanZerebecki) [15:28:52] (03Merged) 10jenkins-bot: pywikibot/core: also run tox-jessie in check pipeline [integration/config] - 10https://gerrit.wikimedia.org/r/247623 (https://phabricator.wikimedia.org/T87169) (owner: 10JanZerebecki) [15:34:16] (03CR) 10JanZerebecki: "Confusing context in the diff :)" [integration/config] - 10https://gerrit.wikimedia.org/r/247649 (https://phabricator.wikimedia.org/T114250) (owner: 10Hashar) [15:34:49] (03PS2) 10JanZerebecki: fundraising/slander now has tox-jessie [integration/config] - 10https://gerrit.wikimedia.org/r/247649 (https://phabricator.wikimedia.org/T114250) (owner: 10Hashar) [15:34:54] (03CR) 10JanZerebecki: [C: 032] fundraising/slander now has tox-jessie [integration/config] - 10https://gerrit.wikimedia.org/r/247649 (https://phabricator.wikimedia.org/T114250) (owner: 10Hashar) [15:35:46] (03Merged) 10jenkins-bot: fundraising/slander now has tox-jessie [integration/config] - 10https://gerrit.wikimedia.org/r/247649 (https://phabricator.wikimedia.org/T114250) (owner: 10Hashar) [15:42:32] (03CR) 10JanZerebecki: [C: 04-1] "Repo to test has no composer.json. Could you check that yourself?" [integration/config] - 10https://gerrit.wikimedia.org/r/245928 (owner: 10Paladox) [15:43:01] (03PS3) 10JanZerebecki: Work around cucumber pretty formater bug [integration/config] - 10https://gerrit.wikimedia.org/r/247602 (https://phabricator.wikimedia.org/T110510) [15:43:22] (03CR) 10Paladox: "Hi I added the file here https://gerrit.wikimedia.org/r/#/c/245929/" [integration/config] - 10https://gerrit.wikimedia.org/r/245928 (owner: 10Paladox) [15:44:04] (03CR) 10JanZerebecki: "Yes it does. I chose disabling the pretty formatter over disabling raita as raita requires less scrolling." [integration/config] - 10https://gerrit.wikimedia.org/r/247602 (https://phabricator.wikimedia.org/T110510) (owner: 10JanZerebecki) [15:56:09] 6Release-Engineering-Team, 6Performance-Team, 3Discovery-Cirrus-Sprint: Investigate loadEventEnd / domComplete regression - https://phabricator.wikimedia.org/T116137#1742609 (10EBernhardson) [16:03:15] PROBLEM - Puppet failure on integration-slave-jessie-1001 is CRITICAL: CRITICAL: 100.00% of data above the critical threshold [0.0] [16:03:15] PROBLEM - Puppet failure on deployment-parsoidcache02 is CRITICAL: CRITICAL: 100.00% of data above the critical threshold [0.0] [16:05:24] (03CR) 10JanZerebecki: "Why did you not add that information before I reviewed this patch? That would have saved me time." [integration/config] - 10https://gerrit.wikimedia.org/r/245928 (owner: 10Paladox) [16:05:57] (03CR) 10Paladox: "Sorry. I will do that in future." [integration/config] - 10https://gerrit.wikimedia.org/r/245928 (owner: 10Paladox) [16:06:16] (03PS2) 10Paladox: [Offline] Update jenkins tests [integration/config] - 10https://gerrit.wikimedia.org/r/245928 [16:10:09] (03CR) 10JanZerebecki: [C: 032] "Deployed to Jenkins jobs: ['browsertests-CentralAuth-en.wikipedia.beta.wmflabs.org-linux-firefox-sauce', 'browsertests-CentralNotice-en.wi" [integration/config] - 10https://gerrit.wikimedia.org/r/247602 (https://phabricator.wikimedia.org/T110510) (owner: 10JanZerebecki) [16:12:42] (03Merged) 10jenkins-bot: Work around cucumber pretty formater bug [integration/config] - 10https://gerrit.wikimedia.org/r/247602 (https://phabricator.wikimedia.org/T110510) (owner: 10JanZerebecki) [16:12:53] thcipriani: hey, !log the switch to scap3 for restbase! [16:13:01] it's huge and awesome and worthy! [16:16:42] (03PS3) 10JanZerebecki: Remove redundant Android tests [integration/config] - 10https://gerrit.wikimedia.org/r/247629 (owner: 10Niedzielski) [16:18:10] greg-g: is this anything we should be aware of? [16:18:22] are you starting to test in labs? [16:19:24] PROBLEM - Free space - all mounts on deployment-bastion is CRITICAL: CRITICAL: deployment-prep.deployment-bastion.diskspace._var.byte_percentfree (<60.00%) [16:19:27] (03CR) 10JanZerebecki: [C: 032] Remove redundant Android tests [integration/config] - 10https://gerrit.wikimedia.org/r/247629 (owner: 10Niedzielski) [16:20:30] gwicke: mobrovac didn't tell you? [16:20:43] (03CR) 10JanZerebecki: [V: 04-1] "Waiting for dependency." [integration/config] - 10https://gerrit.wikimedia.org/r/247805 (owner: 10Paladox) [16:20:44] 6Release-Engineering-Team, 6Performance-Team, 3Discovery-Cirrus-Sprint: Investigate loadEventEnd / domComplete regression - https://phabricator.wikimedia.org/T116137#1742680 (10EBernhardson) Reverted wmf.2 and wmf.3 back to before rEWMVc5b6de098f16: Add new fields to TestSearchSatisfaction with https://gerr... [16:21:04] yeah there was a session today [16:21:10] greg-g: no, this is the first I am hearing about this [16:21:14] jzerebecki: Could you review https://gerrit.wikimedia.org/r/#/c/247072/ please since i want to test to see weather it fixes this extension https://gerrit.wikimedia.org/r/#/c/247864/ if it uses the zend-composer test. [16:21:16] we deployed rb with scap3 in beta [16:21:21] (03CR) 10JanZerebecki: [V: 04-1] "Waiting on dependency." [integration/config] - 10https://gerrit.wikimedia.org/r/246773 (https://phabricator.wikimedia.org/T88343) (owner: 10Paladox) [16:21:41] gwicke: today was the first iteration [16:21:45] (03Merged) 10jenkins-bot: Remove redundant Android tests [integration/config] - 10https://gerrit.wikimedia.org/r/247629 (owner: 10Niedzielski) [16:22:01] so, should we start testing scap vs. our requirements? [16:22:42] gwicke: that's what mobrovac has been doing... [16:22:48] I'm honestly confused if you two talk right now :) [16:23:04] greg-g: you created a cabal, you got a cabral [16:23:07] *cabal [16:23:21] today was the first day that scap3 was tried, that's the most appropriate way of putting it [16:23:22] right, but tyler and everyone talk to me (I'm not in the cabal) [16:23:32] why would mobrovac talk to you? why don't you ask him questions? [16:23:40] come on guys [16:23:41] wouldn't* [16:24:13] so it sounds like now would be a good time to test scap3? [16:24:19] we did :-) [16:24:35] tyler showed us how to set up and deploy restbase with scap3 on the beta cluster [16:24:45] there are a few follow up items to be deal with but overall that seems to work fine (on beta) [16:24:51] there's a lot of notes (or so I hear) from today's deployment [16:24:52] okay, I'd be interested in other labs projects too [16:24:53] gwicke: as i said, today was the first iteration, found stuff that need to be taken care of, and then we continue [16:25:03] and other VMs [16:25:04] then we can follow up with other services. parsoid comes to mind [16:25:13] seems apparently parsoid suffers from Trebuchet misbehavior [16:25:14] *oids in general [16:25:27] basically, how to deploy using scap to $VM [16:25:41] and ideally on beta cluster, have scap3 being run by Jenkins whenever a change is merged to one of the repo [16:26:04] that's not happening yet right? today was just a "do a manual deploy"? [16:26:10] we might need some good documentation / tutorials for deployers [16:26:36] one sure thing, is that end users (the deployers) will need some introduction courses. Potentially on beta cluster first [16:26:40] before we switch the prod deploy [16:26:47] but overall, the session went smoothly [16:26:53] gwicke: yes, me too, and once get to a flawless scap3 deploy, there should be docs explaining how to deploy in other porjects as well [16:27:00] one of the requirements is to really simplify the process outlined in https://wikitech.wikimedia.org/wiki/Labs_node_setup [16:27:17] there shouldn't be a need to set up a handful of services just to be able to do a test deploy [16:27:50] so that it actually scales down, for testing and third-party use [16:27:51] gwicke: marxarelli set up a vagrant-based scpa3 thingy [16:27:54] (03PS2) 10JanZerebecki: Update extension-unittests zuul layout template [integration/config] - 10https://gerrit.wikimedia.org/r/247072 (owner: 10Paladox) [16:27:55] lemme find the link [16:27:57] https://phabricator.wikimedia.org/diffusion/GSV/ [16:28:03] that ^ [16:28:10] (03PS3) 10JanZerebecki: Update extension-unittests zuul layout template [integration/config] - 10https://gerrit.wikimedia.org/r/247072 (owner: 10Paladox) [16:28:18] how does vagrant help with this? [16:28:20] gwicke: https://github.com/marxarelli/scap-vagrant [16:28:28] mobrovac: now lives in phab :) [16:28:30] (03CR) 10JanZerebecki: [C: 032] Update extension-unittests zuul layout template [integration/config] - 10https://gerrit.wikimedia.org/r/247072 (owner: 10Paladox) [16:28:42] gwicke: did you read the README? [16:28:43] (03CR) 10Paladox: "Thanks." [integration/config] - 10https://gerrit.wikimedia.org/r/247072 (owner: 10Paladox) [16:28:56] gwicke: or maybe I'm misunderstanding your question :) [16:29:15] (myself and mobrovac ;) ) [16:29:24] (03Merged) 10jenkins-bot: Update extension-unittests zuul layout template [integration/config] - 10https://gerrit.wikimedia.org/r/247072 (owner: 10Paladox) [16:29:25] it seems to be a way of testing scap, rather than a way to deploy to $VM [16:29:49] it's a way of testing the deploy process, yes [16:29:51] gwicke: a good part of https://wikitech.wikimedia.org/wiki/Labs_node_setup is simplified nowadays. salt/puppet auto sign keys and we have hiera to set the variables on a per project basis [16:30:43] (03CR) 10JanZerebecki: [V: 04-1] "Waiting on dependency." [integration/config] - 10https://gerrit.wikimedia.org/r/247309 (owner: 10Paladox) [16:31:08] (03CR) 10JanZerebecki: [V: 04-1] "Waiting on dependency." [integration/config] - 10https://gerrit.wikimedia.org/r/247069 (owner: 10Paladox) [16:31:29] for a third-party user (and even a labs user), setting up puppet and salt are fairly heavy requirements [16:31:40] (03CR) 10JanZerebecki: [V: 04-1] "Waiting on dependency." [integration/config] - 10https://gerrit.wikimedia.org/r/247067 (owner: 10Paladox) [16:32:26] hasharMeeting, thcipriani: could you outline the process of deploying to a VM in a new labs project? [16:32:50] i'd prefer we handle our use-case first and then move on to others [16:33:12] gwicke: one of today's action items is doucmentation, and i'd love to see that in there as well [16:33:24] testing new services is an important use case [16:33:47] ok .... [16:33:59] it should be easy to do so, with the least amount of dependencies possible [16:34:11] gwicke: on deployment-prep , just spin a new VM , add whatever role you are interested in and it should just work [16:34:17] (03CR) 10JanZerebecki: "recheck" [integration/config] - 10https://gerrit.wikimedia.org/r/246986 (owner: 10Paladox) [16:34:20] (03CR) 10jenkins-bot: [V: 04-1] [RelatedArticles] Move jshint to check: [integration/config] - 10https://gerrit.wikimedia.org/r/246986 (owner: 10Paladox) [16:34:23] labs has been dramatically improved [16:34:42] I think there is even a vagrant file to manage your labs instance [16:34:54] (03CR) 10JanZerebecki: [V: 04-1] "Waiting for dependency." [integration/config] - 10https://gerrit.wikimedia.org/r/246983 (owner: 10Paladox) [16:35:16] right, for other projects it's premature to talk about that, imho, but i fear puppet will still be needed [16:35:24] deploy user keys bla bla [16:35:36] paladox: care to manually rebase https://gerrit.wikimedia.org/r/#/c/246986/ ? [16:35:38] even though, hm, wait, the actual deploy user should be configurable [16:36:01] so if you have ssh set up between the machines, it should work if i'm not mistaken [16:36:06] gwicke: there is also https://www.mediawiki.org/wiki/MediaWiki-Vagrant and https://wikitech.wikimedia.org/wiki/Help:MediaWiki-Vagrant_in_Labs [16:36:56] (03PS2) 10Paladox: [RelatedArticles] Move jshint to check: [integration/config] - 10https://gerrit.wikimedia.org/r/246986 [16:37:00] jzerebecki: Ok done. [16:37:12] mobrovac: if scap can deploy WIP code to a random jessie VM, then that would improve the ability to test services and config changes significantly [16:37:46] and satisfy the "ease of testing / scale down" requirement [16:38:28] gwicke: i agree ofc, but my point is that for the time being having to have puppet in your labs project (which you have automatically now iirc), it's not that hard (for us) to do just that [16:38:47] and, as i said, i think these things can be worked around [16:39:00] as quite a lot of stuff in scap3 is actually configurable [16:39:38] so, i'd like to see it working super-flawlessly in beta first, and then test in other projects and then production [16:40:09] 6Release-Engineering-Team, 6Performance-Team, 3Discovery-Cirrus-Sprint: Investigate loadEventEnd / domComplete regression - https://phabricator.wikimedia.org/T116137#1742744 (10Krinkle) The sheer payload size of adding a new script should in itself not cause a regression of this sort, so I suspect is it not... [16:40:35] sure, I'm fine with waiting with the requirement evaluation until it works for a limited use case [16:40:43] and keep in mind that today was the first time we actually tried it [16:41:09] yes, but that's only one of the requirements ... [16:41:23] jsut as important as the other ones [16:42:23] I am not leading the scap3 effort in anyway but I am pretty sure we all want the transition to be as smooth as possible for deployers [16:42:25] scap3 will need comprehensive docs hasharMeeting thcipriani [16:42:41] those docs should also be short [16:42:43] (03CR) 10JanZerebecki: "Until T114691 is solved you have the choice of testing with optional dependencies or without, but not both. This extension is used togethe" [integration/config] - 10https://gerrit.wikimedia.org/r/245903 (owner: 10Paladox) [16:42:48] reflecting the underlying ease of use [16:42:54] https://doc.wikimedia.org/mw-tools-scap/ is sparse [16:43:17] we will probably want to add a page describing the big picture overview [16:43:18] (03CR) 10JanZerebecki: [C: 032] [RelatedArticles] Move jshint to check: [integration/config] - 10https://gerrit.wikimedia.org/r/246986 (owner: 10Paladox) [16:43:24] and probably a step-by-step 101 tutorial [16:43:33] docs are on the radar [16:43:41] gwicke: and for the "i just want to deploy from tin" part they'll be, for sure "put this here, that there and run deploy", but setting it up something completely different as there are bits and pieces [16:44:05] all of the features are documented in the patch commit messages, I've started transferring that to wiki page: https://wikitech.wikimedia.org/wiki/Scap3 [16:44:21] ccol twentyafterfour [16:44:35] twentyafterfour: can we get that attached to the scap code ? Ie via the Sphinx .rst doc? [16:44:39] 6Release-Engineering-Team, 6Performance-Team, 3Discovery-Cirrus-Sprint: Investigate loadEventEnd / domComplete regression - https://phabricator.wikimedia.org/T116137#1742756 (10EBernhardson) Can refer to this graph (from ori) later to compare the today vs. yesterday timings: http://graphite.wikimedia.org/ren... [16:44:39] i think we are suggesting here we'll need a "user manual" as well [16:45:06] hashar: yeah but sphinx hates me I got very frustrated trying to document it that way so I did what was expedient [16:45:06] yup definitely [16:45:24] twentyafterfour: at least you gather the doc bits from commit messages! [16:45:43] I think a wiki page makes more sense for user manual stuff really [16:45:45] Sphinx has it learning curve, but it should not be too hard to catch up with [16:45:55] at least it's easier for non-pythonistas to contribute to [16:48:17] I just don't like that everything else we use is either markdown or wikitext (or phabricator's mixture of markdown and wikitext) but rst is quite different and sphinx is a white-space Nazi [16:49:31] lol [16:50:03] 10Deployment-Systems, 3Scap3: (Re)implement logging output for scap targets - https://phabricator.wikimedia.org/T115137#1742768 (10dduvall) 5Open>3Resolved [16:50:30] jzerebecki: Hi the new zend-composer isent running on experimental on the new test that was merged. [16:50:37] remembering 4 markup languages....is only slightly better than just using xml ;) [16:50:44] paladox: not yet deployed [16:50:57] jzerebecki: Ok. [16:52:58] (03CR) 10JanZerebecki: [C: 032] [RelatedArticles] Move jshint to check: [integration/config] - 10https://gerrit.wikimedia.org/r/246986 (owner: 10Paladox) [16:53:59] (03Merged) 10jenkins-bot: [RelatedArticles] Move jshint to check: [integration/config] - 10https://gerrit.wikimedia.org/r/246986 (owner: 10Paladox) [16:55:39] (03PS3) 10JanZerebecki: [Translate] Add dependance on cldr extension [integration/config] - 10https://gerrit.wikimedia.org/r/245903 (owner: 10Paladox) [16:55:42] 10Browser-Tests, 10Wikidata: move wikidata browsertests to not use saucelabs - https://phabricator.wikimedia.org/T116166#1742206 (10zeljkofilipin) This should be really easy to do. Our framework supports running tests on a local browser or Sauce Labs. Let me know if you need help with this, or if you would lik... [16:55:56] 10Browser-Tests, 10Wikidata, 7Easy: move wikidata browsertests to not use saucelabs - https://phabricator.wikimedia.org/T116166#1742789 (10zeljkofilipin) p:5Triage>3Normal [16:56:11] 10Deployment-Systems, 3Scap3: (Re)implement logging output for scap targets - https://phabricator.wikimedia.org/T115137#1742791 (10dduvall) [16:57:56] !log deployed restbase to deployment-restbase0{1,2} with scap3 [16:58:23] :) [16:58:37] late to the party: downstairs bathroom is getting demolished today. [16:59:35] (03CR) 10JanZerebecki: [C: 032] "Deployed to Jenkins jobs: ['mwext-Translate-testextension-hhvm', 'mwext-Translate-testextension-zend']" [integration/config] - 10https://gerrit.wikimedia.org/r/245903 (owner: 10Paladox) [17:00:08] 6Release-Engineering-Team, 6Performance-Team, 3Discovery-Cirrus-Sprint: Investigate loadEventEnd / domComplete regression as of 2015-10-13 - https://phabricator.wikimedia.org/T116137#1742827 (10Krinkle) [17:01:04] (03CR) 10Paladox: "Thanks." [integration/config] - 10https://gerrit.wikimedia.org/r/245903 (owner: 10Paladox) [17:01:19] (03Merged) 10jenkins-bot: [Translate] Add dependance on cldr extension [integration/config] - 10https://gerrit.wikimedia.org/r/245903 (owner: 10Paladox) [17:02:29] (03PS2) 10JanZerebecki: [Translate] Add qunit tests [integration/config] - 10https://gerrit.wikimedia.org/r/245493 (owner: 10Paladox) [17:02:39] (03CR) 10JanZerebecki: [C: 032] [Translate] Add qunit tests [integration/config] - 10https://gerrit.wikimedia.org/r/245493 (owner: 10Paladox) [17:03:17] (03CR) 10Paladox: "Thanks." [integration/config] - 10https://gerrit.wikimedia.org/r/245493 (owner: 10Paladox) [17:03:29] (03Merged) 10jenkins-bot: [Translate] Add qunit tests [integration/config] - 10https://gerrit.wikimedia.org/r/245493 (owner: 10Paladox) [17:22:05] 10Beta-Cluster-Infrastructure, 6Release-Engineering-Team: Proposal: Force any WARNINGs on Beta Cluster to fail completely - https://phabricator.wikimedia.org/T115633#1742895 (10mmodell) We should build an error console that integrates into the page so that errors surface instead of getting buried in the logs.... [17:33:17] !log reloading zuul for 9362473..ec1313d [17:34:22] paladox: ^^ its now deployed [17:35:09] can someone get the logmsgbot back? [17:37:01] eh, it's still running in -operations [17:37:09] is it really "logmsgbot" that's missing? [17:37:19] there are 2 bots working together [17:41:31] qa-morebots is what we need [17:41:47] (03CR) 10JanZerebecki: "(Testrun: https://gerrit.wikimedia.org/r/#/c/242572/ )" [integration/config] - 10https://gerrit.wikimedia.org/r/247649 (https://phabricator.wikimedia.org/T114250) (owner: 10Hashar) [17:43:18] 10Continuous-Integration-Infrastructure, 10pywikibot-core, 5Patch-For-Review: run at least pep8 and pep257 for new changesets submitted to pywikibot/core for any user - https://phabricator.wikimedia.org/T87169#1742964 (10JanZerebecki) The tox-jessie job now runs for non-whitelisted users. Example: https://ge... [17:43:58] (03CR) 10JanZerebecki: "(Testrun: https://gerrit.wikimedia.org/r/#/c/247658/ )" [integration/config] - 10https://gerrit.wikimedia.org/r/245903 (owner: 10Paladox) [17:44:54] (03CR) 10JanZerebecki: "(Testrun: https://gerrit.wikimedia.org/r/#/c/247658/ )" [integration/config] - 10https://gerrit.wikimedia.org/r/245493 (owner: 10Paladox) [17:45:17] 10Beta-Cluster-Infrastructure, 7Blocked-on-RelEng, 10Continuous-Integration-Config, 10Parsoid, 5Patch-For-Review: Parsoid patches don't update Beta Cluster automatically -- only deploy repo patches seem to update that code - https://phabricator.wikimedia.org/T92871#1742971 (10cscott) @subbu, @arlolra --... [17:47:33] (03CR) 10JanZerebecki: "Sorry wrong one. https://gerrit.wikimedia.org/r/#/c/237734/" [integration/config] - 10https://gerrit.wikimedia.org/r/245903 (owner: 10Paladox) [17:47:42] (03CR) 10JanZerebecki: "Sorry wrong one. https://gerrit.wikimedia.org/r/#/c/237734/" [integration/config] - 10https://gerrit.wikimedia.org/r/245493 (owner: 10Paladox) [17:50:50] marktraceur: yo yo, quick question about the UploadWizard job that runs from Jenkins against production commons that, i think, gah, what's his name, the volunteer duded from Google Switzerland... [17:51:01] marktraceur: question is: do you care if it is deleted? [18:00:49] greg-g: We decided that we definitely wanted it deleted this morning at the team meeting. [18:01:25] 10Deployment-Systems, 3Scap3, 5Patch-For-Review: Fetch submodules from the deploy host - https://phabricator.wikimedia.org/T113777#1743089 (10thcipriani) 5Open>3Resolved [18:04:38] 10Deployment-Systems, 3Scap3: Scap3's checks.yaml file should be optional - https://phabricator.wikimedia.org/T116204#1743109 (10thcipriani) 3NEW [18:05:09] greg-g: no spam notices for phab mail for me today [18:05:22] they seem to have stopped [18:06:57] James_F: https://waffle.io/jquery/qunit [18:07:04] Workboards for GitHub issues [18:08:41] James_F: kk, I was just making sure the tech lead was aware (as I understand Editing/Multimedia structure) [18:08:56] 10Beta-Cluster-Infrastructure, 10Analytics, 6Services: Set up AQS in Beta - https://phabricator.wikimedia.org/T116206#1743135 (10mobrovac) 3NEW [18:08:58] mobrovac: yay! [18:09:12] re spam :) [18:10:22] 10Deployment-Systems, 3Scap3: enforcing deployment from `/srv/deployment` is wrong - https://phabricator.wikimedia.org/T116207#1743145 (10thcipriani) 3NEW [18:16:15] 10Deployment-Systems, 3Scap3: default lock file for scap3 should be repo-dependent - https://phabricator.wikimedia.org/T116208#1743174 (10thcipriani) 3NEW [18:17:18] 6Release-Engineering-Team, 6operations, 7Database, 5Patch-For-Review: Recover missing values from user_properties tables - https://phabricator.wikimedia.org/T114899#1743184 (10demon) 5Open>3Resolved a:3demon >>! In T114899#1709471, @jcrespo wrote: > We can recover preferences on an individual bases,... [18:20:02] (03PS1) 10Paladox: [ParserHooks] Update Jenkins tests [integration/config] - 10https://gerrit.wikimedia.org/r/247893 [18:27:17] (03PS2) 10Paladox: [ParserHooks] Update Jenkins tests [integration/config] - 10https://gerrit.wikimedia.org/r/247893 [18:33:12] (03PS1) 10Greg Grossmeier: Remove the UploadWizard API test from production [integration/config] - 10https://gerrit.wikimedia.org/r/247895 (https://phabricator.wikimedia.org/T115465) [18:39:44] Project browsertests-Wikidata-WikidataTests-linux-firefox-sauce build #397: 15ABORTED in 1 min 59 sec: https://integration.wikimedia.org/ci/job/browsertests-Wikidata-WikidataTests-linux-firefox-sauce/397/ [18:42:50] Project browsertests-Wikidata-WikidataTests-linux-firefox-sauce build #398: 04STILL FAILING in 4 sec: https://integration.wikimedia.org/ci/job/browsertests-Wikidata-WikidataTests-linux-firefox-sauce/398/ [18:53:31] Project UploadWizard-api-commons.wikimedia.org build #2174: 04FAILURE in 30 sec: https://integration.wikimedia.org/ci/job/UploadWizard-api-commons.wikimedia.org/2174/ [18:54:44] (03PS1) 10JanZerebecki: [WIP] Run Wikidata browsertests without saucelabs [integration/config] - 10https://gerrit.wikimedia.org/r/247901 (https://phabricator.wikimedia.org/T116166) [19:10:08] 10Continuous-Integration-Infrastructure: Re-create ci slaves (April 2015) - https://phabricator.wikimedia.org/T94916#1743392 (10hashar) [19:10:10] 10Continuous-Integration-Infrastructure: Re-create ci slaves (March 2015) - https://phabricator.wikimedia.org/T91524#1743393 (10hashar) [19:10:13] 10Continuous-Integration-Infrastructure, 6operations, 7Puppet: Puppet (silently) fails to setup apache on new trusty instances - https://phabricator.wikimedia.org/T91832#1743389 (10hashar) 5Open>3Resolved a:3hashar I haven't hit that issue when building a new Trusty slave. [19:27:17] 6Release-Engineering-Team, 6Performance-Team, 3Discovery-Cirrus-Sprint: Investigate loadEventEnd / domComplete regression as of 2015-10-13 - https://phabricator.wikimedia.org/T116137#1743482 (10ori) a:3hashar Looks like rEWMVc5b6de was indeed the cause of the regression. Thanks for the help, @hashar, and t... [19:27:33] 6Release-Engineering-Team, 6Performance-Team, 3Discovery-Cirrus-Sprint: Investigate loadEventEnd / domComplete regression as of 2015-10-13 - https://phabricator.wikimedia.org/T116137#1743485 (10ori) 5Open>3Resolved [19:38:25] (03CR) 10Hashar: [C: 032] "Great thank you. I have deleted the job https://integration.wikimedia.org/ci/job/UploadWizard-api-commons.wikimedia.org/" [integration/config] - 10https://gerrit.wikimedia.org/r/247895 (https://phabricator.wikimedia.org/T115465) (owner: 10Greg Grossmeier) [19:40:12] (03Merged) 10jenkins-bot: Remove the UploadWizard API test from production [integration/config] - 10https://gerrit.wikimedia.org/r/247895 (https://phabricator.wikimedia.org/T115465) (owner: 10Greg Grossmeier) [19:40:21] 10Continuous-Integration-Config, 6Commons, 6Multimedia, 10UploadWizard: Adjust UploadWizard API test suite to match commons.wikimedia.org policy for bots - https://phabricator.wikimedia.org/T115469#1743528 (10hashar) 5Open>3declined a:3hashar The job has been deleted following discussion T115465. In... [19:45:45] 6Release-Engineering-Team, 6Performance-Team, 3Discovery-Cirrus-Sprint: Investigate loadEventEnd / domComplete regression as of 2015-10-13 - https://phabricator.wikimedia.org/T116137#1743556 (10hashar) @ori wrote: > Thanks for the help, @hashar, You are most welcome. It is been quite fun to dig in the log w... [19:46:09] 6Release-Engineering-Team, 6Performance-Team, 3Discovery-Cirrus-Sprint, 7WorkType-Maintenance: Investigate loadEventEnd / domComplete regression as of 2015-10-13 - https://phabricator.wikimedia.org/T116137#1743558 (10hashar) [19:46:55] 6RelEng-Admin, 15User-greg, 7WorkType-NewFunctionality: Tag some portion of RelEng team tasks with "New" or "Maint" - https://phabricator.wikimedia.org/T109375#1743559 (10hashar) [19:48:00] 6Release-Engineering-Team, 6operations, 7Database, 5Patch-For-Review, 7WorkType-Maintenance: Recover missing values from user_properties tables - https://phabricator.wikimedia.org/T114899#1743574 (10hashar) [19:49:33] 10Beta-Cluster-Infrastructure, 10Analytics, 6Services: Set up AQS in Beta - https://phabricator.wikimedia.org/T116206#1743578 (10hashar) [19:50:07] 10Beta-Cluster-Infrastructure, 10Analytics, 6Services, 7WorkType-NewFunctionality: Set up AQS in Beta - https://phabricator.wikimedia.org/T116206#1743135 (10hashar) [19:50:39] 10Beta-Cluster-Infrastructure, 10Browser-Tests, 7WorkType-NewFunctionality: Make selenium users use botflags at beta-cluster - https://phabricator.wikimedia.org/T116027#1743585 (10hashar) p:5Triage>3Normal [19:51:02] 10Beta-Cluster-Infrastructure, 6Release-Engineering-Team, 7WorkType-NewFunctionality: Proposal: Force any WARNINGs on Beta Cluster to fail completely - https://phabricator.wikimedia.org/T115633#1743590 (10hashar) p:5Triage>3Normal [19:51:14] 10Beta-Cluster-Infrastructure, 10Continuous-Integration-Infrastructure, 10MediaWiki-Database, 7Database, 7WorkType-NewFunctionality: Enable MariaDB/MySQL's Strict Mode - https://phabricator.wikimedia.org/T108255#1743593 (10hashar) [19:54:36] haha [19:54:36] Subject: [Maniphest] T109375: Tag some portion of RelEng team tasks with "New" or "Maint" [19:54:41] hashar added a project: WorkType-NewFunctionality. [19:54:43] :) [19:57:30] meta [20:03:36] 10Beta-Cluster-Infrastructure, 10Continuous-Integration-Infrastructure, 10MediaWiki-Database, 7Database, 7WorkType-NewFunctionality: Enable MariaDB/MySQL's Strict Mode - https://phabricator.wikimedia.org/T108255#1743624 (10hashar) Lets get it enabled on beta. A good way to exercise beta is to pick some... [20:08:12] 10Beta-Cluster-Infrastructure, 7Blocked-on-RelEng, 10Continuous-Integration-Config, 10Parsoid, 5Patch-For-Review: Parsoid patches don't update Beta Cluster automatically -- only deploy repo patches seem to update that code - https://phabricator.wikimedia.org/T92871#1743637 (10hashar) Current state: ``` l... [20:08:50] is jenkins a little spotty today? we've been having trouble with a couple patches not running their test triggers [20:08:55] 10Beta-Cluster-Infrastructure, 7Blocked-on-RelEng, 10Continuous-Integration-Config, 10Parsoid, and 2 others: Parsoid patches don't update Beta Cluster automatically -- only deploy repo patches seem to update that code - https://phabricator.wikimedia.org/T92871#1743642 (10hashar) [20:09:16] for example, i did a recheck on this a minute ago and it still isn't running: https://gerrit.wikimedia.org/r/#/c/247616/ [20:09:47] 6Release-Engineering-Team, 10Gerrit, 7Jenkins, 7Zuul: Clarification/documentation needed with relation to branches and tests run via Jenkins - https://phabricator.wikimedia.org/T116221#1743646 (10Jdlrobson) 3NEW [20:12:00] 10Beta-Cluster-Infrastructure, 6Release-Engineering-Team, 7WorkType-NewFunctionality: Proposal: Force any WARNINGs on Beta Cluster to fail completely - https://phabricator.wikimedia.org/T115633#1743659 (10hashar) MediaWiki has an integrated debug toolbar that does contain a bunch of logs https://www.mediawik... [20:14:27] Hi could someone fix metrolook merging at https://integration.wikimedia.org/zuul/ since it seems it stuck in gate and submit i updated the access rules and it said jenkins cannot merge so i just forced merged it is merged but is now stuck in gate and submit [20:17:48] 10Continuous-Integration-Infrastructure, 6Release-Engineering-Team, 7Jenkins, 7Zuul: Clarification/documentation needed with relation to branches and tests run via Jenkins - https://phabricator.wikimedia.org/T116221#1743685 (10greg) [20:22:44] (03PS1) 10Legoktm: Whitelist Wikinaut [integration/config] - 10https://gerrit.wikimedia.org/r/247916 [20:24:48] paladox: I don't see it? [20:25:00] (03CR) 10Legoktm: [C: 032] Whitelist Wikinaut [integration/config] - 10https://gerrit.wikimedia.org/r/247916 (owner: 10Legoktm) [20:25:27] greg-g: Seems to have gone now. [20:25:45] kk [20:25:53] (03Merged) 10jenkins-bot: Whitelist Wikinaut [integration/config] - 10https://gerrit.wikimedia.org/r/247916 (owner: 10Legoktm) [20:26:15] 10Beta-Cluster-Infrastructure, 6operations, 7WorkType-NewFunctionality: etcd/confd is not started on deployment-cache-mobile04 - https://phabricator.wikimedia.org/T116224#1743724 (10hashar) 3NEW [20:26:19] !log deploying https://gerrit.wikimedia.org/r/247916 [20:28:30] legoktm: thanks for figuring out niedzielski's issue [20:28:47] 10Continuous-Integration-Infrastructure, 6Release-Engineering-Team, 7Zuul: Clarification/documentation needed with relation to branches and tests run via Jenkins - https://phabricator.wikimedia.org/T116221#1743759 (10hashar) [20:28:58] greg-g legoktm: +1, thanks! [20:31:07] :) np [20:41:19] Could i have some help to fix a bug in TimedMediaHandler tests. I am getting errors in https://integration.wikimedia.org/ci/job/mediawiki-extensions-hhvm/36748/console since re enabling a specific test hashar disabled because it was also erroring out. [20:41:22] https://gerrit.wikimedia.org/r/#/c/247917/1 [20:41:38] Task is at https://phabricator.wikimedia.org/T63877 [20:44:15] 10Continuous-Integration-Infrastructure, 6Release-Engineering-Team, 7Zuul: Clarification/documentation needed with relation to branches and tests run via Jenkins - https://phabricator.wikimedia.org/T116221#1743825 (10hashar) Jobs that have multiple repositories tends to use `zuul-cloner` which for each repos... [20:45:13] 10Continuous-Integration-Infrastructure, 6Release-Engineering-Team, 7Zuul: Clarification/documentation needed with relation to branches and tests run via Jenkins - https://phabricator.wikimedia.org/T116221#1743827 (10hashar) @Jdlrobson if you have any idea of a good place to document all of that mess I am wi... [20:50:52] 10Continuous-Integration-Infrastructure, 10pywikibot-core, 5Patch-For-Review: run at least pep8 and pep257 for new changesets submitted to pywikibot/core for any user - https://phabricator.wikimedia.org/T87169#1743847 (10jayvdb) 5Open>3Resolved >>! In T87169#1742964, @JanZerebecki wrote: > The tox-jessie... [20:51:01] 10Continuous-Integration-Infrastructure, 6Release-Engineering-Team, 7Documentation, 7Zuul: Clarification/documentation needed with relation to branches and tests run via Jenkins - https://phabricator.wikimedia.org/T116221#1743850 (10hashar) [20:51:33] 10Continuous-Integration-Infrastructure, 10pywikibot-core, 5Patch-For-Review: run at least pep8 and pep257 for new changesets submitted to pywikibot/core for any user - https://phabricator.wikimedia.org/T87169#1743853 (10jayvdb) a:5jayvdb>3JanZerebecki [20:53:24] 10Continuous-Integration-Infrastructure, 6Release-Engineering-Team, 7Documentation, 7Zuul: Clarification/documentation needed with relation to branches and tests run via Jenkins - https://phabricator.wikimedia.org/T116221#1743858 (10hashar) There is also mail I posted on the QA list. **[QA] How Zuul gatin... [21:02:51] 6Release-Engineering-Team, 6Performance-Team, 3Discovery-Cirrus-Sprint, 7WorkType-Maintenance: Investigate loadEventEnd / domComplete regression as of 2015-10-13 - https://phabricator.wikimedia.org/T116137#1743893 (10EBernhardson) I've asked our two new frontend engineers in discovery, @jgirault and @jdrew... [21:02:54] 10Beta-Cluster-Infrastructure, 10Deployment-Systems, 7Beta-Cluster-reproducible: Scap failing with 'bool' object has no attribute 'encode' - https://phabricator.wikimedia.org/T114852#1743897 (10hashar) The Jenkins job beta-scap-eqiad fails in creative way and that last failure is unrelated to this task:... [21:03:19] 10Continuous-Integration-Infrastructure, 10pywikibot-core, 5Patch-For-Review: run at least pep8 and pep257 for new changesets submitted to pywikibot/core for any user - https://phabricator.wikimedia.org/T87169#1743899 (10XZise) PAH! Anyway ;) The tasks to watch are probably https://gerrit.wikimedia.org/r/243... [21:24:08] (03PS1) 10Paladox: Add new extension-unittests-composer template [integration/config] - 10https://gerrit.wikimedia.org/r/247920 [21:24:37] (03PS2) 10Paladox: Add new extension-unittests-composer template [integration/config] - 10https://gerrit.wikimedia.org/r/247920 [21:42:05] greg-g: (Not urgent) Do you know who "owns" https://phabricator.wikimedia.org/T87036 (last blocker of HHVM-everywhere)? RelEng I guess? Is it going to get resourced? [21:44:02] ops+releng [21:44:11] we sound like we own a lot of stuff, but we can't really :) [21:44:46] snapshot hosts block it too [21:45:05] Reedy: They're all converted to Trusty, AIUI. [21:45:14] Yeah [21:45:19] You said blocker of hhvm-everywhere [21:45:22] https://phabricator.wikimedia.org/T94277 [21:45:28] snapshots are using zend still, but on trustry [21:45:33] Due to hhvm upstream bug [21:45:35] OK, I actually care about MW moving to PHP 5.4. [21:45:42] Well, state that :P [21:45:58] Reedy: Sadly someone came along and "fixed" all the bugs about 5.4 to turn them into HHVM. [21:46:06] :/ [21:46:15] Indeed. [21:46:16] I'll need to get this whole chain of things back into my mental ram, don't really have time for that now... I wonder who can help with this... [21:46:17] Shouldn't we be going to 5.5? [21:46:27] based on what's in 14.04 [21:46:28] Reedy: Yes, but the two-year-old-agreed-RfC was for 5.4 [21:46:44] Reedy: I'd not be sad to ignore that and move straight to 5.5. [21:46:52] But 7.0 will be out by the time we get there. :-( [21:47:21] but why are we/wmf caring about that rfc for our maint/work machines? [21:47:33] * greg-g needs to not multitask right now, I think, [21:47:35] ignore me [21:47:44] So we can polluteuse PHP 5.4 features all over mediawiki [21:47:45] /ignore greg-g!* [21:47:56] Reedy: right right, gotcha [21:48:27] Specifically, there are libraries seom want to use in core that are 5.5+ only. [21:48:31] biggest thing with tin/terbium is what turns out to be unpuppetised [21:48:35] [ arrrayyyyyyyyyys ] [21:48:43] right [21:48:48] BUT! I think Ops were going to give new hardware as tin/terbium are out of warranty, or all but [21:48:50] 10Beta-Cluster-Infrastructure, 6Release-Engineering-Team, 7WorkType-NewFunctionality: Proposal: Force any WARNINGs on Beta Cluster to fail completely - https://phabricator.wikimedia.org/T115633#1744006 (10mmodell) In order to test anything effectively we would need to bypass cache, right? You won't see an er... [21:49:06] fwiw, 5.4 has already left the supported time period [21:49:13] no security patches will be released for 5.4 [21:49:19] lols [21:49:34] hmm [21:49:44] Why do php.net still list it on the front page then? [21:49:51] and 5.5 is only supported for 8 more months [21:49:52] http://php.net/supported-versions.php [21:50:08] heh [21:50:22] we actually employ the 5.4 release manager, but thats not his job anymore because its not getting any more releases :P [21:59:23] I don't think anyone would lose sleep if we went straight to 5.5 for our min php version [22:02:07] BUT! I think Ops were going to give new hardware as tin/terbium are out of warranty, or all but [22:02:35] We already have mira and AFIAK we could switch over to that if tin died tomorrow, as long as we could rebuild the mediawiki-staging dir easily enough (even from a backup) [22:03:10] we should just start rsyncing it [22:04:08] I think there's only the private repo that we'd have issues if we left it [22:06:26] Well we'd lose the history of the private repo [22:06:39] But all of the other hosts would still have it, and it'd probably be in the backups etc. [22:07:23] there is a task about rsyncing fully (i.e. including .git) between scap masters [22:08:17] yeah [22:08:29] I think bd808 has a WIP patch for that [22:08:38] We discussed it originally ages ago [22:08:54] https://gerrit.wikimedia.org/r/#/c/224313/ [22:10:02] 10Browser-Tests, 10Continuous-Integration-Infrastructure, 10VisualEditor, 5WMF-deploy-2015-10-13_(1.27.0-wmf.3): browsertests-VisualEditor-language-screenshot-os_x_10.10-firefox jenkins job failing - https://phabricator.wikimedia.org/T111510#1744055 (10Jdforrester-WMF) [22:14:44] I made something that mostly works. It had some file permissions issues on the beta cluster and I never got around to double checking vs tin to see if the files with ownership outside the wikidev group were on purpose or an accident on deployment-bastion [22:16:31] I was trying to encourage ostriches to take it over and make things work. I kind of ran into "real work" that has kept me from poking at scap much [22:17:53] There's some silly files in prod that only root can deploy, iirc [22:18:22] /srv/mediawiki-staging/wmf-config/db-*.php [22:18:46] I think the beta ones were directories... the tests are lots to time in jenkins [22:18:54] *lost to time [22:19:01] sure, but we have to make this work in prod too eventually :) [22:19:32] It should be easy to turn back on and test again. I think I just pulled the patch from bastion [22:56:57] 6Release-Engineering-Team, 6operations, 5Patch-For-Review: deployment: user trebuchet gets added and removed from group wikidev on every puppet run - https://phabricator.wikimedia.org/T115760#1744246 (10chasemp) 5Open>3Resolved done uid=995(trebuchet) gid=10004(trebuchet) groups=10004(trebuchet) [23:10:21] 10Deployment-Systems, 6operations, 5Patch-For-Review: install/deploy mira as codfw deployment server - https://phabricator.wikimedia.org/T95436#1744283 (10chasemp) [23:10:23] 10Deployment-Systems, 6operations: errors reported by "eventual_consistency_deployment_server_init" on new deploy server - https://phabricator.wikimedia.org/T99928#1744280 (10chasemp) 5Open>3Resolved a:3chasemp this is no longer happening [23:12:14] 10Beta-Cluster-Infrastructure: Set 'cluster' salt grain appropriately for all instances in beta cluster - https://phabricator.wikimedia.org/T87199#1744289 (10chasemp) [23:19:18] 10Deployment-Systems, 6operations, 5Patch-For-Review: install/deploy mira as codfw deployment server - https://phabricator.wikimedia.org/T95436#1744308 (10Dzahn) so, additional confirmation that the rules on mira look like everything that is needed on tin would not hurt. or we could try deploying from mira,... [23:23:45] mutante: I think we just need to fixup co-master syncing [23:23:54] So the staging dir gets copied around correctly etc [23:24:47] Reedy: so i think it's blocked by "scap multi-master support" last time i checked. adding firewalls to tin is easier [23:25:03] just that we should be around during the first deploy after that [23:25:37] the rules on mira _should_ be all that is needed [23:25:55] just that we havent actual proof [23:26:20] 10Deployment-Systems, 6Release-Engineering-Team, 10Staging, 10Salt, 7Puppet: provider => trebuchet doesn't work until manual 'git deploy start' on deployment-server - https://phabricator.wikimedia.org/T92978#1744327 (10chasemp) [23:26:22] bd808, is there still that issue where deploying from mira actually tells all hosts to pull from tin? [23:26:24] feel free to check if the mira rules look like all there is to it or not [23:26:42] either on the host itself or just by looking at the puppet role [23:26:45] ferm rules [23:31:18] Krenair: I don't think so. The mirror list should work the same from tin or anywhere else [23:31:49] Unless mira is missing the dsh files needed I guess [23:32:07] So I can run a deployment from mira to test it? [23:32:37] (after running a full rsync of mediawiki-staging from tin to mira) [23:34:00] they should have identical roles, incl dsh [23:34:09] unless we need to puppetize it on tin too [23:34:46] we just removed an include dsh from bast1001 because we said it's only used on deployment servers [23:36:48] I once tried to deploy from mira only to find that all of the target hosts still tried to pull from tin [23:37:13] So I think all the dsh stuff needed was there [23:46:45] 10Deployment-Systems, 3Scap3: enforcing deployment from `/srv/deployment` is wrong - https://phabricator.wikimedia.org/T116207#1744417 (10mmodell) See also: serve deployment files directly from the scap process, rather than a separate http server. [23:55:19] I wonder how I got the files there in the first place [23:55:49] oh, I know [23:56:25] SSH_AUTH_SOCK=/run/keyholder/proxy.sock rsync -ar mwdeploy@tin:/srv/mediawiki-staging /srv [23:58:20] 10Differential, 10Gerrit-Migration: Support automatic commit generation in Phabricator - https://phabricator.wikimedia.org/T182#1744480 (10mmodell) >>! In T182#1632547, @mmodell wrote: > > Also, upstream is going to implement this eventually. And now it looks like "eventually" became "soon": it's already d...