[05:34:05] 10DBA: Drop flaggedrevs tables on wikis where it is not enabled - https://phabricator.wikimedia.org/T174801#4177490 (10Marostegui) [05:34:07] 10DBA: Drop flaggedrevs tables at eswiki - https://phabricator.wikimedia.org/T193678#4177486 (10Marostegui) 05Open>03Resolved p:05Triage>03Normal a:03Marostegui They were all empty, so I have dropped them [05:36:05] 10DBA: Drop flaggedrevs tables at metawiki - https://phabricator.wikimedia.org/T193390#4167712 (10Marostegui) I would assume the data is necessary to be backed up, no? There is only data on these tables: ``` flaggedrevs +----------+ | count(*) | +----------+ | 185 | +----------+ flaggedrevs_promote +-------... [05:37:26] 10DBA: Drop flaggedrevs tables at eswikibooks - https://phabricator.wikimedia.org/T193676#4177494 (10Marostegui) p:05Triage>03Normal There is not much data, I assume it is not needed to back it up? ``` flaggedimages +----------+ | count(*) | +----------+ | 43 | +----------+ flaggedpage_config +--------... [05:39:48] 10DBA, 10Operations, 10Goal, 10Patch-For-Review: Convert all sanitarium hosts to multi-instance and increase its reliability/redundancy - https://phabricator.wikimedia.org/T190704#4177496 (10Marostegui) db1116 is now replicating a multi-instance sanitized copy (also checked with check_private_data) of the... [05:40:29] 10DBA, 10Operations, 10Goal, 10Patch-For-Review: Convert all sanitarium hosts to multi-instance and increase its reliability/redundancy - https://phabricator.wikimedia.org/T190704#4177497 (10Marostegui) [05:47:14] 10DBA: Drop flaggedrevs tables on wikis where it is not enabled - https://phabricator.wikimedia.org/T174801#4177502 (10Marostegui) [05:55:28] 10DBA: Drop flaggedrevs tables at metawiki - https://phabricator.wikimedia.org/T193390#4177505 (10Marostegui) I have backuped them just in case and left them at: `dbstore1001:/srv/backups/tmp/T193390.tar.gz` [06:02:23] 10DBA: Drop flaggedrevs tables on wikis where it is not enabled - https://phabricator.wikimedia.org/T174801#4177517 (10Marostegui) [06:02:25] 10DBA: Drop flaggedrevs tables at metawiki - https://phabricator.wikimedia.org/T193390#4177513 (10Marostegui) 05Open>03Resolved p:05Triage>03Normal a:03Marostegui Dropped them [06:02:41] 10DBA: Drop flaggedrevs tables on wikis where it is not enabled - https://phabricator.wikimedia.org/T174801#3573416 (10Marostegui) [07:24:58] 10DBA: Drop flaggedrevs tables at eswikibooks - https://phabricator.wikimedia.org/T193676#4177599 (10Marostegui) Left a backup at: `dbstore1001:/srv/backups/tmp/T193676.tar.gz` [07:28:11] 10DBA: Drop flaggedrevs tables on wikis where it is not enabled - https://phabricator.wikimedia.org/T174801#4177616 (10Marostegui) [07:28:13] 10DBA: Drop flaggedrevs tables at eswikibooks - https://phabricator.wikimedia.org/T193676#4177613 (10Marostegui) 05Open>03Resolved a:03Marostegui Tables dropped [07:28:22] 10DBA: Drop flaggedrevs tables on wikis where it is not enabled - https://phabricator.wikimedia.org/T174801#3573416 (10Marostegui) [07:34:03] 10DBA, 10Operations, 10ops-codfw, 10Patch-For-Review: db2081 crashed/rebooted, probably due to hardware failure - https://phabricator.wikimedia.org/T193325#4177621 (10jcrespo) The check detected some difference, but they could be false positives, checking again. [08:02:30] 10DBA, 10Operations-Software-Development, 10Patch-For-Review: Debmonitor: request for misc DB allocation - https://phabricator.wikimedia.org/T192875#4152620 (10jcrespo) a:03jcrespo [08:27:22] jynus: for debmonitor, assuming that it will end up on m2, what DNS/IP should I put in the config? (I'm just working on the puppet patches) [08:27:45] m2-master.eqiad.wmnet [08:28:00] we don't yet have a proper deployment on codfw [08:28:10] but it is not yet ready [08:28:52] ack, thanks [08:29:36] I was checking the available collations and I ended up filing a mysql bug [08:29:44] lol [08:29:55] url! [08:29:58] as always when getting down a rabbit hole [08:30:52] but don't worry, I know is not yet there (the DB account) and most CRs are still pending final review, so don't worry [08:30:52] so I will create the database as utf8mb4_unicode_520_ci [08:30:54] This is jynus http://i.imgur.com/QMFZ8Zq.gif [08:30:55] it's not a blocker [08:31:07] because there is not yet recent unicode support [08:34:17] but that doesn't support emoji [08:35:59] lol, I don't plan to have emojis in debmonitor :D [08:36:09] this is not because of you [08:36:24] we need its support, and I am testing it at the same time [08:36:38] I imagined ;) [08:41:12] 10DBA, 10Operations-Software-Development, 10Patch-For-Review: Debmonitor: request for misc DB allocation - https://phabricator.wikimedia.org/T192875#4177729 (10jcrespo) ``` root@db1051[debmonitor]> SELECT * FROM test; +------+ | c | +------+ | 💩 | +------+ 1 row in set (0.00 sec) root@db1051[debmonit... [08:42:53] thanks for the encouraging emoji :-P [08:45:36] xddddddd [08:48:04] volans: do you really need all privileges from the web interfaces? [08:48:20] normally the web interfaces get select, or modify rights at most [08:48:26] and all on admin interfaces [08:48:38] but I don't know how that that works for you applicatino [08:49:51] sooo, the web interface surely need select/insert/updates because the data will come through it. The schema changes are made by a CLI (django manage.py migrate) and I still need to fully understand if that will be done by a scap step from the deployment hosts [08:50:09] in that case we could have a separate account there [08:50:11] I will add all [08:50:15] no problem [08:50:19] just a heads up [08:52:16] ack, as soon as I fully understand the deploy procedure I'll let you know how we can split it [08:52:23] for a more safer config [09:07:28] 10DBA, 10Operations-Software-Development, 10Patch-For-Review: Debmonitor: request for misc DB allocation - https://phabricator.wikimedia.org/T192875#4177782 (10jcrespo) @volans You can test it now, but remember this is a full production host- no load testing or other silly things, as this shares service with... [09:15:13] 10DBA, 10Operations-Software-Development, 10Patch-For-Review: Debmonitor: request for misc DB allocation - https://phabricator.wikimedia.org/T192875#4177818 (10jcrespo) Backups added on codfw (there are not, at the time, backups for m2 on eqiad) for dbmonitor. [09:17:27] 10DBA, 10Operations-Software-Development, 10Patch-For-Review: Debmonitor: request for misc DB allocation - https://phabricator.wikimedia.org/T192875#4177831 (10jcrespo) a:05jcrespo>03Volans Assigning to you for verification and read the caveats at T192875#4177782. [09:24:23] 10DBA, 10Operations-Software-Development, 10Patch-For-Review: Debmonitor: request for misc DB allocation - https://phabricator.wikimedia.org/T192875#4177848 (10Volans) Great! Thanks a lot. >>! In T192875#4177782, @jcrespo wrote: > @volans You can test it now, but remember this is a full production host- no... [09:38:51] 10DBA: Drop flaggedrevs tables on wikis where it is not enabled - https://phabricator.wikimedia.org/T174801#4177876 (10MarcoAurelio) [09:56:03] jynus: please ping me when you have a little bit of time [10:06:04] so [10:06:05] jynus: marostegui: here? [10:06:06] hangouts.google.com cannot be resolved [10:06:21] o/ [10:11:44] yeah this is not working [10:11:44] shall we skip this meeting? :) [10:11:44] or discuss things here [10:12:09] I guess his connection is the one not working :) [10:12:10] sure [11:00:34] smart health alerts downtime expired, which reminded me I should ping you about what to do with those [11:01:13] ^ marostegui let'a talk after lunch? [11:01:30] sure [11:01:31] works for me, I'm going to lunch shortly [11:01:39] IO am going for lunch now [12:30:41] godog jynus my proposal for the alerts: https://phabricator.wikimedia.org/P7073 [12:37:34] jynus: are you touching s4 codfw? I would like to deploy a schema change directly on codfw master with replication [13:01:40] 10DBA, 10Patch-For-Review: Productionize 8 eqiad hosts - https://phabricator.wikimedia.org/T192979#4178179 (10jcrespo) [13:02:02] no [13:02:12] I'll proceed then! [13:07:37] 10DBA: Decommission db1060 - https://phabricator.wikimedia.org/T193732#4178195 (10Marostegui) p:05Triage>03Normal [13:21:41] marostegui: plan looks good to me! [14:03:35] 10DBA: Decommission db1056 - https://phabricator.wikimedia.org/T193736#4178332 (10jcrespo) p:05Triage>03Normal [14:05:20] 10DBA: Decommission db1056 - https://phabricator.wikimedia.org/T193736#4178383 (10jcrespo) a:03jcrespo [14:37:13] I'll open tasks for the remaining hosts [14:37:37] godog: No need to I think [14:37:53] I will manually fail them and the automated alert will create the degraded raid one [14:38:02] oh that's right, even better [14:38:03] thanks! [14:39:29] I will fail them on Tuesday, don't want to leave them failed during the weekend [14:39:44] Actually, I will check if chris is at the dc now [14:39:45] and fail one [14:40:33] you're lazy :-P [14:40:46] get the automation work for you and create tasks for you [14:40:57] Just to check if it keeps working [14:40:58] :p [14:41:31] sure sure, good excuse though ;) [14:43:19] volans: when it works [14:43:46] #roast [14:45:01] can someone double check that the wrong disk on db1063 is 1 so: megacli -PDOffline -PhysDrv \[32:1\] -aALL [14:45:04] ? [14:45:22] it should say it on the alert [14:45:30] it does :) [14:45:47] but I want another set of eyes if that's possible [14:46:19] $ sudo /usr/local/lib/nagios/plugins/get-raid-status-megacli [14:46:37] Enclosure Device ID: 32 ; Slot Number: 1 [14:46:39] :) [14:46:58] thanks [14:47:33] de nada [14:55:05] sorry, got distracted by manager [14:55:45] 10DBA, 10Operations, 10ops-eqiad: Degraded RAID on db1063 - https://phabricator.wikimedia.org/T193747#4178575 (10Marostegui) p:05Triage>03Normal a:03Cmjohnson This is m1 master - we failed the disk manually as it has errors [15:05:13] 10DBA, 10Operations, 10ops-eqiad: Degraded RAID on db1063 - https://phabricator.wikimedia.org/T193747#4178568 (10RobH) The warranty on db1063 has expired, and is no longer under warranty support. Any failed disks will need to be replaced from shelf spares. [15:26:39] marostegui the disk in slot 1 on db1063 has been replaced [15:26:47] \o/ [15:26:49] Thanks cmjohnson1 [15:28:41] 10DBA, 10Operations, 10ops-eqiad: Degraded RAID on db1063 - https://phabricator.wikimedia.org/T193747#4178684 (10Marostegui) Disk replaced by @Cmjohnson ``` root@db1063:~# megacli -PDRbld -ShowProg -PhysDrv [32:1] -aALL Rebuild Progress on Device at Enclosure 32, Slot 1 Completed 2% in 1 Minutes. ``` [15:38:45] 10DBA, 10Operations, 10hardware-requests, 10ops-codfw, 10Patch-For-Review: Decommission db2011 - https://phabricator.wikimedia.org/T187886#4178726 (10Papaul) [15:39:11] 10DBA, 10Operations, 10hardware-requests, 10ops-codfw, 10Patch-For-Review: Decommission db2011 - https://phabricator.wikimedia.org/T187886#3989185 (10Papaul) a:05Papaul>03RobH @RobH done on my end [15:42:28] 10DBA, 10Operations, 10hardware-requests, 10ops-codfw: Decommission db2011 - https://phabricator.wikimedia.org/T187886#4178762 (10RobH) [15:42:33] 10DBA, 10Operations, 10hardware-requests, 10ops-codfw: Decommission db2011 - https://phabricator.wikimedia.org/T187886#3989185 (10RobH) 05Open>03Resolved [15:42:36] 10DBA, 10Operations, 10Patch-For-Review: Setup newer machines and replace all old misc (m*) and x1 codfw machines - https://phabricator.wikimedia.org/T183470#4178768 (10RobH) [15:44:00] 10DBA, 10Goal, 10Patch-For-Review: Decommission database hosts <= db2031 (tracking) - https://phabricator.wikimedia.org/T176243#4178782 (10Marostegui) 05Open>03Resolved This task is now completely done. Thanks everyone who helped on getting this goal finished! [15:44:02] 10DBA, 10Goal, 10Patch-For-Review: Decommission database hosts <= db2031 (tracking) - https://phabricator.wikimedia.org/T176243#4178784 (10Marostegui) This task is now completely done. Thanks everyone who helped on getting this goal finished! [16:52:10] 10DBA, 10Operations: Multiple pages with no revisions - https://phabricator.wikimedia.org/T112282#4179032 (10RobH) p:05High>03Low [22:42:27] 10DBA, 10Cloud-Services, 10Patch-For-Review: Make watchlist table available as curated foo_p.watchlist_count on labsdb - https://phabricator.wikimedia.org/T59617#1458677 (10kaldari) FYI, this is needed for T188151. [22:42:53] 10DBA, 10Operations, 10decommission, 10Goal: Decommission old coredb machines (<=db1050) - https://phabricator.wikimedia.org/T134476#4180499 (10RobH) [22:58:43] 10DBA, 10Operations, 10decommission, 10ops-eqiad: Decommission db1034 - https://phabricator.wikimedia.org/T182556#4180550 (10RobH) [22:58:58] 10DBA, 10Operations, 10decommission, 10ops-eqiad: Decommission db1029 and db1031 - https://phabricator.wikimedia.org/T184054#4180551 (10RobH) [23:00:09] 10DBA, 10Operations, 10decommission, 10ops-eqiad, 10Patch-For-Review: Decommission db1039 - https://phabricator.wikimedia.org/T184262#4180561 (10RobH) [23:03:16] 10DBA, 10Operations, 10decommission, 10Goal: reclaim and return all cisco servers - https://phabricator.wikimedia.org/T128821#4180580 (10RobH)