[01:30:06] 10DBA, 10MediaWiki-Categories, 03Community-Tech-Sprint, 13Patch-For-Review: Increase size of categorylinks.cl_collation column - https://phabricator.wikimedia.org/T158724#3051978 (10Bawolff) I guess theres always the question of what happens when someone makes an even longer collation name. (Consider me... [07:16:18] 10DBA, 13Patch-For-Review: Rampant differences in indexes on enwiki.revision across the DB cluster - https://phabricator.wikimedia.org/T132416#3052329 (10Marostegui) db2069 ``` | revision | CREATE TABLE `revision` ( `rev_id` int(8) unsigned NOT NULL AUTO_INCREMENT, `rev_page` int(8) unsigned NOT NULL DEFAU... [09:18:17] 10DBA, 13Patch-For-Review: run pt-table-checksum before decommissioning db1015, db1035,db1044,db1038 - https://phabricator.wikimedia.org/T154485#3052487 (10Marostegui) >>! In T154485#3050716, @Marostegui wrote: > db1036 and db2035 were still lagging behind more than 30k: > > ``` > ˜/jynus 19:26> it is now doi... [09:20:34] 10DBA, 13Patch-For-Review: run pt-table-checksum before decommissioning db1015, db1035,db1044,db1038 - https://phabricator.wikimedia.org/T154485#3052496 (10jcrespo) I wouldn't be against it, but I would ask to start puppetizing that- Something like a replication filter, and add those through hiera with a new p... [09:27:56] 10DBA, 13Patch-For-Review: run pt-table-checksum before decommissioning db1015, db1035,db1044,db1038 - https://phabricator.wikimedia.org/T154485#3052512 (10Marostegui) >>! In T154485#3052496, @jcrespo wrote: > I wouldn't be against it, but I would ask to start puppetizing that- Something like a replication fil... [09:55:54] 07Blocked-on-schema-change, 10DBA, 06Community-Tech, 06Stewards-and-global-tools (Temporary-UserRights): Schema changes for expiring user groups - https://phabricator.wikimedia.org/T155605#3052564 (10Marostegui) I have taken a look at all the hosts in s1 (enwiki) and the table there already has the PK, so... [09:58:44] BTW, I did the increase of threads for the pool with sql.py [10:00:24] 07Blocked-on-schema-change, 10DBA, 06Community-Tech, 06Stewards-and-global-tools (Temporary-UserRights): Schema changes for expiring user groups - https://phabricator.wikimedia.org/T155605#3052574 (10TTO) Note that if the PK already exists, the unique index may not exist, so it would just be a matter of ad... [10:02:49] 07Blocked-on-schema-change, 10DBA, 06Community-Tech, 06Stewards-and-global-tools (Temporary-UserRights): Schema changes for expiring user groups - https://phabricator.wikimedia.org/T155605#3052576 (10Marostegui) >>! In T155605#3052574, @TTO wrote: > Note that if the PK already exists, the unique index may... [10:08:17] 07Blocked-on-schema-change, 10DBA, 06Community-Tech, 06Stewards-and-global-tools (Temporary-UserRights): Schema changes for expiring user groups - https://phabricator.wikimedia.org/T155605#3052579 (10TTO) Eh? Where are we dropping the `ug_group` index? From https://gerrit.wikimedia.org/r/#/c/328377/22/main... [10:19:36] 07Blocked-on-schema-change, 10DBA, 06Community-Tech, 06Stewards-and-global-tools (Temporary-UserRights): Schema changes for expiring user groups - https://phabricator.wikimedia.org/T155605#3052589 (10Marostegui) >>! In T155605#3052579, @TTO wrote: > Eh? Where are we dropping the `ug_group` index? From http... [11:16:40] labsdb1005 may have crashed yesterday? [11:17:32] looks so [11:19:02] doesn't look like a oom [11:19:29] We think the query pointer is invalid, but we will try to print it anyway. [11:19:32] Query: OPTIMIZE TABLE `ipp_changes` [11:19:34] which doesn't mean it was that one though [11:20:44] it complainms about several tables, but those are probably myisam [11:20:58] nothing we can do if people want to create tables on /dev/null [11:22:38] but that is why it is almost impossible to get a consistent replication to the slave [11:23:15] yeah, it has been complaining for years, but as you said, not much we can do [11:23:24] s/years/days obviously [11:23:30] I am going for lunch, will be back later [11:23:43] see you [14:08:37] 10DBA, 06Labs, 10Labs-Infrastructure: Database upgrade MariaDB 10: Engine / Option mismatch on table `user_properties` - https://phabricator.wikimedia.org/T70942#3053003 (10Marostegui) 05Open>03Resolved I have, by chance, arrived to this old ticket, which I believe can be closed. labsdb1001 runs 10.0.15... [14:24:24] is it worrying that I know most of the wikis in which shard they are from the top of my head? [14:26:38] no [14:26:55] you were surprised I knew that a few months ago [14:27:15] you achieved enlightment now, too [14:27:31] hahaha [14:27:34] but wait a few months, I will change all of them [14:28:29] Make sure you clean up s3, please [14:28:54] clean up? [14:29:11] yes, if you are going to change all of them, leave s3 with less wikis [14:29:14] spread them [14:29:16] :p [14:29:17] yes [14:29:31] specially some of the largest [14:39:37] 07Blocked-on-schema-change, 10DBA, 06Community-Tech, 06Stewards-and-global-tools (Temporary-UserRights): Schema changes for expiring user groups - https://phabricator.wikimedia.org/T155605#3053102 (10Marostegui) Apart from s1: s2 all databases there have the PK on their user_groups table s4, commonswiki h... [14:41:22] ^did you use sql.py ? [14:47:42] I did! -h multi :) [14:49:14] I had to look at the code though to see if there was an option for ALL the wikis, so maybe having —shard SHARD or even providing a list of available shards ie: available options,s1,s2,m3,m4 whatever can be useful [14:49:22] or if you prefer just —shard-all as an option [14:49:29] because I tried: "all" [14:49:36] and didn't think it could be capital letters [14:49:57] I think it is --shard ALL [14:50:04] it is [14:50:23] I will have to redo that part [14:50:27] on my own usage [14:50:37] I realized that there are several scenarios [14:50:45] all servers [14:50:52] all servers, one per wiki [14:50:55] can you also pass a list of hosts? [14:51:21] one per table [14:51:24] etc. [14:51:32] only masters [14:51:35] etc. [14:51:57] I can separate the list generation from the exec [14:52:08] (it is already separate on code) [14:54:33] I would also say that when using multi, it might be good to ignore temporary hosts down [14:54:36] for example [14:54:47] root@neodymium:/home/jynus# ./sql.py -h multi -e "show create table user_groups" --shard ALL --no-dry-run [14:54:50] ERROR: Could not connect to labsdb1001.eqiad.wmnet:3306/arwiki [14:54:53] and it stops there [14:55:22] maybe it would be good t have something like —ignore-errors or something similar [14:57:18] it could be an option [14:57:25] I wanted it to stop by default [14:57:38] if it was an alter you would want to do some action, then continue [14:58:15] that is true, but maybe for selects or stuff like that, having an option to ignore hosts down or slow [14:58:24] I wonder why i cannot connect to labs1001 btw [14:58:26] lets see [14:58:35] it is the ssl [14:58:44] needs an upgrade + reboot [14:59:46] ah :) [15:20:53] I think that is a bug that may actually be fixed on the repo version [15:21:01] let me check [15:40:22] I think it should work now [15:40:33] ERROR: Could not connect to dbstore1001.eqiad.wmnet:3306/frwiki [15:40:38] but then it continues [15:42:02] don't use it yet for updates or alters, ok? [15:42:21] no, of course not [15:43:24] I would add a format option [15:43:35] one that is pure text [16:22:05] 10DBA, 06Operations, 10ops-eqiad: Degraded RAID on db1070 - https://phabricator.wikimedia.org/T158969#3053253 (10Marostegui) [16:23:07] 10DBA, 06Operations, 10ops-eqiad: Degraded RAID on db1070 - https://phabricator.wikimedia.org/T158969#3053239 (10Marostegui) Hey @Cmjohnson it should be safe to change this disk when you have time Thanks! [16:33:51] 07Blocked-on-schema-change, 10DBA, 06Community-Tech, 06Stewards-and-global-tools (Temporary-UserRights): Schema changes for expiring user groups - https://phabricator.wikimedia.org/T155605#3053297 (10Marostegui) And the list of wikis of s3 that DO NOT HAVE the PK but an UNIQUE key instead (this is mostly f... [17:20:15] ETA 45 minutes to finish the transfer to dbstore1001! [17:41:21] :-) [17:48:28] 10DBA, 10Wiki-Loves-Monuments-Database: mysqldump is timing out preventing all tables from being included in the dump - https://phabricator.wikimedia.org/T138517#3053448 (10Lokal_Profil) [17:50:59] 15 minutes left, my plan would be to start —skip-slave-start do the mysql_upgrade, restart…and cross my fingers? [17:57:30] mysql_upgrade will take 30 minutes [17:57:33] maybe more [17:57:53] also the scheduler will start and will break replication stuff [17:58:22] there is a start option to disable the event_Scheduler [17:58:26] lets see [17:58:35] not "break" [17:58:42] but hope you understand [17:58:50] yes yes, i do :) [17:59:05] we need to get rid of the event [17:59:07] --event-scheduler=DISABLED [17:59:25] Starting the MySQL server with the https://dev.mysql.com/doc/refman/5.7/en/server-options.html#option_mysqld_skip-grant-tables option causes https://dev.mysql.com/doc/refman/5.7/en/server-system-variables.html#sysvar_event_scheduler to be set to DISABLED, [17:59:26] a service would be much more easy to handle [17:59:39] I think you may have to restart after that [18:00:03] in any case, check twice all repl threads [18:00:38] we can: —skip-slave-start —event-scheduler=disabled then run mysql_upgrade, then restart with —event-scheduler=disabled again, then check all the threads, start replication [18:00:45] and when we are good, we can enable the scheduler [18:01:04] i would need to leave around 8-8:15 by the way [18:01:17] 4 minutes left! [18:02:05] you shouldn't have stayed late [18:02:26] neither have you and here we are :p [18:02:29] there is really no need [18:02:36] yeah, but I was here to do other things [18:03:25] i want to leave it catching up the weekend, and I got to advance a bit on that schema change ticket, and got to see a few slides from fosdem finally! [18:04:38] ok, so /etc/init.d/mysql start —skip-slave-start —event-schduler=disabled sounds good to you? [18:05:03] scheduler obviously [18:06:32] it should be [18:06:57] but I have almost giving up trying to have a sane dbstore :-) [18:07:03] haha [18:07:07] ok, so the transfer is done [18:07:10] let's go ahead [18:07:22] you want to tail the error log to see the fun? [18:07:32] sure [18:08:16] here we go! [18:09:12] check if toku is active [18:09:17] before everthing [18:09:20] ok [18:09:22] if you hve the change [18:09:25] cannot log in yet [18:09:26] like reading a table [18:09:35] oh, true [18:09:39] it is take some time [18:09:42] yeah [18:09:50] like 2001 [18:09:53] now [18:09:55] yeop [18:10:07] (but much faster) [18:10:18] tokudb plugin seems loaded as per show plugins at least [18:10:26] strange [18:10:31] that it works [18:11:10] I do not remember fixing the systmctl flags [18:11:40] and i can read a table with tokudb [18:11:52] arwiki.abuse_filter_action for instance [18:11:56] that is cool [18:12:02] I believe you [18:12:03] let's go for the mysql ugprade? [18:12:08] yes [18:12:15] I just do not know when that was fixed [18:12:37] Unsupported sql_command=46 [18:12:41] yep [18:12:54] Repairing tables [18:12:54] mysql.master_status [18:12:54] Error : CONNECT Unsupported command [18:12:54] Error : Can't lock file (errno: 122 "Internal (unspecified) error in handler") [18:12:57] error : Corrupt [18:13:04] really? [18:13:17] it is related to mysql.master_status and slave_status tables only [18:13:18] (so far) [18:13:22] ah [18:13:24] good [18:13:30] yeah, so far so good [18:13:40] I thought it was regarding data tables [18:13:44] no no [18:13:45] just mysql [18:13:48] I couldn't care more about those [18:13:53] *less [18:13:53] XD [18:14:05] now on phase 2, fixing views [18:14:06] In theory, this upgrade should be easy [18:14:13] that is what it takes 30 minutes [18:14:17] only tokudb worries me [18:14:26] opening every single files and db [18:14:34] you can go take a coffe here [18:15:53] I've just sent you some literature for your amusement [18:16:04] hahaha [18:16:06] lets see [18:16:55] wtf!! [18:17:25] haha they mention your nickname [18:17:53] date: 2007 [18:17:57] I was going to ask [18:17:58] haha [18:18:12] 23 servers on Seoul [18:18:14] what about those other that they mention, still active? [18:20:10] surprisingly, most of them do [18:20:40] polish was really popular at that time I can see! [18:20:47] It says, polish right? [18:21:04] it is hard to read [18:21:18] I may get a better version [18:21:34] yes [18:21:42] it still is, remember pl and pt [18:21:48] on which shard? [18:22:18] you learn a lot about iso codes for you resume! [18:22:58] it is weird because internet in poland I am sure it wasn't that developed at the time [18:23:23] from eswiki, I think the excuse was "botting" [18:23:32] not sure the reality [18:29:23] do you know what would fix most of our pains with many tables? [18:29:33] I have replied to you with something for your amusement too [18:29:34] the new MySQL data dictionary! [18:29:46] that one you tested and reported a bug for? :) [18:29:50] yeah [18:29:56] but it is really fast [18:29:57] was it fixed? [18:30:05] yes, it was [18:30:16] but no more milestone releases [18:30:26] unless it has gone unnoticed [18:30:45] you've got the bug report somewhere? [18:30:49] lol [18:30:54] hahaha [18:30:57] I guess you so the email [18:30:58] XDDD [18:31:04] *Saw [18:31:25] year? [18:31:29] 2006 [18:31:36] XDD [18:32:14] https://bugs.mysql.com/bug.php?id=83706 [18:32:29] it was a stupid bug, actually [18:32:44] lack of quotes in binary because it is a keyword [18:33:45] haha i see the last answer yeah [18:33:50] yea, not 8.0.1 yet [18:34:09] but I saw the data dictionary in action [18:34:22] and it may sound stupid when they say [18:34:37] 10x more performance for information_schema [18:34:43] who cares? [18:34:59] but then here you are, running mysql_upgrade for half an hour [18:35:07] mostly doing show tables :-) [18:35:07] hahaha that is a good point [18:35:29] although it is a bit crazy to have 900 databases in once place too :p [18:35:40] oh, I do not disagree [18:35:55] however [18:36:00] I've seen it worse [18:36:07] yes me too [18:36:08] and i cried [18:36:09] 100000 databases [18:36:18] it no longer worked [18:36:18] 100k?????? [18:36:20] yep [18:36:24] don't ask [18:36:25] I saw 60k and I cried [18:36:35] 100k? does that even start? [18:36:39] maybe they had smaller tables [18:36:52] I think they were installations of [18:37:09] what is the name of the open source course stuff? [18:37:54] moodle [18:38:06] now imagine supporting a whole country, all public system [18:38:11] schools [18:38:16] in a single mysql instance :-) [18:38:28] oh moodle [18:38:29] one db per class and year [18:38:37] :o [18:38:55] did mysql even start with 100k databases??? [18:39:09] it did [18:39:18] xtrabackup, of course, it crashed [18:40:19] do you know what else would be great? [18:40:31] not having toku? :) [18:40:32] multiple-table tablespaces [18:40:45] single file per db [18:41:05] 5.7 has it [18:41:25] you will not solve the table problem [18:41:30] but at least the file problem [18:41:32] yes, I was going to say that it sounded familiar that they were going towards that direction [18:43:01] we are now on mrwikibooks already [18:43:07] pfff [18:43:27] at this point I will tell you to go away :-) [18:43:34] Yeah, I think I will [18:43:38] you can do screen -x transfer [18:43:40] on dbstore1001 [18:43:45] and it is there [18:43:55] events disabled, right? [18:43:59] yep [18:44:00] so just reboot? [18:44:14] events disabled with —event-scheduler=disabled [18:44:23] so if you restart without it they will come back [18:44:27] yeah, I was going to reboot with no parameeters [18:44:32] ah, then yes [18:44:33] and let things happen [18:44:42] when it is finished, of course [18:44:49] nywiki [18:44:51] already [18:45:09] see you next week [18:45:39] pfl [18:45:42] I might stay!!! [18:45:47] it is doing phabricator [18:46:02] you know wikidata and zhwiki and plwiki and ptwiki... [18:46:37] ptwiki is already done, but wikidata, yes pfff [18:52:45] I need to leave the house now :_( [18:52:59] Email me to let me know if all went good, so I can have a relaxing dinner! [18:53:20] ok, bye! [18:53:29] it is doing urwiki now [18:53:32] almost there [18:53:35] see you later! [18:53:39] thanks for all the help [18:56:40] restarting now [22:52:56] 10DBA, 10MediaWiki-Categories, 03Community-Tech-Sprint, 13Patch-For-Review: Increase size of categorylinks.cl_collation column - https://phabricator.wikimedia.org/T158724#3054267 (10kaldari) >I guess theres always the question of what happens when someone makes an even longer collation name. If anyone make...