[09:35:56] it seems that db1048 lagged tonight- probably will do it more weeks, as they do dumps/OLAP on the slave [09:36:35] they should either downtime the lag or we should increase the lag tolerance [09:41:57] agree, can you remind me the usage for the "m" and "x" shards? [09:42:30] s == shard [09:42:41] m == misc [09:43:08] es == external storage [09:43:46] x == ? [09:44:21] x = Extension Store [09:44:30] thx :) [09:44:31] I had to look that up for the official repo [09:44:36] *name [09:44:51] names are not important [09:45:06] I vagely defined them on the slides I shared with you [09:45:16] but s == core relational data [09:45:50] m == non mediawiki services, with different degrees of impact (but requiring HA) [09:46:02] e.g. wordpress, puppet, gerrit, openstak [09:46:14] think, "internal services" [09:46:18] ok [09:46:24] es I know :) [09:46:46] x was, at the time, mostly created for storage needs [09:47:06] now it is a division based mostly on privacy and access patterns [09:51:07] es > s > x in terms of impact [09:51:32] with the note that es almost never fails [09:51:48] becase it is a pure key-value store with primary key access [09:52:04] while s is a purely relational system with complex queries [15:56:35] I'm working with paupaul on es2012 [15:57:27] great! [15:58:02] there is a chance that the stripe can be changed without rebuild [15:59:02] * volans disabled notification on icinga for es2012 [15:59:05] great! that was my biggest concern [15:59:15] :-D [16:39:55] no luck, we need to reimage as expected [19:44:27] so thanks to papaul all servers are reimaged, I'll do a pass to check them after dinner [19:44:54] thats very fast! [19:45:04] congrats! [19:45:20] j.ynus do you want to test XFS formatting too? I'd say that given current timeframe and no current errors/bad performances we might skip [19:45:23] thoughts? [19:45:41] not sure why not ping me, I am aswering you now [19:46:06] aka == I am here [19:46:29] * volans is lost... [19:46:32] what do you mean? [19:46:40] j.ynus [19:46:50] or was just a typo? [19:47:05] was not urgent given that I was leaving for dinner :) [19:47:15] ok [19:47:20] in any case [19:47:30] skip it [19:47:45] +1 [19:47:56] your idea, I am just supporting it [19:48:32] see you next week? [19:48:38] another failed disk on db1021... was not the same you changed 2 disks? [19:48:54] it is probably them being rebuilt [19:49:02] I asked chris to change a pair [19:49:10] ok, make sense [19:50:07] I discover the other day that icinga says the same error on failed disk and on rebuilding [19:50:27] not cool [19:50:53] send a patch! [19:51:07] open a ticket :)