[00:01:10] <grrrit-wm>	 (CR) Elukey: "I had a chat with Madhu today about the overall code, I added some comments in the Fabric's code to make it clearer but nothing super impo" (11 comments) [analytics/wikimetrics-deploy] - https://gerrit.wikimedia.org/r/261579 (https://phabricator.wikimedia.org/T122228) (owner: Madhuvishy)
[00:47:45] <YuviPanda>	 madhuvishy: hey! around?
[00:47:54] <madhuvishy>	 YuviPanda: yes
[00:48:11] <YuviPanda>	 madhuvishy: can you help me debug why I think some of my events might be getting lost?
[00:48:20] <madhuvishy>	 YuviPanda: sure, where are you?
[00:48:33] <YuviPanda>	 madhuvishy: near the hammock
[00:48:39] <YuviPanda>	 madhuvishy: I can come there
[01:07:29] <wikibugs>	 Analytics: Pageview API demo doesn't list be-tarask - https://phabricator.wikimedia.org/T119291#1933649 (Ijon) Ping @Milimetric ?
[01:12:15] <wikibugs>	 Analytics-Tech-community-metrics, Developer-Relations, DevRel-January-2016: Check whether it is true that we have lost 40% of (Git) code contributors in the past 12 months - https://phabricator.wikimedia.org/T103292#1933663 (Qgil) Thank you! If we assume that it is true that we have lost 23% (within a...
[01:12:44] <wikibugs>	 Analytics-Tech-community-metrics, Developer-Relations, DevRel-January-2016, developer-notice: Check whether it is true that we have lost 40% of (Git) code contributors in the past 12 months - https://phabricator.wikimedia.org/T103292#1933668 (Qgil)
[01:38:46] <grrrit-wm>	 (PS7) Madhuvishy: Fabric deployment setup for wikimetrics [analytics/wikimetrics-deploy] - https://gerrit.wikimedia.org/r/261579 (https://phabricator.wikimedia.org/T122228)
[01:56:49] <grrrit-wm>	 (CR) Madhuvishy: Fabric deployment setup for wikimetrics (10 comments) [analytics/wikimetrics-deploy] - https://gerrit.wikimedia.org/r/261579 (https://phabricator.wikimedia.org/T122228) (owner: Madhuvishy)
[02:00:30] <grrrit-wm>	 (CR) Madhuvishy: "recheck" [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/263782 (owner: Madhuvishy)
[02:10:57] <grrrit-wm>	 (PS8) Madhuvishy: Fabric deployment setup for wikimetrics [analytics/wikimetrics-deploy] - https://gerrit.wikimedia.org/r/261579 (https://phabricator.wikimedia.org/T122228)
[09:28:01] <grrrit-wm>	 (PS1) Hashar: Restrain pep8 to 1.5.x [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/264057
[09:28:24] <grrrit-wm>	 (CR) Hashar: "The flake8 issues are due to a new version of pep8. https://gerrit.wikimedia.org/r/264057 fix it." [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/263782 (owner: Madhuvishy)
[09:47:08] <wikibugs>	 Analytics-Wikimetrics, Education-Program-Dashboard: I want WikiMetrics integration with the education dashboard that lets you easily pull reports about courses, institutions, etc. - https://phabricator.wikimedia.org/T92454#1934045 (awight)
[11:43:53] <wikibugs>	 Analytics-Tech-community-metrics, DevRel-January-2016, Easy, Patch-For-Review: Entered text in Typeahead search field nearly not visible in Firefox 42: Fix the CSS - https://phabricator.wikimedia.org/T121101#1934145 (Aklapper) Merged upstream in https://github.com/VizGrimoire/VizGrimoireJS/commit/d7...
[12:46:26] <wikibugs>	 Analytics-Tech-community-metrics, DevRel-January-2016: Make GrimoireLib display *one* consistent name for one user, plus the *current* affiliation of a user - https://phabricator.wikimedia.org/T118169#1934183 (Aklapper) ...and merged in https://github.com/VizGrimoire/GrimoireLib/commit/d646bcd07b584932a0f...
[13:36:32] <wikibugs>	 Analytics-Tech-community-metrics, DevRel-January-2016: Make GrimoireLib display *one* consistent name for one user, plus the *current* affiliation of a user - https://phabricator.wikimedia.org/T118169#1934240 (Lcanasdiaz) Library deployed in our server. Waiting to generate a new set of JSON files.
[13:47:32] <wikibugs>	 Analytics-Tech-community-metrics, DevRel-January-2016, Easy, Google-Code-In-2015, Patch-For-Review: Clarify Demographics definitions on korma (Attracted vs. time served; retained) - https://phabricator.wikimedia.org/T97117#1934249 (Aklapper) >>! In T97117#1932785, @Nemo_bis wrote: >> Attracted: N...
[14:19:16] <wikibugs>	 Analytics, MediaWiki-extensions-WikimediaEvents, The-Wikipedia-Library, Wikimedia-General-or-Unknown, Patch-For-Review: Implement Schema:ExternalLinkChange - https://phabricator.wikimedia.org/T115119#1934262 (Krenair) Well since I was already on it, a review had already been requested...
[14:31:38] <anmol_wassan>	 nuria: Please review my patch. I posted the link at Google Code-In website. :)
[14:44:28] <wikibugs>	 Analytics-Tech-community-metrics, DevRel-January-2016: "Unavailable section name" displayed on repository.html - https://phabricator.wikimedia.org/T121102#1934309 (Lcanasdiaz) The reason is a wrong URL returned by the search box, if we add &ds=scr to the URL it works. My workmate Quan and I are working on...
[14:56:51] <wikibugs>	 Analytics: Pageview API demo doesn't list be-tarask - https://phabricator.wikimedia.org/T119291#1934326 (Milimetric) We've been pondering this, @Ijon.  I'm leaning towards declining it because this really was just a simple demo and proper support for *all* projects is not trivial (our project codes, dbnames,...
[15:05:04] <wikibugs>	 Analytics, MediaWiki-extensions-WikimediaEvents, The-Wikipedia-Library, Wikimedia-General-or-Unknown, Patch-For-Review: Implement Schema:ExternalLinkChange - https://phabricator.wikimedia.org/T115119#1934342 (Sadads) @Krenair Thanks for the review, also I am new to gerrit. Learning the ins and...
[15:17:00] <wikibugs>	 Analytics: python-mwviews does not handle unicode in titles - https://phabricator.wikimedia.org/T123200#1934361 (Milimetric) Thanks @ResMar, getting to these issues will take a bit, as I have a bunch of high priority work, but I appreciate the code and thoughts.
[15:21:58] <wikibugs>	 Analytics-Backlog: Update reportcard.wmflabs.org with July-October data - https://phabricator.wikimedia.org/T116244#1934362 (Milimetric) > Thanks for the explanation! (For the record, the Phabricator convention has since been [[https://lists.wikimedia.org/pipermail/analytics/2016-January/004763.html |changed...
[15:52:24] <wikibugs>	 Analytics, Analytics-Kanban, Patch-For-Review: Foundation-only Geowiki stopped updating - https://phabricator.wikimedia.org/T106229#1934472 (Milimetric) The new problem seems to be that all the databases Evan's scripts write to are now in read-only mode:  ```  ERROR 1290 (HY000): The MariaDB server is...
[15:52:53] <wikibugs>	 Analytics-Kanban: Foundation-only Geowiki stopped updating - https://phabricator.wikimedia.org/T106229#1934473 (Milimetric)
[15:53:33] <milimetric>	 does anyone know why s1-analytics-slave.eqiad.wmnet is in --read-only mode?
[15:53:44] <milimetric>	 (same with s*-analytics-slave)
[16:01:10] <nuria>	 milimetric: hola
[16:01:14] <milimetric>	 hi
[16:01:40] <nuria>	 milimetric: is this your ops week?
[16:02:04] <milimetric>	 no, but I kind of skipped mine during all-staff
[16:04:53] <milimetric>	 hi ottomata, maybe you know this:
[16:04:58] <wikibugs>	 Analytics: Pageview API demo doesn't list be-tarask - https://phabricator.wikimedia.org/T119291#1934481 (Nuria) Agreed with @milimetric, our demo is just a showcase of what you can do with Api and it has many shortcomings,.We are planning on developing tools that are more robust.
[16:05:05] <milimetric>	 why is s1-analytics-slave.eqiad.wmnet in --read-only mode
[16:05:32] <joal>	 nuria: It's my week
[16:05:49] <wikibugs>	 Analytics-Kanban: EL replication having issues since at least January 11th - https://phabricator.wikimedia.org/T123634#1934482 (Nuria) NEW
[16:07:26] <joal>	 nuria: Is it for EL you were asking ?
[16:07:51] <nuria>	 joal; yes, are you subscribed to ops list?
[16:07:58] <joal>	 Yes
[16:08:02] <joal>	 I have seen faidon comment
[16:08:29] <wikibugs>	 Analytics-Tech-community-metrics, DevRel-January-2016: gerrit_review_queue can have incorrect data about patchsets "waiting for review" - https://phabricator.wikimedia.org/T121495#1934489 (Lcanasdiaz) It seems I was wrong! I was talking with @dicortazar and it seems the metric is right but **it does not r...
[16:09:03] <ottomata>	 its in read only mode?
[16:09:54] <milimetric>	 ottomata: yeah, I get an error about it from geowiki, which stopped updating because all the s*-analytics-slave(s) are in read only mode
[16:10:01] <wikibugs>	 Analytics-Kanban, DBA: EL replication having issues since at least January 11th - https://phabricator.wikimedia.org/T123634#1934492 (Nuria)
[16:10:32] <milimetric>	 by "they are in read only mode" I mean when you try to insert you get a db error about MariaDB being in --read-only mode
[16:10:52] <ottomata>	 Hm, ok, don't know why, that is a different db than the EL one, right?
[16:10:58] <ottomata>	 i will look into it in a bit
[16:11:21] <wikibugs>	 Analytics-Kanban, DBA: EL replication having issues since at least January 11th - https://phabricator.wikimedia.org/T123634#1934482 (Nuria) From ops list:  "SELECT * FROM information_schema.processlist ORDER BY time DESC" informs us of this:  | 5599890 | research        | 10.64.36.103:53669 | enwiki...
[16:12:04] <joal>	 nuria: do we take a few minutes in cave to discuss what is expected as an answer to DB issues like that ?
[16:13:29] <nuria>	 joal: sure, let's do it after standup. I have filed a ticket  to compile the info, so far I think that is the only thing we can do.
[16:13:40] <joal>	 ok, thx
[16:14:38] <elukey>	 o/
[16:14:45] <joal>	 Hi elukey :)
[16:18:15] <wikibugs>	 Analytics: Restore MobileWebSectionUsage_14321266 and MobileWebSectionUsage_15038458 - https://phabricator.wikimedia.org/T123595#1934501 (Tbayer) >>! In T123595#1933431, @Nuria wrote: > Tables will start existing once blacklisting is lifted, let us know when new sampling ratio has taken effect.  >  >  I unde...
[16:21:46] <wikibugs>	 Analytics, Wikimedia-Developer-Summit-2016: Developer summit session:  Pageview API  from the Event Bus perspective - https://phabricator.wikimedia.org/T112956#1934503 (Milimetric) >>! In T112956#1906630, @Tgr wrote: >>>! In T112956#1904116, @Milimetric wrote: >> I just mean, can we find like five to ten...
[16:24:10] <wikibugs>	 Analytics: Restore MobileWebSectionUsage_14321266 and MobileWebSectionUsage_15038458 - https://phabricator.wikimedia.org/T123595#1934514 (Nuria) @Tbayer:  Given the many issues we have in our data store right now.  Hardware: https://phabricator.wikimedia.org/T123546 Replication: https://phabricator.wikimedia...
[16:28:04] <wikibugs>	 Analytics: Pageview API demo doesn't list be-tarask - https://phabricator.wikimedia.org/T119291#1934519 (Milimetric) Speaking of a proper robust tool, check out this class of 10 students that's about to tackle the problem: T120497 !!
[16:28:42] <grrrit-wm>	 (CR) Nuria: [C: 2] Restrain pep8 to 1.5.x [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/264057 (owner: Hashar)
[16:30:05] <wikibugs>	 Analytics: Track pageview stats for outreach.wikimedia.org - https://phabricator.wikimedia.org/T118987#1934526 (Milimetric) >>! In T118987#1900916, @TFlanagan-WMF wrote: > Thanks, @Nuria. Is the process you mention quick and easy? I'm just thinking ahead if we need to report some pageview numbers for interna...
[16:32:57] <wikibugs>	 Analytics-Kanban, DBA: EL replication having issues since at least January 11th - https://phabricator.wikimedia.org/T123634#1934530 (Nuria) From mobile team:  https://gerrit.wikimedia.org/r/263538 was merged Monday 11th so new sampling rate of 0 should be applied to all wikis from tomorrow (Thursday 14th)...
[16:33:17] <wikibugs>	 Analytics-Tech-community-metrics, DevRel-January-2016: Legend for "review time for reviewers" and other strings on repository.html - https://phabricator.wikimedia.org/T103469#1390947 (Lcanasdiaz) Ok, so you want to display the content of the "desc" field for each metric displayed on the box on the left wh...
[16:35:48] <wikibugs>	 Analytics: update comScore description on report card - https://phabricator.wikimedia.org/T122059#1934536 (Milimetric) does anything else need to happen here?
[16:39:27] <ottomata>	 a-team, FYI, see email I just sent to analytics list.  i've disabled public access to yarn.wikimedia.org
[16:39:48] <joal>	 ok ottomata
[16:39:53] <madhuvishy>	 Awesome
[16:39:55] <ottomata>	 oh elukey, btw, you should add pingyness to the 'a-team' keyword
[16:39:55] <milimetric>	 ottomata: sweet, less holes
[16:39:57] <joal>	 ottomata: any issue ?
[16:39:59] <ottomata>	 we use that to ping each other
[16:40:07] <joal>	 :)
[16:40:32] <mforns>	 cool ottomata
[16:40:33] <ottomata>	 joal:  no, real issue.  someone emailed the security list and was worried about it, i actually wasn't aware that the REST API was on the same port as the GUI
[16:40:46] <ottomata>	 you couldn't do anything more with the API than you could witih the GUI
[16:40:51] <joal>	 makes sense to close it :)
[16:41:02] <ottomata>	 but, it still wasn't good, it was very easy for me to curl the job history
[16:41:12] <ottomata>	 i couldn't POST any actions
[16:41:13] <joal>	 it sure was
[16:41:17] <ottomata>	 but i didn't like that I could try to POST them
[16:41:22] <joal>	 :)
[16:41:44] <joal>	 I think with job history you get 7 days of details infos (like full quesries and stuff)
[16:42:02] <joal>	 ottomata: So to access it now, VPN ?
[16:42:04] <ottomata>	 full queries I also wasn't aware of, i thought it was just the beginning
[16:42:08] <ottomata>	 ssh tunnel :/
[16:42:17] <joal>	 MwaaaaAAAAAAArf :(
[16:42:27] <ottomata>	 unless we put some auth in front of it
[16:42:39] <joal>	 That would be best
[16:42:50] <wikibugs>	 Analytics-Kanban, DBA: EL replication having issues since at least January 11th - https://phabricator.wikimedia.org/T123634#1934539 (Nuria) {F3227882}
[16:42:50] <ottomata>	 ja there is a ticket
[16:43:05] <joal>	 or we really need something to make hue stable (hue has a job browser)
[16:43:32] <ottomata>	 ok milimetric tell me about this db read only thing again?
[16:43:37] <ottomata>	 this is not the EL db, rigth?
[16:43:41] <milimetric>	 it's not, yes
[16:43:50] <milimetric>	 it's the mediawiki db slaves
[16:43:58] <madhuvishy>	 Hmmm ottomata I thought we could put it behind LDAP. But no?
[16:44:05] <milimetric>	 and i think it's the old name, but i didn't follow up with how they changed that setup
[16:44:24] <ottomata>	 madhuvishy:  not via hadoop itself
[16:44:27] <ottomata>	 maybe with varnish, dunno
[16:44:41] <milimetric>	 though, ottomata some of those slaves have a "log" database in them, which is very weird IMO
[16:45:06] <milimetric>	 i guess maybe they're all just pointing to analytics-store? :/
[16:47:15] <ottomata>	 seesh no idea
[16:49:53] * elukey highlights a-team
[16:50:07] * elukey probably pinged everybody 
[16:50:07] <ottomata>	 hmmm milimetric dunno, eventlogging_sync runs on this slave too?
[16:50:09] <ottomata>	 :)
[16:50:31] <milimetric>	 interesting...
[16:50:41] <milimetric>	 it's not just one slave, it's all s*-analytics-slave(s)
[16:50:50] <milimetric>	 there's like s1, s2, ..., s9 I think
[16:51:11] <milimetric>	 so we're lost without Jaime...
[16:51:13] <milimetric>	 hm
[16:51:50] <ottomata>	 why the heck does eventlogging get replicated to all these slaves?!
[16:55:29] <ottomata>	 oh hm
[16:56:23] <ottomata>	 oh
[16:56:25] <ottomata>	 wha?
[16:56:35] <ottomata>	 milimetric: s1-analytics-slave is one of 2 m4-masters....
[16:56:44] <milimetric>	 ....
[16:56:45] <ottomata>	 that are proxied to round robin by haproxy
[16:57:05] <milimetric>	 er.... so Evan's script was writing to a prod db?!!
[16:57:53] <ottomata>	 no um
[16:58:13] <ottomata>	 i *think* 'm4-master' is an alias for eventlogging prod db, but it is actually spread across two dbs for inserts
[16:58:21] <ottomata>	 and then this custom script selects from both of them
[16:58:26] <ottomata>	 and inserts into analytics-store
[16:58:43] * elukey is confused
[16:58:49] <ottomata>	 so, eventlogging inserts into the log db on a prod slave
[16:58:57] <milimetric>	 elukey is not confused, the setup is confused, elukey is normal
[16:59:00] <ottomata>	 and then stuff from all prod slaves is replicated to analytics-store
[16:59:10] <ottomata>	 oof
[16:59:13] <ottomata>	 i have no idea though
[16:59:16] <ottomata>	 i can't find any docs about this
[16:59:35] <ottomata>	 i'm about 65% that's how things are
[17:01:09] <elukey>	 standuuuppp
[17:01:13] <nuria>	 ping ottomata madhuvishy standdddupppp
[17:02:17] <ottomata>	 wha thaaa is going on?
[17:02:17] <ottomata>	 ok!
[17:04:48] <wikibugs>	 Analytics-Kanban, Patch-For-Review: Add install instructions to script that calculates kanban metrics [1] - https://phabricator.wikimedia.org/T122626#1934565 (Nuria) Open>Resolved
[17:05:56] <wikibugs>	 Analytics-Kanban: Quaterly review 2016/01/22 (slides due on 19th) - https://phabricator.wikimedia.org/T120844#1934568 (Nuria)
[17:05:58] <wikibugs>	 Analytics-Kanban: Gather metrics about cluster usage {hawk} [5 pts] - https://phabricator.wikimedia.org/T121783#1934567 (Nuria) Open>Resolved
[17:08:31] <wikibugs>	 Analytics-Kanban: Investigate sample cube pageview_count vs unsampled log pageview count [13 pts] {hawk} - https://phabricator.wikimedia.org/T108925#1934579 (Tbayer) Update: So @JAllemandou was in SF last week for the Dev Summit and All Hands, and he and I took the opportunity on Friday to sit down in person...
[17:09:49] <wikibugs>	 Analytics-Kanban, Patch-For-Review: Add piwiki beacon to financial report website [5] - https://phabricator.wikimedia.org/T123263#1934586 (Nuria) Open>Resolved
[17:10:24] <wikibugs>	 Analytics-Kanban, Patch-For-Review: Piwik beacon on prod instance should be accessible [5 pts] - https://phabricator.wikimedia.org/T123260#1934589 (Nuria) Open>Resolved
[17:10:41] <wikibugs>	 Analytics-Kanban: Quaterly review 2016/01/22 (slides due on 19th) - https://phabricator.wikimedia.org/T120844#1934592 (Nuria)
[17:10:43] <wikibugs>	 Analytics-Kanban: Gather preliminary metrics of Pageview API usage for quaterly review {slug} [5pts] - https://phabricator.wikimedia.org/T120845#1934591 (Nuria) Open>Resolved
[17:26:06] <wikibugs>	 Analytics: Restore MobileWebSectionUsage_14321266 and MobileWebSectionUsage_15038458 - https://phabricator.wikimedia.org/T123595#1934622 (Tbayer) Understood that these are timely and severe issues; I really appreciate the Analytics' team's hard work on fixing them. I should check with other stakeholders to b...
[17:30:01] <wikibugs>	 Analytics: Restore MobileWebSectionUsage_14321266 and MobileWebSectionUsage_15038458 - https://phabricator.wikimedia.org/T123595#1934628 (Nuria) >would that be that a realistic timeframe for restoring them?   Before adding more data we need to do the tokudb conversion, once we  have an ETA for that we will u...
[17:35:04] <wikibugs>	 Analytics-Tech-community-metrics, DevRel-January-2016: Legend for "review time for reviewers" and other strings on repository.html - https://phabricator.wikimedia.org/T103469#1934642 (Aklapper) @Lcanasdiaz: That was my idea, indeed. But please tell me if it does not make sense or if it is way too complica...
[17:39:32] <wikibugs>	 Analytics-Tech-community-metrics, DevRel-January-2016, Easy, Google-Code-In-2015, Patch-For-Review: Clarify Demographics definitions on korma (Attracted vs. time served; retained) - https://phabricator.wikimedia.org/T97117#1934662 (Nemo_bis) > No idea how not mentioning additional terms helps eit...
[17:40:34] <wikibugs>	 Analytics: Find performance thresholds of piwik production instance  - https://phabricator.wikimedia.org/T123640#1934665 (Nuria) NEW
[17:58:24] <YuviPanda>	 madhuvishy: jfyi, I'm now collecting data for https://meta.wikimedia.org/wiki/Schema:CommandInvocation
[17:58:32] <YuviPanda>	 about 50k events in 17h and steady state
[17:58:44] <ottomata>	 a-team, will be 5 or 10 late to grooming, ok?
[17:58:55] <YuviPanda>	 am running simple queries (select count(*)) on m4 master directly but will make sure to not overdo it
[17:58:57] <milimetric>	 np
[17:58:57] <madhuvishy>	 YuviPanda: okay - replication lag etc so data won't show up in slaves for a while
[17:59:01] * YuviPanda picks out lice from ottomata
[17:59:28] * milimetric watches in horror as YuviPanda starts to eat the lice
[17:59:29] <madhuvishy>	 YuviPanda: can you add the SchemaDoc template to your talk?
[17:59:31] <YuviPanda>	 madhuvishy: yeah, I'm just checking master to verify
[17:59:39] <YuviPanda>	 madhuvishy: what's the SchemaDoc template?
[17:59:51] <YuviPanda>	 I guess I can find out :D
[17:59:53] <YuviPanda>	 ok
[18:00:08] <madhuvishy>	 https://meta.wikimedia.org/wiki/Template:SchemaDoc
[18:00:11] <madhuvishy>	 YuviPanda: ^
[18:00:19] <YuviPanda>	 yup
[18:00:21] <YuviPanda>	 adding
[18:00:23] <YuviPanda>	 now
[18:00:27] <YuviPanda>	 I wasn't aware of this
[18:00:29] <madhuvishy>	 https://meta.wikimedia.org/wiki/Schema_talk:Echo
[18:00:30] <YuviPanda>	 cool
[18:13:55] <wikibugs>	 Analytics, operations, ops-eqiad: Possible bad mem chip or slot on dbproxy1004 - https://phabricator.wikimedia.org/T123546#1934748 (Cmjohnson) The server is out of warranty but I have several spare DIMM for the R610's on-site.  It appears that DIMM A3 is bad and needs to be replaced.  I will need abou...
[18:14:15] <wikibugs>	 Analytics, Project-Creators: Dedicated and/or automated Wikimedia pageviews API project/tag in Phabricator Maniphest [1 pts] - https://phabricator.wikimedia.org/T119151#1934749 (Milimetric) a:madhuvishy
[18:14:21] <YuviPanda>	 ottomata: bam, ^ might explain thing!
[18:15:32] <wikibugs>	 Analytics: Productionize last access jobs for daily and monthly calculations {bear} - https://phabricator.wikimedia.org/T122514#1934753 (Milimetric) p:Triage>High
[18:16:00] <ottomata>	 YuviPanda:  the mem chip?
[18:16:01] <wikibugs>	 Analytics-Kanban: Productionize last access jobs for daily and monthly calculations {bear} - https://phabricator.wikimedia.org/T122514#1906263 (Milimetric)
[18:16:06] <YuviPanda>	 yeah
[18:16:29] <wikibugs>	 Analytics: Productionize last access jobs for daily and monthly calculations {bear} - https://phabricator.wikimedia.org/T122514#1906263 (Milimetric)
[18:17:07] <YuviPanda>	 ottomata: I think you can failover to use a different dbproxy first
[18:17:44] <wikibugs>	 Analytics, operations, ops-eqiad: Possible bad mem chip or slot on dbproxy1004 - https://phabricator.wikimedia.org/T123546#1934760 (Nuria) @ottomata: can you coordinate a 5 minutes outage today?
[18:20:36] <wikibugs>	 Analytics, Analytics-Cluster, Patch-For-Review: Single Kafka partition replica periodically lags - https://phabricator.wikimedia.org/T121407#1934764 (Milimetric) a:Ottomata>elukey
[18:21:18] <wikibugs>	 Analytics-Tech-community-metrics: Mismatch between six names and certain email address in mediawiki-identities data - https://phabricator.wikimedia.org/T123643#1934774 (Aklapper) NEW
[18:21:47] <wikibugs>	 Analytics: Pageview API demo doesn't list be-tarask - https://phabricator.wikimedia.org/T119291#1934787 (Ironholds_backup) It's worth noting that the API itself does include be-tarask data (I just checked it). Agreed on the need for a stats.grok.se replacement. Began idly noodling on one using the stats.grok...
[18:28:39] <wikibugs>	 Analytics: Create Pageview API dashboard to monitor response times - https://phabricator.wikimedia.org/T121277#1934817 (Milimetric) a:GWicke
[18:28:49] <wikibugs>	 Analytics: Create Pageview API dashboard to monitor response times - https://phabricator.wikimedia.org/T121277#1934819 (Milimetric) Open>Resolved Done thanks to Gabriel, re-assigning.  https://grafana.wikimedia.org/dashboard/db/pageviews
[18:29:12] <wikibugs>	 Analytics: 'is_spider' column in eventlogging user agent data {flea} - https://phabricator.wikimedia.org/T121550#1934823 (JAllemandou) The easiest way to add user-agent refinement to eventlogging would be to use the refinery code through hive or spark on eventlogging logged into hadoop.
[18:29:41] <wikibugs>	 Analytics: 'is_spider' column in eventlogging user agent data {flea} - https://phabricator.wikimedia.org/T121550#1934826 (Nuria) Adding this column to the capsule requires work on the EL mysql database end of things which is having a lot of issues right now (as a new column needs to be added to every single...
[18:30:00] <wikibugs>	 Analytics: Use a new approach to compute monthly top 1000 articles (brute force probably works)  - https://phabricator.wikimedia.org/T120113#1934827 (Milimetric)
[18:31:11] <elukey>	 ottomata: https://phabricator.wikimedia.org/T114199 - Educational task? What do you think??
[18:31:45] <elukey>	 (we all love systemd)
[18:31:59] <ottomata1>	 heheh, yeah!  hm.
[18:32:01] <ottomata1>	 could be good.
[18:32:08] <ottomata1>	 tricky though, because eventlogging is currently all about upstart
[18:32:16] <ottomata1>	 but ja
[18:32:20] * milimetric out for lunch
[18:32:45] <ottomata1>	 elukey:  the newish eventlogging-service (for eventbus) has been puppetized for systemd and jessie
[18:33:16] <ottomata1>	 so we'll probably want to model the other daemons after that
[18:33:17] <ottomata1>	 maybe.
[18:33:20] <ottomata1>	 if I did it right :)
[18:33:43] <wikibugs>	 Analytics, operations, ops-eqiad: Possible bad mem chip or slot on dbproxy1004 - https://phabricator.wikimedia.org/T123546#1934841 (Ottomata) Eeee, I'm not so sure.  Are we sure eventlogging is the only user of m4-master?
[18:34:24] <elukey>	 ottomata1: I'll need to familiarize with event logging first (and its new incarnations) so it might be really good for me
[18:35:01] <elukey>	 otherwise if it is already in your plans I'll be glad if you CC me :)
[18:37:50] <ottomata1>	 elukey:  it is, but it isn't first priority
[18:38:01] <ottomata1>	 elukey: i recently merged some eventlogging docker stuff
[18:38:08] <elukey>	 :O
[18:38:19] <ottomata1>	 if you get docker set up and check out recen tmaster, you'll be able to run eventlogging in a docker instance
[18:39:17] <madhuvishy>	 ottomata1: nuria I was talking to YuviPanda yesterday and we were wondering if we should make the wikimetrics dev setup docker too, instead on mw vagrant - because there's no real mediawiki dependency
[18:39:20] <ottomata>	 milimetric:  i have no idea about the s*-analytics slaves being in read only
[18:39:32] <ottomata>	 madhuvishy:  not a bad idea
[18:39:37] <ottomata>	 especially if only for development purposes
[18:39:42] <madhuvishy>	 ya just for dev
[18:39:43] <ottomata>	 although
[18:39:48] <ottomata>	 doesn't wikimetrics query mw dbs?
[18:40:07] <madhuvishy>	 yeah - YuviPanda says we can just use the labs replica directly
[18:40:59] <madhuvishy>	 ottomata: i don't know if it conflicts with the way it's currently setup, but seems feasible
[18:41:59] <elukey>	 madhuvishy: what would change in the move between Vagrat to Docker only for dev purposes? Just curious
[18:42:12] <ottomata>	 neilpquinn: yt?
[18:42:21] <elukey>	 It would be great to have the container even for Labs
[18:42:28] <ottomata>	 the labs replica is queriable from outside of labs?!
[18:42:49] <YuviPanda>	 elukey: we'd just stop using the mw vagrant puppet setup
[18:42:51] <YuviPanda>	 ottomata: kindof.
[18:42:53] <YuviPanda>	 ottomata: ssh tunnels
[18:42:57] <YuviPanda>	 ottomata: and I want to do it anyway
[18:42:59] <ottomata>	 ah
[18:43:06] <YuviPanda>	 since that'll make lcoal development for tools folks too far easier
[18:43:21] <ottomata>	 nuria:  do you know who neil is?  i want to contact him about this query
[18:43:28] <ottomata>	 i'm really not sure what we should do
[18:43:32] <madhuvishy>	 ottomata: i know neil
[18:43:37] <ottomata>	 oh?
[18:43:48] <madhuvishy>	 he's a product analyst in editing
[18:44:01] <ottomata>	 is it insane to just ask that no one do long queries on anatlyics-store until next week and jaime can address these issues?
[18:44:53] <ottomata>	 he has an select/insert query on analytics-store running for the last 20 hours
[18:44:57] <elukey>	 YuviPanda: if you have time today/tomorrow it would be great to see the difference, I am still completely ignorant (about current dev cycle in wikimedia)
[18:45:00] <madhuvishy>	 ottomata: oh
[18:45:18] <ottomata>	 likely not the source of hte problems
[18:45:22] <ottomata>	 but it surely isn't helping
[18:46:10] <YuviPanda>	 ottomata: I usually just kill it (on labsdb at least)
[18:46:15] <YuviPanda>	 ottomata: we've been having replag too
[18:46:25] <YuviPanda>	 usually just 'KILL ALL THE QUERIES, FIND PEOPEL RUNNING QUERY AND STOP THEM'
[18:46:27] <YuviPanda>	 fixes it
[18:46:36] <nuria>	 ottomata: yes ping neilpquinn
[18:46:57] <nuria>	 ottomata: i think asking that should be fine
[18:46:57] <YuviPanda>	 elukey: sure! I wasn't going to come to office today but can come by in the afternoon to chat
[18:47:06] <nuria>	 ottomata: given that those queries are not likely to succeed
[18:47:08] <elukey>	 even tomorrow, don't worry :)
[18:47:40] <madhuvishy>	 elukey: i don't think i'm coming today too. YuviPanda lets do it tomorrow?
[18:47:49] <YuviPanda>	 madhuvishy: sure
[18:47:53] <YuviPanda>	 what is 'it' though?
[18:48:05] <leila>	 hello mforns. :-) I have left a comment about my assessment of the pageview stats tool (from the community wishlist) and mentioned the demo you have worked on. just fyi: https://meta.wikimedia.org/wiki/Talk:2015_Community_Wishlist_Survey/Top_10/Status_draft
[18:48:06] <madhuvishy>	 YuviPanda: he he docker things
[18:48:16] * leila says hi to folks in this channel.
[18:48:21] <YuviPanda>	 madhuvishy: ah ok
[18:48:22] <ottomata>	 hiii!
[18:48:44] <elukey>	 leila: o/
[18:51:16] <nuria>	 ottomata: I agree with YuviPanda on stopping queries that are not likely to work I mean, what else can we do?
[18:52:50] <nuria>	 ottomata: pinging neilpquinn again ...
[18:53:05] <YuviPanda>	 yeah, plus I kill them first and then tell people
[18:53:07] <YuviPanda>	 :D
[18:53:11] <YuviPanda>	 rather than the other way around
[18:58:01] <ottomata>	 milimetric:
[18:58:18] <ottomata>	 nuria: can you think of any reason why this custom replication script would need to have records inserted in order of uuid?
[18:58:39] <ottomata>	 hmmMmMM
[18:58:45] <ottomata>	 maybe I should limit hte mysqldump query that this thing is doing
[18:58:50] <ottomata>	 to like 10000 ro 1000000 rose
[18:58:52] <ottomata>	 rows
[18:59:59] <nuria>	 ottomata: it used to be we had an autoincrement on the tables that is no longer there , maybe the uuid order is from that?
[19:00:20] <nuria>	 ottomata: cause no, there is no need to preserve uuid order that i can think of
[19:00:21] <ottomata>	 it might just be what mysqldump does by default
[19:00:30] <ottomata>	 this is for replicaiton though, so auto increment won't matter
[19:00:36] <ottomata>	 this is just the query the mysqldump does to grab the data
[19:00:40] <ottomata>	 so it does the inserts in order
[19:00:48] <ottomata>	 but uuid is already set in the master
[19:00:51] <ottomata>	 so it shouldn't matter
[19:00:53] <ottomata>	 that should help speed up
[19:01:04] <ottomata>	 as well as maybe limiting the number of records inserted at a time
[19:01:44] <ottomata>	 Ironholds: good news
[19:01:49] <ottomata>	 i special cased your table and ran a sync
[19:01:54] <nuria>	 good news?
[19:01:55] <ottomata>	 WikipediaPortal_14377354 is up to date now
[19:02:00] <nuria>	 ooohhhh
[19:02:03] <ottomata>	 ja, its just that big tables are blocking small ones
[19:02:06] <ottomata>	 since htey have so many records
[19:02:19] <ottomata>	 i'm going to try limiting the number of records batched by the mysqldump and see if it helps
[19:02:28] <Ironholds>	 ottomata, thanks!
[19:02:51] <Ironholds>	 will that persist, or?
[19:02:56] <Ironholds>	 IOW, will my scripts break again tomorrow? ;p
[19:02:59] <ottomata>	 its up to date as of now
[19:03:03] <ottomata>	 i just ran a sync specially for your table
[19:03:10] <ottomata>	 the main sync is still busted blocking on other large tables
[19:03:15] <ottomata>	 i'm going to try to fix that
[19:03:24] <ottomata>	 so that those large tables might lag, but smaller ones won't
[19:03:27] <ottomata>	 not sure how it will go, but will try
[19:03:38] <ottomata>	 oh, weird meeting happening now, ja?
[19:03:43] <ottomata>	 gotta put that on in backrounnnd
[19:03:44] <ottomata>	 :)
[19:03:52] <YuviPanda>	 pffft ottomata join the meetingless :P
[19:05:19] <madhuvishy>	 ottomata: 4 years!
[19:06:38] <YuviPanda>	 woo ottomata!
[19:06:56] * YuviPanda is also 4y but won't count because contractor for part of it
[19:07:16] <ottomata>	 :)
[19:07:29] <James_F>	 YuviPanda: I think it should? Talk to HR.
[19:07:41] <YuviPanda>	 James_F: I was also part time for another 5 months afterwards.
[19:07:51] <YuviPanda>	 when I was in uni
[19:07:55] <YuviPanda>	 but yeah, maybe I should
[19:08:15] <James_F>	 YuviPanda: Correcting 'start' dates is a continuing effort.
[19:08:37] <YuviPanda>	 :D
[19:08:42] <YuviPanda>	 I'll talk to them
[19:08:46] <YuviPanda>	 thanks for the poke, etc :D
[19:09:18] <YuviPanda>	 I was also theoretically not working for the WMF for a few months in the middle (but wrote the Commons app at that time :P) but not sure if that's a leave of absence or not
[19:09:20] <YuviPanda>	 oh well
[19:11:30] <ottomata>	 wait whahhhh
[19:11:37] <ottomata>	 i thought i deleted on the MobileWebSectionUsage_15038458 tables nuria
[19:11:38] <ottomata>	 apparently not
[19:11:43] <ottomata>	 one of the masters (that i didn't know existed)
[19:11:47] <nuria>	 whatata?
[19:11:48] <ottomata>	 still has it
[19:11:56] <ottomata>	 and it is being selected from and inserted back into
[19:12:12] <nuria>	 inserted?
[19:12:16] <nuria>	 ottomata: how so?
[19:12:28] <nuria>	 ottomata: selected maybe
[19:12:33] <nuria>	 ottomata: but inserted?
[19:12:34] <ottomata>	 dunno, i'm still confused by which databases are where
[19:12:38] <ottomata>	 i thought there were two
[19:12:40] <ottomata>	 but there are 3
[19:12:44] <nuria>	 ottomata: what timestamps do inserts have?
[19:12:45] <ottomata>	 2 of which are 'm4-master'
[19:12:55] <ottomata>	 20160113192418 on analytlics-storre
[19:13:09] <ottomata>	 the dump processes are selecitng LOTS of data from db1046
[19:13:10] <nuria>	 ottomata: so that is yesterday's
[19:13:22] <ottomata>	 i think that is hte real master
[19:13:29] <nuria>	 yes
[19:13:32] <ottomata>	 db1046 for writes
[19:13:40] <nuria>	 1046 is teh master for writes
[19:13:43] <ottomata>	 db1047 is another slave?
[19:13:48] <ottomata>	 but also called m4-master?
[19:13:58] <nuria>	 ?
[19:14:04] <nuria>	 no idea
[19:14:30] <ottomata>	 [@dbproxy1004:/home/otto] $ cat /etc/haproxy/conf.d/db-master.cfg .
[19:14:30] <ottomata>	 listen mariadb 0.0.0.0:3306
[19:14:30] <ottomata>	     mode tcp
[19:14:30] <ottomata>	     balance roundrobin
[19:14:30] <ottomata>	     option tcpka
[19:14:31] <ottomata>	     option mysql-check user haproxy
[19:14:31] <ottomata>	     server db1046 10.64.16.35 check inter 3s fall 3 rise 99999999
[19:14:32] <ottomata>	     server db1047 10.64.16.36 check backup
[19:15:14] <nuria>	 let me see , i ahd : https://grafana.wikimedia.org/dashboard/db/server-board?from=1446931397231&to=1449523157231&var-server=db1046&var-network=eth0
[19:15:17] <nuria>	 for master
[19:15:38] <ottomata>	 what is db1047, why is it running eventlogging_sync, and why does haproxy to m4-master include it in configs?
[19:16:05] <Ironholds>	 ottomata, is there a phab task for these problems? Tomasz wants something to subscribe to
[19:16:21] <nuria>	 ottomata: looking at graphs teh one working is 1046
[19:16:34] <nuria>	 Ironholds: yes. https://phabricator.wikimedia.org/T123634#1934492
[19:17:12] <Ironholds>	 ta!
[19:17:25] <ottomata>	 ok, i'm going to stop those queries and make sure this table is gone,  we can talk about bringing it back later
[19:17:30] <ottomata>	 but re-replicating it all now is not good
[19:17:55] <nuria>	 ottomata: agreed
[19:19:13] <nuria>	 ottomata: if you look at graphana definitely 1047 is receiving cyclic updates: https://grafana.wikimedia.org/dashboard/db/server-board?var-server=db1047&var-network=eth0
[19:23:42] <icinga-wm>	 PROBLEM - Host erbium is DOWN: PING CRITICAL - Packet loss = 100%
[19:24:09] <joal>	 ottomata: is that you --^
[19:24:14] <joal>	 ?
[19:24:25] <nuria>	 joal: i doubt it
[19:24:30] <joal>	 hmm
[19:24:38] <joal>	 nuria: what do we have on erbium ?
[19:24:39] <nuria>	 joal: what's erbium ? wasn't it udp2log?
[19:24:49] <ottomata>	 no, but erbium is decommed
[19:24:49] <joal>	 hmmm, synchronicity :)
[19:24:55] <ottomata>	 probably someone just taking it down?
[19:25:01] <joal>	 ok ...
[19:25:09] <nuria>	 joal: ya, https://wikitech.wikimedia.org/wiki/Erbium
[19:25:27] <nuria>	 joal: so it is dying in fron of our eyes
[19:25:31] <nuria>	 *in front
[19:26:07] <joal>	 ottomata must be happy :)
[19:26:21] <joal>	 ops channel have noticed as well
[19:27:25] <mforns>	 leila, just read your line, thx!
[19:27:41] <leila>	 sure, mforns.
[19:50:56] <grrrit-wm>	 (CR) Hashar: "recheck" [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/263782 (owner: Madhuvishy)
[19:51:17] <madhuvishy>	 hashar: thank you :)
[19:51:29] <hashar>	 madhuvishy: hello :-}
[19:51:43] <madhuvishy>	 hashar: hi!
[19:52:00] <hashar>	 the new pep8 version is causing a bunch of errors on most of our python projects :-(
[19:52:23] <hashar>	 luckily we have some volunteers that set the upper version limit  and/or fix the new linting errors :D
[19:52:43] <madhuvishy>	 hashar: hmmm :/ should I change anything on wikimetrics?
[19:53:55] <hashar>	 madhuvishy: it got fixed via  https://gerrit.wikimedia.org/r/#/c/264057/1/tox.ini
[19:54:18] <hashar>	 and your patch that was previously failing ( https://gerrit.wikimedia.org/r/#/c/263782/ ) is now passing
[19:54:25] <madhuvishy>	 hashar: oh cool! awesome, thanks :)
[19:54:26] <hashar>	 (I triggered the job by commenting 'recheck' in Gerrit)
[19:54:34] <madhuvishy>	 yeah, i saw that
[19:54:37] <hashar>	 which test the patch after it got merged against the tip of the branch
[19:55:00] <hashar>	 every morning I looked at all patches that have been rejected by Jenkins :-D
[19:55:29] <grrrit-wm>	 (CR) Hashar: "recheck" [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/263911 (owner: Wassan.anmol)
[19:55:31] <ottomata>	 !log restarted eventlogging_sync script to insert batches of 1000
[19:55:33] <analytics-logbot>	 Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log, Master
[19:55:43] <madhuvishy>	 hashar: i was gonna ping you myself but so many meeting
[19:55:45] <madhuvishy>	 meetins
[19:55:56] <madhuvishy>	 gah i can't type
[19:56:31] <YuviPanda>	 madhuvishy: I left comments on the wikimetrics change, mostly minor
[19:58:11] <madhuvishy>	 YuviPanda: thanks
[19:58:16] <hashar>	 madhuvishy: or you can ping folks in #wikimedia-releng as well :}
[19:58:24] <madhuvishy>	 will do :)
[19:58:39] <elukey>	 YuviPanda: if you have time tomorrow I'd also like to talk about how to push "secrets" in labs to mimic as much as possible Prod (not this specific use case)
[19:58:58] <YuviPanda>	 elukey: that one :D
[19:59:00] <YuviPanda>	 elukey: sure
[19:59:12] <ottomata>	 hmm, ja interesting nuria, the full MobileWebSectionUsage tables were not removed from db1047
[19:59:19] <YuviPanda>	 I think current consensus is 'you can not replicate prod without hosting your own puppetmaster, and hosting your own puppetmaster makes YuviPanda sad'
[19:59:30] <neilpquinn>	 ottomata: I'm so sorry, I didn't see the notifications on my phone!
[19:59:32] <elukey>	 haahhaha
[19:59:45] <elukey>	 yeah I don't want to make Yuvi sad
[19:59:46] <ottomata>	 neilpquinn:  its ok, i haven't killed your query :)
[19:59:51] <YuviPanda>	 bad ottomata
[19:59:52] <ottomata>	 just have been thikning about it
[19:59:53] <YuviPanda>	 baad
[20:00:01] <elukey>	 but it would be great to have a "standard" process
[20:00:06] <YuviPanda>	 must kill queries first before asking people, how else do you get them to loathe you?
[20:00:08] <YuviPanda>	 elukey: I agree.
[20:00:19] <YuviPanda>	 elukey: this is the first time we're using a 'private'ish gerrit repo
[20:00:28] <YuviPanda>	 elukey: labs in general isn't supposed to have really private data
[20:00:31] <elukey>	 for example, we could use the "fake" private repo and then use a common solution to replace the placeholders with "secrets" in some way
[20:00:36] <neilpquinn>	 ottomata: that query is for generating quarterly review metrics so it's moderately important. but the sky won't fall if you kill it. so if I'm causing a server meltdown feel free to do it.
[20:00:37] <ottomata>	 nuria: , so we could possible restore the old data from db1047, instad of backfiling it all from files
[20:00:39] <elukey>	 yep yep
[20:00:43] <ottomata>	 they look like MyISAM tables (I think???)
[20:01:04] <ottomata>	 oh, no
[20:01:05] <ottomata>	 Aria?
[20:01:18] <YuviPanda>	 elukey: yeah, problem is that you have to only give access to instances from a particular project
[20:01:26] <ottomata>	 so, we could probably just shut down the dbs, copy the files over
[20:01:28] <YuviPanda>	 elukey: and that's a hard problem to do totally right, since people have root on those machines and can lie
[20:01:29] <neilpquinn>	 anyway thanks for not killing it!
[20:01:29] <ottomata>	 iunno if it is worht it though
[20:01:55] <ottomata>	 neilpquinn:  how long do you think it will run?
[20:03:21] <elukey>	 YuviPanda: yep :(
[20:03:57] <YuviPanda>	 elukey: in the Glorious Future(TM) projects like this will end up being on kubernetes, which has built in secret management
[20:04:45] <elukey>	 looking forward to it :)
[20:05:36] <neilpquinn>	 ottomata: I'm honestly not sure—I didn't expect it to take this long. I imagine you've looked at the full query?
[20:06:49] <wikibugs>	 Analytics-Kanban, DBA, Patch-For-Review: EL replication having issues since at least January 11th - https://phabricator.wikimedia.org/T123634#1935163 (Ottomata) Ok, I've modified the eventlogging_sync.sh script to do the custom mysqldump | insert based replication in batches of 1000 rows.  This means t...
[20:07:17] <ottomata>	 neilpquinn:  not in detail
[20:11:12] <YuviPanda>	 madhuvishy: do you have a few minutes to help me with https://phabricator.wikimedia.org/T120900
[20:11:42] * madhuvishy looks
[20:11:56] <madhuvishy>	 YuviPanda: ummm sure
[20:12:23] <YuviPanda>	 madhuvishy: ok, so...
[20:12:29] <YuviPanda>	 madhuvishy: they are all in the 'analytics' project, right?
[20:12:44] <madhuvishy>	 YuviPanda: yes
[20:12:49] <YuviPanda>	 so if you look at https://wikitech.wikimedia.org/wiki/Hiera:Tools
[20:12:53] <madhuvishy>	 and we probably dont need it for wikimetrics
[20:12:56] <YuviPanda>	 you see how valhallasw and scfc have added their root key
[20:12:58] <YuviPanda>	 right
[20:13:00] <YuviPanda>	 but for limn :D
[20:13:03] <madhuvishy>	 right
[20:13:06] <YuviPanda>	 which is still self hosted I think
[20:13:12] <YuviPanda>	 anyway
[20:13:15] <madhuvishy>	 yup
[20:13:20] <YuviPanda>	 add a key called "passwords::root::extra_keys":
[20:13:24] <YuviPanda>	 with the value being a dict
[20:13:27] <YuviPanda>	 with username: ssh key
[20:13:35] <madhuvishy>	 should i make a new key, or put my labs one?
[20:13:39] <YuviPanda>	 to https://wikitech.wikimedia.org/wiki/Hiera:Analytics
[20:13:43] <YuviPanda>	 madhuvishy: is ok to put in your labs one
[20:13:50] <madhuvishy>	 YuviPanda: okay
[20:13:53] <madhuvishy>	 i'll add
[20:14:11] <YuviPanda>	 madhuvishy: then need to run puppet on the instance to see if it worked
[20:14:18] <madhuvishy>	 okay
[20:14:21] <YuviPanda>	 and if not we need to just add the key manually (I can help with that too)
[20:14:28] <madhuvishy>	 mm hmmm
[20:14:34] <madhuvishy>	 give me a minute to add this
[20:14:42] <YuviPanda>	 madhuvishy: and if/when you change your labs key you should remember to change this too
[20:14:45] <YuviPanda>	 madhuvishy: thanks!
[20:14:50] <madhuvishy>	 okay
[20:14:52] <YuviPanda>	 this isn't as big a deal for the wikimetrics project
[20:14:56] <YuviPanda>	 because it's fully puppetized
[20:14:58] <nuria>	 ottomata: how would we restore data from 1047?
[20:15:02] <YuviPanda>	 and so we (labs roots) can fix nuderlying issues
[20:15:07] <nuria>	 drops must be propagated there too right?
[20:15:30] <ottomata>	 no
[20:15:33] <ottomata>	 drops?
[20:15:46] <nuria>	 "drop table"
[20:15:51] <ottomata>	 naw
[20:15:52] <ottomata>	 its not a slave
[20:15:53] <wikibugs>	 Analytics-Tech-community-metrics, DevRel-January-2016: gerrit_review_queue can have incorrect data about patchsets "waiting for review" - https://phabricator.wikimedia.org/T121495#1935183 (Aklapper) @Lcanasdiaz: <tl;dr> Pull request to fix this task created in https://github.com/Bitergia/mediawiki-dashboa...
[20:15:56] <ottomata>	 log is not replicated like that
[20:16:11] <ottomata>	 and in didn't run drop on this box
[20:16:18] <ottomata>	 because i didn't know it existed yesterday
[20:16:20] <nuria>	 ottomata: ok, I know even less about this than i thought
[20:16:23] <ottomata>	 this box is kinda like analytics-store
[20:16:28] <ottomata>	 log is replicated there in the sam eway
[20:16:43] <wikibugs>	 Analytics-Tech-community-metrics, DevRel-January-2016: gerrit_review_queue's "waiting for review" column name misleading (also includes unmerged CR+1 patches) - https://phabricator.wikimedia.org/T121495#1935190 (Aklapper) a:Lcanasdiaz>Aklapper
[20:17:01] <wikibugs>	 Analytics-Tech-community-metrics, DevRel-January-2016, Patch-For-Review: gerrit_review_queue's "waiting for review" column name misleading (also includes unmerged CR+1 patches) - https://phabricator.wikimedia.org/T121495#1880244 (Aklapper)
[20:17:22] <nuria>	 can you run : SELECT table_name, (DATA_LENGTH + INDEX_LENGTH)/1024/1024/1024 as `TOTAL SIZE (GB)`, ENGINE, CREATE_OPTIONS FROM information_schema.tables WHERE TABLE_SCHEMA='log' /* AND `ENGINE` <> 'TokuDB' */ ORDER BY (DATA_LENGTH + INDEX_LENGTH) DESC LIMIT 30;
[20:17:42] <nuria>	 ottomata: and paste the results of that on 1047 on the ticket ?
[20:17:52] <nuria>	 ottomata: that way get a glimpse of how big tables are
[20:18:18] <madhuvishy>	 YuviPanda: done - should i run puppet on limn1?
[20:18:22] <YuviPanda>	 madhuvishy: yup
[20:18:25] <madhuvishy>	 ok
[20:18:36] <ottomata>	 nuria
[20:18:37] <ottomata>	 you want all tables?
[20:18:45] <ottomata>	 you have a comment in there to not select tokudb
[20:18:49] <ottomata>	 do you want to not select?
[20:18:57] <nuria>	 ottomata: that will get you just teh 30 biggest
[20:19:39] <ottomata>	 done
[20:19:42] <wikibugs>	 Analytics-Kanban, DBA, Patch-For-Review: EL replication having issues since at least January 11th - https://phabricator.wikimedia.org/T123634#1935198 (Ottomata) db1047(???)  big tables:   ``` MariaDB EVENTLOGGING m4 localhost log >  SELECT table_name, (DATA_LENGTH + INDEX_LENGTH)/1024/1024/1024 as `TOT...
[20:19:46] <madhuvishy>	 YuviPanda: i cannot get into limn1 :/
[20:19:53] <YuviPanda>	 hahaha
[20:19:55] <YuviPanda>	 nice
[20:20:50] <neilpquinn>	 ottomata: okay. I've been reading scrollback, but I don't fully understand what the issue is or how helpful canceling my query would be.  So if you just you keep in mind that this is fairly (but not massively) important, I leave whether to cancel it up to you. If you think it's necessary, I trust you. Let me know if you need more info (e.g. to gauge its
[20:20:50] <neilpquinn>	 importance).
[20:21:12] <YuviPanda>	 madhuvishy: yeah, puppet has been broken for months
[20:21:36] <madhuvishy>	 YuviPanda: hmm :/
[20:22:03] <neilpquinn>	 oh, I just saw the Phab ticket. That might clarify.
[20:22:11] <YuviPanda>	 madhuvishy: try sshing in as root now
[20:22:13] <YuviPanda>	 should work
[20:22:18] <YuviPanda>	 I've added your and milimetric's key to it
[20:22:24] <madhuvishy>	 YuviPanda: okay trying
[20:22:25] <YuviPanda>	 bt the instance itself is in many ways unrecoverable otherwise
[20:22:35] <YuviPanda>	 elukey: ^ is why self hosted puppetmasters make me sad
[20:22:50] <madhuvishy>	 YuviPanda: ya i can get in
[20:22:59] <YuviPanda>	 madhuvishy: cool.
[20:23:02] <madhuvishy>	 hopefully we can get limn out of self hosted soon as well
[20:23:15] <YuviPanda>	 madhuvishy: I want to show you how to add other people to root keys
[20:23:24] <madhuvishy>	 sure
[20:23:31] <YuviPanda>	 madhuvishy: on a working labs instance, you can use ssh-key-ldap-lookup <username>
[20:23:33] <YuviPanda>	 to get their keys
[20:23:37] <YuviPanda>	 madhuvishy: and then you can add it to
[20:23:40] <YuviPanda>	  /etc/ssh/userkeys/root
[20:23:43] <YuviPanda>	 and that's it
[20:23:47] <YuviPanda>	 that's the same thing adding it to Hiera: does
[20:23:57] <YuviPanda>	 when you have working puppet
[20:23:58] <madhuvishy>	 okay
[20:24:16] <madhuvishy>	 do i need to be root to edit userkeys/root?
[20:24:20] <YuviPanda>	 madhuvishy: feel free to add others in your team to root key in hiera too ( elukey maybe?) so you're less reliant on us :)
[20:24:22] <YuviPanda>	 madhuvishy: yes
[20:24:25] <YuviPanda>	 madhuvishy: well, or have sudo
[20:24:37] <YuviPanda>	 madhuvishy: but usually the root key manual change is only needed when puppet is completely broken
[20:24:38] <YuviPanda>	 as is the case now
[20:24:45] <madhuvishy>	 okay - don't we all have sudo on labs instances we can get to?
[20:24:45] <YuviPanda>	 90% of the time just hiera is good enough
[20:24:47] <madhuvishy>	 right
[20:24:53] <madhuvishy>	 makes sense
[20:25:02] <YuviPanda>	 this is just the emergency manual fix
[20:25:05] <YuviPanda>	 cool
[20:25:11] <madhuvishy>	 i'll add anyone who needs to be, for our self hosted ones
[20:25:14] <YuviPanda>	 madhuvishy: can you add milimetric's key to the analytics hiera?
[20:25:19] <YuviPanda>	 and then I can call that ticket closed :D
[20:25:23] <madhuvishy>	 doing
[20:26:17] <madhuvishy>	 YuviPanda: done
[20:26:18] <ottomata>	 neilpquinn:  ok thanks
[20:26:20] <ottomata>	 i thikn i don't need to
[20:26:24] <milimetric>	 thx both
[20:26:32] <YuviPanda>	 milimetric: \o/ thanks
[20:26:35] <YuviPanda>	 err
[20:26:38] <YuviPanda>	 madhuvishy: ^ \o/ thanks
[20:26:41] <madhuvishy>	 YuviPanda: also fixed puppet stuff
[20:26:47] <madhuvishy>	 based on your CR
[20:26:59] <YuviPanda>	 madhuvishy: yup
[20:27:08] <YuviPanda>	 madhuvishy: shall I merge now or do you want someone to merge the deploy patch first?
[20:27:38] <madhuvishy>	 YuviPanda: those are independent i think - i wanna make sure the submodule will go away though
[20:27:49] <YuviPanda>	 ok
[20:27:51] <YuviPanda>	 yeah good point
[20:28:14] <madhuvishy>	 I removed it from .gitmodules
[20:28:16] <YuviPanda>	 ottomata: you've merged patches removing submodules before, right?
[20:28:21] <milimetric>	 madhuvishy: I didn't follow 100%, but you're working on limn1?
[20:28:28] <madhuvishy>	 milimetric: no
[20:28:33] <madhuvishy>	 we got root
[20:28:38] <milimetric>	 oh ok, good
[20:28:47] <milimetric>	 but you're not changing the self-hosted setup there, right?
[20:28:54] <ottomata>	 YuviPanda:  yes
[20:28:56] <madhuvishy>	 and hopefully in the near future we can kill its self hosted ness
[20:28:58] <madhuvishy>	 noo
[20:28:58] <milimetric>	 that instance has a few things that need to get manually migrated before we do that
[20:29:04] <madhuvishy>	 i'm not touching it
[20:29:07] <milimetric>	 k
[20:29:26] <YuviPanda>	 ottomata: can you take a look at https://gerrit.wikimedia.org/r/#/c/260687/10 to see if it's doing it right?
[20:30:26] <madhuvishy>	 YuviPanda: if not we can first merge a separate remove submodule commit, and then do this
[20:30:45] <YuviPanda>	 madhuvishy: it's all the same I think, this is probably easier anyway to do it in one go
[20:30:52] <madhuvishy>	 okay
[20:30:58] <madhuvishy>	 if it works i'm for it
[20:31:01] <YuviPanda>	 ok
[20:31:12] <YuviPanda>	 let's try get it merged today then
[20:31:57] <madhuvishy>	 milimetric: will you have time to review the fabric part? elukey did one pass yesterday, but it would be good for you to look and merge it
[20:32:19] <milimetric>	 sure, looking
[20:32:29] <ottomata>	 YuviPanda: , i think that should do it
[20:32:31] <madhuvishy>	 milimetric: can you also merge https://gerrit.wikimedia.org/r/#/c/263782/ minor path change in wsgi file
[20:32:40] <YuviPanda>	 ottomata: ok!
[20:32:43] <ottomata>	 although i might do the removal of the submoduel as a separate commit
[20:32:47] <ottomata>	 just in case gerrit or something gets confused
[20:32:55] <grrrit-wm>	 (PS2) Milimetric: Change config path in wsgi file [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/263782 (owner: Madhuvishy)
[20:32:58] <YuviPanda>	 it'll get confused anyway right?
[20:33:02] <grrrit-wm>	 (CR) Milimetric: [C: 2 V: 2] Change config path in wsgi file [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/263782 (owner: Madhuvishy)
[20:33:03] <ottomata>	 iunno
[20:33:07] <madhuvishy>	 milimetric: thanks
[20:33:08] <YuviPanda>	 teehee :D
[20:33:18] <YuviPanda>	 ottomata: i'm going to merge now and see if anything gets confused.
[20:33:43] <madhuvishy>	 YuviPanda: ya - nothing will die because of it anyway
[20:33:51] <YuviPanda>	 well
[20:33:57] <YuviPanda>	 prod puppetmaster could theoretically die :D
[20:34:03] <YuviPanda>	 if it's stuck on submodule stuff
[20:34:10] <madhuvishy>	 lol that i dunno
[20:34:18] <madhuvishy>	 wikimetrics wont die
[20:34:22] <ottomata>	 k
[20:34:48] <YuviPanda>	 madhuvishy: needs manual rebase
[20:35:18] <madhuvishy>	 YuviPanda: i can try doing that
[20:35:29] <ottomata>	 ok, update on EL stuff: at least tables are now being iterated through
[20:35:32] <ottomata>	 1000 rows at a time
[20:35:43] <ottomata>	 it still takes way too long to do this though
[20:36:25] <madhuvishy>	 YuviPanda: rebased and pushed
[20:37:18] <grrrit-wm>	 (CR) Milimetric: [C: 2 V: 2] Fabric deployment setup for wikimetrics [analytics/wikimetrics-deploy] - https://gerrit.wikimedia.org/r/261579 (https://phabricator.wikimedia.org/T122228) (owner: Madhuvishy)
[20:37:36] <milimetric>	 looks great to me, Madhu
[20:37:42] <milimetric>	 nice and clean
[20:37:44] <YuviPanda>	 meh
[20:37:48] <YuviPanda>	 it broke puppetmaster on prod
[20:37:49] <YuviPanda>	 let me clean up
[20:38:03] <madhuvishy>	 milimetric: thanks :)
[20:38:06] <madhuvishy>	 YuviPanda: awww
[20:41:53] <madhuvishy>	 milimetric: ok i'll setup a prod server with the role once YuviPanda fixes puppetmaster. we should load the backup from current prod into labsdb sometime
[20:42:17] <YuviPanda>	 I think
[20:42:18] <YuviPanda>	 we've to revert
[20:42:21] <YuviPanda>	 and do it in two steps
[20:42:24] <YuviPanda>	 as the wise ottomata suggested
[20:42:25] <madhuvishy>	 YuviPanda: okay
[20:42:29] <madhuvishy>	 me toooo
[20:42:35] <YuviPanda>	 as everyone except me suggested
[20:42:39] <madhuvishy>	 i'll make a separate patch
[20:42:49] <YuviPanda>	 git does not take lightly to hubris
[20:42:55] <madhuvishy>	 he he
[20:43:05] <madhuvishy>	 you have to revert though
[20:43:32] <YuviPanda>	 done
[20:52:29] <madhuvishy>	 YuviPanda: i'm a little unsure of how to resubmit the old patch
[20:52:39] <madhuvishy>	 if i rebase, it reverts everything
[20:52:43] <YuviPanda>	 hmm
[20:52:44] <YuviPanda>	 good question
[20:52:59] <madhuvishy>	 i can make a new patch
[20:53:12] <YuviPanda>	 madhuvishy: I think you can revert the revert
[20:53:15] <YuviPanda>	 madhuvishy: with gerrit
[20:53:18] <madhuvishy>	 oho
[20:53:22] <YuviPanda>	 then checkout locally
[20:53:25] <YuviPanda>	 and then do
[20:53:30] <YuviPanda>	 git reset HEAD^
[20:53:33] <YuviPanda>	 and then add them back?
[20:53:35] <madhuvishy>	 that will also have submodule remove stuff, that's okay?
[20:53:50] <YuviPanda>	 hmm
[20:53:52] <YuviPanda>	 good point
[20:53:54] <YuviPanda>	 idk
[20:53:56] <YuviPanda>	 try reset HEAD^
[20:53:58] <YuviPanda>	 see what happen
[20:54:00] <YuviPanda>	 s
[20:54:30] <madhuvishy>	 i can't revert the revert
[20:54:41] <madhuvishy>	 if there's even such a thing
[20:54:56] <YuviPanda>	 no revert button
[20:54:58] <YuviPanda>	 ?
[20:55:12] <madhuvishy>	 there is
[20:55:38] <madhuvishy>	 but that's not for reverting the revert no
[21:03:56] <madhuvishy>	 milimetric: do you think we can backup the db and deploy to new prod today?
[21:04:06] <madhuvishy>	 i would need your help in restoring the backup
[21:04:18] <madhuvishy>	 because YuviPanda will soon break our prod
[21:04:29] <YuviPanda>	 well, not necessarily
[21:04:33] <madhuvishy>	 yeah
[21:04:47] <YuviPanda>	 if you stop salt-minion in your instances salt won't hit them
[21:04:51] <madhuvishy>	 but the solution is to stop puppet runs on the instance all together
[21:04:58] <YuviPanda>	 no
[21:05:00] <YuviPanda>	 salt
[21:05:00] <milimetric>	 madhuvishy: sure, I'm ok with that.
[21:05:03] <YuviPanda>	 puppet is already broken there I think
[21:05:06] <madhuvishy>	 right
[21:05:10] <YuviPanda>	 salt is just our remote command execution thingy
[21:05:13] <milimetric>	 do we still have access to NFS from prod, I forget?
[21:06:07] <madhuvishy>	 milimetric: i think so. we can access from anywhere on the analytics project right?
[21:09:47] <madhuvishy>	 YuviPanda: do you have to first remove the submodule everywhere before merging?
[21:10:04] <madhuvishy>	 merging new module
[21:10:11] <YuviPanda>	 madhuvishy: yeah
[21:10:19] <YuviPanda>	 madhuvishy: we removed the submodule already, problem is that the files are left behind
[21:10:27] <madhuvishy>	 yeah
[21:11:18] <madhuvishy>	 YuviPanda: i think you should go ahead - i'll set up the new prod instance meanwhile
[21:11:30] <YuviPanda>	 madhuvishy: ok
[21:11:34] <YuviPanda>	 we're planning in -labs
[21:13:00] <milimetric>	 madhuvishy: so I'll send a note to wikimetrics-l and back up the db to NFS
[21:13:14] <madhuvishy>	 milimetric: awesome, thanks
[21:13:38] <milimetric>	 so wait, we're not expecting anything crazy to go wrong, right?
[21:13:46] <milimetric>	 like, we're just backing up the db to be safe
[21:13:57] <milimetric>	 because there are also the files to back up otherwise
[21:14:12] <madhuvishy>	 milimetric: which files?
[21:14:28] <madhuvishy>	 oh the reports
[21:14:31] <milimetric>	 the report results
[21:14:36] <madhuvishy>	 we should move them over too right?
[21:15:45] <madhuvishy>	 milimetric: as far as I understand, nothing will happen to the prod instance. only the puppet folder will get deleted
[21:15:58] <madhuvishy>	 YuviPanda: ^
[21:16:15] <YuviPanda>	 well
[21:16:18] <YuviPanda>	 the wikimetrics module will
[21:16:24] <madhuvishy>	 yeah
[21:16:26] <YuviPanda>	 but I'm going to skip the current wikimetrics instances
[21:16:26] <madhuvishy>	 that's fine
[21:16:28] <YuviPanda>	 when doing it
[21:16:32] <madhuvishy>	 okay
[21:16:33] <YuviPanda>	 just to give you guys more breathing room
[21:16:42] <madhuvishy>	 thanks
[21:19:05] <milimetric>	 madhuvishy: ok, so the backup on that was running normally, except the redis file (which was broken anyway)
[21:19:14] <milimetric>	 the last backup is in /data/project/wikimetrics/backup/wikimetrics1/hourly
[21:19:18] <milimetric>	 and they're saved daily also
[21:19:21] <milimetric>	 so we're good
[21:20:09] <milimetric>	 madhuvishy: let me know when you're done running puppet so I can clear the crontab
[21:21:40] <madhuvishy>	 milimetric: okay - waiting on YuviPanda to re-merge our new module
[21:22:04] <madhuvishy>	 i will setup staging once again non self-hosted to make sure, and then prod
[21:27:22] <lzia>	 milimetric: what are the three endpoints you are referring to re the pagestats tool?
[21:27:51] <milimetric>	 ooh, I should clarify
[21:28:24] <lzia>	 I'll wait for it then, milimetric. thanks! :-)
[21:31:38] <milimetric>	 k, sent
[21:32:27] * elukey wants documentation about how to deploy projects like wikimetrics on labs/prod 
[21:36:08] <madhuvishy>	 elukey: we can do it together over batcave if you want
[21:36:15] <madhuvishy>	 after timo's talk
[21:36:34] <mforns>	 a-team, see you tomorrow, good luck with the db!
[21:37:01] <elukey>	 mforns: o/
[21:37:23] <elukey>	 madhuvishy: let's do it tomorrow if you are in the office!
[21:38:24] <elukey>	 a-team: please check your ssh clients :)
[21:38:34] <elukey>	 Ref: [Wmfall] Update your openssh clients
[21:39:12] <elukey>	 homebrew has the new openssh version for the mac users (brew update && brew upgrade)
[21:46:08] <milimetric>	 thanks elukey, I added UseRoaming no
[22:11:12] <madhuvishy>	 YuviPanda: you haven't merged yet right?
[22:11:16] <YuviPanda>	 madhuvishy: no
[22:11:28] <madhuvishy>	 okay
[22:11:29] <YuviPanda>	 madhuvishy: and don't wait on me, since I'm excluding your instances from all of this
[22:11:52] <madhuvishy>	 YuviPanda: no, i want to setup a staging instance based on the merged module, and then prod
[22:12:00] <YuviPanda>	 aaah
[22:12:02] <YuviPanda>	 right
[22:12:04] <YuviPanda>	 ok
[22:12:06] <YuviPanda>	 makes sense
[22:12:15] <YuviPanda>	 madhuvishy: I'll be able to merge in <10min, waiting for another run to complete
[22:12:28] <madhuvishy>	 ya np, just let me know whenever
[22:18:33] <milimetric>	 madhuvishy: so we're restoring the database to labsdb then?
[22:18:40] <madhuvishy>	 milimetric: yeah
[22:19:01] <milimetric>	 we'll need someone to create the db I think, not sure our users have that right
[22:19:15] <madhuvishy>	 we do - if you clone wikimetrics-deploy with submodules, you should have the db name, host, creds in secrets/private/production
[22:19:46] <madhuvishy>	 milimetric: they made it such that you can create dbs with names that are like labsdbuser__dbname
[22:20:18] <madhuvishy>	 it'll get created as part of the initialize_server from fab
[22:20:25] <milimetric>	 oh ok, I was thinking it would still be called wikimetrics, but doesn't have to be
[22:20:27] <madhuvishy>	 and then we can restore
[22:20:33] <madhuvishy>	 yeah
[22:20:41] <milimetric>	 what'd you end up doing with the backup processes?
[22:20:54] <madhuvishy>	 it doesn't exist anymore in puppet
[22:20:54] <milimetric>	 is that still puppetized or
[22:20:57] <milimetric>	 oh ok
[22:21:04] <milimetric>	 so we'd just manually back up?
[22:21:06] <madhuvishy>	 labsdb has its own backups
[22:21:12] <madhuvishy>	 we dont have to do anything
[22:21:21] <milimetric>	 well, the files and redis files
[22:21:35] <madhuvishy>	 ah, are they backed up on nfs too
[22:21:35] <milimetric>	 not that we ever used those
[22:21:45] <madhuvishy>	 hmmm i din't think of that
[22:21:52] <milimetric>	 ok
[22:22:07] <madhuvishy>	 we don't have nfs - and i'm not sure we should enable it for this
[22:22:46] <madhuvishy>	 YuviPanda: it seems we have some persistent files that are also being backed up ccurrently on nfs
[22:23:59] <madhuvishy>	 milimetric: do you think we should figure out a way to back those up, or do those manually?
[22:24:19] <madhuvishy>	 i dont know where we'd put the backups though
[22:24:20] <YuviPanda>	 don't think the redis stuff needs to be atally backed up, right? since it's just in-flight requests
[22:24:25] <YuviPanda>	 how big are the files?
[22:24:38] <milimetric>	 there are just a ton of files, and it's not the redis ones
[22:24:46] <milimetric>	 it's kind of dumb
[22:25:00] <milimetric>	 and complicated
[22:25:20] <milimetric>	 quite a bit of work went into the backup scripts
[22:25:35] <milimetric>	 and I don't think anyone really needs those files to be safe except for us as part of vital signs
[22:25:59] <milimetric>	 and we really have to move that to another system anyway, 'cause the reports fail half the time
[22:26:11] <madhuvishy>	 milimetric: hmmm
[22:26:20] <milimetric>	 in my opinion, we don't need to back them up, we can just keep the current backups we have and go from there
[22:26:34] <madhuvishy>	 ok lets do that then
[22:26:41] <YuviPanda>	 sooo
[22:26:44] <YuviPanda>	 I'll merge the puppet patch?
[22:26:48] <milimetric>	 but we still have to copy them over before we change the web proxy
[22:27:06] <milimetric>	 so as long as wikimetrics1 stays online and keeps NFS access, we're ok
[22:27:12] <YuviPanda>	 yeah
[22:27:15] <madhuvishy>	 milimetric: okay cool
[22:27:21] <madhuvishy>	 YuviPanda: yeah i think you can merge
[22:27:25] <YuviPanda>	 so you can just agent forward and scp it from old host to new host
[22:27:31] <YuviPanda>	 or scp it through your local
[22:27:51] <milimetric>	 i was never able to scp with agent forwarding
[22:27:56] <milimetric>	 (-A you mean right?)
[22:28:05] <YuviPanda>	 yeah
[22:28:10] <YuviPanda>	 you agent forward and ssh to bastion
[22:28:14] <YuviPanda>	 then scp from X to bastion
[22:28:17] <YuviPanda>	 then from bastion to Y
[22:28:19] <YuviPanda>	 that should work
[22:28:23] <milimetric>	 oh, ok
[22:28:26] <YuviPanda>	 even if X <-> Y directly doesn't
[22:28:30] <YuviPanda>	 madhuvishy: merged
[22:28:58] <madhuvishy>	 YuviPanda: thanks
[22:29:15] <madhuvishy>	 I'll take down staging and set it up again
[22:29:20] <madhuvishy>	 can I log on labs?
[22:29:30] <YuviPanda>	 madhuvishy: sure!
[22:29:34] <YuviPanda>	 !log wikimetrics <whatever>
[22:29:36] <analytics-logbot>	 Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log, Master
[22:29:48] <YuviPanda>	 heh
[22:29:50] <YuviPanda>	 ok
[22:29:55] <James_F>	 YuviPanda: Tsk. :-)
[22:36:52] <milimetric>	 madhuvishy: what's this thing called again?  I can't login to wikimetrics-01 or wikimetrics01
[22:37:11] <madhuvishy>	 milimetric: the new one?
[22:37:26] <milimetric>	 yes
[22:37:42] <madhuvishy>	 milimetric: no prod instance yet -
[22:37:52] <madhuvishy>	 staging instance is wikimetrics-staging.wikimetrics
[22:37:59] <madhuvishy>	 it's in a different project
[22:38:01] <milimetric>	 oh :)
[22:38:01] <madhuvishy>	 you have access
[22:38:17] <milimetric>	 makes sense that I can't connect to a nonexistent thing then
[22:38:31] <milimetric>	 mmm, I might have to go shopping for a couple hours soon
[22:39:34] <madhuvishy>	 milimetric: no problem
[22:39:45] <madhuvishy>	 our prod is still up
[22:39:57] <milimetric>	 right, 'course but the email I sent I probably shouldn't have sent
[22:40:07] <madhuvishy>	 right
[22:40:19] <milimetric>	 so what's left to do
[22:44:15] <madhuvishy>	 milimetric: i'm setting up staging again
[22:46:12] <wikibugs>	 Analytics, Analytics-Cluster: https://yarn.wikimedia.org/cluster/scheduler should be behind ldap - https://phabricator.wikimedia.org/T116192#1935684 (Tbayer) Right now I'm getting the following error at https://yarn.wikimedia.org/cluster/scheduler :  //Error: 404, Public access disabled. See https://wiki...
[22:46:28] <milimetric>	 madhuvishy: ok, let me know where you leave off and I'll try to finish up if I can
[22:46:45] <madhuvishy>	 milimetric: okay
[22:46:57] <YuviPanda>	 milimetric: nuria thanks for prioritizing this and moving it off
[22:47:03] <YuviPanda>	 madhuvishy: ^
[22:47:38] <YuviPanda>	 limn1 is still in bad shape though :) is almost unrecoverable now, and become more so with every day.
[22:50:51] <madhuvishy>	 YuviPanda: i messed up the paths a little bit it seems
[22:51:09] <madhuvishy>	 the wsgi file is looking for config files in /srv/wikimetrics
[22:51:34] <YuviPanda>	 oh
[22:51:43] <YuviPanda>	 madhuvishy: make additional patch? I'll just merge
[22:51:54] <madhuvishy>	 when it's actually inside /srv/wikimetrics/config - which has the checked out wikimetrics-deploy - inside which there's a config folder
[22:52:01] <madhuvishy>	 why does this seem stupid only now
[22:52:19] <YuviPanda>	 paths are always stupid
[22:52:24] <wikibugs>	 Analytics, Analytics-Cluster: https://yarn.wikimedia.org/cluster/scheduler should be behind ldap - https://phabricator.wikimedia.org/T116192#1935708 (Ottomata) Earlier today I sent the following email to the analytics mailing list.  **Public YARN ResourceManager HTTP UI disabled**  Hi all,  Due to a rece...
[22:53:00] <YuviPanda>	 madhuvishy: the code for scap's source deployment is at:
[22:53:02] <YuviPanda>	 root@tin:/srv/deployment/scap/scap/scap# ls
[22:53:19] <madhuvishy>	 YuviPanda: okay i'll patch both fabric and the wsgi file - will call the config folder in wikimetrics-deploy as config_templates
[22:53:24] <madhuvishy>	 which is what it really is
[22:53:25] <YuviPanda>	 ok!
[22:55:46] <grrrit-wm>	 (PS1) Madhuvishy: Fix the config paths in wsgi file again [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/264202
[23:01:42] <ottomata>	 HaeB: (are you tbayer?)
[23:02:10] <wikibugs>	 Analytics: Restore MobileWebSectionUsage_14321266 and MobileWebSectionUsage_15038458 - https://phabricator.wikimedia.org/T123595#1935731 (Ottomata)
[23:03:13] <grrrit-wm>	 (PS1) Madhuvishy: Move config template yaml files to config_templates dir [analytics/wikimetrics-deploy] - https://gerrit.wikimedia.org/r/264204
[23:03:34] <madhuvishy>	 ottomata: yes he is :)
[23:04:13] <wikibugs>	 Analytics, Analytics-Cluster: https://yarn.wikimedia.org/cluster/scheduler should be behind ldap - https://phabricator.wikimedia.org/T116192#1935736 (Ottomata) @Tbayer,  the ssh command works for me, buuuuuut hm.  What bastion do you usually use to connect to stat1002?  You might even be able to replace...
[23:04:14] <milimetric>	 YuviPanda: I'm working on some logic to fix limn1 as well, no worries
[23:04:22] <milimetric>	 ok, now I go grocery shopping for realz, bbl
[23:04:54] <YuviPanda>	 milimetric: :D
[23:06:44] <madhuvishy>	 YuviPanda: added you to merge these two patches - i think that should fix it
[23:07:32] <YuviPanda>	 madhuvishy: they're both wikimetrics repos right?
[23:07:42] <YuviPanda>	 I can merge if that's ok with rest of a-team :D
[23:07:44] <madhuvishy>	 yup i can self merge too i think
[23:07:52] <YuviPanda>	 ah
[23:07:54] <YuviPanda>	 ok
[23:07:59] <YuviPanda>	 cool then self merge :D
[23:08:05] <madhuvishy>	 they are minor, yp okay
[23:08:19] <wikibugs>	 Analytics, Research-and-Data: Historical analysis of edit productivity for English Wikipedia - https://phabricator.wikimedia.org/T99172#1935743 (Halfak) I've started trying to get this data loaded onto the altiscale Research Cluster so that I can use HIVE to query it.  I'll be working on ways to flag bots...
[23:08:23] <grrrit-wm>	 (CR) Madhuvishy: [C: 2 V: 2] "Self merging - minor fix" [analytics/wikimetrics-deploy] - https://gerrit.wikimedia.org/r/264204 (owner: Madhuvishy)
[23:08:46] <grrrit-wm>	 (CR) Madhuvishy: [C: 2 V: 2] "Self merging, minor fix" [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/264202 (owner: Madhuvishy)
[23:12:09] <madhuvishy>	 YuviPanda: gah i missed one place
[23:12:38] <YuviPanda>	 :D
[23:16:48] <grrrit-wm>	 (PS1) Madhuvishy: Fix path for local_config_dir [analytics/wikimetrics-deploy] - https://gerrit.wikimedia.org/r/264207
[23:17:27] <madhuvishy>	 YuviPanda: anyway, it was a good idea to redo staging
[23:18:14] <YuviPanda>	 yeah
[23:18:16] <YuviPanda>	 caught these things
[23:18:31] <madhuvishy>	 yup
[23:19:06] * YuviPanda goes to stat1002 this time to check his events
[23:20:34] <madhuvishy>	 YuviPanda: :) should I give the prod instance more RAM and CPUs?
[23:20:51] <YuviPanda>	 madhuvishy: medium or large should do...
[23:20:57] <YuviPanda>	 madhuvishy: waht's the current one? medium?
[23:20:57] <madhuvishy>	 okay
[23:21:02] <YuviPanda>	 madhuvishy: also these are debian right?
[23:21:04] <YuviPanda>	 not ubuntu?
[23:21:12] <madhuvishy>	 YuviPanda: new ones debian
[23:21:17] <YuviPanda>	 cooool
[23:21:47] <madhuvishy>	 8.2
[23:21:49] <madhuvishy>	 shinyy
[23:21:53] <YuviPanda>	 :D
[23:22:00] <madhuvishy>	 YuviPanda: old wikimetrics prod instance was large
[23:22:06] <YuviPanda>	 ah
[23:22:08] <YuviPanda>	 yeah then just use large
[23:22:37] <madhuvishy>	 ok
[23:27:45] <YuviPanda>	 hmm
[23:27:52] <YuviPanda>	 we're producing 70k events per day
[23:27:54] <YuviPanda>	 approximately
[23:28:01] <YuviPanda>	 I suppose that's fine?
[23:28:14] <YuviPanda>	 that's less than 1 per second
[23:28:16] <YuviPanda>	 so should be fine
[23:48:44] <grrrit-wm>	 (PS2) Madhuvishy: Make db root user configurable for different environments [analytics/wikimetrics-deploy] - https://gerrit.wikimedia.org/r/264207
[23:49:16] <wikibugs>	 Analytics, Research consulting, Research-and-Data: Update official Wikimedia press kit with accurate numbers - https://phabricator.wikimedia.org/T117221#1935826 (DarTar) @ezachte this hasn't seen any update in a while, is there a status update or shall we close this?
[23:50:02] <wikibugs>	 Analytics-General-or-Unknown, Community-Advocacy, Wikimedia-Extension-setup, Wikipedia-iOS-App-Product-Backlog: enable Piwik on ru.wikimedia.org - https://phabricator.wikimedia.org/T91963#1935829 (Krenair) See {T116308}