[00:00:11] 10Analytics, 10EventBus, 06Services (done): Create mediawiki.page-restrictions-change event - https://phabricator.wikimedia.org/T160942#3138967 (10mobrovac) Merged. To be deployed on the next MW train. [00:35:14] 06Analytics-Kanban: Pageview Spike in Tagalog Wikipedia mid-June 2016 - https://phabricator.wikimedia.org/T144635#3139028 (10Nuria) a:03Nuria [00:46:50] 06Analytics-Kanban: Pageview Spike in Tagalog Wikipedia mid-June 2016 - https://phabricator.wikimedia.org/T144635#3139048 (10Nuria) {F7065290} {F7065293} [00:48:08] 06Analytics-Kanban: Pageview Spike in Tagalog Wikipedia mid-June 2016 - https://phabricator.wikimedia.org/T144635#3139051 (10Nuria) {F7065290} {F7065293} [00:50:47] 06Analytics-Kanban: Pageview Spike in Tagalog Wikipedia mid-June 2016 - https://phabricator.wikimedia.org/T144635#3139054 (10Nuria) [00:55:52] 06Analytics-Kanban: Pageview Spike in Tagalog Wikipedia mid-June 2016 - https://phabricator.wikimedia.org/T144635#3139088 (10Nuria) Will be closing ticket as I cannot really find anything that might point to a bug, from my brief analysis traffic seems organic. [03:47:00] 10Analytics, 10MediaWiki-extensions-WikimediaEvents, 10The-Wikipedia-Library, 10Wikimedia-General-or-Unknown, and 4 others: Implement Schema:ExternalLinksChange - https://phabricator.wikimedia.org/T115119#3139172 (10Beetstra) @Samwalton9, what do you mean with 'at some point'? Do you mean that this has an... [06:38:18] (03PS3) 10Mforns: Use both projectcounts raw and all sites to load cassandra [analytics/refinery] - 10https://gerrit.wikimedia.org/r/345144 (https://phabricator.wikimedia.org/T161494) [06:49:39] heloooo [07:54:12] 10Analytics-Tech-community-metrics: Maniphest statistics show implausible numbers of submitted reports - https://phabricator.wikimedia.org/T161682#3139486 (10Nemo_bis) [07:57:40] 10Analytics-Tech-community-metrics, 06Developer-Relations (Jan-Mar-2017): Deployment of Maniphest panel - https://phabricator.wikimedia.org/T138002#3139504 (10Nemo_bis) I'd have multiple feature requests depending on what's the expected purpose of this statistical tool (I think the main purpose should be to co... [09:18:49] Hi there. I have started working as a Data Analyst for WMDE. Does anyone here know, by any chance, do we have RStudio Server running on stat1002 or stat1003? Thx. [09:35:22] GoranSM: no, doesn't seem so [09:36:57] moritzm: Thanks. So, one question, just to check if get the workflow correctly: we mainly use production machines like stat1002/1003 to gather data from the various sources that are hosted there, then move the data to some other place (Labs, for example, or our own locals) to analyze there? [09:40:45] no, I'm not using R on those machines, better wait until someone else is around [09:41:01] moritzm: Thanks. [09:44:14] 10Analytics-Tech-community-metrics: Maniphest statistics show implausible numbers of submitted reports - https://phabricator.wikimedia.org/T161682#3139639 (10Aklapper) [09:46:06] 10Analytics-Tech-community-metrics: Maniphest statistics show implausible numbers of submitted reports - https://phabricator.wikimedia.org/T161682#3139486 (10Aklapper) 05Open>03Invalid For future reference, please use [[ https://www.mediawiki.org/wiki/Phabricator/Help#Formatting | table markup ]] for tables... [09:52:54] 10Analytics, 10MediaWiki-extensions-WikimediaEvents, 10The-Wikipedia-Library, 10Wikimedia-General-or-Unknown, and 4 others: Implement Schema:ExternalLinksChange - https://phabricator.wikimedia.org/T115119#3139666 (10Samwalton9) @Beetstra Good question. @Milimetric thinks that if/when this is working correc... [10:05:57] GoranSM: hi! I am not super familiar with R but I can speak for the stat machines.. [10:07:04] elukey: Support, please: I am trying to connect to Jupyter notebooks with ssh -N notebook1001.eqiad.wmnet -L 8000:127.0.0.1:8000 as explained on https://meta.wikimedia.org/wiki/Discovery/Analytics#Analysis_with_R. I should be able to do this with my LDAP (Wikitech credentials); however, all that I get is username/password incorrect. [10:08:23] GoranSM: you'd need to have a shell account on notebook1001.eqiad.wmnet, LDAP credentials are valid for Wikitech and some other web interfaces only [10:08:57] there is a process for this that should be part of your onboarding [10:09:47] (also there is a bit of paperwork to do for NDAs between WMDE and WMF to protect sensitive data that we store) [10:10:44] matthiasmullie: I have LDAP confirmed, NDA signed, and access to stat 1002 and 1003 already. [10:11:57] ahhh nice! Didn't know it [10:12:00] elukey: Goran has signed an NDA with Legal [10:12:08] elukey: Will a Phabricator ticket be enough to ask for a shell account on notebook1001.eqiad.wmnet? [10:12:24] GoranSM: let me check what access is needed [10:12:31] elukey: Thanks a lot. [10:13:37] elukey: researchers or statistics-privatedate-users [10:13:53] there you go, but I can see your username on it [10:14:16] ah yes you already are in researchers [10:14:20] elukey: I am in researchers, yes [10:14:58] GoranSM: does a standard SSH login to notebook1001.eqiad.wmnet work for you? (it should) [10:15:40] elukey: Again, to make my question simpler: (A) My SSH config is set to acess stat 1002 and 1003 via bastion host; (B) I do ssh -N notebook1001.eqiad.wmnet -L 8000:127.0.0.1:8000 as advised on the Wiki; (C) I open localhost:8000 in my browser and enter my Wikitech credentials; output == incorrect username password. [10:16:23] elukey: Yes the standard shh to notebook1001.eqiad.wmnet works, I'm there right now [10:17:13] GoranSM: ahhh sorry I thought that the problem was during the ssh connection [10:18:03] elukey: Well the problem was during during the ssh connection: once I do ssh -N notebook1001.eqiad.wmnet -L 8000:127.0.0.1:8000, I cannot login to Jupyter notebook from localhost:8000 [10:18:34] elukey: I want to say, the SSH tunneling is opened while I'm trying to connect to localhost:8000 for Jupyter, as it should be opened [10:20:18] I get login failures too with my credentials, so probably a specific ldap group is needed? I am in the middle of something but I'll get back to you in a bit! [10:20:32] (ops work pending sorry0 [10:20:45] elukey [10:20:59] elukey: (sorry for a typo) Thanks :) [10:40:11] joal hi :] yt? [10:44:08] 10Analytics-Tech-community-metrics: Maniphest statistics show implausible numbers of submitted reports - https://phabricator.wikimedia.org/T161682#3139791 (10Nemo_bis) 05Invalid>03Open > Your Phab query is not restricted to the last two years. Indeed, and a superset of tasks can't have less tasks. Yet 262 <... [10:44:59] 10Analytics-Tech-community-metrics: Maniphest statistics show implausible numbers of submitted reports - https://phabricator.wikimedia.org/T161682#3139795 (10Nemo_bis) [10:58:53] GoranSM: just managed to get in, have you tried with your Wikitech username lowercase? [10:59:08] No let me try [11:00:36] elukey: no change, still incorrect [11:00:44] Invalid username or password [11:01:22] afaics it needs wikitech credentials.. I went in with 'elukey' and my wikitech pass [11:02:20] elukey: I am trying with my wikitech username/password [11:02:56] elukey: let me try to change my wikitech password and then re-try [11:08:37] elukey: Nothing. Changed password, tried with the new password, username normal and username lower case, nothing. Invalid password username. [11:09:19] GoranSM: super weird, will ask to other people in analytics when they'll be online [11:09:46] elukey: Thanks. This is very important for my work. Can I ask you another (much simpler) question? [11:10:46] sure [11:11:48] Here it goes: when at production, say stat1002 or stat stat1004, do we have any tools like Jupyter, RStudio and similar, or we generally access Hadoop, mySQL and similar via konsole and scripts written out in vim and similar? [11:12:57] elukey: I simply want to understand the organizational culture and the usual workflow here. If people do not use such tools to access data and process on production, neither will I do - I don't want to complicate others' (read: sysadmins') lives just in order to be able to use GUIs for dev/analytics. [11:17:00] GoranSM: from what I know it is (mostly) the second one that you mentioned, with some efforts like notebooks (and labs tools) to overcome the console limitation [11:25:26] elukey: Ok, thanks. [11:26:07] elukey: Thank you for all the support that you have provided. Please, when someone who could know how to resolve this access issue turns online, let me know. Thanks again. [11:26:47] GoranSM: sure! I'll try to get back to you with an answer today [11:27:12] elukey: Thank you so much! [11:39:05] * elukey lunch! [11:46:09] 10Analytics-Tech-community-metrics: Maniphest statistics show implausible numbers of submitted reports - https://phabricator.wikimedia.org/T161682#3139486 (10Albertinisg) This is a clear bug. I've been digging into it and I've found we have an issue with the `[[mw:User:This, that and the other]` identity. Rega... [11:58:08] (03PS4) 10Mforns: Use both projectcounts raw and all sites to load cassandra [analytics/refinery] - 10https://gerrit.wikimedia.org/r/345144 (https://phabricator.wikimedia.org/T161494) [12:14:05] Hi mforns ! [12:14:10] Excuse me I missed your ping ! [12:17:24] joal, np :] [12:17:42] I wanted to present you with a riddle, but I think I got it [12:18:14] Ah ! What is it? [12:18:36] the projectcounts oozie hql query was wrong [12:18:51] the resulting data for the mobile-site slice was veeery weird [12:19:09] at least at first glance, then actually it made sense [12:19:28] if you want I can explain more, but I'd need to share screen [12:19:50] sure mforns, let's batcave [12:19:55] k! [12:20:27] joal, having problems with hangouts... [12:39:56] 06Analytics-Kanban, 06Operations, 06WMDE-Analytics-Engineering, 13Patch-For-Review, 15User-Addshore: /a/mw-log/archive/api on stat1002 no longer being populated - https://phabricator.wikimedia.org/T160888#3140077 (10elukey) @Addshore: I am going to close this task but we might want to open another one to... [12:40:21] 06Analytics-Kanban, 06Operations, 06WMDE-Analytics-Engineering, 13Patch-For-Review, 15User-Addshore: /a/mw-log/archive/api on stat1002 no longer being populated - https://phabricator.wikimedia.org/T160888#3113734 (10elukey) a:03elukey [12:40:52] 10Analytics-Cluster, 06Analytics-Kanban, 06Operations: Reinstall Analytics Hadoop Cluster with Debian Jessie - https://phabricator.wikimedia.org/T157807#3140081 (10elukey) a:03elukey [12:41:09] 10Analytics-Cluster, 06Analytics-Kanban, 06Operations: Reinstall Analytics Hadoop Cluster with Debian Jessie - https://phabricator.wikimedia.org/T157807#3017036 (10elukey) a:05elukey>03None [12:41:37] 06Analytics-Kanban, 06Operations, 15User-Elukey: Reimage all the Hadoop worker nodes to Debian Jessie - https://phabricator.wikimedia.org/T160333#3140084 (10elukey) [12:42:04] joal: reimaging an1045 to debian :) [12:44:51] k elukey :) [12:45:25] elukey: so that you know - We've been discussing with mforns, and some data reloading will happen on pagecounts keyspace [12:46:20] elukey: Given that after that, it shouldn't be touched before some time (no regular insertion), it'd be good to do a keysapce clean-up [12:51:29] sure [12:51:56] elukey: that keysapce is not huge, so ti shouldn't be that long [12:53:02] elukey, joal, for now doing some loading to the test keyspace still, will ping you later when I need your help elukey :] [12:53:32] sure :) [12:54:54] stopped hadoop daemons on 1045, some jobs might fail [13:03:35] (03PS1) 10Joal: [WIP] Update sqoop job to better handle failures [analytics/refinery] - 10https://gerrit.wikimedia.org/r/345327 [13:04:05] 10Analytics-Cluster, 06Analytics-Kanban, 06Operations, 13Patch-For-Review, 15User-Elukey: Reimage a Trusty Hadoop worker to Debian jessie - https://phabricator.wikimedia.org/T159530#3140160 (10ops-monitoring-bot) Script wmf_auto_reimage was launched by elukey on neodymium.eqiad.wmnet for hosts: ``` ['ana... [13:10:40] 10Analytics-Tech-community-metrics: Maniphest statistics show implausible numbers of submitted reports - https://phabricator.wikimedia.org/T161682#3140185 (10Aklapper) @Albertinisg: You're fast. :) I hadn't reloaded the page to see your comment before adding mine... [13:12:21] o/ [13:12:31] hey joal & milimetric [13:12:41] no live systems stuff from me today [13:12:52] Hi halfak [13:13:07] I have a question for you on scikit-learn, but that'll be all [13:13:15] halfak: I can do with IRC [13:13:23] sure! [13:13:44] halfak: Trying to make sklearn with spark, we are encountering dependencies issues [13:17:12] halfak: It'd be easier for ottomata if we could use scikit-learn 0.18 instead of 0.17 [13:17:21] halfak: Do you think it would be a big change? [13:17:25] No problem [13:18:13] halfak: So I could myself change the deps in requirement.txt? [13:18:41] joal, not sure which requirements.txt you are referring to. [13:19:13] the one in revscoring forlder, sorry halfak [13:20:03] joal, OK. So I think that everything will just work, but we might need to rebuild the models (which is not too much trouble) [13:20:24] halfak: ok, awesome :) [13:20:48] halfak: It should facilitate us being to test revscoring in spark :) [13:21:24] I can rebuild a model with sklearn 0.18 if that'll help [13:22:36] halfak: I don't need that yet, I'll for sure ask when I will; :) [13:23:24] ottomata: in case you haven't seen: We can try with scikit-learn 0.18 :) [13:23:38] oh i'm reading :) [13:23:39] haha [13:23:41] hi! [13:23:48] cool! [13:23:53] ok i'll try to backport [13:23:55] should work [13:23:58] Hi (sorry, even forgetting to be polite by being happy) [13:25:30] \o/ [13:25:31] :) [13:28:35] 10Analytics-Cluster, 06Analytics-Kanban, 06Operations, 13Patch-For-Review, 15User-Elukey: Reimage a Trusty Hadoop worker to Debian jessie - https://phabricator.wikimedia.org/T159530#3140234 (10ops-monitoring-bot) Completed auto-reimage of hosts: ``` ['analytics1045.eqiad.wmnet'] ``` and were **ALL** suc... [13:37:01] 10Analytics-Tech-community-metrics: Maniphest statistics show implausible numbers of submitted reports - https://phabricator.wikimedia.org/T161682#3140245 (10Nemo_bis) I only checked TTO so far. Feel free to merge, I can reopen if I find other suspicious numbers. [13:38:42] 10Analytics-Tech-community-metrics, 06Developer-Relations (Jan-Mar-2017): Deployment of Maniphest panel - https://phabricator.wikimedia.org/T138002#3140257 (10Aklapper) >>! In T138002#3139504, @Nemo_bis wrote: > I'd have multiple feature requests depending on what's the expected purpose of this statistical too... [13:40:02] 1045 is already up and running, fixing perms at the momnet [13:40:04] *moment [13:40:10] should be up and running before standup [13:42:11] nie [13:42:13] nice [13:48:19] ottomata,joal - I'd need a bit of time to complete my work on mediawiki ssl certs (now it is time for the eqiad ones), do you mind if I skip the ops meeting? [13:49:10] elukey: we can skip it this week i think if you like, i want you to show me this auto-reimage script one day though...:) [13:49:20] also I got a Q for you about RAID in new Kafka brokers, wondering what you think [13:50:55] ah yes I saw it but didn't follow up sorry! :( [13:51:00] no hurry! [13:51:18] joal: looks like this isn't going to be a snap [13:51:23] joblib tests fail when building [13:51:25] grrrr [13:51:36] ottomata: I added https://wikitech.wikimedia.org/wiki/Analytics/Cluster/Hadoop/Administration#Worker_Reimage_.2812_disk.2C_2_flex_bay_drives_-_analytics1028-analytics1057.29 but I [13:51:42] I'll finish it today [13:51:50] should be good after that :) [13:52:20] oo [13:52:45] right elukey i'm curious though, you have some auto-reimage script, no? [13:53:12] ahhh ok! Yes it is Riccardo's magic, available on neodymium [13:56:08] ha! amazing! [13:56:10] haven't used that [13:56:13] pretty cool [13:56:28] elukey: are you telling me that you can reimage these hosts without logging into mgmt console? [13:57:51] ottomata: oh yes! [13:57:59] and it logs in the task [13:58:02] reboots the host [13:58:03] etc.. [13:58:54] amazing [14:00:56] (03CR) 10Hashar: "recheck" [analytics/wikistats] - 10https://gerrit.wikimedia.org/r/337452 (owner: 10Milimetric) [14:01:34] joal, mforns [14:01:35] PROBLEM - aqs endpoints health on aqs1004 is CRITICAL: /legacy/pagecounts/aggregate/{project}/{access-site}/{granularity}/{start}/{end} (Get pagecounts) is CRITICAL: Test Get pagecounts returned the unexpected status 404 (expecting: 200) [14:01:37] (03CR) 10Hashar: "Did a recheck to verify the job pass all fine (for T97514). And it does!" [analytics/wikistats] - 10https://gerrit.wikimedia.org/r/337452 (owner: 10Milimetric) [14:01:44] a lot of alarms like these on the ops chan [14:01:47] elukey, ops [14:02:19] elukey, I truncated the table [14:02:19] mforns: Have you truncated the table? [14:02:24] yes [14:02:28] mforns: please don't do that :) [14:02:41] I have to add the monitoring data [14:02:46] mforns: correct [14:02:49] k [14:03:01] mforns: I thought we had talked about relaoding, not truncating :) [14:03:23] mforns: in case of data deletion (truncate), no need to cleanup keyspace [14:03:30] joal, I thought the opposite, sorry :[ [14:03:50] mforns: no problem, let's correct that :) [14:03:54] I just added the monitoring check fake data [14:04:02] alarms should stop in theory now [14:04:29] mforns: aqs recovered :) [14:04:34] my bad! [14:16:10] ottomata: on neodym - sudo -E wmf-auto-reimage analytics1046.eqiad.wmnet -p T160333 [14:16:11] T160333: Reimage all the Hadoop worker nodes to Debian Jessie - https://phabricator.wikimedia.org/T160333 [14:16:13] that's it [14:16:37] it takes care of icinga, updating phab, rebooting the host in the end, etc.. [14:18:02] amaing [14:46:15] ottomata: a minute in ops meeting for python deps ? [14:46:51] k [14:53:16] a-team: still working on the ssl cert update in eqiad, will probably be a bit late to the standup.. if I don't make it I'll send e-scrum.. [14:53:26] ok [14:57:11] 10Analytics, 10MediaWiki-extensions-WikimediaEvents, 10The-Wikipedia-Library, 10Wikimedia-General-or-Unknown, and 4 others: Implement Schema:ExternalLinksChange - https://phabricator.wikimedia.org/T115119#3140478 (10Nuria) @Beestra: @samwalton is looking at data in a database replica the data does not appe... [15:00:24] a-tem: staddduppppp [15:01:08] milimetric, fdans : staddduppp [15:06:31] 10Analytics-Cluster, 06Analytics-Kanban: Provision new Kafka clusters in eqiad and codfw with security features - https://phabricator.wikimedia.org/T152015#3140502 (10Ottomata) [15:07:18] 10Analytics-Cluster, 06Analytics-Kanban: Hadoop cluster expansion. Add Nodes - https://phabricator.wikimedia.org/T152713#3140507 (10Ottomata) [15:20:44] fdans, I have no task, I thought maybe you'd want someone to pair with you? I'd be happy to do so :] [15:21:22] mforns: sure! batcave-2? [15:21:26] k! [15:22:40] mforns: do you have any old screenshots of ash's visual design of the topic explorer [15:22:55] it's not in her latest and I just noticed [15:23:04] milimetric, mmm looking [15:23:17] I remember it was basic, just headings of each area followed by the questions for that area, right? [15:24:15] oh yea, it's only in the wireframes mforns, got it: https://www.dropbox.com/sh/948iggybzgkzrzp/AABVjHg_4MwagVtox0p_L0t0a/Wireframe%20flats%20feedback%20round%201/Research%20Mocks%20Round%201?dl=0&preview=Topic+Selector+with+Subcategories.png [15:24:31] milimetric, OK [15:28:03] 06Analytics-Kanban, 10DBA, 13Patch-For-Review: Change length of userAgent column on EL tables - https://phabricator.wikimedia.org/T160454#3140604 (10Marostegui) Hi! What's the status of this? [15:28:30] an1045 is up and running btw :) [15:30:28] (03PS6) 10Fdans: [wip] Add legacy pageviews metric [analytics/dashiki] - 10https://gerrit.wikimedia.org/r/344114 (https://phabricator.wikimedia.org/T143906) [15:33:35] 06Analytics-Kanban, 10DBA, 13Patch-For-Review: Change length of userAgent column on EL tables - https://phabricator.wikimedia.org/T160454#3140617 (10Ottomata) Happening today! :) [15:40:02] nuria: https://etherpad.wikimedia.org/p/analytics-el-table-rename [15:41:53] ottomata: updated wiki [15:41:59] ottomata: updated etherpad, that is [15:42:54] nuria: do we need a revert of hte EL revert? [15:43:14] for the ua parser thing? [15:43:25] ah no [15:43:29] its already in [15:43:29] i remember [15:43:38] because i deployed it to beta and tested there [15:50:37] ottomata: yes [15:51:25] together with bumping len of varchar [15:51:47] fdans: i think you had a prior dashiki change you need to abandon that had also changes for AQS [15:52:10] nuria: I'm using that one [15:52:25] fdans: ah ok, sounds good [15:52:37] (03Abandoned) 10Nuria: Bump up pageviews.js to version that supports pagecounts [analytics/dashiki] - 10https://gerrit.wikimedia.org/r/345197 (https://phabricator.wikimedia.org/T149358) (owner: 10Nuria) [15:52:49] nuria: fyi I'm pairing with marcel and we've got this working, starting work on multiple wikis now [15:54:02] ottomata: the ony thing i wonder is about that custom replication thing [15:54:08] ottomata: do we need to stop it? [15:57:06] nuria: right, cause it is not a cron, reading.. [15:57:08] uhhhh [15:57:11] how does this script ever work? [15:57:19] it seems to only grab the list of tables to replicate when it starts [15:57:28] how does it know to get a new table if it is created after it starts...???? [15:57:50] ottomata: it must, right? [15:58:04] ottomata: cause new schemas are being added w/o script being restarted [16:00:51] nuria: yeah i guess so [16:01:28] welp, i dunno how that works, but yeah, let's stop [16:01:29] adding [16:02:55] ottomata: be right abck [16:02:57] *back [16:19:47] 10Analytics-Cluster, 06Analytics-Kanban: Provision new Kafka clusters in eqiad and codfw with security features - https://phabricator.wikimedia.org/T152015#3140829 (10elukey) @Ottomata the perfect scenario would be to have a comparative test of performances on production hw but it would be really painful and h... [16:20:46] ottomata: --^ not sure if helpful but I tried to give me opinion :) [16:21:04] thanks elukey [16:21:05] yeah, hard to say [16:37:44] ottomata: also we need to check perf metrics, let's give a holler to performance [16:37:49] ottomata: (added that) [16:38:45] k [16:40:09] ottomata: let's get on batcave while we do this [16:40:22] ja [16:44:19] (03PS2) 10Joal: Update sqoop script to better handle failures [analytics/refinery] - 10https://gerrit.wikimedia.org/r/345327 [16:44:34] ottomata: --^ it has been tested, let me know what you think of it :) [16:45:17] 10Analytics-Tech-community-metrics: Author names that include commata or "and" are split into separate identities in the frontend - https://phabricator.wikimedia.org/T161241#3140991 (10Aklapper) [16:45:19] 10Analytics-Tech-community-metrics: Maniphest statistics show implausible numbers of submitted reports - https://phabricator.wikimedia.org/T161682#3140988 (10Aklapper) [16:46:30] ottomata: let's be on batcave while we do all this no? [16:47:59] nuria: ya [16:48:04] oh you there now? [16:48:05] o [16:48:05] k [16:48:43] omw [16:53:51] 10Analytics: Upgrade pivot - https://phabricator.wikimedia.org/T161725#3141015 (10Nuria) [17:09:07] 10Analytics, 10EventBus, 10Wikimedia-Stream, 13Patch-For-Review: Implement server side filtering (if we should) - https://phabricator.wikimedia.org/T152731#2858381 (10Smalyshev) I'd definitely like to have per-wiki filtering for events. More granular would be nice but not a must. Something like `enedit` ma... [17:13:11] !log deploying eventlogging latetst: 28740773cea545215ea610c8c3e1a3ba36ef5a6a (UA changes) [17:13:13] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [17:19:45] !log restarted EL on eventlog1001 with new changeset and tables renamed [17:19:46] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [17:22:23] 10Analytics, 10EventBus, 10Wikimedia-Stream, 13Patch-For-Review: Implement server side filtering (if we should) - https://phabricator.wikimedia.org/T152731#3141090 (10GWicke) I really think we should consider exposing separate streams, one per wiki, ideally (if we can make the technical stuff work), as par... [17:22:44] team just completed the ssl upgrade work, all fine :) [17:23:06] analytics1045 seems working good, interestingly I noticed DiskChecker$DiskErrorException in there too (gone after a yarn restart) [17:23:20] anyhow, ping me if you see job failures etc. [17:23:28] hopefully there will be none :) [17:23:34] I am heading out for the evening! [17:23:38] talk with you tomorrow! :) [17:23:40] * elukey off [17:24:33] oh interesting elukey. cool laters! [17:26:15] mforns: πŸ‘ŒπŸΌπŸ‘ŒπŸΌπŸ‘ŒπŸΌπŸ‘ŒπŸΌ [17:26:53] 06Analytics-Kanban, 10DBA, 13Patch-For-Review: Change length of userAgent column on EL tables - https://phabricator.wikimedia.org/T160454#3141118 (10Ottomata) Done! https://etherpad.wikimedia.org/p/analytics-el-table-rename [17:26:55] (03CR) 10Nuria: [C: 031] Adding renamed tables to sql union statements [analytics/discovery-stats] - 10https://gerrit.wikimedia.org/r/344049 (https://phabricator.wikimedia.org/T160454) (owner: 10Nuria) [17:27:04] fdans, what is that :] ? [17:27:27] milimetric: someone with +2 needs to merge these changes: https://gerrit.wikimedia.org/r/#/c/344049/1 [17:27:33] cc bearloga [17:27:45] (03CR) 10Nuria: [V: 032 C: 032] Adding renamed tables to sql union statements [analytics/limn-edit-data] - 10https://gerrit.wikimedia.org/r/344054 (https://phabricator.wikimedia.org/T160454) (owner: 10Nuria) [17:27:56] (03CR) 10Nuria: [V: 032 C: 032] Adding renamed tables to sql union statements [analytics/limn-flow-data] - 10https://gerrit.wikimedia.org/r/344055 (https://phabricator.wikimedia.org/T160454) (owner: 10Nuria) [17:28:02] mforns: solved the promises conundrum, data is returning fiiine [17:28:10] working now on merging it and we're set :) [17:28:15] nuria: I certainly don't have +2 on that repo [17:28:51] bearloga: funny, me neither, milimetric must, i have +2 it on all the other ones for reportupdater [17:29:06] (03CR) 10Milimetric: [C: 04-1] Adding renamed tables to sql union statements (031 comment) [analytics/discovery-stats] - 10https://gerrit.wikimedia.org/r/344049 (https://phabricator.wikimedia.org/T160454) (owner: 10Nuria) [17:29:28] I have +2 nuria & bearloga, I'll merge after the small fix (missing a select) [17:29:41] brb though, gotta go grab lunch [17:30:01] milimetric: no, will fix, give me a sec [17:30:27] milimetric: i had not see your comment, sorry, fixing [17:32:20] (03PS2) 10Nuria: Adding renamed tables to sql union statements [analytics/discovery-stats] - 10https://gerrit.wikimedia.org/r/344049 (https://phabricator.wikimedia.org/T160454) [17:32:57] (03CR) 10Nuria: Adding renamed tables to sql union statements (031 comment) [analytics/discovery-stats] - 10https://gerrit.wikimedia.org/r/344049 (https://phabricator.wikimedia.org/T160454) (owner: 10Nuria) [17:33:54] (03CR) 10Milimetric: [V: 032 C: 032] Adding renamed tables to sql union statements [analytics/discovery-stats] - 10https://gerrit.wikimedia.org/r/344049 (https://phabricator.wikimedia.org/T160454) (owner: 10Nuria) [17:36:32] 10Analytics, 10EventBus, 10Wikimedia-Stream, 13Patch-For-Review: Implement server side filtering (if we should) - https://phabricator.wikimedia.org/T152731#3141153 (10Ottomata) > Does it mean I could "reconnect" to the stream at any point? Would it be possible to make it so I could choose that point e.g. b... [17:50:23] heading to cafe... [17:50:47] fdans, sorry had to restart, what was the news? [17:51:49] mforns: problem with promises solved, now merging data :) [17:51:57] woohoo! [17:52:32] fdans, SoS is over, do you want help, otherwise I'm going to log off for today [17:52:52] mforns: I'm going out now to run a couple of errands pre trip [17:52:55] will be back later [17:53:01] ok, ok [17:53:06] let's talk tomorrow mforns :) [17:53:10] so, good luck and see you guys tomorrow! [17:53:22] k :] [18:04:29] 10Analytics-Tech-community-metrics: Git's "Last Attracted Developers" lists established developers and developers without a First Commit Date - https://phabricator.wikimedia.org/T161309#3141264 (10Aklapper) [18:16:54] milimetric: quick question [18:17:01] shoot [18:17:05] milimetric: What should I do of this page: https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake/Mediawiki_history/Presentation [18:17:27] oh, joal those were just brainstorms, you wanna delete it? [18:17:31] I'm thinking of renaming it to: https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake/communication/Presentation_DATE [18:17:42] I'll delete it [18:17:57] as you prefer, I don't mind keeping it [18:18:55] nope, just clutter, forgot about it [18:18:56] gone [18:18:57] thx [18:19:03] thank you :) [18:19:23] milimetric: also, provided a new version of CR for sqoop (tested and all) [18:20:41] cool, will take a look [18:22:57] 10Analytics-Tech-community-metrics: Check if we actually index all code repositories - https://phabricator.wikimedia.org/T161211#3141303 (10Aklapper) https://gerrit.wikimedia.org/r/labs/tools/grrrit is listed in https://github.com/Bitergia/mediawiki-repositories/blob/master/git_repositories.conf so I'd expect ac... [18:25:11] 10Analytics, 06Discovery, 10EventBus, 10MediaWiki-API, and 4 others: Create reliable change stream for specific wiki - https://phabricator.wikimedia.org/T161731#3141307 (10Smalyshev) [18:26:43] 10Analytics, 06Discovery, 10EventBus, 10MediaWiki-API, and 4 others: Create reliable change stream for specific wiki - https://phabricator.wikimedia.org/T161731#3141325 (10Smalyshev) p:05Triage>03Normal [18:30:11] 10Analytics, 06Discovery, 10EventBus, 10Wikidata, and 3 others: Create reliable change stream for specific wiki - https://phabricator.wikimedia.org/T161731#3141332 (10Anomie) The pony being requested here is not going to happen in the action API, removing tag. This is probably an effective duplicate of {T1... [18:30:20] 10Analytics-Tech-community-metrics, 06Developer-Relations (Apr-Jun 2017): Go through default Kibana widgets; decide which ones are not relevant for us and remove them - https://phabricator.wikimedia.org/T147001#3141334 (10Aklapper) p:05High>03Low T138002 got fixed so three more dashboard to check, however... [18:37:04] 10Analytics, 06Discovery, 10EventBus, 10Wikidata, and 3 others: Create reliable change stream for specific wiki - https://phabricator.wikimedia.org/T161731#3141367 (10Smalyshev) I think EventStreams is closest to the goal too, but I want to have a complete description of the pony for the record so that we... [19:00:05] 10Analytics-EventLogging, 06Analytics-Kanban, 13Patch-For-Review: Change userAgent field to user_agent_map in EventCapsule - https://phabricator.wikimedia.org/T153207#2872765 (10Nuria) Changes have been deployed to production. [19:03:46] milimetric: i am so proud of my bubble plots: https://phabricator.wikimedia.org/F7065293 [19:04:13] https://phabricator.wikimedia.org/F7065290 [19:05:11] cc joal, while doing those i also realized that the number of unidentified pageviews ('-') was akin to Search:Requests [19:16:27] woah cool! [19:19:43] nuria: Not sure I understand the point about "-" and Search:Request [19:20:25] joal, that the number of unidentified pageviews is ~equal to the number of Search "pageviews" [19:20:28] ottomata: I think I succesfully ran a spark job with scikit learn [19:20:51] nuria: I think this is unrelated --> If they both come from pageivew, [19:21:17] ottomata: Can you explain to me how I could setup a full jessie cluster in labs? [19:21:43] nuria: When looking at title in pageviews table, there is no duplicate [19:22:25] nuria: If you took the uri_path, then it makes sense :) [19:22:49] joal: ay sorry, no, I mean that numbers "in bulk" are similar as in "the number of pageviews for which we cannot find out the title" and "the number of search requests" is about the same [19:22:51] joal: !!! amaazing! [19:23:01] joal: ya we just have to put new workers in it [19:23:07] you don't need the master to be jessie, right? [19:23:20] joal: but nevermind, it was my own learning [19:23:21] joal: we can probably jsut delete the trusty workers [19:23:30] then you'd have a single worker node jessie cluster [19:23:31] ottomata: only the worlkers, and I need some setup (packages installed [19:24:00] ottomata: forgot to copy the numpy special package you pasted earlier, can you paste again? [19:25:09] joal: https://packages.debian.org/jessie-backports/python-numpy [19:25:15] python-numpy=1:1.12.0-2~bpo8+1 [19:25:32] thanks ottomata [19:35:44] ottomata: master is cdh3-1, right ? [19:35:52] sounds right :) [19:36:28] double checking [19:37:10] joal: correct [19:37:16] k thanks [19:38:41] 06Analytics-Kanban: Pageview Spike in Tagalog Wikipedia mid-June 2016 - https://phabricator.wikimedia.org/T144635#3141582 (10Nuria) 05Open>03Resolved [19:39:02] 06Analytics-Kanban, 06Operations, 06WMDE-Analytics-Engineering, 13Patch-For-Review, 15User-Addshore: /a/mw-log/archive/api on stat1002 no longer being populated - https://phabricator.wikimedia.org/T160888#3141583 (10Nuria) 05Open>03Resolved [19:39:23] 10Analytics-Cluster, 06Analytics-Kanban, 13Patch-For-Review: Review Druid's logging configuration - https://phabricator.wikimedia.org/T155491#3141587 (10Nuria) 05Open>03Resolved [19:39:35] 10Analytics-Cluster, 06Analytics-Kanban, 13Patch-For-Review: Move away Hue and Camus (and other crons) from analytics1027 - https://phabricator.wikimedia.org/T159527#3141589 (10Nuria) 05Open>03Resolved [19:41:21] ottomata: Trying to add a new node to the cluster - How can I setup conf? [19:45:35] actually ottomata - I deleted cdh3-[2|3|4], and it seems the cluster didn't like :( [19:45:37] 10Analytics: Update pivot to latest version - https://phabricator.wikimedia.org/T161630#3141606 (10Nuria) [19:54:01] mwarf, I have lost ottomata :( [20:25:37] 10Analytics, 10ChangeProp, 10EventBus, 06Services (later), 15User-mobrovac: [EPIC] Develop a JobQueue backend based on EventBus - https://phabricator.wikimedia.org/T157088#3141759 (10Pchelolo) [20:33:49] 06Analytics-Kanban: Check abnormal pageviews for XHamster - https://phabricator.wikimedia.org/T158071#3141805 (10Nuria) Traffic starts getting significantly higher from 1/8/2016 onwards Xhamster blog has loads of stats as of their traffic: https://xhamster.com/blog but couldn't find anything that might be rela... [20:36:41] 10Analytics, 10ChangeProp, 10EventBus, 06Revision-Scoring-As-A-Service, and 3 others: Create generalized "precache" endpoint for ORES - https://phabricator.wikimedia.org/T148714#3141818 (10Halfak) [20:36:41] sorry joal! [20:36:49] id din't realize i had signed off of irc [20:36:52] cafe here being weird [20:37:00] you can create a new node [20:37:03] and then go to puppet class config [20:37:09] and include role analytics cluster worker [20:37:10] then run puppet [20:37:11] should be it! [20:38:00] 10Analytics, 10ChangeProp, 10EventBus, 06Revision-Scoring-As-A-Service, and 3 others: Create generalized "precache" endpoint for ORES - https://phabricator.wikimedia.org/T148714#2730864 (10Halfak) [20:40:31] ah you have to run puppet twice? hm. [20:40:34] ok oh well [20:40:37] joal: am i doing it for you [20:40:42] on cdh3-6 [21:06:43] 10Analytics-EventLogging, 06Analytics-Kanban, 10DBA, 13Patch-For-Review: Add autoincrement id to EventLogging MySQL tables. {oryx} - https://phabricator.wikimedia.org/T125135#3142024 (10Ottomata) Just a note: now that EL tables were renamed, and active tables recreated in T160454, they should all have an...