[00:48:32] 10Analytics, 10Operations, 10Product-Analytics, 10SRE-Access-Requests: Access to analytics infrastructure for SNowick_WMF - https://phabricator.wikimedia.org/T242026 (10Dzahn) [00:53:21] 10Analytics, 10Operations, 10Product-Analytics, 10SRE-Access-Requests: Access to analytics infrastructure for SNowick_WMF - https://phabricator.wikimedia.org/T242026 (10Dzahn) I see on https://wikitech.wikimedia.org/wiki/Analytics/Cluster/Hue " If you already have cluster access, but can't log into Hue, it... [01:34:36] 10Analytics, 10Operations, 10Product-Analytics, 10SRE-Access-Requests: Access to analytics infrastructure for SNowick_WMF - https://phabricator.wikimedia.org/T242026 (10SNowick_WMF) Thanks, yes it is a manual sync process: The ticket attached to this one says "Currently, Hue users are manually synced from... [01:39:19] 10Analytics, 10Operations, 10Product-Analytics, 10SRE-Access-Requests: Access to analytics infrastructure for SNowick_WMF - https://phabricator.wikimedia.org/T242026 (10Dzahn) a:03elukey [02:11:40] 10Analytics, 10Pywikibot, 10Wikimedia-Site-requests: Provide some Pywikibot usage statistics for Python2.7 and Python3.x - https://phabricator.wikimedia.org/T242157 (10Xqt) [02:22:05] 10Analytics, 10Pywikibot, 10Wikimedia-Site-requests: Provide some Pywikibot usage statistics for Python2.7 and Python3.x - https://phabricator.wikimedia.org/T242157 (10Xqt) An all usage statistic for 2019 is given here: https://www.jetbrains.com/lp/devecosystem-2019/python/ where one of 10 users still takes... [03:49:38] (03PS2) 10Milimetric: Encode to pagecounts-ez format with a UDF [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/562597 (https://phabricator.wikimedia.org/T192474) [03:52:15] (03CR) 10jerkins-bot: [V: 04-1] Encode to pagecounts-ez format with a UDF [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/562597 (https://phabricator.wikimedia.org/T192474) (owner: 10Milimetric) [04:03:19] (03PS3) 10Milimetric: Encode to pagecounts-ez format with a UDF [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/562597 (https://phabricator.wikimedia.org/T192474) [05:59:43] good morning team [07:49:33] good morning fdans [07:50:02] elukey: helloooo whenever you want we can do the wikistats move [07:50:57] fdans: sure, gimme ~30 mins to bootstrap ok? [07:51:05] sounds good! [07:51:07] and then I'll be ready :) [07:51:35] fdans: yesterday I was thinking to make a list of stat v1 urls to check after the move, to see if they are broken or not [07:51:49] sort of unit testing :D [07:52:03] can you come up with a good list or urls? [07:52:05] yeah that's a good idea [08:05:03] (03PS3) 10Fdans: Add vue-i18n integration, English strings [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/558702 (https://phabricator.wikimedia.org/T240617) [08:05:24] (03CR) 10jerkins-bot: [V: 04-1] Add vue-i18n integration, English strings [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/558702 (https://phabricator.wikimedia.org/T240617) (owner: 10Fdans) [08:06:19] (03PS4) 10Fdans: Add vue-i18n integration, English strings [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/558702 (https://phabricator.wikimedia.org/T240617) [08:07:29] (03CR) 10jerkins-bot: [V: 04-1] Add vue-i18n integration, English strings [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/558702 (https://phabricator.wikimedia.org/T240617) (owner: 10Fdans) [08:12:50] elukey: I have a bunch of urls! [08:15:46] \o/ [08:15:56] I am sending emails to people using too much space on notebooks again :( [08:35:47] elukey: ban everybody [08:40:38] I am wondering if the new 'stat1008' should be 'notebook1005', with a GPU and a lot of disk space like a stat node [08:40:51] so people can work on it while we experiment with newpyter [08:43:24] anyway, stats.w.o [08:44:36] ah snap we haven't cleaned up the old stuff yet [08:44:40] just remembered [08:46:13] I am re-reviewing the task, I still have some doubts [08:57:51] elukey: cleaned up the old stuff? [08:58:20] fdans: yes the old geowiki things etc.. [08:58:26] from the vhost config [09:01:42] fdans: time to bc for a sec? [09:01:43] 10Analytics, 10Analytics-Kanban: Make stats.wikimedia.org point to wikistats2 by default - https://phabricator.wikimedia.org/T237752 (10elukey) >>! In T237752#5667573, @elukey wrote: > The other idea that me and Fran had about moving on with the stats.w.o -> v2 transition is the following: > > 1) Add somethin... [09:01:49] see --^ [09:03:09] elukey: omw [09:16:03] 10Analytics, 10Pywikibot, 10Wikimedia-Site-requests, 10User-Urbanecm: Provide some Pywikibot usage statistics for Python2.7 and Python3.x - https://phabricator.wikimedia.org/T242157 (10Urbanecm) a:03Urbanecm Maybe I'm doing something wrong, but I logged to [Turnilo](https://turnilo.wikimedia.org) and sub... [10:01:11] 10Analytics, 10Pywikibot, 10Wikimedia-Site-requests, 10User-Urbanecm: Provide some Pywikibot usage statistics for Python2.7 and Python3.x - https://phabricator.wikimedia.org/T242157 (10Dvorapa) This seems similar to core/compat times. One of the reasons is that help pages and docs suggest `python pwb.py`,... [12:13:21] ottomata: re https://wikitech.wikimedia.org/w/index.php?title=Analytics/Systems/Superset&diff=next&oldid=1837211, does this mean we can remove the part about accounts not being automatically created from the access section? [12:15:30] neilpquinn: the account is created when you first log in by superset now (do you remember the horrible http 500 used to be returned for first time users?) [12:16:12] so it is sufficient to be in wmf/nda to access superset to have your account created [12:16:57] fdans: I came up with https://phabricator.wikimedia.org/T237752#5785128 [12:17:23] that is a little bit more verbose in httpd's settings, but probably easier to follow/maintain [12:19:59] elukey: it does seem more clear on its intent [12:20:04] fine with me! [12:21:46] 10Analytics: Kerbereros Identity for Gabriel Birke - https://phabricator.wikimedia.org/T242215 (10gabriel-wmde) [12:27:36] fdans: ack! [12:29:34] 10Analytics: Kerbereros Identity for Gabriel Birke - https://phabricator.wikimedia.org/T242215 (10elukey) 05Open→03Resolved a:03elukey ` elukey@krb1001:~$ sudo manage_principals.py create gbirke --email_address=gabriel.birke@wikimedia.de Principal successfully created. Make sure to update data.yaml in Pupp... [12:38:48] fdans: ah no wait, there is one thing still missing, namely index.html [12:39:00] in theory it should be looked into v2 first [12:39:06] but I'd need to check [12:39:08] forgot about it [12:43:18] probably not, uff [12:43:46] back to square one [12:58:17] fdans: sorry for the hassle, updated the task with a new version [12:58:34] that involves the creation of /v1 but not /v2, that is moved to the main root dir [12:58:35] Hi team - siesta time :) [12:59:32] nono elukey thank you for doing this [13:05:55] 10Analytics: Kerbereros Identity for Corinna Hillebrand - https://phabricator.wikimedia.org/T242217 (10CorinnaHillebrand_WMDE) [13:05:59] with this version we shouldn't need any symlink etc.. [13:06:38] elukey: sounds good to me! [13:06:57] (03PS5) 10Fdans: Add vue-i18n integration, English strings [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/558702 (https://phabricator.wikimedia.org/T240617) [13:12:07] (03CR) 10Joal: [V: 03+2 C: 03+2] "Merging for deploy later today" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/562325 (https://phabricator.wikimedia.org/T242015) (owner: 10Joal) [13:12:42] fdans: I'd need your consult if you have time in bc [13:14:33] elukey: omw [13:15:48] (03PS2) 10Joal: Reduce mediawiki-history oozie accepted SLA delay [analytics/refinery] - 10https://gerrit.wikimedia.org/r/562500 (https://phabricator.wikimedia.org/T242015) [13:17:53] (03CR) 10Joal: [V: 03+2 C: 03+2] "Merging for deploy later today" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/562500 (https://phabricator.wikimedia.org/T242015) (owner: 10Joal) [13:29:31] 10Analytics, 10Pywikibot, 10Wikimedia-Site-requests, 10User-Urbanecm: Provide some Pywikibot usage statistics for Python2.7 and Python3.x - https://phabricator.wikimedia.org/T242157 (10Xqt) The result is surprising me a bit: - good news: Python 3.4 isn't used and we can drop support for that release soon.... [13:35:38] 10Analytics: Kerbereros Identity for Corinna Hillebrand - https://phabricator.wikimedia.org/T242217 (10elukey) ` elukey@krb1001:~$ sudo manage_principals.py create cohi --email_address=corinna.hillebrand@wikimedia.de Principal successfully created. Make sure to update data.yaml in Puppet. Successfully sent email... [14:10:45] 10Analytics, 10Analytics-Kanban, 10Patch-For-Review, 10User-Elukey: Investigate sporadic failures in oozie hive actions due to Kerberos auth - https://phabricator.wikimedia.org/T241650 (10elukey) [14:11:41] 10Analytics, 10Analytics-Kanban, 10User-Elukey: Investigate Hue alarms - https://phabricator.wikimedia.org/T241649 (10elukey) [14:12:40] 10Analytics, 10Analytics-Kanban, 10User-Elukey: Investigate Hue alarms - https://phabricator.wikimedia.org/T241649 (10elukey) I am inclined to mark this as done and re-open if the issue re-occurs (wasn't able to repro it). [14:22:52] ottomata: o/ [14:22:55] ehllooooo [14:23:10] do you have a min for a chat about an idea? (on irc I mean) [14:23:21] sure! [14:23:28] thanks :) [14:23:51] today I checked the work to add user quotas to notebooks, and since there's nothing in puppet (afaics) it is a bit of a hassle [14:24:06] so I was wondering if we could do something like the following [14:24:20] 1) order stat1008 with the GPU, and move stat1004's homes to it [14:24:49] 2) reimage stat1004 as notebook1005, and allow users to use notebooks with more space until newpyter is ready [14:25:32] 10Analytics: Kerberos password for user mepps - https://phabricator.wikimedia.org/T242222 (10mepps) [14:25:37] just as interim solution to facilitate everybody's job [14:26:27] sounds fine to me elukey! [14:27:00] ack then, I'll propose it to the team later on then :) [14:27:21] also, how do you prefer to proceed with newpyter? [14:28:23] possibly start from https://phabricator.wikimedia.org/T188275 ? [14:29:56] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Event-Platform, 10Services (watching): Switch all eventgate clients to use new TLS port - https://phabricator.wikimedia.org/T242224 (10Ottomata) [14:29:57] ah I found https://phabricator.wikimedia.org/T224658 [14:30:04] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Event-Platform, 10Services (watching): Switch all eventgate clients to use new TLS port - https://phabricator.wikimedia.org/T242224 (10Ottomata) [14:30:30] yeah! elukey i'll have headspace hopefully by next week to start thinking about that stuff! [14:30:39] excited to work on that with ya! [14:30:55] 10Analytics: Newpyter - First Class Jupyter Notebook system - https://phabricator.wikimedia.org/T224658 (10Ottomata) [14:36:24] 10Analytics, 10Operations, 10Product-Analytics, 10SRE-Access-Requests: Access to analytics infrastructure for SNowick_WMF - https://phabricator.wikimedia.org/T242026 (10Ottomata) Manual syncing is still needed for Hue (users are in MySQL, not SQLite, syncing is still needed). [14:36:45] ottomata: I'll start reading docs then, super ignorant in the subject, let me know if I can help on this [14:37:21] 10Analytics, 10Operations, 10Product-Analytics, 10SRE-Access-Requests: Access to analytics infrastructure for SNowick_WMF - https://phabricator.wikimedia.org/T242026 (10Ottomata) Done. Use your shell username and ldap password to login. [14:37:30] ok great [14:37:41] elukey: i'm ignorant too, we will learn together! [14:40:34] 10Analytics, 10Operations, 10Product-Analytics, 10SRE-Access-Requests: Access to analytics infrastructure for SNowick_WMF - https://phabricator.wikimedia.org/T242026 (10Ottomata) Also hi and welcome! :D [14:41:36] 10Analytics, 10Pywikibot, 10Wikimedia-Site-requests, 10User-Urbanecm: Provide some Pywikibot usage statistics for Python2.7 and Python3.x - https://phabricator.wikimedia.org/T242157 (10Urbanecm) >>! In T242157#5785695, @Xqt wrote: > The result is surprising me a bit: > - good news: Python 3.4 isn't used a... [14:43:04] oh elukey i have more tls analytics vlan changes! [14:43:17] we are doing a migration, so this should be adding a rule [14:43:19] not just changing [14:43:19] for now [14:44:18] https://phabricator.wikimedia.org/T242224#5785876 [14:44:46] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Event-Platform, 10Services (watching): Switch all eventgate clients to use new TLS port - https://phabricator.wikimedia.org/T242224 (10Ottomata) Ah @elukey, we need to allow port 4192 for eventgate-analytics from analytics VLAN. [14:47:46] ottomata: is it needed now or can it be done around standup? [14:48:07] can wait elukey ty [14:48:25] super, because I'll ask Arzhel to use https://wikitech.wikimedia.org/wiki/Homer ! [14:48:36] oh mny [14:48:40] cooool [14:50:24] ottomata: the port needs to be added to the range or swapped with the current one? [14:50:53] we need to change configs for clients to use it too [14:51:03] so wee need to add it for now [14:51:06] ack [14:51:09] we can remove the old one after [14:54:02] running an errand for ~1h, ttl! [14:56:16] hi elukey [14:56:52] qq: are the notebook machines working? I can open the notebook, but can't run any command there [15:04:57] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Event-Platform, and 2 others: Switch all eventgate clients to use new TLS port - https://phabricator.wikimedia.org/T242224 (10Ottomata) [15:06:20] hm, dsaez what do you mean? [15:18:00] joal / mforns: I'm meeting with PA after standup to talk about canonical data / refinery/static_data, let me know if you're interested [16:12:05] ottomata, I can open the notebooks, but can't excute commands there [16:12:33] ottomata, this for pyspark notebook, python is working ok. [16:29:17] dsaez: o/ - what notebook host? [16:31:35] dsaez: did you kinit? I see errors on 1003 for Caused by: GSSException: No valid credentials provided (Mechanism level: Failed to find any Kerberos tgt [16:53:25] Hey analytics - do individual users need their own kerb tokens to use beeline now? https://phabricator.wikimedia.org/T242236 [16:55:21] 10Analytics: Problem connecting to database from stat1007.eqiad.wmnet - https://phabricator.wikimedia.org/T242236 (10sbassett) [16:59:52] sorry team 2fa [17:01:50] ping fdans staduppp [17:09:34] sbassett: yep, we enabled kerberos in December :) [17:13:44] 10Analytics: Problem connecting to database from stat1007.eqiad.wmnet - https://phabricator.wikimedia.org/T242236 (10elukey) We enabled kerberos authentication in December, more info about how to get new credentials in https://wikitech.wikimedia.org/wiki/Analytics/Systems/Kerberos/UserGuide#Get_a_password_for_Ke... [17:14:41] sbassett: I added some info to the task, please ping me if you need clarification or have doubts etc.. [17:15:10] elukey: I'm creating a kerb creds request task rn. I imagine dsharpe will be doing the same soon :) [17:15:57] sbassett: yes sorry I just realized that it is completely separate, I thought you were working on the same problem :) [17:17:47] hello dsharpe :) [17:17:57] Hello there! [17:18:19] 10Analytics: Kerberos credentials for sbassett - https://phabricator.wikimedia.org/T242242 (10sbassett) [17:18:21] 10Analytics: Kerberos credentials for sbassett - https://phabricator.wikimedia.org/T242242 (10sbassett) [17:21:16] dsharpe: do you need a kerb principal? [17:21:49] (I am doing one know this is why I am asking) [17:23:46] Yes please. I just submittted https://phabricator.wikimedia.org/T242244. I don't know how to do check that "Your shell username is in analytics-privatedata-users." from https://wikitech.wikimedia.org/wiki/Analytics/Systems/Kerberos/UserGuide#Authenticate_via_Kerberos [17:24:29] dsharpe: basically if you are in the analytics-privatedata-user posix group (you are), so I'll add it [17:27:52] 10Analytics: Kerberos credentials for dsharpe - https://phabricator.wikimedia.org/T242244 (10elukey) ` elukey@krb1001:~$ sudo manage_principals.py create dsharpe --email_address=dsharpe@wikimedia.org Principal successfully created. Make sure to update data.yaml in Puppet. Successfully sent email to dsharpe@wikim... [17:32:05] 10Analytics: Kerberos credentials for jfishback - https://phabricator.wikimedia.org/T242245 (10JFishback_WMF) [17:36:25] 10Analytics, 10Analytics-Cluster, 10Product-Analytics: Improve Hue user management - https://phabricator.wikimedia.org/T127850 (10Dzahn) [17:36:28] 10Analytics, 10Operations, 10Product-Analytics, 10SRE-Access-Requests: Access to analytics infrastructure for SNowick_WMF - https://phabricator.wikimedia.org/T242026 (10Dzahn) 05Open→03Resolved Cool, thanks, Ottomata. Closing ticket. [17:36:28] elukey: Thanks! [17:40:48] 10Analytics, 10Android-app-Bugs, 10Wikipedia-Android-App-Backlog: EventLogging sees MobileWikiAppFindInPage parsing errors - https://phabricator.wikimedia.org/T147196 (10Dbrant) [17:41:52] dsharpe: yw! [17:43:39] (03PS6) 10Mforns: [WIP] Add Spark/Scala module for anomaly detection [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/561674 (https://phabricator.wikimedia.org/T235486) [17:47:14] 10Analytics: Problem connecting to database from stat1007.eqiad.wmnet - https://phabricator.wikimedia.org/T242236 (10Dsharpe) 05Open→03Resolved a:03Dsharpe I am all set now. Thanks! [17:51:39] 10Analytics, 10Patch-For-Review: Kerberos credentials for sbassett - https://phabricator.wikimedia.org/T242242 (10elukey) >>! In T242242#5786883, @sbassett wrote: > @elukey - thanks. Can I create a keytab for myself on `stat1007` or does #analytics need to do that for me? Wasn't quite certain of the process... [17:52:31] !log Rerun webrequest-load-wf-text-2020-1-8-15 with updated thresholds after frontend issue [17:52:33] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [17:52:38] (03PS1) 10Elukey: Add the 'response_size' dimension back to webrequest_sampled_128 druid indexation [analytics/refinery] - 10https://gerrit.wikimedia.org/r/562888 (https://phabricator.wikimedia.org/T240681) [17:53:32] joal: not sure if correct or not --^ [17:53:47] I can quickly test it with my test dataset [17:53:50] just to be sure [17:54:05] elukey: I feel conformata [17:54:06] (I mean launching a coord for 1 hour and see in turnilo) [17:54:18] confortable - but if you can test easily that's even better :) [17:54:40] elukey: you tell me :) [17:54:54] joal: if you think the change is ok please go ahead :) [17:55:05] I just merged the two puppet changes [17:55:12] let me know when you want to test aqs [17:55:47] elukey: now! [17:55:48] :) [17:55:53] (03CR) 10Joal: [V: 03+2 C: 03+2] "Merging for deploy later today" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/562888 (https://phabricator.wikimedia.org/T240681) (owner: 10Elukey) [17:57:39] (03Merged) 10jenkins-bot: Correct MW XML dumps converter parameter parsing [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/560833 (owner: 10Joal) [17:58:06] mforns: any idea why reportupdater would get a Permission denied error when trying to access wmf_raw data? Are files owned as analytics:hdfs not accessible to the RU user on stat1007? [17:58:33] milimetric, I think ru runs with analytics user no? [17:59:09] is analytics user able to access those files? [17:59:22] oh! of course [17:59:32] I though you were saying hdfs:hdfs [17:59:58] milimetric, the permission denied error might be when trying to access a pid file though [18:00:03] joal: aqs1004 depooled and ready [18:00:06] or the history file [18:00:14] elukey: gone for a test :) [18:00:34] milimetric, or the rerun directory if any [18:00:52] milimetric, in any case those temp files can be deleted and RU restarted [18:01:11] elukey: good for me :) [18:01:19] hm... [18:01:27] elukey: do you do it, or do you prefer me to do it ? (using scap) [18:01:54] mforns: no pid file in this case, it seems like it can't access the data somehow [18:02:25] milimetric: do you want me to wait for https://gerrit.wikimedia.org/r/#/c/analytics/refinery/+/555578/ ? [18:03:09] joal: I can do it via cookbook [18:03:19] elukey: works for me :) As ou prefer [18:03:37] * joal feels a new keYboard will be needed soon [18:06:38] joal: oh shoot, sorry, I can do it in a few minutes is that ok? [18:06:52] milimetric: it is :) [18:07:32] joal: done! [18:08:07] also milimetric, I guess we move to using canonical table being sure we'll use that in the future (getting info from your meeting ;) [18:08:11] 10Analytics, 10Analytics-Kanban, 10Product-Analytics: Keep canonical_data.wikis updated - https://phabricator.wikimedia.org/T241741 (10Milimetric) A productive meeting with Connie and Neil resulted in the following draft proposal: * create a separate repository and iteratively migrate the contents of refine... [18:08:22] awesome - Thanks elukey - Testing UI [18:09:12] 10Analytics, 10Patch-For-Review: Kerberos credentials for sbassett - https://phabricator.wikimedia.org/T242242 (10sbassett) >>! In T242242#5786945, @elukey wrote: > You can use the one that all the members of `analytics-privatedata-users` can access, sudoing as the `analytics-privatedata` users. We don't creat... [18:10:50] all good for me elukey - Thanks a lot :) [18:11:03] !log AQS deployed with new druid datasource (2019-12) [18:11:04] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [18:13:37] 10Analytics: Kerberos credentials for sbassett - https://phabricator.wikimedia.org/T242242 (10elukey) 05Open→03Resolved [18:13:39] (03PS5) 10Milimetric: Add and update columns on edit_hourly table [analytics/refinery] - 10https://gerrit.wikimedia.org/r/555578 (https://phabricator.wikimedia.org/T232659) (owner: 10Conniecc1) [18:13:55] ok joal ^ [18:14:30] yes, we'll be using canonical_data, but it'll be updated regularly, so best of both worlds [18:14:30] (03PS1) 10Joal: Bump changelog.md to 0.0.110 [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/562893 [18:15:37] milimetric: two things - Do we want the is_redirect field in druid? [18:16:07] milimetric: And, maybe can we name it is_redirect_currently ? in order to have is_redirect abailable later for the real one? [18:16:24] ooops [18:17:02] joal: will add to druid template, but I think we can replace it in place when we have a historified one [18:17:05] or use _historical [18:19:56] milimetric: it feels kinda weird, since all other fields use historical values when possible - Since it;s the only one not being so, I'd rather have it explicit - Also - This is my last push, if you feel strongly it should be is_redirect, let's keep it :) [18:20:44] joal: I kind of support your idea but _currently is confusing because it's a relative term so it could easily mean currently as in relative to that event [18:20:53] that's why we went with _latest and that confused everyone [18:20:54] I donno [18:21:03] right milimetric :( hm [18:21:39] I don't know either :( [18:22:35] milimetric: given it'll be the only 'currently' one, maybe it's explicit enough? [18:22:43] I was just thinking that [18:22:44] sold! [18:22:48] \o/ [18:25:08] (03PS6) 10Milimetric: Add and update columns on edit_hourly table [analytics/refinery] - 10https://gerrit.wikimedia.org/r/555578 (https://phabricator.wikimedia.org/T232659) (owner: 10Conniecc1) [18:25:08] (03CR) 10Joal: [V: 03+2 C: 03+2] "Merging for deploy" [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/562893 (owner: 10Joal) [18:25:44] (03PS7) 10Milimetric: Add and update columns on edit_hourly table [analytics/refinery] - 10https://gerrit.wikimedia.org/r/555578 (https://phabricator.wikimedia.org/T232659) (owner: 10Conniecc1) [18:26:07] milimetric: while we are it: add the is_redirect to ocmmit-message please? [18:26:16] np [18:27:13] Ah! found an error - sending a comment on CR [18:27:19] (comment only) [18:29:04] milimetric: just sent a comment about a comment [18:29:14] (on patch 7) [18:29:17] (03CR) 10Joal: "1 comment + add is_redirect to commit message and we're good :)" (031 comment) [analytics/refinery] - 10https://gerrit.wikimedia.org/r/555578 (https://phabricator.wikimedia.org/T232659) (owner: 10Conniecc1) [18:30:49] !log Releasing refinery-0.0.110 to archiva using Jenkins [18:30:51] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [18:33:58] (03PS1) 10Joal: Update oozie wikidata article placeholder jar [analytics/refinery] - 10https://gerrit.wikimedia.org/r/562905 [18:34:55] milimetric: have you seen my last comment? [18:35:08] also, if you don't mind looking at the above milimetric --^ [18:35:19] no rush, jenkins is releasing ;) [18:35:51] yep, removing the .org and already updated the commit msg, will look at the placeholder thing in a sec [18:36:47] (03PS9) 10Milimetric: Add and update columns on edit_hourly table [analytics/refinery] - 10https://gerrit.wikimedia.org/r/555578 (https://phabricator.wikimedia.org/T232659) (owner: 10Conniecc1) [18:37:15] (03CR) 10Milimetric: [V: 03+2 C: 03+2] Update oozie wikidata article placeholder jar [analytics/refinery] - 10https://gerrit.wikimedia.org/r/562905 (owner: 10Joal) [18:38:19] 10Analytics, 10Pywikibot, 10Wikimedia-Site-requests, 10User-Urbanecm: Provide some Pywikibot usage statistics for Python2.7 and Python3.x - https://phabricator.wikimedia.org/T242157 (10Urbanecm) Do you think we should inform the user about it? [18:48:00] milimetric: if you have some time we can discuss wikistats, otherwise tomorrow :) [18:48:29] elukey: yeah, sorry I should monitor this deploy and get some lunch [18:48:48] elukey: can we chat on IRC? [18:48:53] (03CR) 10Joal: [V: 03+2 C: 03+2] "Merging for deploy!" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/555578 (https://phabricator.wikimedia.org/T232659) (owner: 10Conniecc1) [18:50:02] milimetric: sure, but we can do tomorrow without rushing [18:50:04] no problem [18:50:26] either way, I'm happy to chat on IRC while stuffing my face :) [18:50:36] just doing it over video seems mean :) [18:53:27] ahahah [18:53:40] ok so very briefly, I tried the solution that we discussed [18:53:54] more info in https://phabricator.wikimedia.org/T237752#5785128 [18:54:09] but it doesn't work for the /dist dir issue that I mentioned during standup [18:54:22] so I thought another solution [18:56:07] also outlined in the task [18:56:20] buuut no idea if it diverge too much from what you want t odo [18:56:37] so just wanted to re-sync and see if we can find a better solution or similar [18:56:53] I don't have any strong opinion, just trying to make all the use cases to work [18:58:37] elukey: that seems ideal, no? [18:59:01] this way we can easily migrate the root anytime (via changing the symlink) and all the old links work [18:59:52] elukey: so if I get it right, the "fallback look into v1" will mean that even if a link is absolute as in /some-link-to-old-wikistats.html Apache will look for it in /v2/some-link-to-old-wikistats.html and then /v1/some-link-to-old-wikistats.html [19:00:37] it kind of doesn't scale if we add a v3 and v4 but hopefully we really don't do that :) [19:00:49] so I say +1 if my understanding above is correct [19:00:52] nice idea [19:03:41] 10Analytics, 10Pywikibot, 10Wikimedia-Site-requests, 10User-Urbanecm: Provide some Pywikibot usage statistics for Python2.7 and Python3.x - https://phabricator.wikimedia.org/T242157 (10Xqt) >>! In T242157#5787298, @Urbanecm wrote: > Do you think we should inform the user about it? Sure. [19:04:11] !log Deploy refinery using scap [19:04:12] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [19:04:26] (03CR) 10CDanis: "Oh cool, so now it's a metric *and* a dimension? That's great!" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/562888 (https://phabricator.wikimedia.org/T240681) (owner: 10Elukey) [19:04:52] milimetric: the caveat is that we'll need to add a symlink called 'index.html' -> /v2/index.html in the root, IIRC you didn't like the idea [19:05:08] but I might not remember correctly, if so I'll proceed with that :) [19:05:16] in this way we can keep the current v2 deployment [19:05:29] elukey: no, that idea's fine, I didn't love the complicated apache stuff [19:05:30] (03PS7) 10Mforns: [WIP] Add Spark/Scala module for anomaly detection [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/561674 (https://phabricator.wikimedia.org/T235486) [19:05:38] milimetric: ack then, thanks! [19:05:42] :) thank you [19:10:12] 10Analytics, 10Event-Platform, 10MediaWiki-Maintenance-scripts, 10WMF-JobQueue, 10Wikimedia-General-or-Unknown: showJobs.php maintenance script useless and misleading in production - https://phabricator.wikimedia.org/T221224 (10Pchelolo) Perhaps we could make it log a warning and error-out if JobQueueEve... [19:12:41] * elukey off! [19:12:48] bye elukey [19:12:51] o/ [19:14:22] (03PS8) 10Mforns: [WIP] Add Spark/Scala module for anomaly detection [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/561674 (https://phabricator.wikimedia.org/T235486) [19:16:53] !log Deploy refinery on HDFS [19:16:55] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [19:29:04] !log Kill-restart webrequest-druid-daily-coord and webrequest-druid-hourly-coord after deploy [19:29:06] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [19:42:47] !log Kill-restart mediawiki-history-load-coord, [19:42:49] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [19:44:34] !log Kill-restart mediawiki-history-load-coord, mediawiki-history-denormalize-coord, mediawiki-history-check_denormalize-coord, mediawiki-history-metrics-coord, mediawiki-history-reduced-coord, mediawiki-history-dumps-coord [19:44:35] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [19:48:22] !log Kill-restart wikidata-articleplaceholder_metrics-coord [19:48:24] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [19:50:12] (03PS9) 10Mforns: [WIP] Add Spark/Scala module for anomaly detection [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/561674 (https://phabricator.wikimedia.org/T235486) [19:51:31] (03PS10) 10Mforns: Add Spark/Scala module for anomaly detection [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/561674 (https://phabricator.wikimedia.org/T235486) [19:56:39] !log Kill-restart edit-hourly-coord and edit-hourly-druid-coord [19:56:41] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [19:56:52] !log kill wikidata-articleplaceholder_metrics-coord as it is buggy [19:56:54] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [19:59:05] (03CR) 10Joal: "This patch doesn't work in prod - Let's have another patch for next week deploy." (031 comment) [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/556988 (https://phabricator.wikimedia.org/T236895) (owner: 10Ladsgroup) [20:00:14] 10Analytics, 10ArticlePlaceholder, 10Wikidata, 10wikidata-tech-focus, and 3 others: ArticlePlaceholder dashboard stopped tracking page views - https://phabricator.wikimedia.org/T236895 (10JAllemandou) The patch merged by @Nuria had a bug. I commented on the already merged patch on a solution. For the momen... [20:00:35] 10Analytics, 10Analytics-Kanban, 10ArticlePlaceholder, 10Wikidata, and 4 others: ArticlePlaceholder dashboard stopped tracking page views - https://phabricator.wikimedia.org/T236895 (10JAllemandou) [20:06:00] !log Prepare and start learning-features-actor-hourly-coord [20:06:01] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [20:17:53] !log rerun edit-hourly-wf-2019-12 after having updated the underlying table [20:17:54] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [20:21:36] git st [20:21:38] oops [20:32:16] joal, is there an example of oozie spark action where we collect the job's output? [20:32:20] in refinery? [20:32:37] I don't think so mforns [20:32:39] I mean, oozie collects it an uses it in a decision [20:32:48] I see, you think that is possible? [20:33:05] possible - we do it in hive, whu mnot in spark [20:33:14] pff - tired - fatty fingers [20:33:24] hehe [20:33:35] ok, will look for the hive example [20:33:49] mforns: IIRC webrequest is one of them [20:33:54] ok, cool :] [20:34:11] mforns: oh and actually mediawiki-history-check is another, from spark [20:34:26] I didn't think about it [20:34:48] oh! great [20:44:17] !log Restart webrequest-load-bundle to update queue to production [20:44:18] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [20:47:47] (03PS1) 10Joal: Fix oozie learning/features/actor/hourly [analytics/refinery] - 10https://gerrit.wikimedia.org/r/562957 [20:48:28] (03CR) 10Joal: "Fixed in prod by overwriting CLI parameter." [analytics/refinery] - 10https://gerrit.wikimedia.org/r/562957 (owner: 10Joal) [20:51:48] (03PS2) 10Joal: Fix oozie learning/features/actor/hourly [analytics/refinery] - 10https://gerrit.wikimedia.org/r/562957 (https://phabricator.wikimedia.org/T238360) [20:52:55] ok team, deploy is done - One failure (wikidata article-placeholder see T236895), and 2 jobs still running (https://hue.wikimedia.org/oozie/list_oozie_coordinator/0028067-191216160148723-oozie-oozi-C/ and https://hue.wikimedia.org/oozie/list_oozie_coordinator/0028069-191216160148723-oozie-oozi-C/) [20:52:56] T236895: ArticlePlaceholder dashboard stopped tracking page views - https://phabricator.wikimedia.org/T236895 [20:53:03] I'm gonna get diner [20:53:06] See y'all tomorrow [20:58:32] byeeeee [21:21:24] (03PS11) 10Mforns: Add Spark/Scala module for anomaly detection [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/561674 (https://phabricator.wikimedia.org/T235486) [21:49:11] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Event-Platform, and 2 others: Create and use new schema repositories - https://phabricator.wikimedia.org/T240985 (10Ottomata) Ok, new images and configs deployed to all staging instances. I tested POSTing an example of each event type that each... [22:38:47] 10Analytics, 10Pywikibot, 10Wikimedia-Site-requests, 10User-Urbanecm: Provide some Pywikibot usage statistics for Python2.7 and Python3.x - https://phabricator.wikimedia.org/T242157 (10Multichill) @Urbanecm thanks for looking into this. Appreciated. Not sure if bot usernames are a privacy issue, I think no... [23:03:44] 10Analytics, 10Datasets-Archiving, 10Research-Backlog: Make HTML dumps available - https://phabricator.wikimedia.org/T182351 (10leila) @ArielGlenn thanks for engaging on this ticket further. From the current conversation, it doesn't look to me as we can have a solution for having HTML dumps on a regular basi...