[00:58:56] <wikibugs>	 10Analytics, 10Discovery: Ingest wdqs metrics into druid - https://phabricator.wikimedia.org/T240498 (10Nuria)
[00:59:34] <wikibugs>	 10Analytics, 10Discovery: Ingest wdqs metrics into druid - https://phabricator.wikimedia.org/T240498 (10Nuria)
[02:57:44] <wikibugs>	 10Analytics, 10Analytics-Kanban, 10Product-Analytics, 10SDC General, 10Wikidata: Create reportupdater reports that execute SDC requests - https://phabricator.wikimedia.org/T239565 (10Milimetric) Ok, seems like some of this confusion is getting cleared up.  For my part, here's what I'm planning to do next...
[03:12:05] <wikibugs>	 (03CR) 10Milimetric: "This looks really good.  I haven't read the requirements, so I'd need to go over those more closely, but I couldn't find anything wrong wi" (032 comments) [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/556449 (https://phabricator.wikimedia.org/T239625) (owner: 10Lex Nasser)
[04:20:27] <icinga-wm>	 PROBLEM - Check the last execution of monitor_refine_mediawiki_job_events on an-coord1001 is CRITICAL: CRITICAL: Status of the systemd unit monitor_refine_mediawiki_job_events https://wikitech.wikimedia.org/wiki/Analytics/Systems/Managing_systemd_timers
[07:57:07] <joal>	 Hni team
[07:58:46] <joal>	 mber2019
[08:12:11] <elukey>	 o/
[08:12:48] <joal>	 elukey: if you wanna have a look (not yet finished, but looking reasonably ok IMO: https://github.com/jobar/hdfs-tools
[08:20:10] <elukey>	 joal: my experience with Scala is zero so I cannot really make any judgement, but I am sure it is super good :)
[08:20:46] <joal>	 elukey: I thank you for the blind trust, but probably would not do it myself for myself :D
[08:21:08] <elukey>	 joal: we can surely test it today/tomorrow :)
[08:21:44] <joal>	 elukey: when you wish :) main missing feature is exclude, the rest should work as expected
[08:22:15] <joal>	 elukey: The shaded jar is 5M - I guess this is ok ;)
[08:22:59] <elukey>	 joal: looks super good
[08:23:05] <elukey>	 how will it be deployed?
[08:23:12] <elukey>	 via refinery or another repo?
[09:25:39] <joal>	 excuse me elukey I missed your last ping
[09:25:48] <joal>	 I think it;ll
[09:25:59] <joal>	 be deployed via a new repo
[09:33:12] <elukey>	 ok I didn't get this part
[09:33:29] <elukey>	 I am asking since we'd need to come up with the puppet code before monday (possibly)
[09:33:41] <joal>	 completely agreed elukey 
[09:37:51] <elukey>	 joal: so let's try to schedule a plan, since I'll need to follow up with Brooke and Ariel
[09:38:34] <joal>	 yes elukey - I hope to have a full version (with exclude) either tonight or tomorrow morning
[09:38:56] <joal>	 elukey: Andrew reviews my code (hopefully there are not too big of changes :S)
[09:39:21] <joal>	 elukey: Andrew told me yesterday he'd set up a github for the project once reviewed etc
[09:40:22] <joal>	 We then need to have a new repo to store the jar, the scripts (to facilitate running the jar) and the deploy config
[09:40:33] <joal>	 elukey: -- Does that sound correct about actions to be taken?
[09:41:06] <elukey>	 joal: do you have a min for batcave?
[09:41:11] <joal>	 sure
[09:48:17] <wikibugs>	 10Analytics, 10Performance-Team, 10Security-Team, 10WMF-Legal, 10Privacy: A Large-scale Study of Wikipedia Users' Quality of Experience: data release - https://phabricator.wikimedia.org/T217318 (10Gilles) a:05Gilles→03Slaporte
[09:59:50] <elukey>	 joal: other thing that I forgot to tell you is https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/556633/
[10:00:10] <elukey>	 just to give you the idea of what I am doing
[10:00:26] <elukey>	 the hadoop-config.sh script is installed by our dear packages
[10:00:27] <joal>	 elukey: I have no idea what augeas is :S
[10:00:46] <elukey>	 nono leave it aside, it was an experiment, look only to the file in puppet
[10:00:50] <elukey>	 hadoop-config.sh
[10:00:58] <elukey>	 that thing contains, by cloudera, this
[10:01:13] <elukey>	 # Disable ipv6 as it can cause issues
[10:01:14] <elukey>	 HADOOP_OPTS="$HADOOP_OPTS -Djava.net.preferIPv4Stack=true"
[10:01:34] <elukey>	 for some horrible reasons it ends up everywhere, in hive-server2/metastore, hdfs-datanode,etc..
[10:01:41] <elukey>	 (in their run time parameters)
[10:01:51] <joal>	 \o/ ! Welcome to a the new old-internet :)
[10:02:17] <elukey>	 so in the past to fix the problem in hadoop I appended -Djava.net.preferIPv4Stack=false via hadoop-env.sh
[10:02:22] <elukey>	 err hdfs-env.sh, etc..
[10:02:25] <joal>	 I think I recall that
[10:02:35] <elukey>	 now the problem is that with hive this is not possible
[10:02:54] <elukey>	 so I decided to remove the problem for source :D
[10:03:10] <elukey>	 in hadoop test hive is now correctly binding to ipv4 and ipv6
[10:03:24] <joal>	 wow - nice!
[10:06:06] <joal>	 elukey: while being awesome, this frightens me a bit :)
[10:06:33] <elukey>	 joal: all the other daemons are running with the ipv6 settings, so I am reasonably sure it is ok
[10:06:53] * joal trust elukey blindly :)
[10:08:00] <elukey>	 I will apply the change to all the workers and roll restart gently, to be sure
[10:08:12] <elukey>	 so we decouple this from monday's maintenance
[10:16:28] <joal>	 ack elukey :)
[10:19:00] <wikibugs>	 10Analytics, 10Product-Analytics, 10SDC General, 10Wikidata: Data about how many file pages on Commons contain at least one structured data element - https://phabricator.wikimedia.org/T238878 (10daniel) >>! In T238878#5730630, @Milimetric wrote: >>>! In T238878#5708257, @daniel wrote: >> By the way, if you...
[11:14:29] <elukey>	 !log stop timers on an-coord1001 as prep step for hive/oozie restart
[11:14:32] <stashbot>	 Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log
[11:21:26] <elukey>	 interesting: /usr/lib/jvm/java-8-openjdk-amd64/jre/bin/java -Djava.net.preferIPv4Stack=true
[11:21:30] <elukey>	 this is a yarn container
[11:21:54] <elukey>	 not really a big deal but worth to follow up
[11:22:27] <elukey>	 tested the restart of a datanode btw, all good
[11:29:29] <elukey>	 ok so going to have lunch and then I'll come back to restart hive/oozie and the hadoop workers
[11:39:44] <mforns>	 heya teammmm :]
[11:49:53] <wikibugs>	 (03CR) 10Mforns: [V: 03+2] "@Milimetric, @Nuria: If you both suggested data_quality_stats, I think it's good! I also like data_quality_stats :], I was more against us" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/547320 (https://phabricator.wikimedia.org/T235486) (owner: 10Mforns)
[12:14:59] <moritzm>	 is "CRITICAL: Status of the systemd unit monitor_refine_mediawiki_job_events" a known issue for an-coord1001=
[12:15:02] <moritzm>	 is "CRITICAL: Status of the systemd unit monitor_refine_mediawiki_job_events" a known issue for an-coord1001?
[12:20:11] <elukey>	 moritzm: yes it is, we are working on it, will be solved hopefully by EOD
[12:21:36] <moritzm>	 ack, thx
[12:28:15] <milimetric>	 fdans: you got the mediawiki_job refine thing?
[12:28:23] <milimetric>	 oh! he's in Austin, duh, sorry, nvm
[12:39:39] <elukey>	 !log restart hive and oozie on an-coord1001 to pick up ipv6 settings
[12:39:40] <stashbot>	 Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log
[12:40:18] <elukey>	 !log enable timers on an-coord1001 after maintenance
[12:40:19] <stashbot>	 Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log
[12:41:39] <wikibugs>	 10Analytics, 10Analytics-Cluster, 10Analytics-Kanban: an-coord1001 hive metastore not listening on ipv6 - https://phabricator.wikimedia.org/T240255 (10elukey) Looks better now!  ` elukey@stat1004:~$ telnet an-coord1001.eqiad.wmnet 9083 Trying 2620:0:861:105:10:64:21:104... Connected to an-coord1001.eqiad.wmn...
[12:41:41] <wikibugs>	 10Analytics, 10Analytics-Cluster, 10Analytics-Kanban: an-coord1001 hive metastore not listening on ipv6 - https://phabricator.wikimedia.org/T240255 (10elukey) Looks better now!  ` elukey@stat1004:~$ telnet an-coord1001.eqiad.wmnet 9083 Trying 2620:0:861:105:10:64:21:104... Connected to an-coord1001.eqiad.wmn...
[12:54:25] <elukey>	 joal: on an-coord1001 all is binding on ipv4/6, nothing exploding so far
[12:54:37] <elukey>	 I am going to roll restart the hadoop workers team
[12:54:56] <elukey>	 (since I have removed the prefer ipv4 false option, not needed anymore)
[12:55:10] <joal>	 k elukey 
[12:59:25] <elukey>	 !log roll restart hadoop workers to pick up the new settings (removed prefer ipv4 false after T240255)
[12:59:28] <stashbot>	 Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log
[12:59:28] <stashbot>	 T240255: an-coord1001 hive metastore not listening on ipv6 - https://phabricator.wikimedia.org/T240255
[14:46:21] <elukey>	 joal: decided to propose this first step https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/556681/
[14:46:41] <elukey>	 basically replacing crons with timers, and make sure all works fine
[14:47:05] <elukey>	 then I'll prep a patch to merge on monday, so we'll change only what it runs in the define
[14:48:38] <wikibugs>	 (03PS14) 10Mforns: Refactor data_quality oozie bundle to fix too many partitions [analytics/refinery] - 10https://gerrit.wikimedia.org/r/547320 (https://phabricator.wikimedia.org/T235486)
[14:52:48] <joal>	 wow - I have -1 a patch on puppet ... first tine
[14:54:22] <elukey>	 joal: I copied the current version of the cron, not added something new.. I'd prefer not to change it a lot from the cron if possible
[14:54:45] <joal>	 works for me in that case )p
[14:56:22] <elukey>	 joal: I didn't get "Having a trailing slash in source means only the source content will be copied, not the last-dir of the source path"
[14:58:17] <joal>	 elukey: this is one of rsync trick: id you do rsync /my/src/folder /my/dst, you'll en up with 'folder' inside /my/dst - If you do rsync /my/src/folder/ /my/dst you'll have only folder's content in /my/dst, not the parent folder
[14:59:02] <elukey>	 joal: sure but this seems how the rsyncs are set up now
[14:59:53] <joal>	 elukey: my complete bad then - I have not looked at the rsync-cron, assuming it would take src as is - please disregard :S
[15:00:11] <elukey>	 like
[15:00:13] <elukey>	 31 * * * * bash -c '/usr/bin/rsync -rt --delete --exclude readme.html --chmod=go-w stat1007.eqiad.wmnet::hdfs-archive/unique_devices/ /srv/dumps/xmldatadumps/public/other/unique_devices/'
[15:00:25] <joal>	 I see
[15:00:26] <elukey>	 joal: no no I am double checking with you, that's it
[15:00:29] <elukey>	 nothing more :D
[15:00:33] <joal>	 sounds great :)
[15:01:11] <joal>	 let's keep trailing slashes then - But I'd rather put them in $src variable than harcoded in command :)
[15:01:14] <joal>	 Maybe for later
[15:01:18] <joal>	 elukey: --^
[15:01:51] <elukey>	 joal: yes yes we'll refactor it as second step, now I'd prefer to avoid diverging from the cron if you agre
[15:02:05] <joal>	 ack elukey :)
[15:02:34] <joal>	 Gone for kids :)
[15:03:20] <wikibugs>	 10Analytics, 10Operations, 10SRE-Access-Requests: Add accraze to analytics-privatedata-users - https://phabricator.wikimedia.org/T240243 (10jcrespo) 05Open→03Resolved @ACraze seems to be unavailable. Resolving, but please reopen if you found issues later.
[15:03:23] <wikibugs>	 10Analytics, 10Operations, 10SRE-Access-Requests: Requesting access to stats machines/ores hosts hosts for Andy Craze - https://phabricator.wikimedia.org/T226204 (10jcrespo)
[15:03:32] <wikibugs>	 10Analytics, 10Operations, 10SRE-Access-Requests: Add accraze to analytics-privatedata-users - https://phabricator.wikimedia.org/T240243 (10jcrespo) a:05ACraze→03jcrespo
[15:21:03] <icinga-wm>	 RECOVERY - Check the last execution of monitor_refine_mediawiki_job_events on an-coord1001 is OK: OK: Status of the systemd unit monitor_refine_mediawiki_job_events https://wikitech.wikimedia.org/wiki/Analytics/Systems/Managing_systemd_timers
[15:23:11] <elukey>	 !log execute systemctl reset-failed monitor_refine_mediawiki_job_events after Andrew's comment on alerts@
[15:23:16] <stashbot>	 Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log
[15:40:24] <wikibugs>	 (03PS15) 10Mforns: Refactor data_quality oozie bundle to fix too many partitions [analytics/refinery] - 10https://gerrit.wikimedia.org/r/547320 (https://phabricator.wikimedia.org/T235486)
[15:44:12] <fdans>	 hi team, looking at the data loss alarms if no one has
[15:44:24] <elukey>	 fdans: how dare you
[15:45:15] <fdans>	 elukey: luca this all hands we'll finally have a duel on the corridors of the hotel
[15:45:42] <fdans>	 these constant attacks shall not remain unpunished
[15:46:05] <elukey>	 fdans: I accept with pleasure
[15:46:25] <mforns>	 oh! I'll be the ref :D
[15:47:09] <elukey>	 mforns: did you see the update for superset presto and kerberos from github?
[15:47:12] <elukey>	 :(
[15:47:22] <mforns>	 elukey, I saw it yesterday, is there sth new?
[15:47:46] <elukey>	 mforns: nono, but it is kinda troublesome, I didn't find a package to replace pyhive..
[15:47:59] <elukey>	 maybe we could try to contact dropbox somehow
[15:48:25] <mforns>	 hm
[15:49:50] <mforns>	 the immediate alternative for us is Druid, but it's not as appealing from the dashboarding point of view...
[15:50:32] <mforns>	 oh, actually, if we ingest data quality stats into druid, they could be shown in superset anyway...
[15:52:10] <nuria>	 mforns, elukey : Hola! i think for the presto case is worth contacting the superset owners to see if they have contacts/plan/ideas
[15:52:20] <mforns>	 elukey, already did
[15:52:21] <ottomata>	 nuria: Hola!
[15:52:28] <mforns>	 hehe
[15:52:31] <ottomata>	 how long should headers be truncted to?  400?
[15:52:35] <nuria>	 HOLA EVERYBODY
[15:52:37] <ottomata>	 that is what we have as MAX_UA_LENGTH
[15:52:48] <ottomata>	 also, if we want to consider XFF parsing in eventgate to set client_ip
[15:52:55] <ottomata>	 should I just grab the left most one?
[15:52:56] <ottomata>	 always?
[15:52:58] <nuria>	 ottomata: sounds fine, anything beyond 200 chars  is likely automated traffic
[15:53:02] <ottomata>	 or sory, rightmost one?
[15:53:36] <nuria>	 ottomata: let me remember what some proxies like googleweblight do
[15:54:08] <wikibugs>	 (03PS8) 10Mforns: Add data quality metric: traffic variations per country [analytics/refinery] - 10https://gerrit.wikimedia.org/r/550498 (https://phabricator.wikimedia.org/T234484)
[15:54:27] <ottomata>	 ah left most
[15:55:20] <ottomata>	 and, if that is not set nuria, should I use ths connected socket's remote ip addr?
[15:56:12] <elukey>	 joal: want to see something cool? ssh -L 8080:an-airflow1001.eqiad.wmnet:8778 an-airflow1001.eqiad.wmnet
[15:56:15] <elukey>	 :D
[15:56:46] <elukey>	 all team probably would like it as well --^
[15:56:49] <ottomata>	 COOOL
[15:57:11] <ottomata>	 very cool
[16:00:27] <elukey>	 I am trying to figure out now what user is running those tasks now
[16:01:22] <nuria>	 ottomata: so no x-forwarded-for to be found on googleweblight, but looks like list starts from left
[16:02:03] <ottomata>	 ya
[16:02:04] <nuria>	 ottomata: header starts from left that is
[16:02:20] <ottomata>	 i think varnish uses the rightmost non WMF IP address
[16:02:33] <ottomata>	 but, we don't have a good way of getting WMF IPs in eventgate code
[16:02:54] <ottomata>	 that makes sense, as we really want the IP that sent the request to us
[16:06:34] <fdans>	 elukey: hmm, the script that checks if the data loss is a false positive is getting stuck in
[16:06:36] <fdans>	 19/12/12 15:59:50 WARN Utils: Truncated the string representation of a plan since it was too large. This behavior can be adjusted by setting 'spark.debug.maxToStringFields' in SparkEnv.conf.
[16:07:02] <fdans>	 it's been 7 min like that, not sure if that's normal
[16:07:21] <elukey>	 never seen this before
[16:10:00] <fdans>	 elukey: nvm, it's gone through, but I don't remember it taking so long last time, maybe I'm wrong
[16:10:17] <fdans>	 confirmed false positive
[16:11:13] <wikibugs>	 (03PS9) 10Mforns: Add data quality metric: traffic variations per country [analytics/refinery] - 10https://gerrit.wikimedia.org/r/550498 (https://phabricator.wikimedia.org/T234484)
[16:13:27] <nuria>	 ottomata: ya,  the only list for IPs that is any trustworthy is the one mainatained by Bryan  with adresses on labs
[16:13:49] <wikibugs>	 (03CR) 10Milimetric: [C: 04-1] "oh wait, no I tested the wrong branch by accident, you do have one test failure:" [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/556449 (https://phabricator.wikimedia.org/T239625) (owner: 10Lex Nasser)
[16:16:58] <wikibugs>	 10Analytics, 10Analytics-Kanban: Estimate percentage wise the number of requests on mediarequest dataset that are previews - https://phabricator.wikimedia.org/T240362 (10fdans) Added note about percentage in the "Limitations" section of the API docs and added date range to study of signal and noise docs.
[16:20:45] <wikibugs>	 (03CR) 10Nuria: [C: 03+1] "I see, this is just a correction of typo of filename, correct? If so +1 on my end" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/550498 (https://phabricator.wikimedia.org/T234484) (owner: 10Mforns)
[16:24:07] <elukey>	 ebernhardson: o/ - sorry to keep asking the same question, but I am wondering what user is launching jobs to hadoop from airflow (spark jobs I assume)
[16:27:27] <elukey>	 I am checking now the airflow dashboard of an-airflow1001 and I can't find anything.. maybe the current DAGs are not doing any hadoop related thing?
[16:32:30] <ebernhardson>	 elukey: the service is run by the 'airflow' user, the jobs are submitted to the cluster as 'analytics-search'
[16:33:45] <ebernhardson>	 elukey: currently the dag is in the "off" state, so it won't run automagically. I do test runs with `airflow test <dag_id> <task_id> <date>` 
[16:35:12] <wikibugs>	 (03PS5) 10Mforns: Add Spark job to update data quality table with incoming data [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/549115 (https://phabricator.wikimedia.org/T235486)
[16:35:46] <ebernhardson>	 and we can make the daemon run as analytics-search if that makes everything significantly easier, it is simply for conceptual reasons that i wanted the service to run as an unrelated user (and in theory that would make it easier for separate airflow instances to be spun up for other teams, maybe)
[16:36:21] <elukey>	 ebernhardson: ah okok
[16:36:48] <elukey>	 ebernhardson: where is 'analytics-search' specified?
[16:37:58] <ebernhardson>	 elukey: arguments to individual tasks (generally inherited from the DAG defaults)
[16:38:20] <ebernhardson>	 elukey: basically the spark task takes two arguments, principal and keytab, and goes from there
[16:38:56] <elukey>	 ebernhardson: ok so the airflow user runs the spark task, that then has to read the keytab. 
[16:39:06] <wikibugs>	 (03CR) 10Mforns: "@Nuria, yes. The last patch set (9) is just replacing data_quality_metrics by data_quality_stats, and I took the opportunity to change the" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/550498 (https://phabricator.wikimedia.org/T234484) (owner: 10Mforns)
[16:39:08] <ebernhardson>	 elukey: right, the spark task just runs the spark-submit CLI command
[16:39:16] <elukey>	 (basically the user running the scheduler is the one executing tasks)
[16:39:17] <wikibugs>	 (03PS6) 10Mforns: Add Spark job to update data quality table with incoming data [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/549115 (https://phabricator.wikimedia.org/T235486)
[16:39:21] <ebernhardson>	 elukey: yes
[16:40:22] <elukey>	 ebernhardson: all right thanks :) will file a change after meetings, I have clearer ideas
[16:40:33] <ebernhardson>	 ok, thanks! Because my ideas are much less clear atm :)
[16:44:58] <wikibugs>	 (03PS2) 10Lex Nasser: Modified external webrequest search engine classification and added tests. [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/556449 (https://phabricator.wikimedia.org/T239625)
[16:45:02] <wikibugs>	 (03PS1) 10Lex Nasser: Fix style and correct incorrect test case. [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/556730 (https://phabricator.wikimedia.org/T239625)
[17:00:44] <nuria>	 ping milimetric 
[17:00:54] <nuria>	 ping fdans 
[17:01:06] <fdans>	 omw
[17:04:37] <wikibugs>	 10Analytics, 10Analytics-Cluster, 10Analytics-Kanban: an-coord1001 hive metastore not listening on ipv6 - https://phabricator.wikimedia.org/T240255 (10elukey) p:05Triage→03Normal a:03elukey
[17:04:58] <milimetric>	 haha elukey you beat me to that by a millisecond
[17:09:56] <wikibugs>	 (03PS1) 10Milimetric: Report structured data use for commons [analytics/reportupdater-queries] - 10https://gerrit.wikimedia.org/r/556741 (https://phabricator.wikimedia.org/T239565)
[18:25:45] <elukey>	 ebernhardson: an-airflow1001 kerberized
[18:31:11] <ebernhardson>	 elukey: awsome! thanks
[18:32:03] <elukey>	 ebernhardson: so the idea is that user 'airflow' is now able to read the airflow.keytab file, that contains credentials for the 'analytics-search' principal
[18:33:12] <elukey>	 we'll maybe tune it in the future, but let's see if this works first
[18:33:21] <elukey>	 what do you think?
[18:33:32] <ebernhardson>	 elukey: ahh, ok that makes sense. I'll put up a patch and try it (can it auth yet? or do i need to wait for monday)
[18:34:20] <elukey>	 ebernhardson: it can auth now but probably best to test it on monday after kerberos is up, since spark may be confused if you ask it to authenticate on a non-secured cluster
[18:34:46] <ebernhardson>	 alright, i'll just prep and deploy them, but wont expect it to work yet
[18:35:11] <elukey>	 ack
[18:38:44] <joal>	 ottomata, milimetric: https://github.com/jobar/hdfs-tools
[18:38:49] <joal>	 gone for diner, back after
[18:43:01] <wikibugs>	 (03CR) 10Nuria: Report structured data use for commons (033 comments) [analytics/reportupdater-queries] - 10https://gerrit.wikimedia.org/r/556741 (https://phabricator.wikimedia.org/T239565) (owner: 10Milimetric)
[18:45:32] <wikibugs>	 (03CR) 10Nuria: "code looks good, before merging it we should throughly tested it on the cluster with 1 hour of data using a jar build with this code on th" (031 comment) [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/556730 (https://phabricator.wikimedia.org/T239625) (owner: 10Lex Nasser)
[18:46:42] <elukey>	 !log rsync timers deployed on labstore100[6,7]
[18:46:44] <stashbot>	 Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log
[18:46:47] <elukey>	 \o/
[18:49:44] <wikibugs>	 (03CR) 10Nuria: [C: 03+1] "Looks good, +2 when we have merged" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/550498 (https://phabricator.wikimedia.org/T234484) (owner: 10Mforns)
[18:50:10] <wikibugs>	 (03CR) 10Nuria: [C: 03+1] "Sorry , +2 when we have tested it" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/550498 (https://phabricator.wikimedia.org/T234484) (owner: 10Mforns)
[18:59:18] <wikibugs>	 10Analytics: Archive /home/ezachte data on stat1007 - https://phabricator.wikimedia.org/T238243 (10elukey) @Erik_Zachte Hi! Gentle ping to see if you have time to review the files during the next days :)
[19:00:19] <elukey>	 all right timers on labstore are working
[19:00:21] <elukey>	 gooooood
[19:00:26] <elukey>	 going to dinner :)
[19:36:47] <joal>	 hurray for timers on labstore :)
[19:45:11] <ottomata>	 joal:  o/
[19:45:14] <joal>	 hi
[19:45:30] <ottomata>	 if you create a branch in github at your initial commit
[19:45:35] <ottomata>	 then you can create  a PR against it
[19:45:40] <ottomata>	 and we can use that for review
[19:45:45] <ottomata>	 (otherwise I don't know how to leave comments! :) )
[19:45:50] <joal>	 Ah!
[19:45:57] <joal>	 makes sense
[19:48:58] <joal>	 ottomata: https://github.com/jobar/hdfs-tools/pull/1
[19:49:27] <ottomata>	 joal:  i thikn you want to make the branch at https://github.com/jobar/hdfs-tools/commit/eed7a7cff21ffd1ad1649fdf3eb8ea24d614b602
[19:49:48] <joal>	 hm
[19:49:57] <ottomata>	 if you make it there, you can make a PR from master to it
[19:50:08] <ottomata>	 and all the commits in master will be part of the PR, and will be able to review them
[19:50:23] <ottomata>	 lke this
[19:50:23] <ottomata>	 https://github.com/wikimedia/eventgate/pull/1
[19:51:45] <joal>	 ok - will merge the current PR then push a branch from 1st commit
[19:51:58] <ottomata>	 k
[19:52:06] <ottomata>	 joal:  i think i don't get "We need to differentiate those cases as copying a folder also copies its content,
[19:52:06] <ottomata>	      * and we don't want that when doing recursion into folders."
[19:52:11] <ottomata>	 in createOrCopy new
[19:52:28] <ottomata>	 copying a folder also copies its content?
[19:52:33] <joal>	 ottomata: yessir
[19:52:38] <ottomata>	 not sure i understand what that means
[19:52:51] <joal>	 ottomata: batcave?
[19:52:53] <ottomata>	 k
[20:12:37] <joal>	 ottomata: https://github.com/jobar/hdfs-tools/pull/2
[20:12:51] <joal>	 ottomata: please note the comments at the top of the HdfsRsyncExec file :)
[20:13:23] <ottomata>	 Nice
[20:20:44] <ottomata>	 joal:  i did not know rsync had --chmod=CHMOD            flag!
[20:21:11] <joal>	 ottomata: it is one of the required features to mimic the current rsync :)
[20:23:54] <ottomata>	 joal:  do we use --filter?  or just --exclude
[20:23:55] <ottomata>	 ?
[20:24:17] <joal>	 ottomata: any :) exclude is an alias for filter - X
[20:24:23] <ottomata>	 huh.
[20:24:55] <joal>	 The way it works is it stacks the rules, and applies them in order for every file, the first match is the good one: inclide or exclude - No match means include 
[20:29:24] <ottomata>	 joal, does dst have to be an existing folder? or does its parent just have to exist?
[20:29:27] <ottomata>	 i often do
[20:29:49] <ottomata>	 cd mystuff;  rsync -av ./ remote.host.wmnet:~/mystuff/
[20:29:52] <joal>	 ottomata: dst needs to exist I think
[20:30:02] <ottomata>	 remote.host.wment:~/mystuff/ might not exist
[20:30:03] <ottomata>	 but will be created
[20:30:11] <joal>	 ah?
[20:30:25] <ottomata>	 yeah, i think it creates the dst dir if it doesn't exist
[20:30:27] <joal>	 I fail in that case...
[20:30:30] <ottomata>	 i think it will fail of the full path doesn't exist
[20:30:42] <ottomata>	 if dst is A/B/C
[20:30:47] <ottomata>	 and A or B don't exist, it will fail
[20:30:51] <ottomata>	 but it will create C
[20:31:14] <ottomata>	 that is nice for first rsyncs; kinda sucks to have to create a directory before your first copy
[20:32:28] <wikibugs>	 10Analytics: Add mediarequests dataset to druid (just some dimensions) - https://phabricator.wikimedia.org/T240613 (10Nuria)
[20:34:12] <joal>	 ottomata: later feature?
[20:34:24] <ottomata>	 k
[20:34:44] <ottomata>	 hm joal that means we'll have to puppetize dir creation, (or create dir in script), right?
[20:34:57] <joal>	 hm
[20:35:36] <ottomata>	 joal i'm not sure i understand the mlutiple chmod command parser thing
[20:35:44] <joal>	 huhu
[20:35:45] <ottomata>	 i get that  you might have 2, one for F one for D
[20:35:49] <ottomata>	 or you might have just one for both
[20:35:55] <ottomata>	 but why do you need fold them all together?
[20:35:59] <joal>	 ottomata: correct - No prefix means both
[20:36:17] <joal>	 ottomata: I loop over the list once
[20:36:40] <joal>	 I wondered about looping multiple times - more readable I guess?
[20:37:31] <ottomata>	 i thikn you loop over the list twice? once for files and once for dirs?
[20:38:20] <joal>	 ottomata: 3 passes - 1 for validation, 1 for files ChmodParser creation, 1 for dirs ChmodParser creation
[20:38:47] <ottomata>	 would it be better to just validate the number of commands and do it one way or th eother
[20:39:01] <ottomata>	 you should either be given: 1 chmodCommand with NO prefix
[20:39:15] <ottomata>	 or, 1 or 2 commands with prefixes
[20:39:16] <ottomata>	 right?
[20:39:27] <ottomata>	 seems weird to provide one command with prefix and one without?
[20:40:03] <joal>	 ottomata: you can do: go-wx D+x 
[20:40:33] <joal>	 for instance - can be written differently though
[20:40:43] <ottomata>	 ya but why would you want to allow that?
[20:40:58] <wikibugs>	 (03PS2) 10Milimetric: Report structured data use for commons [analytics/reportupdater-queries] - 10https://gerrit.wikimedia.org/r/556741 (https://phabricator.wikimedia.org/T239565)
[20:41:00] <wikibugs>	 (03CR) 10Milimetric: Report structured data use for commons (033 comments) [analytics/reportupdater-queries] - 10https://gerrit.wikimedia.org/r/556741 (https://phabricator.wikimedia.org/T239565) (owner: 10Milimetric)
[20:41:11] <ottomata>	 hmmm
[20:41:13] <joal>	 ottomata: multi-chmod command for symbolic is by default
[20:41:14] <ottomata>	 oh i think i see...
[20:41:27] <joal>	 it actually works in chmod :)
[20:41:27] <ottomata>	 that is crazy stuff
[20:41:36] <ottomata>	 D F don't do they?
[20:41:38] <ottomata>	 that's an rsync thing, no?
[20:41:41] <joal>	 yessir
[20:41:52] <joal>	 so multi-command, over 2 slots ...
[20:41:58] <ottomata>	 ⊙ω⊙
[20:42:06] <joal>	 mwahahaha :)
[20:42:23] <ottomata>	 wait
[20:42:25] <ottomata>	 no not in chmod
[20:42:29] <ottomata>	 you can't prefix D in chmod, can you?
[20:42:32] <ottomata>	 just in rsync.
[20:42:33] <ottomata>	 right?
[20:42:37] <joal>	 nope - But in rsync you can
[20:42:39] <ottomata>	 ah ok
[20:42:40] <ottomata>	 yes
[20:42:48] <ottomata>	 phew thought i was a bonkers man
[20:42:53] <ottomata>	 ok
[20:43:32] <ottomata>	 geez ok
[20:44:20] <joal>	 ottomata: I alwas wanted to learn about rsync - Well now I'm kinda ok :)
[20:53:05] <ottomata>	 joal:  added some comments :)
[20:53:29] <wikibugs>	 10Analytics, 10Analytics-Wikistats: Create English strings json for vue-i18n to use - https://phabricator.wikimedia.org/T240617 (10fdans)
[20:55:38] <wikibugs>	 10Analytics, 10Analytics-Wikistats: Include locale string jsons as webpack chunks so that only the required language is bundled - https://phabricator.wikimedia.org/T240618 (10fdans)
[20:58:20] <wikibugs>	 10Analytics, 10Analytics-Wikistats: Add stats.wikimedia.org/v2 as a TranslateWiki project - https://phabricator.wikimedia.org/T240621 (10fdans)
[20:58:48] <fdans>	 nuria: just tasked i18n as requested: https://phabricator.wikimedia.org/T238752
[21:19:33] <ottomata>	 joal:  FYI one of the tests is failing for me
[21:19:41] <joal>	 meeh :(
[21:20:00] <ottomata>	 - should copy src to dst recursively with size-only and not copy existing *** FAILED ***
[21:20:00] <ottomata>	   2 did not equal 3 (TestHdfsRsyncExec.scala:160)
[21:20:48] <joal>	 :(
[21:21:01] <joal>	 all works for me (intellij +- maven
[21:28:11] <ottomata>	 i'm just doing mvn package on CLI hm
[21:28:24] <wikibugs>	 (03PS1) 10Milimetric: Migrate to new dashiki instances [analytics/dashiki] - 10https://gerrit.wikimedia.org/r/556818 (https://phabricator.wikimedia.org/T236586)
[21:28:32] <wikibugs>	 (03CR) 10Milimetric: [V: 03+2 C: 03+2] Migrate to new dashiki instances [analytics/dashiki] - 10https://gerrit.wikimedia.org/r/556818 (https://phabricator.wikimedia.org/T236586) (owner: 10Milimetric)
[21:29:43] <ottomata>	 hm joal  i guess we should install hdfs-tools.jar into its on directory?  not in default hadoop classpath usr/lib/hadoop, because of the shaded jar?
[21:30:22] <joal>	 It so so ottomata - No haddop def present, and only scala related stuff, but who knows
[21:30:37] <ottomata>	 yeah, but if we run spark
[21:30:40] <ottomata>	 i think it will load the hadoop CP
[21:30:53] <joal>	 yup
[21:31:03] <ottomata>	 might get conflicting scala version if we upgrade spark or something
[21:35:41] <wikibugs>	 10Analytics, 10Analytics-Kanban: Dashiki: Read multiple wikis from single file - https://phabricator.wikimedia.org/T236941 (10Milimetric) @srishakatux: just a ping that this is done.  I still want to update the Dashiki docs which are in a very sad state, but before I get to that.  To use the feature, you have...
[21:37:37] <wikibugs>	 10Analytics, 10Analytics-Kanban, 10Cloud-VPS (Debian Jessie Deprecation), 10Patch-For-Review: "dashiki" Cloud VPS project jessie deprecation - https://phabricator.wikimedia.org/T236586 (10Milimetric) This is done, updated docs and deployment code, deleting the instances now.
[21:40:19] <joal>	 ottomata: I answered to the comments and pushed a new patch with new stuff for FilterRule
[21:40:59] <ottomata>	 looking!
[21:41:45] <ottomata>	 joal:  why dropWhile instead of filter?
[21:41:54] <joal>	 where?
[21:42:01] <ottomata>	 iiuc dropWhile stops dropping at first non-match?
[21:42:08] <joal>	 Yes
[21:42:56] <joal>	 We drop first char if it is F or D, or We keep the rest 
[21:42:57] <ottomata>	 OH you are just removing it for parsing
[21:42:58] <ottomata>	 GOT It
[21:43:00] <joal>	 :)
[21:43:04] <ottomata>	 not for filtering for the type
[21:43:25] <ottomata>	 still, a little weird to use drop while, no?
[21:43:28] <joal>	 ottomata: Yes, we already know the type since it is filtered
[21:43:28] <ottomata>	 that will allow someone to do
[21:43:35] <ottomata>	 DDDDDDDDDDo+w
[21:43:35] <ottomata>	 ?
[21:43:54] <joal>	 Nope, fails regexp - But would be processed correctly indeed
[21:43:57] <ottomata>	 oh
[21:44:21] <joal>	 ottomata: this function is about preparing config internals, not validation
[21:44:28] <ottomata>	 why not just mod.tial
[21:44:30] <ottomata>	 mod.tail
[21:44:30] <ottomata>	 ?
[21:44:35] <ottomata>	 oh
[21:44:36] <ottomata>	 or
[21:44:39] <joal>	 huhu
[21:44:42] <ottomata>	 if startWith D or F
[21:44:44] <ottomata>	 mod.tail
[21:44:44] <ottomata>	 ?
[21:44:52] <joal>	 Would work the same
[21:45:07] <ottomata>	 i think would be more readable, is strange to loop over the string with a while to drop the first character
[21:45:07] <joal>	 if ou prefer :)
[21:46:14] <ottomata>	 hm also, don't you always need to drop the first char if F or D, even if not acc.isEmpty?
[21:46:19] <joal>	 ottomata: I like the dropWhile syntax, concise - but can change :)
[21:46:50] <joal>	 good catch !!!
[21:46:56] <fdans>	 milimetric: do you have a couple minutes on the bc? I want to run something by you as an i18n sanity check :)
[21:47:09] <milimetric>	 ofc fdans, omw cave
[21:48:16] <wikibugs>	 (03CR) 10Nuria: "Thanks for taking care of this." [analytics/dashiki] - 10https://gerrit.wikimedia.org/r/556818 (https://phabricator.wikimedia.org/T236586) (owner: 10Milimetric)
[21:48:37] <joal>	 ottomata: https://gist.github.com/jobar/9a2fb8b0a3ba622f40bfbc06dcd7f2c9
[21:50:42] <wikibugs>	 10Analytics, 10Analytics-Kanban: Dashiki: Read multiple wikis from single file - https://phabricator.wikimedia.org/T236941 (10Nuria) @srishakatux might not need this cause she is not using the vital-signs layout
[21:51:07] <ottomata>	 lgtm joal, much more readable, maybe just add a comment about removing the rsync speciifc F or D qualifier to make it compatible with normal chmod arg
[21:51:36] <joal>	 yup
[22:02:00] <wikibugs>	 (03CR) 10Nuria: Report structured data use for commons (031 comment) [analytics/reportupdater-queries] - 10https://gerrit.wikimedia.org/r/556741 (https://phabricator.wikimedia.org/T239565) (owner: 10Milimetric)
[22:08:31] <nuria>	 ottomata: can i get +2 at eeventgate depot?
[22:10:00] <ottomata>	 nuria:  i dont
[22:10:04] <ottomata>	 i don't seem to have powers to change
[22:10:09] <nuria>	 ottomata: k
[22:10:26] <nuria>	 ottomata: also, are tests run with npm test? or is there anything else?
[22:10:31] <ottomata>	 just npm test
[22:12:46] <wikibugs>	 (03CR) 10Nuria: "This change should probably be a new patch on https://gerrit.wikimedia.org/r/#/c/analytics/refinery/source/+/556449/" [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/556730 (https://phabricator.wikimedia.org/T239625) (owner: 10Lex Nasser)
[22:14:52] <lexnasser>	 nuria: dan showed me how to make it a new patch, will fix later today
[22:15:04] <nuria>	 lexnasser: sounds good, no rush
[22:29:40] <wikibugs>	 10Analytics, 10Cloud-Services, 10Developer-Advocacy (Oct-Dec 2019): Further improvements to the WMCS edits dashboard - https://phabricator.wikimedia.org/T240040 (10srishakatux) Here is an update on suggested improvements: * In the [[ https://wmcs-edits.wmflabs.org/#wmcs-edits/wmcs-edits-tabular-view | Tabula...
[22:30:28] <wikibugs>	 10Analytics, 10Cloud-Services, 10Developer-Advocacy (Oct-Dec 2019): Further improvements to the WMCS edits dashboard - https://phabricator.wikimedia.org/T240040 (10srishakatux) And, changes can be seen here for testing https://wmcs-edits.wmflabs.org.
[22:32:20] <joal>	 ottomata: I have a final version I think :)
[22:35:18] <ottomata>	 ya?
[22:36:59] <ottomata>	 joal:  transferTree?
[22:37:12] <joal>	 transferRootTree )
[22:38:09] <ottomata>	 that's just to use glob or to use listSTatuds?
[22:38:17] <ottomata>	 if source had /* in it?
[22:39:08] <joal>	 used for filter-rules
[22:40:20] <ottomata>	 ah
[22:42:52] <ottomata>	 hmm, joal i have a hard time understanding how that is used...
[22:44:38] <ottomata>	 transferTreeRoot is supposed to be the root of the src independent of any globs in src path?
[22:45:15] <joal>	 ottomata: it needs to be computed per file at tree-root only - BUG !
[22:45:22] <ottomata>	 i think the word 'transfer' is confusing me
[22:45:29] <joal>	 treeRoot?
[22:45:47] <joal>	 ottomata: treeRoot can be thought of /
[22:45:57] <ottomata>	 is it always from source path?
[22:45:59] <joal>	 here we talk about the / in the context of the transfer
[22:46:03] <ottomata>	 it is used in filter match
[22:46:06] <ottomata>	 but not computed from it, right?
[22:46:15] <ottomata>	 maybe a better name:
[22:46:18] <ottomata>	 srcBasePath
[22:46:19] <ottomata>	 ?
[22:46:24] <joal>	 ottomata: used in src and dst (we don't delete extraneous exlcuded)
[22:46:37] <ottomata>	 aye ok, but it means the same?
[22:46:46] <ottomata>	 dst just doesn't have a glob
[22:46:46] <ottomata>	 so
[22:46:48] <joal>	 But no need to pass it or dst as it doesn't change
[22:46:50] <joal>	 yup
[22:46:51] <ottomata>	 it will be just dest?
[22:46:54] <joal>	 co9rrect
[22:47:14] <ottomata>	 hm
[22:47:31] <ottomata>	 i guess treeRoot is ok, if it is explained what it menas a little bit.
[22:47:34] <ottomata>	 srcTreeRoot
[22:47:34] <ottomata>	 ?
[22:47:52] <ottomata>	 you need to pass that around just for the filter matching?
[22:47:55] <ottomata>	 is taht right?
[22:47:56] <joal>	 works for me (basePath doesn't sound bad either)
[22:48:02] <joal>	 yup
[22:48:04] <ottomata>	 i like base path better
[22:48:06] <ottomata>	 srcBasePath
[22:48:09] <joal>	 srcBasePath
[22:48:19] <ottomata>	 yeah, esp. since you have srcPath
[22:48:24] <ottomata>	 makes sense to get the 'base path' out of it
[22:49:30] <ottomata>	 joal i gotta go pretty soon!  i have a rudimentary deb ready to just deploy a jar file
[22:49:38] <ottomata>	 dunno if we need a little wrapper or not
[22:49:47] <joal>	 Great - doing some more testing
[22:49:52] <ottomata>	 k cool
[22:50:02] <joal>	 wrapper would be good ottomata I think
[22:50:14] <joal>	 java -cp /home/joal/code/hdfs-tools/target/hdfs-tools-0.0.1-SNAPSHOT.jar:/usr/lib/spark2/jars/*:$(/usr/bin/hadoop classpath) org.wikimedia.analytics.hdfstools.HdfsRsyncCLI 
[22:50:23] <joal>	 Not so nice ottomata --^
[22:52:46] <ottomata>	 aye k
[22:53:34] <ottomata>	 joal cool, will do tomorrow!  hopefully we can even deploy it tomorrow yesss and puppetize 
[22:53:36] <ottomata>	 let'sg OoOOoO
[22:53:42] <joal>	 \o/
[22:53:46] <joal>	 Thanks ottomata 
[22:54:12] <ottomata>	 laterrrss!
[23:01:59] <wikibugs>	 (03CR) 10Nuria: "Virtual +2 if we have tested the job. Seems quite straight forward, thanks for changing the name of the class" [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/549115 (https://phabricator.wikimedia.org/T235486) (owner: 10Mforns)
[23:03:42] <wikibugs>	 (03CR) 10Nuria: "Given that the last patch should only be a name change, virtual +2 if we have tested the job" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/547320 (https://phabricator.wikimedia.org/T235486) (owner: 10Mforns)
[23:06:13] <wikibugs>	 10Analytics, 10Analytics-Kanban: Dashiki: Read multiple wikis from single file - https://phabricator.wikimedia.org/T236941 (10srishakatux) Thanks @Milimetric for sharing the updates! Currently, we are using the `tabs` layout and if/when we plan to use the `metrics-by-layout` these changes will be helpful.   Ma...
[23:17:12] <wikibugs>	 10Analytics, 10Operations, 10decommission, 10ops-eqiad: Decommission analytics100[1,2] - https://phabricator.wikimedia.org/T205507 (10Jclark-ctr) a:05Cmjohnson→03Jclark-ctr