[00:17:07] (03PS3) 10Nuria: Add new search engines test cases [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/423336 (https://phabricator.wikimedia.org/T191714) (owner: 10Jonas Kress (WMDE)) [00:18:54] (03CR) 10jerkins-bot: [V: 04-1] Add new search engines test cases [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/423336 (https://phabricator.wikimedia.org/T191714) (owner: 10Jonas Kress (WMDE)) [06:46:29] joal: o/ [06:46:54] from the logs I can see that refinery has been deployed [06:47:42] so when you are online I think we can restart namenodes and check trash! [06:48:02] (confirmed that my change is live on an1003, just to be sure) [06:48:35] in the meantime, I am going to bounce the Yarn Resource Manager daemons to let them pick up the new prometheus config [06:53:44] done! [07:20:43] Hi elukey - kids day today [07:20:52] I'll have time later in the afternoon [07:22:22] joal: ahahha I need to write it on my arm so I'll be able to remember it, sorry :) [07:22:48] I definitely prefer you/andrew around when enabling the feature [07:22:50] no rush! [07:23:05] np elukey [07:23:53] elukey: we went through some issues yesterday, still one issue to be fixed, but globally ok [07:23:59] later :) [07:25:15] laterrr [07:26:59] RECOVERY - Kafka MirrorMaker main-eqiad_to_jumbo-eqiad max lag in last 10 minutes on einsteinium is OK: OK - scalar(max(max_over_time(kafka_burrow_partition_lag{group=kafka-mirror-main-eqiad_to_jumbo-eqiad,topic!.*(change-prop https://grafana.wikimedia.org/dashboard/db/kafka-mirrormaker?var-datasource=eqiad%2520prometheus%252Fops&var-mirror_name=main-eqiad_to_jumbo-eqiad [07:27:26] this is me fixing the alert --^ [10:25:35] going afk for lunch + errand! [10:53:44] (03PS1) 10Joal: Correct wikidata articleplaceholder job [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/425511 [12:49:45] mforns: o/ [12:58:22] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, 10Services (next): Enable CP4JQ support for private wikis - https://phabricator.wikimedia.org/T191464#4123032 (10Pchelolo) Technically nothing prevents us from just enabling the new JobQueue for private wikis. Note that ChangeProp and RESTBas... [13:08:10] o/ [13:08:14] o/ [13:10:52] ottomata: I am going to restart the analytics kafka brokers for openjdk upgrades (but not mm on them), and then I'll do jumbo [13:11:27] k [13:15:36] ottomata: there is a console-consumer on kafka1012 under your username that needs a restart as well :P [13:15:45] (joking, whenever you have time) [13:17:24] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, 10Services (next): Enable CP4JQ support for private wikis - https://phabricator.wikimedia.org/T191464#4123056 (10Ottomata) Hm, currently the data we import into Hadoop is readable by anyone with a Hadoop account (not just analytics-privatedata-... [13:18:33] console-consumer! [13:18:33] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, 10Services (next): Enable CP4JQ support for private wikis - https://phabricator.wikimedia.org/T191464#4123058 (10Ottomata) Hm, however, we’re trying to make internal ‘private’ cross DC data all go over TLS. If we do this, we would want to have... [13:18:34] looking [13:18:48] ah! now that I think about it, I'd need to restart all the kafka brokers for the conf1001->conf1004 switch [13:19:13] weird... [13:19:28] that just looks stuck! i don't have any screens. killing [13:19:38] elukey: do you have a couple mins in batcavity? [13:20:35] fdans: a bit busy now, would it be ok in ~20/30 mins? [13:21:02] yes yes, whenever you are available, I'm with other stuff :) [13:21:04] thank you elukey [13:23:57] (03CR) 10Ottomata: Update spark jobs jar and correct assembly path (031 comment) [analytics/refinery] - 10https://gerrit.wikimedia.org/r/425441 (owner: 10Joal) [13:50:46] 10Analytics: Clickstream dataset for Persian Wikipedia only includes external values - https://phabricator.wikimedia.org/T191964#4123143 (10DarTar) [13:51:11] 10Analytics: Clickstream dataset for Persian Wikipedia only includes external values - https://phabricator.wikimedia.org/T191964#4122737 (10DarTar) Thanks for reporting this @Ladsgroup. I added #Analytics since the data is maintained by Joseph et al. [14:09:34] fdans: sorryyy [14:10:05] heey elukey don't worry, not blocking! [14:21:09] elukey, hellooo, what's up? [14:22:34] mforns: hola! I was wondering if you wanted to review/discuss with me the code review to apply the yaml whitelist to db1108 [14:23:49] elukey, yeahhhh [14:23:52] bc? [14:25:21] mforns: need to chat with fran before! (I am lagging a bit :P) [14:25:57] elukey, np, I need to charge my headphones [14:26:14] ping me whenever :] [14:26:16] fdans: bc? [14:26:32] yep! [14:26:47] mforns: in the meantime, if you want to check https://gerrit.wikimedia.org/r/#/c/425498/ [14:27:03] on it [14:44:09] yargh joal, hm, some of the refine jobs are failling because of...i'm not totally sure why now, but because of empty arrays in source data? [14:44:16] which parquet doesn't allow? [14:50:39] (03PS1) 10Amitjoki: Add Vue Filters to make the code clean and use them as necessary for casing [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/425541 (https://phabricator.wikimedia.org/T191824) [14:53:40] 10Analytics, 10Analytics-Wikistats, 10Patch-For-Review: Add Vue Filters to make the code clean and use them as necessary - https://phabricator.wikimedia.org/T191824#4123382 (10Amitjoki) Here's a quick preview of how the change affects the UI: https://www.useloom.com/share/e840f13d82f5478094fe656d8979a974 [14:58:30] heya :) [14:59:10] heeeey [15:00:26] mforns: deploying the change to db1108! [15:00:41] db1107 will not change as expected [15:01:43] mforns: if you want we can chat in bc now! [15:04:16] elukey, yeeeeeees [15:04:19] omw [15:11:08] 10Analytics, 10Analytics-Wikistats: The popup in the line chart is obstructive - https://phabricator.wikimedia.org/T191985#4123443 (10Amitjoki) [15:11:52] 10Analytics, 10Analytics-Wikistats: The popup in the line chart is obstructive - https://phabricator.wikimedia.org/T191985#4123469 (10Amitjoki) [15:21:53] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, 10Services (next): Enable CP4JQ support for private wikis - https://phabricator.wikimedia.org/T191464#4123511 (10Pchelolo) > Hm, however, we’re trying to make internal ‘private’ cross DC data all go over TLS. If we do this, we would want to hav... [15:23:12] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, 10Services (next): Enable CP4JQ support for private wikis - https://phabricator.wikimedia.org/T191464#4123522 (10Ottomata) Timeline for upgrading main is Q4, but MirrorMaker +TLS wasn't in the plan. I don't think we should block your work thou... [15:23:49] 10Analytics, 10Analytics-Wikistats: The popup in the line chart is obstructive - https://phabricator.wikimedia.org/T191985#4123526 (10Amitjoki) [15:24:52] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, 10Services (next): Enable CP4JQ support for private wikis - https://phabricator.wikimedia.org/T191464#4123528 (10Pchelolo) Ok, in that case, are you ok with enabling it in very near future and adding a mental note that this increases the priori... [15:28:51] ottomata: i can help look at refine issues [15:28:58] ottomata: are you in da cave? [15:29:32] nuria_: about to go for ops sync [15:29:39] ottomata: k [15:29:51] i figured out a work around, but i don't know why it used to work in spark1 but now doesn't [15:30:02] problem is similar to https://issues.apache.org/jira/browse/HIVE-11625 [15:30:09] but with empty arrays instead of null map keys [15:30:16] the error is raised from hive's parquet code [15:30:27] but the versions of that look to be the same in spark 1.6 [15:30:43] work around: use spark's parquet writer to write hte file [15:30:50] then msck repair table (or add partition) manually [15:30:53] instead of going through spark-hive [15:30:53] (03CR) 10Nuria: [C: 032] Correct wikidata articleplaceholder job [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/425511 (owner: 10Joal) [15:38:23] (03PS2) 10Mforns: Make the graphData updateable when using table-chart [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/424765 (https://phabricator.wikimedia.org/T191661) (owner: 10Amitjoki) [15:41:22] (03CR) 10Mforns: Make the graphData updateable when using table-chart (031 comment) [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/424765 (https://phabricator.wikimedia.org/T191661) (owner: 10Amitjoki) [15:51:52] 10Analytics-Kanban, 10Analytics-Wikistats, 10Patch-For-Review: Wikistat Beta: expand topic explorer by default - https://phabricator.wikimedia.org/T186335#4123654 (10mforns) [15:53:10] Hey ottomata [15:53:18] ottomata: just saw your message [15:53:25] ottomata: l'ets brainstorm post-standup [15:53:45] 10Analytics-Kanban, 10Analytics-Wikistats, 10Patch-For-Review: Add Vue Filters to make the code clean and use them as necessary - https://phabricator.wikimedia.org/T191824#4123674 (10mforns) [15:55:18] 10Analytics-Kanban, 10Analytics-Wikistats, 10Patch-For-Review: Limit pan in Wikistats2 maps - https://phabricator.wikimedia.org/T189195#4123689 (10mforns) [15:56:39] (03CR) 10Amitjoki: [C: 031] ">" (031 comment) [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/424765 (https://phabricator.wikimedia.org/T191661) (owner: 10Amitjoki) [15:57:16] 10Analytics-Kanban, 10Analytics-Wikistats, 10Easy, 10Patch-For-Review: [Wikistats2] The detail page for tops and maps metrics does not indicate time range - https://phabricator.wikimedia.org/T182990#4123698 (10mforns) [15:58:22] (03CR) 10jerkins-bot: [V: 04-1] Make the graphData updateable when using table-chart [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/424765 (https://phabricator.wikimedia.org/T191661) (owner: 10Amitjoki) [15:58:27] 10Analytics-Kanban, 10Analytics-Wikistats, 10Patch-For-Review: Display of radio buttons in Wikistats 2 is somewhat confusing - https://phabricator.wikimedia.org/T183185#4123707 (10mforns) [15:58:52] 10Analytics, 10Analytics-Wikistats, 10Patch-For-Review: Display of radio buttons in Wikistats 2 is somewhat confusing - https://phabricator.wikimedia.org/T183185#3846152 (10mforns) [16:00:18] ping mforns [16:00:24] trying to join! [16:26:09] 10Analytics, 10Analytics-Kanban, 10Analytics-Wikistats, 10Patch-For-Review: Wikistats2 GraphPanel computeds and watchers do not update as expected when using table-chart. - https://phabricator.wikimedia.org/T191661#4123869 (10mforns) a:03Amitjoki [16:44:27] all right failing over the name node to an1002 [16:45:09] restarting namenode on an1001 [16:45:16] hanks elukey [16:48:24] 1001 back as master, 1002 restarting [16:48:43] !log restart hadoop namenodes to pick up HDFS trash settings [16:48:44] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [16:49:24] wow elukey - That was fast :) [16:49:43] (03CR) 10Milimetric: "I'm glad there's now a hover on bar charts. I think a couple of things are needed for consistency:" [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/423904 (https://phabricator.wikimedia.org/T188277) (owner: 10Fdans) [16:50:35] joal: trash should be enabled :) [16:51:24] elukey: I'm gonna try, give me a minute [16:52:15] 18/04/11 16:52:09 INFO fs.TrashPolicyDefault: Moved: 'hdfs://analytics-hadoop/user/elukey/test' to trash at: hdfs://analytics-hadoop/user/elukey/.Trash/Current/user/elukey/test [16:52:19] \o/ [16:52:26] (03CR) 10Milimetric: "Oh, and when you hover over one bar out of a group of bars when you're splitting, the popup only shows the bar you're hovering, but it sho" [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/423904 (https://phabricator.wikimedia.org/T188277) (owner: 10Fdans) [16:53:44] tried also to get the file in the trash [16:53:49] on local fs, all good [16:53:54] it seems working fine! [16:54:19] I'll check tomorrow the checkpoint creation [17:01:54] nuria_, you're not coming to the meeting with Lucas right? [17:01:54] nuria_: not at meeting with Luca? [17:02:19] joal: joining in a sec [17:09:30] 10Analytics, 10Analytics-Wikistats: The popup in the line chart is obstructive - https://phabricator.wikimedia.org/T191985#4123443 (10sahil505) Hey @Amitjoki, I believe this task somewhat overlaps with T187429. Resolving point 3 of T187429 will automatically resolve this. Correct me if I'm wrong. [17:19:05] 10Quarry: Recurring queries - https://phabricator.wikimedia.org/T101835#4124081 (10Framawiki) It can be great to have dedicate runners for scheduled queries, rather than interfere with end-user performance. [17:25:39] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, 10Services (blocked): Disable MirrorMaker for job queue events - https://phabricator.wikimedia.org/T192005#4124128 (10Pchelolo) p:05Triage>03Normal [17:43:16] (03PS1) 10Joal: Correct Mediawiki-history denormalize job [analytics/refinery] - 10https://gerrit.wikimedia.org/r/425569 [18:00:08] (03PS1) 10Ottomata: DataFrameToHive - Use DataFrame .write.parquet instead of .insertInto [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/425578 (https://phabricator.wikimedia.org/T159962) [18:00:17] joal: ^ [18:00:20] reading ottomata [18:02:30] (03PS2) 10Ottomata: DataFrameToHive - Use DataFrame .write.parquet instead of .insertInto [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/425578 (https://phabricator.wikimedia.org/T159962) [18:05:10] (03CR) 10Joal: [V: 031] "Tested on cluster." [analytics/refinery] - 10https://gerrit.wikimedia.org/r/425569 (owner: 10Joal) [18:05:21] (03CR) 10Joal: [C: 031] "LGTM :) I'm assume you've tested it :)" [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/425578 (https://phabricator.wikimedia.org/T159962) (owner: 10Ottomata) [18:08:24] * elukey off! [18:08:38] Bye elukey [18:10:26] ottomata: do we deploy now?b [18:13:06] joal: i want to maybe see if we can make a couple more of those things more efficient, [18:13:11] especially that infer data type thing [18:13:27] ottomata: ok [18:13:57] ottomata: Something about he df.head --> We could do count, it would prevent doing it somewhere else [18:14:19] And, cache the df before counting, like that, we reuse from RAM if feasible [18:14:50] ottomata: About data inference, I really think it should be removed and provided via conf (in a properties file :-P [18:15:02] I'm going to have diner, will be back for deploy after [18:20:11] hmm, aye [18:25:16] 10Analytics, 10Analytics-Wikistats: Adding ranks to the map tooltip - https://phabricator.wikimedia.org/T191141#4124405 (10Nuria) >Usability will improve a lot, I think. Would love to know what you think of this :) I disagree that this is needed, rank is really not that helpful "as is" now. To be so metrics n... [18:29:14] (03CR) 10Nuria: Add Vue Filters to make the code clean and use them as necessary for casing (033 comments) [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/425541 (https://phabricator.wikimedia.org/T191824) (owner: 10Amitjoki) [18:43:22] (03PS3) 10Mforns: Make the graphData updateable when using table-chart [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/424765 (https://phabricator.wikimedia.org/T191661) (owner: 10Amitjoki) [18:47:38] (03CR) 10Mforns: [C: 032] Make the graphData updateable when using table-chart [analytics/wikistats2] - 10https://gerrit.wikimedia.org/r/424765 (https://phabricator.wikimedia.org/T191661) (owner: 10Amitjoki) [19:07:03] 10Analytics, 10Analytics-Wikistats, 10Accessibility, 10Easy, 10Patch-For-Review: Wikistats Beta: Fix accessibility/markup issues of Wikistats 2.0 - https://phabricator.wikimedia.org/T185533#4124589 (10Volker_E) @MCornacchio The wikiselector search is wrapped in a `h5` currently, which is superfluos seman... [19:09:18] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, and 2 others: Enable CP4JQ support for private wikis - https://phabricator.wikimedia.org/T191464#4124595 (10mobrovac) For posterity, in today's JobQueue biweekly meeting we agreed that having unencrypted mirroring of private wikis' data is not ac... [19:09:42] 10Analytics, 10ChangeProp, 10EventBus, 10MediaWiki-JobQueue, and 2 others: Enable CP4JQ support for private wikis - https://phabricator.wikimedia.org/T191464#4124598 (10mobrovac) [19:39:33] (03PS1) 10Ottomata: RefineTarget - Use Hadoop FS to infer input format rather than Spark [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/425597 (https://phabricator.wikimedia.org/T159962) [20:13:11] (03CR) 10Joal: "A bit hacky, but would work in most cases I assume :)" (031 comment) [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/425597 (https://phabricator.wikimedia.org/T159962) (owner: 10Ottomata) [20:17:26] ottomata: if you agree, I'll deploy those things tomorrow morning (except if you want to do it tonight) [20:17:35] With yesterday late night, I'm gonna stop soon [20:18:30] 10Analytics, 10ChangeProp, 10EventBus, 10Services (done): Support reliable delayed job execution in ChangeProp - https://phabricator.wikimedia.org/T186261#4124795 (10Pchelolo) 05Open>03Resolved [20:21:19] joal: i think there isn't a hurry [20:21:21] let's do it tomorrow [20:21:27] i can replace the jar that refine is usign (just did) [20:21:31] since that is temp puppetized anyway [20:21:38] i'm re running the failed stuff now [20:22:15] ok ottomata - I'd like to fix the wikidata job - So deploy tomorrow would be better for me [20:25:10] ya let's wait [20:25:11] no hurry [20:26:50] ottomata: wait for refine you mean? [20:27:01] meaning, I'll deploy without refine patches [20:34:07] ok - I'm gone for tonight team, see you tomorrow ! [20:40:35] 10Analytics, 10Analytics-Wikistats: Change 'NaN' to something more helpful in metrics % change over the selected time range - https://phabricator.wikimedia.org/T192028#4124963 (10sahil505) [20:42:10] oh joal, you are going to make a release? [20:42:13] you can do with refine patches, that's fine [20:42:19] i don't have the one for df.head vs count [20:42:27] but if you need to make a release, yes include them [20:55:21] Hello A-team! Can I create a table to record users' current state ([appInstallID, property, property_value]), and use eventlogging to update the property_value whenever there is a change? [20:56:30] chelsyx, hi! hmmmm, EventLogging does not update current data, it just stores events with their timestamps [20:57:07] chelsyx: yeah, you'd need to either fire the events from the app or process the corresponding EventLogging kafka topic with some custom code [20:57:34] you could probably have queries that get a user's current state, given a list of status changes, but that might be not very efficient [20:59:29] Hmmmm, I can do that through a list of status changes, but how and where can I store users' initial state? [20:59:33] chelsyx: wow what a stream processing use case! [21:03:11] chelsyx: well, the 1st event you get for a user is initial state [21:03:31] chelsyx: also think that purging happens every 90 days thus history is restricted [21:03:43] chelsyx: rather than a snapshot you have a"history table" [21:04:11] nuria_: right, but not if everyone has a different initial state from somewhere else. In that case you'd need to do custom processing and have a custom table somewhere, yeah [21:05:11] milimetric: unless you time travel to the time where they installed app i think that initializing to their actual state is best you can do [21:05:21] https://www.confluent.io/blog/ksql-open-source-streaming-sql-for-apache-kafka/ chelsyx this kiiiinda works now, with some manual poking [21:08:01] nuria_: yeah, like milimetric said, for existing users, everyone has a different initial state, I can't time travel :P [21:09:30] 10Analytics, 10Analytics-Wikistats, 10Patch-For-Review: Display of radio buttons in Wikistats 2 is somewhat confusing - https://phabricator.wikimedia.org/T183185#4125011 (10Nuria) This seems a task that needs mocks to decide in which are the best visuals. Let's put it on backlog. One thing we for sure need... [21:09:50] Some schema just record the state in a EL table whenever a event is log, like this one (listcount and itemcount), https://meta.wikimedia.org/wiki/Schema:MobileWikiAppReadingLists [21:10:20] chelsyx: yes, el will record a sequence of states in your case [21:10:48] but I was hoping there was a way to store the state in a separate place [21:11:08] 10Analytics, 10Analytics-Wikistats: Upgrading Wikistats 2.0 footer UI/design - https://phabricator.wikimedia.org/T191672#4125013 (10Nuria) +1 to this work, thank you. [21:12:07] chelsyx: what is what you hope to find from your data [21:12:20] chelsyx: as in , what questions does this data need to answer? [21:12:59] mforns: looked at the list of tickets from GSoC, commented in all. This one specially i think it needs to go back to beta column: https://phabricator.wikimedia.org/T183185 [21:13:30] mforns: i think this we should definitely do: https://phabricator.wikimedia.org/T191672 [21:13:43] 10Analytics, 10Analytics-Wikistats: Change 'NaN' to something more helpful in metrics % change over the selected time range - https://phabricator.wikimedia.org/T192028#4125015 (10mforns) @sahil505 Good catch! Feel free to work on this if you want. [21:14:22] nuria_: Usage metrics for the new reading list feature. We want to know how many users save articles to reading list, how many list and how many articles do they save, etc... [21:15:07] nuria_, I agree that one still needs discussion and will take a while [21:15:28] nuria_: I wanted to have a EL table to record the events with structure like this: https://matomo.org/docs/event-tracking/ [21:15:35] chelsyx: don't have time now, but there a lot of cool ways to do what you say :D ping me tomorrow maybe we can jump in a hangout [21:15:44] chelsyx: ok, and how does your schema is used in this case? [21:15:56] nuria_, the second one can go ahead on my opinion as well [21:16:26] chelsyx: that is basically eventlogging 1to 1 [21:16:43] chelsyx: the matomo events that is [21:17:02] thanks for commenting! [21:17:34] chelsyx: quite different from state we were talking about before, if that makes sense [21:17:42] nuria_: Instead of storing state values in the event table, I was hoping to have a separate user table so that I can store the most current state, and compute things like on average, how many articles do users save [21:17:45] chelsyx: (we can talk on batcave if you want) [21:18:22] ottomata: I will contact you tomorrow! :) Thanks [21:18:24] chelsyx: as the events are "interactions" [21:18:53] nuria_: Thanks, joining batcave [21:19:21] chelsyx: omw [21:25:23] (03PS3) 10Mforns: [WIP] Add job and query for page previews aggregation [analytics/refinery] - 10https://gerrit.wikimedia.org/r/425281 (https://phabricator.wikimedia.org/T186728) [21:38:55] mforns: agreed [21:39:05] mforns: let me know if you need more help with this [21:39:11] thanks! [21:43:09] (03CR) 10Nuria: [WIP] Add job and query for page previews aggregation (031 comment) [analytics/refinery] - 10https://gerrit.wikimedia.org/r/425281 (https://phabricator.wikimedia.org/T186728) (owner: 10Mforns) [21:46:35] (03CR) 10Mforns: [WIP] Add job and query for page previews aggregation (031 comment) [analytics/refinery] - 10https://gerrit.wikimedia.org/r/425281 (https://phabricator.wikimedia.org/T186728) (owner: 10Mforns) [22:00:30] ottomata: question: why did we build spark as >zip rather than jars? [22:18:39] nuria_: it is easier, and i followed some instructions i found online. [22:18:51] i don't know where cloudera got its assembly jar [22:19:02] the zip file is just a zip of all files in a dir [22:19:04] and works just as well [22:19:22] ottomata: k