[00:35:21] 10Analytics, 10Product-Analytics: Hive Runtime Error - Query on event.MobileWikiAppDailyStats failing with errors - https://phabricator.wikimedia.org/T277348 (10SNowick_WMF) [00:40:21] PROBLEM - Throughput of EventLogging EventError events on alert1001 is CRITICAL: 186.3 ge 30 https://wikitech.wikimedia.org/wiki/Analytics/Systems/EventLogging/Administration https://grafana.wikimedia.org/dashboard/db/eventlogging?panelId=13&fullscreen&orgId=1 [00:43:57] 10Analytics, 10Analytics-Kanban, 10Better Use Of Data: Create Oozie job for session length - https://phabricator.wikimedia.org/T273116 (10kzimmerman) 05Open→03Resolved [00:46:09] 10Analytics-Radar, 10Better Use Of Data, 10Product-Analytics, 10Product-Data-Infrastructure: Roll-up raw sessionTick data into distribution - https://phabricator.wikimedia.org/T271455 (10kzimmerman) 05Open→03Resolved [00:49:37] RECOVERY - Throughput of EventLogging EventError events on alert1001 is OK: (C)30 ge (W)20 ge 0.07143 https://wikitech.wikimedia.org/wiki/Analytics/Systems/EventLogging/Administration https://grafana.wikimedia.org/dashboard/db/eventlogging?panelId=13&fullscreen&orgId=1 [01:24:42] 10Analytics, 10Event-Platform, 10Inuka-Team (Kanban): KaiOSAppFeedback Event Platform Migration - https://phabricator.wikimedia.org/T267345 (10SBisson) @Ottomata I've been trying to send event for the KaiOS feedback schema. I always receive a 500 Internal server error but no actual error that I can see. This... [02:08:54] 10Analytics, 10Event-Platform, 10Research: TranslationRecommendation* Schemas Event Platform Migration - https://phabricator.wikimedia.org/T271163 (10bmansurov) I think so. For example, here's what was sent: `lang=json {"schema":"TranslationRecommendationUIRequests","$schema":"/analytics/legacy/translationr... [15:11:58] hey quick question... for a new data dive, should I use Jupyter or Newpyter? [15:12:19] the latter :) [15:12:50] (it should become the only way soon-ish, we are ironing out the last details!) [15:18:08] elukey: fantasmic thanks! yeah I was looking forward to trying it, just didn't know if it was stable enough yet :) :) [15:43:03] elukey: ahhh since I see you're about, I could bug you for another quick question heheh... apologies for the bother, especially since it's a weekend, and it's not urgent... I was just wondering if you have any thoughts on what current system to use to pull a random sample of rows in pageview actor and webrequest tables for a specified time period and other query parameters? I haven't used Hive [15:43:05] in a while so I'm not sure what new stuff there may be.......... [15:43:22] Doesn't have to be a random sample, could also be the full data [15:43:45] but I was just thinking with the new systems (like Presto) in place, maybe that's now doable and faster? [17:32:31] AndyRussG: for small data Presto is fine, for big datasets we still suggest/recommend to go for hive or spark if possible [17:33:24] AndyRussG: if you need to explore data, we have webrequest_128 in turnilo (it is a sampled, 1/128th, webrequest basically( [17:42:29] elukey: ahh cool beans thanks! so I want to do a distribution of dates in last visited cookies (in X-analytics)... I think... [17:42:55] basically what I want to find is: over the course of a 2-week period, what percentage of unique devices only visit the site once [17:43:14] for a few specific period/language/country combinations [17:47:04] AndyRussG: let's open a task in case for next week, so people of my team will chime in :) [17:57:43] elukey: thanks ahhh actually it's just at this point a personal development/volunteer project [17:58:26] elukey: probably I'd be scolded for taking up your team's time if I opened a task... but thanks so much for suggesting it :) :) [17:58:45] (scolded not by anyone from analytics, but elsewhere) [18:05:47] nah it is really fine, we are happy to help :) [18:24:01] ehhh thanks!