[00:28:06] HaeB: hmmm, sorry I was caught up in stuff, leaving to India tomorrow and had to do things [00:28:45] HaeB: what's the query you're running? Can you split it up into weeks/15 days? [00:30:34] no, it's about most viewed pages over a longer timespan [00:34:39] gaah [00:34:42] hmmm [00:44:47] HaeB: can you paste query? [00:48:55] madhuvishy: well, it's not really the query's fault ;) (as i said, it works fine on hive except the unicode issue), but FWIW: [00:48:57] SELECT CONCAT('https://',project,'.org/wiki/',page_title), SUM(view_count) AS views FROM wmf.pageview_hourly WHERE year = 2015 AND agent_type = "user" GROUP BY project, page_title ORDER BY views DESC LIMIT 200; [00:50:25] i'll try splitting it up into an inner query that generates the list of pages, and an other that does the order and limit [00:54:54] nah, doesn't help, "java.lang.OutOfMemoryError" again (even though i also restricted the inner query to pages with > 100000 views) [03:35:09] Analytics-Engineering, Analytics-Wikimetrics: New cohorts not validating on Wikimetrics - https://phabricator.wikimedia.org/T116456#1750053 (TFlanagan-WMF) NEW [06:44:36] PROBLEM - Analytics Cassanda CQL query interface on aqs1001 is CRITICAL: Connection timed out [06:51:36] RECOVERY - Analytics Cassanda CQL query interface on aqs1001 is OK: TCP OK - 0.000 second response time on port 9042 [07:50:36] PROBLEM - Analytics Cassanda CQL query interface on aqs1001 is CRITICAL: Connection timed out [07:52:16] RECOVERY - Analytics Cassanda CQL query interface on aqs1001 is OK: TCP OK - 0.025 second response time on port 9042 [08:12:06] PROBLEM - Analytics Cassanda CQL query interface on aqs1001 is CRITICAL: Connection timed out [08:12:54] Analytics-Tech-community-metrics, DevRel-October-2015: Backlogs of open changesets by affiliation - https://phabricator.wikimedia.org/T113719#1750142 (Qgil) The main use cases today are: * We want to focus on code review of oldest patches submitted by independent / unknown volunteers. Can we have a list?... [08:13:47] RECOVERY - Analytics Cassanda CQL query interface on aqs1001 is OK: TCP OK - 3.003 second response time on port 9042 [08:15:00] Analytics-Tech-community-metrics, DevRel-October-2015: Affiliations and country of resident should be visible in Korma's user profiles - https://phabricator.wikimedia.org/T112528#1750144 (Qgil) The (admittedly vague today) use cases for country data are: * Helping developers to find each other for potent... [08:56:13] Analytics-Backlog, Datasets-General-or-Unknown, operations: Requests to dumps.wikimedia.org should end up in hadoop wmf.webrequest via kafka! - https://phabricator.wikimedia.org/T116430#1750184 (Addshore) Hmm, this isn't a duplicate..? lists.wm.o != dumps.wm.o !!! [08:57:51] Analytics-Backlog, Wikimedia-Mailing-lists, operations: Requests to lists.wikimedia.org should end up in hadoop wmf.webrequest via kafka! - https://phabricator.wikimedia.org/T116429#1750185 (Addshore) Well, this mainly applies to dumps.wm.o (which the other ticket was open for). But I was looking to se... [15:20:57] Analytics-Engineering, Analytics-Wikimetrics: New cohorts not validating on Wikimetrics - https://phabricator.wikimedia.org/T116456#1750621 (TFlanagan-WMF) Open>Resolved a:TFlanagan-WMF Problem appears to have resolved itself overnight? Not sure what the root cause was. [16:14:22] (PS1) Christopher Johnson (WMDE): adds parameterized dygraphs adds chart links in datatables [wikidata/analytics/dashboard] - https://gerrit.wikimedia.org/r/248627 [16:18:07] (PS2) Christopher Johnson (WMDE): adds parameterized dygraphs adds chart links in datatables [wikidata/analytics/dashboard] - https://gerrit.wikimedia.org/r/248627 [16:23:50] (PS3) Christopher Johnson (WMDE): adds parameterized dygraphs adds chart links in datatables [wikidata/analytics/dashboard] - https://gerrit.wikimedia.org/r/248627 [16:29:00] (CR) Christopher Johnson (WMDE): [C: 2 V: 2] adds parameterized dygraphs adds chart links in datatables [wikidata/analytics/dashboard] - https://gerrit.wikimedia.org/r/248627 (owner: Christopher Johnson (WMDE)) [16:34:22] Analytics-Tech-community-metrics, DevRel-November-2015: Explain / sort out / fix SCM repository number mismatch on korma - https://phabricator.wikimedia.org/T116483#1750662 (Aklapper) NEW a:Dicortazar [16:34:28] Analytics-Tech-community-metrics, DevRel-November-2015: Explain / sort out / fix SCR repository number mismatch on korma - https://phabricator.wikimedia.org/T116484#1750669 (Aklapper) NEW a:Dicortazar [16:35:35] Analytics-Tech-community-metrics, Developer-Relations, DevRel-October-2015: Check whether it is true that we have lost 40% of (Git) code contributors in the past 12 months - https://phabricator.wikimedia.org/T103292#1750677 (Aklapper) >>! In T103292#1404495, @Aklapper wrote: > There are differences in... [17:02:13] Analytics-Tech-community-metrics, Developer-Relations, DevRel-October-2015: Check whether it is true that we have lost 40% of (Git) code contributors in the past 12 months - https://phabricator.wikimedia.org/T103292#1750715 (Aklapper) >>! In T103292#1387470, @Qgil wrote: > in http://korma.wmflabs.org/b... [17:50:55] PROBLEM - Difference between raw and validated EventLogging overall message rates on graphite1001 is CRITICAL: CRITICAL: 40.00% of data above the critical threshold [30.0] [17:52:45] RECOVERY - Difference between raw and validated EventLogging overall message rates on graphite1001 is OK: OK: Less than 25.00% above the threshold [20.0] [18:10:47] PROBLEM - Difference between raw and validated EventLogging overall message rates on graphite1001 is CRITICAL: CRITICAL: 40.00% of data above the critical threshold [30.0] [18:12:36] RECOVERY - Difference between raw and validated EventLogging overall message rates on graphite1001 is OK: OK: Less than 25.00% above the threshold [20.0] [19:24:42] Analytics-Tech-community-metrics, Developer-Relations, DevRel-October-2015: Check whether it is true that we have lost 40% of (Git) code contributors in the past 12 months - https://phabricator.wikimedia.org/T103292#1750831 (Aklapper) More Git repos that seem to have upstream commits only: * hhvm-dev (... [19:32:09] Analytics-Tech-community-metrics, Developer-Relations, DevRel-October-2015: Check whether it is true that we have lost 40% of (Git) code contributors in the past 12 months - https://phabricator.wikimedia.org/T103292#1750839 (Aklapper) >>! In T103292#1415360, @Dicortazar wrote: > I do not know if there'... [19:36:32] Analytics-Backlog, Analytics-Kanban: Druid testing on labs to asses whether is a suitable Cassandra replacement. {slug} - https://phabricator.wikimedia.org/T116409#1750841 (Krenair) [20:23:46] PROBLEM - Difference between raw and validated EventLogging overall message rates on graphite1001 is CRITICAL: CRITICAL: 26.67% of data above the critical threshold [30.0] [20:25:36] RECOVERY - Difference between raw and validated EventLogging overall message rates on graphite1001 is OK: OK: Less than 25.00% above the threshold [20.0] [20:29:58] Analytics-Tech-community-metrics, Developer-Relations, DevRel-October-2015: Check whether it is true that we have lost 40% of (Git) code contributors in the past 12 months - https://phabricator.wikimedia.org/T103292#1751038 (Aklapper) My current assumption is that `hhvm-dev` (and its imported upstream-... [20:32:46] Analytics-Tech-community-metrics, DevRel-October-2015: Correct affiliation for code review contributors of the past 30 days - https://phabricator.wikimedia.org/T112527#1751042 (Luiscanasdiaz) @aklapper the data on korma is already updated [22:26:54] Analytics-Tech-community-metrics, DevRel-October-2015: Correct affiliation for code review contributors of the past 30 days - https://phabricator.wikimedia.org/T112527#1751147 (Aklapper) >>! In T112527#1751042, @Luiscanasdiaz wrote: > @aklapper the data on korma is already updated @Luiscanasdiaz: I don't...