[00:14:40] (CR) Nuria: [C: 2] "Looks good, tested in vagrant and conversion and display are working well, merging." [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/175169 (owner: Bmansurov) [00:37:27] Analytics-Dashiki, Analytics-Engineering: Vital Signs user reads description of metric - https://phabricator.wikimedia.org/T76741#820203 (kevinator) [00:38:12] Analytics-Wikimetrics: Accept more timezones as input - https://phabricator.wikimedia.org/T74116#820205 (Nuria) Open>Resolved [00:40:09] Analytics-Dashiki, Analytics-Engineering: User sees banner in Dashiki when loading the site - https://phabricator.wikimedia.org/T76695#820210 (kevinator) p:Normal>High [00:41:26] (PS1) QChris: Abort deployment, if Oozie's Hive config seems to contain passwords [analytics/refinery] - https://gerrit.wikimedia.org/r/177714 [00:41:28] (PS1) QChris: Improve error message in deploy script, if Oozie's Hive config is not readable [analytics/refinery] - https://gerrit.wikimedia.org/r/177715 [00:41:30] (PS1) QChris: Drop jars that are not on all worker nodes from Oozie's Hive config [analytics/refinery] - https://gerrit.wikimedia.org/r/177716 [00:41:45] Analytics-Dashiki, Analytics-Engineering: Vital Signs user sees banner in Dashiki when loading the site - https://phabricator.wikimedia.org/T76695#820216 (kevinator) [00:45:35] Analytics-Dashiki: Cannot add project to dashboard - https://phabricator.wikimedia.org/T73333#820223 (kevinator) p:Triage>Normal [00:46:15] Analytics-Dashiki: Cannot add project to dashboard - https://phabricator.wikimedia.org/T73333#820225 (kevinator) [00:54:38] Analytics: Backfill pageview data with WebstatsCollector data and/or Sampled Log data - https://phabricator.wikimedia.org/T76768#820242 (Krenair) [01:43:45] Analytics-Wikimetrics: User sees TOS when loging in - https://phabricator.wikimedia.org/T76826 (kevinator) NEW p:Triage [01:44:05] Analytics-Wikimetrics: User sees TOS when loging in - https://phabricator.wikimedia.org/T76826#820369 (kevinator) [01:48:23] Analytics-Wikimetrics, Analytics-Engineering: Wikimetrics User clicks on Terms or Use link on website - https://phabricator.wikimedia.org/T76107#820383 (kevinator) [01:51:15] Analytics-Wikimetrics: User sees TOS when loging in - https://phabricator.wikimedia.org/T76826#820387 (kevinator) [02:06:31] Analytics-Wikimetrics, Analytics-Engineering: Support page should mention Phabricator, not Bugzilla - https://phabricator.wikimedia.org/T76521#820406 (Nuria) Open>Resolved [03:12:32] Analytics-Engineering: Story: Data Warehouse manages schema migrations with alembic - https://phabricator.wikimedia.org/T76829 (Nuria) NEW p:Triage a:Nuria [03:17:55] (PS1) Nuria: [WIP] Manage warehouse migrations with alembic [analytics/data-warehouse] - https://gerrit.wikimedia.org/r/177739 [03:37:16] (PS2) Nuria: [WIP] Manage warehouse migrations with alembic [analytics/data-warehouse] - https://gerrit.wikimedia.org/r/177739 [08:12:04] Wikimedia-General-or-Unknown, Analytics: Sudden drop in number of articles on nowiki on Nov29 (by 34k articles) - https://phabricator.wikimedia.org/T76356#820767 (Nemo_bis) Open>Invalid No articles vanished, Special:Statistics/NUMBEROFARTICLES was simply wrong. https://meta.wikimedia.org/w/index.php?tit... [08:14:03] Wikimedia-General-or-Unknown, Analytics: Sudden drop in number of articles on nowiki on Nov29 (by 34k articles) - https://phabricator.wikimedia.org/T76356#820776 (Nemo_bis) [12:40:21] Engineering-Community, Analytics-Tech-community-metrics, Phabricator: Monthly report of total / active Phabricator users - https://phabricator.wikimedia.org/T1003#821136 (Aklapper) Let's have a separate ticket for some stats app in Phab. To avoid confusion: This ticket was originally about the cronjob that on... [13:50:26] Engineering-Community, Analytics-Tech-community-metrics, Phabricator: Monthly report of total / active Phabricator users - https://phabricator.wikimedia.org/T1003#821248 (chasemp) There is a "Fact" app that is pretty young, but it may make more sense to put our time into that https://secure.phabricator.com/fact/ [14:09:37] (CR) Ottomata: "Looks good. I think you should add an override flag for this. It is ok to deploy with passwords if we are using vagrant or labs for test" [analytics/refinery] - https://gerrit.wikimedia.org/r/177714 (owner: QChris) [14:10:10] (CR) Ottomata: [V: 2] Improve error message in deploy script, if Oozie's Hive config is not readable [analytics/refinery] - https://gerrit.wikimedia.org/r/177715 (owner: QChris) [14:10:15] (CR) Ottomata: [C: 2] Improve error message in deploy script, if Oozie's Hive config is not readable [analytics/refinery] - https://gerrit.wikimedia.org/r/177715 (owner: QChris) [14:11:18] (CR) Ottomata: "Perhaps we should deploy to worker nodes?" [analytics/refinery] - https://gerrit.wikimedia.org/r/177716 (owner: QChris) [15:18:16] Analytics-Refinery: Hive freezes starting a query, and produces the following error... - https://phabricator.wikimedia.org/T63100#821428 (Ironholds) Open>Resolved a:Ironholds Seems resolved. [15:18:31] Analytics-Refinery: Another Kraken error around nan in webrequests_mobile - https://phabricator.wikimedia.org/T62776#821432 (Ironholds) Open>Resolved a:Ironholds Not seen this since. [15:20:06] Wikimedia-General-or-Unknown, Analytics: Sudden drop in number of articles on nowiki on Nov29 (by 34k articles) - https://phabricator.wikimedia.org/T76356#821443 (Ironholds) To be entirely clear, for Jeblad's sake: Special:Statistics' article count is not generated by "number of articles", it's generated by t... [15:42:00] qchris_away: https://gerrit.wikimedia.org/r/#/c/169974/ is not merged yet [15:42:06] what's left? [15:48:55] (CR) Nuria: "Thank you, i have marked the google code-in task as done. Now, I feel that more work could be done here to adapt graph to different viewpo" [analytics/dashiki] - https://gerrit.wikimedia.org/r/177487 (owner: Unicodesnowman) [15:55:12] (CR) Milimetric: [C: 2 V: 2] "Following reports of rendering problems, I tried to replicate by wrapping config data in setTimeout calls. None of that seemed to matter " [analytics/dashiki] - https://gerrit.wikimedia.org/r/177005 (owner: Mforns) [15:55:34] mforns: couldn't find anything wrong - well done [16:01:46] milimetric: Yup. This change turned into analytics/aggregator project. [16:02:02] cool - so should I abandon? [16:02:04] Should I just abandon the gerrit change you linked to? Was not sure [16:02:11] sure, no prob [16:02:18] ok. [16:02:34] (Abandoned) Milimetric: Transform projectcounts hourly files [analytics/refinery] - https://gerrit.wikimedia.org/r/169974 (https://bugzilla.wikimedia.org/72740) (owner: Milimetric) [16:35:49] Wikimedia-General-or-Unknown, Analytics: Sudden drop in number of articles on nowiki on Nov29 (by 34k articles) - https://phabricator.wikimedia.org/T76356#821551 (jeblad) The number I refer to is the number on the top right side in [[https://no.wikipedia.org/wiki/Spesial:Siste_endringer|Special:RecentChanges]... [16:37:17] Analytics-Wikimetrics, Analytics-Engineering: [Dev 13 pts] Fix oauth and do a quick pre-security review. - https://phabricator.wikimedia.org/T76779#821553 (Milimetric) [16:38:42] (PS3) Milimetric: [WIP] Manage warehouse migrations with alembic [analytics/data-warehouse] - https://gerrit.wikimedia.org/r/177739 (owner: Nuria) [16:49:35] Wikimedia-General-or-Unknown, Analytics: Sudden drop in number of articles on nowiki on Nov29 (by 34k articles) - https://phabricator.wikimedia.org/T76356#821562 (Ironholds) Again, NUMBEROFARTICLES and Special:Statistics' count are the same count, generated through the same method. This is based on the number... [16:53:59] Wikimedia-General-or-Unknown, Analytics: Sudden drop in number of articles on nowiki on Nov29 (by 34k articles) - https://phabricator.wikimedia.org/T76356#821564 (Ironholds) Hmn. Actually, it looks like the edit number should include edits in the archive table. This is interesting. [16:57:56] Wikimedia-General-or-Unknown, Analytics: Sudden drop in number of articles on nowiki on Nov29 (by 34k articles) - https://phabricator.wikimedia.org/T76356#821569 (Ironholds) [16:58:28] could we please get wikibugs to not include usernames, or something? [16:58:34] because getting pinged for doing my job is infuriating. [16:59:13] Engineering-Community, Analytics-Tech-community-metrics, Phabricator: Monthly report of total / active Phabricator users - https://phabricator.wikimedia.org/T1003#821570 (Dzahn) https://gerrit.wikimedia.org/r/#/c/177792/ [17:01:11] Wikimedia-General-or-Unknown, Analytics: Sudden drop in number of articles on nowiki on Nov29 (by 34k articles) - https://phabricator.wikimedia.org/T76356#821572 (Ironholds) [17:08:34] Wikimedia-General-or-Unknown, Analytics: Sudden drop in number of articles on nowiki on Nov29 (by 34k articles) - https://phabricator.wikimedia.org/T76356#821595 (Ironholds) Okay, will check the dumps; this looks like it could be a problem. I'll report back; if it /is/, this becomes Ops's issue. Hrm. [17:08:47] GodDAMMIT. [17:11:11] YuviPanda, milimetric, Hi [17:11:41] I couldn't do much work today, I am down with cold and fever :( [17:12:37] hi rtnpro [17:13:07] Ironholds: yeah, that's really annoying [17:13:22] we gotta fix that, but I'm not sure who set it up and how [17:13:40] milimetric, I will work on the patch as soon as I feel better, hopefully tomorrow morning [17:14:05] rtnpro: no problem at all, hope you feel better [17:14:25] milimetric, so, I am working on two things on wikimetrics now: fixing logging and refactoring loading the configuration [17:14:36] in romania we make sodou: mix an egg with honey and hot milk [17:14:52] rtnpro: that's great - but no rush. Take it easy [17:15:17] milimetric, let's discuss a bit, so that I have some food for thought [17:15:38] milimetric, did we used to have the dumps on stat1002? Or are they on 3? [17:15:53] milimetric, I am at loss with refactoring loading of config file [17:17:55] milimetric, the load_config file does well what it does [17:18:28] milimetric, it extends the self.config variable [17:18:53] milimetric, could you enlighten me on the issues that you see with current approach to load the config file [17:19:32] milimetric, I could move the entire logic of loading the config file, taking into account the default file and the specially mentioned file in one function [17:19:43] rtnpro: the main issue i have is the code is a bit hard to read. But I guess that would require a lot of re-working and might not give us too much value [17:19:45] milimetric, but it will have to do the same thing at the end [17:20:35] rtnpro: let's maybe just deploy the logging changes and i'll collect what problems we have with it over the next month or so [17:20:35] milimetric, yes, it is not that readable, I can work on to make it more readable [17:20:51] then we can be more concrete about what we want to improve [17:21:02] milimetric, ok, makes sense :) [17:21:44] cool. Then I'll take a look when you feel better and in the meantime, if you're bored, you can check out our other projects and see if anything's interesting to work on next. [17:50:47] ottomata: kafka, and librdkafka especially, are pretty damn cool. [17:51:33] we should migrate eventlogging to use it instead of udp [17:52:08] ottomata: analytics1021 is getting pretty active. Is that only due to the rebooting, or are you running things to get partitions replicated too? [17:54:00] it is back online since being off for a day [17:54:10] chris replaced the disk [17:54:24] i bumped up the replication speed while it catches up [17:54:28] The disk is there already? nice! [17:54:49] yup! [17:55:02] ori :) [17:56:10] is magnus still around? did he ever convince the python-kafka guy to port his lib to librdkafka? [17:56:13] Engineering-Community, Analytics-Tech-community-metrics, Phabricator: Monthly report of total / active Phabricator users - https://phabricator.wikimedia.org/T1003#821700 (Qgil) >>! In T1003#821248, @chasemp wrote: > There is a "Fact" app that is pretty young, but it may make more sense to put our time into th... [17:56:20] if not i might take a stab at python bindings [17:57:12] qchris, nuria__: btw, what would be your gut feeling about dropping python 2 support? we'd be able to remove a lot of code [17:57:40] ori: in one word BAD [17:57:43] I have not tested EventLogging on python 3. [17:57:49] But I am all for it. [17:58:17] nuria__: why? [17:58:18] on my opinion i it doesn't deliver customer value immediately it is not worth the effort [17:58:32] ottomata: you don't need to hear from all of us saying if we have objecitons for stat* VLAN move, right? [17:58:52] ori: but i am all about "if it's not broken do not change it" [17:58:54] nuria__: i wouldn't try to argue for it in a priorities meeting, true, but it wouldn't be hard and i think it'd be fun. i'd enjoy it, anyway. [17:59:03] ahhh .. that is different [17:59:17] ori, magnus is still around, but not doing work for us [17:59:21] i ping him now and then with questions [17:59:21] ori: but it will require a bunch of testing specially sqlalchemy mysql driver [17:59:23] nuria__: And, it would reduce the LOC for EventLogging. Less LOC, less code to maintain. Less worries. [17:59:25] not sure about python-kafka [17:59:44] ori: the driver -docs say- "it is not known to work under py3" [18:00:36] huh? where? [18:01:07] ori: do check out mysql driver for alchemy in python 3 and lemme know, last time we checked authors basically told you not to use it [18:01:23] ori: but that was [18:01:31] thinking ...6 months ago? [18:01:59] ugh, looks like you're right: https://filippo.io/sqlalchemy-plus-mysql-plus-python-3-plus-pip/ [18:02:24] switching to oursql could be risky [18:03:52] (PS4) Nuria: Manage warehouse migrations with alembic [analytics/data-warehouse] - https://gerrit.wikimedia.org/r/177739 [18:04:11] yeah, this would be a blocker [18:04:35] ori: ya, I think needs baking time [18:05:00] ori: everything else (minus changes related to encoding) is probably pretty straight forward [18:13:42] Analytics-EventLogging, Mobile-Web: MobileWebClickTracking table is huge and thus querying too slow - https://phabricator.wikimedia.org/T76671#821725 (phuedx) a:Jdlrobson [18:25:39] ottomata: i might have to miss the real-time, or attend on my cell phone, so i want to jot down some notes ahead of time. is there an etherpad already or should i create one? [18:27:18] not one yet [18:27:35] we can call your cell from the hangout if you want [18:28:17] cool [19:09:48] Analytics-Engineering: [Ops] new group that would allow you to sudo to the wikimetrics user (to enable dev to run admin script) - https://phabricator.wikimedia.org/T76792#821819 (ggellerman) [19:11:50] Analytics-Engineering: [Ops] new group that would allow you to sudo to the wikimetrics user (to enable dev to run admin script) - https://phabricator.wikimedia.org/T76792#821826 (ggellerman) [19:12:01] Analytics-Engineering: [Ops] new group that would allow you to sudo to the wikimetrics user (to enable dev to run admin script) - https://phabricator.wikimedia.org/T76792#819814 (ggellerman) [19:15:28] Analytics-Engineering: Puppet Production role class for wikimetrics scheduler/queue - https://phabricator.wikimedia.org/T76791#821853 (ggellerman) [19:16:39] Analytics-Engineering, Analytics-Refinery: Backfill pageview data with WebstatsCollector data and/or Sampled Log data - https://phabricator.wikimedia.org/T76768#821866 (ggellerman) [19:17:15] Analytics-Engineering, Analytics-Refinery: Backfill pageview data with WebstatsCollector data and/or Sampled Log data - https://phabricator.wikimedia.org/T76768#819553 (ggellerman) [19:18:25] Analytics-Engineering: changing puppetization for scheduler/queue - https://phabricator.wikimedia.org/T76790#821876 (ggellerman) [19:19:31] Analytics-Wikimetrics, Analytics-Engineering: Fix oauth and do a quick pre-security review [13 pts] - https://phabricator.wikimedia.org/T76779#821882 (ggellerman) [19:29:17] Analytics-Refinery: Story: AnalyticsEng has kafkatee on analytics1003 - https://phabricator.wikimedia.org/T70246#821953 (ggellerman) p:Normal>Low Changing prio to low per @kevinator [19:29:21] Analytics-Refinery: Story: Transparently switch from udp2log datafiles over to kafkatee generated datafiles - https://phabricator.wikimedia.org/T70250#821958 (kevinator) [19:29:34] Analytics-Refinery: Story: AnalyticsEng generates new datafiles using kafkatee - https://phabricator.wikimedia.org/T70247#821964 (ggellerman) p:Unbreak!>Low Changing prio to low per @kevinator [19:30:11] Analytics-Refinery: Story: Vet the kafkatee generated files - https://phabricator.wikimedia.org/T70248#821982 (kevinator) [19:30:35] Analytics-Refinery: Epic: AnalyticsEng has kafkatee running in lieu of varnishcsa and udp2log - https://phabricator.wikimedia.org/T70139#821987 (kevinator) [19:33:05] Analytics-Engineering: [Ops] new group that would allow you to sudo to the wikimetrics user (to enable dev to run admin script) - https://phabricator.wikimedia.org/T76792#822015 (Krenair) As long as it's got a relevant project listed, that's fine. [19:35:42] Analytics-Refinery: Epic: AnalyticsEng has kafkatee running in lieu of varnishcsa and udp2log - https://phabricator.wikimedia.org/T70139#822029 (Ironholds) [19:35:56] wikibugs, SOD OFF. [19:36:36] YuviPanda, could you get wikibugs to stop adding usernames? or, do...something, to them? [19:37:08] Analytics, Analytics-Engineering: THEME: Analyst uses an operationalized Saiku - https://phabricator.wikimedia.org/T75246#822036 (kevinator) p:Triage>Normal [19:37:36] Analytics-Engineering, Analytics-Refinery: Backfill pageview data with WebstatsCollector data and/or Sampled Log data - https://phabricator.wikimedia.org/T76768#822040 (kevinator) p:Triage>Low [19:37:46] Analytics, Analytics-Engineering: THEME: Analyst uses an operationalized Saiku - https://phabricator.wikimedia.org/T75246#822041 (Ironholds) [19:38:39] Analytics-Refinery: Story: Vet the kafkatee generated files - https://phabricator.wikimedia.org/T70248#822047 (Ironholds) [19:38:43] Analytics-Refinery: Story: AnalyticsEng generates new datafiles using kafkatee - https://phabricator.wikimedia.org/T70247#822048 (Ironholds) [19:38:46] Analytics-Engineering: EPIC: Getting Mondrian & Saiku productionized - https://phabricator.wikimedia.org/T76739#822051 (ggellerman) [19:38:54] Analytics-Engineering, Analytics-Refinery: Creating the data in Hadoop from the raw request data, with Oliver's definition - https://phabricator.wikimedia.org/T76762#822058 (kevinator) [19:38:57] Analytics-Engineering: EPIC: Getting Mondrian & Saiku productionized - https://phabricator.wikimedia.org/T76739#822060 (ggellerman) p:Low>Normal Changing prio to Normal per @kevinator [19:38:57] Analytics-Engineering, Analytics-Refinery: Define Oozie-executable job to aggregate the data (Hive?) - https://phabricator.wikimedia.org/T76763#822063 (kevinator) [19:39:12] Analytics-Refinery: Raw webrequest partitions that were not marked successful due to configuration updates - https://phabricator.wikimedia.org/T74300#822070 (Ironholds) [19:39:15] Analytics-Refinery: Make webrequest partition validation handle races between time and sequence numbers - https://phabricator.wikimedia.org/T71615#822071 (Ironholds) [19:39:19] Analytics-Refinery: Raw webrequest partitions that were not marked successful - https://phabricator.wikimedia.org/T72085#822072 (Ironholds) [19:39:23] Analytics-Refinery: cp1064.eqiad.wmnet lost a kafka message on 2014-11-18T20:05:24 - https://phabricator.wikimedia.org/T75609#822074 (Ironholds) [19:39:27] Analytics-Engineering, Analytics-Refinery: Schedule Oozie job and set up monitoring - https://phabricator.wikimedia.org/T76764#822075 (kevinator) p:Triage>Normal [19:39:31] Analytics-Refinery: Raw webrequest partitions that were not marked successful due to depooled servers interfering with monitoring - https://phabricator.wikimedia.org/T74649#822077 (Ironholds) [19:39:31] Analytics-Refinery: Raw webrequest partitions that were not marked successful due to only esams caches causing unknown problems - https://phabricator.wikimedia.org/T74809#822078 (Ironholds) [19:39:44] Analytics-Engineering, Analytics-Refinery: Sqoop? data from Hadoop into a PostgreSQL database, Oozify and monitor this - https://phabricator.wikimedia.org/T76765#822079 (kevinator) p:Triage>Normal [19:39:53] Analytics-Refinery: Raw webrequest partitions for 2014-10-30T21/1H not marked successful - https://phabricator.wikimedia.org/T74810#822081 (Ironholds) [19:39:57] Analytics-Refinery: Raw webrequest partitions for 2014-10-08T1[89]:xx:xx not marked successful - https://phabricator.wikimedia.org/T73881#822082 (Ironholds) [19:40:00] Analytics-Refinery: Duplicates/missing logs from esams bits for 2014-09-28T{18,19,20}:xx:xx - https://phabricator.wikimedia.org/T73435#822083 (Ironholds) [19:40:00] Analytics-Refinery: Raw webrequest partitions for 2014-10-07T1[789]:xx:xx not marked successful - https://phabricator.wikimedia.org/T73882#822084 (Ironholds) [19:40:04] Analytics-Refinery: Several raw webrequest partitions now marked successful between 2014-10-13T13:xx:xx and 2014-10-13T22:xx:xx - https://phabricator.wikimedia.org/T74028#822085 (Ironholds) [19:40:08] Analytics-Refinery: Raw webrequest partitions that were not marked successful due to network issues - https://phabricator.wikimedia.org/T74298#822086 (Ironholds) [19:40:12] Analytics-Refinery: Kafka partition leader elections causing a drop of a few log lines - https://phabricator.wikimedia.org/T72087#822091 (Ironholds) [19:40:12] Analytics-General-or-Unknown: Kafka broker analytics1021 not receiving messages every now and then - https://phabricator.wikimedia.org/T71667#822094 (Ironholds) [19:40:15] Analytics-Wikimetrics, Analytics-Engineering: Mondrian has access to the PostgreSQL database - https://phabricator.wikimedia.org/T76766#822092 (kevinator) [19:40:15] Analytics-Refinery: Raw webrequest partitions that were not marked successful due to deployments gone wrong - https://phabricator.wikimedia.org/T74299#822095 (Ironholds) [19:40:34] Analytics-Wikimetrics, Analytics-Engineering: make cubes as windows into warehouse - https://phabricator.wikimedia.org/T76767#822096 (kevinator) p:Triage>Normal [19:41:02] Analytics-Wikimetrics, Analytics-Engineering: EPIC: Aggregating and Shaping the data for browsing - https://phabricator.wikimedia.org/T76761#822098 (kevinator) [19:41:05] Analytics-Engineering: [Ops] new group that would allow you to sudo to the wikimetrics user (to enable dev to run admin script) - https://phabricator.wikimedia.org/T76792#822103 (ggellerman) Thanks for your patience, @Krenair! We are still ramping up in Phab, and figuring out how to manage backlogs and workf... [19:41:58] Analytics-General-or-Unknown: kafkatee not consuming for some partitions - https://phabricator.wikimedia.org/T73056#822119 (Ironholds) [19:42:09] Analytics-General-or-Unknown: X-Analytics header is "php=zend;php=zend" instead of "php=zend" on bits for some requests - https://phabricator.wikimedia.org/T72463#822120 (Ironholds) [19:42:12] Analytics-Refinery: Kafkatee generated files in /a/log/webrequest not updating since 2014-09-18 - https://phabricator.wikimedia.org/T73290#822121 (Ironholds) [19:42:15] Analytics-General-or-Unknown: Make kafka write to graphite (instead of / as well as) ganglia - https://phabricator.wikimedia.org/T73322#822122 (Ironholds) [19:42:18] Analytics-Refinery: MD5 checksums missing from pagecounts-all - https://phabricator.wikimedia.org/T73710#822123 (Ironholds) [19:42:25] Analytics-General-or-Unknown: Package libcidr + libanon + libdclass for Ubuntu Trusty - https://phabricator.wikimedia.org/T70997#822124 (Ironholds) [19:42:31] Analytics-General-or-Unknown: Listing oozie coordinators on stat1002 fails with type conversion error - https://phabricator.wikimedia.org/T73192#822125 (Ironholds) [19:42:45] Analytics-General-or-Unknown: Turn off default vhost on stat1001.wikimedia.org - https://phabricator.wikimedia.org/T70150#822126 (Ironholds) [19:42:49] Analytics-Refinery: Decide on job.properties vs. {workflow,coordinator,bundle}.properties - https://phabricator.wikimedia.org/T70570#822127 (Ironholds) [19:42:53] Analytics-Refinery: Make oozie write external stats per default - https://phabricator.wikimedia.org/T70569#822128 (Ironholds) [19:42:53] Analytics-General-or-Unknown: Create a table in labs with replication lag data - https://phabricator.wikimedia.org/T71463#822129 (Ironholds) [19:42:56] Analytics-General-or-Unknown: http://reportcard.wikimedia.org/ - redirect and delete old stuff - https://phabricator.wikimedia.org/T71625#822131 (Ironholds) [19:43:00] Analytics-General-or-Unknown: Packetloss_Average alarm on erbium on 2014-08-23 - https://phabricator.wikimedia.org/T72092#822132 (Ironholds) [19:43:18] Analytics-EventLogging: Remove ad-hoc UA logging from existing schemas - https://phabricator.wikimedia.org/T61832#822137 (Ironholds) [19:43:21] Analytics-Refinery: Change license to Apache license, Version 2.0 - https://phabricator.wikimedia.org/T65084#822138 (Ironholds) [19:43:30] Analytics-Refinery: Auto-delete non sanctioned tables - https://phabricator.wikimedia.org/T67949#822141 (Ironholds) [19:43:38] Analytics-Refinery: Support Wikipedia Zero change from X-CS to X-Analytics - https://phabricator.wikimedia.org/T68546#822151 (Ironholds) [19:43:41] Analytics-General-or-Unknown: Current puppet does not allow to bring up a cluster in labs - https://phabricator.wikimedia.org/T70161#822152 (Ironholds) [19:43:47] Analytics-General-or-Unknown: Inventory Analytics systems for operational support - https://phabricator.wikimedia.org/T70450#822155 (Ironholds) [19:43:50] Analytics-Refinery: Make oozie use system's libpath per default - https://phabricator.wikimedia.org/T70568#822156 (Ironholds) [19:43:53] Analytics-EventLogging: Add sanitized User-Agent to default fields logged by EventLogging - https://phabricator.wikimedia.org/T54295#822157 (Ironholds) [19:44:42] Analytics-Refinery: Hive queries inconsistently failing - https://phabricator.wikimedia.org/T67420#822165 (Ironholds) [19:45:11] Analytics-General-or-Unknown: datasets.wikimedia.org SSL error - https://phabricator.wikimedia.org/T74805#822172 (Ironholds) [19:45:49] Analytics-Refinery: Raw webrequest partitions that were not marked successful - https://phabricator.wikimedia.org/T72085#822178 (kevinator) p:Triage>Normal [19:46:20] Analytics-Refinery: Kafka partition leader elections causing a drop of a few log lines - https://phabricator.wikimedia.org/T72087#822185 (kevinator) p:Triage>Normal [19:46:20] Analytics-General-or-Unknown: Kafkatee zero files having 10% less requests than udp2log zero files - https://phabricator.wikimedia.org/T66181#822188 (Ironholds) [19:46:27] Analytics-Refinery: Raw webrequest partitions that were not marked successful due to network issues - https://phabricator.wikimedia.org/T74298#822193 (kevinator) p:Triage>Normal [19:46:34] Analytics-Refinery: Raw webrequest partitions that were not marked successful due to configuration updates - https://phabricator.wikimedia.org/T74300#822198 (kevinator) p:Triage>Normal [19:46:49] Analytics-Refinery: Raw webrequest partitions that were not marked successful due to only esams caches causing unknown problems - https://phabricator.wikimedia.org/T74809#822202 (kevinator) p:Triage>Normal [19:47:14] Analytics-General-or-Unknown: "data:" URLs accounting for 6 of the top 10 most viewed articles reported by stats.grok.se - https://phabricator.wikimedia.org/T68112#822212 (Ironholds) [19:47:17] Analytics-General-or-Unknown: convert reports to use kafka-tee output - https://phabricator.wikimedia.org/T66016#822213 (Ironholds) [19:47:20] Analytics-General-or-Unknown: Icinga should notice people when /home partition on stat1002 fills up - https://phabricator.wikimedia.org/T65522#822214 (Ironholds) [19:47:22] Analytics-General-or-Unknown: Files in /srv on stat1003 owned by non-existing users - https://phabricator.wikimedia.org/T66749#822215 (Ironholds) [19:47:29] Analytics-Refinery: Make webrequest partition validation handle races between time and sequence numbers - https://phabricator.wikimedia.org/T71615#822216 (kevinator) [19:47:45] Analytics-Refinery: cp1064.eqiad.wmnet lost a kafka message on 2014-11-18T20:05:24 - https://phabricator.wikimedia.org/T75609#822219 (kevinator) [19:48:11] Analytics-General-or-Unknown: Number of Wikipedia Zero increasing drastically in mid March 2014 - https://phabricator.wikimedia.org/T64848#822224 (Ironholds) [19:48:15] Analytics-Refinery: Archiva web UI formats links to download artifacts with local IP address - https://phabricator.wikimedia.org/T67628#822226 (Ironholds) [19:48:21] Analytics-General-or-Unknown: Using puppet's active_nodes or pybal to determine mobile frontend caches - https://phabricator.wikimedia.org/T66240#822227 (Ironholds) [19:48:45] Analytics-General-or-Unknown: Analytics: Can we start quoting our logging fields? - https://phabricator.wikimedia.org/T62184#822228 (Ironholds) Open>Resolved a:Ironholds This will be resolved by switching to hadoop; done. [19:49:02] Analytics-General-or-Unknown: Replication checks disabled in Icinga for most analytics slaves - https://phabricator.wikimedia.org/T66088#822238 (Ironholds) [19:49:16] Analytics-General-or-Unknown: Analytics: Upgrade R on Analytics machines to 3.1.0 - https://phabricator.wikimedia.org/T66126#822239 (Ironholds) Open>Resolved a:Ironholds Done [19:49:19] Analytics-General-or-Unknown: Attempts of Hadoop tasks randomly fail "Bad connect ack with firstBadLink as $SOME_CLUSTER_IP" - https://phabricator.wikimedia.org/T65693#822242 (Ironholds) [19:49:22] Analytics-General-or-Unknown: ~30% increase of number of lines zero tsvs between 20140218 and 20140220 file - https://phabricator.wikimedia.org/T63660#822243 (Ironholds) [19:49:36] Analytics-General-or-Unknown: udp2log and/or demux.py filename corruption - https://phabricator.wikimedia.org/T64082#822244 (Ironholds) [19:49:39] Analytics-General-or-Unknown: Hive queries can bring load on cluster slaves > #CPUs - https://phabricator.wikimedia.org/T65222#822245 (Ironholds) [19:49:39] Analytics-General-or-Unknown: Safe-guard against double counting SSL zero traffic - https://phabricator.wikimedia.org/T64980#822246 (Ironholds) [19:50:08] Ironholds: you need legoktm [19:51:27] Analytics-Refinery: Story: Transparently switch from udp2log datafiles over to kafkatee generated datafiles - https://phabricator.wikimedia.org/T70250#822263 (Ironholds) [19:51:32] Analytics-Refinery: Story: AnalyticsEng has kafkatee on analytics1003 - https://phabricator.wikimedia.org/T70246#822264 (Ironholds) [19:53:32] aand done [19:53:37] Analytics-Engineering: Write new Test Script for pipeline to generate visualizations from EL data - https://phabricator.wikimedia.org/T76409#822314 (kevinator) p:Triage>Normal [20:03:34] Engineering-Community, Analytics-Tech-community-metrics, Phabricator: Monthly report of total / active Phabricator users - https://phabricator.wikimedia.org/T1003#822392 (Jdforrester-WMF) [20:29:33] Hey ottomata, let's say I wrote 50 TB to the hadoop cluster over the weekend. How bad would that be? [20:29:50] (assuming the write heads can handle the data rate without issue) [20:37:11] I just realized that I can use /mnt/hdfs in my Makefiles! <3 ottomata [20:46:16] Analytics-EventLogging, Mobile-Web: MobileWebClickTracking table is huge and thus querying too slow - https://phabricator.wikimedia.org/T76671#822539 (Jdlrobson) I created some cards to make this table smaller and solve this on short term (see https://trello.com/c/thwAJVcN) but on long term analytics team wil... [21:05:26] (PS5) Nuria: Manage warehouse migrations with alembic [analytics/data-warehouse] - https://gerrit.wikimedia.org/r/177739 [21:11:32] halfak 50 TB! [21:11:52] shoudl be fine, i suppose, jus twatch how much space you are using, probably keeping 50TB for ever is a lot [21:12:17] i'm out for the weekend, latterrrs [21:15:25] FINCH, Analytics-Engineering: Define user segments in a way that Product and Analytics can actually use in database queries - https://phabricator.wikimedia.org/T76908 (Jaredzimmerman-WMF) NEW p:Triage [21:15:44] FINCH, Analytics-Engineering: Define user segments in a way that Product and Analytics can actually use in database queries - https://phabricator.wikimedia.org/T76908#822593 (Jaredzimmerman-WMF) [21:19:52] FINCH, Analytics-Engineering: Define user segments in a way that Product and Analytics can actually use in database queries - https://phabricator.wikimedia.org/T76908#822612 (Jaredzimmerman-WMF) p:Triage>Normal [21:24:24] FINCH, Analytics-Engineering: Define user segments in a way that Product and Analytics can actually use in database queries - https://phabricator.wikimedia.org/T76908#822616 (Ironholds) This is an interesting task. So, at the moment, identifying readers is not easy. We can probably write up some example quer... [21:26:31] FINCH, Analytics-Engineering: Define user segments in a way that Product and Analytics can actually use in database queries - https://phabricator.wikimedia.org/T76908#822622 (Jaredzimmerman-WMF) [21:29:56] FINCH, Analytics-Engineering: Define user segments in a way that Product and Analytics can actually use in database queries - https://phabricator.wikimedia.org/T76908#822629 (Jaredzimmerman-WMF) [21:57:50] Analytics-Wikimetrics, Analytics-Engineering: Story: WikimetricsUser deletes user from cohort [21 pts] - https://phabricator.wikimedia.org/T75350#822683 (kevinator) [22:06:11] milimetric: do you have a sec? [22:20:54] Analytics-Wikimetrics, Analytics-Engineering: User reads result of validation after creating a cohort - https://phabricator.wikimedia.org/T76914 (kevinator) NEW p:Normal [22:22:53] Analytics-Wikimetrics, Analytics-Engineering: User reads result of validation after creating a cohort - https://phabricator.wikimedia.org/T76914#822708 (kevinator) [22:24:40] Analytics-EventLogging: Support third-party use by eliminating hard dependency on Varnish - https://phabricator.wikimedia.org/T45601#822723 (EBernhardson) [22:47:29] kevinator: just saw your ping [22:47:31] yep, what's up [22:48:21] milimetric: I was going to ask you for opinions/advice [22:48:29] milimetric: instead I logged a task: https://phabricator.wikimedia.org/T76914 [22:48:39] reading [22:49:09] Analytics-Refinery: Make oozie use system's libpath per default - https://phabricator.wikimedia.org/T70568#822740 (kevinator) p:Triage>Normal [22:49:40] kevinator: agreed with the proposed change [22:49:45] should be easy to do btw [22:49:57] Analytics-Refinery: Respect X-Forwarded-For only from trustworthy sources - https://phabricator.wikimedia.org/T56783#822743 (kevinator) p:Triage>Normal [22:51:20] Analytics-Refinery: Support Wikipedia Zero change from X-CS to X-Analytics - https://phabricator.wikimedia.org/T68546#822750 (kevinator) p:Triage>Normal [22:51:28] Analytics-Refinery: Support Wikipedia Zero change from X-CS to X-Analytics - https://phabricator.wikimedia.org/T68546#708948 (kevinator) p:Normal>Low [22:52:29] I would like to apologise for all the mean things I have said about Java. Having spent all day writing C++ makefiles that work on more than my machine: I Get It Now. [22:53:05] Analytics-Refinery: System deletes non-sanction tables older than 90 days - https://phabricator.wikimedia.org/T67949#822759 (kevinator) p:Triage>Normal [22:55:07] Analytics-Refinery: Change license to Apache license, Version 2.0 - https://phabricator.wikimedia.org/T65084#822762 (kevinator) @qchris is this still an issue? Since we have a new cluster & refinery repository I don't know if this still applies. [22:55:18] Analytics-Refinery: Change license to Apache license, Version 2.0 - https://phabricator.wikimedia.org/T65084#822764 (kevinator) [22:55:49] Analytics-Refinery: Make oozie write external stats per default - https://phabricator.wikimedia.org/T70569#822766 (kevinator) [22:56:10] Analytics-Wikimetrics, Analytics-Engineering: Story: WikimetricsUser deletes user from cohort [21 pts] - https://phabricator.wikimedia.org/T75350#822768 (Capt_Swing) Not sure I quite understand how it's weird to group them. Is it weird from the user's perspective, or only ours? I personally hate it when web... [22:56:53] Analytics-Refinery: Decide on job.properties vs. {workflow,coordinator,bundle}.properties - https://phabricator.wikimedia.org/T70570#822770 (kevinator) [22:58:13] Analytics-Refinery: Decide on job.properties vs. {workflow,coordinator,bundle}.properties - https://phabricator.wikimedia.org/T70570#728021 (kevinator) [22:59:21] milimetric: yeah, I thought it would be easy. mforns was ready to do it now… but I thought it should be logged and vetted [23:00:12] Analytics-Refinery: MD5 checksums missing from pagecounts-all - https://phabricator.wikimedia.org/T73710#822778 (kevinator) p:Triage>Low [23:06:17] Analytics-Refinery: Kafkatee generated files in /a/log/webrequest not updating since 2014-09-18 - https://phabricator.wikimedia.org/T73290#822783 (kevinator) p:Triage>Normal [23:07:36] Analytics-Refinery: pagecount for api.php - https://phabricator.wikimedia.org/T47121#822785 (kevinator) p:Normal>Low [23:09:29] Analytics-Engineering, Analytics-Refinery: Creating the data in Hadoop from the raw request data, with Oliver's definition - https://phabricator.wikimedia.org/T76762#822787 (kevinator) p:Normal>High [23:10:11] Analytics-Refinery: Raw webrequest partitions that were not marked successful due to deployments gone wrong - https://phabricator.wikimedia.org/T74299#822788 (kevinator) p:Triage>Normal [23:11:00] Analytics-Refinery: Several raw webrequest partitions now marked successful between 2014-10-13T13:xx:xx and 2014-10-13T22:xx:xx - https://phabricator.wikimedia.org/T74028#822790 (kevinator) p:Triage>Normal [23:11:31] Analytics-Refinery: Raw webrequest partitions for 2014-10-07T1[789]:xx:xx not marked successful - https://phabricator.wikimedia.org/T73882#822793 (kevinator) p:Triage>Normal [23:15:05] Analytics-Refinery: Hive stats not working - https://phabricator.wikimedia.org/T63279#822796 (kevinator) Open>declined a:kevinator We have a new cluster since this bug was logged, and I don't think it's an issue anymore or even reproducible. @qchris @ottomata please confirm. [23:15:16] Analytics-Refinery: Hive stats not working - https://phabricator.wikimedia.org/T63279#822800 (kevinator) p:Triage>Low [23:15:42] Analytics-Refinery: Duplicates/missing logs from esams bits for 2014-09-28T{18,19,20}:xx:xx - https://phabricator.wikimedia.org/T73435#822802 (kevinator) [23:16:03] Analytics-Refinery: Raw webrequest partitions for 2014-10-08T1[89]:xx:xx not marked successful - https://phabricator.wikimedia.org/T73881#822804 (kevinator) p:Triage>Normal [23:16:23] Analytics-Refinery: Raw webrequest partitions for 2014-10-30T21/1H not marked successful - https://phabricator.wikimedia.org/T74810#822806 (kevinator) [23:17:08] Analytics-Refinery: Raw webrequest partitions that were not marked successful due to depooled servers interfering with monitoring - https://phabricator.wikimedia.org/T74649#822809 (kevinator) p:Triage>Normal [23:18:08] Analytics-Refinery: Find deployment host to deploy refinery from that has neither refinery-hive, nor passwords in hive-site.xml - https://phabricator.wikimedia.org/T76806#822814 (kevinator) p:Triage>Normal [23:19:51] Analytics-Refinery: Kraken unit-tests on Jenkins are failing - https://phabricator.wikimedia.org/T56046#822817 (kevinator) [23:21:25] Analytics-Refinery: Kraken unit-tests on Jenkins are failing - https://phabricator.wikimedia.org/T56046#822820 (kevinator) [23:22:12] Analytics-Refinery: Kraken data flow monitoring not working properly - https://phabricator.wikimedia.org/T52195#822824 (kevinator) p:Triage>Low [23:22:30] Analytics-Refinery: Kraken data flow monitoring not working properly - https://phabricator.wikimedia.org/T52195#577633 (kevinator) [23:29:05] Analytics-Refinery: Story: AnalyticsEng has UDF in Hadoop for UA parsing - https://phabricator.wikimedia.org/T69803#822832 (kevinator) [23:31:42] Analytics-Refinery: Epic: Analyst has Page View Report from hadoop prototype - https://phabricator.wikimedia.org/T70961#822834 (kevinator) Open>Resolved a:kevinator This was done. webstatscollector was implemented in Hive and pageview dumps were created, compatible with the old ones. ETL was not ne... [23:34:07] have a nice weekend everyone! [23:51:29] (PS1) Jforrester: Provide some initial raw graphs for basic numbers [analytics/limn-edit-data] - https://gerrit.wikimedia.org/r/177943 [23:52:13] milimetric: Hey, if you happen to be around a sanity-check on https://gerrit.wikimedia.org/r/#/c/177943/ would be hugely appreciated. [23:56:17] Analytics-Engineering, Analytics-Refinery: PageView reports by hive-webstatscollector should return undefined values when data is not available - https://phabricator.wikimedia.org/T76406#822903 (kevinator) [23:56:21] Analytics-Engineering, Analytics-Refinery: PageView reports by hive-webstatscollector should return undefined values when data is not available - https://phabricator.wikimedia.org/T76406#799656 (kevinator) p:Normal>High [23:58:58] (Or nuria__ or Ironholds of course.) [23:59:32] I don't do Limn, I'm afraid. [23:59:50] Ironholds: I'm more interested in if the SQL looks sane.