[00:02:41] <Ironholds>	 spagewmf, we shut stat1001 down, if I recall
[00:02:49] <Ironholds>	 but if you have 1003 access you can get in through bast1001.
[00:03:58] <spagewmf>	 Ironholds! long time no see.  If so can you update wikitech?  https://wikitech.wikimedia.org/wiki/Datasets.wikimedia.org says "The site is hosted on stat1001"
[00:04:29] <Ironholds>	 ooh, nope, apparently it's still live just a new machine. my bad.
[00:11:57] <spagewmf>	 Ironholds: Thanks. I don't have access to either {stat1001,stat1003}.wikimedia.org through bast1001, doesn't like my deploy key or my gerrit/labs key.
[00:13:32] <Ironholds>	 then ask ottomata in the morning, is my advice.
[00:16:03] <spagewmf>	 thanks will do (now playing: "Touch ottomata in the morning, then just walk away" by Diana Ross)
[00:21:55] <Ironholds>	 milimetric, stat1002 has rsync to public-datasets, right? Or wrong?
[00:22:14] <Ironholds>	 because, I threw some files/dirs over >10 hours ago, and nada.
[00:24:01] <milimetric>	 Ironholds: I think stat1003 is the machine
[00:24:03] <milimetric>	 one sec
[00:24:13] <milimetric>	 and it shouldn't take longer than 30 min.
[00:24:21] <Ironholds>	 well, stat1002 has a public-datasets folder, and I'm not sure why it should if there's no rsyncing
[00:24:34] <milimetric>	 i think that one syncs too... hm....
[00:24:35] <Ironholds>	 and if we /don't/ have syncing on 1002, that's a massive blocker on...a lot of work, and we should set it up.
[00:24:47] <Ironholds>	 look at the /readership/ folder, or /enwiki/test.txt
[00:25:08] <Ironholds>	 they're in stat1002:/a/public-datsets/ - they ain't in http://datasets.wikimedia.org/public-datasets/
[00:26:05] <milimetric>	 Ironholds: yeah, they're not on stat1003, and now I'm forgetting how C & A set this up
[00:26:24] <milimetric>	 there was some long debate about it... grr.
[00:26:30] <milimetric>	 i'll try to read puppet if it's urgent
[00:28:13] <Ironholds>	 milimetric, I'm blocked on all the urgent mobile tasks until it's done. So if you could? Although I hate the idea of you staying up late for this :/
[00:28:33] <grrrit-wm>	 (PS1) Milimetric: [WIP] Transform projectcounts hourly files [analytics/refinery] - https://gerrit.wikimedia.org/r/169974
[00:28:51] <milimetric>	 no problem, I've built up early-leaving.  I'm not sure I can help though I'll try
[00:29:55] <Ironholds>	 okay!
[00:29:57] <Ironholds>	 thanks :)
[00:30:04] <Ironholds>	 (in that case I will be back in 5, making a run to the convenience store)
[00:30:51] <milimetric>	 i'll just type out what I find as I find it:
[00:31:09] <milimetric>	 stat1002 is not set up to rsync in the same place stat1003 is (in puppet)
[00:32:53] <Ironholds>	 kk
[00:35:04] * milimetric is about to go into hulk smash mode
[00:37:22] <milimetric>	 Ironholds: it looks to me as if /a/aggregate-datasets is the place to put stuff in on stat1002
[00:37:38] <Ironholds>	 aha. I'll test. Thanks!
[00:37:40] <milimetric>	 and /a/public-datasets is the place to put stuff on stat1003
[00:37:43] <milimetric>	 I'm testing Ironholds
[00:37:49] <Ironholds>	 but /a/public-datasets is synced to stat1002
[00:37:51] <Ironholds>	 this is profoundly silly
[00:37:57] <Ironholds>	 grr, Ops of Christmas Past!
[00:38:04] <Ironholds>	 okay! Lemme know what you find?
[00:38:27] <milimetric>	 yeah, the comments are all out of date on the puppet definition
[00:39:47] <milimetric>	 i *really* hate that something called aggregate datasets could be made public without any notice in there.  Like - at least put a README there, goodness
[00:41:52] <milimetric>	 Ironholds: http://datasets.wikimedia.org/aggregate-datasets/ should get a little test file soon, but I re-read the puppet and I'm fairly confident that this will work.  I'd dump what you have on there and I'll leave myself a message here to ping the opsy folks tomorrow
[00:42:17] <Ironholds>	 milimetric, cool; thanks!
[06:05:32] <wikibugs>	 Analytics / Wikimetrics: Story: Wikimetrics uses some EL data - https://bugzilla.wikimedia.org/72735 (Kevin Leduc) NEW p:Unprio s:enhanc a:None Move some data in EL to LabsDBs or Data Warehouse so Wikimetrics can use it. The driving use case is generating target-site breakdowns for Vital Si...
[06:06:57] <wikibugs>	 Analytics / Wikimetrics: Story: Wikimetrics uses some EL data - https://bugzilla.wikimedia.org/72735#c1 (Kevin Leduc) p:Unprio>High Collaborative tasking on etherpad: http://etherpad.wikimedia.org/p/analytics-72735
[06:07:27] <wikibugs>	 Analytics / Wikimetrics: Story: Wikimetrics uses some EL data - https://bugzilla.wikimedia.org/72735 (Kevin Leduc)
[06:20:14] <wikibugs>	 Analytics / Wikimetrics: Story: Wikimetrics compiles target-site breakdown on metrics based on MW tags - https://bugzilla.wikimedia.org/72736#c1 (Kevin Leduc) p:Unprio>High Implement breakdowns for existing Vital Signs metrics where we can use Mediawiki tags (mobile, mobile-app or other [desktop]).
[06:20:27] <wikibugs>	 Analytics / Wikimetrics: Story: Wikimetrics compiles target-site breakdown on metrics based on MW tags - https://bugzilla.wikimedia.org/72736#c2 (Kevin Leduc) collaborative etherpad at http://etherpad.wikimedia.org/p/analytics-72736
[06:24:29] <wikibugs>	 Analytics / Wikimetrics: Story: Wikimetrics has connection to Data Warehouse - https://bugzilla.wikimedia.org/72737 (Kevin Leduc) NEW p:Unprio s:enhanc a:None - create new wikimetrics connections to warehouse - puppet changes, vagrant setup, localhost setup
[06:25:42] <wikibugs>	 Analytics / Wikimetrics: Story: Wikimetrics has connection to Data Warehouse - https://bugzilla.wikimedia.org/72737#c1 (Kevin Leduc) p:Unprio>High collaborative tasking on etherpad: http://etherpad.wikimedia.org/p/analytics-72737
[06:31:31] <wikibugs>	 Analytics / Wikimetrics: Story: Wikimetrics compiles target-site breakdown for remaining metrics - https://bugzilla.wikimedia.org/72738 (Kevin Leduc) NEW p:Unprio s:enhanc a:None Daily report of target site breakdown for remaining existing metrics in Vital Signs.
[06:32:59] <wikibugs>	 Analytics / Wikimetrics: Story: Wikimetrics has connection to Data Warehouse - https://bugzilla.wikimedia.org/72737 (Kevin Leduc)
[06:32:59] <wikibugs>	 Analytics / Wikimetrics: Story: Wikimetrics compiles target-site breakdown for remaining metrics - https://bugzilla.wikimedia.org/72738#c1 (Kevin Leduc) p:Unprio>High Collaborative tasking on etherpad: http://etherpad.wikimedia.org/p/analytics-72738
[06:36:59] <wikibugs>	 Analytics / Dashiki: Story: User selects breakdown in Vital Signs - https://bugzilla.wikimedia.org/72739 (Kevin Leduc) NEW p:Unprio s:enhanc a:None According to Pau's design: http://pauginer.github.io/prototypes/analytics-dashboard/index.html  Implement a 'button' in the left nav bar to dis...
[06:37:58] <wikibugs>	 Analytics / Dashiki: Story: User selects breakdown in Vital Signs - https://bugzilla.wikimedia.org/72739 (Kevin Leduc)
[06:38:41] <wikibugs>	 Analytics / Dashiki: Story: User selects breakdown in Vital Signs - https://bugzilla.wikimedia.org/72739#c1 (Kevin Leduc) p:Unprio>High Collaborative tasking on etherpad: http://etherpad.wikimedia.org/p/analytics-72739
[06:44:45] <wikibugs>	 Analytics / Dashiki: Story: Vital Signs User selects the Daily or Monthly Pageviews metrics - https://bugzilla.wikimedia.org/72740 (Kevin Leduc) NEW p:Unprio s:enhanc a:None Generate data on the cluster, move it on labsdbs for wikimetrics or make it directly available to Vital Signs  There...
[06:46:57] <wikibugs>	 Analytics / Dashiki: Story: Vital Signs User selects the Daily or Monthly Pageviews metrics - https://bugzilla.wikimedia.org/72740#c1 (Kevin Leduc) p:Unprio>High Collaborative tasking on etherpad: http://etherpad.wikimedia.org/p/analytics-72740
[06:55:15] <wikibugs>	 Analytics / EventLogging: List tables/schemas with data retention needs - https://bugzilla.wikimedia.org/72741 (Kevin Leduc) NEW p:Unprio s:enhanc a:None Do the ground work to identify - with product's help - tables/schemas, what they are used for and what sort of data they want to keep bey...
[06:55:42] <wikibugs>	 Analytics / EventLogging: List tables/schemas with data retention needs - https://bugzilla.wikimedia.org/72741 (Kevin Leduc) p:Unprio>Highes
[06:55:43] <wikibugs>	 Analytics / EventLogging: List tables/schemas with data retention needs - https://bugzilla.wikimedia.org/72741 (Kevin Leduc) a:Kevin Leduc
[06:56:56] <wikibugs>	 Analytics / EventLogging: List tables/schemas with data retention needs - https://bugzilla.wikimedia.org/72741#c1 (Kevin Leduc) - Aaron can help reviews schemas, he could show which tables contain sensitive data - Talk to individual teams to identify what they need
[06:58:43] <wikibugs>	 Analytics / EventLogging: Automate purge of raw logs older than 90 days - https://bugzilla.wikimedia.org/72742 (Kevin Leduc) NEW p:Unprio s:enhanc a:None - ops task - set up log polling mechanism with deletion (pupet change)
[06:58:56] <wikibugs>	 Analytics / EventLogging: Automate purge of raw logs older than 90 days - https://bugzilla.wikimedia.org/72742 (Kevin Leduc) p:Unprio>High
[07:05:59] <wikibugs>	 Analytics / EventLogging: Automate pruning of sampled logs after 90 days - https://bugzilla.wikimedia.org/72743 (Kevin Leduc) NEW p:Unprio s:enhanc a:None - similar to pruning raw logs after 90 days. Needs a cron running on stat1002, Oxygen and wherever else the data exists.
[07:06:27] <wikibugs>	 Analytics / EventLogging: Automate pruning of sampled logs after 90 days - https://bugzilla.wikimedia.org/72743#c1 (Kevin Leduc) Reposting a comment Christian made in a google doc: "It will probably be a puppet change too, but it might be a different one. Like cron vs. logrotate.  I am not sure about a...
[07:06:41] <wikibugs>	 Analytics / EventLogging: Automate pruning of sampled logs after 90 days - https://bugzilla.wikimedia.org/72743 (Kevin Leduc) p:Unprio>High
[07:11:14] <wikibugs>	 Analytics / EventLogging: Automate purge of rows older than 90 days for select tables/schemas - https://bugzilla.wikimedia.org/72744 (Kevin Leduc) NEW p:Unprio s:enhanc a:None - don't turn it on, just make it available per schema. This is already done PER Table by sean, not per column - Thi...
[07:11:28] <wikibugs>	 Analytics / EventLogging: Automate purge of rows older than 90 days for select tables/schemas - https://bugzilla.wikimedia.org/72744 (Kevin Leduc) p:Unprio>High
[07:14:12] <wikibugs>	 Analytics / EventLogging: Story: User clicks on link to event capsule schema while viewing a schema - https://bugzilla.wikimedia.org/72745#c1 (Kevin Leduc) p:Unprio>Normal s:normal>enhanc On schema pages (for example) https://meta.wikimedia.org/wiki/Schema:NewEditorEdit Add a link to the ev...
[07:14:56] <wikibugs>	 Analytics / EventLogging: Story: User clicks on link to event capsule schema while viewing a schema - https://bugzilla.wikimedia.org/72745#c2 (Kevin Leduc) Collaborative tasking on etherpad: http://etherpad.wikimedia.org/p/analytics-72745
[07:37:14] <wikibugs>	 Analytics / Wikimetrics: Story: WikimetricsUser tags a cohort using a pre-defined tag - https://bugzilla.wikimedia.org/72746 (Kevin Leduc) NEW p:Unprio s:enhanc a:None The story is here https://www.mediawiki.org/wiki/Analytics/Wikimetrics/Stories#Tag_your_own_cohort_with_existing_tags  tran...
[07:37:43] <wikibugs>	 Analytics / Wikimetrics: Story: WikimetricsUser tags a cohort using a pre-defined tag - https://bugzilla.wikimedia.org/72746 (Kevin Leduc) p:Unprio>Highes
[07:41:44] <wikibugs>	 Analytics / Wikimetrics: Story: WikimetricsUser reads user names in a JSON report - https://bugzilla.wikimedia.org/72747 (Kevin Leduc) NEW p:Unprio s:enhanc a:None Story is here: https://www.mediawiki.org/wiki/Analytics/Wikimetrics/Stories#List_usernames_in_reports_that_include_individual_r...
[07:42:13] <wikibugs>	 Analytics / Wikimetrics: Story: WikimetricsUser reads user names in a JSON report - https://bugzilla.wikimedia.org/72747 (Kevin Leduc) p:Unprio>Highes
[10:09:36] <wikibugs>	 Analytics / Refinery: Raw webrequest upload partition for 2014-10-29T14/1H not marked successful - https://bugzilla.wikimedia.org/72756 (christian) NEW p:Unprio s:normal a:None The upload webrequest partition [1] for 2014-10-29T14/1H has not been marked successful.  What happened?   [1] ___...
[10:09:53] <wikibugs>	 Analytics / Refinery: Raw webrequest upload partition for 2014-10-29T14/1H not marked successful - https://bugzilla.wikimedia.org/72756#c1 (christian) NEW>RESO/FIX Commit 37228258e8680aa035206e8b89eaa9f57b28555f got merged, which updated the varnishkafka configuration for the upload caches. This...
[10:10:06] <wikibugs>	 Analytics / Refinery: Raw webrequest partitions that were not marked successful due to configuration updates - https://bugzilla.wikimedia.org/72300 (christian)
[10:12:06] <qchris>	 !log Marked raw upload webrequest partition for 2014-10-29T14/1H ok (See {{bug|72756}})
[13:58:27] <wikibugs>	 Analytics / EventLogging: Add test flag to EventLogging - https://bugzilla.wikimedia.org/72365#c6 (christian) NEW>RESO/WON (In reply to nuria from comment #5) > [...] I see little value in adding a test flag > and I am of the opinion that we should not do it.  Same here.  (Hence, being bold and R...
[14:02:22] <ottomata>	 haha
[14:33:31] <kevinator>	 https://docs.google.com/a/wikimedia.org/spreadsheets/d/1UEnlRIRKKGBhQluyWUiCiwyn2pX9DSzK8kxfBw3kdDk/edit#gid=1717503637
[14:38:43] <wikibugs>	 Analytics / Wikimetrics: report table performance, cleanup, and number of items - https://bugzilla.wikimedia.org/72635#c1 (nuria) There are several ways to go about this:  #1. Purge from db anything older than 30 days that is not a recurrent reports. This can be done via a scheduler task  #2 do not wri...
[14:41:41] <wikibugs>	 Analytics / Wikimetrics: Story: WikimetricsUser tags a cohort using a pre-defined tag - https://bugzilla.wikimedia.org/72746 (Kevin Leduc)
[14:45:12] <wikibugs>	 Analytics / EventLogging: Automate pruning of sampled logs after 90 days - https://bugzilla.wikimedia.org/72743#c2 (christian) (In reply to Kevin Leduc from comment #1) > Reposting a comment Christian made in a google doc: > [...] > * gadolinium (might be a on-time thing there), and  s/on-time/one-time/
[14:47:11] <wikibugs>	 Analytics / Wikimetrics: Story: WikimetricsUser reads user names in a JSON report - https://bugzilla.wikimedia.org/72747 (Kevin Leduc)
[14:50:56] <wikibugs>	 Analytics / Wikimetrics: report table performance, cleanup, and number of items - https://bugzilla.wikimedia.org/72635 (Kevin Leduc)
[14:51:41] <wikibugs>	 Analytics / Wikimetrics: report table performance, cleanup, and number of items - https://bugzilla.wikimedia.org/72635#c2 (nuria) We estimated #2, please have in mind recurrent reports need to be working as they are today.
[15:22:21] <YuviPanda>	 nuria__: hey! we've statsd running on labmon1001.eqiad.wmnet and you can send metrics to that from your application using any statsd client library
[15:26:00] <nuria__>	 YuviPanda: can you write a little 1 pager on how to get the statsd client on your lab instance and a dummy "hello world" example? When I send data to graphite in prod i found testing it was not that easy.
[15:26:48] <YuviPanda>	 ah, hmm. usually you just use a library (like https://pypi.python.org/pypi/python-statsd) and specify the host
[15:28:45] <nuria__>	 YuviPanda: but you can 1) do it directly (pip statsd) 2) have puppet install statsd that for you to be ready
[15:29:18] <YuviPanda>	 nuria__: statsd client library, you mean?
[15:29:59] <nuria__>	 YuviPanda: yes, also it will be nice to have the graphite endpoint configured on puppet ( asking here like it's x-mas...)
[15:30:12] <YuviPanda>	 nuria__: I'm somewhat confused now...
[15:30:20] <YuviPanda>	 nuria__: what do you mean by 'graphite endpoint configured on puppet'?
[15:30:34] <YuviPanda>	 nuria__: if that is 'send to different hosts based on labs or prod' that is already there...
[15:31:33] <nuria__>	 YuviPanda: maybe graphite endpoint is not a good description, rather the node endpoint that listens for your counters
[15:32:01] <nuria__>	 YuviPanda: that came out not so clear either..ahem...let me see if i can find the example
[15:32:08] <nuria__>	 in the puppet files in prod
[15:34:24] <nuria__>	 YuviPanda: not sure if something like this exists in puppet for labs already
[15:34:57] <nuria__>	 https://www.irccloud.com/pastebin/4EgtEcGM
[15:35:31] <nuria__>	 "my=instance" should be myinstance
[15:59:04] <ottomata>	 qchris: ok
[15:59:07] <ottomata>	 so, to understand
[15:59:19] <ottomata>	 in general, we miss kafkatee lines near the beginning of hours.
[15:59:20] <ottomata>	 right?
[15:59:23] <qchris>	 right
[15:59:27] <ottomata>	 first few seconds of each hour
[15:59:36] <ottomata>	 and it doesn't matter the host or datacenter
[15:59:38] <qchris>	 I mean ... I did not look elsewhere
[15:59:47] <qchris>	 so we at least miss it there.
[16:00:02] <ottomata>	 right, and, as far as we can tell those lines are in hive
[16:00:05] <ottomata>	 so they are in kafka
[16:00:10] <qchris>	 Right, it doesn't matter the host or datacenter
[16:00:15] <ottomata>	 which, would make sense if it happens to all hosts at the same time
[16:00:23] <ottomata>	 sounds like either kafkatee or something on analytics1003 then
[16:00:37] <ottomata>	 the only thing that I know that happens hourly on analytics1003 is webstastcollector
[16:01:21] <qchris>	 Does kafka not guarantee to produce to the consumer?
[16:01:38] <ottomata>	 the consumer is responsible for consuming
[16:01:52] <ottomata>	 if the messages are in kafka (which we are sure they are), then it has to be downstream from kafka
[16:01:54] <qchris>	 I'd count that as a kafkatee issue then.
[16:01:56] <ottomata>	 yes
[16:02:21] <ottomata>	 i somehow doubt that collector is the problem
[16:02:24] <ottomata>	 but, shall I stop it anyway?
[16:02:26] <ottomata>	 just in case?
[16:02:33] <qchris>	 Would be a nice way too test.
[16:02:38] <ottomata>	 ok
[16:03:04] <qchris>	 It does not look implausible ... after all ... a few hundred MB are written in short time.
[16:03:13] <mforns>	 milimetric, nuria, kevinator: trying to reconnect
[16:07:36] <ottomata>	 qchris: but it is not all hours that miss data during those first few seconds
[16:07:37] <ottomata>	 right?
[16:07:46] <ottomata>	 jsut, usually, if data is missing, that is where it is missing from
[16:08:02] <qchris>	 I did not check that. Instead, I checked if for some second
[16:08:23] <qchris>	 (where udp2log files have messages), there are no messages in the kafkatee files
[16:08:33] <ottomata>	 i am reading the manpage for ts for the first time.....:o
[16:08:44] <qchris>	 And for those seconds, I checked if there are missing timestamps.
[16:08:46] <qchris>	 :-P
[16:09:13] <qchris>	 that inline function thingie is quite nice to fight DRY
[16:09:30] <ottomata>	 ja, what is <( )???
[16:09:34] <ottomata>	 that is new to me too!
[16:09:39] <qchris>	 bash redirection.
[16:09:43] <ottomata>	 from a subshell?
[16:09:47] <qchris>	 Yes.
[16:09:54] <qchris>	 The thing within <( ... )
[16:09:55] <ottomata>	 diff will take two streams of stdin?
[16:10:10] <ottomata>	 i guess not stdin.
[16:10:16] <qchris>	 gets written to a tmp file, and the name of that tmp file gets substituted there.
[16:10:22] <ottomata>	 does it open towo fds?
[16:10:23] <ottomata>	 whoa.
[16:10:27] <qchris>	 yen.
[16:10:35] <qchris>	 s/yen/yes, 2 fds/
[16:10:42] <ottomata>	 your foo is great.
[16:10:46] <qchris>	 :-D
[16:11:08] <ottomata>	 OH
[16:11:10] <qchris>	 bash grew a lot the last years ... so I never got around to learn zsh.
[16:11:10] <ottomata>	 hahah
[16:11:23] <ottomata>	 i somehow missed the ts() func declaration
[16:11:23] <qchris>	 Now I am a lamer, as I still use bash :-(
[16:11:29] <ottomata>	 i was reading
[16:11:30] <ottomata>	 man ts
[16:11:33] <ottomata>	 and all like
[16:11:38] <ottomata>	 what is going on here!?
[16:11:43] <ottomata>	  ts - Time Stamping Authority tool (client/server)
[16:11:49] <qchris>	 whoa :-D
[16:11:54] <qchris>	 I did not know that existed.
[16:12:03] <ottomata>	 me neither
[16:12:27] <qchris>	 During prototyping I just use "X" as function name ... but I figured that is less descriptive for timestamps.
[16:12:51] <ottomata>	 aye
[16:12:53] <qchris>	 I thought you were kidding before when you said you're just reading "man ts"
[16:14:28] <ottomata>	 ok, webstats not running on an03 anymore
[16:14:42] <qchris>	 Great!
[16:16:54] <ottomata>	 ha, qchris, it would be nice if we had the kafka partition...and maybe even the kafkaoffset, of each message in the json
[16:16:55] <ottomata>	 :)
[16:17:22] <ottomata>	 hm, guess we can't know the offset...can we?  dunno.
[16:17:33] <qchris>	 yes, that would help I guess.
[16:18:09] <ottomata>	 partitoin would be nice, because then we could immediately see if there are problems with certain partitions
[16:18:15] <ottomata>	 that would be easy to add, I think
[16:18:17] <ottomata>	 maybe...
[16:18:17] <ottomata>	 hm
[16:18:24] <ottomata>	 we use a random partitioner, so, not sure
[16:18:30] <ottomata>	 might have to be an rdkafka feature?
[16:18:32] <ottomata>	 duno.
[16:18:34] <ottomata>	 will ask snaps sometime.
[16:18:52] <qchris>	 It would certainly help debugging things.
[16:19:16] <qchris>	 But wouldn't that mean that kafka would have to mangle the message?
[16:19:35] <ottomata>	 no, the partition is selected by the producer
[16:19:38] <qchris>	 Or would camus/kafkatee add that information on the fly?
[16:19:40] <qchris>	 Oh.
[16:19:47] <qchris>	 varnishkafka then?
[16:20:12] <qchris>	 Mhmmm.
[16:21:32] <qchris>	 Varnishkafka just passes it to librdkafka.
[16:21:56] <ottomata>	 yes
[16:21:59] <qchris>	 So varnishkafka does not know what partition it sends to.
[16:22:02] <ottomata>	 either varnishkafka or librdkakfa woudl do it
[16:22:11] <ottomata>	 yeah, varnishkafka would have to know by librdkafka, i suppose
[16:22:16] <ottomata>	 dunno, it probably has a way to select or know
[16:22:17] <ottomata>	 who knows
[16:22:18] <ottomata>	 but ja
[16:22:32] <ottomata>	 camus probably could add that on the fly
[16:22:37] <ottomata>	 if we worked hard enough to make it do so :)
[16:22:43] <qchris>	 Right.
[16:23:17] <qchris>	 Hahahaha ... but I guess there are bigger fish to fry :-(
[16:28:06] <ottomata>	 so, hm, are there more things we can troubleshoot with this, or do we wait another 24 hours and then check again?
[16:29:56] <qchris>	 IIRC, you checked field #1 in the tsvs (hostname), which showed issues with field #3 (timestamps)
[16:30:05] <qchris>	 All the other fields are not yet vetted.
[16:30:30] <qchris>	 But those will just suffer the same missing lines.
[16:31:04] <ottomata>	 yeah, if we have one issue we'll see the others
[16:31:17] <qchris>	 I guess it's fine to wait over the weekend.
[16:31:26] <qchris>	 Then there will be full, good tsvs.
[16:31:42] <ottomata>	 well, this seems to happen enough times in 24 hours
[16:31:44] <qchris>	 I guess one could look at the sampled ones (if you have time)
[16:31:45] <ottomata>	 according to your email
[16:32:45] <qchris>	 Yes, it would happen often enough. But the tsvs only get available at 6:30.
[16:32:58] <qchris>	 Especially the ones from the production udp2log instances.
[16:33:07] <ottomata>	 aye, so tomorrow?
[16:33:35] <qchris>	 Tomorrow, there'll only be a file that is good after 16:00.
[16:33:50] <qchris>	 On Saturday, we'd have a full good file (hopefully).
[16:33:58] <ottomata>	 that is nowish!
[16:34:00] <ottomata>	 tomorrow
[16:34:04] <ottomata>	 haha
[16:34:09] <qchris>	 tomorrow it is :-D
[16:34:53] <qchris>	 I mean ... we could look at the file on analytics1003 in one hour. If it shows the holes
[16:35:19] <qchris>	 for 17:00:10 ... then turning off webstatscollector did not help :-)
[16:36:02] <ottomata>	 oh, true.  ah, but if it does not show the holes then we have to wait
[16:36:05] <ottomata>	 yeah, let's check
[16:36:08] <ottomata>	 if we see holes, we keep thikning
[16:36:50] <qchris>	 k
[17:10:41] <wikibugs>	 Analytics / EventLogging: List tables/schemas with data retention needs - https://bugzilla.wikimedia.org/72741 (Dan Andreescu)
[17:11:43] <wikibugs>	 Analytics / Wikimetrics: report table performance, cleanup, and number of items - https://bugzilla.wikimedia.org/72635 (Dan Andreescu)
[17:11:43] <wikibugs>	 Analytics / Wikimetrics: Story: WikimetricsUser tags a cohort using a pre-defined tag - https://bugzilla.wikimedia.org/72746 (Dan Andreescu)
[17:11:56] <wikibugs>	 Analytics / EventLogging: database consumer could batch inserts (sometimes) - https://bugzilla.wikimedia.org/67450 (Dan Andreescu)
[17:12:42] <wikibugs>	 Analytics / EventLogging: Automate purge of raw logs older than 90 days - https://bugzilla.wikimedia.org/72742#c1 (nuria) NEW>RESO/DUP   *** This bug has been marked as a duplicate of bug 72642 ***
[17:12:42] <wikibugs>	 Analytics / EventLogging: Story: Identify and direct the purging of Event logging raw logs older than 90 days in stat1002 - https://bugzilla.wikimedia.org/72642#c2 (nuria) *** Bug 72742 has been marked as a duplicate of this bug. ***
[17:17:57] <wikibugs>	 Analytics / EventLogging: Story: Identify and direct the purging of Event logging raw logs older than 90 days in stat1002 - https://bugzilla.wikimedia.org/72642 (Kevin Leduc) p:Unprio>High s:normal>enhanc
[17:21:12] <wikibugs>	 Analytics / Dashiki: Story: Vital Signs User selects the Daily or Monthly Pageviews metrics - https://bugzilla.wikimedia.org/72740 (Dan Andreescu)
[17:23:59] <wikibugs>	 Analytics / EventLogging: Create staging environment - https://bugzilla.wikimedia.org/72767 (Toby Negrin) NEW p:Unprio s:normal a:None Currently we don't have an environment we can test event logging backend changes. This is risky and non-optimal.
[17:40:33] <mforns>	 nuria, milimetric: hey guys, we are deploying in 20 mins, right?
[17:40:46] <mforns>	 or you already started?
[17:43:49] <DarTar>	 YuviPanda: yt?
[17:43:56] <DarTar>	 I had a dream
[17:44:14] <DarTar>	 and you were in it, and quarry too
[17:45:12] <DarTar>	 and the dream said we should have a private instance of quarry for internal use (with access to EventLogging data)
[17:46:12] <DarTar>	 this could be used as a shared sandbox for people to play with the schemas they own, as a way of prototyping Vital Signs queries but also as an easy way to share queries within the org
[17:48:55] <Ironholds>	 DarTar, you have weird dreams
[17:49:12] <DarTar>	 Ironholds: I know, I’m working on that
[17:50:31] <DarTar>	 Ironholds: my brain is trying to tell me how to make myself (and my team) less and less involved in ad-hoc data requests
[17:51:16] <Ironholds>	 DarTar, now if we can just start having dreams about not giving me engineering projects ;p
[17:51:45] <DarTar>	 I’ll add that to the dream queue for tonight
[17:51:53] <Ironholds>	 ta
[17:53:10] <milimetric>	 mforns: I was about to deploy right now
[17:53:30] <milimetric>	 I'm going to hop in the hangout to do it - cc: nuria__
[17:56:35] <nuria__>	 milimetric: be there
[17:57:15] <mforns>	 ok
[18:00:19] <YuviPanda>	 DarTar: heh :)
[18:00:30] <YuviPanda>	 DarTar: it's easy to set up, needs a machine (from toby)
[18:00:38] <DarTar>	 hi toby
[18:01:05] <DarTar>	 oops, he left his desk :-/
[18:01:29] <YuviPanda>	 heh
[18:02:39] <tnegrin>	 DarTar: yo
[18:02:55] <DarTar>	 do we have a box for YuviPanda ?
[18:03:13] <tnegrin>	 we have a meeting
[18:03:18] <DarTar>	 ah
[18:03:20] <DarTar>	 right
[18:03:22] <DarTar>	 coming
[18:03:47] <YuviPanda>	 nuria__: sorry had gone away. I've no idea what https://www.irccloud.com/pastebin/4EgtEcGM means :|
[18:04:13] <wikibugs>	 Analytics / Refinery: Spike: Assess feasibility and effort to add fields to webrequest logs - https://bugzilla.wikimedia.org/72651#c3 (ewulczyn) Another thing that came up in our research group meeting today is to add the browser session cookie. I added this to the etherpad.
[18:07:39] <nuria__>	 ottomata: can you merge https://gerrit.wikimedia.org/r/170103
[18:21:12] <milimetric>	 mforns / nuria__: he merged it, but now my burger's getting cold - let's reconvene in 20 min?
[18:21:17] <milimetric>	 thanks ottomata !
[18:21:19] <mforns>	 hehe
[18:21:29] <mforns>	 of course
[18:33:02] <ottomata>	 yup!
[18:33:45] <nuria__>	 milimetric, mforns: have a meeting in 10 mins
[18:33:52] <nuria__>	 will join right after
[18:33:55] <mforns>	 ok
[18:48:17] <milimetric>	 mforns:
[18:48:24] <milimetric>	 let's do it now?
[18:48:25] <mforns>	 yep
[18:48:32] <mforns>	 im in the batcave
[18:54:43] <wikibugs>	 Analytics / Refinery: Raw webrequest bits partition for 2014-10-26T21/1H not marked successful - https://bugzilla.wikimedia.org/72548#c4 (christian) NEW>RESO/FIX ottomata had a look at the logs on cp3019 and said that there were produce errors about full buffers. So we're writing it off as tempor...
[18:54:57] <wikibugs>	 Analytics / Refinery: Raw webrequest partitions that were not marked successful due to network issues - https://bugzilla.wikimedia.org/72298 (christian)
[18:54:58] <wikibugs>	 Analytics / Refinery: Raw webrequest partitions that were not marked successful - https://bugzilla.wikimedia.org/70085 (christian)
[19:18:20] <nuria__>	 milimetric: are you done deploying?
[19:19:02] <nuria__>	 cc mforns
[19:19:49] <milimetric>	 nuria__: yep
[19:20:30] <nuria__>	 milimetric; did you deleted pages created and start fresh?
[19:20:48] <milimetric>	 nuria__: so I was going to wait to hear from kevinator about that
[19:20:52] <milimetric>	 but he's not on here
[19:21:00] <milimetric>	 'cause i want to sync up with his announcement
[19:21:34] <nuria__>	 ok, if you did not sent an e-mail i can do that
[19:21:40] <nuria__>	 to kevinator i mean
[19:21:57] <nuria__>	 cause leaving teh other metrics with bots data i do not think is a big deal,
[19:22:06] <nuria__>	 but the "pages created" changed in nature
[19:23:25] <ottomata>	 qchris_away: yt?
[19:23:26] <ottomata>	 no you arte away!@
[19:25:52] <milimetric>	 ottomata: i don't know if you've seen osquery yet, but I was just playing with it and I'm having fun doing opsy things... so that says a lot: https://github.com/facebook/osquery/wiki/using-osqueryi
[19:26:35] <milimetric>	 nuria__: yea, i was just going to wait for him - he's probably just at lunch
[19:27:02] <ottomata>	 whoa
[19:27:30] <ottomata>	 that's awesome
[19:39:02] <mforns>	 nuria__, milimetric: I was about to send an email to kevinator, but if you want to tell him about pages created fresh start..
[19:41:11] <milimetric>	 no rush  mforns / nuria__ we can wait 'till he's back
[19:41:22] <milimetric>	 the recurrent stuff runs at 3 am, we got time
[19:41:36] <mforns>	 ok
[19:48:14] <nuria__>	 milimetric, mforns,ottomata  taking long lunch (spanish style) , will be back in couple hours
[19:49:26] <milimetric>	 enjoy!
[19:53:25] <ottomata>	 qchris_away: fyi, i see about 175 more lines in kafkatee for the 10-30T17 hour for the zero logs,
[19:53:27] <ottomata>	 good sign
[19:53:51] <ottomata>	 kafkatee has all logs that udp2log has for that hour for zero logs
[19:59:49] <qchris>	 Now ottomata is gone :-/
[20:00:33] <YuviPanda>	 nuria__: not sure if my email made it to your internal mailing list
[20:10:23] <YuviPanda>	 tnegrin: not sure if my email made it to your internal mailing list
[20:11:18] <qchris>	 YuviPanda: /me checks email ...
[20:12:20] <qchris>	 YuviPanda: I received three recent emails from you via the internal list.
[20:12:33] <qchris>	 So it seems they made it to the internal list.
[20:12:57] <YuviPanda>	 qchris: ah, cool :)
[20:18:07] <grrrit-wm>	 (CR) Milimetric: [WIP] Add schema for edit fact table (1 comment) [analytics/data-warehouse] - https://gerrit.wikimedia.org/r/167839 (owner: QChris)
[20:19:32] <grrrit-wm>	 (CR) Milimetric: [WIP] Add schema for edit fact table (1 comment) [analytics/data-warehouse] - https://gerrit.wikimedia.org/r/167839 (owner: QChris)
[20:47:40] <grrrit-wm>	 (PS1) Mforns: Avoid exception accessing unknown project database [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/170152 (https://bugzilla.wikimedia.org/72582)
[20:53:57] <wikibugs>	 Analytics / Wikimetrics: report table performance, cleanup, and number of items - https://bugzilla.wikimedia.org/72635 (Marcel Ruiz Forns) NEW>ASSI a:Marcel Ruiz Forns
[20:56:11] <milimetric>	 kevinator: question for you
[20:56:18] <kevinator>	 hi
[20:56:25] <milimetric>	 we deployed
[20:56:32] <milimetric>	 mforns was going to ping you
[20:56:45] <milimetric>	 but then we realized we didn't fully talk through how we were going to re-generate reports
[20:56:51] <milimetric>	 or maybe we did and I forgot the conclusion
[20:57:02] <milimetric>	 so we have changes for pages / edits (namespace 0 is no longer the default)
[20:57:07] <mforns>	 kevinator: I wrote to you in private
[20:57:17] <milimetric>	 and changes for the "rolling" metrics (bots are filtered out)
[20:57:25] <milimetric>	 for both of those we could:
[20:57:32] <milimetric>	 * delete all old reports and regenerate
[20:57:41] <milimetric>	 * just let the system start using the new definitions as of today
[20:57:55] <milimetric>	 when making that decision, note: we will regenerate everything most likely once we have the DW
[20:58:54] <kevinator>	 Let’s keep data for the time being… and watch if we see a noticable step on the dashboard.
[20:59:17] <kevinator>	 Once we get the DW we can re-generate everything
[21:00:30] <kevinator>	 Should Vital Signs get a mailing list for these kind of announcements: new metric definition, changes in data…
[21:01:18] <kevinator>	 If I hear people getting confused about the data, we can regenerate everything sooner.
[21:03:04] <mforns>	 ok
[21:17:41] <milimetric>	 kevinator: that sounds like a good plan.  cc: nuria__ ^ few lines above from kevin
[21:17:49] <milimetric>	 basically: no regeneration right now
[21:18:30] <milimetric>	 i mean, everyone will be confused if they try to look at the data
[21:18:40] <milimetric>	 but as far as I know nobody's using this as their primary data source yet
[21:31:57] <wikibugs>	 Analytics / Wikimetrics: Apache's logs containing "client denied by server configuration: /srv/wikimetrics/wikimetrics/api.wsgi" - https://bugzilla.wikimedia.org/71606#c9 (Dan Andreescu) PATC>RESO/FIX deployment train picked this up  choo choo!
[21:33:30] <qchris>	 :-)
[23:02:58] <nuria__>	 kevinator: so we are on the same page we shall get a Jump on the dashboard for sure
[23:03:19] <nuria__>	 kevinator: for bots, numbers will go down
[23:03:34] <nuria__>	 kevinator: for pages created we are counting all pages thus they will go up
[23:04:08] <kevinator>	 nuria__, milimetric: I was just thinking the ideal way to let people know why there’s a jump: annotations ;-)
[23:04:41] <nuria__>	 kevinator: right, until then I think we probably need a public log
[23:46:00] <grrrit-wm>	 (CR) Nuria: Avoid exception accessing unknown project database (2 comments) [analytics/wikimetrics] - https://gerrit.wikimedia.org/r/170152 (https://bugzilla.wikimedia.org/72582) (owner: Mforns)