[00:16:13] <Shilad>	 Nettrom: That's what I'm doing, too. It's gross. I think the proper solution would be switching the partition to a string: "YYYY-MM-DD:HH"
[00:16:26] <Shilad>	 But that is probably a painful change :)
[06:52:18] <wikibugs_>	 (03PS1) 10Shilad Sen: WIP: Spark job to create page ids viewed in each session [analytics/refinery/source] (nav-vectors) - 10https://gerrit.wikimedia.org/r/381169 (https://phabricator.wikimedia.org/T174796)
[07:19:36] <elukey>	 joal: morning!
[07:19:51] <elukey>	 I have a theory for the mess that happened yesterday to the history server
[07:22:32] <elukey>	 I checked the metrics and it seems that the heap pressure was already 80/90% right before the event, and then $something triggered more allocation that ended up causing more gc pressure and work to do minor/major collections
[07:22:55] <elukey>	 and eventually GC overhead -> daemon kaput
[07:23:55] <elukey>	 after the explanation that you gave us yesterday during standup about what the history server do I am wondering if huge jobs that allocates a ton of containers (that needs to register themselves when they finish etc..) could cause peak of heap utilization
[07:24:07] <elukey>	 the current settings are Xmx1g, not enough
[07:25:58] <elukey>	 we could go to 2g, but since we have a ton of space on an1001 not utilized, I'd go for 4g
[07:26:33] <elukey>	 so if my theory is correct, we should be ok when huge jobs kick in and the cluster is already busy
[07:29:54] <elukey>	 just merged the puppet change :P
[07:30:02] <elukey>	 need to restart the history server though
[08:36:48] <joal>	 Thanks a lot elukey :)
[08:36:53] <joal>	 That is great analysis :)
[08:37:23] <joal>	 I have double cheked jobs this morning, everything seems back in track, and with the change to 4G heap for history server, we should be on the safe side
[08:38:02] <elukey>	 super :)
[08:38:19] <elukey>	 do you think that we could restart it now? Or maybe let the cluster drain?
[08:43:51] <joal>	 elukey: I'd suspend camus, wait for drain then restart
[08:44:06] <joal>	 We've seens that running jobs don't like history server dying in the liddkle
[08:44:12] <joal>	 *middle sorry
[08:47:41] <elukey>	 all right let's do it
[08:48:52] <joal>	 Ok !!!
[08:49:02] <elukey>	 camus disabled
[08:49:19] <joal>	 elukey: We have a job from bearloga :(
[08:49:35] <joal>	 elukey: It should be small
[08:50:48] <joal>	 elukey: One webrequest load job still to finish
[08:50:55] <joal>	 elukey: You have time for coffee ;)
[08:51:50] <elukey>	 there is always time for coffee!
[08:52:00] <joal>	 :D
[09:44:49] <mforns>	 hellooooo
[09:49:20] <joal>	 elukey: the jobs from bearloga are automated by report updater, we won't manage to drain the cluster - Let's restatrt now history server now
[09:49:23] <joal>	 Hi mforns :)
[09:51:41] <elukey>	 !log restart mapreduce history server on an1001 to apply new heap settings (Xmx/s to 4g)
[09:51:42] <stashbot>	 Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log
[10:01:17] <elukey>	 joal: all good from my side, ok to re-enable camus?
[10:01:38] <joal>	 elukey: YES !
[10:01:44] <joal>	 Thanks a lot elukey 
[10:02:31] <elukey>	 !log renabled camus after maintenance
[10:02:32] <stashbot>	 Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log
[10:03:09] <elukey>	 joal: take a look to the current heap size of the namenodes - https://grafana.wikimedia.org/dashboard/db/analytics-hadoop?panelId=4&fullscreen&orgId=1
[10:03:42] <elukey>	 no more old gen collections \o/
[10:03:46] <joal>	 https://grafana.wikimedia.org/dashboard/db/analytics-hadoop?panelId=4&fullscreen&orgId=1&from=now-7d&to=now
[10:03:58] <joal>	 Looks like something happened yesterday :)
[10:04:03] <elukey>	 it is like "yessss more spaceeeee"
[10:04:55] <joal>	 :)
[10:05:05] <elukey>	 the datanodes heap utilization is a bit high and I can see old gen collections, but it is usally like that..
[10:05:11] <elukey>	 (4g would be nice in there :P)
[10:05:39] <joal>	 :)
[10:05:48] <joal>	 elukey: What prevents us to do so?
[10:08:05] <elukey>	 joal: theoretically nothing, I am seeing space on the worker nodes (old and new gen) - https://grafana.wikimedia.org/dashboard/file/server-board.json?var-server=analytics1060&refresh=1m&orgId=1
[10:08:53] <joal>	 elukey: looking back a month, for that machine, looks like here is enough space to bump
[10:12:37] <elukey>	 good news, the kafka jumbo cluster should have prometheus metrics very soon
[10:20:42] <joal>	 Great elukey :)
[10:20:54] <joal>	 elukey: sorry for me bothering - how are we moving forward with Druid?
[10:21:37] <elukey>	 joal: no bother :) - My plan was to re-review one of the huge puppet changes that andrew made to refactor our codbase and allow multiple cluster definition
[10:21:43] <elukey>	 and then merge it after lunch
[10:21:55] <elukey>	 after that, there will be another code review to split the clusters
[10:21:57] <joal>	 elukey: looks awesome :)
[10:22:02] <elukey>	 but it will require manual work
[10:22:08] <joal>	 ok
[10:22:26] <joal>	 let me know if I can help for manual work (I'm no good for puppet, but I can move stuff around)
[10:28:05] <elukey>	 joal: sure! I have to warn you thought that Andrew stated clearly that we'll probably will not make it in time for the EOQ
[10:28:30] <joal>	 elukey: I know, I just continue to push to make it fast :)
[10:28:37] <joal>	 even if not fast enough for EOQ
[10:28:47] <elukey>	 okok :)
[10:30:32] * elukey lunch!
[12:07:18] * fdans chinese food for lunch!
[12:12:35] <joal>	 Taking a 
[12:12:39] <joal>	 break
[12:12:43] <joal>	 !
[12:26:11] <wikibugs_>	 (03PS1) 10Addshore: instanceof.php short sleep to avoid rate limit [analytics/wmde/scripts] - 10https://gerrit.wikimedia.org/r/381207 (https://phabricator.wikimedia.org/T176577)
[12:26:29] <wikibugs_>	 (03PS1) 10Addshore: instanceof.php short sleep to avoid rate limit [analytics/wmde/scripts] - 10https://gerrit.wikimedia.org/r/381208 (https://phabricator.wikimedia.org/T176577)
[12:26:34] <wikibugs_>	 (03CR) 10Addshore: [C: 032] instanceof.php short sleep to avoid rate limit [analytics/wmde/scripts] - 10https://gerrit.wikimedia.org/r/381208 (https://phabricator.wikimedia.org/T176577) (owner: 10Addshore)
[12:26:36] <wikibugs_>	 (03CR) 10Addshore: [C: 032] instanceof.php short sleep to avoid rate limit [analytics/wmde/scripts] - 10https://gerrit.wikimedia.org/r/381207 (https://phabricator.wikimedia.org/T176577) (owner: 10Addshore)
[12:26:42] <wikibugs_>	 (03Merged) 10jenkins-bot: instanceof.php short sleep to avoid rate limit [analytics/wmde/scripts] - 10https://gerrit.wikimedia.org/r/381208 (https://phabricator.wikimedia.org/T176577) (owner: 10Addshore)
[12:26:45] <wikibugs_>	 (03Merged) 10jenkins-bot: instanceof.php short sleep to avoid rate limit [analytics/wmde/scripts] - 10https://gerrit.wikimedia.org/r/381207 (https://phabricator.wikimedia.org/T176577) (owner: 10Addshore)
[13:02:08] <elukey>	 joal, about to merge Andrew's refactoring for Druid
[13:22:22] <milimetric>	 hetall
[13:22:29] <milimetric>	 *heyall
[13:26:12] <mforns>	 helooo
[13:31:20] <elukey>	 fyi people I am working on druid1001
[13:31:32] <elukey>	 let me know if you are doing anything like restart etc..
[13:31:33] <elukey>	 :)
[13:46:48] <wikibugs_>	 10Analytics, 10Proton, 10Readers-Web-Backlog, 10Patch-For-Review, 10Readers-Web-Kanban-Board: Implement Schema:Print purging strategy - https://phabricator.wikimedia.org/T175395#3592394 (10pmiazga) @ovasileva - do we need to store also the sessionToken (to detect how many prints we have per session)? It'...
[14:03:04] <wikibugs_>	 10Analytics, 10Proton, 10Readers-Web-Backlog, 10Patch-For-Review, 10Readers-Web-Kanban-Board: Implement Schema:Print purging strategy - https://phabricator.wikimedia.org/T175395#3592394 (10Jdlrobson) This has been sitting here for a week. To get code review we'll need to be a little more proactive. Have...
[14:52:05] <wikibugs_>	 10Analytics, 10Operations, 10Traffic: Invalid "wikimedia" family in unique devices data due to misplaced WMF-Last-Access-Global cookie - https://phabricator.wikimedia.org/T174640#3643098 (10ema) p:05Triage>03Normal
[14:53:37] <wikibugs_>	 (03CR) 10Nuria: WIP: Spark job to create page ids viewed in each session (031 comment) [analytics/refinery/source] (nav-vectors) - 10https://gerrit.wikimedia.org/r/381169 (https://phabricator.wikimedia.org/T174796) (owner: 10Shilad Sen)
[14:53:49] <nuria_>	 Shilad: I added some comments to your CR
[14:54:07] <nuria_>	 Shilad: we can talk about it in more detail if they do not make sense
[14:57:54] <nuria_>	 Shilad: the number of different signatures that you get in a day (per domain) should be on the order  of magnitude of unique devices per day per domain but i think your methodology will return  a smaller number
[15:00:17] <nuria_>	 ping mforns joal milimetric 
[15:00:24] <mforns>	 hello
[15:00:28] <mforns>	 oh!
[15:09:13] <mforns>	 hangouts kicked me out again!
[15:23:50] <wikibugs_>	 10Analytics: Add action api counts to graphite-restbase job - https://phabricator.wikimedia.org/T176785#3643192 (10mforns)
[15:24:06] <wikibugs_>	 10Analytics-Kanban: Add action api counts to graphite-restbase job - https://phabricator.wikimedia.org/T176785#3636597 (10mforns)
[15:27:13] <wikibugs_>	 10Analytics, 10Proton, 10Readers-Web-Backlog, 10Patch-For-Review, 10Readers-Web-Kanban-Board: Implement Schema:Print purging strategy - https://phabricator.wikimedia.org/T175395#3643229 (10mforns)
[15:30:16] <wikibugs_>	 10Analytics-Cluster, 10Analytics-Kanban: CamusPartitionChecker does not work when topic names have '.' or '-' in them. - https://phabricator.wikimedia.org/T171099#3643246 (10mforns)
[15:30:56] <wikibugs_>	 10Analytics-Cluster, 10Analytics-Kanban: CamusPartitionChecker does not work when topic names have '.' or '-' in them. - https://phabricator.wikimedia.org/T171099#3454060 (10mforns)
[15:35:44] <wikibugs_>	 10Analytics, 10EventBus, 10Wikimedia-Stream: Hits from private AbuseFilters aren't in the stream - https://phabricator.wikimedia.org/T175438#3593335 (10mforns) Hi @Nirmos, I'm not super familiar with the AbuseFilters. Can you please give us an explanation of the flow that you are seeing and the one that you'...
[15:36:13] <wikibugs_>	 10Analytics, 10Proton, 10Readers-Web-Backlog, 10Patch-For-Review, 10Readers-Web-Kanban-Board: Implement Schema:Print purging strategy - https://phabricator.wikimedia.org/T175395#3643274 (10Jdlrobson) @mforns it looks like the fix for T169730 is in production now  (it's in 1.30.0-wmf.19 which is everywhere).
[15:36:35] <wikibugs_>	 10Analytics-Kanban, 10Analytics-Wikistats: Stub new mediawiki history-based metrics - https://phabricator.wikimedia.org/T175268#3643276 (10mforns)
[15:37:43] <wikibugs_>	 10Analytics-Kanban, 10Analytics-Wikistats: Wikistats unique devices metrics needs some copy that says "monthly" - https://phabricator.wikimedia.org/T176240#3643284 (10Nuria)
[15:37:55] <wikibugs_>	 10Analytics-Kanban, 10Analytics-Wikistats: Stub new mediawiki history-based metrics - https://phabricator.wikimedia.org/T175268#3588195 (10mforns)
[15:38:43] <wikibugs_>	 10Analytics-Kanban, 10Analytics-Wikistats: Add top articles by pageviews metric - https://phabricator.wikimedia.org/T175266#3643286 (10mforns)
[15:39:04] <wikibugs_>	 10Analytics-Kanban, 10Analytics-Wikistats: Add top articles by pageviews metric - https://phabricator.wikimedia.org/T175266#3588171 (10mforns)
[15:40:39] <wikibugs_>	 10Analytics-Kanban, 10Analytics-Wikistats: Stub new mediawiki history-based metrics - https://phabricator.wikimedia.org/T175268#3643322 (10JAllemandou) URIs to be mocked are defined as swagger-config in Restbase pull request: https://github.com/wikimedia/restbase/pull/875
[15:42:17] <wikibugs_>	 10Analytics, 10Proton, 10Readers-Web-Backlog, 10Patch-For-Review, 10Readers-Web-Kanban-Board: Implement Schema:Print purging strategy - https://phabricator.wikimedia.org/T175395#3643327 (10bmansurov) >>! In T175395#3642855, @Jdlrobson wrote: > This has been sitting here for a week. To get code review we'...
[15:44:43] <wikibugs_>	 10Analytics: Rename datasources and fields in Druid to use underscores instead of hyphens - https://phabricator.wikimedia.org/T175162#3584677 (10mforns) Let's change banner_activity_minutely to hyphens and that's that.
[15:45:00] <wikibugs_>	 10Analytics: Rename datasources and fields in Druid to use hyphens instead of underscores - https://phabricator.wikimedia.org/T175162#3643336 (10mforns)
[15:45:38] <wikibugs_>	 10Analytics: Rename datasources and fields in Druid to use hyphens instead of underscores - https://phabricator.wikimedia.org/T175162#3584677 (10mforns) p:05Triage>03Normal
[15:45:46] <wikibugs_>	 10Analytics, 10Analytics-Wikistats: Address design feedback from Volker - https://phabricator.wikimedia.org/T167673#3643343 (10Nuria)
[15:46:06] <wikibugs_>	 10Analytics-Kanban: Rename datasources and fields in Druid to use hyphens instead of underscores - https://phabricator.wikimedia.org/T175162#3584677 (10mforns)
[15:48:11] <wikibugs_>	 10Analytics: Productionize streaming jobs - https://phabricator.wikimedia.org/T176983#3643365 (10Nuria)
[15:49:02] <wikibugs_>	 10Analytics-Kanban: Productionize streaming jobs - https://phabricator.wikimedia.org/T176983#3643380 (10Nuria)
[15:50:06] <wikibugs_>	 10Analytics-Kanban: Productionize streaming jobs - https://phabricator.wikimedia.org/T176983#3643365 (10Nuria)
[15:50:21] <wikibugs_>	 10Analytics: R execution on stat1005 -> 'stack smashing error' - https://phabricator.wikimedia.org/T174946#3577730 (10mforns) @Erik_Zachte  This is probably because of the new debian stretch stat1005 is running on.
[15:50:31] <wikibugs_>	 10Analytics: Productionitize netflow job - https://phabricator.wikimedia.org/T176984#3643389 (10Nuria)
[16:10:00] <wikibugs_>	 10Analytics, 10Operations, 10hardware-requests, 10Patch-For-Review: Decommission stat1002.eqiad.wmnet - https://phabricator.wikimedia.org/T173097#3643514 (10Nuria)
[16:29:58] <joal>	 gone for diner, will be back after
[16:56:06] <Shilad>	 nuria: I saw your note about issues constructing session identifiers from IP for mobile sessions. Thanks! Do you know if this also confuses X-Forwarded-For? My understanding is that NAT typically set this correctly, but perhaps this is not the case for particular OSs/Carriers?
[16:56:52] <Shilad>	 sorry.. in wrong channel... I'll move to wikimedia-analytics correct one and resend.
[16:57:44] <Shilad>	 ...and i guess i am there. Definitely have not mastered 
[16:58:00] <elukey>	 going offline people!
[16:58:01] * elukey off!
[17:05:38] <wikibugs_>	 10Analytics, 10Proton, 10Readers-Web-Backlog, 10Patch-For-Review, 10Readers-Web-Kanban-Board: Implement Schema:Print purging strategy - https://phabricator.wikimedia.org/T175395#3643767 (10mforns) @bmansurov  @Jdlrobson  Yes, thanks! Will have a loot at this tomorrow.
[17:07:18] <mforns>	 hey team, leaving for today, tomorrow I'll also start the day earlier, byeee
[17:08:44] <wikibugs_>	 10Analytics-EventLogging, 10Analytics-Kanban, 10Page-Previews, 10Readers-Web-Backlog, and 5 others: EventLogging subscriber module in ready state but not sending tracked events - https://phabricator.wikimedia.org/T175918#3643776 (10phuedx) Per T175918#3632663, this can't be signed off until 21:00 ([[ https...
[17:36:11] <wikibugs_>	 10Analytics-Kanban: vet edit data on the data lake - https://phabricator.wikimedia.org/T153923#3643922 (10Nuria) a:05Milimetric>03ezachte
[17:36:21] <wikibugs_>	 10Analytics-Kanban: vet edit data on the data lake - https://phabricator.wikimedia.org/T153923#2895122 (10Nuria) Assigning to Eric and moving to radar
[17:36:41] <wikibugs_>	 10Analytics: vet edit data on the data lake - https://phabricator.wikimedia.org/T153923#3643924 (10Nuria)
[17:49:55] <nuria_>	 phuedx: did you get the data you needed for pdf rendering
[17:49:56] <nuria_>	 ?
[17:51:20] <wikibugs_>	 10Analytics: Private geo wiki data in new analytics stack - https://phabricator.wikimedia.org/T176996#3643940 (10Nuria)
[17:53:21] <wikibugs_>	 10Analytics: Private geo wiki data in new analytics stack - https://phabricator.wikimedia.org/T176996#3643977 (10Nuria)
[18:06:20] <phuedx>	 nuria_: i did, thanks!
[18:06:35] <nuria_>	 phuedx: via pivot or command line?
[18:07:36] <phuedx>	 pivot was great for initial discovery
[18:10:01] <nuria_>	 phuedx: ok
[18:10:03] <phuedx>	 and then we used the command line for drilling down into an hour or two's worth of data
[18:10:29] <nuria_>	 phuedx: ok, ya, that is a more effective use of resources
[18:10:35] <nuria_>	 phuedx: please evangelize in your team
[18:12:27] <phuedx>	 nuria_: absolutely!
[18:18:41] <nuria_>	 Shilad: i just saw your ping , sorry, something missing on irc today
[18:30:28] <nuria_>	 Shilad: yt?
[18:33:46] <nuria_>	 Shilad: but.. what would be X-forwarded for set to in that case? you do not really have anything but an umbrella ip
[18:38:08] <nuria_>	 Shilad: are you thinking an internal ip? assigned by operator
[19:05:02] <wikibugs_>	 10Analytics-EventLogging, 10Analytics-Kanban, 10Patch-For-Review, 10Readers-Web-Backlog (Tracking): Schema:Popups suddenly stopped logging events in MariaDB, but they are still being sent according to Grafana - https://phabricator.wikimedia.org/T174815#3644260 (10Nuria) @tbayer: I imaported popups tables t...
[19:22:48] <nuria_>	 Shilad: That is not what you will find on the data, is empty more often than not. We can talk but I would restrict your queries to desktop if you want the signatures to work as you have them 
[19:32:04] <nuria_>	 Shilad: a good ball park for signatures for a domain to cross check your data should be the uniques_understimate for unique devices calculations, take a look at that data on tables on webrequest database: https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake/Traffic/Unique_Devices
[20:02:34] <Shilad>	 nuria_: Sorry for not responding. I was teaching, but thanks for the info. That all makes sense. It's a bummer that there is no way to reliably follow uniques on mobile. Have you all thought about using a cookie to do so? I'm sure there are policy issues there... just curious.
[20:03:30] <nuria_>	 Shilad: ya, there are huge privacy issues we do not do it on purpose
[20:04:27] <nuria_>	 Shilad: for your purpose since you are looking at signatures in short term spams you can do it but it requires a more sofisticated approach than that one
[20:07:48] <AndyRussG>	 Hi! I did try to reach druid from a SWAP Jupyter internal notebook, and got a "HTTP Error 503: Service Unavailable"
[20:08:19] <AndyRussG>	 Here's how I tried to connect:
[20:08:21] <AndyRussG>	 from pydruid.client import *
[20:08:23] <AndyRussG>	 query = PyDruid( 'http://druid1001.eqiad.wmnet:8082',  'druid/v2')
[20:09:28] <AndyRussG>	 Hmmm grepping about the refinery repo, it sez the port is 8090
[20:16:55] <AndyRussG>	 Hm still "HTTP Error 503: Service Unavailable"
[20:22:25] <AndyRussG>	 However, I'm able to query Druid fine from the console on notebook1001, like so: curl -X POST 'druid1001.eqiad.wmnet:8082/druid/v2/?pretty' -H 'Content-Type:application/json' -d @test_druid_query
[20:22:35] <AndyRussG>	 Maybe some Jupyter sandboxing or firewall?
[20:22:48] <AndyRussG>	 joal: madhuvishy: ^ thx in advance!! :)
[20:35:34] <madhuvishy>	 AndyRussG: I looked at it a bit before and found that I can connect to the port with something like `!nc -vz druid1001.eqiad.wmnet 8082`(tcp) from the jupyter notebook or the server
[20:35:40] <madhuvishy>	 but couldn't curl
[20:36:19] <madhuvishy>	 I'm not sure what's up, I left messages for andrew and joal a couple days ago in backscroll
[20:37:01] <joal>	 I've seen the messages, but didn't investiage madhuvishy 
[20:53:18] <Shilad>	 nuria_: That makes sense.Can you tell me more about this idea: "requires a more sofisticated approach than that one." Doesn't sound like something I'll do, but I'm curious.
[20:53:44] <Shilad>	 nuria_: And sorry for the intermittent delays... I'm holding office hours.
[21:34:09] <AndyRussG>	 madhuvishy: joal Thanks so much!! (sorry I was away from the keyboard for a bit)... I guess there's a puppet config for this stuff somewhere, just to see if anything obvious jumps out? thx again
[22:14:19] <nuria_>	 AndyRussG: say that you get to connect to druid, do you know how to query?
[22:14:50] <AndyRussG>	 nuria_: hey... Just looking at the documentation for Druid and pydruid
[22:15:14] <AndyRussG>	 Also scraped some config in refinery for some relevant details
[22:15:39] <AndyRussG>	 Here's the pydruid query I tried:
[22:15:40] <AndyRussG>	 query.timeseries(
[22:15:42] <AndyRussG>	     datasource='pageviews-hourly',
[22:15:44] <AndyRussG>	     granularity='day',
[22:15:46] <AndyRussG>	     intervals = '2017-06-01T00:00/2017-07-01T00',
[22:15:48] <AndyRussG>	     aggregations = { 'view_count': doublesum('view_count' ) }
[22:15:50] <AndyRussG>	 )
[22:17:32] <nuria_>	 AndyRussG: and you have tried on the console with pydruid and that works too?
[22:18:30] <AndyRussG>	 nuria_: ah no... good idea! I tried with a CURL from the console, but not pydruid
[22:18:38] <nuria_>	 AndyRussG: right
[22:19:10] <nuria_>	 AndyRussG: let's first try whether pydruid actually works (it might) but i  do not think any of us has used it
[22:20:04] <AndyRussG>	 K will do!
[22:26:13] <AndyRussG>	 With the CURL it worked fine from the console, got a valid Druid data response
[22:26:28] <nuria_>	 AndyRussG: ya, i just tried too
[22:38:40] <nuria_>	 AndyRussG: trying pyDruid from inside jupyter notebook terminal it doesn't work i think the install needs couple more things
[22:57:31] <nuria_>	 AndyRussG: i think  the dependencies of pydruid require packages that cannot be easily installed on the virtualenv (from my brief tests)
[22:59:27] <AndyRussG>	 nuria_: ah hmm interesting...!! thanks! (I'll look at it again in a few, just in a call now)
[23:15:27] <nuria_>	 AndyRussG: I think pydruid requires pyobject which requires system install https://pygobject.readthedocs.io/en/latest/faq.html
[23:24:54] <AndyRussG>	 nuria_: this didn't return any errors from within the notebook: !pip install pydruid
[23:25:07] <nuria_>	 AndyRussG: right, now try to use package
[23:26:29] <AndyRussG>	 nuria_: mmm it does stuff. For example, the query.timeseries I tried (above) did correctly convert to a valid Druid json query, which it printed out as part of the error
[23:27:53] <AndyRussG>	 nuria_: https://tools.wmflabs.org/paste/view/8046eb00
[23:28:41] <AndyRussG>	 Apparently makes an http call and gets a response
[23:31:14] <AndyRussG>	 Funny the query takes a long time to come back
[23:44:18] <addshore>	 Hi A Team :D
[23:44:54] <addshore>	 general question, how much lag in general can one expect on the mysql event logging tables? how close to real time are they kept?
[23:47:35] <AndyRussG>	 nuria_: madhuvishy joal I was able to query Druid using pydruid form the console, using the same virtual python environment we get in the notebooks
[23:47:37] <AndyRussG>	 https://tools.wmflabs.org/paste/view/b114d636
[23:48:04] <AndyRussG>	 used the packages I'd already installed from within the notebook
[23:48:35] <AndyRussG>	 could it be something about how the query is made from the notebook that is blocked by a firewall, or some sort of notebook sandboxing?