[00:03:21] <icinga2-wm>	 RECOVERY - check load on Ores-Compute-01 is OK: OK - load average: 2.39, 4.13, 4.89
[00:10:01] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.92, 5.90, 5.26
[00:40:05] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.98, 7.98, 7.61
[01:10:10] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.55, 7.32, 6.85
[01:40:15] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 4.90, 6.08, 7.06
[02:10:19] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.86, 6.65, 6.62
[02:25:21] <icinga2-wm>	 RECOVERY - check load on Ores-Compute-01 is OK: OK - load average: 1.42, 3.61, 4.88
[02:30:01] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.71, 5.31, 5.17
[02:32:52] <icinga2-wm>	 RECOVERY - check load on Ores-Compute-01 is OK: OK - load average: 2.92, 4.51, 4.92
[02:39:13] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 6.68, 5.51, 5.11
[02:57:21] <icinga2-wm>	 RECOVERY - check load on Ores-Compute-01 is OK: OK - load average: 1.28, 3.71, 4.95
[03:02:01] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.82, 5.36, 5.22
[03:06:46] <icinga2-wm>	 RECOVERY - check load on Ores-Compute-01 is OK: OK - load average: 2.23, 4.28, 4.90
[03:14:05] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.35, 5.79, 5.15
[03:15:04] <icinga2-wm>	 RECOVERY - check load on Ores-Compute-01 is OK: OK - load average: 4.11, 5.15, 4.96
[03:22:20] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.51, 5.91, 5.18
[03:52:24] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 4.35, 3.75, 5.12
[03:59:21] <icinga2-wm>	 RECOVERY - check load on Ores-Compute-01 is OK: OK - load average: 1.64, 3.69, 4.85
[04:05:00] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 8.13, 5.73, 5.24
[04:08:48] <icinga2-wm>	 RECOVERY - check load on Ores-Compute-01 is OK: OK - load average: 2.51, 4.57, 4.94
[04:13:16] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.24, 5.59, 5.20
[04:43:19] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 4.71, 6.81, 7.00
[05:13:24] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.85, 7.88, 7.50
[05:43:29] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.14, 5.72, 5.70
[06:13:34] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.14, 5.31, 5.72
[06:43:39] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 1.17, 3.53, 5.38
[07:13:44] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.44, 7.33, 6.84
[07:43:49] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 8.00, 7.57, 6.96
[08:00:21] <icinga2-wm>	 RECOVERY - check load on Ores-Compute-01 is OK: OK - load average: 2.15, 2.88, 4.86
[08:13:00] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.17, 5.98, 5.20
[08:43:03] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 1.16, 3.88, 5.43
[08:44:21] <icinga2-wm>	 RECOVERY - check load on Ores-Compute-01 is OK: OK - load average: 1.00, 2.91, 4.88
[08:46:30] <wikibugs_>	 10Scoring-platform-team, 10MediaWiki-extensions-ORES, 10MW-1.30-release-notes (WMF-deploy-2017-06-27_(1.30.0-wmf.7)), 10Patch-For-Review, and 2 others: [Discuss] Make ORES Review Tool preferences more prominent - https://phabricator.wikimedia.org/T167910#3377876 (10Trizek-WMF) Adding user-notice: create a...
[08:52:01] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.97, 5.85, 5.30
[09:15:36] <wikibugs_>	 10Scoring-platform-team-Backlog, 10Labs, 10Labs-Infrastructure, 10Operations: Keep wmflabs scoring boxes up-to-date - https://phabricator.wikimedia.org/T168478#3377954 (10ArielGlenn) p:05Triage>03Normal
[09:22:03] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 8.00, 8.02, 7.69
[09:32:21] <icinga2-wm>	 RECOVERY - check load on Ores-Compute-01 is OK: OK - load average: 2.25, 2.81, 4.93
[10:05:00] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.57, 6.95, 5.15
[10:35:03] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 6.90, 5.06, 5.50
[11:05:08] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 6.33, 4.66, 5.42
[11:35:13] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 3.22, 5.78, 6.29
[11:45:21] <icinga2-wm>	 RECOVERY - check load on Ores-Compute-01 is OK: OK - load average: 1.69, 3.68, 4.91
[11:50:01] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.87, 5.38, 5.20
[11:52:52] <icinga2-wm>	 RECOVERY - check load on Ores-Compute-01 is OK: OK - load average: 2.63, 4.32, 4.85
[11:58:16] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.06, 5.51, 5.13
[12:28:17] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 8.06, 7.49, 6.54
[12:36:21] <icinga2-wm>	 RECOVERY - check load on Ores-Compute-01 is OK: OK - load average: 2.07, 3.25, 4.85
[12:45:00] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.15, 5.68, 5.16
[13:15:02] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.08, 7.19, 7.00
[13:45:07] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 2.09, 4.41, 5.81
[13:48:22] <icinga2-wm>	 RECOVERY - check load on Ores-Compute-01 is OK: OK - load average: 2.00, 3.07, 4.93
[13:53:00] <icinga2-wm>	 PROBLEM - check load on Ores-Compute-01 is WARNING: WARNING - load average: 7.29, 5.05, 5.21
[14:02:55] <halfak>	 ^ this needs to be turned off
[14:10:49] <paladox>	 halfak you can ack it on http://gerrit-icinga.wmflabs.org/dashboard :)
[14:10:57] <paladox>	 it will remove the ack when ever it recovers
[14:11:07] <icinga2-wm>	 ACKNOWLEDGEMENT - check load on Ores-Compute-01 is WARNING: WARNING - load average: 3.44, 5.88, 6.23 paladox ack
[14:12:51] <halfak>	 Looks like my old password doesn't work. 
[14:13:03] <halfak>	 zppix gave me a password to log in with
[14:13:57] <paladox>	 halfak, i can create an account for you if you doint want to use ldap.
[14:14:16] <paladox>	 I had to fix it over the weekend after doing jessie to stretch upgrade on the host.
[14:14:19] <halfak>	 Oh yeah.  I don't want to type my prod ldap for sure. 
[14:14:25] <paladox>	 ok
[14:28:09] <halfak>	 paladox, did you turn off the load warning?
[14:28:17] <paladox>	 halfak i acked it.
[14:28:25] <paladox>	 So it should come back when ever it recovers
[14:28:30] <halfak>	 Great. 
[14:28:39] <halfak>	 We should not have the load warning fire on the compute node.  That's what it is for :) 
[14:28:57] <paladox>	 ok
[14:29:12] <paladox>	 So you want it removed? The load check?
[14:30:18] <halfak>	 Yup
[14:30:30] <halfak>	 Thanks!
[14:30:39] <halfak>	 I'd do it if the UI worked for me :/ 
[14:31:03] <paladox>	 halfak that's done through the puppet repo
[14:31:12] <paladox>	 ui wont allow you to remove it unless done through director
[14:31:31] <paladox>	 https://gerrit.wikimedia.org/r/#/admin/projects/labs/icinga2
[14:31:34] <paladox>	 halfak ^^
[14:32:13] <halfak>	 gotcha. 
[14:33:17] <paladox>	 and the file is service
[14:37:41] <paladox>	 halfak https://gerrit.wikimedia.org/r/#/c/361462/
[14:38:26] <halfak>	 (y)
[14:38:58] <paladox>	 deployed
[14:52:12] <halfak>	 great. 
[14:52:24] <halfak>	 FYI, Amir1, I don't think we're going to be able to deploy today. 
[14:52:37] <Amir1>	 halfak: why :(
[14:52:40] <halfak>	 the model building process takes more than 24 hours these days
[14:52:46] <halfak>	 Working on ptwiki now
[14:52:48] <Amir1>	 I see
[14:52:50] <Amir1>	 okay
[14:52:53] <halfak>	 Alphabetically. 
[14:53:06] <halfak>	 The problem is re-extracting all the features. 
[14:53:12] <halfak>	 We've done lots of optimizations but still...
[14:56:17] <Amir1>	 Yeah, I know :(
[15:06:26] <glorian_wd>	 halfak: o/
[15:06:42] <glorian_wd>	 halfak: I have re-submitted the PR. I guess this should be the last one
[15:20:45] <wikibugs_>	 10Scoring-platform-team, 10User-Zppix: Graphite access for Zppix - https://phabricator.wikimedia.org/T168014#3379263 (10RobH) I'm removing the #operations and #ops-access-requests tags, so this doesn't show in clinic duty triage, since its no longer an active request.  I'd have declined it, but since its on tw...
[16:01:34] <Amir1>	 halfak: ping
[16:02:57] <halfak>	 sorry bio
[16:02:59] <halfak>	 here now
[16:18:03] <wikibugs_>	 10Scoring-platform-team, 10User-Zppix: Graphite access for Zppix - https://phabricator.wikimedia.org/T168014#3379558 (10Zppix) >>! In T168014#3379263, @RobH wrote: > I'm removing the #operations and #ops-access-requests tags, so this doesn't show in clinic duty triage, since its no longer an active request.  I...
[16:24:00] <wikibugs_>	 10Scoring-platform-team, 10editquality-modeling, 10revscoring, 10artificial-intelligence: Build damaging/goodfaith models for Romanian Wikipedia - https://phabricator.wikimedia.org/T156503#3379601 (10Halfak) a:03Sumit
[16:32:00] <wikibugs_>	 10Scoring-platform-team, 10Bad-Words-Detection-System, 10revscoring, 10artificial-intelligence: Add language support for Albanian - https://phabricator.wikimedia.org/T168369#3379637 (10Halfak) Looks like the generated list is there.    Instructions: https://www.mediawiki.org/wiki/ORES/BWDS_review  Ping: @M...
[16:35:27] <wikibugs_>	 10Scoring-platform-team, 10ORES, 10Services (watching), 10User-Ladsgroup: ORES POST precaching always fails with 500 - https://phabricator.wikimedia.org/T168674#3371682 (10Halfak)
[16:35:29] <wikibugs_>	 10Scoring-platform-team, 10ORES: Mid June 2017 ORES deployment - https://phabricator.wikimedia.org/T168099#3379655 (10Halfak)
[16:37:25] <wikibugs_>	 10Scoring-platform-team, 10revscoring, 10artificial-intelligence: Fix degenerate regular expressions for matching "hahaha" and "jajaja" - https://phabricator.wikimedia.org/T168888#3379659 (10Halfak)
[16:37:32] <wikibugs_>	 10Scoring-platform-team, 10revscoring, 10artificial-intelligence: Fix degenerate regular expressions for matching "hahaha" and "jajaja" - https://phabricator.wikimedia.org/T168888#3379674 (10Halfak) 05Open>03Resolved
[16:37:53] <wikibugs_>	 10Scoring-platform-team, 10revscoring, 10artificial-intelligence: Fix degenerate regular expressions for matching "hahaha" and "jajaja" - https://phabricator.wikimedia.org/T168888#3379659 (10Halfak)
[16:37:54] <wikibugs_>	 10Scoring-platform-team, 10ORES: Mid June 2017 ORES deployment - https://phabricator.wikimedia.org/T168099#3379676 (10Halfak)
[16:38:29] <wikibugs_>	 10Scoring-platform-team, 10ORES, 10articlequality-modeling, 10editquality-modeling, 10artificial-intelligence: Rebuild all of the models for ORES (new regexes) - https://phabricator.wikimedia.org/T168889#3379678 (10Halfak)
[16:39:01] <wikibugs_>	 10Scoring-platform-team, 10revscoring, 10artificial-intelligence: Fix degenerate regular expressions for matching "hahaha" and "jajaja" - https://phabricator.wikimedia.org/T168888#3379710 (10ssastry)
[16:40:50] <wikibugs_>	 10Scoring-platform-team, 10Bad-Words-Detection-System, 10revscoring, 10artificial-intelligence: Add language support for Albanian - https://phabricator.wikimedia.org/T168369#3379718 (10Halfak) a:03Sumit
[16:41:00] <wikibugs_>	 10Scoring-platform-team, 10ORES: Mid June 2017 ORES deployment - https://phabricator.wikimedia.org/T168099#3379720 (10Halfak) a:03Halfak
[16:43:56] <wikibugs_>	 10Scoring-platform-team, 10editquality-modeling, 10User-Ladsgroup, 10artificial-intelligence: Flagged revs approve model to fiwiki - https://phabricator.wikimedia.org/T166235#3379730 (10Halfak) a:05Ladsgroup>03awight
[16:47:21] <codezee>	 awight: I cannot download and train the draftquality model because of permissions, let me know if you need to know anything on the PR
[16:50:01] <halfak>	 awight, I needed to request access to deleted text via my staff account in order to run the extractor. 
[16:54:37] <codezee>	 halfak: while i was playing aroung with bwds on my laptop, it was taking a lot of time due to limited bandwitdth, same would probably happend with model training, is it possible for me to access one of the labs accounts used for training models?
[16:54:47] <codezee>	 *would happen
[17:09:53] <wikibugs_>	 10Scoring-platform-team, 10MediaWiki-JobQueue, 10ORES, 10Performance-Team, and 5 others: Job queue corruption after codfw switch over (Queue growth, duplicate runs) - https://phabricator.wikimedia.org/T163337#3379884 (10elukey) The idea about the experiment is to remove rdb2004 as slave of rdb2003, to see...
[17:15:58] <awight>	 codezee: Okay thanks, I'm sure I'll have some questions shortly!
[17:18:11] <Amir1>	 halfak: ping :D https://github.com/wiki-ai/ores/pull/209
[17:50:48] <wikibugs_>	 10Scoring-platform-team, 10MediaWiki-Vagrant, 10ORES, 10Wikilabels, 10Patch-For-Review: ORES services should have vagrant roles - https://phabricator.wikimedia.org/T159105#3380009 (10awight)
[17:53:17] <halfak>	 Amir1, on it
[17:53:40] <Amir1>	 thanks
[17:58:09] <halfak>	 Weird.  I merged it but it didn't auto close.  I can see the commit in master
[17:58:51] <Amir1>	 Thanks!
[18:06:52] <wikibugs_>	 10Scoring-platform-team, 10draftquality-modeling, 10artificial-intelligence: Experiment with Sentiment score feature for draftquality - https://phabricator.wikimedia.org/T167305#3323948 (10awight) >>! In T167305#3340316, @Sumit wrote: > So I could setup a test with the library - https://github.com/kevincobai...
[18:13:32] <wikibugs_>	 10Scoring-platform-team, 10draftquality-modeling, 10artificial-intelligence: Experiment with Sentiment score feature for draftquality - https://phabricator.wikimedia.org/T167305#3380074 (10Sumit) >>! In T167305#3380066, @awight wrote: >>>! In T167305#3340316, @Sumit wrote: >> So I could setup a test with the...
[18:19:11] <wikibugs_>	 10Scoring-platform-team, 10MediaWiki-Vagrant, 10ORES, 10Wikilabels, 10Patch-For-Review: ORES services should have vagrant roles - https://phabricator.wikimedia.org/T159105#3380076 (10awight) I think I mentioned this in code review, but with a fresh vagrant checkout and VM, I get: > ==> default: Execution...
[18:24:51] <halfak>	 awight, where did that work you were doing on the basic Meta ORES docs end up getting saved?
[18:27:07] <awight>	 halfak: This is the entry point, https://www.mediawiki.org/wiki/Meta-ORES
[18:27:26] <halfak>	 Thanks!
[18:28:55] <awight>	 ooh--I had a better diagram, I'll try to iterate on that shortly.
[18:37:47] <halfak>	 Nice.  I saved a couple edits too :) 
[18:38:02] <halfak>	 keegan is hoping to use that as a case study in outreach around a new software thingie
[18:41:50] <awight>	 codezee: It looks like I have sufficient permissions to run the draftquality makefile...
[18:42:05] <awight>	 Currently, I'm pulling datasets/enwiki.draft_quality.201508.tsv.bz2
[18:42:15] <awight>	 Do you think that will be enough to run your stuff?
[18:44:11] <awight>	 Also--how do I run your stuff?  You mentioned something about running the utilities in the Makefile, it looks like I would have to pull all data sets and train the model?
[18:45:13] <codezee>	 awight: yes model training would be necessary, since I've added features to the original draftquality model, its like retraining the model with added feature set
[18:46:03] <codezee>	 awight: if we could see a tuning report like - https://github.com/wiki-ai/draftquality/blob/master/tuning_reports/enwiki.draft_quality.md with some improvement it would mean the features work
[18:46:14] <halfak>	 o/ Keegan 
[18:46:20] <Keegan>	 \o
[18:46:37] <awight>	 halfak: Amir1: Any thoughts about how to train models that require privileged data?  We shouldn't be exporting that stuff to wmflabs...
[18:46:50] <halfak>	 Was just talking to awight about that meta-ORES page.  It's pretty sparse at the moment.  I think we need a nice coherent statement and maybe even a wireframed ui before reachin out. 
[18:47:26] <codezee>	 awight: any way you could extract individual feature values of the three sentiment features with associated labels of spam, attack, vandalism or normal? that'd be very helpful for feature study
[18:47:34] <halfak>	 awight, fair point.  I don't think it's crazy to have a dataset with a random sample of deleted pages on a labs VM, but it's not the best./ 
[18:47:39] <codezee>	 like the one I showed on task
[18:47:42] <halfak>	 we need a secure place to train model files. 
[18:48:11] <awight>	 halfak: Better yet, we should be able to completely decouple feature value caching.
[18:48:31] <awight>	 Then, we can import the feature values from external sources.
[18:48:41] <halfak>	 awight, not sure how to decouple any more than we currently are.
[18:48:53] <halfak>	 Oh!  You mean "don't store the text -- just extract the features"
[18:48:56] <awight>	 yes
[18:48:57] <halfak>	 We can do that!  No problem.
[18:49:06] <halfak>	 Just a bit slower for re-training. 
[18:49:12] <halfak>	 But not too crazy. 
[18:49:31] <halfak>	 Just skip the w_text step and go right to the w_cache step
[18:50:01] <halfak>	 I think you could even pipe the output. 
[18:50:57] * awight looks for the howto :)
[18:51:19] <halfak>	 Ha.  I think the makefile should give strong indications of where stdout and stdin can be matched. 
[18:51:28] <halfak>	 lol @ hotwo
[18:51:32] <halfak>	 *howto
[18:51:39] <awight>	 hot potato
[18:53:28] <awight>	 Another example of what I'm imagining for decoupling, I want to pull just the sentiment features codezee is adding, but AFAICT I currently have to go also process the entire feature set for the data.
[18:53:47] <glorian_wd>	 halfak: do you have a time to look at my PR today?
[18:53:57] <halfak>	 awight, that'd take a little coding but you could do it. 
[18:54:13] <halfak>	 It'd be a somewhat happy interpreter dance. 
[18:54:20] <halfak>	 See https://github.com/wiki-ai/revscoring/blob/master/ipython/feature_engineering.ipynb
[18:54:28] <halfak>	 for how to extract arbitrary sets of features
[18:55:06] <awight>	 cool, this does sound like a fun adventure
[18:55:34] <halfak>	 I love that notebook.  It comes in super handy for a lot of ORES stuff :) 
[18:56:04] <codezee>	 yes, its been helpful for understanding revscoring feature datasources thingy
[18:57:06] <halfak>	 Keegan, anything obvious you want to deal with in that mw page for Meta-ORES? 
[18:59:29] <Keegan>	 halfak: Overall the content looks good
[18:59:55] <halfak>	 Seems it still lacks a clear statement and maybe a UI example explaining WTF we are talking about. 
[19:01:49] <Keegan>	 halfak: It could probably use a nutshell in simple English of what the tool will be for at the top
[19:02:02] <Keegan>	 I'm not sure everyone is going to be able to understand on first pass
[19:03:35] <Keegan>	 halfak: I also added that suggestion as a talk page topic simple to turn the discussion tab blue :)
[19:03:41] <Keegan>	 *simply to
[19:03:45] <halfak>	 Nice
[19:03:47] <halfak>	 :D
[19:03:50] <halfak>	 Legitimizing
[19:04:10] <codezee>	 halfak: in case you missed earlier, any possibility to get labs access for training models?
[19:05:59] <halfak>	 "labs access"?
[19:06:07] <halfak>	 As in access to our instances?  
[19:06:20] <halfak>	 I'm surprised we didn't already get to that yet!
[19:06:27] <halfak>	 Do you have an labs shell name? 
[19:06:31] <halfak>	 codezee, ^
[19:07:25] <codezee>	 halfak: yes, same as this nick i think you'll just need to add me to the required groups afaik
[19:07:40] <halfak>	 Sure.  Can do 
[19:19:37] <halfak>	 awight, don't forget to make tasks for your nice little PRs :) 
[19:19:41] <halfak>	 e.g. https://github.com/wiki-ai/draftquality/pull/4
[19:19:44] <awight>	 harr
[19:19:45] <awight>	 k
[19:19:52] <halfak>	 They are great things to report in our status updates :D
[19:21:33] <codezee>	 halfak: whats with the no_review labels in the makefile of editquality? I see not all languages have that no_review set...
[19:21:54] <halfak>	 codezee, yeah.  This had to do with the way we used to sample revisions for review. 
[19:22:12] <halfak>	 There was a short period of time and subset of projects that used a different partern. 
[19:22:18] <halfak>	 You don't need that for albanian
[19:22:26] <wikibugs_>	 10Scoring-platform-team: Minor cleanup in Makefiles - https://phabricator.wikimedia.org/T168904#3380217 (10awight)
[19:22:34] <codezee>	 or romanian i guess?
[19:23:34] <wikibugs_>	 10Scoring-platform-team: Minor cleanup in Makefiles - https://phabricator.wikimedia.org/T168904#3380233 (10awight) https://github.com/wiki-ai/draftquality/pull/4 https://github.com/wiki-ai/editquality/pull/75
[19:23:38] <codezee>	 i might disappear in a while so let me know through mail or a task the instance(s) name i'm granted access to
[19:23:56] <halfak>	 codezee, gotcha. 
[19:25:00] <halfak>	 codezee, https://wikitech.wikimedia.org/wiki/User:Codezee
[19:25:14] <awight>	 codezee: I could use a few more hints about how to extract the features you want...
[19:25:28] <awight>	 I'm currently pulling all the draft extracts listed in the draftquality makefile
[19:25:46] <awight>	 Then, I'll figure out how to evaluate your new features
[19:26:10] <awight>	 But what's most useful for you--is a list of the new feature values for a small sample of deleted drafts all you need?
[19:26:25] <codezee>	 halfak: https://wikitech.wikimedia.org/wiki/User:Sumit
[19:26:35] <codezee>	 i though you were asking the shell name earlier :/
[19:26:41] <codezee>	 *thought
[19:27:09] <halfak>	 Oh yeah.  FOrgot they can be different
[19:27:20] <halfak>	 And forgot I needed the wiki name to do it through wikitech
[19:27:50] <codezee>	 yeah they get confusing...
[19:28:08] <halfak>	 codezee, are you set up to ssh to other labs instances?
[19:28:16] <halfak>	 ssh ores-compute-01.eqiad.wmflabs
[19:28:18] <codezee>	 awight: yes if i get the feature values of all three sentiment features for a decent number of drafts of each category that'd be nice... :)
[19:28:38] <awight>	 codezee: cool--I'll split by category
[19:28:39] <codezee>	 halfak: i can ssh to eqiad let me try this one
[19:28:58] <halfak>	 If you just want to run some tests with data, you could work from https://figshare.com/articles/Deleted_Wikipedia_articles_spam_vandalism_attack_/4245035
[19:29:03] <awight>	 codezee: A decent number, is like 10,000? or 100?
[19:29:08] <awight>	 sorry I'm new here ;-)
[19:30:04] <awight>	 halfak: Seems problematic to only use censored data which is... not censored?
[19:30:09] <codezee>	 awight: 10k is always better than 100 when it comes to AI ;)
[19:30:17] <awight>	 hehe
[19:30:23] <codezee>	 i think we should get a good enough idea with 10k
[19:30:31] <Platonides>	 oh, "sentiment features"
[19:30:37] <Platonides>	 not "sentient features" :P
[19:30:52] <awight>	 lol stay tuned for the sentiment sentience
[19:31:17] <codezee>	 halfak: I'm able to ssh, i suppose i'll not need to install any ubuntu specific deps ? just create a virtualenv right ?
[19:31:30] <halfak>	 right. 
[19:31:42] <codezee>	 thanks that should be it :)
[19:31:47] <halfak>	 Platonides, this is AI, right?
[19:32:37] <Platonides>	 that's why it mostly made sense :)
[19:34:22] <awight>	 codezee: lmk if halfak's figshare lets you do the processing you need?  It might be nicer for you to own the data, than wait for me each time you change something...
[19:34:31] <codezee>	 halfak awight: the above link is already the same as the github sample and its results are on the phab task
[19:34:38] <awight>	 lol
[19:35:09] <halfak>	 \o/
[19:35:23] <codezee>	 awight: halfak on that dataset the hypothesis holds, imo the litmus test is if the scoring report shows a rise in accuracy
[19:35:44] <codezee>	 which has to be on the full dataset which I can't touch :/ :/
[19:36:25] <awight>	 aah--so rather than extract specific feature values, I should be training a model, and post the results?
[19:37:36] <codezee>	 awight: model report should be fine, i meant if you're able to get feature specific values it was even better :)
[19:37:57] <codezee>	 awight: and i think for the immediate case retraining the model is far easier
[19:38:09] <awight>	 +1 I'll start with that
[19:39:00] <codezee>	 maybe we can have AI someday to censor text for us automatically, :D
[19:39:37] <Platonides>	 that's the previous step to the AI realizing nothing we want to write about is worthy
[19:39:47] <Platonides>	 at least, that would make its work easier!
[19:41:44] <wikibugs_>	 10Scoring-platform-team-Backlog: Design how we'll train models which depend on private data - https://phabricator.wikimedia.org/T168908#3380311 (10awight)
[19:44:38] <awight>	 BTW, what's the theory behind .gitignoring the datasets dirs?
[19:46:37] <halfak>	 awight, mostly we don't want to check anything in from there. 
[19:46:50] <halfak>	 I like git add -f to add a dataset when I really mean it. 
[19:47:16] <awight>	 haha fair enough
[19:48:33] <awight>	 omg that .tsv.bz2 is useless
[19:54:16] <wikibugs_>	 10Scoring-platform-team: draftquality should be trained on a sample, rather than humongous everything - https://phabricator.wikimedia.org/T168909#3380334 (10awight)
[19:54:26] <wikibugs_>	 10Scoring-platform-team-Backlog: draftquality should be trained on a sample, rather than humongous everything - https://phabricator.wikimedia.org/T168909#3380346 (10awight)
[19:55:20] <awight>	 halfak: Hey that reminds me of something I noticed last week.  Please chat w/ me some time about the theory behind sampling / randomizing
[19:55:43] <halfak>	 E_OUTOFBRAINERROR
[19:55:51] <codezee>	 :D
[19:55:56] <halfak>	 There was no more brain to allocate at the moment
[19:55:57] <awight>	 kill -9
[19:56:02] * halfak dies
[19:56:05] * halfak is reborn
[19:56:08] <codezee>	 systemctl restart
[19:56:13] <halfak>	 lol
[19:56:13] <awight>	 nohup
[19:57:06] <Platonides>	 we need to throw more halfaks to the problem
[19:57:06] <awight>	 gotta set that zombie process loose
[19:57:14] <awight>	 -j6
[19:57:42] <awight>	 oh no!  we've catalyzed geekpocalypse
[20:47:53] <wikibugs_>	 10Scoring-platform-team, 10MediaWiki-Vagrant, 10ORES, 10Wikilabels, 10Patch-For-Review: ORES services should have vagrant roles - https://phabricator.wikimedia.org/T159105#3380445 (10awight) I think the ORES service might not be running.  ``` vagrant roles list -e Enabled roles:  ores...
[20:55:39] <wikibugs_>	 10Scoring-platform-team: Minor cleanup in Makefiles - https://phabricator.wikimedia.org/T168904#3380454 (10awight) Some nastiness I just discovered: many commands will overwrite their output on failure.  E.g.,  ``` datasets/enwiki.draft_quality.201601.tsv.bz2: \         sql/draft_quality.variables.sql     echo '...
[21:36:37] <halfak>	 https://gerrit.wikimedia.org/r/361576
[21:37:12] <wikibugs_>	 10Scoring-platform-team-Backlog: Document nuances of training data - https://phabricator.wikimedia.org/T168912#3380543 (10awight)
[21:38:24] <wikibugs_>	 10Scoring-platform-team, 10MediaWiki-Vagrant, 10ORES, 10Wikilabels, 10Patch-For-Review: ORES services should have vagrant roles - https://phabricator.wikimedia.org/T159105#3380555 (10Tgr) >>! In T159105#3380076, @awight wrote: >> ==> default: Execution of '/bin/systemctl start ores-wsgi' returned 6: Fail...
[21:41:27] <wikibugs_>	 10Scoring-platform-team, 10MediaWiki-Vagrant, 10ORES, 10Wikilabels, 10Patch-For-Review: ORES services should have vagrant roles - https://phabricator.wikimedia.org/T159105#3380564 (10awight) Yep, the process was roughly, * vagrant role enable ... * vagrant destroy * vagrant box update * vagrant up
[21:44:50] <wikibugs_>	 10Scoring-platform-team-Backlog: Investigate parallelizing the model makefile - https://phabricator.wikimedia.org/T168913#3380574 (10awight)
[21:45:15] * halfak submits enormous PR for editquality
[21:45:47] <awight>	 halfak: need +2 on that, or are you self-merging?
[21:46:13] <wikibugs_>	 10Scoring-platform-team, 10MediaWiki-Vagrant, 10ORES, 10Wikilabels, 10Patch-For-Review: ORES services should have vagrant roles - https://phabricator.wikimedia.org/T159105#3380589 (10Tgr) >>! In T159105#3380445, @awight wrote: > I think the ORES service might not be running.  After a successful provision...
[21:46:47] <halfak>	 awight, https://github.com/wiki-ai/editquality/pull/77 
[21:46:48] <halfak>	 <3
[21:47:47] <wikibugs_>	 10Scoring-platform-team, 10MediaWiki-Vagrant, 10ORES, 10Wikilabels, 10Patch-For-Review: ORES services should have vagrant roles - https://phabricator.wikimedia.org/T159105#3380593 (10awight) ``` vagrant provision ==> default: Running provisioner: lsb_check... ==> default: Running provisioner: shell......
[21:50:39] <halfak>	 Amir1, I think we might not make it with the draft quality model 
[21:51:20] <Amir1>	 halfak: we can delay it a little
[21:51:26] <Amir1>	 what can I do?
[21:51:30] <wikibugs_>	 10Scoring-platform-team, 10MediaWiki-Vagrant, 10ORES, 10Wikilabels, 10Patch-For-Review: ORES services should have vagrant roles - https://phabricator.wikimedia.org/T159105#3380604 (10awight) Looks pretty simple, actually!  Missing Python dependency?  Here are snippets from syslog,  ``` Jun 26 21:47:21 me...
[21:52:22] <halfak>	 Amir1, not sure.  I'm waiting on ores-compute-01 to finish training the draft quality model and it is mega slow. 
[21:52:27] <halfak>	 500k observations
[21:52:40] <halfak>	 We can only run two processes in parallel because of memory issues. 
[21:53:14] <paladox>	 halfak how much storage does ores-compute-01 one have?
[21:53:24] <paladox>	 Is it an xtra large one which has alot of ram?
[21:53:43] <halfak>	 It has 16GB of ram
[21:53:45] <halfak>	 8 "cores"
[21:53:53] <paladox>	 oh lol
[21:53:54] <halfak>	 But I don't think we're the biggest size. 
[21:53:56] <halfak>	 We could upgrade. 
[21:54:08] <paladox>	 it takes up 16gb?
[21:54:16] <awight>	 paladox: If you're curious, https://tools.wmflabs.org/openstack-browser/project/ores
[21:54:41] <paladox>	 thanks
[21:55:04] <paladox>	 according to the ui
[21:55:06] <paladox>	 it has 8
[21:55:10] <paladox>	 8gb of ram.
[21:55:16] <paladox>	 for ores-computer-01
[21:55:29] <awight>	 paladox: I think the columns are confusing
[21:55:48] <awight>	 i.e. off by one
[21:55:51] <paladox>	 oh
[21:55:55] <paladox>	 yeh
[21:55:59] <awight>	 We seem to be using open source :p
[21:56:09] <paladox>	 o
[21:56:14] <paladox>	 oh i see
[21:56:14] <paladox>	 16384M
[21:56:19] <awight>	 j/k, proprietary software would be whitescreening inexplicably at this point.
[21:56:41] <halfak>	 paladox, 16GB 
[21:56:45] <halfak>	 :P 
[21:56:47] <paladox>	 yep
[21:56:54] <paladox>	 it was out of place :)
[21:57:52] <awight>	 Here's an interesting one:
[21:57:52] <awight>	 21066 halfak    20   0 1055076 470508  24028 R  98.7  2.9  16:09.88 revscoring                    
[21:57:55] <awight>	 21065 halfak    20   0 5505852 5.247g   1748 S   0.7 33.5   1:12.35 shuf                          
[21:58:04] <awight>	 It's actually "shuf" doing the ram killing
[21:58:10] <halfak>	 awight, right
[21:58:12] <halfak>	 I saw that
[21:58:16] <halfak>	 It'll be revscoring soon
[21:58:38] <awight>	 If the order doesn't matter for training and we're not sampling, we can take shuf out of the chain...
[21:58:59] <halfak>	 head -n 500000 wouldn't have the same guarantees
[21:59:42] <halfak>	 I think that shuf will exit as soon as one revscoring process has all of the data.  Then revscoring will spawn 2 more processes to do the cv pattern. 
[22:00:24] <halfak>	 OK here we are.  I say we try to move forward without draftquality
[22:00:28] <awight>	 gotcha
[22:00:33] <halfak>	 Amir1, ^
[22:00:47] <Amir1>	 I don't see anything
[22:00:49] <Amir1>	 :(
[22:00:54] <halfak>	 ?
[22:00:55] <awight>	 fwiw I see that other people are annoyed at shuf.  There are algorithms which can do this without 5GB of memory.
[22:01:09] <Amir1>	 sorry, I thought it seems I need to merge something
[22:01:12] <awight>	 Amir1: you looking for the PR? I just merged
[22:01:16] <Amir1>	 okay
[22:01:54] <Amir1>	 halfak: I'm okay with moving without draftquality
[22:02:00] <halfak>	 kk
[22:02:03] <awight>	 I'd like to shoulder-surf this deploy, lmk
[22:02:06] <halfak>	 Will have a prod config pr soon
[22:02:10] <Amir1>	 we can deploy twice, I can do it tomorrow later
[22:02:20] <halfak>	 awight, call when ready
[22:03:56] <Amir1>	 should we jump into a call?
[22:36:54] <wikibugs_>	 10Scoring-platform-team-Backlog, 10Operations, 10Ops-Access-Requests: Grant AWight accounts on ores production clusters - https://phabricator.wikimedia.org/T168442#3380731 (10awight) 05Resolved>03Open Looks like I'll need shell access to scb1002.eqiad.wmnet, in order to do canary tests while deploying....
[22:49:20] <wikibugs_>	 10Scoring-platform-team: Get Adam all the rights - https://phabricator.wikimedia.org/T168917#3380745 (10Halfak)
[22:49:49] <wikibugs_>	 10Scoring-platform-team-Backlog, 10Operations, 10Ops-Access-Requests: Grant AWight accounts on ores production clusters - https://phabricator.wikimedia.org/T168442#3380759 (10awight) Sounds like I'll need shell on scb[1-2]* and also the ores-admin group, so I can do terrible things on production boxes.
[22:49:51] <wikibugs_>	 10Scoring-platform-team: Get Adam all the rights - https://phabricator.wikimedia.org/T168917#3380760 (10Halfak)
[22:50:20] <wikibugs_>	 10Scoring-platform-team: Get Adam all the rights - https://phabricator.wikimedia.org/T168917#3380745 (10Halfak)
[22:50:23] <wikibugs_>	 10Scoring-platform-team-Backlog, 10Operations, 10Ops-Access-Requests: Grant AWight accounts on ores production clusters - https://phabricator.wikimedia.org/T168442#3380762 (10Halfak)
[22:50:33] <wikibugs_>	 10Scoring-platform-team: Get Adam all the rights - https://phabricator.wikimedia.org/T168917#3380745 (10Halfak)
[22:57:03] <wikibugs_>	 10Scoring-platform-team-Backlog, 10Operations, 10Ops-Access-Requests, 10Patch-For-Review: Grant AWight accounts on ores production clusters - https://phabricator.wikimedia.org/T168442#3380797 (10Ladsgroup) This is the only thing that needs to be done
[22:58:44] <wikibugs_>	 10Scoring-platform-team: Get Adam all the rights - https://phabricator.wikimedia.org/T168917#3380803 (10RobH)
[22:58:48] <wikibugs_>	 10Scoring-platform-team-Backlog, 10Operations, 10Ops-Access-Requests, 10Patch-For-Review: Grant AWight accounts on ores production clusters - https://phabricator.wikimedia.org/T168442#3380801 (10RobH) 05Open>03Resolved Addition to the ores-admins is a sudo group, and thus will require review during the...
[22:59:41] <wikibugs_>	 10Scoring-platform-team: Get Adam all the rights - https://phabricator.wikimedia.org/T168917#3380745 (10RobH)
[22:59:52] <wikibugs_>	 10Scoring-platform-team-Backlog, 10Operations, 10Ops-Access-Requests, 10Patch-For-Review: Grant AWight accounts on ores production clusters - https://phabricator.wikimedia.org/T168442#3380805 (10RobH) 05Resolved>03Open Also no one reopened this when requesting more rights be added, opening it back up now.
[23:01:03] <wikibugs_>	 10Scoring-platform-team-Backlog, 10Operations, 10Ops-Access-Requests, 10Patch-For-Review: Grant AWight accounts on ores production clusters - https://phabricator.wikimedia.org/T168442#3380813 (10awight) My fault, I've been flapping this task like crazy... T168442#3380731  Thanks for taking a look!
[23:03:52] <wikibugs_>	 10Scoring-platform-team-Backlog, 10ORES, 10Easy: ORES 500's on integers that can't be processed - https://phabricator.wikimedia.org/T168920#3380833 (10Halfak)
[23:07:27] <awight>	 So, is this only wmflabs?  
[23:07:27] <awight>	 https://logstash.wikimedia.org/app/kibana#/dashboard/ORES?_g=()&_a=(filters:!(),options:(darkTheme:!f),panels:!((col:1,id:Dashboards,panelIndex:1,row:1,size_x:12,size_y:2,type:visualization),(col:1,id:Events-Over-Time,panelIndex:2,row:3,size_x:9,size_y:2,type:visualization),(col:1,id:Event-Types,panelIndex:3,row:5,size_x:5,size_y:3,type:visualization),(col:6,id:Event-Level,panelIndex:4,row:5,size_x:3,
[23:07:33] <awight>	 size_y:3,type:visualization),(col:1,columns:!(type,level,wiki,host,message),id:Default-Events-List,panelIndex:5,row:8,size_x:12,size_y:25,sort:!(%27@timestamp%27,desc),type:search),(col:10,id:Top-20-Hosts,panelIndex:6,row:3,size_x:3,size_y:2,type:visualization),(col:9,id:Events-Over-Time-By-Channel,panelIndex:7,row:5,size_x:4,size_y:3,type:visualization)),query:(query_string:(analyze_wildcard:!t,query
[23:07:39] <awight>	 :%27type:ores%27)),title:ORES,uiState:(P-2:(vis:(legendOpen:!f)),P-3:(vis:(legendOpen:!f)),P-4:(vis:(legendOpen:!f)),P-6:(vis:(params:(sort:(columnIndex:!n,direction:!n))))))
[23:07:42] <awight>	 barf.
[23:07:44] <awight>	 https://logstash.wikimedia.org/app/kibana#/dashboard/ORES
[23:12:48] <awight>	 I ask because the server errors we just triggered are nowhere to be seen.
[23:16:09] * halfak looks
[23:17:12] <wikibugs_>	 10Scoring-platform-team, 10ORES, 10Easy: ORES 500's on integers that can't be processed - https://phabricator.wikimedia.org/T168920#3380873 (10Halfak)
[23:17:38] <halfak>	 https://github.com/wiki-ai/ores/pull/210
[23:17:43] <halfak>	 {{done}}
[23:17:43] <AsimovBot>	 How efficient, halfak!
[23:17:48] <halfak>	 Damn right AsimovBot 
[23:18:22] <awight>	 wat
[23:18:29] <awight>	 AsimovBot: help
[23:18:29] <AsimovBot>	 Asimov v. 2, By jem (IRC) / -jem- (Wikimedia), 2010-17 - Bot IRC de apoyo a los proyectos y al movimiento Wikimedia programado en PHP - Las órdenes deben escribirse precedidas de alguno de los prefijos admitidos (@!-=) - 13Lista de órdenes: -ord - 13Problemas o sugerencias: -sug - 13Ayuda: -? 15,02orden / -ic - 10http://wikimedia.es/asimov?uselang=en
[23:19:57] <halfak>	 awight, I'm not very familiar with logstash
[23:20:14] <halfak>	 How do you search for type:ores and level:ERROR
[23:20:28] <paladox>	 why does that bot speak which i presume spanish?
[23:20:48] <halfak>	 jem speaks spanish, I guess
[23:21:07] <halfak>	 jem used to hang out around here, but I don't see them much anymore.  But AsimovBot remains. 
[23:21:22] <halfak>	 [[Foobar]]
[23:21:22] <AsimovBot>	 10[1] 04https://meta.wikimedia.org/wiki/Foobar
[23:21:25] <awight>	 halfak: me neither--my uneducated guess is that the server error is falling through all the safety nets and might not be using python's logging settings.
[23:21:43] <halfak>	 Hmm... Certainly possible
[23:21:46] <awight>	 [[es:AsimovBot]]
[23:21:46] <AsimovBot>	 10[2] 04https://es.wikipedia.org/wiki/AsimovBot
[23:22:45] <paladox>	 oh
[23:23:08] <wikibugs_>	 10Scoring-platform-team, 10ORES, 10articlequality-modeling, 10editquality-modeling, 10artificial-intelligence: Rebuild all of the models for ORES (new regexes) - https://phabricator.wikimedia.org/T168889#3379678 (10Halfak) https://github.com/wiki-ai/editquality/pull/77
[23:23:41] <wikibugs_>	 10Scoring-platform-team, 10ORES, 10Easy: ORES 500's on integers that can't be processed - https://phabricator.wikimedia.org/T168920#3380833 (10Halfak) https://github.com/wiki-ai/ores/pull/210
[23:24:45] <awight>	 From T149010: > I am wondering however how we could change the logformat to support more level (like ERROR, WARN) etc.
[23:24:45] <stashbot>	 T149010: Send ORES logs to logstash - https://phabricator.wikimedia.org/T149010
[23:26:50] <wikibugs_>	 10Scoring-platform-team-Backlog: Send error logs to logstash - https://phabricator.wikimedia.org/T168921#3380887 (10awight)
[23:34:19] <awight>	 I wonder why our redis client count is so high?  https://grafana.wikimedia.org/dashboard/db/ores?orgId=1&panelId=22&fullscreen&from=1498515346973&to=1498518706973
[23:35:15] <awight>	 Ah, maybe we have 100 workers per box?...
[23:41:48] <halfak>	 awight, that sounds about right
[23:42:01] <halfak>	 both celery and uwsgi workers get a connection