[10:15:57] hello halfak [10:16:00] how is you today [10:16:05] Hey ToAruShiroiNeko [10:16:29] Main source of my preocupation is over :3 [10:16:41] Oh!? [10:16:42] So I intend to return back :) [10:17:05] Yes, the mayorate people finaly read their snail mail. I did push them to it but hey outcome is all I care about at this point. :) [10:17:16] It only have taken since december [10:17:19] :p [10:17:26] any who I had a question for you [10:17:41] last time I was involved you guys were working on language identification [10:18:30] Hmm... not sure what you mean by "language identification" [10:18:39] identifying if a post is in chinese or english [10:19:01] Oh! So we might apply the apt model to, say, a meta page/edit? [10:19:02] etc. like when foreign languages are posted on a wiki not in that language [10:19:13] I had a different idea [10:19:19] Oh yeah. that was an idea I had yeah. We haven't implemented it [10:19:28] I gained access to OTRS and it is kind of chaotic [10:19:36] posts dont have language identification etc [10:19:54] if someone sends a permission email to -en it may remain lost in there for eons too [10:20:03] Seems like we could identify language pretty easily based on a small set of features. [10:20:04] there is considerable spam as well [10:20:18] yeahm even a dictionary specific or character set specific would do wonders [10:21:05] emails coming to otrs tend to be classified and sent to the relevant email address [10:21:18] but if something is sent to the wrong email adress it isnt identified as such [10:21:28] moreover a good chunk of emails are spam [10:21:46] I have been offered so many jobs but no viagra. [10:21:47] :( [10:28:37] 06Revision-Scoring-As-A-Service, 10rsaas-articlequality : Generate monthly article quality dataset - https://phabricator.wikimedia.org/T145655#2637137 (10Halfak) [10:29:15] 06Revision-Scoring-As-A-Service, 10rsaas-articlequality : Generate monthly article quality dataset - https://phabricator.wikimedia.org/T145655#2637137 (10Halfak) All datasets are here: https://datasets.wikimedia.org/public-datasets/all/wp10/20160801/ I'm traveling so it's hard to upload to figshare. I'll do... [10:34:00] halfak so I think it would be interesting to have an internal system for OTRS [10:34:06] since OTRS is private info [10:39:09] ToAruShiroiNeko, in at a conference, so I'll be in and out today. [10:39:20] ah ok [10:39:22] you enjoy that [10:39:31] get some coffee for me too :3 [13:13:12] 06Revision-Scoring-As-A-Service, 10MediaWiki-extensions-ORES, 13Patch-For-Review: Introduce ORES rvprop - https://phabricator.wikimedia.org/T143614#2699255 (10Ladsgroup) [13:13:34] 06Revision-Scoring-As-A-Service, 10MediaWiki-extensions-ORES, 05MW-1.28-release-notes, 13Patch-For-Review, 05WMF-deploy-2016-10-11_(1.28.0-wmf.22): Introduce rcshow=oresreview and similar ones - https://phabricator.wikimedia.org/T143616#2699256 (10Ladsgroup) [13:14:09] 06Revision-Scoring-As-A-Service, 10MediaWiki-extensions-ORES, 13Patch-For-Review: Expose ores_model data in API using meta=ores - https://phabricator.wikimedia.org/T143617#2699269 (10Ladsgroup)