[17:35:07] ello Amir1 [17:35:16] want to work on something in the hack session? :) [18:27:42] O_o [18:27:47] O_o [18:52:07] hey ToAruShiroiNeko, I was afk [18:52:25] I noticed :) [18:53:01] What Can I do? I can't promise [18:53:27] well, I was hoping you to generate stop words locally [18:53:34] say 1000 or 1500 of them per wiki [18:53:57] once you have a library of 1000-1500 of them per wiki you can run tf-idf on this new list [18:54:03] to find words common in all wikis [18:54:28] 1500 because we will get local stop words as well as many many on wiki specific ones [18:54:42] on the final list we would take the top 250 or 500 or something [18:55:07] you wanted to work on this on tuesday [18:55:23] but the tf=idf approach is already there I think [18:55:36] and would take uite a bit of time to run them [18:56:22] hmm [18:56:22] ok [18:59:32] I am holding back on generating language models just yet [18:59:39] since I think this will have good signal gain [19:15:34] oh I am reading through your wikidata mail [19:21:01] kian looks like somehting uite exciting. :) [19:26:01] :)