[00:03:04] Hercule Hercule * Re: [Toolserver-l] Encoding issue using SGE [02:35:04] Hazard-SJ * Re: [Toolserver-l] TS web broken [02:41:46] [[Special:Log/newusers]] create 10 * Biengoi2 * (New user account) [04:27:57] 2013/05/23 04:20 WARN cassia /sql/data/dewiki DISK WARNING - free space: /sql 247850 MB (20% inode=99%): [04:54:02] 2013/05/23 04:52 OK cassia /sql/data/dewiki DISK OK - free space: /sql 252023 MB (21% inode=99%): [07:15:04] Dr. Trigon * Re: [Toolserver-l] TS web broken [08:13:04] Merlijn van Deen * Re: [Toolserver-l] Encoding issue using SGE [11:08:20] 2013/05/23 11:03 CRIT ortelius toolserver.org HTTP HTTP CRITICAL: HTTP/1.1 200 OK - 239 bytes in 8.522 second response time [11:10:20] 2013/05/23 11:09 OK ortelius toolserver.org HTTP HTTP OK: HTTP/1.1 200 OK - 239 bytes in 0.018 second response time [11:17:21] 2013/05/23 11:14 CRIT ortelius toolserver.org HTTP HTTP CRITICAL: HTTP/1.1 200 OK - 239 bytes in 8.462 second response time [11:23:22] 2013/05/23 11:22 OK ortelius toolserver.org HTTP HTTP OK: HTTP/1.1 200 OK - 239 bytes in 0.070 second response time [11:31:05] Patricia Pintilie * Re: [Toolserver-l] Encoding issue using SGE [11:31:06] Patricia Pintilie * Re: [Toolserver-l] TS web broken [11:31:06] Patricia Pintilie * Re: [Toolserver-l] Introducing the new toolserver admin [11:32:05] Patricia Pintilie * Re: [Toolserver-l] SGE fail to run any of my tasks, they error out as "cgroup change of group failed" [11:32:05] Patricia Pintilie * Re: [Toolserver-l] Status of the toolserver [11:34:22] 2013/05/23 11:30 CRIT ortelius toolserver.org HTTP HTTP CRITICAL: HTTP/1.1 200 OK - 239 bytes in 2.412 second response time [11:37:22] 2013/05/23 11:36 OK ortelius toolserver.org HTTP HTTP OK: HTTP/1.1 200 OK - 239 bytes in 0.008 second response time [12:34:40] hey [12:41:04] Hercule Hercule * Re: [Toolserver-l] Encoding issue using SGE [12:49:27] 2013/05/23 12:48 OK ha-www HTTP svn HTTP OK: HTTP/1.1 200 OK - 274 bytes in 0.025 second response time [12:52:04] Marlen Caemmerer * Re: [Toolserver-l] TS web broken [13:16:04] Merlijn van Deen * Re: [Toolserver-l] Encoding issue using SGE [13:49:04] Dr. Trigon * Re: [Toolserver-l] TS web broken [14:08:04] Jeremy Baron * [Toolserver-l] Patricia Pintilie [14:09:48] hi [14:09:50] anyone around? [14:10:13] can I get an article count on pictures on penises etc on commons versus the total number of files [14:10:29] total number of files is readily available I know but the other isnt [14:10:47] My goal is to disprove the notion that commons is full of porn. [14:14:21] hah [14:14:34] can you better define what you want? [14:15:04] Lars Aronsson * Re: [Toolserver-l] Encoding issue using SGE [14:21:20] jeremyb I can try [14:21:31] the categories that have nudity or genetelia [14:21:39] breats for women too [14:21:59] if you can give me the counts of each of those (and subcats when relevant) it would be helpful [14:22:31] I'd also like a few example non-problematic categories to illustrate how many images some categories have [14:22:39] to be able to say [14:22:58] "yes we have n many images of penises but we also have m many images of roosters" [14:23:13] both are cocks :p [14:23:14] the counts of all members or just of images or non-images or what? [14:23:22] make me a list of cats [14:23:45] if it could be a tree with sums of children in ()'s would be nice [14:23:46] like [14:24:10] Category:Penis 50 (90) [14:24:18] what are 50 and 90? [14:24:31] -Category:Penises with condoms 40 [14:24:48] 50 is the number of files in that category alone ()'s incudles subcats [14:24:48] ok [14:24:57] to get a good idea on content [14:25:09] some subcats would be removed manually probably [14:25:23] you somehow end up with non problematic content , its commons :p [14:25:26] well i'm not going to commit to it and others should feel to take the job. but maybe I will tonight (US time) [14:25:30] if no one gets to it first [14:25:38] ook [14:25:41] I can provide a list of cats [14:25:46] please do [14:25:58] bbl [14:26:17] ToAruShiroiNeko, a list of cats? How many do you have? :p [14:26:37] I think I can handle with a few [14:26:38] http://commons.wikimedia.org/wiki/Category:Genitalia_by_gender [14:26:46] obviously it has animal category too [14:27:07] ooooook. [14:27:31] I just saw the rest of the thread. :/ [14:28:00] which may be interesting to have the numbers on but the root totals should reflect that maybe [14:28:22] http://commons.wikimedia.org/wiki/Category:Nudity_or_partial_nudity [14:29:04] these two should cover breasts too [14:29:16] not sure if they overlap at some point [14:29:28] WTF?!? [14:30:07] we're on a toolserver channel and you're talking about breasts and penises? [14:30:21] sure, because people want to mass delete them [14:30:43] because "commons is full of porn" I want to demonstrate that with numbers [14:31:07] My assumption is that a minority of images on commons is like that [14:31:39] No. Porn is a bunch of videos of some kind of intercourse. A dick will always be a dick. [14:31:53] I know that [14:32:07] Therefore, commons is not full of porn. [14:32:16] It's full of penises. [14:32:18] :p [14:33:17] full of dicks you mean? :D [14:33:26] yep. [14:33:28] :D [14:35:04] Daniel Schwen * Re: [Toolserver-l] Patricia Pintilie [14:37:18] Cyberpower678 for the past 7 years time and time again people wanted to remove all nudity from commons with this argument [14:37:27] I just want to reflect it objectively [14:38:21] I agree porn has no place in commons. I can't find any, so there's no problem. [14:39:24] #wikimedia-commons [14:39:56] Wouldn't this conversation be better had over at commons? [14:46:48] http://commons.wikimedia.org/wiki/Category:Sex [14:47:04] Cyberpower678 I am asking for toolserver query help [14:47:12] to count the files [14:47:58] ToAruShiroiNeko: what specifically would you like a count of? [14:49:04] Hersfold Wikipedia * Re: [Toolserver-l] Patricia Pintilie [15:05:04] What would you like a count of? [15:27:00] Cyberpower678: i think he said it pretty clearly? only question i had left was whether only to include File NS or other cat members too [15:27:06] ToAruShiroiNeko: cat list? [15:35:20] Does someone know if there is a SGE resource I can request so that only one of my scripts is run at any time? (Serial instead of parallel?) [15:45:14] scfc_de: I think you can set limits on resources, let me look it up [15:46:31] scfc_de: according to https://wiki.toolserver.org/view/Job_scheduling#Optional_resources you could use the user_slot parameter for it? [15:47:25] as far as I understand the documentation, it'll only apply to scripts that actually request that resource [15:51:05] Nettrom: Ah! I should have RTFM. I only googled for SGE + user resources, came up empty and instead spread some jobs via "qsub -a $TIMESTAMP". Thanks for pointing out user_slot. [17:02:55] jeremyb sure [17:03:03] 3 cats I think are a good start [17:03:22] Category:Sex Category:Nudity_or_partial_nudity Category:Genitalia_by_gender [17:03:35] subcats have bulk of the content I think [17:03:38] they may overlap [17:23:58] sure, overlap is expected [17:24:06] how to do sums then? [18:33:04] Hercule Hercule * Re: [Toolserver-l] Encoding issue using SGE [19:15:41] jeremyb, I think you would need to join with the actual list of files [19:20:55] Platonides: i meant mathematically not SQL [19:21:08] double count? dedupe? how? [19:21:13] https://github.com/mzmcbride/database-reports/blob/master/reports/__init__.py has a function to find all subcategories, and somewhere in there there's probably a report that does something similar that could be adapted. But I don't think that this topic will be discussed rationally, so I wouldn't invest in it :-). People opposed to "porn" in Commons will probably have fundamental issues with it and not say, "x % is okay, but not [19:21:13] more". [19:21:33] jeremyb: If you join the actual list of files, that dedupes it. [19:21:54] then how do you choose which cat to assign? [19:22:18] anyway, back to work [19:22:44] I am trying to run asubcategory list [19:22:48] but it's sloow... [19:23:10] jeremyb: I thought the purpose of the enterprise was to get a number of files under a category. Why do you need to assign a category? [19:23:21] "a number" = "the number". [19:26:05] Platonides: Yes, once I looked into using CatScan's cache, but couldn't find it. On Tools, we should set up something more usable. In most cases, the level of "certainty" that CatScan provides (x hours old results, only y levels deep) is good enough. There's also still the GraphServ thingy that WMDE once set up. [19:32:04] Hercule Hercule * Re: [Toolserver-l] Encoding issue using SGE [19:58:12] see this porn category: http://commons.wikimedia.org/wiki/Category:Unidentified_Nudibranchia [19:58:46] there are 8209 subcategories of Nudity_or_partial_nudity [19:58:56] including eg. http://commons.wikimedia.org/wiki/Category:Saints_Peter_and_Paul_churches_in_Latvia [20:01:41] Platonides: So they are right: Commons is full of porn :-). [20:07:32] I see 152090 files below Nudity_or_partial_nudity [20:07:50] including monuments and churches [20:08:27] 152090 / 17086740 = 0.89% [20:12:38] the three category trees contain 192678 files = 1.127% of commons [20:12:46] ToAruShiroiNeko: ^ [20:12:57] 14009 subcategories in total [20:13:12] but there must be some derivation to non-sex categories [20:16:34] ToAruShiroiNeko, are you interested in that list? [20:17:20] this is porn? :S http://commons.wikimedia.org/wiki/Category:Bus_Stop_%28film%29 [20:18:44] completely pornographic: http://commons.wikimedia.org/wiki/:Category:Self-portraits_by_Tintoretto [20:20:36] so many self-portraits are porn because they are inside [[Category:Beards in art]] [20:23:23] So: Nudity_or_partial_nudity -> Eroticism -> Human_surface_anatomy -> Male_faces -> Faces_of_men_in_art -> Beards in art -> Self-portraits_by_Tintoretto [20:25:42] what's pornographic in http://commons.wikimedia.org/wiki/Category:Beyonc%C3%A9_Knowles ? [20:49:15] Beyoncé_Knowles <- Women_with_long_hair <- Female_long_hair <- Human_surface_anatomy <- Eroticism <- Nudity_or_partial_nudity [20:50:50] Human_surface_anatomy has 8826 subcategories [20:59:08] this makes no sense [21:00:13] Nudity_or_partial_nudity has 8210 subcats, and its subcat Human_surface_anatomy has more?? (8826) [22:41:05] Platonides * [Toolserver-l] Toolserver db outperformed by labs [23:03:04] Tim Landscheidt * Re: [Toolserver-l] Toolserver db outperformed by labs