[00:04:06] hey are there anyone? [00:12:32] yes [00:51:37] MatmaRex: I'm trying to import the whole database of tr.wikipedia.org [00:51:47] I was importing this file: https://dumps.wikimedia.org/trwiki/20170501/trwiki-20170501-pages-articles-multistream.xml.bz2 [00:52:09] tr.wikipedia holds 1.490.460 pages: https://tr.wikipedia.org/wiki/%C3%96zel:%C4%B0statistikler [00:52:50] MatmaRex: But my importing process somehow finished on 148.596 page https://lugat.org/%C3%96zel:%C4%B0statistikler [00:53:35] MatmaRex: this is the last line of importing: 148500 (2.68 pages/sec 2.68 revs/sec) [00:53:51] Then it stopped I don't know if it's crashed or finished. [00:54:30] jackmcbarn: giby_the_kid do you have any idea [00:54:43] How can I import the rest of the data? [00:57:16] wc -l trwiki-20170501-pages-articles-multistream-index.txt tells me: [00:57:27] there are 810114 lines [01:03:50] mertyildiran: pages-articles doesn't contain *all* pages – it contains the content pages, and some other stuff (i'm not exactly sure) [01:04:21] the numbers still don't add up, but i would expect around 292748 pages to be imported, rather than 1490461 [01:04:54] if you want really all of the pages, you need to use the pages-meta dump [01:06:33] i don't know why the importing stopped for you. it sounds like it indeed crashes. [01:06:47] MatmaRex: Which of these https://dumps.wikimedia.org/trwiki/20170501/ [01:07:04] i guess look at the 148.596th page, or the next 148.597th one, and see if there's anything weird about it? i don't hve the dumps downloaded [01:07:07] Are you talking about this dump?: https://dumps.wikimedia.org/trwiki/20170501/trwiki-20170501-pages-meta-current.xml.bz2 [01:07:40] mertyildiran: "All pages, current versions only." trwiki-20170501-pages-meta-current.xml.bz2 [01:07:43] yes [01:07:48] MatmaRex: templates are missing for example [01:07:57] OK great I will import that file. [01:08:09] Is there a way to start importing from nth record? [01:08:11] note that this is also only the current versions, and no edit history… so it really depends on why you're doing this [01:08:31] It seems like it always starting from to first record in the dump [01:08:36] hm, i don't know [01:08:42] I don't need history [01:09:30] MatmaRex: I can't see any "Start from nth file" option in here: https://github.com/wikimedia/mediawiki/blob/master/maintenance/importDump.php [01:09:42] Am I missing something? Can you see anything? [01:10:04] it doesn't seem like there is one :( [01:11:52] *"Start from nth record" I'm sorry [01:12:17] Yeah it's annoying because I think it crashed not finished [01:13:01] MatmaRex: because I get 810114 wc -l trwiki-20170501-pages-articles-multistream-index.txt [01:13:21] So there are actually 810114 records in the dump [01:13:46] Do you know what is the difference between multistream dump and the other dumps? [01:22:34] mertyildiran: i think it's just that it's split into multiple files [02:22:30] No… [09:15:36] Hi [09:15:58] Other than as a terminator for tags, is > at the start of a line used for anything? [09:16:41] Currently : is used (wrongly) as an indent marker on talk pages [09:17:00] It would be nice to have something that's an indent marker specfically [11:18:20] ShakespeareFan00: it isn't and definetly something to consider.. [11:25:44] ShakespeareFan00: related https://phabricator.wikimedia.org/T6521 [11:26:55] ShakespeareFan00: I do note however that > is often used for quotations, not for indentation. as such might be a bit confusing [11:27:15] Wouldn't quotations be indented anyway... [11:27:17] ? [11:27:59] the use of > for quotes is a carry over from when certain early e-mail clients had what was effectively in-data headers [11:28:03] i mean in other types of media > is used for quotation. think mail, but also most MD forms [11:28:13] MD? [11:28:16] markdown [11:28:21] Ah.. [11:28:39] Also ">" used to be the terminal prompt on many systems [11:28:41] so > corresponds more to
[11:29:09] In any event what I was asking about already seems to be in phabricator [11:30:04] Another markup issue, is starting a list at an arbitary value, or indicating one should be continued... [11:30:17] As you know wikimarkup was originally for single pages [11:30:24] ShakespeareFan00: i agree we need to fix : somehow though. It's just bad to have that output
's maybe it's about time we fix that. I think we would have the ability to make that happen nowadays. [11:30:54] ShakespeareFan00: there is a ticket for that too. For which the current approach is "this is so rare, you can use raw html for it". [11:31:10] thedj: On Wikipedia maybe [11:31:27] On Wikisource with ProofreadPage/LST it's more common than you would think [11:32:05] On a simmilar note tables that span multilpe LST'ed sections get hideously complex to code... [11:32:45] how so ? [11:33:03] Multi-page tables... [11:33:30] You have to put some of the table syntax inside on the indvidual pages [11:33:33] but why is it complex ? Because the header is on the previous page or something ? [11:33:47] The header is a "ribbon" on the pages [11:34:03] When showing the indvisual pages for proofreading, it's in the header on each PAGE [11:34:11] ah, you want a single table, but still split it over multiple wikicode pages. [11:34:17] Exactly [11:34:48] do you have a good example link for that ? [11:35:31] ShakespeareFan00: I've recently been looking at some things that print markup systems (like latex) still are better at than HTML and this is a nice addition to that. [11:38:51] thedj : Not a good example, but https://en.wikisource.org/wiki/Chronological_Table_and_Index_of_the_Statutes/Chronological_Table/Hen3 is one such work where you have multiple page combined [11:39:14] (and subsequent entries). That it works at all suprised me :) [13:33:25] Help [13:33:33] \help [13:33:39] Wtf [14:59:01] hi [14:59:33] the math API at api.formulasearchengine.com has been down for a few days, are there plans to fix it or i'm on my own? [15:01:54] We don't maintain that... [15:02:27] who maintains it? [15:04:10] It's MediaWiki (and related) software [15:04:12] But we don't maintain it [15:04:18] And we don't keep lists of who maintains what [16:29:31] having problems with VE installed but not working. plz help if you're familiar with VE setup https://www.mediawiki.org/wiki/Topic:Tquqwezxalce1wfl [22:38:54] is this the right place for help with wiki markup? [22:51:05] Hi KataiKou. [22:51:06] Sure. [22:52:28] It's kind of simple, but kind of messy? I'm trying to make a table that's a perfect grid, except for a title bar at the top. Is there an easy way to do this without doing "height" "width" on every row/column? [22:54:04] (this gets messier because I want to have some cells span columns, but still have everything grid aligned) [23:01:37] KataiKou: you can try style="table-layout: fixed;" (this is just CSS, not a MediaWiki feature) [23:01:57] KataiKou: but it will be a *perfect* grid if you do so. it won't adjust to the width of the contents of the table cells. [23:03:14] that's fine. Pretty much what I'm looking for. What determines the row/col sizes?