<?xml version="1.0" encoding="UTF-8" ?>
<rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wikidot="http://www.wikidot.com/rss-namespace">

	<channel>
		<title>Unicode page titles allowed in the future?</title>
		<link>http://community.wikidot.com/forum/t-2839/unicode-page-titles-allowed-in-the-future</link>
		<description>Posts in the discussion thread &quot;Unicode page titles allowed in the future?&quot; - Is it possible to allow Unicode characters in the page titles?</description>
				<copyright></copyright>
		<lastBuildDate></lastBuildDate>
		
					<item>
				<guid>http://community.wikidot.com/forum/t-2839#post-57572</guid>
				<title>Re: Unicode page titles allowed in the future?</title>
				<link>http://community.wikidot.com/forum/t-2839/unicode-page-titles-allowed-in-the-future#post-57572</link>
				<description></description>
				<pubDate>Sat, 13 Oct 2007 19:15:28 +0000</pubDate>
				<wikidot:authorName>BobChao</wikidot:authorName>				<wikidot:authorUserId>40343</wikidot:authorUserId>				<content:encoded>
					<![CDATA[
						 <p>Please fix this bug, it's a user-stopper for us. Users want to use Unicode (in our case, Chinese) characters both for page title and (escaped) URL, just like what MediaWiki can do.</p> 
				 	]]>
				</content:encoded>							</item>
					<item>
				<guid>http://community.wikidot.com/forum/t-2839#post-7087</guid>
				<title>Re: Unicode page titles allowed in the future?</title>
				<link>http://community.wikidot.com/forum/t-2839/unicode-page-titles-allowed-in-the-future#post-7087</link>
				<description></description>
				<pubDate>Wed, 17 Jan 2007 21:31:22 +0000</pubDate>
				<wikidot:authorName>MilchFlasche</wikidot:authorName>				<wikidot:authorUserId>6241</wikidot:authorUserId>				<content:encoded>
					<![CDATA[
						 <p>Then currently how about adding some documentation about the effect of non-ASCII titles when creating a content page or a forum thread to notify users using non-Western languages? :) I guess this might help them to name their page first in English, and know that they can change the title later :) While people might already know that they should name their content pages in Unix-form, but there seems to be no hint about this yet when people create a forum thread :)</p> <p>My regards!</p> 
				 	]]>
				</content:encoded>							</item>
					<item>
				<guid>http://community.wikidot.com/forum/t-2839#post-7047</guid>
				<title>Re: Unicode page titles allowed in the future?</title>
				<link>http://community.wikidot.com/forum/t-2839/unicode-page-titles-allowed-in-the-future#post-7047</link>
				<description></description>
				<pubDate>Wed, 17 Jan 2007 12:15:58 +0000</pubDate>
				<wikidot:authorName>michal frackowiak</wikidot:authorName>				<wikidot:authorUserId>1</wikidot:authorUserId>				<content:encoded>
					<![CDATA[
						 <p>I think this is a good idea but currently it would be quite difficult to implement… I hope in the future… Quite many people ask for better support for non-ascii characters so finally some solution will have to be implemented.</p> 
				 	]]>
				</content:encoded>							</item>
					<item>
				<guid>http://community.wikidot.com/forum/t-2839#post-6563</guid>
				<title>Re: Unicode page titles allowed in the future?</title>
				<link>http://community.wikidot.com/forum/t-2839/unicode-page-titles-allowed-in-the-future#post-6563</link>
				<description></description>
				<pubDate>Fri, 12 Jan 2007 02:58:01 +0000</pubDate>
				<wikidot:authorName>MilchFlasche</wikidot:authorName>				<wikidot:authorUserId>6241</wikidot:authorUserId>				<content:encoded>
					<![CDATA[
						 <p>Not feasible?</p> 
				 	]]>
				</content:encoded>							</item>
					<item>
				<guid>http://community.wikidot.com/forum/t-2839#post-6141</guid>
				<title>Re: Unicode page titles allowed in the future?</title>
				<link>http://community.wikidot.com/forum/t-2839/unicode-page-titles-allowed-in-the-future#post-6141</link>
				<description></description>
				<pubDate>Sat, 06 Jan 2007 09:14:22 +0000</pubDate>
				<wikidot:authorName>MilchFlasche</wikidot:authorName>				<wikidot:authorUserId>6241</wikidot:authorUserId>				<content:encoded>
					<![CDATA[
						 <p>There are tens of thousands of Hanzi, shared by Chinese, Japanese, Korean and many other Sinitic languages, and all these languages have all their own pronunciation of the Hanzi. So it's really burdensome trying to transcode Hanzi into romanzations, because it won't satisfy everyone :( So I guess we don't have to expect the transcoders to work on CJK languages. It would be natural inconvenience for East Asia people to name their web pages; blame that it's the Westerners who have invented computers :p</p> <p>I myself can accept that we name the pages with meaningful English phrases, but there is one trouble: people might not know they have to do this when they type "the title in the original language" in the "add new page" field; and if they type "20070105：開站感言" like I did, they might not notice that the page name has been cut to "20070105" only, and more pages with "20070105…" would result in confusing (and conflicts?). Such problem also happened when I add a new thread on my forum: I enter the Chinese post title, but not until I have saved did I know that the Chinese part in my title were all gone, because the system automatically takes the title as the page name, and ignores non-alphanumeric characters at all, so the page name could be much less meaningful than expected. Furthermore, if I have entered a title full of Chinese characters? Would there even be anything for the page name? Hmm, I should try this later.</p> <p>So for the East Asia trouble, here are my humble suggestions:</p> <ul> <li>Allowing people to name the pages with multibyte characters, and transcode them into <tt>%xx%xx%xx</tt> as wikis like Wikipedia do. Although the page names would be impossible to understand by any human, but East Asians won't need to translate their mental page name into English, and they can directly link to these pages with [[[漢字]]](Hanzi), without any need of aliases such as [[[Hanzi|漢字]]] <ul> <li>In this case, maybe an option for people to choose to keep the current way (suitable for languages written in alphabetics), or transcode their page names always.</li> </ul> </li> <li>Or if you don't wish to break the semanticity of current Roman characters (no transcoding into <tt>%xx%xx</tt>), then maybe when multibyte characters are typed in the "add new page" field or "post title" field (i.e. whenever it would affect the page name), a bubble or some warning pops up to notify users that what they type are illegal names and could be ignored or truncated, and suggests them to name the page in Western characters first and key in their preferred title later in another field. <ul> <li>For the content pages, page name and title are separated fields, so at least multibyte characters can be kept in the second field. But such distinction is missing in the forum threads! So could we do a separation of page names and titles for forum thread too? (But this option can be turned off if Western users don't need it)</li> </ul> </li> </ul> <p>Personally I prefer the first way because it's simpler for East Asians (although those "<tt>%xx%xx%xx</tt>" thing are really ugly! :wink: ) , because they don't have to create and memorize two sets of page names and titles, and it's much easier for them to make internal links; or the <span style="text-decoration: underline;"><a href="http://zh.wikipedia.org/" >Chinese Wikipedia</a></span> won't be able to reach 100 thousand of articles!</p> <p>What do you think?</p> 
				 	]]>
				</content:encoded>							</item>
					<item>
				<guid>http://community.wikidot.com/forum/t-2839#post-6082</guid>
				<title>Re: Unicode page titles allowed in the future?</title>
				<link>http://community.wikidot.com/forum/t-2839/unicode-page-titles-allowed-in-the-future#post-6082</link>
				<description></description>
				<pubDate>Fri, 05 Jan 2007 09:53:24 +0000</pubDate>
				<wikidot:authorName>michal frackowiak</wikidot:authorName>				<wikidot:authorUserId>1</wikidot:authorUserId>				<content:encoded>
					<![CDATA[
						 <p>Hi,</p> <p>indeed, you have found the solution ;-) I will try to fix the <em>add new page</em> to copy the original title.</p> <p>There are several reasons to keep <em>unix names</em> with only letters and numbers — URL addresses are much better readable instead of %3B%4A%20… The problem for Chinese, Korean, Japanese etc. is that you have to use alphanumeric values for unix names…</p> <p>This works perfectly for character sets where only some characters are specific — the "name unixifier" transcodes special characters into alphanumeric, e.g. ą -&gt;a, ć -&gt; c etc. But I have no idea how to do this e.g. for Chinese…</p> <p>Any ideas? I believe one can live with how it works now but if there is a nice solution — it would be worth considering!</p> <p>michal</p> 
				 	]]>
				</content:encoded>							</item>
					<item>
				<guid>http://community.wikidot.com/forum/t-2839#post-6037</guid>
				<title>Re: Unicode page titles allowed in the future?</title>
				<link>http://community.wikidot.com/forum/t-2839/unicode-page-titles-allowed-in-the-future#post-6037</link>
				<description></description>
				<pubDate>Fri, 05 Jan 2007 04:52:53 +0000</pubDate>
				<wikidot:authorName>MilchFlasche</wikidot:authorName>				<wikidot:authorUserId>6241</wikidot:authorUserId>				<content:encoded>
					<![CDATA[
						 <p>Oh! Please pardon my carelessness! Now I know that multibyte characters are not allowed in the "<strong>unix name</strong>" of documents, so they are ignored when we "<strong>add a new page</strong>" through the module; b<span style="text-decoration: underline;">ut they can be typed into the titles</span>. Sorry, my bad :p</p> <p>Solved :)</p> <hr /> <p><span style="color: #FF1493;">Oh! I really like these AJAX and Javascript-powered interfaces! Full of surprises everywhere!</span></p> 
				 	]]>
				</content:encoded>							</item>
					<item>
				<guid>http://community.wikidot.com/forum/t-2839#post-6030</guid>
				<title>Unicode page titles allowed in the future?</title>
				<link>http://community.wikidot.com/forum/t-2839/unicode-page-titles-allowed-in-the-future#post-6030</link>
				<description></description>
				<pubDate>Fri, 05 Jan 2007 04:27:18 +0000</pubDate>
				<wikidot:authorName>MilchFlasche</wikidot:authorName>				<wikidot:authorUserId>6241</wikidot:authorUserId>				<content:encoded>
					<![CDATA[
						 <p>Hello Michal,</p> <p>I really like this wiki engine of yours! I thought only TiddlyWiki, DokuWiki or MediaWiki could satisfy my needs, and hosted wiki services are never to be perfect. But today when I found here, I really can't imagine that there is still such a great service and elegant piece of work on earth!</p> <p>So even no localization of Chinese yet, I still registered right away and have just created my own site here. But then I found something pitiful: <span style="text-decoration: underline;">the page titles are still ignoring Unicode characters such as Chinese Hanzi now.</span> I'm here just to ask that do you have any plans to implement more support for Unicode? Because it would always be more convenient to display the page title in one's own language instead of English. (But at least multibyte contents are well displayed in the source, which is <strong>way better than XWiki!</strong>) Take DokuWiki for example, it allows multibyte page names, and <span style="text-decoration: underline;">encode them into things like <tt>%3B%4A%20</tt> for the real file name</span>. If Wikidot can do this, then it could attract more non-Western users I think :)</p> <p>But before the implementation, hundreds of thanks are still presented to you.</p> <p>My best regards,</p> <p>MilchFlasche</p> 
				 	]]>
				</content:encoded>							</item>
				</channel>
</rss>