<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://blogs.msdn.com/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>Why is there no pronunciation-based sort for Japanese?</title><link>http://blogs.msdn.com/b/michkap/archive/2004/12/27/332618.aspx</link><description>(From the Suggestion Box ) 
 When people start looking at East Asian languages, they notice that most of the regions have a sort based on pronunciation: Korea has a sort based on the Hangul pronunciation of the Hangul and Hanja codepoints, Taiwan has</description><dc:language>en-US</dc:language><generator>Telligent Evolution Platform Developer Build (Build: 5.6.50428.7875)</generator><item><title>A&amp;P of Sort Keys, part 11 (aka It's not like ideographic sorts were developed idiopathically)</title><link>http://blogs.msdn.com/b/michkap/archive/2004/12/27/332618.aspx#5095017</link><pubDate>Mon, 24 Sep 2007 13:54:39 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:5095017</guid><dc:creator>Sorting It All Out</dc:creator><description>&lt;p&gt;Previous posts in this series: Part 0: The empty string sorts the same in every language Part 1: The&lt;/p&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=5095017" width="1" height="1"&gt;</description></item><item><title>Knock knock! Who's there? Kana! Kana Who?</title><link>http://blogs.msdn.com/b/michkap/archive/2004/12/27/332618.aspx#423712</link><pubDate>Wed, 01 Jun 2005 07:46:32 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:423712</guid><dc:creator>Sorting It All Out</dc:creator><description>You Kana wonder how we order Japanese strings? :-)&lt;br&gt;Some time yesterday, one of the testers over on the...&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=423712" width="1" height="1"&gt;</description></item><item><title>re: Why is there no pronunciation-based sort for Japanese?</title><link>http://blogs.msdn.com/b/michkap/archive/2004/12/27/332618.aspx#352680</link><pubDate>Fri, 14 Jan 2005 04:15:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:352680</guid><dc:creator>Michael Kaplan</dc:creator><description>I also found an interesting article about all of this, including more info on Kanji and the readings thereof, at &lt;a target="_new" href="http://www.nationmaster.com/encyclopedia/Kanji"&gt;http://www.nationmaster.com/encyclopedia/Kanji&lt;/a&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=352680" width="1" height="1"&gt;</description></item><item><title>The Fourth Reading</title><link>http://blogs.msdn.com/b/michkap/archive/2004/12/27/332618.aspx#352662</link><pubDate>Fri, 14 Jan 2005 03:47:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:352662</guid><dc:creator>Curt Sampson</dc:creator><description>Don't forget the fourth method of reading: &amp;quot;special&amp;quot; readings. These are basically arbitrary. For example, the characters for &amp;quot;one&amp;quot; and &amp;quot;day,&amp;quot; 一日, can, and not infrequently is, read using on-readings for both characters: &amp;quot;ichi nichi.&amp;quot; In this case it expresses the idea of a length of time one day long. However, when you're using it to express the idea of &amp;quot;first day&amp;quot; (e.g., the first day of the month), you pronounce it &amp;quot;tsuitachi.&amp;quot; This bears no relation to the on or kun readings of either character. Moreover, it's a single word; you can't for example assign the &amp;quot;tsui&amp;quot; to 一 and the &amp;quot;tachi&amp;quot; to 日; the reading is valid only for the two characters in combination.&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=352662" width="1" height="1"&gt;</description></item><item><title>re: Why is there no pronunciation-based sort for Japanese?</title><link>http://blogs.msdn.com/b/michkap/archive/2004/12/27/332618.aspx#333028</link><pubDate>Tue, 28 Dec 2004 01:47:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:333028</guid><dc:creator>Michael Kaplan</dc:creator><description>Chris -- I agree, by all means!&lt;br&gt;&lt;br&gt;But a solution that put it in and let you override it is superior to one that forcess you to always type it, even though it may well be what you just typed in order to enter the string in the first place.&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=333028" width="1" height="1"&gt;</description></item><item><title>re: Why is there no pronunciation-based sort for Japanese?</title><link>http://blogs.msdn.com/b/michkap/archive/2004/12/27/332618.aspx#332834</link><pubDate>Mon, 27 Dec 2004 18:37:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:332834</guid><dc:creator>Chris Pearce</dc:creator><description>One of the things you will need to be aware of if you are going to use the IME input as a reading for the kanji is that you're going to have times when that input is not correct.&lt;br&gt;&lt;br&gt;Sometimes for place and people names, or other words that don't want to come up nicely with the IME, it's easier to type in alternative readings for a kanji and select it from the list.&lt;br&gt;&lt;br&gt;Take for example '真乗院', a minor temple in Tokyo.  It doesn't come up in the IME when you input 'しんじょういん(shinjouin)', but typing in the readings for individual kanji 'ma', 'noru' (delete the ru after you select the kanji) and 'in'.  But now 'Manoin' would sort completely differently than 'Shinjouin.'&lt;br&gt;&lt;br&gt;Little IME tricks like this could play havoc if you don't give people an opportunity to correct their kanji readings.&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=332834" width="1" height="1"&gt;</description></item><item><title>re: Why is there no pronunciation-based sort for Japanese?</title><link>http://blogs.msdn.com/b/michkap/archive/2004/12/27/332618.aspx#332628</link><pubDate>Mon, 27 Dec 2004 09:59:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:332628</guid><dc:creator>Michael Kaplan</dc:creator><description>In both Windows XP and Windows Server 2003, the help for the Japanese IME include keystrokes that can be used to revert a converted string or character to a reading, which is what I think you are talking about.&lt;br&gt;&lt;br&gt;I was thinking of an application that would do this behind the scenes so if you type in a name it would get the pronunciation and stick it in a &amp;quot;reading&amp;quot; field. No extr work required from the user!&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=332628" width="1" height="1"&gt;</description></item><item><title>re: Why is there no pronunciation-based sort for Japanese?</title><link>http://blogs.msdn.com/b/michkap/archive/2004/12/27/332618.aspx#332623</link><pubDate>Mon, 27 Dec 2004 09:31:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:332623</guid><dc:creator>Norman Diamond</dc:creator><description>&amp;gt; In theory, this is something an application&lt;br&gt;&amp;gt; can do when a name&lt;br&gt;&lt;br&gt;a name or anything else&lt;br&gt;&lt;br&gt;&amp;gt; is typed when the IME mode is based on&lt;br&gt;&amp;gt; pronunciation; this is the one time that the&lt;br&gt;&amp;gt; pronunciation information is present without&lt;br&gt;&amp;gt; it being queried separately -- during the&lt;br&gt;&amp;gt; composition phase.&lt;br&gt;&lt;br&gt;Agreed.&lt;br&gt;&lt;br&gt;&amp;gt; As far as I know, this is not something that&lt;br&gt;&amp;gt; is done right now&lt;br&gt;&lt;br&gt;I have read that in some cases it is possible to reconvert a Kanji string after mistakenly converting an undesired Kanji string with the same pronunciation.  I've never been able to do that myself.  As a guess, it might be something particular to Word 2002 and later plus IME 2002 and later, maybe or maybe not related to the &amp;quot;Natural&amp;quot; IME.  But even when I had Word 2002 temporarily installed on a machine, I wasn't able to do recoversions myself.  Also if I recall enough correctly of what I read, reconversion is no longer possible after the document is closed and reopened, because the pronunciation is not stored.&lt;br&gt;&lt;br&gt;By the way some characters have more than 10 readings.&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=332623" width="1" height="1"&gt;</description></item></channel></rss>