<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://blogs.msdn.com/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>And if your language starts playing a different TUNE</title><link>http://blogs.msdn.com/b/michkap/archive/2006/08/31/733302.aspx</link><description>Warning to readers : this post is completely and totally my own opinions based on my efforts to assist with Tamil's representation in Unicode, and truly have nothing to do with Microsoft's opinions on the matter (whatever they are). If you quote anything</description><dc:language>en-US</dc:language><generator>Telligent Evolution Platform Developer Build (Build: 5.6.50428.7875)</generator><item><title>On Thokks who don't give a Frigg, under the mistletoe</title><link>http://blogs.msdn.com/b/michkap/archive/2006/08/31/733302.aspx#961824</link><pubDate>Sat, 04 Nov 2006 23:55:44 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:961824</guid><dc:creator>Sorting It All Out</dc:creator><description>&lt;p&gt;I was thinking about Balder yesterday. Balder is a fascinating character in Norse mythology, and a good&lt;/p&gt;
&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=961824" width="1" height="1"&gt;</description></item><item><title>re: And if your language starts playing a different TUNE</title><link>http://blogs.msdn.com/b/michkap/archive/2006/08/31/733302.aspx#774764</link><pubDate>Thu, 28 Sep 2006 05:24:25 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:774764</guid><dc:creator>Michael S. Kaplan</dc:creator><description>Tamil improving itself has nothing to do with its encoding, because language is not just encoding. &lt;br&gt;&lt;br&gt;But Unicode is on record as rejecting these schemes, so eventually the illogic of waste is the factor....&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=774764" width="1" height="1"&gt;</description></item><item><title>re: And if your language starts playing a different TUNE</title><link>http://blogs.msdn.com/b/michkap/archive/2006/08/31/733302.aspx#774715</link><pubDate>Thu, 28 Sep 2006 04:32:40 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:774715</guid><dc:creator>CAPital Z</dc:creator><description>East Asian Language encoding dilema and Balinese are not pre-dated problems.&lt;br&gt;&lt;br&gt;Anyhow, yes whatever written in current unicode will be unreadable. &amp;nbsp;But that's the evolution right?. &amp;nbsp;Latin encoding had so many perfections &amp;nbsp;during the course of Computer. &amp;nbsp;Tamil is just a baby in computing. &amp;nbsp;So now the Tamil doesn't get the chance to improve just because, what is already there has to be THE ONE.&lt;br&gt;&lt;br&gt;So Tamil can never improve itself in the future. &amp;nbsp;Because Unicode Consortium will never accept any modification to what is already there!&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=774715" width="1" height="1"&gt;</description></item><item><title>re: And if your language starts playing a different TUNE</title><link>http://blogs.msdn.com/b/michkap/archive/2006/08/31/733302.aspx#769305</link><pubDate>Sun, 24 Sep 2006 17:36:51 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:769305</guid><dc:creator>Michael S. Kaplan</dc:creator><description>CAPital, this is hardly breaking the rules of Unicode. &lt;BR&gt;&lt;BR&gt;In some cases, there were legacy standards that pre-dated Unicode which had to be represented. And other cases where you see rule breaking, the actual proposals give the justifications (as do the block descriptions in the Unicode book, in many cases). &lt;BR&gt;&lt;BR&gt;Did you look at the links I put in? They explain many of the reasons why strategies like TUNE are simply too late and do involve a re-encoding of a script already encoded, and would set bzck Tamil computing by 5-10 years or more. &lt;BR&gt;&lt;BR&gt;Even just looking at your blog (and the many other Tamil blogs out there), it would invalidate all of this existing data!&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=769305" width="1" height="1"&gt;</description></item><item><title>re: And if your language starts playing a different TUNE</title><link>http://blogs.msdn.com/b/michkap/archive/2006/08/31/733302.aspx#769295</link><pubDate>Sun, 24 Sep 2006 17:15:07 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:769295</guid><dc:creator>CAPital Z</dc:creator><description>In the sets of Latin, there are encoded characters as &amp;#224;, &amp;#225;, &amp;#226;, &amp;#227;, &amp;#229; [for the European languages]. &amp;nbsp;It is encoded as a single character. &amp;nbsp;Meanwhile, the same glyphs are present seperately, like a, ̀, ́, ˆ, ˜, ˚. &lt;br&gt;&lt;br&gt;So only for Tamil like scripts [South Asian languages] similar encoding is denied.&lt;br&gt;&lt;br&gt;East Asian languages did encoded their all characters. &amp;nbsp;And you know the famous Chinese governments stand about all chinese character encoding.&lt;br&gt;&lt;br&gt;Most recently, the new language to be included in Unicode, Balinese break the rules of Unicode. &amp;nbsp;As you said, it did not only encode the &amp;quot;basic abugida&amp;quot; but did others as well. &amp;nbsp;You may have already seen it in the new Unicode Charts.&lt;br&gt;&lt;br&gt;Even in Tamil, your point is right that basic abugida is enough to display the most characters. &amp;nbsp;But the &amp;quot;ku கு, kuu கூ&amp;quot; &amp;nbsp;[and similar for all other characters] are almost totally different than the basic abugidas.&lt;br&gt;&lt;br&gt;I'm not saying TUNE is the best thing, but Tamil do lack the efficiency of what other similar anguages have in Unicode.&lt;br&gt;&lt;br&gt;&lt;br&gt;______&lt;br&gt;CAPital&lt;br&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=769295" width="1" height="1"&gt;</description></item><item><title>re: And if your language starts playing a different TUNE</title><link>http://blogs.msdn.com/b/michkap/archive/2006/08/31/733302.aspx#768846</link><pubDate>Sun, 24 Sep 2006 09:21:36 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:768846</guid><dc:creator>Michael S. Kaplan</dc:creator><description>Actually, it did not. It encoded an abugida. &lt;br&gt;&lt;br&gt;The rules that you envisage are not actually rules of Unicode, which may be the problem here? You are expecting promises to be kept that were never made. :-(&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=768846" width="1" height="1"&gt;</description></item><item><title>re: And if your language starts playing a different TUNE</title><link>http://blogs.msdn.com/b/michkap/archive/2006/08/31/733302.aspx#768837</link><pubDate>Sun, 24 Sep 2006 09:13:58 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:768837</guid><dc:creator>CAPital</dc:creator><description>I am not trying to say that only the ka + dot should have been encoded and not the other. &amp;nbsp;Then it would be almost impossible to display the other.&lt;br&gt;&lt;br&gt;All I'm saying is Unicode already broke its rules. &amp;nbsp;So to other European and East Asian langues.&lt;br&gt;&lt;br&gt;IF it did NOT break its rules for ANY language, then I wouldn't even say a word.&lt;br&gt;&lt;br&gt;So it looks like whoever had the power, broke the rules according to their &amp;quot;standardization&amp;quot; policies.&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=768837" width="1" height="1"&gt;</description></item><item><title>re: And if your language starts playing a different TUNE</title><link>http://blogs.msdn.com/b/michkap/archive/2006/08/31/733302.aspx#768824</link><pubDate>Sun, 24 Sep 2006 09:01:01 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:768824</guid><dc:creator>Michael S. Kaplan</dc:creator><description>See also &lt;A href="http://blogs.msdn.com/michkap/archive/2005/06/13/428755.aspx"&gt;this post&lt;/A&gt; from over a year ago, which makes some additional related points, ones that I wish those who want to set Tamil implementations back by years would pay more attention to....&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=768824" width="1" height="1"&gt;</description></item><item><title>re: And if your language starts playing a different TUNE</title><link>http://blogs.msdn.com/b/michkap/archive/2006/08/31/733302.aspx#768819</link><pubDate>Sun, 24 Sep 2006 08:55:07 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:768819</guid><dc:creator>Michael S. Kaplan</dc:creator><description>Actually, no -- that is not how abugidas work. But see &lt;A href="http://blogs.msdn.com/michkap/archive/2006/09/16/758310.aspx"&gt;this post&lt;/A&gt; and &lt;A href="http://blogs.msdn.com/michkap/archive/2006/09/20/763705.aspx"&gt;this one&lt;/A&gt; for more info on the approach that was taken in the encoding, and why it is okay for you to not agree with the approach and still be able to work in Unicode....&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=768819" width="1" height="1"&gt;</description></item><item><title>re: And if your language starts playing a different TUNE</title><link>http://blogs.msdn.com/b/michkap/archive/2006/08/31/733302.aspx#768816</link><pubDate>Sun, 24 Sep 2006 08:46:58 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:768816</guid><dc:creator>CAPital</dc:creator><description>When you say:&lt;br&gt;&lt;br&gt;///This post brought to you by க் (U+0b95 U+0bcd, a.k.a. TAMIL LETTER KA + TAMIL SIGN VIRAMA, a.k.a. TAMIL KA puLLi, a.k.a. TAMIL LETTER K)///&lt;br&gt;&lt;br&gt;I right away notice that there is NO Tamil sign called VIRAMA!&lt;br&gt;&lt;br&gt;In fact Tamil language doesn't add the dot to form the 'ka' sound. &amp;nbsp;the form you have presented is the original form. &amp;nbsp;the letter without the dot is NOT the basic letter. &amp;nbsp;So you can see Unicode is already broken its rules of ONLY encoding the basic characters!&lt;br&gt;&lt;br&gt;______&lt;br&gt;CAPital&lt;br&gt;&lt;br&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=768816" width="1" height="1"&gt;</description></item></channel></rss>