<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://blogs.msdn.com/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>Unicode without UNICODE/_UNICODE?</title><link>http://blogs.msdn.com/b/michkap/archive/2010/07/30/10033760.aspx</link><description>A familiar question I got the other day: 
 We are considering porting a Win32 application to use Unicode for internal string handling and are trying to decide which encoding to use. We would like to use UTF-8 and wondered whether there is any way to</description><dc:language>en-US</dc:language><generator>Telligent Evolution Platform Developer Build (Build: 5.6.50428.7875)</generator><item><title>re: Unicode without UNICODE/_UNICODE?</title><link>http://blogs.msdn.com/b/michkap/archive/2010/07/30/10033760.aspx#10126352</link><pubDate>Tue, 08 Feb 2011 18:10:03 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10126352</guid><dc:creator>Yuhong Bao</dc:creator><description>&lt;p&gt;The funny thing is that UTF-8 was invented one year before NT 3.1.&lt;/p&gt;
&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10126352" width="1" height="1"&gt;</description></item><item><title>re: Unicode without UNICODE/_UNICODE?</title><link>http://blogs.msdn.com/b/michkap/archive/2010/07/30/10033760.aspx#10077092</link><pubDate>Sun, 17 Oct 2010 22:08:43 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10077092</guid><dc:creator>Yuhong Bao</dc:creator><description>&lt;p&gt;Don&amp;#39;t forget the max two bytes per *Unicode* character converted from ACP/OEMCP assumption too (mentioned in this blog article, for example: &lt;a target="_new" rel="nofollow" href="http://blogs.msdn.com/b/michkap/archive/2005/02/08/369197.aspx"&gt;blogs.msdn.com/.../369197.aspx&lt;/a&gt;). It is why HKSCS-2004 and Shift_JIS-2004 can&amp;#39;t be the ACP (cp951 only worked because they mapped to Unicode PUA instead of the official code points beyond the BMP).&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10077092" width="1" height="1"&gt;</description></item><item><title>re: Unicode without UNICODE/_UNICODE?</title><link>http://blogs.msdn.com/b/michkap/archive/2010/07/30/10033760.aspx#10049147</link><pubDate>Thu, 12 Aug 2010 00:04:32 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10049147</guid><dc:creator>Yuhong Bao</dc:creator><description>&lt;p&gt;And BTW, some of the &amp;quot;code that assumes a maximum of two bytes per character&amp;quot; have been discussed over time in this blog. Search for &amp;quot;CharNextExA UTF-8&amp;quot; in this blog, for example.&lt;/p&gt;
&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10049147" width="1" height="1"&gt;</description></item><item><title>re: Unicode without UNICODE/_UNICODE?</title><link>http://blogs.msdn.com/b/michkap/archive/2010/07/30/10033760.aspx#10047064</link><pubDate>Fri, 06 Aug 2010 14:39:46 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10047064</guid><dc:creator>Economista Gerardo salazar Franco</dc:creator><description>&lt;p&gt;Muy &amp;nbsp;expectacular &amp;nbsp;todo.thank you&lt;/p&gt;
&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10047064" width="1" height="1"&gt;</description></item><item><title>re: Unicode without UNICODE/_UNICODE?</title><link>http://blogs.msdn.com/b/michkap/archive/2010/07/30/10033760.aspx#10045999</link><pubDate>Wed, 04 Aug 2010 16:01:38 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10045999</guid><dc:creator>John Cowan</dc:creator><description>&lt;p&gt;&amp;quot;Our server can handle 35 simultaneous connections fine, but thrashes when there are 70. &amp;nbsp;Can you fix that?&amp;quot;&lt;/p&gt;
&lt;p&gt;Programmer starts searching the source code for instances of &amp;quot;35&amp;quot; and changes them all to &amp;quot;70&amp;quot;.&lt;/p&gt;
&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10045999" width="1" height="1"&gt;</description></item><item><title>re: Unicode without UNICODE/_UNICODE?</title><link>http://blogs.msdn.com/b/michkap/archive/2010/07/30/10033760.aspx#10044619</link><pubDate>Sun, 01 Aug 2010 01:43:42 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10044619</guid><dc:creator>Michael S. Kaplan</dc:creator><description>&lt;p&gt;I have blogged many times about how the process of working on the MSLU project taught me that it is *much* more complicated than that.&lt;/p&gt;
&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10044619" width="1" height="1"&gt;</description></item><item><title>re: Unicode without UNICODE/_UNICODE?</title><link>http://blogs.msdn.com/b/michkap/archive/2010/07/30/10033760.aspx#10044617</link><pubDate>Sun, 01 Aug 2010 01:37:40 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10044617</guid><dc:creator>Dan Bishop</dc:creator><description>&lt;p&gt;But aren&amp;#39;t the &amp;quot;ANSI&amp;quot; functions just MultiByteToWideChar + call the corresponding &amp;quot;W&amp;quot; function + WideCharToMultiByte? &amp;nbsp;And since the conversion functions DO support UTF-8, it should just be a matter of searching for hard-coded 2&amp;#39;s in the &amp;quot;A&amp;quot; functions? &amp;nbsp;Or is it more subtle than that?&lt;/p&gt;
&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10044617" width="1" height="1"&gt;</description></item><item><title>re: Unicode without UNICODE/_UNICODE?</title><link>http://blogs.msdn.com/b/michkap/archive/2010/07/30/10033760.aspx#10044614</link><pubDate>Sun, 01 Aug 2010 01:27:24 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10044614</guid><dc:creator>Michael S. Kaplan</dc:creator><description>&lt;p&gt;This is not a core Notepad scenario that a typical Windows user ever would hit....&lt;/p&gt;
&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10044614" width="1" height="1"&gt;</description></item><item><title>re: Unicode without UNICODE/_UNICODE?</title><link>http://blogs.msdn.com/b/michkap/archive/2010/07/30/10033760.aspx#10044613</link><pubDate>Sun, 01 Aug 2010 01:20:53 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10044613</guid><dc:creator>Yuhong Bao</dc:creator><description>&lt;p&gt;&amp;quot;Better for who?&amp;quot;&lt;/p&gt;
&lt;p&gt;Better for users who have to open text files in various encodings.&lt;/p&gt;
&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10044613" width="1" height="1"&gt;</description></item><item><title>re: Unicode without UNICODE/_UNICODE?</title><link>http://blogs.msdn.com/b/michkap/archive/2010/07/30/10033760.aspx#10044608</link><pubDate>Sun, 01 Aug 2010 00:34:02 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10044608</guid><dc:creator>Michael S. Kaplan</dc:creator><description>&lt;p&gt;The &amp;quot;ANSI&amp;quot; code path. Since no Windows &amp;quot;ANSI&amp;quot; code page is ever more than two bytes per &amp;quot;character&amp;quot;, the assumption is valid for all intended cases. Of course this design limits options for making use of some code pages or UTF-8 as an ACP, it is as true design limitation, which was the point of this blog -- that there is no option to magically support a UTF-8 ACP.&lt;/p&gt;
&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10044608" width="1" height="1"&gt;</description></item></channel></rss>