<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://blogs.msdn.com/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>Smart quotes: The hidden scourge of text meant for computer consumption</title><link>http://blogs.msdn.com/b/oldnewthing/archive/2009/02/25/9443404.aspx</link><description>They look pretty but act differently.</description><dc:language>en-US</dc:language><generator>Telligent Evolution Platform Developer Build (Build: 5.6.50428.7875)</generator><item><title>re: Smart quotes: The hidden scourge of text meant for computer consumption</title><link>http://blogs.msdn.com/b/oldnewthing/archive/2009/02/25/9443404.aspx#9457685</link><pubDate>Wed, 04 Mar 2009 01:57:57 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9457685</guid><dc:creator>DWalker</dc:creator><description>&lt;p&gt;@ulric and others: &amp;nbsp;I was pasting a short piece of SQL code into an e-mail message using Outlook 2007, today, and since my message format is HTML by default, and uses Word as the editor, I had a heck of a time getting the quotes in the SQL to stay as straight quotes. &lt;/p&gt;
&lt;p&gt;I could probably change the default for Word when it's used as the e-mail editor and not when I'm creating a regular Word document, but I don't happen to know how to change one setting without the other.&lt;/p&gt;
&lt;p&gt;I occasionally need to underline something or make it RED and BOLD in an e-mail or risk having it overlooked; otherwise, I would use plain text e-mails.&lt;/p&gt;
&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9457685" width="1" height="1"&gt;</description></item><item><title>re: Smart quotes: The hidden scourge of text meant for computer consumption</title><link>http://blogs.msdn.com/b/oldnewthing/archive/2009/02/25/9443404.aspx#9456131</link><pubDate>Tue, 03 Mar 2009 10:41:49 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9456131</guid><dc:creator>SuperKoko</dc:creator><description>&lt;p&gt;&amp;gt; There's no reason that you couldn't do the same to escape curly quotes.&lt;/p&gt;
&lt;p&gt;Actually, it's easier to do it for curly quotes than for normal double quotes, because existing programs are unaware of curly quotes, and so, accept them as normal command line input.&lt;/p&gt;
&lt;p&gt;Usually, double quotes can be used to include space characters in a command line parameter.&lt;/p&gt;
&lt;p&gt;some_command &amp;quot;hello world&amp;quot;&lt;/p&gt;
&lt;p&gt;However, I'm aware of at least TWO 32 bits hexadecimal editors not supporting this feature, making it IMPOSSIBLE to open files with space characters in their path, through the command line.&lt;/p&gt;
&lt;p&gt;On the topic of smart quotes:&lt;/p&gt;
&lt;p&gt;They're bad on the Web, because of shared responsibilities involving buggy Web browsers, Web servers and Web scripts.&lt;/p&gt;
&lt;p&gt;Typically:&lt;/p&gt;
&lt;p&gt;A normal person copy text in Word and paste it in a text form on a Web page, including smart quotes.&lt;/p&gt;
&lt;p&gt;The page is encoded in ISO-8859-1, and so, the Web browser figures that it must send a POST request with charset=ISO-8859-1.&lt;/p&gt;
&lt;p&gt;Since the buggy browser thinks that ISO-8859-1 = Windows-1252, it sends code points used for control characters in ISO-8859-1 (between 0x80 and 0xA0).&lt;/p&gt;
&lt;p&gt;The buggy Web server doesn't validate input and store this buggy data in its data base, believing it's valid ISO-8859-1.&lt;/p&gt;
&lt;p&gt;Then, if the POST data appears on a Web page (e.g. a blog comment), it serves the page as ISO-8859-1 or UTF-8. At best, with ISO-8859-1, it serves Windows-1252 and claims it to be ISO-8859-1, which is ok with buggy Web browsers believing that ISO-8859-1 = Windows-1252. For UTF-8, it might mess up the thing, either deleting the characters, or not translating them (sending sort of Windows-UTF-8), or, if we're lucky, properly translating them to UTF-8.&lt;/p&gt;
&lt;p&gt;Things are so messy that, HTML5 actually REQUIRES Web browsers to assume that ISO-8859-1 = Windows-1252.&lt;/p&gt;
&lt;p&gt;&amp;lt;&lt;a rel="nofollow" target="_new" href="http://www.whatwg.org/specs/web-apps/current-work/#character-encodings-0&amp;gt;"&gt;http://www.whatwg.org/specs/web-apps/current-work/#character-encodings-0&amp;gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Whether the WHATWG can, cannot, should or shouldn't require HTML clients to violate HTTP and ISO-8859 standards is another topic.&lt;/p&gt;
&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9456131" width="1" height="1"&gt;</description></item><item><title>re: Smart quotes: The hidden scourge of text meant for computer consumption</title><link>http://blogs.msdn.com/b/oldnewthing/archive/2009/02/25/9443404.aspx#9455155</link><pubDate>Mon, 02 Mar 2009 23:54:40 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9455155</guid><dc:creator>Unicode General Punctuation</dc:creator><description>&lt;p&gt;Nevermind, maybe. I think I figured it out.&lt;/p&gt;
&lt;p&gt;By &amp;quot;extended ASCII range&amp;quot;, he's referring to the upper half of a codepage like 1252, and stating that it is &amp;quot;completely incompatible with UTF-8&amp;quot; because you have to go through the effort of actually /mapping/ the codepoints.&lt;/p&gt;
&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9455155" width="1" height="1"&gt;</description></item><item><title>re: Smart quotes: The hidden scourge of text meant for computer consumption</title><link>http://blogs.msdn.com/b/oldnewthing/archive/2009/02/25/9443404.aspx#9454769</link><pubDate>Mon, 02 Mar 2009 20:04:28 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9454769</guid><dc:creator>Unicode General Punctuation</dc:creator><description>&lt;p&gt;Not going to discuss the pros and cons of smart-quotes, but I am confused by Dean's comment that they use &amp;quot;an extended ASCII range that makes it completely incompatible with UTF-8.&amp;quot;&lt;/p&gt;
&lt;p&gt;Unless I'm mistaken, we're talking about characters in the U+2000 range, but UTF-8 can easily encode most of U+0000 to U+10FFFF, even restricted as it is by RFC 3629.&lt;/p&gt;
&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9454769" width="1" height="1"&gt;</description></item><item><title>re: Smart quotes: The hidden scourge of text meant for computer consumption</title><link>http://blogs.msdn.com/b/oldnewthing/archive/2009/02/25/9443404.aspx#9453013</link><pubDate>Sun, 01 Mar 2009 23:56:59 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9453013</guid><dc:creator>Yuhong Bao</dc:creator><description>&lt;p&gt;Daniel Earwicker: In fact Raymond devoted an entire article to exactly this:&lt;/p&gt;
&lt;p&gt;&lt;a rel="nofollow" target="_new" href="http://blogs.msdn.com/oldnewthing/archive/2008/08/19/8877486.aspx"&gt;http://blogs.msdn.com/oldnewthing/archive/2008/08/19/8877486.aspx&lt;/a&gt;&lt;/p&gt;
&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9453013" width="1" height="1"&gt;</description></item><item><title>re: Smart quotes: The hidden scourge of text meant for computer consumption</title><link>http://blogs.msdn.com/b/oldnewthing/archive/2009/02/25/9443404.aspx#9450601</link><pubDate>Sat, 28 Feb 2009 12:48:24 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9450601</guid><dc:creator>Bulletmagnet</dc:creator><description>&lt;p&gt;There is a solution: use the demoronizer!&lt;/p&gt;
&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9450601" width="1" height="1"&gt;</description></item><item><title>re: Smart quotes: The hidden scourge of text meant for computer consumption</title><link>http://blogs.msdn.com/b/oldnewthing/archive/2009/02/25/9443404.aspx#9449238</link><pubDate>Fri, 27 Feb 2009 20:41:21 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9449238</guid><dc:creator>Stephen Jones</dc:creator><description>&lt;p&gt;I once got send code for a SQL select. Very simple because I'm not a programmer.&lt;/p&gt;
&lt;p&gt;It was only finally when I sent the .mdb with the data stripped out that we realized why it worked for him but not for me. Yup, he had Word as his email editor.&lt;/p&gt;
&lt;p&gt;I've lost count of the times I've read about people complaining some forum software doesn't show quotes. I tell them to cut and paste into the text box from Notebook not Word, but they still say it's the newspaper/forum software that is wrong.&lt;/p&gt;
&lt;p&gt;(Another common complaint is how the discussion forum software has a lousy American spell check. The forum software doesn't have a spell check at all of course, it's the default spell check that comes with Firefox (which incidentally has stopped working for the last two or three updates).&lt;/p&gt;
&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9449238" width="1" height="1"&gt;</description></item><item><title>re: Smart quotes: The hidden scourge of text meant for computer consumption</title><link>http://blogs.msdn.com/b/oldnewthing/archive/2009/02/25/9443404.aspx#9448970</link><pubDate>Fri, 27 Feb 2009 18:40:43 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9448970</guid><dc:creator>Doug Kavendek</dc:creator><description>&lt;p&gt;At least it's easy enough to keep yourself from making that mistake. &amp;nbsp;A while back I made a really simple program that just reloads whatever is in the clipboard as plaintext, and it corrects for all those curved quotes, em-dashes, etc., as well as stripping out all the formatting nonsense. &amp;nbsp;Bind it to a global key, like Ctrl-Shift-V and just do that before pasting. &amp;nbsp;I use it all the time now; way easier than looking for paste-special menu items or remembering how each application deals with rich text.&lt;/p&gt;
&lt;p&gt;If anyone's interested:&lt;/p&gt;
&lt;p&gt;&lt;a rel="nofollow" target="_new" href="http://www.kavendek.com/stuff/SimplifyClipboard.cpp"&gt;http://www.kavendek.com/stuff/SimplifyClipboard.cpp&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;a rel="nofollow" target="_new" href="http://www.kavendek.com/stuff/SimplifyClipboard.exe"&gt;http://www.kavendek.com/stuff/SimplifyClipboard.exe&lt;/a&gt;&lt;/p&gt;
&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9448970" width="1" height="1"&gt;</description></item><item><title>re: Smart quotes: The hidden scourge of text meant for computer consumption</title><link>http://blogs.msdn.com/b/oldnewthing/archive/2009/02/25/9443404.aspx#9447766</link><pubDate>Fri, 27 Feb 2009 03:50:14 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9447766</guid><dc:creator>Technage</dc:creator><description>&lt;p&gt;It's like seeing the insides of a television and being amazed at the little intricacies especially the ones that shock you when you touch them.&lt;/p&gt;
&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9447766" width="1" height="1"&gt;</description></item><item><title>re: Smart quotes: The hidden scourge of text meant for computer consumption</title><link>http://blogs.msdn.com/b/oldnewthing/archive/2009/02/25/9443404.aspx#9446673</link><pubDate>Thu, 26 Feb 2009 18:01:23 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9446673</guid><dc:creator>Daniel Earwicker</dc:creator><description>&lt;p&gt;Somewhat similar irritation - people who refuse to email me a plain bitmap. They insist on pasting it into a Word document and sending that instead. This is of course extremely helpful because otherwise they wouldn't be able to introduce me to their collection of macro viruses.&lt;/p&gt;
&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9446673" width="1" height="1"&gt;</description></item></channel></rss>