Welcome to MSDN Blogs Sign in | Join | Help

May 2005 - Posts

Something special for IT Pros (and maybe for some developers!) I will be doing miniature versions of talks I have done before on keyboards -- how they work, how to develop them, how to deploy them. This will happen in Dr. International's clinic ( located Read More...
Raymond Chen did a post yesterday entitled You can't simulate keyboard input with PostMessage . He did touch on the complicated language issues, saying: First of all, keyboard input is a more complicated matter than those who imprinted on the English Read More...
Scott Hanselman posted about GrokTalk , and explained what it is all about: The deal is this: We've all sat through some pretty lousy technical sessions at conferences. For the most part, sessions at TechEd are filled with good information, but every Read More...
Or should I say, Háts off to David Beaver ? :-) Over on the Language Log , David noticed some interesting issues with Google search in his post entitled PASS THE HÁT . The basic issue comes up with Google's HYPHEN-MINUS operator. According to their documentation Read More...
In years past, I had seen shows at the Front Row Theatre in Cleveland (it is no longer around, the Rock and Roll Hall of Fame is on the site where it used to be). I was struck at the time by the way that the performer would be facing different parts of Read More...
A little over a month ago, I was talking about how SetLocaleInfo really stinks . Buried in my tirade was the germ of a question that people have asked in the past -- why is there no setting analogue to the LOCALE_RETURN_NUMBER flag used by functions like Read More...
What was I saying? I can't remember. Maybe I will look up on Adi Oltean's blog how much memory my version of Windows can have. Good thing I remembered where my RSS reader was! :-) Read More...
(a tiny little bit if technical content in this post, but its in the second half and may not be worth the trouble sifting through this stream of consciousness blathering!) About a month ago in my post Getting Enough Exercise? I talked about my exciting Read More...
Yesterday, Manip post a rant about the fact that the three settings in the title of my post today are not integrated well (in response to a post by Larry Osterman , who gets good comments even to his non-posts!). The rant: I'm using this misc story to Read More...
Yesterday, Jeff Parker post a comment to a non-post about time zones and backcompat from Larry Osterman ( Larrys blog is so cool that he has ideas pop up even when he only posts about the fact that he could do put up anything substantive since he was Read More...
So I am setting up various Virtual PC images for SQL Server Shiloh and Yukon demos for my upcoming TechEd presentations. I was installing on different platforms, also -- because hard drive space is cheap and variety is the spice of life. :-) Suddenly Read More...
Ram Mallika mentioned in the suggestion box: Can you please talk about the issues in a out-of-the-box support for complex scripts and languages, instead of making it as a supplemental language installation? Also, What can we expect on this issue on the Read More...
Back in the beginning of April, I explained about the one scenario where casing does not need to roundtrip in .NET -- the Greek final sigma. Anyway, the day before yesterday I got an email from someone who had been reading my blog and was looking at all Read More...
Ok, by now you know the drill -- I am comparing the various ways of expressing text in Unicode. In prior posts I have talked about the issues related to size and to speed . However, both of those posts were working in a theoretical vacuum that was independent Read More...
No one ever accused the Universal Character Set of being simple. Just short of 100,000 characters, many different scripts and languages, all sorts of complex scripts. Unicode is downright hard, sometimes. If you asked me, that is the biggest reason for Read More...
Here it is -- I did mention that We're hiring . :-) That pointer is to a development postion. It turns out we also need a technical feature PM (program manager) too! The listing is right here , and it has a lot of good detail about the kind of person Read More...
" I have a sneaking suspicion that if there were a way to make movies without actors, George [Lucas] would do it. " -- Most often attributed Mark Hamill I saw Star Wars Episode III (Revenge of the Sith) on Saturday, and like just about everybody else Read More...
I have talked about Chris Walker before. He is one of guys behind Notepad.exe for several versions, watching this uber-layer around a Win32 EDIT control be morphed into what some consider to be the most-used plain text editor on the planet. Often when Read More...
How does the old saying go? For every expert there is an equal and opposite expert . Never did that anonymous quote seem more accurate to me than my first UTC (Unicode Technical Committee) meeting. Peter Vogel and I were having a conversation about it Read More...
In the first post in this series, I talked about size considerations , but even then I hinted that there is more to the decision than just size. I mean, I am not saying that size does not matter, but there are other facts that can matter more.... In this Read More...
If you look at version 1.0 and 1.1 of the .NET Framework, they both have a Guid structure and a Guid.Equals() method which takes an object o and returns " true if o is a Guid that has the same value as this instance; otherwise, false ." Since every type Read More...
Ivan Leo posted the following note in the suggestion box: Why isn't RtlCreateUnicodeString() documented? I mean, it's a damn simple interface that has no reason to change, and is useful is one needs lots of UNICODE_STRINGs I cannot speak for the owners Read More...
Now this is a question that I would make into an interview question, if only there were some way to do all the setup work in time. Unfortunately, unless the candidate is very knowledgable about internationalization on the way in, there is no way to get Read More...
It all started so innocently, with John Jenkins of Apple asking a simple question on the Unicode List.... Is it kosher for U+0478 CYRILLIC CAPITAL LETTER UK (Ѹ) and U+0479 CYRILLIC SMALL LETTER UK (ѹ) to have basically the same shapes as U+0222 LATIN Read More...
The Globalization Services team inside of GIFT is expanding again, and we are looking for a Software Development Enginner (developer)! As a team, we are responsible for many of the underlying technologies and features that I have been blogging about every Read More...
The decision of which UTF (Unicode Transformation Format) to use might be driven by many different factors, so I will be doing a few different posts on this topic, looking at the factors individually. This first post will talk about the size considerations. Read More...
Almost a week ago, I talked about What it means to be in the default install . And about a week ago, Richie Rump talked about Windows Mobile 5.0 Unveiled . And then also a week ago, I got an email from a developer who asked the following. I’m figuring Read More...
Not too long ago, we picked up a new tester (new to the area, not to testing in general) to own collation for Windows and the .NET Framework. Now I will readily admit that is usually news that I take as well as getting a new hair stylist, which is to Read More...
People ask all the time how they can type in random Unicode data. Some people point out the vast array of supported Keyboard Layouts on Windows . Others point out how you can create your own keyboards with MSKLC . Still others talk about fancy things Read More...
A few days ago, I made some SQL server developers nervous when I explained that Not all SQL Server collations are created equal . But there is one point I must emphasize here. There is no data loss in moving between collations, as long as you use Unicode Read More...
You may have noticed the new graphic on the side there. As I mentioned previously , I will be in Amsterdam for TechEd Europe 2005 . The talks will be mostly the same as the ones in Orlando, but with two important differences -- (1) I will have all of Read More...
Someone just pointed this one out to me today: Probably more than you want to know about keyboarding in Windows Forms…. It is one of many posts on jfo's coding , a blog that I will definitely be adding to my list. Sometimes the best way to get smarter Read More...
One small change: DAT290 Designing Multilingual Databases Using SQL Server 2005 Old time: Wednesday, June 8 10:15 AM - 11:30 AM New time: Monday, June 6 3:15 PM - 4:30 PM I have been assured that this is for very important reasons to do with interesting Read More...
I think I may have said in the past that the SUBLANGID is an odd beast. They are defined in the winnt.h header file in the SDK (and ntdef.h in the DDK). Here is an excerpt of the ones there (which mostly have the value of 1 or 2): #define SUBLANG_DEFAULT Read More...
"One of my early managers once shared with me the following piece of advice: if you don't get your hand slapped at least twice a year, you aren't pushing the boundaries hard enough." -- Sam Ruby , via Robert Scoble I'll have to keep this one in mind. Read More...
Back in the end of March, when I posted Font substitution and linking #1 , that ordinal at the end certainly implied that there were going to be a few more posts in a series. Sorry it took so long to get back to the topic, I have been busy. :-) Now I Read More...
(Nothing technical in this post, sorry!) It was really sunny yesterday. This works well since I have a convertible and got to run with the top down. I do need to point out that only an optimist could ever live in Seattle and own a convertible. It's not Read More...
Not all of the collations in SQL Server 2005 (a.k.a. Yukon) and earlier versions have coverage over the same full sets of Unicode code points. (By the way, this topic will be covered during my TechEd sessions, particularly DAT290: Databases for the World: Read More...
Uwe Keim asked (in the suggestion box ) several questions about encoding support and strings in the .NET Framework: Michael, I need your knowledge about how all this encoding stuff on strings work on .NET (I searched hours without finding usful things). Read More...
A few years ago, the Microsoft Office XP marketing team had a big advertising campaign that they hired Gilbert Gottfried for. The news coverage (in a rare burst of accuracy and journalistic integrity that I wish I had seen around here lately) summarized Read More...
The Microsoft Layer for Unicode is usually just a very simple wrapper over the non-Unicode APIs on Window 95, 98, and Me, that uses the default system code page to do the conversions. There are of course exceptions to this; there are times when Win9x Read More...
Meikel Weber posted (in the suggestion box): Hi, Your article today reminds me of an old question. Why is there no "GetDateFormat" API that takes a CALID as parameter? Thank you for this really cool blog!!! Well, thanks for the kind words. :-) Some questions Read More...
A few days ago, CornedBee left a comment to my post Raymond's Chinese dictionary : > by some reports over 60,000 of the over 70,000 ideographs in Unicode/GB18030 Suddenly the 65000 characters in a Windows WCHAR or a VC++ wchar_t seem so little ... Read More...
Larry Osterman posted earlier today about Turning the blog around - End of Life issues and I thought about the issue. I think I have talked about the backcompat issues enough in the past that it is pretty clear where I stand on the issue of when we say Read More...
The original version of the Microsoft Layer for Unicode's DLL only ran on Win9x. Ever. Now it had the loader , mind you, which would properly call the operating system or MSLU depending on where you were running, but there was no code in place to handle Read More...
Rasqual asked, in the suggestion box: Windows associates the idea of Unicode with 'Wide char', that is a 2-byte long character (currently). A comment on Raymond Chen's blog stated that Windows 2000 uses the UCS-2 representation of Unicode and Windows Read More...
A few minutes ago, I posted about Achieving case insensitivity , to deal with how best to make things case sensitive in an expression in SQL Server. Now looking beyond the issue with that one expression, and to the wider issue of collations and what to Read More...
Julia Lerman said just a few hours ago that she is a SQL Server luddite . I think that is not really true. I've read her blog, and I know people who have heard her speak in public. I plan to see her speak soon, in Orlando, because of that! But when I Read More...
That's right, the well known Dr. International now has a blog, subtitled Thoughts, tips, questions and suggestions – all things international . Definitely a good add to the list of blogs I read ! The Doctor starts of strong with a post entitled Hello Read More...
Rico Mariani decided to try a managed version of the dictionary I talked about earlier today. According to Rico, the question of the hour is: How does Raymond's program fare vs. the equivalent managed code below? But as far as I am concerned, the question Read More...
Let's create the following filenames: αβγδεζηθ.txt АБВГДЕЖЗ.txt אבגדהוזח.txt กขฃคฅฆจ.txt (they can be empty or have data in them) And then try to zip them up with your favorite program (I'll use WinZip, you can use anything you like here). The zip will Read More...
In my earlier post about Raymond's Chinese dictionary , I mentioned how in some cases code pages can be treated not so much an encoding technique as a 'repetoire fence'. The idea is not all that novel. In this age where most major software products are Read More...
For the 'I laugh so I will not cry' file: Over in The Language Log , Bill Poser reports in LOST IN TRANSLATION about an interesting step that the government of Gaeltacht in Ireland (a region where Irish is still common) has taken to help assure the continuing Read More...
Raymond Chen is going to be developing a Chinese dictionary over the next while. This is a really cool project that I am definitely keeping an eye on, for a lot of reasons, one of which is related to my prior blather about how IMEs have it easy . Dictionaries Read More...
In Slate , Sydney Spiesel reports in a piece entitled Blame Canada about the decision in Canada to ban Adderall XR, and what. Now I am not a huge fan of what many intelligent people consider the over-mediation and misdiagnosis of children with Attention Read More...
(No technical content in this post) I was asked a question in email about why research on MS seemed to be so difficult to get done. It is an interesting question, so I thought I would post an excerpt from the FAQ entitled Frequently Asked Questions (FAQ) Read More...
Patrick Andries started the thread off back in July of 2002: 06A4 ARABIC LETTER VEH * Arabic for foreign words Is it true that this letter is sometimes used in Algeria to represent G (as in guide, gone) in Proper Nouns (of Berber origin). Source : Yannis