Is it Macau or is it Macao?

Sorting it all Out
Michael Kaplan's random stuff of dubious value
Be sure to read the disclaimer here first!

Is it Macau or is it Macao?

  • Comments 22

People get confused sometimes about the name of this place, but whether you call it:

  • Haojing'ao (壕鏡澳 "Trench Inlet")
  • Liandao (蓮島 "Lotus Island")
  • Xiangshan'ao (香山澳 "Fragrant mountain Island")

According to some sites (like the U.S. Library of Congress), its official name used to be Macao when it was a Portuguese territory but after the reversion to China the official name became Macau (China : Special Administrative Region) and in conversations about it in its former territorial status the name Macau is now prefered in most contexts.

But other sources (such as the Macao SAR Government Portal) seem to prefer Macao and Macao Special Administrative Region.

At the risk of disrespecting Congress I am going to side with the actual Macao government site, and not just because they have updated more recently. :-)

Last year, while looking into what we had for East Asian collation support in Windows, I noticed a curious fact. The default system code page is 950, which is the Traditional Chinese code page, yet the collation choices were:

  • 0x00001404 uses the PRC Pinyin-based pronunciation sorting tables
    As if it were using MAKELCID(MAKELANGID(LANG_CHINESE, SUBLANG_CHINESE_MACAU), SORT_CHINESE_PRCP)
  • 0x00021404 uses the PRC-based stroke order
    As if it were using MAKELCID(MAKELANGID(LANG_CHINESE, SUBLANG_CHINESE_MACAU), SORT_CHINESE_PRC)

I figured this was a bug but it seemed odd that no one ever reported it, if it were. Hmmm, strange....

So, although I was confused I decided to see if I could find out what was going on. I talked to several native speakers and people now living in the US who were either from Macao or who had vistied there for an extended period, and learned that even though the Traditional forms are still used more often than the Simplified ones, in recent years the Simplified forms have seen more usage.

More importantly, however, people in Macao often do use a pronunication sort and it is Pinyin-esque (English is a productive language, so I can make that word up!). Many people in Macao learn the Bopomofo pronunciations but they do not use them in their daily lives. Thus the PRC Simplified Pinyin may not be perfect but they will be closer to the order that a native speaker would expect than the Bopomfo order, even if not all of the ideographs are on the list.

I was unable to find any data on a Macao-specific Pinyin ordering, but I do know that the PRC government has expressed interest in getting Pinyin pronunciations for many more ideographs, including Traditional forms. Perhaps one of the motivations behind such a move is indeed to help support people in Hong Kong and Macao! Certainly it is the case that a Pinyin-based IME is more useful to most native speakers in Macao than a Bopomofo one would be.

Maybe something Cantonese would be most useful, but that is a story for another day....

 

This post brought to you by "序" (U+5e8f, a.k.a. as an ideograph meaning sequence or series)

Comment on the blather
Leave a Comment
  • Please add 4 and 4 and type the answer here:
  • Post
Blog - Comment List
  • You've probably discussed this before, but doesn't it make you wonder if text sorting is not really something that needs to be sensitive to regional norms everywhere? Or may be what I am asking is whether you've discussed the linguistic effect that Microsoft has -- having that one word where people can't find it can lead to far reaching consequences. Then again, important software should not rely on an OS sort. As computers spread to the remote corners of every country, they naturally impose their own effect on the way business is done there. Finding places where people are familiar with multiple languages and competing ways of organizing text, the default settings on the most common computers and software will become known as the "computer" way of doing things. Then there are aspects of sorting like whether or not to use the word "The" on the beginning of the book title which generally requires an awareness of the issue at the time of data entry. I expect similar issues in other languages, and then the interplay when you are sorting multiple languages. It is a wonderful line of work you do, but this Macao article certainly gives a flavor of a very esoteric aspect. Just some random thoughts...
  • It acually does need to be somewhat sensitive to the norms that the user expects -- any time it does not maytch exactly for CJK it may just be some other pronunciation?

    This article may be esoteric, I had not thought of it that way.... it was just an issue that interested me, thats all. :-)
  • "According to some sites (like the U.S. Library of Congress), its official name used to be Macao when it was a Portuguese territory but after the reversion to China the official name became Macau (China : Special Administrative Region) and in conversations about it in its former territorial status the name Macau is now prefered in most contexts."

    In Portugal, its official name was (before the reversion) and still is Macau...
  • Hey ncampos!

    I would expect that -- the names of languages when translated into other languages do not always match what they are to people who speak the original language. for example, the word for Nederlands may be Holandski, Holland dili, Belanda, holandština, Olandè, Hollandsk, Olandese, Holandès, holland, holländska, Nederlansk, Holandês, holenderski, Olandeză, Golland, gоllandča, golandaca, or Holandés.

    But it is still Dutch to me. :-)
  • Previous posts in this series: Part 0: The empty string sorts the same in every language Part 1: The

  • It has been at least a good 29 months since I posted Is it Macau or is it Macao? , which (among other

  • I have been asking the same question, and didn't notice you blogged about it.

    Go visit the http://www.gov.mo/egi/Portal/index.htm

    It is now using both spelling on the same page!

    Please take a look at

    http://hk.knowledge.yahoo.com/question/?qid=7007012200058

    (if you can read Chinese), it says the reply is official from the Macao government.

  • Well, someone posting on Yahoo answers quoting what they claim is the official answer may or may not constitute proof for others, and whether any country or region has the right to decide the only way that OTHER languages would spell a word is questionable notion for any language....

  • Yes, the author only says his friend claimed to have got an official answer. OTOH, you can see on Macao's official web site that both spelling exists.

    Just as much right as when PRC started to spell Peking as Beijing.

  • Leaves me with flashbacks to farsi/persian and uighur/uyghur issues. :-)

  • Yes, the end of the title is an allusion to a late 80s Poison power ballad based on a Bret Michaels love

  • It might remind you a bit of the whole Uighur/Uyghur issue (ref: here ), but it is not really quite that.

  • Please read the disclaimer ; content of Michael Kaplan's blog not approved by Microsoft! Now it isn't

  • Please read disclaimer ; content of Michael Kaplan's blog not approved by Microsoft! Pronunciation based sorting for Traditional Chinese

  • I don't know if you read Chinese, but this article mentions an incident of Peking/Beijing, and I like the reply by the governor:

    http://plastichk.blogspot.com/2008/03/peking.html

Page 1 of 2 (22 items) 12