<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://blogs.msdn.com/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>Loneliness of the long-distance linguist. : localization</title><link>http://blogs.msdn.com/kierans/archive/tags/localization/default.aspx</link><description>Tags: localization</description><dc:language>en</dc:language><generator>CommunityServer 2.1 SP1 (Build: 61025.2)</generator><item><title>Found in translation.</title><link>http://blogs.msdn.com/kierans/archive/2006/09/06/743540.aspx</link><pubDate>Thu, 07 Sep 2006 09:46:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:743540</guid><dc:creator>KieranS</dc:creator><slash:comments>4</slash:comments><comments>http://blogs.msdn.com/kierans/comments/743540.aspx</comments><wfw:commentRss>http://blogs.msdn.com/kierans/commentrss.aspx?PostID=743540</wfw:commentRss><description>&lt;P&gt;Though languages can be interestingly different from one another, the majority of work in modern linguistics has been dedicated to showing that the differences are only cosmetic; at heart, all languages are expressively equivalent. I know this is true, because every time someone from&amp;nbsp;[big name theoretical linguistics department]&amp;nbsp;comes up with a new model to describe this, a whole bunch of people who used to be at&amp;nbsp;[big name theoretical linguistics department]&amp;nbsp;run out and apply it to their favorite languages and show how it works. :)&lt;/P&gt;
&lt;P&gt;But in software, things work differently. The kinds of linguistic differences that are meaningful to linguists are not the same as the kinds of differences that are meaningful to software developers. And for a linguist working&amp;nbsp;in the software industry, that's where things really start to get interesting.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;All languages are expressively equivalent.&lt;/STRONG&gt; What do linguists actually mean by that? They mean that although some things can be difficult to translate, there is no semantic content that can be expressed in one language but not in another. You may have to say it a different way, using different vocabulary, syntax, gesture, intonation, and so on, but if it can be said in some language, then you can say it in yours. From the point of view of a theoretical linguist, this is a pretty deep contention, because it shows something about how the brain works regardless of the language that someone happens to speak. &lt;/P&gt;
&lt;P&gt;Well. Full disclosure:&amp;nbsp;From the point of view of some linguistic anthropologists, the above contention sounds pretty wacky. But let's pretend that theoretical linguists are right and the body&amp;nbsp;of work supporting this from the last half-century is really on the mark. That's the kind of background I come from and I find the work pretty convincing. So what's the problem? The problem is that in software, it doesn't actually matter if it's true. It's true, but it's not usefully true. Because in software, you have to design stuff, and when you start designing stuff you run into all kinds of practical constraints that theoretical linguists don't need to care about.&lt;/P&gt;
&lt;P&gt;Let's look at localization as one example. On the one hand, it works out pretty well that all languages can figure some way to express opening, saving, printing, and closing files (even if sometimes there's some discussion&amp;nbsp;about the &lt;EM&gt;best&lt;/EM&gt; way to say it).But how far does that take us? If you're developing an application, then you still need to figure out how to design your UI so that all relevant localized strings fit within it. It's great that text can be localized into several expressively equivalent languages, but if you're an application&amp;nbsp;developer, you're not interested in the extent to which languages are the same in this regard.&amp;nbsp;You care about&amp;nbsp;the range of differences between them. Because your application needs to plan around exactly that set of differences.&lt;/P&gt;
&lt;P&gt;You can find analogous examples all over the software space. It is crucial to grasp the range of sounds across languages in order to do good work in speech. You need to understand the ways in which different languages can segment written text into morphemes, words, sentences, and discourse segments before you can really plan great content rendering support. In all of these cases, really good multilingual work starts by documenting the range of variation as closely as possible; it is only by delineating the differences that the necessary design generalizations start to emerge.&lt;/P&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=743540" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/kierans/archive/tags/linguistics/default.aspx">linguistics</category><category domain="http://blogs.msdn.com/kierans/archive/tags/localization/default.aspx">localization</category></item><item><title>Lexical diffusion, software style.</title><link>http://blogs.msdn.com/kierans/archive/2006/08/17/703220.aspx</link><pubDate>Thu, 17 Aug 2006 10:53:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:703220</guid><dc:creator>KieranS</dc:creator><slash:comments>2</slash:comments><comments>http://blogs.msdn.com/kierans/comments/703220.aspx</comments><wfw:commentRss>http://blogs.msdn.com/kierans/commentrss.aspx?PostID=703220</wfw:commentRss><description>&lt;P&gt;The &lt;A href="http://g.msn.com/9SE/1?http://members.microsoft.com/wincg&amp;amp;&amp;amp;DI=6244&amp;amp;IG=027b25a1643148e9b2278e14ce5ab6c9&amp;amp;POS=1&amp;amp;CM=WPU&amp;amp;CE=1&amp;amp;CS=OTH&amp;amp;SR=1"&gt;&lt;FONT color=#1f5ba4&gt;Microsoft &lt;STRONG&gt;Community&lt;/STRONG&gt; &lt;STRONG&gt;Glossary&lt;/STRONG&gt; - Linguistic Partnership Project&lt;/FONT&gt;&lt;/A&gt;&amp;nbsp;allows members of a speech community to work together to create and standardize technical vocabulary for their language. Community Glossaries can be an important step on the road to software localization for customers in regions that are newer to technology. Thus far there are online Community Glossary projects for languages all around the world, including &lt;A href="http://members.microsoft.com/wincg/home.aspx?langid=1135"&gt;Greenlandic&lt;/A&gt;, &lt;A href="http://members.microsoft.com/wincg/home.aspx?langid=1099"&gt;Kannada&lt;/A&gt;, &lt;A href="http://members.microsoft.com/wincg/home.aspx?langid=1153"&gt;Māori&lt;/A&gt;, &lt;A href="http://members.microsoft.com/wincg/home.aspx?langid=1054"&gt;Thai&lt;/A&gt;,&amp;nbsp;&lt;A href="http://members.microsoft.com/wincg/home.aspx?langid=1066"&gt;Vietnamese&lt;/A&gt;, and several others. Interested participants work with a local moderator to decide on the appropriate terminology to be used in the software space.&amp;nbsp; &lt;/P&gt;
&lt;P&gt;So does it work? That's an interesting question.&lt;/P&gt;
&lt;P&gt;New vocabulary is introduced into a language in several ways. Sometimes, rarely, someone well-known or otherwise influential coins a word or phrase that happens to stick. This can happen on many scales, ranging from the popular kid in a group of friends all the way up to celebrities who get media attention. Sometimes a new technology is introduced and some brand name associated with it becomes the generic term, as with &lt;EM&gt;kleenex, xerox&lt;/EM&gt;, and more recently and much to their apparent chagrin, &lt;EM&gt;google&lt;/EM&gt;.&lt;/P&gt;
&lt;P&gt;More often, though, new vocabulary gets introduced when speakers of one language come into contact with speakers of another. English has borrowed vocabulary from virtually every language with which it has come into contact. No matter how government language authorities&amp;nbsp;try to impose standards around vocabulary in advertising, media, and school curricula, such efforts have not generally been successful in achieving results in the real speech of real people. Mexican kids still speak Spanish during&amp;nbsp;Little League games and the French still end the work week with &lt;EM&gt;le weekend. &lt;/EM&gt;It's just how language works.&lt;/P&gt;
&lt;P&gt;That's not to say that all standards efforts are ineffective. When a particular term catches on and its usage is popularized, it becomes a de facto standard for the speech community that uses it, and its very popularity both propels it forward and also catalyzes its eventual decline. Vocabulary choice is one way that speakers define themselves either as part of or against the speech communities they inhabit.&lt;/P&gt;
&lt;P&gt;What is true for colloquial vocabulary is also true for terminology in the technology space. Terminology persists when people use it. Which brings us back to the Community Glossary efforts.&lt;/P&gt;
&lt;P&gt;If efforts to identify and standardize technical vocabulary&amp;nbsp;are driven solely by official language authorities, I don't think they will be successful in the long run. A word is only as present as the community that uses it. It is only by involving the target user community that standardization efforts can hope to identify relevant and enduring terminology, and the most successful products will be the ones that&amp;nbsp;create a user experience with that&amp;nbsp;terminology in appropriate ways. I'm very excited about the Community Glossary&amp;nbsp;project because it lets real speakers identify real vocabulary that actually works for them, and it creates a lexical resource that can be valuable in language preservation efforts for endangered languages.&lt;/P&gt;
&lt;P&gt;Anyone who believes that a Community Glossary effort would be beneficial for their language can work with their local &lt;A href="http://www.microsoft.com/worldwide/" target=_blank&gt;Microsoft subsidiary office&lt;/A&gt;&amp;nbsp;to request one and get the process started.&lt;/P&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=703220" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/kierans/archive/tags/linguistics/default.aspx">linguistics</category><category domain="http://blogs.msdn.com/kierans/archive/tags/community+solutions/default.aspx">community solutions</category><category domain="http://blogs.msdn.com/kierans/archive/tags/localization/default.aspx">localization</category></item><item><title>Lights, camera, action!</title><link>http://blogs.msdn.com/kierans/archive/2006/08/03/687991.aspx</link><pubDate>Fri, 04 Aug 2006 01:44:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:687991</guid><dc:creator>KieranS</dc:creator><slash:comments>3</slash:comments><comments>http://blogs.msdn.com/kierans/comments/687991.aspx</comments><wfw:commentRss>http://blogs.msdn.com/kierans/commentrss.aspx?PostID=687991</wfw:commentRss><description>&lt;P&gt;For everything you ever wanted to know about Windows globalization and localization, or at least for&amp;nbsp;one or two&amp;nbsp;things, you might like to check out this &lt;A title=http://channel9.msdn.com/Showpost.aspx?postid=222513 href="http://channel9.msdn.com/Showpost.aspx?postid=222513"&gt;&lt;FONT color=#0000ff&gt;Channel 9&lt;/FONT&gt;&lt;/A&gt;&amp;nbsp;interview. It features &lt;A id=bp____ctl0___bs___lcl___Categories__ctl0_Links__ctl3_Link href="http://blogs.msdn.com/michkap"&gt;&lt;FONT color=#0000ff&gt;Michael Kaplan&lt;/FONT&gt;&lt;/A&gt;, whose blog you may know; Jeff Allen, our director of localization services; Alp Turkmen, who is driving our business direction in the Windows international space; Claus Juhl, one of our localization engineers; and me. Michael and I speak about globalization issues, with a heavy focus on the Locale Builder and associated ways that we'd like to open up our infrastructure to accomodate all kinds of community solutions, and we're about 45 minutes in. But don't skip! There's great information throughout.&lt;/P&gt;
&lt;P&gt;If you watch, please let us know what you think! And yes -- there are closer to 7,000 languages in the world than 2,000, for those of you playing along at home.&lt;/P&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=687991" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/kierans/archive/tags/NLS/default.aspx">NLS</category><category domain="http://blogs.msdn.com/kierans/archive/tags/locales/default.aspx">locales</category><category domain="http://blogs.msdn.com/kierans/archive/tags/community+solutions/default.aspx">community solutions</category><category domain="http://blogs.msdn.com/kierans/archive/tags/localization/default.aspx">localization</category><category domain="http://blogs.msdn.com/kierans/archive/tags/Local+Language+Program/default.aspx">Local Language Program</category></item></channel></rss>