The Translator Add-in can be added to any web page and makes it really easy for visitors to quickly translate the page into any one of our supported languages. The translation is conveniently displayed side-by-side with the original page so that both original and translated languages are available to the reader. There are also other display options available to try out.
This is what visitors to your website will see:
There’s also a cool auto-detect feature that will pick up your visitor’s browser language and offer up the word “Translator” in their native language.
Have you implemented this on your web sites? What do you think? How can we improve it?
If you haven’t, try it out and let us know your feedback!
Every now and then I look at visitor logs on the various personal and professional sites/blogs that I administer. It makes for a fascinating experience to see the many places worldwide that visitors come from. I have often wondered about non English speakers and how I could make my writing more accessible to them. While some professional and company web sites have translated versions available, in many user forums and communities across the web there have been requests for a translated version of the pages/posts. Today, on many sites, I have to copy the text on the site, paste it into a translator and look at the translation. It is cumbersome and not very seamless in an otherwise smooth navigation experience.
I am very pleased to say Windows Live Translator solved this problem with the latest feature addition that rolled out this week. Now on the Live Translator home page you will find a new link "Add the web page Translator to your site". By clicking on this link you go to a page that offers snippets of code that can be added to individual web pages for which you wish to offer translations.
The code generator will create the appropriate widget depending on the source language of your site. Refer to the Live Translator introduction post where Andrea listed the language pairs that we currently support.
So here is what you do to have a link on your web page to translate it:
Step 1: Click on the Add the web page Translator to your site link
Step 2: Select the language your web page is written in (source language)
For example: Since all the articles on my blog are in English, I choose English as the source language
Step 3: The code that you need to copy and paste into your web page's HTML is generated in the box
For example: Since I chose English, the code that is generated looks like this
Step 4: Copy that code and paste it into the page that should offer translation.
For example: On my blog say I want the blog post I wrote about Live Translator to be translated, I go into the blog editor and paste it like so:
If the blog or web page uses templates, one could also paste the code into a template - thereby providing the Translate This Page widget on all pages
Step 5: Enjoy an expanded (and hopefully more appreciative) audience!
The end result on my blog looks like this in the case of a single post translation:
The end result looks like this if I put it in the template (this allows for translation of every post):
For the more technically minded here is some more information on the parameters that the Live Translator accepts:
where lp is the language pair (such as en_fr for english to french) for source and target languages. a is the URL you want translated.
where lp is the language pair (such as en_fr for english to french) for source and target languages. a is the URL you want translated.
The Windows Live focused community site ViaWindowsLive is making creative use of the Live Translator to make their site available in multiple languages (look on the left bottom of the page). I would love to check out how you might be utilize this new feature. Feel free to post a link to your site in the comments.
Edit: Updating the parameters link
Will Lewis is a program manager on the Microsoft Translator team, working on language quality and data acquisition. Today's guest blog is a high level explanation of how the engine works:
As many of you know, under the hood Microsoft Translator is powered by a Statistical Machine Translation (SMT) engine. Statistical systems are different than rule-based ones in that the “rules” mapping words and phrases from one language to another are learned by the system rather than being hand-coded. Training an SMT requires amassing a large amount of parallel training data—hopefully of good quality and from heterogeneous sources—and training the engine on that data. (By parallel, we mean a source of data where the content for one language is the same as the content for the other.) The engine learns the correspondences between words and phrases in one language and those in another, which are often reinforced by repeated occurrences of the same words and phrases throughout the input. For instance, in training the English-German system let’s say, if the engine sees the phrase All rights reserved on the English side and also notices Alle Rechte vorbehalten on the German side, it may align these two phrases, and assign some probability to this alignment. Repeated occurrences of the source and target phrases in the training data will only reinforce this alignment.
Generally, having parallel data for a language pair means we can train engines in both directions (i.e., both the English-German and the German-English systems can be trained on the same input sentences). Some of you had some questions regarding why it was that we released the English-Spanish system before we released Spanish-English. There were really two reasons. First, English-Spanish was the first general domain language pair we released. Releasing one language pair allowed us to test the infrastructure before we started releasing more. Second, the technology for Spanish-English was slightly different than that used for English-Spanish, and we need some additional time to do the necessary infrastructural changes to accommodate. In the future, we plan to release new translation systems in pairs (with a couple of exceptions). I can’t reveal what languages we have planned next, but do expect some new ones soon!
For those of you interested in technical discussions regarding our engines and how they work, please refer to some of the papers by the researchers who developed them. Three recent papers of note are:
Chris Quirk, Arul Menezes. Do we need phrases? Challenging the conventional wisdom in Statistical Machine Translation May 2006 New York, New York, USA Proceedings of HLT-NAACL 2006
Chris Quirk, Arul Menezes. Dependency Treelet Translation: The convergence of statistical and example-based machine translation? March 2006 Machine Translation 43-65 (Attached file)
Chris Quirk, Arul Menezes. Using Dependency Order Templates to Improve Generality in Translation July 2007 Association for Computational Linguistics
There has been some great coverage recently on the beta release of our new online translation service. In this post I would like to provide you with information regarding Microsoft’s entry into the free online machine translation field – straight from the horse’s mouth, figuratively speaking.
The URL of the translation home page is http://www.microsofttranslator.com, where you can issue requests for text and web page translations:
Note that a check box option labeled Computer-related content allows you to get better-adapted translations for (computer-) technical text, provided by Microsoft Research’s own statistical machine translation engine. This service is available for the following language pairs:
English - Chinese Simplified English - Chinese Traditional English - French English - German English - Italian English - Japanese English - Portuguese English - Spanish
English - Chinese Simplified
English - Chinese Traditional
English - French
English - German
English - Italian
English - Japanese
English - Portuguese
English - Spanish
Non-technical translations and additional language pairs are currently provided by the latest version of Systran. The additional language pairs available are:
Arabic - English Chinese Simplified - English Chinese Traditional - English Dutch - English English - Arabic English - Dutch English - Korean French - English French - German German - English German - French Italian - English Japanese - English Korean - English Portuguese - English Russian - English Spanish - English
Arabic - English
Chinese Simplified - English
Chinese Traditional - English
Dutch - English
English - Arabic
English - Dutch
English - Korean
French - English
French - German
German - English
German - French
Italian - English
Japanese - English
Korean - English
Portuguese - English
Russian - English
Spanish - English
Our innovative approach to web page translation includes a user interface we refer to as the Bilingual Viewer. It offers 4 types of bilingual views from which users may choose depending on their preference or screen size. The side-by-side and top/bottom views offer synchronized scrolling, highlighting, and navigation (and yes, we still have some wrinkles to iron out there). In the two single language views, you can hover your mouse pointer over a sentence in one language and the corresponding passage in the other language is automatically displayed nearby for ease of reference. Finally, we render the translated text progressively on a web page in order to make it more quickly available for the user to read, while other page elements are still being translated in the background.
Original with hover translation view:
Translation with hover original view:
Live Search will soon be exposing “Translate this page” links on the results page for search results which are in a language that is different from the user’s system language (provided that the required language pair is available from our service). When you click on a “Translate this page” link, the web page will be opened in the Bilingual Viewer (in side-by-side view or the view you selected during your last viewing session).
Language translation is extremely difficult, as the meaning of words and phrases often depends on the context and specialized knowledge of the domain area or culture. Sentence structures and grammatical rules vary significantly between two languages, adding to the complexity of the translation challenge. Currently, it still requires human skills to translate sentences without errors. The quality of today's most advanced translation software is well below the accuracy and fluency of a professional translator, and many sentences are simply not understandable. Researchers are continuously working on improvements, but it may be many years before high quality translation can be consistently offered by a computer. For this reason, we display both the original text and its translation, anticipating that you will find it easier to understand the translation, comparing it with the original content if needed. Some of our translation results (usually for computer-related content) are based on training our translation system on large amounts of bilingual text. The more bilingual or multilingual text we can train our system on, the better our translation quality will become. If you have large amounts of translated text in any subject domain, which you would be willing to share with us, please click here to let us know.
Please refer to our FAQ section for more answers to questions that have reached us, and please do make use of the option to send us Feedback. We have released our first version of this translation service as a Beta, so we can listen to and learn from you how to best meet your needs. Expect to see continuous improvements to the Windows Live Translator Beta.
The Translator team is excited to announce the availability of the English to Russian language pair on MicrosoftTranslator.com. This language pair is now available across all implementations of the translator technology, including Live Search and (in the next few days) the Windows Live Messenger TBot.
You have probably noticed that the Russian to English language pair has been available for some time on our site. As always, translation quality is a top focus for our team. Sometimes reaching quality takes longer for a particular direction – this can be for many reasons. For example, if you are translating between a simple language and a complex language, the translations will be better going to the simple language than they will be going to the complex language. If you are interested in learning more about the technology behind our machine translation engine, see Will Lewis’ blog post on statistical machine translation.
While machine translation is certainly never perfect, for this new language pair we have now hit our quality bar for release. How do we determine the quality bar? In general, when the translation can be considered “useful”. We consistently receive feedback from our users that imperfect translation which is useful is better than no translation. So we have to balance user demand with translation quality. With that in mind, we test our language pairs with human evaluations, until we have reached “useful” translation.
We are always open to your constructive feedback and help – please continue to help us so that we can keep improving quality! We are always very grateful for good feedback.
Some other updates in this release you may notice:
· We have officially migrated our domain to www.microsofttranslator.com
· Improved quality across several language pairs, due to improvements in training data quality
· Improvements in Japanese to English, due to an improved method of parsing the training data