Every now and then I look at visitor logs on the various personal and professional sites/blogs that I administer. It makes for a fascinating experience to see the many places worldwide that visitors come from. I have often wondered about non English speakers and how I could make my writing more accessible to them. While some professional and company web sites have translated versions available, in many user forums and communities across the web there have been requests for a translated version of the pages/posts. Today, on many sites, I have to copy the text on the site, paste it into a translator and look at the translation. It is cumbersome and not very seamless in an otherwise smooth navigation experience.
I am very pleased to say Windows Live Translator solved this problem with the latest feature addition that rolled out this week. Now on the Live Translator home page you will find a new link "Add the web page Translator to your site". By clicking on this link you go to a page that offers snippets of code that can be added to individual web pages for which you wish to offer translations.
The code generator will create the appropriate widget depending on the source language of your site. Refer to the Live Translator introduction post where Andrea listed the language pairs that we currently support.
So here is what you do to have a link on your web page to translate it:
Step 1: Click on the Add the web page Translator to your site link
Step 2: Select the language your web page is written in (source language)
For example: Since all the articles on my blog are in English, I choose English as the source language
Step 3: The code that you need to copy and paste into your web page's HTML is generated in the box
For example: Since I chose English, the code that is generated looks like this
Step 4: Copy that code and paste it into the page that should offer translation.
For example: On my blog say I want the blog post I wrote about Live Translator to be translated, I go into the blog editor and paste it like so:
If the blog or web page uses templates, one could also paste the code into a template - thereby providing the Translate This Page widget on all pages
Step 5: Enjoy an expanded (and hopefully more appreciative) audience!
The end result on my blog looks like this in the case of a single post translation:
The end result looks like this if I put it in the template (this allows for translation of every post):
For the more technically minded here is some more information on the parameters that the Live Translator accepts:
where lp is the language pair (such as en_fr for english to french) for source and target languages. a is the URL you want translated.
where lp is the language pair (such as en_fr for english to french) for source and target languages. a is the URL you want translated.
The Windows Live focused community site ViaWindowsLive is making creative use of the Live Translator to make their site available in multiple languages (look on the left bottom of the page). I would love to check out how you might be utilize this new feature. Feel free to post a link to your site in the comments.
Edit: Updating the parameters link
Have you tried the new Windows Live Search? I have been using it as the default search on all my computers at work and home (10+ of them) and am very impressed by the improvements. While all the great new features are quite excellent, I am very pleased with the quality (relevance) of the search results themselves. The rollout was gradual, and most of you should be seeing the new search by now. If not, both the Live Search blogs have direct links to the various features.
On that note, here is some search related news for you from Andrea, your friendly neighborhood program manager for the Windows Live Translator Beta:
Windows Live Translator Beta is now directly integrated in the newest version of Live Search. What does this mean? When a search result (i.e. web page) has been found in a language which is different than the user’s language, and the Live Translator can translate from the web page’s into the user’s language, the search result is accompanied by a new link: “Translate this page”
(User language = English; web page language of all 4 search results = Spanish; translation offered: from Spanish to English)
A click on this link opens the found web page in Bilingual Viewer mode, allowing the user to see the original web page and its translations with all enhancements described in our previous blog entry.
You may ask: how does Live Search know “my” language? Generally, your system settings provide this information. You can change your language settings either in your browser which will influence the behavior of all language(or “market)-sensitive web sites, or you can define a “Display Language” just for your Live Search experience. Live Search’s “Options” menu allows you to select your preferred language in which the Live Search user interface will be displayed to you.
If you choose to select a display language there, it will henceforth be considered “your” language. Any web page found by Live Search that is in a different language from yours may be shown with a “Translate this page” link, provided that translations from the page language into your (selected) language are available. (Please see our introduction blog entry for a list of translation languages we currently offer).
That was Andrea, giving you the scoop on what to expect with the new search integration.
Many times, I find myself searching for deeper meanings for cryptic error codes that programs often throw up. Some other times I find myself searching for any information that might be gleaned about the latest Tablet PCs. These searches tend to turn up sites in other languages with potentially useful information. The new "translate this" integration into search is now a feature that I cannot live without.
Have fun searching and translating!
1) Where can I find help with the translator service?
Help and FAQ are here.
2) How is the text translated?
Text is translated by computer software automatically and without human involvement. Web pages about computer-related topics are translated by Microsoft’s own state-of-the-art statistical machine translation technology which has been trained on large amounts of computer-related data. Web pages about other topics or into languages that are not included in Microsoft’s eight currently supported languages are translated by translation software from Systran.
3) Why is the quality of the translation not as good as I would like it to be?
Language translation is extremely difficult, as the meaning of words and phrases often depends on the context and specialized knowledge of the domain area or culture. Sentence structures and grammatical rules vary significantly between two languages, adding to the complexity of the translation challenge. Currently, it still requires human skills to translate sentences without errors. The quality of today's most advanced translation software is well below the accuracy and fluency of a professional translator, and many sentences are simply not understandable. Researchers are continuously working on improvements, but it may be many years before high quality translation can be consistently offered by a computer. For this reason, we display both the original text and its translation, anticipating that you will find it easier to understand the translation, comparing it with the original content if needed.
Some of our translation results (usually for computer-related content) are based on training our translation system on large amounts of bilingual text. The more bilingual or multilingual text we can train our system on, the better our translation quality will become. If you have large amounts of translated text in any subject domain, which you would be willing to share with us, please click here to let us know.
4) Where can I report translation problems? Can I make corrections to the translation you offer?
Use the Feedback link in the Bilingual Viewer or the Microsoft Translator Beta home page to report problems. We are currently working on additional ways to collect your corrections.
5) What is the difference between rule-based machine translation and statistical machine translation?
1. Rule-based MT systems require extensive dictionaries containing syntactic, semantic and morphological data, and large sets of rules to translate a word or phrase. Of multi-language, broad domain systems, Systran is the industry standard and benchmark for automatic translation, and relies on rule-based technology developed by a large team of linguists over many years. Other best-of-breed rule-based engines exist for specific language pairs, but their quality in a particular language pair does not necessarily scale to other pairs.
2. Statistical MT is based on machine-learning technologies, and relies on large volumes of parallel human-translated texts from which the MT engine can learn. This data must be obtained in every language pair and domain that the machine will be asked to translate in. While the quality of current systems is limited by the availability of parallel data, the potential for language coverage and quality improvements is very promising, as multilingual content on the web increases, and new techniques for mining parallel data are discovered.
6) Which language translations does Microsoft’s own statistical machine translation engine support?
Microsoft’s own translation system is used for translation from English into: German, Spanish, French, Italian, Portuguese (Brazilian), Chinese (simplified and traditional), and Japanese. Other languages are translated using software from Systran.
7) How long has Microsoft been working on the machine translation engine?
Microsoft has been developing statistical machine translation technology in its Research division for over 3 years, and other natural language and machine translation technologies for over 12 years. While the original focus of this work was on Microsoft’s own localization needs, where this technology has been very successful enabling customers from many countries to access help files, knowledge base articles and MSDN articles, we are now pleased to bring it to you for general use. Research and development are ongoing.
8) Why aren't all web pages translated?
There are a number of reasons why we may not be able to translate content on a certain web page:
· System requirements may not be met
· We do not transmit any https (secure web pages) content to our translation server, as this could be considered a phishing attempt. You still have the opportunity to navigate to an https site yourself and copy/paste paragraphs into the translator.live.com translation box, but in order to respect secure information 100%, we will not automatically send content that is protected by https to our server.
· Flash and text on images cannot be translated.
· The Bilingual Viewer uses frames to display translations to you. Pages containing scripts which enforce that a web page is not displayed in a frame will not be translated.
9) When do I see a “Translate this page” link next to a Live Search result?
This link is available when Live Search has found a web page which is in a different language than the default (or chosen) language for your browser, and if we can offer a translation between the web page language and your default language.
10) How do I set my preferred translation language?
You can select a translation language on the Microsoft Translator Beta home page (www.microsofttranslator.com) for your text or web page translations.
Once you see web page translations in the Bilingual Viewer, you may change your original and translation language.
The Language tab under Search Options (accessible through the “Options” link on the Live Search page) allows you to select a translation language into which pages are translated. This may be other than the default language of your browser. It also allows you to tell the system to return results only in certain languages.
11) Can I set a preferred translation view within the bilingual viewer?
You can select a view from within the bilingual viewer. If you use a compliant browser, this preference will persist between viewing sessions.
12) How is this service similar to others in the market, and how is it different?
Like other systems in the market, the Microsoft system offers free text and web page translation. Microsoft’s web page translator differs from the competitors in its use of a unique Bilingual Viewer, which shows the original and translation side by side, and which customers have said they prefer over other viewing modes. It also offers higher quality translations of computer-related technical text through the use of Microsoft’s own statistical MT technology.
Microsoft’s MT portfolio is focused on meeting customer needs through intuitive integration of MT into products and services that are part of our users’ everyday workflow, and creating as user-friendly an experience as possible. As such, we welcome customer feedback and look forward to hearing suggestions.
13) Will I be able to translate documents within Word?
Currently, Microsoft Office allows you to translate words, phrases or documents. For some languages, Office offers a screen-tip dictionary lookup for individual words. A “Translation” option that is available in the Office ribbon-UI as well as in the right-click menu allows the user to request translations of words, entire phrases, and even Word documents via the Research & Reference pane. There, users may select a translation language and enter sentences for translation into a query box, or they may click on a “Translate the whole document” button which will provide translations for short documents.
14) I’ve read that there are five views offered with the bilingual viewer. Where is the fifth view?
This even had the creators of the Bilingual Viewer surprised :). They share two competing theories about what the mysterious 5th view might be, and you can help solve this mystery. Please send your discovery or guesses here. We will publish all unique findings on this blog … and who knows—maybe you will uncover a secret 6th or 7th view?
15) When can I use the translator for language X?
We are constantly working to improve the quality of existing translations as well as those languages that are not ready yet. Keep an eye on this blog for announcements as to when new languages are added to the service.
16) I have a feature suggestion, what do I do?
Use the Feedback link in the Bilingual Viewer or the Microsoft Translator Beta home page to send feature requests. While we might not be able to respond personally, be assured that the team will be looking at each request that comes in.
17) What are the system requirements for the service?
There has been some great coverage recently on the beta release of our new online translation service. In this post I would like to provide you with information regarding Microsoft’s entry into the free online machine translation field – straight from the horse’s mouth, figuratively speaking.
The URL of the translation home page is http://www.microsofttranslator.com, where you can issue requests for text and web page translations:
Note that a check box option labeled Computer-related content allows you to get better-adapted translations for (computer-) technical text, provided by Microsoft Research’s own statistical machine translation engine. This service is available for the following language pairs:
English - Chinese Simplified English - Chinese Traditional English - French English - German English - Italian English - Japanese English - Portuguese English - Spanish
English - Chinese Simplified
English - Chinese Traditional
English - French
English - German
English - Italian
English - Japanese
English - Portuguese
English - Spanish
Non-technical translations and additional language pairs are currently provided by the latest version of Systran. The additional language pairs available are:
Arabic - English Chinese Simplified - English Chinese Traditional - English Dutch - English English - Arabic English - Dutch English - Korean French - English French - German German - English German - French Italian - English Japanese - English Korean - English Portuguese - English Russian - English Spanish - English
Arabic - English
Chinese Simplified - English
Chinese Traditional - English
Dutch - English
English - Arabic
English - Dutch
English - Korean
French - English
French - German
German - English
German - French
Italian - English
Japanese - English
Korean - English
Portuguese - English
Russian - English
Spanish - English
Our innovative approach to web page translation includes a user interface we refer to as the Bilingual Viewer. It offers 4 types of bilingual views from which users may choose depending on their preference or screen size. The side-by-side and top/bottom views offer synchronized scrolling, highlighting, and navigation (and yes, we still have some wrinkles to iron out there). In the two single language views, you can hover your mouse pointer over a sentence in one language and the corresponding passage in the other language is automatically displayed nearby for ease of reference. Finally, we render the translated text progressively on a web page in order to make it more quickly available for the user to read, while other page elements are still being translated in the background.
Original with hover translation view:
Translation with hover original view:
Live Search will soon be exposing “Translate this page” links on the results page for search results which are in a language that is different from the user’s system language (provided that the required language pair is available from our service). When you click on a “Translate this page” link, the web page will be opened in the Bilingual Viewer (in side-by-side view or the view you selected during your last viewing session).
Language translation is extremely difficult, as the meaning of words and phrases often depends on the context and specialized knowledge of the domain area or culture. Sentence structures and grammatical rules vary significantly between two languages, adding to the complexity of the translation challenge. Currently, it still requires human skills to translate sentences without errors. The quality of today's most advanced translation software is well below the accuracy and fluency of a professional translator, and many sentences are simply not understandable. Researchers are continuously working on improvements, but it may be many years before high quality translation can be consistently offered by a computer. For this reason, we display both the original text and its translation, anticipating that you will find it easier to understand the translation, comparing it with the original content if needed. Some of our translation results (usually for computer-related content) are based on training our translation system on large amounts of bilingual text. The more bilingual or multilingual text we can train our system on, the better our translation quality will become. If you have large amounts of translated text in any subject domain, which you would be willing to share with us, please click here to let us know.
Please refer to our FAQ section for more answers to questions that have reached us, and please do make use of the option to send us Feedback. We have released our first version of this translation service as a Beta, so we can listen to and learn from you how to best meet your needs. Expect to see continuous improvements to the Windows Live Translator Beta.
Microsoft Research’s Machine Translation (MSR-MT) group has been among the leading research organizations in the machine translation space for over 8 years, and some of the foundational work in natural language processing at MSR began over 16 years ago. The team’s approach to machine translation integrates linguistic features with state-of-the-art statistical machine translation algorithms. The team’s focus has always been on automatically acquiring translation knowledge from bilingual corpora, i.e., parallel data consisting of original source language sentences and their corresponding translations by human translators. About 3 years ago, the team’s focus shifted from a purely rule-based approach to this task toward a hybrid approach that includes extensive statistical processing, allowing for greater scalability across domains and into new languages.
Microsoft’s Machine Translation technology was first developed for in-house localization purposes, to allow our Customer Support organization to publish technical support documents with a frequency and language breadth that would have been prohibitively expensive using human translators. With all of Microsoft’s previously human-translated documents and localized software at its disposal, the MT team was able automatically to train its statistical MT engine to achieve quite good quality in the technical domain. This technology was extended to support the Windows localization team, the Developer Division, MSDN, and several other groups within Microsoft. It has also allowed Microsoft to reach many more customers than would have ever been possible using human translation alone.
After focusing on Microsoft’s own translation needs, the team began to build a scalable web service that would allow it to provide translation services to the general public, as a standalone tool on the web, and as a feature within other products. Given that the Microsoft MT engine has been trained most heavily on technical data, it has not yet been tuned for translating text in other subject domains. However, we hope to continue improving the quality and breadth of the engine. We look forward to sharing our developments with you over the coming months on this blog.