In the current crisis in Haiti there are a number of initiatives to rapidly build software to assist in humanitarian aid. Responding to community requests for a machine translation (MT) system to translate between English and Haitian Creole, our team has been hard at work over the last few days. I am glad to announce that an experimental Haitian Creole MT system is now publicly available via several services and APIs powered by Microsoft Translator technologies. We will continue working on improving the system, but we hope meanwhile that in spite of the experimental nature – it will be of use in the relief efforts.
1) What is being announced today?
Responding to requests from the community involved in Haitian relief efforts, Microsoft Research is making available today an experimental machine translation system for translating to and from Haitian Creole. You can try it at http://translate.bing.com or http://www.microsofttranslator.com.
2) How is it significant?
With the devastating disaster that struck Haiti, we have all been individually pitching in to help the efforts. This is our effort, as a team, to respond to the needs of communities such as Crisis Commons by delivering a Haitian Creole translator which can be of help to individual users, as well as other technology projects that could use a scalable translation system in their relief endeavors. Further, the usage of our API is completely free and it can be built into any application or website for immediate use. We hope that this might help the many applications being developed (such as those on crisiscommons.org) to aid the humanitarian efforts.
3) How can I use this system?
The Haitian Creole translator is now part of the Microsoft Translator web service enabling many of the user scenarios powered by the service. Users can access the service through the Microsoft Translator web site. Developers would be interested in looking at our APIs – and choose from SOAP or HTTP (Support for Haitian in our AJAX API will be rolled out in the coming days).
4) How is it different from other efforts?
There have been some great efforts in quickly building dictionary and rule-based Haitian Creole translation tools. The statistical machine translation system behind Microsoft Translator allows for a continuous improvement in the quality of translations (by adding more training data). Also, by delivering this as part of our web service we can ensure scale and performance and open up the possibility of using our many scenarios (Bing Translator, Internet Explorer 8, Messenger Bot etc.) with Haitian Creole, as well as using our extensive API set to add such support to other software and web sites at no cost.
5) What was involved in getting this out of the door in record time?
The process involved identifying parallel (translated) data between English and Haitian Creole, and training the MT engine to create the requisite language models. We would also like to acknowledge the great work being done the Crisis Commons folks, the dictionary builders at haitisurf.com, the folks at CMU that made available parallel data and the Microsoft volunteers who challenged our team to action.
6) What should I expect in terms of quality?
This is an experimental system put together in record time. While our typical approach to adding new languages involves significantly larger amounts of training, a higher threshold for quality testing – we decided that the upside warranted making the system available to the community at the earliest, and continue improving it subsequently. We are working diligently to keep improving the quality, but bear with us if you encounter problems. You can always contact us at firstname.lastname@example.org with feedback. Our user and developer forums are also available to discuss any issues you encounter.
7) How can I help improve the system?
The best way you can help improve the system is by helping us find more training data. This is typically sentences or words translated between English and Haitian Creole. We intend to make available to the larger community (via tausdata.org) data that we collect (as license restrictions permit) for training purposes. If you know of dictionaries, translated sentences, or websites that have such translations we urge you to contribute it to TDA’s TAUS data sharing initiative. TDA is a non-profit organization providing a neutral and secure platform for sharing language data. If you have any concerns or questions feel free to contact us at email@example.com.
8) How can I help the broader Haiti relief efforts?
Go here to learn more about how you can help those devastated by the earthquake.
9) Where can I get more information?
Please stay tuned to our blog for further announcements. You can learn more about Microsoft Translator and the services we offer here.
10) What can we expect next?
In the coming days expect to see support for Haitian Creole added to even more of our scenarios (Translation Bot, Translator widget, Office etc) as well as the AJAX API. Known issues and announcements can also be found on our forums.
We hope that this contribution proves useful to the various humanitarian efforts underway, and please stay tuned to this blog for further news on the Haitian Creole language support. If you have any questions feel free to contact us at firstname.lastname@example.org.
Update (2:53 PM PST): The Messenger Translation Bot can now speak Haitian Creole. Add email@example.com to your messenger buddy list. Try the group conversation feature with a Kreyol speaker!
-Vikram Dendi, Senior Product Manager, Microsoft Translator
Hallo aus Berlin!
Thanks to the great feedback from the early adopters of the Microsoft Translator widget and APIs, we are pleased to remove the invite requirement and move the widget and APIs to public beta. Anyone can now generate a snippet for their site or application from the widget and AJAX API adoption portals.
Thank you for all those who attended today’s session at TechEd Europe. Here is a recap:
We also greatly appreciate all the great feedback on what languages you would like to see, and we hope to satisfy many of the requests within the next few months. Stay tuned and keep the feedback coming!
In my last update I had asked about what languages you wanted Microsoft Translator service to support. Thank you for taking the time to respond. We are pleased to announce that last week we added Czech (CSY), Danish (DAN), Greek (ELL), Swedish (SVE) and Thai (THA), taking our language count to a nice round 20.
Here is the complete list as of today:
You will be able to translate between these languages in all Microsoft Translator powered services including Bing Translator, Internet Explorer Accelerator, Office, Widget as well as in our APIs. Feel free to send in your feedback on the new languages via the forum. We do keenly follow your recommendations and requests as we prioritize new languages – so please do keep them coming in the comments section!
I am pleased to announce that we just added Hebrew to the list of languages that we support. You can immediately use it in Bing Translator, in IE8, with the widget, with the messenger bot, inside Office and of course with the API.
I would like to congratulate our language quality and coverage team on the progress they have been making with new languages. Over the next few months you will see more languages added to the mix, and also continue to see quality improvements for existing languages. Feel free to leave a comment on this thread about any languages you would particularly like to see.
We have also had many of you contacting us about helping find data sources that can be useful to train the machine translation system on – we appreciate your help! Our email address is firstname.lastname@example.org. Do stay in touch!
Use Bing to instantly translate queries from one language to another with our translation Instant Answer! Starting today, when you are looking for a translation of a word or phrase, go to Bing.com and kick off an instant translation, powered by Microsoft Translator. Instant translation is another way that Bing helps you complete tasks faster by presenting better organized and more relevant content.
What to expect?
Example query: translate I love you
Example query: translate I love you to Japanese
Example query: how do you say apple juice in Spanish