<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://blogs.msdn.com/utility/FeedStylesheets/atom.xsl" media="screen"?><feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en-US"><title type="html">Chris Wendt</title><subtitle type="html">News and info about Machine Translation at Microsoft</subtitle><id>http://blogs.msdn.com/christw/atom.xml</id><link rel="alternate" type="text/html" href="http://blogs.msdn.com/christw/default.aspx" /><link rel="self" type="application/atom+xml" href="http://blogs.msdn.com/christw/atom.xml" /><generator uri="http://communityserver.org" version="2.1.61025.2">Community Server</generator><updated>2007-02-18T23:30:00Z</updated><entry><title>Optimizing a statistical MT system using shared data</title><link rel="alternate" type="text/html" href="http://blogs.msdn.com/christw/archive/2009/09/05/optimizing-a-statistical-mt-system-using-shared-data.aspx" /><id>http://blogs.msdn.com/christw/archive/2009/09/05/optimizing-a-statistical-mt-system-using-shared-data.aspx</id><published>2009-09-05T17:28:00Z</published><updated>2009-09-05T17:28:00Z</updated><content type="html">A very substantial feature of statistical machine translation (SMT) systems is the ability to train the system on different bodies of text, and this way customizing it and optimizing it for a certain style and vocabulary. In MT land we call documents of related style and terminology a "domain". The most valuable training data for an SMT system is what we call "parallel data": the same document in two languages. The format that makes parallel data immediately useful is a translation memory (TM) file...(&lt;a href="http://blogs.msdn.com/christw/archive/2009/09/05/optimizing-a-statistical-mt-system-using-shared-data.aspx"&gt;read more&lt;/a&gt;)&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9891792" width="1" height="1"&gt;</content><author><name>christw</name><uri>http://blogs.msdn.com/members/christw.aspx</uri></author><category term="Machine Translation" scheme="http://blogs.msdn.com/christw/archive/tags/Machine+Translation/default.aspx" /><category term="SMT" scheme="http://blogs.msdn.com/christw/archive/tags/SMT/default.aspx" /><category term="MT" scheme="http://blogs.msdn.com/christw/archive/tags/MT/default.aspx" /><category term="Translation Memory" scheme="http://blogs.msdn.com/christw/archive/tags/Translation+Memory/default.aspx" /><category term="TAUS Data Association" scheme="http://blogs.msdn.com/christw/archive/tags/TAUS+Data+Association/default.aspx" /></entry><entry><title>Notes from the MT Summit in Ottawa</title><link rel="alternate" type="text/html" href="http://blogs.msdn.com/christw/archive/2009/09/05/notes-from-the-mt-summit-in-ottawa.aspx" /><id>http://blogs.msdn.com/christw/archive/2009/09/05/notes-from-the-mt-summit-in-ottawa.aspx</id><published>2009-09-05T16:41:00Z</published><updated>2009-09-05T16:41:00Z</updated><content type="html">Last week I attended the MT Summit in Ottawa ( http://summitxii.amtaweb.org/),mostly in the commercial user track. Compared to the previous two AMTA conferences I attended, it was striking to see the attention on MT-aided translation and translator productivity, in all 3 tracks at the conference (research, commercial users, and government users). What was also striking is that nobody has actual solid data on post-editing productivity in statistically significant numbers. Memorable papers related...(&lt;a href="http://blogs.msdn.com/christw/archive/2009/09/05/notes-from-the-mt-summit-in-ottawa.aspx"&gt;read more&lt;/a&gt;)&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9891777" width="1" height="1"&gt;</content><author><name>christw</name><uri>http://blogs.msdn.com/members/christw.aspx</uri></author><category term="Machine Translation" scheme="http://blogs.msdn.com/christw/archive/tags/Machine+Translation/default.aspx" /><category term="conferences" scheme="http://blogs.msdn.com/christw/archive/tags/conferences/default.aspx" /></entry><entry><title>Microsoft Knowledge Base</title><link rel="alternate" type="text/html" href="http://blogs.msdn.com/christw/archive/2007/02/18/microsoft-knowledge-base.aspx" /><id>http://blogs.msdn.com/christw/archive/2007/02/18/microsoft-knowledge-base.aspx</id><published>2007-02-19T10:54:00Z</published><updated>2007-02-19T10:54:00Z</updated><content type="html">&lt;P&gt;My team is responsible for the MT engine that creates the foreign language translations of knowledge base (KB) articles on &lt;A href="http://support.microsoft.com/" mce_href="http://support.microsoft.com/"&gt;http://support.microsoft.com&lt;/A&gt;. Each KB article has at&amp;nbsp;the bottom a feeedback field allowing the reader to rate the article. I pay close attention to this feedback to determine the usefulness of the MT engine for a task like understanding a technical document.&lt;/P&gt;
&lt;P&gt;Feel free to let me know right here of any significant issues or recurring problems you see with the KB translation.&lt;/P&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=1712817" width="1" height="1"&gt;</content><author><name>christw</name><uri>http://blogs.msdn.com/members/christw.aspx</uri></author><category term="KB" scheme="http://blogs.msdn.com/christw/archive/tags/KB/default.aspx" /></entry><entry><title>Citation in the French press</title><link rel="alternate" type="text/html" href="http://blogs.msdn.com/christw/archive/2007/02/18/citation-in-the-french-press.aspx" /><id>http://blogs.msdn.com/christw/archive/2007/02/18/citation-in-the-french-press.aspx</id><published>2007-02-19T10:30:00Z</published><updated>2007-02-19T10:30:00Z</updated><content type="html">&lt;P class=MsoPlainText style="MARGIN: 0in 0in 0pt"&gt;I am cited in last weekend's issue of the French newspaper “Libération”: &lt;A href="http://www.liberation.fr/transversales/weekend/234277.FR.php" mce_href="http://www.liberation.fr/transversales/weekend/234277.FR.php"&gt;http://www.liberation.fr/transversales/weekend/234277.FR.php&lt;/A&gt; under the title "Babelweb". Pretty positive about MT in general. &lt;/P&gt;
&lt;P class=MsoPlainText style="MARGIN: 0in 0in 0pt" mce_keep="true"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class=MsoPlainText style="MARGIN: 0in 0in 0pt"&gt;Thierry Fontenelle blogged about the article in his post&amp;nbsp; &lt;SPAN style="FONT-SIZE: 10pt; COLOR: navy; FONT-FAMILY: 'Arial','sans-serif'; mso-fareast-font-family: Calibri; mso-fareast-theme-font: minor-latin; mso-ansi-language: EN-US; mso-fareast-language: EN-US; mso-bidi-language: AR-SA"&gt;&lt;A href="http://blogs.msdn.com/correcteurorthographiqueoffice/"&gt;http://blogs.msdn.com/correcteurorthographiqueoffice/&lt;/A&gt;. &lt;/SPAN&gt;&lt;/P&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=1712708" width="1" height="1"&gt;</content><author><name>christw</name><uri>http://blogs.msdn.com/members/christw.aspx</uri></author><category term="Press" scheme="http://blogs.msdn.com/christw/archive/tags/Press/default.aspx" /></entry></feed>