<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://blogs.msdn.com/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>All the Cool Developers use Speech APIs : Speech Recognition</title><link>http://blogs.msdn.com/chuckop/archive/tags/Speech+Recognition/default.aspx</link><description>Tags: Speech Recognition</description><dc:language>en</dc:language><generator>CommunityServer 2.1 SP1 (Build: 61025.2)</generator><item><title>Our Users Are Leading Authorities</title><link>http://blogs.msdn.com/chuckop/archive/2008/08/26/our-users-are-leading-authorities.aspx</link><pubDate>Tue, 26 Aug 2008 08:26:53 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:8896675</guid><dc:creator>Charles Oppermann</dc:creator><slash:comments>1</slash:comments><comments>http://blogs.msdn.com/chuckop/comments/8896675.aspx</comments><wfw:commentRss>http://blogs.msdn.com/chuckop/commentrss.aspx?PostID=8896675</wfw:commentRss><description>&lt;p&gt;Throughout my career at Microsoft, I've eagerly participated in mailing lists, newsgroups, and web forums to engage customers and learn more about their needs and foster direct communication.&lt;/p&gt;  &lt;p&gt;One of the better forums for speech recognition is run by &lt;strong&gt;Professor Itamar Even-Zohar &lt;/strong&gt;of Tel Aviv University, where he teaches Culture Research.&amp;#160; Itamar has been a long time user of speech recognition and vocal in feedback regarding Windows Speech Recognition.&amp;#160; His &lt;a href="http://www.tau.ac.il/~itamarez/sr/" target="_blank"&gt;web site on speech recognition&lt;/a&gt; contains useful information on WSR and speech recognition included in Office XP and Office 2003.&amp;#160; In particular, his &lt;a href="http://tech.groups.yahoo.com/group/ms-speech" target="_blank"&gt;ms-speech forum&lt;/a&gt; is invaluable.&lt;/p&gt;  &lt;p&gt;Recently when David Pogue of the New York Times wrote about the newest version of NaturallySpeaking, Itamar was quick to write David and set him straight on a few matters, including a plug about Windows Speech Recognition Macros!&lt;/p&gt;  &lt;p&gt;David &lt;a href="http://pogue.blogs.nytimes.com/2008/08/22/windows-speech-recognition-does-more/" target="_blank"&gt;wrote of Itamar&lt;/a&gt;, &amp;quot;&lt;strong&gt;&lt;em&gt;Clearly, I&amp;#8217;ve unearthed the world&amp;#8217;s leading authority on speech-recognition foreign-language versions,&lt;/em&gt;&lt;/strong&gt;&amp;quot;&lt;/p&gt;  &lt;p&gt;If you read the links I'm providing, you'll see that Professor Even-Zohar is &lt;strong&gt;not&lt;/strong&gt; enamored of all that we do.&amp;#160; He's critical of several aspects of WSR and while he &amp;quot;gets it&amp;quot; regarding &lt;a href="http://www.microsoft.com/downloads/details.aspx?FamilyID=fad62198-220c-4717-b044-829ae4f7c125" target="_blank"&gt;WSR Macros&lt;/a&gt;, he's quick to point out flaws and features.&lt;/p&gt;  &lt;p&gt;It's users like this that we need more of; people who are highly experienced and unafraid to share their opinions.&amp;#160; The information provided is valuable to me and the rest of the product teams.&amp;#160; On the flip side, we have to be careful regarding users expectations.&amp;#160; Bending our ear doesn't mean you'll get whatever feature you asked for, and within a particular timeframe.&lt;/p&gt;  &lt;p&gt;Oftentimes we'll have more features than time or people available.&amp;#160; We have to be very choosy about where to spend our resources.&amp;#160; Even things that are a number #1 priority sometimes have to take a backseat to a lesser feature because it was one that we could do in the time or resources available.&lt;/p&gt;  &lt;p&gt;Having the feedback from experienced users though help us make the most of the resources we have.&amp;#160; We can prioritize better and have confidence that what we're doing will have the greatest impact.&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;To everyone who writes us at &lt;/strong&gt;&lt;a href="mailto:listen@microsoft.com" target="_blank"&gt;&lt;strong&gt;listen&lt;/strong&gt;&lt;/a&gt;&lt;strong&gt;, &lt;/strong&gt;&lt;a href="mailto:speak@microsoft.com" target="_blank"&gt;&lt;strong&gt;speak&lt;/strong&gt;&lt;/a&gt;&lt;strong&gt; and &lt;/strong&gt;&lt;a href="mailto:sapitech@microsoft.com" target="_blank"&gt;&lt;strong&gt;sapitech&lt;/strong&gt;&lt;/a&gt;&lt;strong&gt; - we thank you and keep the feedback rolling!&lt;/strong&gt;&lt;/p&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=8896675" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/chuckop/archive/tags/Microsoft/default.aspx">Microsoft</category><category domain="http://blogs.msdn.com/chuckop/archive/tags/Windows+Speech+Recognition/default.aspx">Windows Speech Recognition</category><category domain="http://blogs.msdn.com/chuckop/archive/tags/Speech+Recognition/default.aspx">Speech Recognition</category><category domain="http://blogs.msdn.com/chuckop/archive/tags/Users/default.aspx">Users</category></item><item><title>Speech Content in the Windows SDK</title><link>http://blogs.msdn.com/chuckop/archive/2008/02/26/speech-content-in-the-windows-sdk.aspx</link><pubDate>Tue, 26 Feb 2008 09:46:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:7901471</guid><dc:creator>Charles Oppermann</dc:creator><slash:comments>9</slash:comments><comments>http://blogs.msdn.com/chuckop/comments/7901471.aspx</comments><wfw:commentRss>http://blogs.msdn.com/chuckop/commentrss.aspx?PostID=7901471</wfw:commentRss><description>&lt;P mce_keep="true"&gt;I'm happy to announce the availability of the RTM release of the Windows SDK.&amp;nbsp; This release - the first RTM one since Vista - contains the following speech-related items:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;I&gt;Updated&lt;/I&gt;: SAPI 5.3 documentation&lt;/LI&gt;
&lt;LI&gt;&lt;I&gt;Updated&lt;/I&gt;: System.Speech documentation&lt;/LI&gt;
&lt;LI&gt;&lt;I&gt;Updated&lt;/I&gt;: Sample source code&lt;/LI&gt;
&lt;UL&gt;
&lt;LI&gt;8 C++ projects&lt;/LI&gt;
&lt;LI&gt;3 C# projects&lt;/LI&gt;
&lt;LI&gt;2 sample engines - TTS and SR&lt;/LI&gt;&lt;/UL&gt;
&lt;LI&gt;&lt;I&gt;New&lt;/I&gt;: Grammar Compiler (GC.EXE) tool now part of the tool binaries included in the SDK&lt;/LI&gt;&lt;/UL&gt;
&lt;P mce_keep="true"&gt;The Windows SDK completely replaces the older SAPI 5.1 SDK and supports development on Windows XP, Windows Server 2003, Windows Vista, and Windows Server 2008.&lt;/P&gt;
&lt;P mce_keep="true"&gt;Customers can download this SDK as a DVD image (1,330MB ISO file) from this location:&lt;BR&gt;&lt;A href="http://www.microsoft.com/downloads/details.aspx?FamilyId=F26B1AA4-741A-433A-9BE5-FA919850BDBF"&gt;http://www.microsoft.com/downloads/details.aspx?FamilyId=F26B1AA4-741A-433A-9BE5-FA919850BDBF&lt;/A&gt;&lt;/P&gt;
&lt;P mce_keep="true"&gt;Or go through a guided setup process where only the components they need are downloaded.&amp;nbsp; Speech is part of the base install.&lt;BR&gt;&lt;A href="http://www.microsoft.com/downloads/details.aspx?FamilyId=E6E1C3DF-A74F-4207-8586-711EBE331CDC"&gt;http://www.microsoft.com/downloads/details.aspx?FamilyId=E6E1C3DF-A74F-4207-8586-711EBE331CDC&lt;/A&gt;&lt;/P&gt;
&lt;P mce_keep="true"&gt;&amp;nbsp;I'm particularly interested in &lt;A class="" title="Email Charles Oppermann" href="mailto:chuckop@microsoft.com" mce_href="mailto:chuckop@microsoft.com"&gt;your feedback&lt;/A&gt; regarding the Windows SDK as a whole and in particular getting speech information.&lt;/P&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=7901471" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/chuckop/archive/tags/Speech+-+APIs/default.aspx">Speech - APIs</category><category domain="http://blogs.msdn.com/chuckop/archive/tags/Text+to+Speech/default.aspx">Text to Speech</category><category domain="http://blogs.msdn.com/chuckop/archive/tags/Speech+Recognition/default.aspx">Speech Recognition</category><category domain="http://blogs.msdn.com/chuckop/archive/tags/Speech/default.aspx">Speech</category><category domain="http://blogs.msdn.com/chuckop/archive/tags/SDK/default.aspx">SDK</category><category domain="http://blogs.msdn.com/chuckop/archive/tags/SAPI+5.3/default.aspx">SAPI 5.3</category></item><item><title>InterSpeech 2006</title><link>http://blogs.msdn.com/chuckop/archive/2006/09/22/InterSpeech-2006.aspx</link><pubDate>Fri, 22 Sep 2006 08:45:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:769866</guid><dc:creator>Charles Oppermann</dc:creator><slash:comments>0</slash:comments><comments>http://blogs.msdn.com/chuckop/comments/769866.aspx</comments><wfw:commentRss>http://blogs.msdn.com/chuckop/commentrss.aspx?PostID=769866</wfw:commentRss><description>&lt;P&gt;[This posting originally appeared on my &lt;A href="http://chuckop.spaces.live.com/" target=_blank mce_href="http://chuckop.spaces.live.com/"&gt;personal blog&lt;/A&gt;.&amp;nbsp; I'm copying all my speech related blogging to this new MSDN hosted blog.&amp;nbsp; I'll be doing an introduction post soon.]&lt;/P&gt;
&lt;P&gt;I'm in Pittsburgh this week, attending the &lt;A href="http://www.interspeech2006.org/" mce_href="http://www.interspeech2006.org/"&gt;InterSpeech 2006&lt;/A&gt; conference.&amp;nbsp; Actually, I shouldn't say I'm attending it; I'm just staffing the Microsoft booth, giving demonstrations of &lt;STRONG&gt;&lt;A href="http://www.microsoft.com/windowsvista/features/foreveryone/speech.mspx" target=_blank mce_href="http://www.microsoft.com/windowsvista/features/foreveryone/speech.mspx"&gt;Windows Speech Recognition&lt;/A&gt;&lt;/STRONG&gt;.&amp;nbsp; This is an academic conference,&amp;nbsp;mainly for speech scientists and researchers to present their published papers.&amp;nbsp; For example, one of the poster sessions is entitled &lt;EM&gt;"A Novel Framework of Text-Independent Speaker Verification Based on Utterance Transform and Iterative Cohort Modeling"&lt;/EM&gt; which has Microsoft's own Zhengyou Zhang as one of the authors.&amp;nbsp; The poster sessions which remind me of some early science fair projects because it's posted on a wall, with the research data and conclusions neatly shown.&lt;/P&gt;
&lt;P&gt;Since &lt;A href="http://research.microsoft.com/srg" mce_href="http://research.microsoft.com/srg"&gt;&lt;STRONG&gt;Microsoft Research&lt;/STRONG&gt;&lt;/A&gt; is one of the sponsors, they get a booth in which to demonstrate technology and products.&amp;nbsp; A week ago, the Speech Research Group asked my group, Speech Components, if one of the program managers could come out and give demonstrations.&amp;nbsp; I volunteered.&amp;nbsp; The demos went well, and for the most part were trouble-free.&amp;nbsp; I choose to use the Release Candidate 1 of Windows Vista for the demo machines, because I didn't want to risk problems with an unknown, random build.&amp;nbsp; There was a small issue with the audio gain on the microphone that would set the gain at the maximum after the computer resumed from standby, or the USB headset unplugged and plugged back in.&amp;nbsp; The gain is supposed to be set at 15, so when it went to a 100, recognition accuracy would plummet, but not too badly.&lt;/P&gt;
&lt;P&gt;Usually, it was difficult at times to show the correction dialog, used when some phrase was dictated incorrectly.&amp;nbsp; Even when there were hundreds of people milling about the vendor booths, and the ambient noise level very high, the system did very well.&lt;/P&gt;
&lt;P&gt;The most often comment was similar to "this is a amazing".&lt;/P&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=769866" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/chuckop/archive/tags/Travel/default.aspx">Travel</category><category domain="http://blogs.msdn.com/chuckop/archive/tags/Microsoft/default.aspx">Microsoft</category><category domain="http://blogs.msdn.com/chuckop/archive/tags/Speech+Recognition/default.aspx">Speech Recognition</category><category domain="http://blogs.msdn.com/chuckop/archive/tags/Research/default.aspx">Research</category><category domain="http://blogs.msdn.com/chuckop/archive/tags/Conferences/default.aspx">Conferences</category></item></channel></rss>