Welcome to MSDN Blogs Sign in | Join | Help

Browse by Tags

All Tags » Speech API   (RSS)
New Speech Server video on Channel 9
http://channel9.msdn.com/Showpost.aspx?postid=208891 Albert, Mithun and Dave talk us through some of the great new capabilities of the next version of Speech Server. I worked on the initial version of Speech Server (2004), as well as some of the plumbing Read More...
Updating speech samples for beta 2
When beta 1 shipped, we published this article in MSDN Magazine. We've made some minor tweaks to the API for beta 2, and I figure what better way to illustrate them than to walk through the samples in that article and update them for beta 2. First, the Read More...
MSDN Magazine article on speech in Vista
Okay, so I haven't been completely idle: http://msdn.microsoft.com/msdnmag/issues/06/01/speechinWindowsVista/#void (Thanks to Robert Stumberger, Rob Chambers, and the other folks here at Microsoft who helped put this together). Read More...
Building a speech telephony app
Bosky wrote : "basically i want my speech server to be able to accept a call, quickly port it say to another 'something' . so is this possible ? i hear MS offices themselves at some places have replaced the phone opperator with spech recognition based Read More...
WinFX speech API changes in PDC CTP
Joseph Kilada wrote "Robert, can you give any hints as to whether the WinFX Speech APIs have changed at all in build 5219 (or whatever the PDC build is) compared to Beta 1?" Sometime soon I'll post updates to all the samples I've put in this blog, to Read More...
Introducing SAPI 5.3
Vista (a.k.a. Longhorn) has a new version of SAPI: 5.3. SAPI 5.3 is an incremental update to SAPI 5.1. The core mission and architecture are unchanged across all 5.x releases. Among a variety of tweaks, SAPI 5.3 has these overall improvements: Support Read More...
RSS reader using SAPI
Michael Carbenay's built an RSS client on the .Net runtime 1.1, using COM interop to have SAPI read the feed to you: http://michael.carbenay.info/post-41.htm Read More...
Speech chess
Valentin Iliescu's written a WinFX chess application that you can play by speaking to it. Very nice. Thanks Valentin! http://www.valil.com/winfx/ Read More...
Follow-up to speech processing of audio files
I was asked some questions about my Speech processing of audio files post. Jonathan Tregear asks "Is there a way to get SAPI to convert and recognize any of the WMA audio formats or other compressed formats beyond the Windows OS standard formats?" No, Read More...
Using SAPI with C#
Want to use SAPI from C#? Take a look at Matt Harrington's article about using SAPI from C# to do speech synthesis . I've done a lot of talking about the upcoming managed speech API in WinFX. But I also get a lot of messages from people who are using Read More...
Which SDK gives me access to the System.Speech namespace?
Josh posted : "I installed the Speech SDK v 5.1 but don't have access to the System.Speech.Synthesis namespace. Do I need to add a reference to it?" You'll need the WinFX SDK for Avalon & Indigo beta 1 RC 1. Here are some instructions for how to get Read More...
Speech processing of audio files
There are a couple of FAQs I often hear about using speech with audio files: How do I recognize speech that's been recorded to an audio file? How do I write synthesized speech to an audio file? This can be done with SAPI, but judging by the frequency Read More...
Speech recognition of interviews & videos
Jonathan Tregear recently posted some comments/questions on speech recognition of interviews (in response to a brief discussion I had with Scoble in my Channel 9 interview a couple of months back). I looped in Frank Seide from Microsoft Research Asia Read More...
Another speech blogger
Chris Schindler from the Speech Server team started his blog yesterday ! Today's speech API tidbit: the synthesizer API can also splice audio files into the output stream. Try this: Dim pb As New PromptBuilder pb.AppendAudio( "c:\windows\media\notify.wav" Read More...
XML from previous speech samples
The last few samples I've posted have used APIs to construct the input to the speech engines. Under the covers, you can conceptualize these as creating XML files (or at least, they correspond to XML files and can be serialized as such). The PromptBuilder Read More...
More Posts Next page »
Page view tracker