Speech Content in the Windows SDK

Published 26 February 08 01:46 AM | Charles Oppermann 

I'm happy to announce the availability of the RTM release of the Windows SDK.  This release - the first RTM one since Vista - contains the following speech-related items:

  • Updated: SAPI 5.3 documentation
  • Updated: System.Speech documentation
  • Updated: Sample source code
    • 8 C++ projects
    • 3 C# projects
    • 2 sample engines - TTS and SR
  • New: Grammar Compiler (GC.EXE) tool now part of the tool binaries included in the SDK

The Windows SDK completely replaces the older SAPI 5.1 SDK and supports development on Windows XP, Windows Server 2003, Windows Vista, and Windows Server 2008.

Customers can download this SDK as a DVD image (1,330MB ISO file) from this location:
http://www.microsoft.com/downloads/details.aspx?FamilyId=F26B1AA4-741A-433A-9BE5-FA919850BDBF

Or go through a guided setup process where only the components they need are downloaded.  Speech is part of the base install.
http://www.microsoft.com/downloads/details.aspx?FamilyId=E6E1C3DF-A74F-4207-8586-711EBE331CDC

 I'm particularly interested in your feedback regarding the Windows SDK as a whole and in particular getting speech information.

Comments

# ascend slowly, breathing normally said on February 27, 2008 12:41 AM:

Charles Oppermann on the Speech team wrote a post on the new Speech-related content that ships in the

# karl1406 said on February 27, 2008 6:21 AM:

Any reason the examples provided are distributed

C++ (8)

C#  (3)

VB (0)

???

# Microsoft Windows SDK Blog said on February 27, 2008 1:25 PM:

Charles Oppermann on the Speech team wrote a post on the new Speech-related content that ships in the

# Moti said on March 1, 2008 11:06 AM:

I don't know if this is the perfect place to ask this question, but there's a crucial piece of information I can't find anywhere: How do I register a managed TTS engine with SAPI?

All the documentation says is "A TtsEngineSsml must be register[ed] under HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Speech\Voices\Tokens" - but I already know that from plain unmanaged SAPI. What goes into this key instead of a CLSID to identify my .NET class?

# alphaomega said on April 25, 2008 8:54 PM:

Neat! I spent days looking for why "tag-format" was not supported in my grammar (working on speech recognition). Turns out XP only had SAPI 5.1 and 5.3 was only on Vista. I read this post and saw chance to get the new SDK. Spent 7 hours downloading. Installed beautifully, but for some reason the System.Speech hasn't seemed to have updated? Cause apparently its still 'not supported'. Any help would be much appreciated. I'm a newbie to VS2008 and still in the process of migrating from VB6 to VB.NET. But thanks for the links! Very much appreciated!

New Comments to this post are disabled
Page view tracker