Welcome to MSDN Blogs Sign in | Join | Help

February 2005 - Posts

TTS navigation

Haha. Sushant, this is a fascinating problem. SAPI has some inherent TTS navigation capabilities, but it looks like you need a little bit more. The problem's certainly soluble (I have one approach in mind). I'll try to post my thoughts later in the week
Posted by RobertBrown | 1 Comments

Text Navigation

Sushant, s orry I missed your comment . I’ve re-posted it on your behalf – hope you don’t mind. Can you post another comment to explain what you mean by text navigation? (e.g. do you mean VCR-style controls for TTS, selection of insertion points for dictation,
Posted by RobertBrown | 3 Comments

Choosing speech API features

James makes a good point that limiting API features to known examples of applications potentially lowers the ceiling on how well an API can adapt to unanticipated needs. I think he's right. But it's only one factor. When multiple new features are competing
Posted by RobertBrown | 5 Comments

Speech platforms

Since a few different speech technologies were mentioned in the comments, I thought it might be useful to summarize the speech platform technology we currently provide. Windows Windows XP, Tablet and 2003 all include: SAPI 5.1: the COM API for use by
Posted by RobertBrown | 10 Comments

Phoneme segmentation

In a recent comment, James Salsman wrote “SAPI 4.0a had phoneme segmentation” and he asks that we put it back into our newer APIs. (You can see more about SAPI 4 here ). It’s been a long time since we made an API with this functionality. I’m curious to
Posted by RobertBrown | 7 Comments

Thoughts on speech API topics

I've been chatting with quite a few people on the team this week about what categories of posts would be interesting and relevant (and inserting "that's a great idea, you should blog it" into each conversation :-). I'm thinking these three broad categories
Posted by RobertBrown | 7 Comments

about me

Like the title says, my name's Robert Brown. I'm a Program Manager at Microsoft, working on speech technologies - in particular, the APIs. I expect this log to be about both program management and speech. We'll see if I predict correctly. What's speech?
Posted by RobertBrown | 2 Comments
 
Page view tracker