Welcome to MSDN Blogs
Sign in
|
Join
|
Help
Search
Rob's Response Point
Home
Email
RSS 2.0
Atom 1.0
Recent Posts
Free Webinar on Response Point & Related Stuff
RP Extension Ninja Skills
Quick reference for making phone calls
Forwarding Voicemail to Email
Voice Mail Capacity on Response Point
Tags
Home Server
Response Point
News
Archives
November 2008 (5)
October 2008 (3)
July 2008 (1)
May 2007 (1)
April 2007 (2)
March 2007 (2)
August 2006 (1)
June 2006 (6)
December 2005 (1)
November 2005 (1)
September 2005 (5)
August 2005 (6)
July 2005 (6)
June 2005 (5)
May 2005 (5)
April 2005 (1)
March 2005 (5)
February 2005 (7)
Speech interview on Channel 9
Channel 9
just posted an
interview with me
.
Some links to stuff I talk about:
The new speech API I
posted about a few days ago
The app I demo when I dial 0 is running on
Speech Server
, and the
case study I mention is here
.
You may also be interested in some of our research web pages, since I mention them at one stage in the interview:
Synthesis research
;
Speech research in Redmond
;
Speech research in Asia
.
I also mentioned the
Voice Command
app for Windows Mobile.
Posted:
Sunday, May 29, 2005 12:21 AM by
RobertBrown
Comments
Jonathan Tregear
said:
Sorry about the late comment to this story. I've been falling behind on my blog reading lately.
Late in your interview with Robert Scoble, he asked you about the possibility of using speech rocognition to produce transcripts of his interviews for example. Your answer was that the results he would get would not be very good unless the the speech engine was trained to each of the spearkers voices.
I've heard about ASR engines produced by companies like Autonomy/Virage that claim to be able to do a decent job of speaker independent and unconstrained domain voice recognition for similar uses like indexing and and searching newscasts etc. Do you have experience with or an opinion about how good those engines are?
Related to this is another question I've been wondering about: Suppose you pointed an engine at a video like the interview example above, but instead of using it to produce a transcript of the interview you were only interested in finding instances of a well defined list of keywords. This would be useful in indexing and searching libraries of audio content also. Would that be an easier problem to solve for speaker independent (i.e. untrained) speech recognition?
Thanks if you find this and have the time to respond.
#
June 24, 2005 1:55 AM
New Comments to this post are disabled