Jay's blog on text-to-speech (Now Defunct)

What's up with text-to-speech (TTS) and Microsoft? Heck, what's up with TTS in general these days? Speech, language, and technology. Cool stuff, indeed.

The Blizzard Challenge: Rate text to speech samples

What's the Blizzard Challenge? In a shell's nut, it's a competition that provides researchers and speech labs with the same set of originally recorded waves (i.e., 5 hours of a male speaker), and then challenges teams to create the "best" (i.e., most natural and intelligible) TTS voice. If it were an architecture challenge, it would be like giving the same set of building materials to many different architects to come up with the most functional and beautiful building.

And now rating the results is open to the public here:

http://www.speech.cs.cmu.edu/blizzard2006/register-R.html

It took me about 20 minutes to complete the study. There are five tasks, with the first three scoring waves, and a transcription for the last two. It's probably a good indication of how good a TTS system you can get with 5 hours of recordings. I can't tell who are all of the participant groups at this point. Festvox has a little bit more information on its site.

 

Published Friday, June 30, 2006 5:43 PM by jaywaltm

Comments

No Comments
Anonymous comments are disabled

About jaywaltm

Jay is a Program Manager in the Speech and Natural Language Group at Microsoft. He has a Ph.D. in Linguistics from the University of Washington.

© 2009 Microsoft Corporation. All rights reserved. Terms of Use  |  Trademarks  |  Privacy Statement
Microsoft
Page view tracker