What's up with text-to-speech (TTS) and Microsoft? Heck, what's up with TTS in general these days? Speech, language, and technology. Cool stuff, indeed.
What's the Blizzard Challenge? In a shell's nut, it's a competition that provides researchers and speech labs with the same set of originally recorded waves (i.e., 5 hours of a male speaker), and then challenges teams to create the "best" (i.e., most natural and intelligible) TTS voice. If it were an architecture challenge, it would be like giving the same set of building materials to many different architects to come up with the most functional and beautiful building.
And now rating the results is open to the public here:
http://www.speech.cs.cmu.edu/blizzard2006/register-R.html
It took me about 20 minutes to complete the study. There are five tasks, with the first three scoring waves, and a transcription for the last two. It's probably a good indication of how good a TTS system you can get with 5 hours of recordings. I can't tell who are all of the participant groups at this point. Festvox has a little bit more information on its site.
Anonymous comments are disabled
About jaywaltm
Jay is a Program Manager in the Speech and Natural Language Group at Microsoft. He has a Ph.D. in Linguistics from the University of Washington.