Welcome to MSDN Blogs Sign in | Join | Help

Writing with speech

Speaking in last Sunday's New York Times, novelist Richard Powers tells how he has used speech recognition on his TabletPC to compose all his writings of recent years.

It's a fine article - you could quote him almost anywhere on the obstacles of the keyboard, on SR speed and accuracy, on getting accustomed to talking to a machine, or on "speakos and mondegreens"... And it says so much about speech interface technology - it would have been inconceivable only 5 years ago.

One factor behind this is the productivity of the TabletPC correction UI:

I speak untethered, without a headset, into the slate’s microphone array. The words appear as fast as I can speak, or they wait out my long pauses. I touch them up with a stylus, scribbling or re-speaking as needed. Whole phrases die and revive, as quickly as I could have hit the backspace.

Stylus vs. Keyboard. Editing is more direct and efficient with the stylus than with mouse and keyboard, given the word-on-word nature of transcript editing. (And assuming Mr Powers is using TabletPC's built-in speech recognition UI - hats off to Rob and team who put it there!)

Having read, years ago, Galatea 2.2, I almost wish this article had gone behind the lines of the human interface and into the SR technology itself - the hum of the decoders, the scatter-charts of the acoustic models, the forced couplings of the words in the language models... If anyone can surface the poetry of a speech recognition engine, it's the writer who brought to life the neural networks of a fictional machine that is taught to listen and (heartbreakingly) to speak, in that book.

Published Tuesday, January 16, 2007 7:45 PM by Stephen Potter
Filed under:

Comments

Tuesday, January 16, 2007 11:46 PM by Tom

# re: Writing with speech

>>it would have been inconceivable only 5 years ago<<

I disagree - I used Dragon NaturallySpeaking in this way five years ago.  It was fast and accurate, even on machines that are slow by today's standards.

So I don't see what the big deal is now.

Wednesday, January 17, 2007 1:34 PM by Stephen Potter

# re: Writing with speech

Hi Tom, yes my statement is a little brash, let me clarify.

If I'd read in a major newspaper in early 2002 a novelist's ode to high productivity over several years with speech recognition, I'd have been very surprised. I don't mean to suggest that many other people haven't been as productive even for longer. But I think that the incremental gains in speed and accuracy over recent years, coupled with the ease of touch-editing with a stylus (helped by broader adoption of Tablet PCs) are only now helping us reach the tipping point where the technology is emerging into the mainstream. And Richard Powers' article signals this gathering momentum.

Stephen

Thursday, January 18, 2007 9:35 PM by Working the Spoken Word

# The novelist's method

IBM engineer and blogger John Tolva contacted Richard Powers in disbelief about the dictation of his

New Comments to this post are disabled
 
Page view tracker