HISTORY OF SPEECH RECOGNITION

INDEX         ANALOG TO DIGITAL        DIGITAL TO TEXT       SOFTWARE APPLICATIONS        FUTURE         REFERENCES


A toy breaks the ice into speech recognition

"Radio Rex", was the first success story in the field of speech recognition.  It was a toy dog that came in a house, when the name "Rex" was spoken the dog would pop out of his house.  The dog was held within its house by an electromagnet, as current flowed through a circuit bridge, the magnet was energized.  The bridge was sensitive to 500 cps of acoustic energy.  The energy of the vowel sound of the word "Rex" caused the bridge to vibrate, breaking the electrical circuit, and allowing a spring to push Rex out of his house.  Rex was the pioneer into the field of speech recognition.

To War with Mother Russia

The U.S. Department of Defense sponsored the first academic pursuits in speech recognition in the late 1940's.  In an attempt to intercept and decode Russian messages, the U.S. sought the development of an automatic language translator.  The first, and most difficult, step was to solve the problem of creating a program that could recognize speech.  The project was a dismal failure. Phrases were typically mistranslated and included errors such as:

"The spirit is willing but the flesh is weak."

to

"The vodka is strong but the meat is disgusting."

 

Despite the dismal failure, appreciation and interest for the field began to grow. As a result, the government funded the Speech Understanding Research (SUR)  program at Carnegie Mellon University, MIT, and some select commercial institutions.  The agency that funded the research became known as the Defense Advanced Research Project Agency (DARPA).

EARLY KEY ADVANCES

Up to this point there are only three major obstacles standing in the way of commercial use.

  1. Computing Power, lots of power required, but little available

  2. The ability to recognize speech from any person (not just the particular voices the system has been designed around).

  3. A continuity of speech capability (so that the person speaking did not have to break after every word).

The successes from the 50's to the 80's gained more attention and interest, eventually continuous speech became imaginable.

 

PHONEME'S RECOGNIZED AS KEY TO SPEECH

In the 1960's linguistic researchers examine inherent structure of language, results of research lead developers to concentrate speech recognition technology at the level of phonemes, the sound fragments that make up comprehensible words.  By the 1980's programmers were using more powerful hardware to implement statistical phoneme-chain recognition routines.  However, computing power still inhibits speech recognition.

COMMERCE TAKES OVER

Speechworks and Dragon Systems take over as major producers of speech recognition technology.  As these two compete in the field, eventually a point is reached where computation required gets low enough and computation available became high enough for wide spread commercial use.

At the same time, the task difficulty increased coupled with the decrease in error rate made for wide spread use.

 

 


INDEX         ANALOG TO DIGITAL        DIGITAL TO TEXT       SOFTWARE APPLICATIONS        FUTURE         REFERENCES