Speech Recognition HOWTO

Stephen Cook

                scook@gear21.com
            

Revision History
Revision v2.0 April 19, 2002 Revised by: scc
Changed license information (now GFDL) and added a new publication.
Revision v1.2 February 5, 2002 Revised by: scc
Added more commercial software listings (sent by Mayur Patel).
Revision v1.1 October 5, 2001 Revised by: scc
Added info for Vocalis Speechware. Fixed/Updated various other items.
Revision v1.0 November 20, 2000 Revised by: scc
Added info on L and H and HTK
Revision v0.5 September 13, 2000 Revised by: scc
Initial HOWTO Submission

Automatic Speech Recognition (ASR) on Linux is becoming easier. Several packages are available for users as well as developers. This document describes the basics of speech recognition and describes some of the available software.


Table of Contents
1. Legal Notices
1.1. Copyright/License
1.2. Disclaimer
1.3. Trademarks
2. Forward
2.1. About This Document
2.2. Acknowledgements
2.3. Comments/Updates/Feedback
2.4. ToDo
2.5. Revision History
3. Introduction
3.1. Speech Recognition Basics
3.2. Types of Speech Recognition
3.3. Uses and Applications
4. Hardware
4.1. Sound Cards
4.2. Microphones
4.3. Computers/Processors
5. Speech Recognition Software
5.1. Free Software
5.1.1. XVoice
5.1.2. CVoiceControl/kVoiceControl
5.1.3. Open Mind Speech
5.1.4. GVoice
5.1.5. ISIP
5.1.6. CMU Sphinx
5.1.7. Ears
5.1.8. NICO ANN Toolkit
5.1.9. Myers' Hidden Markov Model Software
5.1.10. Jialong He's Speech Recognition Research Tool
5.1.11. More Free Software?
5.2. Commercial Software
5.2.1. IBM ViaVoice
5.2.2. Vocalis Speechware
5.2.3. Babel Technologies
5.2.4. SpeechWorks
5.2.5. Nuance
5.2.6. Abbot/AbbotDemo
5.2.7. Entropic
5.2.8. More Commercial Products
6. Inside Speech Recognition
6.1. How Recognizers Work
6.2. Digital Audio Basics
7. Publications
7.1. Books
7.2. Internet