Open Access Open Access  Restricted Access Subscription or Fee Access

Context Dependent Speech Recognition Using VITERBI Algorithm

V. Swarna Priya, B. Amutha

Abstract


This research is based on the conversion of English speech into English text with an efficient system for independent speaker speech recognition based on Neural Network Approach using Viterbi Algorithm. To recognize the English words consider all the accents of same spoken word, so that matching process with the actual word does not lead to any difficulties. There are 26 characters in English. It is well known that the pronunciation of a word depends heavily on the background. Speech dependent & speaker independent technique can be used and English words must be recognized. HMM n to 1 encoder and HMM 1 to n decoder for finding the speech to text with the help of viterbi algorithm. There may be two or more pronunciation for the same word so the database should be maintained in that all expected accent should be there for a particular word. The main focus is to clearly get the spoken word for the different pronunciation/accent. This approach is better than others as the identification becomes easier.

Keywords


Viterbi Algorithm, HMM, Speech to Text Conversion, English WORDS

Full Text:

PDF

References


http://en.wikipedia.org/wiki/English_language

http://www.englishclub.com/english-language-history.htm

http://www.oxforddictionaries.com/page/93

http://www.wisegeek.com/what-is-automated-speech-recognition.htm

http://en.wikipedia.org/wiki/Viterbi_algorithm

Bapat, A.V et al,‖Phonetic Speech Analysis for Speech to Text Conversion‖ Industrial and Information Systems, 2008. ICIIS 2008. IEEE Region 10 and the Third international Conference on December 2008.

Breen, A, ―Speech Synthesis model: a review‖ Electronics & Communication Engineering Journal ISSN:0954-0695 on February 1992.

O'Shaughnessy D, ―Interacting with computers by voice: Automatic Speech Recognition and Synthesis‖, Proceedings of IEEE, ISSN:0018-9219, on September 2003.

Huang, X et al, ―On Speaker-independent, Speaker-dependent, and Speaker-adaptive speech recognition‖, Speech and Audio Processing, IEEE Transactions, ISSN : 1063-6676 on April 1993.

De Vos, L et al, ―Algorithm and DSP-implementation for a speaker-independent single-word speech recognizer with additional speaker-dependent say-in facility‖, Interactive Voice Technology for Telecommunications Applications, 1996. Proceedings., Third IEEE Workshop on September 1996.

J. Gauvain et al, ―Large-vocabulary continuous speech recognition: advances and applications‖, Proceeding of IEEE, 0018-9219,on August 2000.

H.Lou et al, ―Implementing the Viterbi algorithm‖ IEEE on Signal Processing Magazine INSPEC 5055394 on September 1995.

Paliwal, K.K, ― A study of LSF representation for speaker-dependent and speaker-independent HMM-based speech recognition systems‖, Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on April 1990.

R.Cole et al, ‖ Speaker-independent recognition of spoken English letters‖ International Joint Conference IJCNN on Neural Networks, 1990., 1990 , 06 August 2002.

R.D.T Janssen et al ―Speaker-independent phonetic classification in continuous English letters‖ IJCNN-91-Seattle International Joint Conference on Neural Network on August 2002.

Ganapathiraju et al ―Syllable-a promising recognition unit for LVCSR‖ IEEE Workshop on Automatic Speech Recognition and Understanding, Proceedings on IEEE on August 2002.

O. Viikki et al,―ASR in portable wireless devices‖ IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01.

J. Fritsch et al , "Context-Dependent Hybrid HME/HMM Speech Recognition using Polyphone Clustering Decision Trees," icassp, vol. 3, pp.1759, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'97) - Volume 3, 1997.

H. Bourlard et al, ―Copernicus and the ASR challenge—Waiting for Kepler,‖ in Proc. DARPA Speech Recognition Workshop, Harriman, NY, Feb. 1996, pp. 157–162.

K. F. Lee et al, ―Context-independent phonetic hidden Markov models for speaker-independent continuous speech recognition,‖ IEEE Trans. Acoustic., Speech, Signal Processing, vol. 38, pp. 599–609, Apr. 1990.

J.Hagenaue et al ―A Viterbi algorithm with soft-decision outputs and its applications‖, Global Telecommunications Conference, 1989, and Exhibition. Communications Technology for the 1990s and Beyond. GLOBECOM '89., IEEE 27-30 Nov 1989.

http://en.wikipedia.org/wiki/Most_common_words_in_English

http://en.softonic.com/s/wavepad-voice-recorder.


Refbacks

  • There are currently no refbacks.