On Voice Activated Information Retrieval System

B. Indhuja; S. Mala Devi; Dr. S. Murugavalli

On Voice Activated Information Retrieval System

B. Indhuja, S. Mala Devi, Dr. S. Murugavalli

Abstract

Speech Recognition (SR) is a process that transcribes speech into text using a computer. Speech recognition system is a speech-to-text conversion wherein the output of the system displays text corresponding to the recognized speech. A step towards a more natural, “human-like” communication between machines and users in need of information is represented by the introduction of speech language technologies into Information Retrieval system. The integration of the Information Retrieval system (IR) and the Automatic speech recognition (ASR) system degrades the performance. The failure to recognize a keyword in continuous speech may drastically affect the performance of the IR system. The proposal is to build an ASR system that is trained to identify the word we pronounce in restricted domain. Then for each of the recognized word, the system is expected to find the match which is then extracted by the IR system.

Keywords

Automatic Speech Recognition (ASR), Information Retrieval (IR), Word Error Rate (WER)

Full Text:

PDF

References

Paolo Rosso, Llu´ıs-F. Hurtado and Encarna Segarra,Emilio Sanchis,” On the Voice-Activated Question Answering”, IEEE Transactions on Systems, Man, and Cybernetics—part c: applications and reviews, vol. 42, no. 1,January 2012 75.

L. Bahl, F. Jelinek, and R. Mercer, “A maximum likelihood approach to continuous speech recognition,” IEEE J. Pattern Anal. Mach. Intell., vol. PAMI-5, no. 2, pp. 179–190, Mar. 1983.

Wiqas Ghai and Navdeep Singh, “Literature Review on Automatic Speech Recognition,” International Journal of Computer Applications (0975 – 8887) Volume 41– No.8, March 2012.

Y. Wang, D. Yu, Y. Ju and A. Acero, ‘‘An introduction to voice search,’’ IEEE Signal Process. Mag., vol. 25, no. 3, pp. 28–38, May 2008.

Niladri Sekhar Dey,Ramakanta Mohanty, K.L.chugh,”Speech and Speaker Recognition System using Artificial Neural Networks and Hidden Markov Model”,2012 International Conference on Communication Systems and Network Technologies.

Santosh K.Gaikwad, Bharti W.Gawali and Pravin Yannawar, “A Review on Speech Recognition Technique,” International Journal of Computer Applications | Vol. 10-No.3, November 2010.

Mark Sanderson and W. Bruce Croft, “The History of Information Retrieval Research”

NIST. Speech recognition scoring package (score). [Online]. Available: http://www.nist.gov/speech/tools

S. J. Young, N. H. Russell, and J. H. S. Russell, “Token passing: A simple conceptual model for connected speech recognition systems,” Cambridge University Engineering Dept, UK, Tech. Rep. CUED/F-INFENG/TR38, 1989.

Gerhard Weikum, Gjergji Kas neci, Maya Ramanath, and Fab ian Suchanek, “Database and Information-Retrieval Methods for Knowledge Discovery,” Communications of the ACM | april 2009 | vol. 52 | no. 4.

Chadawan Ittichaichareon, Siwat Suksri and Thaweesak Yingthawornsuk, “Speech Recognition using MFCC,” International Conference on Computer Graphics, Simulation and Modeling (ICGSM'2012) July 28-29, 2012 Pattaya (Thailand)

DOUGLAS O’SHAUGHNESSY, “InteractingWith Computers by Voice: Automatic Speech Recognition and Synthesis,” PROCEEDINGS OF THE IEEE, VOL. 91, NO. 9, SEPTEMBER 2003

F. Jelinek, “Continuous speech recognition by statistical methods,” Proc. IEEE, vol. 64, pp. 532–556, Apr. 1976.

L. Rabiner and B. Juang, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall, 1993.

R. Reddy, “Speech recognition by machine: A review,” Proc. IEEE, vol. 64, pp. 501–531, Apr. 1976.

Richard M. Stern and Nelson Morgan, “Hearing is believing,” IEEE signal processing magazine, November 2012.

Michelle Cutajar, Edward Gatt, Ivan Grech, Owen Casha and Joseph Micallef,” Comparative study of automatic speech recognition techniques”, IET Signal Process., 2013, Vol. 7, Iss. 1, pp. 25–46 25.

Hung-yi Lee, Chia-ping Chen, and Lin-shan Lee, “Integrating Recognition and Retrieval With Relevance Feedback for Spoken Term Detection,” IEEE transactions on Audio, Speech and Language processing, 2012.

Xiaodong He and Li Deng, “Speech-Centric Information Processing: An Optimization-Oriented Approach,” Proceedings of the IEEE | Vol. 101, No. 5, May 2013.

Haizhou Li, Bin Ma, and Kong Aik Lee, “Spoken Language Recognition: From Fundamentals to Practice,” Proceedings of the IEEE | Vol. 101, No. 5, May 2013

Refbacks

There are currently no refbacks.

Username
Password
Remember me