Open Access Open Access  Restricted Access Subscription or Fee Access

Spectral Processing Methods for Degraded Speech Signal

U. Purushotham, Dr.K. Suresh

Abstract


Assessment of clean speech from a noisy speech signal has been a research topic for a long time. This research finds its variety of applications, which includes the present mobile communication also. The most important outcome of this research is the improved quality and reduced listening effort in the presence of an interfering noise signal. In this paper the performance of various noise reduction techniques namely spectral subtraction, wavelet transforms, iterative subtraction, MMSE and Wiener filtering is done. This paper proposes a time-frequency estimator for enhancement of noisy speech signals in the discrete frequency transform domain. In the proposed method the estimation is based on modeling and filtering frequency components of noisy speech signal using Kalman filters. Experimental outcome show that the proposed method provides the better performance as compared to the other Spectral processing approaches.


Keywords


MMSE, Wiener, Autoregressive and Kalman Filter

Full Text:

PDF

References


S.F.Boll, “Suppression of acoustic noise in speech using spectral subtraction,” IEEE Trans. Acoust., Speech, Signal Process., Vol 27,pp. 113-120,1979.

P. Lockwood and J. Boudy, “Experiments with a nonlinear Spectral subtractor (NSS), hidden markov models and the projection, for robust speech recognition in cars,” Speech Communication, Vol. 11, Nos. 2-3, pp. 215-228, 1992.

I. Soon, S. Koh and C. Yeo, “Selective magnitude subtraction for speech enhancement,” Proceedings. The Fourth International Conference/Exhibition on High Performance Computing in the Asia-Pacific Region, vol.2, pp. 692-695, 2000.

K. Wu and P. Chen, “Efficient speech enhancement using spectral subtraction for car hands-free application,” International Conference on Consumer Electronics, vol. 2, pp. 220-221, 2001.

C. He and G. Zweig, “Adaptive two-band spectral Subtraction with multi-window spectral estimation,” ICASSP, vol.2, pp. 793-796, 1999.

O. Cappe, “Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor,” IEEE Trans. Speech Audio Process., Vol. 2, pp. 345–349, 1994.

Z. Goh, K.-C. Tan, and B. Tan, “Post processing method for suppressing musical noise generated by spectral subtraction,” IEEE Trans. Speech Audio Process., vol. 6, no. 3, pp. 287–292, May 1998.

K. Yamashita, S. Ogata, and T. Shimamura, “Spectral subtraction iterated with weighting factors,” in Proc. IEEE Speech Coding Workshop, 2002, pp. 138–140.

K. Yamashita, S. Ogata, and T. Shimamura, “Improved spectral subtraction utilizing iterative processing,” (in Japanese) IEICE Trans. A, vol. J88-A, no. 11, pp. 1246–1257, 2005.

M. R. Khan and T. Hasan, “Iterative noise power subtraction technique for improved speech quality,” in Proc. Int. Conf. Elect. Comput. Eng. (ICECE2008), 2008, pp. 391–394.

X. Li, G. Li, and X. Li, “Improved voice activity detection based on iterative spectral subtraction and double thresholds for CVR,” in Proc. 2008 Workshop Power Electron. Intell. Transport. Syst., 2008, pp. 153–156.

T. Fukumori, M. Morise, T. Nishiura, and H. Nanjo, “Musical tone reduction on iterative spectral subtraction based on optimum flooring parameters,” IEICE Tech. Rep., Vol. 110, pp. 43–48, 2010.

S. Li, J.-Q.Wang, M. Niu, X.-J. Jing, and T. Liu, “Iterative spectral subtraction method for millimeter-wave conducted speech enhancement,” J. Biomed. Sci. Eng., vol. 2010, no. 3, pp. 187–192, 2010.

M. Berouti, R. Schwartz and J. Makhoul, “Enhancement of speech corrupted by acoustic noise,” Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., pp. 208-211, 1979.

N. Virag, “Single channel speech enhancement based on masking properties of the human auditory system,” IEEE Trans. Speech and Audio Processing., Vol 7, pp 126- 137, 1999.

Ephraim.Y and Malah.D, “Speech enhancement using a minimum mean square error short-time spectral amplitude estimator,” IEEE Trans. Acoust., Speech, Signal Processing, Vol.32, pp. 1109–1121, 1984.

Ephraim.Y and Malah.D, “Speech enhancement using a minimum mean square error log-spectral amplitude estimator,” IEEE Trans. Acoust., Speech, Signal Processing, Vol. 32, pp. 443–5, 1985.

Ephraim.Y, Malah.D, & B. H. Juang, “On the application of hidden Markov models for enhancing noisy speech,” IEEE Trans. Acoust., Speech, Signal Processing, Vol. 37, pp. 1846–1856, 1989.

J.S. Erkelens, R.C. Hendriks, R. Heusdens, and J. Jensen, "Minimum mean-square error estimation of discrete Fourier coefficients with generalized gamma priors", IEEE Trans. on Audio, Speech and Language Proc., Vol. 15, no. 6, pp. 1741 - 1752, August 2007.

Erkelens.J.S, Hendriks.R.C and Heusdens.R "On the Estimation of Complex Speech DFT Coefficients without Assuming Independent Real and Imaginary Parts", IEEE Signal Processing Letters, 2008.

R.C. Hendriks, J.S. Erkelens and R. Heusdens "Comparison of complex-DFT estimators with and without the independence assumption of real and imaginary parts", ICASSP, 2008.

A. Mertins, “Signal Analysis: Wavelets, Filter Banks, Time- Frequency Transforms and Application”. John Wiley & Sons, Chichester, 1999.

D. L. Donoho, “De-noising by soft-thresholding,” IEEE Trans. on inform. Theory, vol. 41, no. 3, pp. 613-627, May 1995.

D. Veselinovic and D. Graupe, “A wavelet transform approach to blind adaptive filtering of speech from unknown noises,” IEEE Trans. on Circuits and Syst.-II, vol. 50, no. 3, pp. 150–154, Mar. 2003.

E. Zwicker and H. Fastl, Psychoacoustics: Facts and Models. Springer-Verlog, New York, 1990.

M. Bahoura and J. Rouat, “Wavelet speech enhancementbased on the Teager energy operator,” IEEE Signal Process.Letter, vol. 8, no. 1, pp. 10−12, 2001.

K. K. Paliwal and A. Basu, “A speech enhancement method based on Kalman filtering,” IEEE Int. Conf. Acoust., Speech, Signal Processing, pp. 177–180, 1987.

J. D. Gibson and S. D. Grey, “Filtering of colored noise for speech enhancement and coding,” IEEE Trans. Signal Processing, vol. 39, pp. 1732–1741, 1991.

IEEE Recommended Practice for Speech Quality Measurements. IEEE Trans. Audio and Electro acoustics, AU-17(3), 225-246, 1969.

“Perceptual evaluation of speech quality and objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codec’s,” ITU-T Recommendation P. 862

Hu, Y. and Loizou, P, “Subjective evaluation and comparison of speech enhancement algorithms,” Speech Communication, 49, 588-601, 2007.

Hu, Y. and Loizou, P, “Evaluation of objective Quality measures for speech enhancement,” IEEE Transactions on Speech and Audio Processing, 16(1), 229-238, 2008.

Ma, J., Hu, Y. and Loizou, P, "Objective measures for predicting speech intelligibility in noisy conditions based on new band-importance functions", Journal of the Acoustical Society of America, 125(5), 3387-3405,2009.

H. Hirsch, and D. Pearce, “The Aurora Experimental Framework for the Performance Evaluation of Speech Recognition Systems under Noisy Conditions.” ISCA ITRW ASR2000, France, 2000.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.