Open Access Open Access  Restricted Access Subscription or Fee Access

Linear Prediction Residual Error Based Assessment of Dysarthric Speech

M. Gandhimathy, P. Vijayalakshmi

Abstract


Dysarthria is a neuromotor impairment of speech that affects one or more subsystems like laryngeal, velopharyngeal and articulatory. The acoustic characteristics of dysarthric speakers deviate from that of the normal speakers. For the assessment of velopharyngeal dysfunction (hypernasality) inclusion of additional formants and deletion of actual formants are considered as acoustic cues. In this work, a novel approach is proposed for the assessment of dysarthric speech, using linear prediction residual error. Speech signal of dysarthric speakers is passed through inverse filter, designed using the system parameters estimated using LP analysis of a normal speaker, which is the LP residual error, is expected to contain information about the additional resonance frequency introduced due to hypernasality. On the other hand, the reverse process is expected to give the deletion of formants. In order to perform this analysis, an acoustically closer (in terms of formant frequency) normal speaker is to be found. For this comparison, the distance metrics like accumulated residual energy, Itakura-Saito distance and COSH distance measures are used. The LP-based magnitude spectra of the residual error, derived as mentioned above, contain information about the deletion and addition of resonant frequencies. Acoustic analysis is performed on 18 hypernasal speaker‟s speech data collected from AIISH, Mysore and 15 normal speakers‟ speech data collected in the laboratory environment and 10 dysarthric speakers‟ data from Nemours database of dysarthric speech. It is observed that out of 10 dysarthric speakers of Nemours database, four speakers are found to be hypernasal.

Keywords


Dysarthria, Hypernasality, LP Residue, Formant Analysis

Full Text:

PDF

References


Vijayalakshmi P., Nagarajan T. and Reddy M.R., “Assessment of articulatory and velopharyngeal sub-systems of dysarthric speech”, Biomedical soft computing and Human sciences (2009), Vol.14, No.2, pp.87-94.

Vijayalakshmi P., Ramasubba Reddy M., and Douglas O‟Shaughnessy, “Acoustic analysis and detection of hypernasality using a group delay function”, IEEE Trans. on Biomedical Engg. Vol.54, No.4, Apr. 2007, pp 621-629.

Vijayalakshmi P., Nagarajan T. and Jayanthan Ra. V, “Selective pole modification-based technique for the analysis and detection of hypernasality”, IEEE TENCON, 2009,pp.1-5.

Dong Kyun Rah, Young Il Ko, Chulhee Lee, and Deok Won Kim, “A noninvasive estimation of hypernasality using a linear predictive model”, Annals of Biomedical Engineering, Vol. 29, 2001,pp. 587–594.

Douglas A. Cairns, John H. L. Hansen,and John E. Riski, “A noninvasive technique for detecting hypernasal speech using a nonlinear operator”, IEEE Trans. on Biomedical Engg. Vol. 43, No. 1, Jan 1996, pp.35-45.

Pidal X.M, Polikoff J.B., Peters S.M., Leonzio J.E., Bunnell H.T., (1996) “The Nemours database of dysarthric speech”, in the proceedings of ICSLP in Philadelphia,P.A, USA pp.1962-1965.

Rabiner L.R. and R.W.Schafer, “Digital Processing of Speech Signals”, Pearson Education,2004.


Refbacks

  • There are currently no refbacks.