• Volume/Page
  • Keyword
  • DOI
  • Citation
  • Advanced
   
 
 
 

Journal of the Acoustical Society of America

Year Range: 
Search Issue | RSS Feeds RSS
Previous Issue

Jun 2011

Volume 129, Issue 6, pp. EL217-4101

back to top
RSS Feeds

Processing speech signal using auditory-like filterbank provides least uncertainty about articulatory gestures

Prasanta Kumar Ghosh, Louis M. Goldstein, and Shrikanth S. Narayanan

J. Acoust. Soc. Am. Volume 129, Issue 6, pp. 4014-4022 (2011); (9 pages)

Full Text: Read Online (HTML) | Download PDF

Show Abstract
Understanding how the human speech production system is related to the human auditory system has been a perennial subject of inquiry. To investigate the production–perception link, in this paper, a computational analysis has been performed using the articulatory movement data obtained during speech production with concurrently recorded acoustic speech signals from multiple subjects in three different languages: English, Cantonese, and Georgian. The form of articulatory gestures during speech production varies across languages, and this variation is considered to be reflected in the articulatory position and kinematics. The auditory processing of the acoustic speech signal is modeled by a parametric representation of the cochlear filterbank which allows for realizing various candidate filterbank structures by changing the parameter value. Using mathematical communication theory, it is found that the uncertainty about the articulatory gestures in each language is maximally reduced when the acoustic speech signal is represented using the output of a filterbank similar to the empirically established cochlear filterbank in the human auditory system. Possible interpretations of this finding are discussed.
Show PACS
43.72.Ar Speech analysis and analysis techniques; parametric representation of speech

Development of a temporal fundamental frequency coding strategy for cochlear implants

Andrew E. Vandali and Richard J. M. van Hoesel

J. Acoust. Soc. Am. Volume 129, Issue 6, pp. 4023-4036 (2011); (14 pages) | Cited 5 times

Full Text: Read Online (HTML) | Download PDF

Show Abstract
A sound-coding strategy for users of cochlear implants, named enhanced-envelope-encoded tone (eTone), was developed to improve coding of fundamental frequency (F0) in the temporal envelopes of the electrical stimulus signals. It is based on the advanced combinational encoder (ACE) strategy and includes additional processing that explicitly applies F0 modulation to channel envelope signals that contain harmonics of prominent complex tones. Channels that contain only inharmonic signals retain envelopes normally produced by ACE. The strategy incorporates an F0 estimator to determine the frequency of modulation and a harmonic probability estimator to control the amount of modulation enhancement applied to each channel. The F0 estimator was designed to provide an accurate estimate of F0 with minimal processing lag and robustness to the effects of competing noise. Error rates for the F0 estimator and accuracy of the harmonic probability estimator were compared with previous approaches and outcomes demonstrated that the strategy operates effectively across a range of signals and conditions that are relevant to cochlear implant users.
Show PACS
43.72.Ar Speech analysis and analysis techniques; parametric representation of speech
43.66.Ts Auditory prostheses, hearing aids
43.75.Yy Instrumentation and measurement methods for musical acoustics
43.66.Hg Pitch
Close

close