• Volume/Page
  • Keyword
  • DOI
  • Citation
  • Advanced
   
 
 
 

Journal of the Acoustical Society of America

Year Range: 
Search Issue | RSS Feeds RSS
Previous Issue

Dec 2009

Volume 126, Issue 6, pp. EL171-3387

back to top
RSS Feeds

The dynamic range of speech, compression, and its effect on the speech reception threshold in stationary and interrupted noise

Koenraad S. Rhebergen, Niek J. Versfeld, and Wouter. A. Dreschler

J. Acoust. Soc. Am. Volume 126, Issue 6, pp. 3236-3245 (2009); (10 pages) | Cited 2 times

Online Publication Date: 14 Dec 2009

Full Text: Read Online (HTML) | Download PDF

Show Abstract
Changes in the speech reception threshold (SRT) after amplitude compression of speech or speech in noise may be due to changes in the dynamic range of the speech signal. However, current models set up to predict the speech intelligibility consider the dynamic range of speech to be fixed regardless of the type of compression. The present paper describes two experiments with normal-hearing subjects to examine the effect of the dynamic range on the SRT in stationary and interrupted noise after wide dynamic range compression. The dynamic range has been varied by compression or expansion of only the speech signal, leaving the masking noise unaltered, or by compression or expansion of the mixed speech-in-noise signal. The results show that compression affects the SRT, both in a positive or a negative direction, not only due to dynamic range but also due to distortion of the speech signal.
Show PACS
43.71.An Models and theories of speech perception
43.66.Ba Models and theories of auditory processes
43.71.Gv Measures of speech perception (intelligibility and quality)
43.72.Kb Speech communication systems and dialogue systems

Speech identification in noise: Contribution of temporal, spectral, and visual speech cues

Jeesun Kim, Chris Davis, and Christopher Groot

J. Acoust. Soc. Am. Volume 126, Issue 6, pp. 3246-3257 (2009); (12 pages) | Cited 1 time

Online Publication Date: 14 Dec 2009

Full Text: Read Online (HTML) | Download PDF

Show Abstract
This study investigated the degree to which two types of reduced auditory signals (cochlear implant simulations) and visual speech cues combined for speech identification. The auditory speech stimuli were filtered to have only amplitude envelope cues or both amplitude envelope and spectral cues and were presented with/without visual speech. In Experiment 1, IEEE sentences were presented in quiet and noise. For in-quiet presentation, speech identification was enhanced by the addition of both spectral and visual speech cues. Due to a ceiling effect, the degree to which these effects combined could not be determined. In noise, these facilitation effects were more marked and were additive. Experiment 2 examined consonant and vowel identification in the context of CVC or VCV syllables presented in noise. For consonants, both spectral and visual speech cues facilitated identification and these effects were additive. For vowels, the effect of combined cues was underadditive, with the effect of spectral cues reduced when presented with visual speech cues. Analysis indicated that without visual speech, spectral cues facilitated the transmission of place information and vowel height, whereas with visual speech, they facilitated lip rounding, with little impact on the transmission of place information.
Show PACS
43.71.Es Vowel and consonant perception; perception of words, sentences, and fluent speech
43.66.Ts Auditory prostheses, hearing aids

Effects of electrode separation between speech and noise signals on consonant identification in cochlear implants

Bom Jun Kwon

J. Acoust. Soc. Am. Volume 126, Issue 6, pp. 3258-3267 (2009); (10 pages)

Online Publication Date: 14 Dec 2009

Full Text: Read Online (HTML) | Download PDF

Show Abstract
The aim of the present study was to examine cochlear implant (CI) users’ perceptual segregation of speech from background noise with differing degrees of electrode separation between speech and noise. Eleven users of the nucleus CI system were tested on consonant identification using an experimental processing scheme called “multi-stream processing” in which speech and noise stimuli were processed separately and interleaved. Speech was presented to either ten (every other electrode) or six electrodes (every fourth electrode). Noise was routed to either the same (the “overlapped” condition) or a different set of electrodes (the “interlaced” condition), where speech and noise electrodes were separated by one- and two-electrode spacings for ten- and six-electrode presentations, respectively. Results indicated a small but significant improvement in consonant recognition (5%–10%) in the interlaced condition with a two-electrode spacing (approximately 1.1 mm) in two subjects. It appears that the results were influenced by peripheral channel interactions, partially accounting for individual variability. Although the overall effect was small and observed from a small number of subjects, the present study demonstrated that CI users’ performance on segregating the target from the background might be improved if these sounds were presented with sufficient peripheral separation.
Show PACS
43.71.Ky Speech perception by the hearing impaired
43.66.Ts Auditory prostheses, hearing aids
43.71.Es Vowel and consonant perception; perception of words, sentences, and fluent speech
43.71.Gv Measures of speech perception (intelligibility and quality)
Close

close