• Volume/Page
  • Keyword
  • DOI
  • Citation
  • Advanced
   
 
 
 

Journal of the Acoustical Society of America

Year Range: 
Search Issue | RSS Feeds RSS
Previous Issue

Jun 1972

Volume 51, Issue 6B, pp. 1839-2065

Page 1 of 2 Pages Next Page | Jump to Page

Direct Estimation of Multidimensional Tonal Dissimilarity

Thomas Carvellas and Bruce Schneider

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 1839-1848 (1972); (10 pages)

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
Magnitude estimates were obtained for the dissimilarity of pure tones varying in frequency and intensity. In Expt. 1, a spatial embedding for a set of seven tones was determined by assuming the magnitude estimates represented distances in a two‐dimensional space whose coordinate axes were pitch and loudness. Embeddings were determined for both Euclidean and “city‐block” distance functions. A linear relation was consistently observed between the projections of the tones on dimensions of pitch and loudness and their values on Stevens's mel and sone scales, with the city‐block projections usually yielding higher correlations. In Expt. 2, 11 tones were used and the magnitude estimates of dissimilarity were compared to the tightly constrained “monotone” distances derived from the rank‐order properties of the dissimilarities. The agreement observed between the magnitude estimates and the monotone predicted distances strongly supports a distance interpretation for the dissimilarities.

Auditory Temporal Integration at Relatively High Masked‐Threshold Levels

Roy W. Gengel

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 1849-1851 (1972); (3 pages) | Cited 5 times

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
The clinical literature indicates that some persons with sensorineural hearing impairment show temporal integration functions that have relatively flat slopes when compared to subjects (Ss) with normal sensitivity. It is not clear whether temporal integration functions with reduced slopes are due to sensorineural hearing impairment per se, or to the relatively high levels of signal required to measure the thresholds of these hearing‐impaired Ss. Therefore, temporal integration functions were obtained from four normal Ss in the presence of a masking noise of sufficient over‐all level (87 dB SPL re 0.0002 dyn/cm2) to simulate moderate to severe hearing loss. The results indicate that temporal integration functions with a reduced slope are a consequence of sensorineural hearing impairment and are not due to the relatively high levels of signal required to measure threshold.

Binaural Interaction in Low‐Frequency Stimuli: The Inability to Trade Time and Intensity Completely

Ervin R. Hafter and Samuel C. Carrier

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 1852-1862 (1972); (11 pages) | Cited 11 times

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
Subjects were presented with low‐frequency tonal signals, leading in time to one ear but more intense to the other. For differing values of interaural time (Δt) and intensity (ΔI), the listener's task was to detect the difference between the dichotic signals and diotic tones of the same frequency. The shapes of the psychometric functions demonstrate that time and intensity do trade but that the trade is incomplete—a result shown to be in accord with other experiments where listeners presented with such signals report the presence of two distinct “images.” A model of time‐intensity trading is presented in which it is assumed that differences of intensity are converted to time by two different trading mechanisms and that the two sums of Δt and converted ΔI act as different loci on a single interaural dimension. It is further suggested that the inner and outer hair cells may represent the site of the two trades.

Auditory Distortion Products f2f1 and 2f1f2

J. L. Hall

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 1863-1871 (1972); (9 pages) | Cited 6 times

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
Two primary tones of frequency f1 = 368–1475 Hz and f2 = 1.1–1.5 times f1 were presented monaurally at 60–80 dB sound‐pressure level. Nonlinear distortion products at both f2f1 and 2f1f2 were made to beat against low‐intensity probe tones slightly mistuned from the frequency of the distortion product. Subjects adjusted the amplitude and phase of cancellation tones at the frequency of the distortion product to eliminate the beating. The amplitude and phase of the 2f1f2 cancellation tone agree with results already in the literature. The amplitude of the f2f1 cancellation tone relative to the primary tones is insensitive to amplitude of the primary tones. Both the amplitude and phase angle of the f2f1 cancellation tone decrease with an increase of f1 or f2/f1. Differences between properties of the 2f1f2 and f2f1 cancellation tones are consistent with a model for generation of distortion products proposed by Goldstein.

Monaural Phase Effect: Cancellation and Reinforcement of Distortion Products f2f1 and 2f1f2

J. L. Hall

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 1872-1881 (1972); (10 pages) | Cited 1 time

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
When two primary tones in the frequency ratio f1:f2  =  2:3 are presented to human listeners, a combination tone is audible at f1/2. The loudness of this combination tone depends on the phase angle between the two primary tones. We measured the amplitude and phase of the combination tone using a cancellation‐ plus probe‐tone technique. The phase effect can be explained by interaction of a second‐order distortion product f2f1 and a third‐order distortion product 2f1f2 of approximately equal intensities. The intensity relative to the primary tones of both distortion products is influenced strongly by frequency of the primary tones and weakly by intensity of the primary tones.

Monaural Phase Effects for Two‐Tone Signals

J. L. Hall and M. R. Schroeder

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 1882-1884 (1972); (3 pages)

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
Effects of relative phase on subjective quality were found for signals with two components having a frequency ratio of 1:2 (octave). The data were analyzed by means of INDSCAL, a multidimensional scaling program. The subjective space corresponding to different phase conditions is a circle, and equal phase differences correspond to equal distances along the circle. These results confirm the sensitivity of the auditory system to waveform even when only two frequency components are involved, and they demonstrate that perception of the two‐component signal falls into an orderly pattern.

Psychophysical Evidence for Lateral Inhibition in Hearing

T. Houtgast

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 1885-1894 (1972); (10 pages) | Cited 27 times

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
Although there are some indications of the possible role of lateral inhibition in hearing, there has been no clear demonstration of it in psychophysical experiments. Either the phenomenon plays only a minor role, or it has escaped psychophysical verification. Accepting for a moment the second possibility, is argued in this paper that the threshold of a test tone presented simultaneously with a masker does not reflect clear lateral‐inhibition effects since the inhibition affects both the test tone and the masker. Two different methods, in which the test tone and the masker were presented succsesively, give clear psychophysical evidence of lateral inhibition in hearing. Firstly, the threshold curve of short test‐tone bursts presented in the gaps between repeated masker bursts (noise with a steep negative or positive gradient at a particular frequency) shows marked edge effects. Secondly, the results of psychophysical measurements on two‐tone suppression indicate that the nervous activity due to one frequency component may be suppressed by another component. The effect at the edges of the frequency spectrum are comparable with visual Mach bands, and the interaction of two tones is suggestive of the two‐tone inhibition found in auditory‐nerve fibers.

A Hybrid‐Computer Model of the Cochlear Partition

Allyn E. Hubbard and C. Daniel Geisler

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 1895-1903 (1972); (9 pages) | Cited 3 times

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
A hybrid‐computer simulation of the Peterson‐Bogert model of the cochlea is reported. The present simulation is shown to be a good representation of the original model except near 10 kHz. The output of the model at one point has many similarities to recent experimental data. In particular, the low‐frequency slope of the amplitude function is +6 dB/oct and the high‐frequency slope is steeply negative. The phase function shows a break from linearity near the resonance frequency and approaches +90° at low frequencies. Quantitative discrepancies between model and experimental data exist, but they could be appreciably reduced by changes in parameter values. Incorporating nonlinear membrane damping in the simulation produces nonlinear effects similar to those observed in the cochlea. With increasing intensity, the displacement peak becomes relatively reduced and occurs at a somewhat lower frequency.

Tympanic Membrane Vibrations in Cats Studied by Time‐Averaged Holography

Shyam M. Khanna and Juergen Tonndorf

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 1904-1920 (1972); (17 pages) | Cited 10 times

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
The vibratory characteristics of the cat's tympanic membrane (TM) were studied in fresh cadavers and living animals with the aid of time‐averaged holography. Initially, technical difficulties were presented by the TM's small size, low stiffness, and its transparency. Time‐averaged holograms of the TM were obtained for various sound‐pressure levels at a number of frequencies between 600 and 6000 Hz. Reconstructions of such holograms show an image of the TM superimposed by isoamplitude contour lines giving the absolute displacement amplitude at any point on the TM. Vibratory patterns remain essentially unchanged in their first mode up to a frequency of 2000 Hz, with higher modes occurring above that value. At all frequencies, the TM vibrates maximally in the posterior‐superior quadrant and less in the anterior and inferior ones. These results show that, even in its first mode of operation, i.e., below 2000 Hz, the TM does not vibrate like a stiff plate as observed by von Békésy in human cadavers. Rather, the present findings support the curved‐membrane concept of Helmholtz. Contributions to the total transformer action of the middle ear by the area ratio: TM/stapes footplate, the ossicular lever ratio, and that due to the curved membrane action are reevaluated.

Masking with Continuous and Gated Sinusoids

Barry Leshowitz and Edward Cudahy

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 1921-1929 (1972); (9 pages) | Cited 1 time

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
The detection of a 5‐msec tonal signal presented at various delays after the onset of a 500‐msec tonal masker was investigated in several transient‐masking experiments. Changes in the amount of masking were measured as a function of the delay between onset of the signal and the onset of the longer‐duration masker. In agreement with previous investigations of transient masking there was a large decrease in masked threshold, 26 dB in some conditions, with an increase in signal delay. A “transient overshoot” at masker termination was also noted. These data are consistent with a short‐time energy‐sensing model of detection. In a second transient‐masking study, the masking produced by a gated sinusoid in the presence of a continuous tonal pedestal was investigated. The pedestal results indicate that a simple, linear, time‐invariant, energy‐detection scheme cannot account for the results of two‐tone masking experiments.

Exploratory Studies of Zwicker's “Negative Afterimage” in Hearing

Robert C. Lummis and Newman Guttman

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 1930-1944 (1972); (15 pages) | Cited 3 times

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
This paper describes several exploratory experiments on a subjective tone that Zwicker discovered and referred to as the “negative afterimage.” Zwicker described the tone as audible upon cessation of a band‐rejected noise or band‐rejected pulse train stimulus, and as having a pitch corresponding to a frequency within the rejected band. We worked exclusively with noise stimuli. Our principal finding is that the minimum adequate stimulus appears to be a spectral edge rather than a rejected band, and that a low‐pass edge is more effective than a high‐pass edge. Other findings are as follows. Most people hear the tone, but susceptibility varies widely and the effect is labile. With favorable stimulus parameters, precision of matching to its pitch is on the order of 5% to 10% of the mean. The principal determinant of pitch is the location of the edge of the low‐pass component of the stimulus; the pitch corresponds to a frequency about ⅔ oct above that edge. Energy in the high‐pass component of the inducing stimulus enhances perception of the tone but has little effect on its pitch.

Effect of Signal Frequency on the MLD for Uncorrelated Noise

Donald E. Robinson and Terrence R. Dolan

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 1945-1946 (1972); (2 pages)

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
Rabiner, Laurence, and Durlach [J. Acoust. Soc. Amer. 40, 62–70 (1966)] reported that the masking‐level difference (MLD) for uncorrelated noise with signals in‐phase and reversed in‐phase at the ears was independent of signal frequency for frequencies between 150 and 1000 Hz. Since all other “MLD conditions” show marked signal‐frequency effects, and since the E‐C model predicts that such effects will also occur with uncorrelated noise, the present study reinvestigates the effect of signal frequency on the MLD for uncorrelated noise. The results indicate that the MLD for NUS0 decreases with increasing signal frequency.

Changes in Frequency Discrimination Caused by Leading and Trailing Tones

Don A. Ronken

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 1947-1850 (1972); (-96 pages) | Cited 2 times

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
Frequency discrimination for a tone of short duration may be strongly affected by the temporal proximity of another tone. One of the effects is unlike either masking or memory interference and may be related to the time required by the auditory system for frequency organization.

Equal Loudness Contours Derived from Sensory Magnitude Judgments

Bruce Schneider, Anthony A. Wright, William Edelheit, Peggy Hock, and Christopher Humphrey

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 1951-1959 (1972); (9 pages)

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
Magnitude estimates were obtained for the loudness of 77 pure tones varying in frequency (11 values) and intensity (seven values) when (1) all 77 tones were presented intermixed within a session, and (2) all tones of one frequency were presented before presenting tones of another frequency. A power function described the growth of loudness for each frequency for both conditions 1 and 2, and the exponent of the power function varied with frequency. Equal loudness contours were derived from the power functions of Condition 1 and were compared to those obtained from other experiments using different procedures.

Simultaneous Two‐Channel Signal Detection. II. Correlated and Uncorrelated Signals

Robert D. Sorkin, Richard E. Pastore, and Lawrence D. Pohlmann

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 1960-1965 (1972); (6 pages)

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
Two‐channel auditory signal detection was investigated with narrow‐band noise signals masked by independent, binaurally uncorrelated noise. The two‐channel tasks included conditions where the signals in each channel were correlated, uncorrelated, or opposite in phase. No significant differences were observed between the single‐channel conditions and the two‐channel correlated signal condition, but performance in the uncorrelated and 180° phase conditions was significantly poorer. A comprehensive analysis of cross‐channel interaction indicated a cross‐channel inhibitory or masking effect, which may be counteracted by the presence of lateralization cues. Constraints on signal detectability due to a limited processing capacity appeared to be negligible.

Signal Detection as a Function of Contralateral Sinusoid‐to‐Noise Ratio

William A. Yost, M. J. Penner, and L. L. Feth

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 1966-1970 (1972); (5 pages) | Cited 1 time

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
Detection of a masked signal (500 Hz) at one ear was measured while a sinusoid of the same frequency and a noise were presented to the contralateral ear. Both the signal and the contralateral sinusoids were pulsed, and they were either in phase or inverted. The noise at the signal ear was presented continuously at a spectrum level of 40 dB. In Expt. 1, the same continuous noise was presented to the contralateral ear at a spectrum level of 40 dB and in phase with the noise in the signal ear. The contralateral sinusoid‐to‐noise ratio was varied by changing the level of the sinusoid. In Expt. 2, the contralateral sinusoid‐to‐noise ratio was held constant while the levels of both the contralateral sinusoid and the contralateral noise were varied. When the contralateral sinusoid and noise are intense, a 50‐dB increase in the contralateral sinusoid‐to‐noise ratio causes a 25‐ to 35‐dB change in the signal‐to‐noise ratio required for detection. The results show that contralateral stimulation can either enhance or degrade detection of a monaural signal. The results are compared to data from investigations of masking‐level difference, central masking, and monarual detection with contralateral cue.

Measuring the Resonances of Treble Viol Plates by Hologram Interferometry and Designing an Improved Instrument

Carl‐Hugo Ågren and Karl A. Stetson

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 1971-1983 (1972); (13 pages)

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
This paper is a presentation of the first application of the techniques of vibration analysis by hologram interferometry to the study of stringed instrument resonances. The instrument studied was the treble viol, which was a convenient size for a hologram system and allowed approximate simulation of the actual instrument by the clamped top plate. After study of an instrument of classical proportions by hologram interferometry, a new instrument was designed that was longer, wider, and flatter. Holographic study of the top plate of this new instrument verified that it should function well on a finished instrument since its resonance frequencies suited the instrument range. The rest of the instrument was built and tested holographically after assembly to confirm the previous simulation. The instrument was completed and played with very good results, i.e., more volume, easier playing, and more agreeable tone than the classically designed treble viol. The primary experimental difficulty encountered was the swelling of the wooden plates with humidity which adversely affected real‐time interferometry, and the steps taken to overcome this were only partially successful.

Research Potentials in Auditory Characteristics of Violin Tone

Paul C. Boomsliter and Warren Creel

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 1984-1993 (1972); (10 pages)

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
Sensations of tone, including violin tone, come partly from evoked matrices generated within the brain. Cited evidence from electrodes permanently implanted in the brains of conditioned cats shows that an incoming signal evokes generation of temporally patterned recurrences. The cat acts appropriately for the signal when coincidence exists between the input and the internally generated pattern. An adequately functioning nervous system experiences tone from input of regularly repeating waves, such as are compatible with neural temporal recurrence, but human patients with some neural handicaps, including vertebral artery insufficiency, commonly experience noise instead of tone from smooth sine wave input, presumably because of inadequate internal generation or coincidence detection. The normal human nervous system, with many neurons responding only to change, is not a steady‐state apparatus, yet it uses temporal recurrence with an appetite for regularity. Human preference for regularity and change can be observed when a master plays a violin which is thus physiologically controlled.

Directivity of the Bowed Stringed Instruments and Its Effect on Orchestral Sound in Concert Halls

Jürgen Meyer

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 1994-2009 (1972); (16 pages) | Cited 5 times

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
The directional characteristics of sound radiation were investigated for violins, violas, cellos, and contrabasses. Each instrument group has a nondirectional characteristic in its lowest frequency range except the contrabass. At higher frequencies, there are regions of preferred radiation, which change their direction and anglewidth with frequency. These results suggest that different seating arrangements for the strings would be optimum for different concert halls and different styles of musical compositions. The European seating system is optimum for the radiation of the cellos; it has the advantage that the sound character of the first and second violins may be distinguished by the audience, but the sound of the second violins has no brilliance. The American system makes the second violins sound as good as the first violins, but it is difficult to distinguish both groups in contrapuntal passages of classical symphonies. The sound of the cellos is poor and without clarity. The validity of these findings is confirmed by measurements in concert halls, and they have been used for guidance in changing the seating arrangement of an orchestra to improve the instrumental balance with successful results.

New Electronic Tuning Device for Pianos

A. C. van der Woerd

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 2010-2017 (1972); (8 pages)

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
In the tuning device being described a reference oscillator generates 84 crystal‐controlled frequencies, which follow a stretched scale whose deviations are programmed in an interchangeable diode matrix. Three different diode matrices have been realized, intended for tuning small, medium‐size, and grand piano's, respectively. A bandpass filter is used, the transmission range of which automatically follows the reference frequency. An electromagnetic device is used as a sound transducer. In consequence of this, tuning of all strings on their fundamentals becomes fully feasible. With one medium‐size piano, the achieved accuracy was measured. The accuracy of the tuning with respect to the reference frequency was <10−4 in all cases. The total accuracy (including the inaccuracy of the reference oscillator) varies from 10−4 for the lowest note to 5×10−4 for the highest note.

The Timing of Utterances and Linguistic Boundaries

Ilse Lehiste

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 2018-2024 (1972); (7 pages) | Cited 9 times

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
This paper is concerned with the effect of morphological and syntactic boundaries on the temporal structure of spoken utterances. Two speakers produced 20 tokens each of four sets of words consisting of a mono‐syllabic base form, disyllabic and trisyllabic words derived from the base by the addition of suffixes, and three short sentences in which the base form was followed by a syntactic boundary; this in turn was followed by a stressed syllable, one unstressed syllable, and two unstressed syllables. The sentences thus reproduced the syllabic sequences of the derived words. The duration of words and segments was measured from oscillograms. The manifestation of morphological and syntactic boundaries is discussed, and some implications of the findings relative to the temporal programming of spoken utterances are considered.

Bilabial Closure Durations for /p/, /b/, and /m/ in Voiced and Whispered Vowel Environments

Martin F. Schwartz

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 2025-2029 (1972); (5 pages)

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
A study was made of the duration of bilabial closures for /p/, /b/, and /m/ under voiced and whispered speaking conditions. The results, obtained from 12 normal‐speaking young adult males, indicated significantly greater whisper durations for /p/ and /b/ but not /m/. An explanation for this finding is offered in terms of the hypothesis that individuals conserve air when whispering by prolonging air‐arresting articulatory gestures.

Experiment on Voice Identification

Oscar Tosi, Herbert Oyer, William Lashbrook, Charles Pedrey, Julie Nicol, and Ernest Nash

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 2030-2043 (1972); (14 pages) | Cited 1 time

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
A two‐year experiment on voice identification through visual inspection of spectrograms was performed with the twofold goal of checking Kersta's claims in this matter [Nature 196, 1253–1257 (1962)] and testing models including variables related to forensic tasks. The 250 speakers used in this experiment were randomly selected from a homogeneous population of 25 000 males speaking general American English, all students at Michigan State University. A total of 34 996 experimental trials of identification were performed by 29 trained examiners. Each trial involved up to 40 known voices, in various conditions: with closed and open trials, contemporary and noncontemporary spectrograms, nine or six clue words spoken in isolation, in a fixed context and in a random context, etc. The examiners were forced to reach a positive decision (identification or elimination) in each instance, taking an average time of 15 minutes. Their decisions were based solely on inspection of spectrograms; listening to the identification by voices was excluded from this experiment. The examiners graded their self‐confidence in their judgments on a 4‐point scale (1 and 2, uncertain; 3 and 4, certain). Results of this experiment confirmed Kersta's experimental data, which involved only closed trials of contemporary spectrograms and clue words spoken in isolation. Experimental trials of this study, correlated with forensic models (open trials, fixed and random contexts, noncontemporary spectrograms), yielded an error of approximately 6% false identifications and approximately 13% false eliminations. The examiners judged approximately 60% of their wrong answers and 20% of their right answers as “uncertain.” This suggests that if the examiners had been able to express no opinion when in doubt, only 74% of the total number of tasks would have had a positive answer, with approximately 2% errors of false identification and 5% errors of false elimination. The different conditions existing between experimental trials of identification or elimination performed by the examiners of this study and the tasks performed by a professional examiner in real cases are discussed.

Efficient Acoustic Parameters for Speaker Recognition

Jared J. Wolf

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 2044-2056 (1972); (13 pages) | Cited 1 time

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
In a scheme for the mechanical recognition of speakers, it is desirable to use acoustic parameters that are closely related to voice characteristics that distinguish speakers. This paper describes an investigation of an efficient approach to selecting such parameters, which are movitated by known relations between the voice signal and vocal‐tract shapes and gestures. Rather than general measurements over the extent of an utterance, only significant features of selected segments are used. A simulation of a speaker recognition system was performed by manually locating speech events within utterances and using parameters measured at these locations to classify the speakers. Useful parameters were found in fundamental frequency, features of vowel and nasal consonant spectra, estimation of glottal source spectrum slope, word duration, and voice onset time. These parameters were tested in speaker recognition paradigms using simple linear classification procedures. When only 17 such parameters were used, no errors were made in speaker identification from a set of 21 adult male speakers. Under the same conditions, speaker verification errors of the order of 2% were also obtained.
back to top
RSS Feeds

Significance of Resistance Measurements in the Scala Media

Y. Cern and H. Fischler

J. Acoust. Soc. Am. Volume 51, Issue 6B, pp. 2057-2059 (1972); (3 pages)

Online Publication Date: 12 Aug 2005

Full Text: | Download PDF

Show Abstract
Resistance measurements of the cochlear membranes are analyzed regarding the effective length of the cochlea actually sensed. It is shown that only short segments are represented by the measurements, as confirmed by experimental results. It is asserted that both the longitudinal and transverse dimensions must be considered simultaneously in cochlear modeling, as facilitated by a transmission line or core conductor approach.
Page 1 of 2 Pages Next Page | Jump to Page
Close

close