Auditory‐nerve fiber spike trains were recorded in response to spoken English stop consonant–vowel syllables, both voiced (/b,d,g/) and unvoiced (/p,t,k/), in the initial position of syllables with the vowels /i,a,u/. Temporal properties of the neural responses and stimulus spectra are displayed in a spectrographic format. The responses were categorized in terms of the fibers’ characteristic frequencies (CF) and spontaneous rates (SR). High‐CF, high‐SR fibers generally synchronize to formants throughout the syllables. High‐CF, low/medium‐SR fibers may also synchronize to formants; however, during the voicing, there may be sufficient low‐frequency energy present to suppress a fiber’s synchronized response to a formant near its CF. Low‐CF fibers, from both SR groups, synchronize to energy associated with voicing. Several proposed acoustic correlates to perceptual features of stop consonant–vowel syllables, including the initial spectrum, formant transitions, and voice‐onset time, are represented in the temporal properties of auditory‐nerve fiber responses. Nonlinear suppression affects the temporal features of the responses, particularly those of low/medium‐spontaneous‐rate fibers.