• Volume/Page
  • Keyword
  • DOI
  • Citation
  • Advanced
   
 
 
 

Perceptually based head-related transfer function database optimization

J. Acoust. Soc. Am. Volume 131, Issue 2, pp. EL99-EL105 (2012); (7 pages)

Brian F. G. Katz and Gaëtan Parseihian

LIMSI-CNRS, BP 133, Universit Paris Sud, Orsay, France Brian.Katz@limsi.fr, Gaetan.Parseihian@limsi.fr

Full Text: Read Online (HTML) | Download PDF FREE | View Cart
In the context of binaural audio rendering, choosing the best head-related transfer function (HRTF) for an individual from large databases poses several problems. This study proposes a method to reduce the size of a given HRTF database. Participants, 45 in total, were asked to rate the quality of binaural synthesis for 46 HRTFs. The lack of reciprocity in the ratings was noted. Results were used to create a perceptually optimized HRTF subset which satisfied all participants’ judgments. The subset was validated using localization tests on a separate group of subjects with results showing reduced errors when subjects were given their best choice, rather than their worst choice HRTF.

© 2012 Acoustical Society of America

Acknowledgments

This work was supported in part by the French National Research Agency (ANR) through the TecSan program (Grant No. NAVIG ANR-08TECS-011) and the Midi-Pyrénées region through the APRRTT program.

Article Outline

  1. Introduction
  2. HRTF database
  3. Perceptual evaluation
  4. Database reduction
  5. Database subset validation
    1. HRTF subset selection
    2. Localization task
    3. Results
  6. Discussion

KEYWORDS and PACS

Keywords

hearing

PACS

ARTICLE DATA

History
Received 30 Sep 2011
Accepted 01 Dec 2011
Published online 13 Jan 2012

PUBLICATION DATA

ISSN

0001-4966 (print)  

  1. V. R. Algazi, R. O. Duda, D. P. Thompson, and C. Avendano, “The CIPIC HRTF database,” in 2001 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY (October 21–24, 2001), pp. 99–102.
  2. IRCAM LISTEN HRTF database. http://recherche.ircam.fr/equipes/salles/listen/ (Last viewed December 17, 2011).
  3. Tohoku HRTF database available at http://www.ais.riec.tohoku.ac.jp/lab/db-hrtf/ (Last viewed December 17, 2011).
  4. Y. Iwaya, “Individualization of head-related transfer functions with tournament-style listening test: Listening with other's ears,” Acoust. Sci. Tech. 27(6), 340–343, (2006).
  5. H. Jo, W. Martens, and Y. Park, “Evaluating candidate sets of head-related transfer functions for control of virtual source elevation,” in Audio Engineering Society Conference: 40th International Conference: Spatial Audio: Sense the Sound of Space, Tokyo, Japan (October 8–10, 2010), 12 pp.
  6. J. C. Middlebrooks, E. A. Macpherson, and Z. A. Onsan, “Psychophysical customization of directional transfer functions for virtual sound localization,” J. Acoust. Soc. Am. 108(6), 3088–3091 (2000).
  7. F. Wightman and D. Kistler, “Multidimensional scaling analysis of head-related transfer functions,” in IEEE Digital Audio Workshop, New Paltz, NY (October 17–20, 1993), pp. 98–101.
  8. P. Minnaar, J. Plogsties, S. Krarup Olesen, F. Christensen, and H. Mller, “The interaural time difference in binaural synthesis,” in Audio Engineering Society Convention, Paris, France (February 19–22, 2000), Preprint 5133, 20 pp.
  9. LIMSI Spatialisation Engine, InterDeposit Digital No. IDDN.FR. 001.340014.000.S.P. 2010.000.31235.
  10. J.-M. Pernaux, “Spatialisation du son par les techniques binaurales: application aux services de telecommunications (Sound spatialization using binaural sound synthesis: Application to telecommuncation services),” Ph.D. dissertation, INGP Grenoble, 2003.
  11. F. Dramas, B. F. G. Katz, and C. Jouffrais, “Auditory-guided reaching movements in the peripersonal frontal space,” Acoustics, 123, 3723 (2008).
  12. J. C. Middlebrooks, “Narrow-band sound localization related to external ear acoustics,” J. Acoust. Soc. Am. 92(5), 2607–2624 (1992).
  13. F. L. Wightman and D. J. Kistler, “Headphone simulation of free-field listening. II: Psy-chophysical validation,” J. Acoust. Soc. Am. 85(2), 868–878 (1989).
  14. R. L. Martin, K. I. McAnally, and M. A. Senova. Free-field equivalent localization of virtual audio. J. Audio Eng. Soc. 49(1/2), 14–22 (2001).
  15. E. M. Wenzel, M. Arruda, D. J. Kistler, and F. L. Wightman, “Localization using non-individualized headrelated transfer functions,” J. Acoust. Soc. Am. 94(1), 111–123 (1993).


Figures (1) Tables (1)

Figures (click on thumbnails to view enlargements)

FIG.1
(Color online) (a) Subjective HRTF rating test results: (×) bad; (·) ok; (•) excellent. (Ο) = trans-pose of excellent (•T). Lowercase identifiers represent subjects while uppercase represents the corresponding subject’s HRTF. Note: subject aj did not participate but HRTF AJ was included in the test. Split boxplot-histogram of the magnitude of (b) lateral angular error and (c) polar angular error, by group. Boxplot and angular error scale at the left; histogram value legend color bar on the right. Mean values are represented by •.

FIG.1 Download High Resolution Image (.zip file) | Export Figure to PowerPoint

Tables

Table I. Percentage of error according to type and mean linear regression analysis. Variances shown in parentheses.

View Table


Close

close