Applications of 3-Dimensional Spherical Transforms to Acoustics and Personalization of Head-related Transfer Functions (HRTFs)
The spherical harmonic transform (SHT), which returns spatial frequency components of data or distributions determined on the unit sphere, has found many applications in acoustics, such as spatial sound capture and reproduction, beamforming with spherical arrays, analysis of transducer radiation patterns, interpolation of head-related transfer functions (HRTFs) and others. However the SHT is a 2-dimensional transform, not suited for 3-dimensional data that vary not only with angle but additionally across the radial dimension. This work examines two 3-dimensional spherical transforms, namely the spherical Fourier-Bessel transform (SFBT) and the spherical harmonic oscillator transform (SHOT), and considers their potential uses in acoustics. The study presents preliminary results on their application to personalization of HRTFs, avoiding the cumbersome task of measuring them directly. Assuming that head-shape similarity correlates to some extent with HRTF similarity, we employ the aforementioned transforms to get a spectral representation of the user’s head scan, and determine its distance from the spectra of head scans associated with the HRTF database.
发言人详细信息
Archontis Politis obtained his M.Eng. degree in civil engineering at Aristotle’s University of Thessaloniki, Greece, and his M.Sc. degree in sound & vibration studies at ISVR, University of Southampton, UK, in 2006 and 2008 respectively. From 2008 to 2010 he worked as a graduate acoustic consultant at Arup Acoustics, Glasgow, UK, and as a researcher in a joint collaboration between Arup Acoustics and the Glasgow School of Arts, on interactive auralization of architectural spaces using 3D sound techniques. He is currently pursuing a doctoral degree at Aalto University, Finland, in the field of parametric spatial sound recording, analysis and reproduction.
- 日期:
- 演讲者:
- Archontis Politis
- 所属机构:
- Aalto University
-
-
Casey Anderson
-
-
系列: Microsoft Research Talks
-
Decoding the Human Brain – A Neurosurgeon’s Experience
Speakers:- Pascal Zinn,
- Ivan Tashev
-
-
-
-
Galea: The Bridge Between Mixed Reality and Neurotechnology
Speakers:- Eva Esteban,
- Conor Russomanno
-
Current and Future Application of BCIs
Speakers:- Christoph Guger
-
Challenges in Evolving a Successful Database Product (SQL Server) to a Cloud Service (SQL Azure)
Speakers:- Hanuma Kodavalla,
- Phil Bernstein
-
Improving text prediction accuracy using neurophysiology
Speakers:- Sophia Mehdizadeh
-
-
DIABLo: a Deep Individual-Agnostic Binaural Localizer
Speakers:- Shoken Kaneko
-
-
Recent Efforts Towards Efficient And Scalable Neural Waveform Coding
Speakers:- Kai Zhen
-
-
Audio-based Toxic Language Detection
Speakers:- Midia Yousefi
-
-
From SqueezeNet to SqueezeBERT: Developing Efficient Deep Neural Networks
Speakers:- Sujeeth Bharadwaj
-
Hope Speech and Help Speech: Surfacing Positivity Amidst Hate
Speakers:- Monojit Choudhury
-
-
-
-
-
'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project
Speakers:- Peter Clark
-
Checkpointing the Un-checkpointable: the Split-Process Approach for MPI and Formal Verification
Speakers:- Gene Cooperman
-
Learning Structured Models for Safe Robot Control
Speakers:- Ashish Kapoor
-