Re: [AUDITORY] Seeking advice on improving localization clarity in static binaural playback with non-individualized HRTFs

To: AUDITORY@xxxxxxxxxxxxxxx

Subject: Re: [AUDITORY] Seeking advice on improving localization clarity in static binaural playback with non-individualized HRTFs

From: "Picinali, Lorenzo" <l.picinali@xxxxxxxxxxxxxx>

Date: Thu, 7 Aug 2025 12:36:58 +0000

Accept-language: en-GB, en-US

Approved-by: l.picinali@xxxxxxxxxxxxxx

Arc-authentication-results: i=2; mx.google.com; dkim=pass header.i=@LISTS.MCGILL.CA header.s=SELECTOR1 header.b=q2O6+Xf4; arc=fail (signature failed); spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.104 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=imperial.ac.uk

Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=imperial.ac.uk; dmarc=pass action=none header.from=imperial.ac.uk; dkim=pass header.d=imperial.ac.uk; arc=none

Arc-message-signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20240605; h=list-archive:list-owner:list-subscribe:list-unsubscribe:list-help :precedence:in-reply-to:to:comments:subject:from:sender:reply-to :date:message-id:mime-version:msip_labels:content-language :accept-language:references:thread-index:thread-topic:approved-by :dkim-signature; bh=Jlldwx6xLFknyUOdXEbLmF8VWwXO0QsrZG69DxEV53g=; fh=5/42mu9FVmfuMp6n0xGXVcDar2H3ENcHt8Uv11Om8gY=; b=dtgmPZ5MRZlbAiuz4LJu1T3FYcZ+wH9hkcmF9RgSAaDO3FEkGiR29JGH7b3FW+ttVa NHPFwYn1GNcYofHTDRb/NWqWxJjT1+eggipjaE+HMbSInaYVqxYk8MXxc1htNY83O2M/ QB8J/YlqePh0MvDqeCSrRQw0qhpGb+OJdQT1uwXDU0L1aOMhwuiB+pYKXm9Z7JGIgXXd IXGVZDLVqeHG5rHO54/hBbtfjxIXGBQJs54FTrMJwf8endrkRLLrfkWcYSPahdCkr2qd kqUMhSklJGBif8oIP7d0vQ1mYBRdfX3Ot7+54WNE4UUsnAdEKFyDo5Q1ZZ2MrTZd+G0d ZGmg==; dara=google.com

Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector10001; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=Jlldwx6xLFknyUOdXEbLmF8VWwXO0QsrZG69DxEV53g=; b=YVv5AgzuzTwYtgseoSUuXBm9By8D35aQYRQi1n9OMlNENNmTavWfZwTS3wWbMPmSH9S1TkTVqxL1yK78QY8Ko9xpVVz1h54OfwI/CBB1YVmOTYJj5def9SEcHsz7mKk/6dyF89k6KDslE0TuD3YPvOiKqCbJSoQAL4QwjYrWX7bUPcWtz2kuLzOwb3ogNLRZZRBAVUPHcAzXjwEKBf1RHFRYNcWcYm/1WpMkJBHuJtU8/8z/fqdhmsSEY+CZxXjLlpS0hcrbuW1XYG2A4D5NTpJy3rYtCpawG8l4NMRWM4ts5k3Lp2RRsqU+OuTMzQiF5Zh59FvbEmAKiX0tXQ/jwA==

Arc-seal: i=2; a=rsa-sha256; t=1754626200; cv=fail; d=google.com; s=arc-20240605; b=C0VwjNIQrIMUoDOj4hDd3GxaaJGsrzEuiTb9gO16jgjPGBNrF3KUPSnTA+7YKD5u33 K12K0MJtQs9UMqxSJPFc61KEaIsQ8+3Q8TKBAz2+l3QerR1iyrhgg/qdi6lC5EWtCgFm PCdpIoEZaJug1xAvCvvmEEFvQ7LenaFGtIu+dSVQeig6UHJ7Yhli9Mkcm1wAa766LxDS LoOhcM/JlYhdSzDbPdXH2MEFHtlnqQEypepiddKIzsLSrgH1nhNGGOqPCrtgLPylLyi1 EB3JtFl1qo0Dt6Svkn2OTU5S68DrVAC3EmMTMdV4j7GsS4gLFa3r+6APNQIQzl2yJz8V vEZQ==

Arc-seal: i=1; a=rsa-sha256; s=arcselector10001; d=microsoft.com; cv=none; b=BHDXPIqbFOeXIDB+6N3xlLbeAuJCk9n57jWVpbEpwDuM8h+GTfYjxsn1FFTVS3XhFKS76kjhLVk/vRqUM7HNVuXpdr/SOkXauKJXLpJtZz+SDOomTvRBRZulssmxXaiutJujruq3GZXc+ErSPDEeATvsJEiSKjO/aZPdi+DluyJg2O69DHu3bPOzmwddSNOtsZRgPWThDQ/LY/tlQSUpwPlVMX7gMxT4r1UEsxqcC0Y2tYTHMmCAoAgAJ37SgaG2jAJezUpoHGdttVEYOjpG/YSjkdGBOtn9xql14aON2YYromkaPl2LwdDQxO7oAlUi1te+nAgVURjukxh4vVI8gQ==

Authentication-results: mx.google.com; dkim=pass header.i=@LISTS.MCGILL.CA header.s=SELECTOR1 header.b=q2O6+Xf4; arc=fail (signature failed); spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.104 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=imperial.ac.uk

Comments: To: Dingding Yao <simon.ydd@xxxxxxxxx>

Delivered-to: dan.ellis@xxxxxxxxx

Dkim-signature: v=1; a=rsa-sha256; d=LISTS.MCGILL.CA; s=SELECTOR1; c=relaxed/relaxed; bh=Jlldwx6xLFknyUOdXEbLmF8VWwXO0QsrZG69DxEV53g=; i=@LISTS.MCGILL.CA; h=Approved-By:ARC-Seal:ARC-Message-Signature:ARC-Authentication-Results:Thread-Topic:Thread-Index:References:Accept-Language:Content-Language:msip_labels:Content-Type:MIME-Version:Message-ID:Date:Reply-To:Sender:From:Subject:To:In-Reply-To:List-Help:List-Unsubscribe:List-Subscribe:List-Owner:List-Archive; b=q2O6+Xf4ZlioUQ90ijIrRzzjqb/c3kRPyEDSkomJtbIBEm+fU+r1huL6aVvFGZNl25qapnG4P4Rk4zUgA+jqvAKtTUqgIwH/tGivc6vsIa0FAmBfIgxtFdFsJLFFn3bQlWonmv0UkwIF67Xl7wTl37mRuWRnl/JSzyv9judoYCfot0undbTwC34GGKmabHW7bYGkuPcCVoL9q0v4FcXyfRRsciD/AAtwxvSaDKEueUo7VvLfQfCVLYhLjaKWOn+C28P6kBd1EPMOCcUFXcBaTCxJxTMZTcCJ2y63gjUlG5dz1oDZk3SNAGsGslLe1oBx79jorAhe9kbII97Wv3O9XA==

In-reply-to: <CAPpizCUC-BVAE_oH=2WboGN0YsTxhm8T6XoELzT_xLUAtWCC-Q@mail.gmail.com>

List-archive: <https://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

List-help: <https://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO%20AUDITORY>

List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>

List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>

List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>

Msip_labels:

References: <CAPpizCUC-BVAE_oH=2WboGN0YsTxhm8T6XoELzT_xLUAtWCC-Q@mail.gmail.com>

Reply-to: "Picinali, Lorenzo" <l.picinali@xxxxxxxxxxxxxx>

Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>

Thread-index: AQHcB2miTWXjFK6wQUe5YAhlgiibobRXFmOI

Thread-topic: [AUDITORY] Seeking advice on improving localization clarity in static binaural playback with non-individualized HRTFs

Hello Dingding,

Here are my two cents on the matter; it is known that the spectral cues used by our hearing system to determine whether the source is in front or on the back are mainly between 4 and 8kHz, meaning that sources on the back might have 3-5dB less energy in that range (there usually are other bands that are affected in a similar way, for example around 12 and 14/15kHz, albeit the difference in dB between front and back is smaller). Similarly, elevated sources are likely to have a ~5-7dB boost around 7-9kHz, together with attenuation between 12-14kHz.

I remember seeing some conference papers on improving front-back confusion (e.g. Balan, O., Moldoveanu, A., & Moldoveanu, F. (2018). A Systematic Review of the Methods and Experiments Aimed to Reduce Front-Back Confusions in the Free-Field and Virtual Auditory Environments. RoCHI, 24-29, for example) but I'm not particularly convinced such methods would work without training. The latter is a relevant point in my opinion, as if you for example enhance the front-back and up-down spectral differences and create a "superhuman HRTF", you can surely train someone to use those enhanced cues and significantly improve their discrimination performances. I'd though consider this approach as being more a form of "sonification" rather than spatialisation, and I don't know whether this would work consistently and repeatedly with untrained individuals.

Best

Lorenzo

--
Lorenzo Picinali
Professor in Spatial Acoustics and Immersive Audio
Dyson School of Design Engineering
Imperial College London
Dyson Building
Imperial College Road
South Kensington, SW7 2DB, London
E: l.picinali@xxxxxxxxxxxxxx

https://profiles.imperial.ac.uk/l.picinali
https://www.axdesign.co.uk/

https://www.sonicom.eu/

From: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx> on behalf of Dingding Yao <simon.ydd@xxxxxxxxx>
Sent: 07 August 2025 7:41 AM
To: AUDITORY@xxxxxxxxxxxxxxx <AUDITORY@xxxxxxxxxxxxxxx>
Subject: [AUDITORY] Seeking advice on improving localization clarity in static binaural playback with non-individualized HRTFs

Dear list,

I hope this message finds you well.

I am reaching out to seek your advice on a question related to binaural reproduction. As we all know, localization ambiguities—especially front-back and up-down confusions—are a common challenge when using HRTF-based binaural playback. Previous literature has pointed out several influencing factors, such as dynamic cues (e.g., head rotation), individualized HRTFs, and headphone equalization.

However, I am particularly interested in whether it is still possible to achieve a clear sense of directional perception under static listening conditions with non-individualized HRTFs. Specifically, even if precise localization is not attained, might there be techniques or strategies that allow listeners to clearly and reliably distinguish between front and back, as well as between above and below?

Any insights, relevant experiences, or useful references would be greatly appreciated. I would also welcome any discussion or perspectives on this topic.

Best regards,
Dingding Yao