Re: [AUDITORY] Seeking advice on improving localization clarity in static binaural playback with non-individualized HRTFs (Brian FG Katz )


Subject: Re: [AUDITORY] Seeking advice on improving localization clarity in static binaural playback with non-individualized HRTFs
From:    Brian FG Katz  <brian.katz@xxxxxxxx>
Date:    Fri, 8 Aug 2025 08:38:30 +0200

This is a multipart message in MIME format. ------=_NextPart_000_1E62_01DC083F.D2197D80 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Hello Dingding,=20 =20 There are various elements which improve static binaural rendering. = Rather than repeat the various elements here, I invite you to check some = more of the literature, such as the chapter that Rozenn and I wrote for = this book :=20 =20 B. F. G. Katz and R. Nicol, =E2=80=9CBinaural spatial = reproduction,=E2=80=9D in Sensory Evaluation of Sound (N. Zacharov, = ed.), pp. 349=E2=80=93388, Boca Raton: CRC Press, 2019, = <http://www.crcpress.com/Sensory-Evaluation-of-Sound/Zacharov/p/book/9781= 498751360> (url). ISBN 978-1-4987-5136-0. =20 I can also say (as another element of small self-promotion) we have = incorporated many if the techniques in our free binaural VST plug-in = (http://anaglyph.dalembert.upmc.fr/) with this goal in mind. I invite = you to try it out.=20 =20 From your text, I have the impression that you distinguish between = =E2=80=9Cclearly and reliable=E2=80=9D responses (repeatability of = subjects) as being distinct from the actual desired rendering location. = This is not really compatible with your following goal of distinguish = between front and back, as well as between above and below, unless you = mean in their personal remapping of the perceived HRTF space. So, are = you looking for =E2=80=9Creliable=E2=80=9D or =E2=80=9Ccorrect=E2=80=9D? = This is not exactly clear in your statements.=20 =20 Best of luck, -Brian -- Brian FG Katz, Research Director, CNRS Groupe Lutheries - Acoustique =E2=80=93 Musique Sorbonne Universit=C3=A9, CNRS, UMR 7190, Institut Jean Le Rond = =E2=88=82'Alembert=20 <http://www.dalembert.upmc.fr/home/katz> = http://www.dalembert.upmc.fr/home/katz =20 De : AUDITORY - Research in Auditory Perception = <AUDITORY@xxxxxxxx> De la part de Dingding Yao Envoy=C3=A9 : jeudi 7 ao=C3=BBt 2025 08:41 =C3=80 : AUDITORY@xxxxxxxx Objet : [AUDITORY] Seeking advice on improving localization clarity in = static binaural playback with non-individualized HRTFs =20 Dear list, I hope this message finds you well. =20 I am reaching out to seek your advice on a question related to binaural = reproduction. As we all know, localization = ambiguities=E2=80=94especially front-back and up-down = confusions=E2=80=94are a common challenge when using HRTF-based binaural = playback. Previous literature has pointed out several influencing = factors, such as dynamic cues (e.g., head rotation), individualized = HRTFs, and headphone equalization. =20 However, I am particularly interested in whether it is still possible to = achieve a clear sense of directional perception under static listening = conditions with non-individualized HRTFs. Specifically, even if precise = localization is not attained, might there be techniques or strategies = that allow listeners to clearly and reliably distinguish between front = and back, as well as between above and below? Any insights, relevant experiences, or useful references would be = greatly appreciated. I would also welcome any discussion or perspectives = on this topic.=20 Best regards, =20 Dingding Yao ------=_NextPart_000_1E62_01DC083F.D2197D80 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable <html xmlns:v=3D"urn:schemas-microsoft-com:vml" = xmlns:o=3D"urn:schemas-microsoft-com:office:office" = xmlns:w=3D"urn:schemas-microsoft-com:office:word" = xmlns:x=3D"urn:schemas-microsoft-com:office:excel" = xmlns:m=3D"http://schemas.microsoft.com/office/2004/12/omml" = xmlns=3D"http://www.w3.org/TR/REC-html40"><head><meta = http-equiv=3DContent-Type content=3D"text/html; charset=3Dutf-8"><meta = name=3DGenerator content=3D"Microsoft Word 15 (filtered = medium)"><style><!-- /* Font Definitions */ @xxxxxxxx {font-family:"Cambria Math"; panose-1:2 4 5 3 5 4 6 3 2 4;} @xxxxxxxx {font-family:Calibri; panose-1:2 15 5 2 2 2 4 3 2 4;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {margin:0cm; font-size:11.0pt; font-family:"Calibri",sans-serif;} a:link, span.MsoHyperlink {mso-style-priority:99; color:#0563C1; text-decoration:underline;} span.ptmri8t- {mso-style-name:ptmri8t-;} span.EmailStyle21 {mso-style-type:personal-compose; font-family:"Calibri",sans-serif; color:windowtext;} .MsoChpDefault {mso-style-type:export-only; font-family:"Calibri",sans-serif; mso-fareast-language:EN-US;} @xxxxxxxx WordSection1 {size:612.0pt 792.0pt; margin:70.85pt 70.85pt 70.85pt 70.85pt;} div.WordSection1 {page:WordSection1;} --></style></head><body lang=3DEN-GB link=3D"#0563C1" vlink=3D"#954F72" = style=3D'word-wrap:break-word'><div class=3DWordSection1><p = class=3DMsoNormal><span style=3D'mso-fareast-language:EN-US'>Hello = Dingding, <o:p></o:p></span></p><p class=3DMsoNormal><span = style=3D'mso-fareast-language:EN-US'><o:p>&nbsp;</o:p></span></p><p = class=3DMsoNormal><span style=3D'mso-fareast-language:EN-US'>There are = various elements which improve static binaural rendering. Rather than = repeat the various elements here, I invite you to check some more of the = literature, such as the chapter that Rozenn and I wrote for this book : = <o:p></o:p></span></p><p class=3DMsoNormal><span = style=3D'mso-fareast-language:EN-US'><o:p>&nbsp;</o:p></span></p><p = class=3DMsoNormal style=3D'margin-left:36.0pt'><span = style=3D'font-size:13.5pt;font-family:"Arial",sans-serif;color:black'>B.&= nbsp;F.&nbsp;G. Katz and R.&nbsp;Nicol, =E2=80=9CBinaural spatial = reproduction,=E2=80=9D in&nbsp;<span = class=3Dptmri8t-><i>Sensory</i></span>&nbsp;<span = class=3Dptmri8t-><i>Evaluation of = Sound&nbsp;</i></span>(N.&nbsp;Zacharov, ed.), pp.&nbsp;349=E2=80=93388, = Boca Raton: CRC Press, 2019,&nbsp;</span><a = href=3D"http://www.crcpress.com/Sensory-Evaluation-of-Sound/Zacharov/p/bo= ok/9781498751360"><span = style=3D'font-size:13.5pt;font-family:"Arial",sans-serif'>(url)</span></a= ><span = style=3D'font-size:13.5pt;font-family:"Arial",sans-serif;color:black'>. = ISBN 978-1-4987-5136-0.</span><span = style=3D'mso-fareast-language:EN-US'><o:p></o:p></span></p><p = class=3DMsoNormal><span = style=3D'mso-fareast-language:EN-US'><o:p>&nbsp;</o:p></span></p><p = class=3DMsoNormal><span style=3D'mso-fareast-language:EN-US'>I can also = say (as another element of small self-promotion) we have incorporated = many if the techniques in our free binaural VST plug-in (<a = href=3D"http://anaglyph.dalembert.upmc.fr/">http://anaglyph.dalembert.upm= c.fr/</a>) with this goal in mind. I invite you to try it out. = <o:p></o:p></span></p><p class=3DMsoNormal><span = style=3D'mso-fareast-language:EN-US'><o:p>&nbsp;</o:p></span></p><p = class=3DMsoNormal><span style=3D'mso-fareast-language:EN-US'>From your = text, I have the impression that you distinguish between = =E2=80=9Cclearly and reliable=E2=80=9D responses (repeatability of = subjects) as being distinct from the actual desired rendering location. = This is not really compatible with your following goal of = </span>distinguish between front and back, as well as between above and = below, unless you mean in their personal remapping of the perceived HRTF = space. So, are you looking for =E2=80=9Creliable=E2=80=9D or = =E2=80=9Ccorrect=E2=80=9D? This is not exactly clear in your statements. = <span style=3D'mso-fareast-language:EN-US'><o:p></o:p></span></p><p = class=3DMsoNormal><span = style=3D'mso-fareast-language:EN-US'><o:p>&nbsp;</o:p></span></p><p = class=3DMsoNormal><span style=3D'mso-fareast-language:EN-US'>Best of = luck,<o:p></o:p></span></p><p class=3DMsoNormal><span = style=3D'mso-fareast-language:EN-US'>-Brian<o:p></o:p></span></p><p = class=3DMsoNormal><span = style=3D'color:#1F497D'>--</span><o:p></o:p></p><p = class=3DMsoNormal><span style=3D'color:#1F497D'>Brian FG Katz, Research = Director, CNRS</span><o:p></o:p></p><p class=3DMsoNormal><span lang=3DFR = style=3D'font-size:10.0pt;color:#1F497D'>Groupe Lutheries - Acoustique = =E2=80=93 Musique</span><span lang=3DFR><o:p></o:p></span></p><p = class=3DMsoNormal><span lang=3DFR = style=3D'font-size:10.0pt;color:#1F497D'>Sorbonne Universit=C3=A9, CNRS, = UMR 7190, Institut Jean Le Rond =E2=88=82'Alembert </span><span = lang=3DFR><o:p></o:p></span></p><p class=3DMsoNormal><span lang=3DFR = style=3D'color:#1F497D'><a = href=3D"http://www.dalembert.upmc.fr/home/katz"><span = style=3D'color:blue'>http://www.dalembert.upmc.fr/home/katz</span></a></s= pan><span lang=3DFR><o:p></o:p></span></p><p class=3DMsoNormal><span = lang=3DFR = style=3D'mso-fareast-language:EN-US'><o:p>&nbsp;</o:p></span></p><div = style=3D'border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0cm = 0cm 0cm'><p class=3DMsoNormal><b><span = lang=3DFR>De&nbsp;:</span></b><span lang=3DFR> AUDITORY - Research in = Auditory Perception &lt;AUDITORY@xxxxxxxx&gt; <b>De la part = de</b> Dingding Yao<br><b>Envoy=C3=A9&nbsp;:</b> jeudi 7 ao=C3=BBt 2025 = 08:41<br><b>=C3=80&nbsp;:</b> = AUDITORY@xxxxxxxx<br><b>Objet&nbsp;:</b> [AUDITORY] Seeking = advice on improving localization clarity in static binaural playback = with non-individualized HRTFs<o:p></o:p></span></p></div><p = class=3DMsoNormal><o:p>&nbsp;</o:p></p><div><p class=3DMsoNormal = style=3D'margin-bottom:12.0pt'>Dear list,<o:p></o:p></p><div><p = class=3DMsoNormal>I hope this message finds you = well.<o:p></o:p></p><div><p = class=3DMsoNormal><o:p>&nbsp;</o:p></p></div><div><p class=3DMsoNormal>I = am reaching out to seek your advice on a question related to binaural = reproduction. As we all know, localization = ambiguities=E2=80=94especially front-back and up-down = confusions=E2=80=94are a common challenge when using HRTF-based binaural = playback. Previous literature has pointed out several influencing = factors, such as dynamic cues (e.g., head rotation), individualized = HRTFs, and headphone equalization.<o:p></o:p></p></div><div><p = class=3DMsoNormal><o:p>&nbsp;</o:p></p></div><div><p = class=3DMsoNormal>However, I am particularly interested in whether it is = still possible to achieve a <b>clear sense of directional perception</b> = under <b>static listening conditions</b> with <b>non-individualized = HRTFs</b>. Specifically, even if precise localization is not attained, = might there be techniques or strategies that allow listeners to = <b>clearly and reliably </b>distinguish between front and back, as well = as between above and below?<br><br>Any insights, relevant experiences, = or useful references would be greatly appreciated. I would also welcome = any discussion or perspectives on this topic. <br><br>Best regards, = &nbsp;<br>Dingding = Yao<o:p></o:p></p></div></div></div></div></body></html> ------=_NextPart_000_1E62_01DC083F.D2197D80--


This message came from the mail archive
postings/2025/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University