Re: [AUDITORY] Seeking advice on using ANF firing rate to reslove front-back confusion in sound localization model

Subject: Re: [AUDITORY] Seeking advice on using ANF firing rate to reslove front-back confusion in sound localization model

From: Qin Liu <000003c563e12bd3-dmarc-request@xxxxxxxxxxxxxxx>

Date: Mon, 3 Mar 2025 13:08:11 +0000

Accept-language: en-150, zh-CN, fr-CH, en-US

Approved-by: qin.liu@xxxxxxx

Arc-authentication-results: i=1; mx.google.com; dkim=pass header.i=@LISTS.MCGILL.CA header.s=SELECTOR1 header.b=NMMd79Sk; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.104 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=mcgill.ca

Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20240605; h=list-archive:list-owner:list-subscribe:list-unsubscribe:list-help :precedence:in-reply-to:to:comments:subject:from:sender:reply-to :date:message-id:mime-version:content-language:accept-language :references:thread-index:thread-topic:approved-by:dkim-signature; bh=L3l+jZQ1LcwDyNl+s6RspcWsNuk0Q1Hz+wqwYLhcTOA=; fh=5/42mu9FVmfuMp6n0xGXVcDar2H3ENcHt8Uv11Om8gY=; b=deBBpk+3iXNAKNanLD6zFN/GOX1pDZJuWqEfLJHCSZLS24FsHMg5Qn69dmWTSqrR2m 6ztxNC5+Kd7UbIBhRzdmGUDFw75Ap6SD9jaql217sXzG/JfADs48bYm8e6nmHQF/tWsj oS/LZL5tUrJWkaMFAyfj78Q3DZ8VTqXOXjOTg84TSv2TSBCiwxHEOEnBGA8t6krKB74L 54TMqo1nzIiT04jhBDbzl3FPtMkuNW3z8rVENXcoJmSFYHrBio3uPKnaBLfi/8MFZbKL /+/icfPvXWrg/uHwTe5H4qHluZmz/rkR7ndyRjrI2XA394gz2Cn1TAshAC0aX6vGfL/d 2oaw==; dara=google.com

Arc-seal: i=1; a=rsa-sha256; t=1741065211; cv=none; d=google.com; s=arc-20240605; b=iKfgvSlCi5t8GBWer0jemmOdZNhO5xnuR4wk/DTDU5TmKSJG3MzSo97NXmeo+oCIf4 /8E2zpHoWiX9CWJagYlbpo4yT6BekToWSxhVI5kqrKAn5z6KB7292dF/sGPUfN5tFKQP 8wCDNfaclMEhTBM526cP46dfDouCAHORLGUE7RgKzDzTxqFy/TeRN84dr1SsNayvuXL4 r+02EfmGaDItg9ebssrnmN8HHCPNXOdQP4mAE7GJg0pMkAgn2sySXMR47QdD1+1VY/Sk BUcGDtW81MCDirmxYy8ABNg8v9Je8nFaJ8l1+Q/InXx4vihxL97QzT9ZrPkvadG+6E+Z kasQ==

Authentication-results: mx.google.com; dkim=pass header.i=@LISTS.MCGILL.CA header.s=SELECTOR1 header.b=NMMd79Sk; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.104 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=mcgill.ca

Comments: To: "Richard F. Lyon" <dicklyon@xxxxxxx>

Delivered-to: dan.ellis@xxxxxxxxx

Dkim-signature: v=1; a=rsa-sha256; d=LISTS.MCGILL.CA; s=SELECTOR1; c=relaxed/relaxed; bh=L3l+jZQ1LcwDyNl+s6RspcWsNuk0Q1Hz+wqwYLhcTOA=; i=@LISTS.MCGILL.CA; h=Approved-By:Thread-Topic:Thread-Index:References:Accept-Language:Content-Language:Content-Type:MIME-Version:Message-ID:Date:Reply-To:Sender:From:Subject:To:In-Reply-To:List-Help:List-Unsubscribe:List-Subscribe:List-Owner:List-Archive; b=NMMd79Skk+pKcyZdDhowld6OYORmZFgIDknkQJ2XgimpZws0YW0C3OCAsmjJtfICJhM55i8RQ59Ca+a/wEOTnau3ICLab43E2UOf+91RZHvSnSnc4n+0cCC9Zil8+FVjB3ALaO+YK5Xe/CW90gZ8bgm8B68KLe/8w896kFThnG9Q/c29ZH3egl2Xp/Iu/WvAGwwz/wEUqdY9TBFSagR5iShaw4iXw4QlPF25hoGbb512jhgxOzUxxaxdC9B4FRvDTGG5lub29Auc6wuckUjQM5sCYVkmzVcwXA55ouIFe+ikBd/hrngJXOR8sBr63xZ2uwPcylnZaI+ZM7AalQJjMA==

In-reply-to: <CAA=YKqi4KFHSxHTkA7YDATeLgBUJ-Zd2P3WtXctmOuXS63XFRA@mail.gmail.com>

List-archive: <https://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

List-help: <https://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO%20AUDITORY>

List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>

List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>

List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>

References: <834a57b299ae41a4bbd17cbd2530edb6@epfl.ch>,<CAA=YKqi4KFHSxHTkA7YDATeLgBUJ-Zd2P3WtXctmOuXS63XFRA@mail.gmail.com>

Reply-to: Qin Liu <qin.liu@xxxxxxx>

Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>

Thread-index: AQHbh5+LHbUxswWWv0Ck8AY9viAzTbNa77iAgAZ5XTc=

Thread-topic: Seeking advice on using ANF firing rate to reslove front-back confusion in sound localization model

Dear Dick,

Thank you for your suggestions.

I am a little confused about the concept of "rate-vs-place profiles." Could you please provide more references or explain it a bit more? I believe spectral cues will definitely help, but I haven't found a proper way to manipulate them yet.

I've attempted to use head rotation to determine whether the sound source is from the front or back based on the MSO firing rate, which requires turning +/- 90 degrees each time. This is why I am seeking alternative methods to address this issue.

Best regards,

Qin

From: Richard F. Lyon <dicklyon@xxxxxxx>
Sent: Thursday, 27 February 2025 12:08:01
To: Qin Liu
Cc: AUDITORY@xxxxxxxxxxxxxxx
Subject: Re: Seeking advice on using ANF firing rate to reslove front-back confusion in sound localization model

Qin,

The rate-vs-place profiles from the two ears may have most of what you need to supplement the MSO's output that represents ITD, which is mostly a left-right cue. The cues for elevation, including front-back, are generally thought to be more subtle spectral features, related to the individual's HRTF, and are not as robust as the ITD cues. ILD cues are of intermediate robustness, I think, but still primarily left-right.

I hadn't thought about doing what Jan Schnupp suggested, looking at slightly different cones of confusion for different frequencies, but that sounds like another way to conceptualize the subtle HRTF-dependent spectral cues.

So you don't have to use "HRTF template matching", but you do have to use HRTFs.

If you want to do this in anything like the real world, as opposed to an anechoic environment, you'll need a strong precedence effect to pay attention to the first arrival and ignore echos, or something along those lines.

Also, in the real world, we usually resolve front-back confusion quickly and easily by rotating our heads a little. The effect of rotation on ITD is opposite for front vs back, so this gives a very robust front-back cue; up-down is still hard.

Dick

On Wed, Feb 26, 2025 at 4:21 PM Qin Liu <000003c563e12bd3-dmarc-request@xxxxxxxxxxxxxxx> wrote:

Dear auditory list,

I am currently working on a project involving sound localization using firing rates from auditory nerve fibers (ANFs) and the medial superior olive (MSO). However, I have encountered an issue: I am unable to distinguish between front and back sound sources using MSO firing rates alone but only the left-right.

I am considering whether auditory nerve fiber (ANF) firing rates might provide a solution, but I am uncertain how to utilize them effectively. For instance, I have experimented with analyzing the positive gradients of ANF firing rates but have not yet achieved meaningful results.

Could anyone suggest an auditory metric derived from binaural signals, ANF firing rates, or MSO that could classify front/back sources without relying on HRTF template matching? Any insights or alternative approaches would be invaluable to my work.

Thank you in advance. I sincerely appreciate any guidance you can offer.

Best regards,

Qin Liu
Doctoral Student
Laboratory of Wave Engineering, École Polytechnique Fédérale de Lausanne (EPFL)
Email: qin.liu@xxxxxxx