Re: [AUDITORY] Visual references in sound localisation

Subject: Re: [AUDITORY] Visual references in sound localisation

From: Norbert Kopco <kopco@xxxxxx>

Date: Fri, 2 Mar 2018 09:37:24 -0500

Arc-authentication-results: i=1; mx.google.com; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.101 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx

Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-archive:list-owner:list-subscribe:list-unsubscribe:list-help :precedence:in-reply-to:to:subject:from:sender:reply-to:date :message-id:content-language:mime-version:user-agent:references :approved-by:arc-authentication-results; bh=1LG1C9aFtS1Qg+3bDxy2Om48Aesb34AvEkwFhNpb7kc=; b=ZxI0vaDlwhTWll6hpq3nK3Zcponj3pLbrupTNMakIZCUKUXBbdnTiZ1yxgznkuwHA9 e3A3/bjMhbrVgX2Yvu8j7PQl8eL5pf1kGgz8p7fjTvYBKvchmrgzWmMDt1L32jG8h/UQ 1WT5zD5CCf6dCKujBEynDDnPk504AmpQ48cT1kZyqthjLaXRYKStiJs6JMpLAp8Y+Ebm fXm/3PuQ8gK/NluJyoRPtoNc0RU29oqJ90vg/jCB9rXAQXqlH7PllruIn9onkqoFMWil CrJ2JsdpD5XnzKjtCQa+xlV/g5Lf5bqtqX00zgNM1fWEzNxez8IqX2WsYUeAX5YTOcDb dITg==

Arc-seal: i=1; a=rsa-sha256; t=1520053967; cv=none; d=google.com; s=arc-20160816; b=fzKGRsteLqtVVIadT5EBAMpJZjW4NK1YikrghlY9Y4DMUIriSWpe8GmQDT3y8EScjQ j1Jfg0VIqkjLKtCsDsUYT0NKIhddr7HUD1lsBcua3ASLI7WgBEuX0lfkvp4M8SgU7ynH 4ZSyFUUDjB+hTP6a/pv7XlWhubiCvSDzYDLjuBW/B7+w36hnZmoe+ElnIWngDJUEfjMK SL3u5/O1fHG/Ksvp5ysD1absLol0EuqDGbQMTYMw0jRRrewlQVcMoaPAoG/WDT6NBFGk WCKqf2cFdbRhdgECfmRDLMV8vInB0ggEzmES9fV2q5KKD3/Ff/SYorYIl2R7opQ7476A mblA==

Authentication-results: mx.google.com; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.101 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx

Delivered-to: dan.ellis@xxxxxxxxx

In-reply-to: <24070_1519880648_5A9789C7_24070_31_1_DB6PR0601MB21679593DED30ADD6A2651ABD3C70@DB6PR0601MB2167.eurprd06.prod.outlook.com>

List-archive: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

List-help: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO%20AUDITORY>

List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>

List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>

List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>

References: <DB6PR0601MB2167E566C0661519916268C7D3C30@DB6PR0601MB2167.eurprd06.prod.outlook.com> <24070_1519880648_5A9789C7_24070_31_1_DB6PR0601MB21679593DED30ADD6A2651ABD3C70@DB6PR0601MB2167.eurprd06.prod.outlook.com>

Reply-to: kopco@xxxxxx

Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>

User-agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.6.0

Hello,

You might find this paper relevant:
Kopčo N, Best V, Carlile S (2010). Speech localization in a multitalker mixture. Journal of the Acoustical Society of America , 127, 1450-1457 ( DOI: 10.1121/1.3290996).

It describes an experiment in which we didn't use a visual reference. Instead, we provided the subject with a priori information about where the distractor speech sources are and they could use that information to separate the target from the distractors. Assuming that the visual cues would provide the same kind of information, the results might be similar.

I hope this helps,
Noro

On 2/28/2018 10:23 AM, Engel Alonso-Martinez, Isaac wrote:

Dear all,

Thank you all very much for your responses.

It seems that there is plenty of literature on the effect of visual stimuli in auditory localisation. If anyone is interested, a summary of relevant keywords for this topic could be: 'visual capture', 'visual dominance', 'visual bias' and 'cross-modal bias'. Also, one may find relevant papers under: 'multimodal integration', 'multisensory integration' and 'cross-modal plasticity'.

I have found that a common practice is to use only one visual cue and one auditory cue at the same time. If the two stimuli are close to be spatially congruent, the subject will probably bind the two of them together unconsciously, thus causing this 'visual capture' effect in which the visual stimulus dominates the auditory one. This may not happen if the two stimuli are not spatially congruent in a noticeable way [1, 2].

However, in the scenario that I proposed originally there are two auditory stimuli: one of them is explicitly associated to the visual cue and would act as an 'anchor', while the other one has to be located. Intuitively, one might think that if the two auditory cues are perceived as different sources, the risk of visual dominance should be small.

As it has been pointed out, another part of the question is on 'relative localisation' and comparative judgements, particularly in multimodal scenarios. How good are we at estimating the location of two sound sources with respect to each other? And what happens if we introduce visual cues?

All suggestions are welcome! Thank you all again for your contributions.

Kind regards,

Isaac Engel

References:

[1] Bosen, Adam K. et al. 2016. “Comparison of Congruence Judgment and Auditory Localization Tasks for Assessing the Spatial Limits of Visual Capture.” Biological Cybernetics 110(6): 455–71

[2] Berger, Christopher C., et al. "Generic HRTFs may be good enough in Virtual Reality. Improving source localization through cross-modal plasticity." Frontiers in Neuroscience 12 (2018): 21.

--
Isaac Engel
PhD student at Dyson School of Design Engineering
Imperial College London
10 Princes Gardens
South Kensington, SW7 1NA, London
E-mail: isaac.engel@xxxxxxxxxxxxxx

www.imperial.ac.uk/design-engineering/research/human-performance-and-experience/sound-and-audio-systems

From: Engel Alonso-Martinez, Isaac
Sent: 24 February 2018 19:08
To: auditory@xxxxxxxxxxxxxxx
Subject: Visual references in sound localisation

Dear all,

I am interested in the impact of audible visual references in sound localisation tasks.

For instance, let's say that you are presented two different continuous sounds (e.g., speech) coming from sources A and B, which are in different locations. While source A is clearly visible to you, B is invisible and you are asked to estimate its location. Will source A act as a spatial reference, helping you in doing a more accurate estimation, or will it be distracting and make the task more difficult?

If anyone can point to some literature on this, it would be greatly appreciated.

Kind regards,

Isaac Engel