Re: [AUDITORY] converting masking thresholds to masker levels of speech sounds

Subject: Re: [AUDITORY] converting masking thresholds to masker levels of speech sounds

From: Thibaud Necciari <thibaud.necciari@xxxxxxxxxxxxxxxx>

Date: Thu, 30 Jan 2020 14:24:20 +0000

Accept-language: fr-FR, en-US

Approved-by: thibaud.necciari@xxxxxxxxxxxxxxxx

Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=lewitt-audio.com; dmarc=pass action=none header.from=lewitt-audio.com; dkim=pass header.d=lewitt-audio.com; arc=none

Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=MYAZi72NrycSe1vgJ9om8EKKYR36sg/mBMnVUrioxz4=; b=Qel+kEtBkHcXH2Yof+liLKqPi/b5jOPYecZYAHLIF0zpFK9yOjFmTmn5TvqZH9RoO9ykdMVJLR3ifJr4xmZ9fcvlUry1TuVFKsoWN0oQzFzXXPSPziROBfL2IrP+xeG63RiUSf0JHyTlVedpggShqBu3cCxBvirGJnDg9ZgL++Gv97YyTBjULsXE4wOjyNqavy1Mk8K28bJvgzgsFE6FBvT+Lu5kqFYfx8ZrbNbBe+7d8ldGQ66h3jr1WMUrEwjtXCwcVAFO/j0ohn7H5ZEMt6SOmmY8p2SAgmOSt5xPWDcaot135BWHEPrxdd4s3UuFt8vzb7KhYFdI9mPP/gSoYg==

Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=IKMVxBfuRmipupNMQfMF5aeDaQk3Z7RmBJY02gJvCWEob63/uEFcE8MFVsty6HuV/kkcuuButYixCmKFjAYrIa/EWHOiS63TTPVwT2qLAJjQ9KKo7E6J6leGyk43xcYMWQTJ3f7dVwmfdtffU9UdYb8hGW9gDNOsm/hlvZFnInhbj0tueDQ6pnJtg+YWyUyU+MzflBhWc/W74lKqIzXyD2AOvgGkMu2CwRrL+8q3qR51WXZeGRvYgIpgcMABZwluESLFoNeQqnhXBy3BAa8p/1pqbpUOjHFIYxbOo0CotipJik52f1REKHRAl5c0Q1FtkTITlFt/pDqLUi5QhbFkPw==

Authentication-results: mx.google.com; arc=fail (signature failed); spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.102 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx

Delivered-to: dan.ellis@xxxxxxxxx

In-reply-to: <AM0PR0102MB3524861E269ECB69F60B694ED8050@AM0PR0102MB3524.eurprd01.prod.exchangelabs.com>

List-archive: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

List-help: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO%20AUDITORY>

List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>

List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>

List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>

References: <AM0PR0102MB352490918EA4AF38C55FA31DD80B0@AM0PR0102MB3524.eurprd01.prod.exchangelabs.com>,<CACPGB==N3jV5DzRYVta1AENQyRRoH60OtQ_Wxo3gvZgWyPrTYg@mail.gmail.com> <AM0PR0102MB3524861E269ECB69F60B694ED8050@AM0PR0102MB3524.eurprd01.prod.exchangelabs.com>

Reply-to: Thibaud Necciari <thibaud.necciari@xxxxxxxxxxxxxxxx>

Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>

Thread-index: AQHV1UlE5+/WUr0PE0W1pfEccTNwuagBkO0AgAAMKaWAAajpgA==

Thread-topic: [AUDITORY] converting masking thresholds to masker levels of speech sounds

Dear Mengli,

If you are interested in accounting for both spectral and temporal auditory masking effects, you could have a look at this paper: https://doi.org/10.1371/journal.pone.0166937

It is about a joint time-frequency masking model I’ve worked on in the past. A Matlab/Octave script that allows to compute the masking function is provided as supplementary material. I thought it may help!

Best regards,

Thibaud.

From: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx> On Behalf Of Feng, Mengli (2018)
Sent: Wednesday, 29 January 2020 14:09
To: AUDITORY@xxxxxxxxxxxxxxx
Subject: Re: [AUDITORY] converting masking thresholds to masker levels of speech sounds

Hi Frederico,

Thanks very much for the code!

I did the same thing using ISO psychoacoustic model 2. Was thinking about using models to account for temporal effect. Have you tried more advanced auditory models?

Best wishes,

Mengli

Mengli Feng

PhD Student

PGR Collective EPMS School Convenor

Audio, Biosignals and Machine Learning Group

Department of Electronic Engineering

Royal Holloway, University of London

Research Interest:

Speech/ voice production and perception

Ongoing Project:

the perceptual effect of Bone-conducted sound of own voice

>>> Pure Page

From: Frederico Pereira <pereira.frederico@xxxxxxxxx>
Sent: Wednesday, January 29, 2020 12:15 pm
To: Feng, Mengli (2018)
Cc: AUDITORY@xxxxxxxxxxxxxxx
Subject: Re: [AUDITORY] converting masking thresholds to masker levels of speech sounds

Hi Mengli,

I´m currently working on something similar and I´ve been developing on top of the code and psychoacoustic models based on:

ISO/IEC 11172-3:1993, Information technology – Coding of moving pictures and associated audio for digital storage media at up to about 1,5 Mbit/s – Part 3: Audio

https://ieeexplore.ieee.org/abstract/document/1296956

and Matlab code provided by:

https://www.petitcolas.net/fabien/software/mpeg/#references

Hoping this is of some help to you.

regards,

Frederico

On Tue, Jan 28, 2020 at 5:19 AM Feng, Mengli (2018) <Mengli.Feng.2018@xxxxxxxxxxxxxxx> wrote:

Dear All,

I am trying to convert masking curves into the frequency responses of the original maskers (single speech sounds). The maskees I am using are narrow band noises at different frequencies.

It has taken me enormous effort to find an auditory model to make accurate predictions, considering the maskers are complex tones with multiple harmonics in high frequency region. Might anyone provide some guidance or advice on finding a suitable model?

Is it even possible to do such prediction knowing only the frequency responses of the maskees and the masking thresholds given that temporal effects would inevitably appear because of the higher harmonics in human speech sounds? Any opinions?

Any suggestion would be greatly appreciated!

Best Regards,

Mengli

--

Mengli Feng

PhD Student

PGR Collective EPMS School Convenor

Audio, Biosignals and Machine Learning Group

Department of Electronic Engineering

Royal Holloway, University of London

Research Interest:

Speech/ voice production and perception

Ongoing Project:

the perceptual effect of Bone-conducted sound of own voice

>>> Pure Page

Frederico Pereira
Mobile:+351937356301
Email:pereira.frederico@xxxxxxxxx