[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [AUDITORY] converting masking thresholds to masker levels of speech sounds



Dear Mengli,

 

If you are interested in accounting for both spectral and temporal auditory masking effects, you could have a look at this paper: https://doi.org/10.1371/journal.pone.0166937

It is about a joint time-frequency masking model I’ve worked on in the past. A Matlab/Octave script that allows to compute the masking function is provided as supplementary material. I thought it may help!

 

Best regards,

 

Thibaud.

 

From: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx> On Behalf Of Feng, Mengli (2018)
Sent: Wednesday, 29 January 2020 14:09
To: AUDITORY@xxxxxxxxxxxxxxx
Subject: Re: [AUDITORY] converting masking thresholds to masker levels of speech sounds

 

Hi Frederico,

 

Thanks very much for the code!

 

I did the same thing using ISO psychoacoustic model 2. Was thinking about using models to account for temporal effect. Have you tried more advanced auditory models?

 

Best wishes,

Mengli

 

-- 

Mengli Feng

PhD Student

PGR Collective EPMS School Convenor

 

Audio, Biosignals and Machine Learning Group

Department of Electronic Engineering

Royal Holloway, University of London 

 

Research Interest:

Speech/ voice production and perception

Ongoing Project:

the perceptual effect of Bone-conducted sound of own voice

>>> Pure Page

 


From: Frederico Pereira <pereira.frederico@xxxxxxxxx>
Sent: Wednesday, January 29, 2020 12:15 pm
To: Feng, Mengli (2018)
Cc: AUDITORY@xxxxxxxxxxxxxxx
Subject: Re: [AUDITORY] converting masking thresholds to masker levels of speech sounds

 

Hi Mengli,

 

I´m currently working on something similar and I´ve been developing on top of the code and psychoacoustic models based on:

ISO/IEC 11172-3:1993, Information technology – Coding of moving pictures and associated audio for digital storage media at up to about 1,5 Mbit/s – Part 3: Audio

 

Hoping this is of some help to you.

 

regards,

 

Frederico

 

On Tue, Jan 28, 2020 at 5:19 AM Feng, Mengli (2018) <Mengli.Feng.2018@xxxxxxxxxxxxxxx> wrote:

Dear All,

 

I am trying to convert masking curves into the frequency responses of the original maskers (single speech sounds). The maskees I am using are narrow band noises at different frequencies.

 

It has taken me enormous effort to find an auditory model to make accurate predictions, considering the maskers are complex tones with multiple harmonics in high frequency region. Might anyone provide some guidance or advice on finding a suitable model? 

 

Is it even possible to do such prediction knowing only the frequency responses of the maskees and the masking thresholds given that temporal effects would inevitably appear because of the higher harmonics in human speech sounds? Any opinions?

 

Any suggestion would be greatly appreciated!

 

Best Regards,

Mengli

 

 

-- 

Mengli Feng

PhD Student

PGR Collective EPMS School Convenor

 

Audio, Biosignals and Machine Learning Group

Department of Electronic Engineering

Royal Holloway, University of London 

 

Research Interest:

Speech/ voice production and perception

Ongoing Project:

the perceptual effect of Bone-conducted sound of own voice

>>> Pure Page

 



--

Frederico Pereira
Mobile:+351937356301
Email:pereira.frederico@xxxxxxxxx