Re: [AUDITORY] converting masking thresholds to masker levels of speech sounds ("Feng, Mengli (2018)" )

Subject: Re: [AUDITORY] converting masking thresholds to masker levels of speech sounds From: "Feng, Mengli (2018)" <Mengli.Feng.2018@xxxxxxxx> Date: Wed, 29 Jan 2020 13:08:40 +0000 List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY> --_000_AM0PR0102MB3524861E269ECB69F60B694ED8050AM0PR0102MB3524_ Content-Type: text/plain; charset="Windows-1252" Content-Transfer-Encoding: quoted-printable Hi Frederico, Thanks very much for the code! I did the same thing using ISO psychoacoustic model 2. Was thinking about u= sing models to account for temporal effect. Have you tried more advanced au= ditory models? Best wishes, Mengli -- Mengli Feng PhD Student PGR Collective EPMS School Convenor Audio, Biosignals and Machine Learning Group Department of Electronic Engineering Royal Holloway, University of London Research Interest: Speech/ voice production and perception Ongoing Project: the perceptual effect of Bone-conducted sound of own voice >>> Pure Page ________________________________ From: Frederico Pereira <pereira.frederico@xxxxxxxx> Sent: Wednesday, January 29, 2020 12:15 pm To: Feng, Mengli (2018) Cc: AUDITORY@xxxxxxxx Subject: Re: [AUDITORY] converting masking thresholds to masker levels of s= peech sounds Hi Mengli, I=B4m currently working on something similar and I=B4ve been developing on = top of the code and psychoacoustic models based on: ISO/IEC 11172-3:1993, Information technology =96 Coding of moving pictures = and associated audio for digital storage media at up to about 1,5 Mbit/s = =96 Part 3: Audio https://ieeexplore.ieee.org/abstract/document/1296956 and Matlab code provided by: https://www.petitcolas.net/fabien/software/mpeg/#references Hoping this is of some help to you. regards, Frederico On Tue, Jan 28, 2020 at 5:19 AM Feng, Mengli (2018) <Mengli.Feng.2018@xxxxxxxx= rhul.ac.uk<mailto:Mengli.Feng.2018@xxxxxxxx>> wrote: Dear All, I am trying to convert masking curves into the frequency responses of the o= riginal maskers (single speech sounds). The maskees I am using are narrow b= and noises at different frequencies. It has taken me enormous effort to find an auditory model to make accurate = predictions, considering the maskers are complex tones with multiple harmon= ics in high frequency region. Might anyone provide some guidance or advice = on finding a suitable model? Is it even possible to do such prediction knowing only the frequency respon= ses of the maskees and the masking thresholds given that temporal effects w= ould inevitably appear because of the higher harmonics in human speech soun= ds? Any opinions? Any suggestion would be greatly appreciated! Best Regards, Mengli -- Mengli Feng PhD Student PGR Collective EPMS School Convenor Audio, Biosignals and Machine Learning Group Department of Electronic Engineering Royal Holloway, University of London Research Interest: Speech/ voice production and perception Ongoing Project: the perceptual effect of Bone-conducted sound of own voice >>> Pure Page -- Frederico Pereira Mobile:+351937356301 Email:pereira.frederico@xxxxxxxx<mailto:Email%3Apereira.frederico@xxxxxxxx= m> --_000_AM0PR0102MB3524861E269ECB69F60B694ED8050AM0PR0102MB3524_ Content-Type: text/html; charset="Windows-1252" Content-Transfer-Encoding: quoted-printable <html> <head> <meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3DWindows-1= 252"> </head> <body> <div dir=3D"ltr"> <div dir=3D"ltr" data-ogsc=3D"" style=3D""> <div dir=3D"ltr" data-ogsc=3D"" style=3D""> <div></div> <div data-ogsc=3D"" style=3D""> <div>Hi Frederico,</div> <div dir=3D"ltr"><br> </div> <div dir=3D"ltr">Thanks very much for the code!</div> <div dir=3D"ltr"><br> </div> <div dir=3D"ltr">I did the same thing using ISO psychoacoustic model 2. Was= thinking about using models to account for temporal effect. Have you tried= more advanced auditory models?</div> <div dir=3D"ltr"><br> </div> <div dir=3D"ltr">Best wishes,</div> <div dir=3D"ltr">Mengli</div> <div><br> </div> <div class=3D"ms-outlook-ios-signature" id=3D"ms-outlook-mobile-signature"> <div style=3D"direction: ltr;">-- </div> <div style=3D"direction: ltr;">Mengli Feng</div> <div style=3D"direction: ltr;">PhD Student </div> <div style=3D"direction: ltr;">PGR Collective EPMS School Conveno= r</div> <div style=3D"direction: ltr;"> </div> <div style=3D"direction: ltr;">Audio, Biosignals and Machine Learning Group= </div> <div style=3D"direction: ltr;">Department of Electronic Engineering</div> <div style=3D"direction: ltr;">Royal Holloway, University of London </= div> <div style=3D"direction: ltr;"> </div> <div style=3D"direction: ltr;">Research Interest: </div> <div style=3D"direction: ltr;">Speech/ voice production and perception</div= > <div style=3D"direction: ltr;">Ongoing Project: </div> <div style=3D"direction: ltr;">the perceptual effect of Bone-conducted soun= d of own voice </div> <div style=3D"direction: ltr;">>>> Pure Page </div> <div dir=3D"ltr"><br> </div> </div> </div> <div id=3D"id-2bc1e381-9e25-4e39-ac35-b1c3f314cf57" class=3D"ms-outlook-mob= ile-reference-message"> <hr style=3D"display: inline-block; width: 98%; font-family: -webkit-standa= rd; font-size: 12pt; color: rgb(0, 0, 0);" tabindex=3D"-1"> <div id=3D"divRplyFwdMsg" dir=3D"ltr"><font face=3D"Calibri, sans-serif"><b= >From:</b> Frederico Pereira <pereira.frederico@xxxxxxxx><br> <b>Sent:</b> Wednesday, January 29, 2020 12:15 pm<br> <b>To:</b> Feng, Mengli (2018)<br> <b>Cc:</b> AUDITORY@xxxxxxxx<br> <b>Subject:</b> Re: [AUDITORY] converting masking thresholds to masker leve= ls of speech sounds <div> </div> </font></div> <meta content=3D"text/html; charset=3Dutf-8"> <div dir=3D"ltr"> <div dir=3D"ltr"> <div dir=3D"ltr"> <div dir=3D"ltr"> <div dir=3D"ltr"> <div>Hi Mengli,</div> <div><br> </div> <div>I=B4m currently working on something similar and I=B4ve been developin= g on top of the code and psychoacoustic models based on:</div> <div><i>ISO/IEC 11172-3:1993, Information technology =96 Coding of moving p= ictures and associated audio for digital storage media at up to about 1,5 M= bit/s =96 Part 3: Audio</i></div> <div> <p><span lang=3D"PT"><a href=3D"https://ieeexplore.ieee.org/abstract/docume= nt/1296956" data-ogsc=3D"" style=3D"">https://ieeexplore.ieee.org/abstract/= document/1296956</a></span></p> <b></b><i></i><u></u><sub></sub><sup></sup><strike></strike>and Matlab code= provided by:<br> </div> <div><a href=3D"https://www.petitcolas.net/fabien/software/mpeg/#references= " data-ogsc=3D"" style=3D"">https://www.petitcolas.net/fabien/software/mpeg= /#references</a></div> <div><br> </div> <div>Hoping this is of some help to you.</div> <div><br> </div> <div>regards,</div> <div><br> </div> <div>Frederico</div> </div> </div> </div> </div> <br> <div class=3D"gmail_quote"> <div class=3D"gmail_attr" dir=3D"ltr">On Tue, Jan 28, 2020 at 5:19 AM Feng,= Mengli (2018) <<a href=3D"mailto:Mengli.Feng.2018@xxxxxxxx" data= -ogsc=3D"" style=3D"">Mengli.Feng.2018@xxxxxxxx</a>> wrote:<br> </div> <blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex; paddin= g-left:1ex; border-left-color:rgb(204,204,204); border-left-width:1px; bord= er-left-style:solid"> <div> <div dir=3D"ltr"> <div dir=3D"ltr"> <div></div> <div> <div>Dear All,</div> <div> </div> <div>I am trying to convert masking curves into the frequency responses of = the original maskers (single speech sounds). The maskees I am using are nar= row band noises at different frequencies.</div> <div> </div> <div>It has taken me enormous effort to find an auditory model to make accu= rate predictions, considering the maskers are complex tones with multiple h= armonics in high frequency region. Might anyone provide some guidance or ad= vice on finding a suitable model? </div> <div> </div> <div>Is it even possible to do such prediction knowing only the frequency r= esponses of the maskees and the masking thresholds given that temporal effe= cts would inevitably appear because of the higher harmonics in human speech= sounds? Any opinions?</div> <div> </div> <div>Any suggestion would be greatly appreciated!</div> <div> </div> <div>Best Regards,</div> <div>Mengli</div> <div> </div> <div><br> </div> <div id=3D"gmail-m_8380058477085050702ms-outlook-mobile-signature"> <div style=3D"direction:ltr">-- </div> <div style=3D"direction:ltr">Mengli Feng</div> <div style=3D"direction:ltr">PhD Student </div> <div style=3D"direction:ltr">PGR Collective EPMS School Convenor<= /div> <div style=3D"direction:ltr"> </div> <div style=3D"direction:ltr">Audio, Biosignals and Machine Learning Group</= div> <div style=3D"direction:ltr">Department of Electronic Engineering</div> <div style=3D"direction:ltr">Royal Holloway, University of London </di= v> <div style=3D"direction:ltr"> </div> <div style=3D"direction:ltr">Research Interest: </div> <div style=3D"direction:ltr">Speech/ voice production and perception</div> <div style=3D"direction:ltr">Ongoing Project: </div> <div style=3D"direction:ltr">the perceptual effect of Bone-conducted sound = of own voice</div> <div style=3D"direction:ltr">>>> Pure Page </div> <div dir=3D"ltr"><br> </div> </div> </div> </div> </div> </div> </blockquote> </div> <br clear=3D"all"> <br> -- <br> <div class=3D"gmail_signature" dir=3D"ltr">Frederico Pereira<br> Mobile:+351937356301<br> <a href=3D"mailto:Email%3Apereira.frederico@xxxxxxxx" target=3D"_blank" da= ta-ogsc=3D"" style=3D"">Email:pereira.frederico@xxxxxxxx</a></div> </div> </div> </div> </div> </div> </body> </html> --_000_AM0PR0102MB3524861E269ECB69F60B694ED8050AM0PR0102MB3524_--

This message came from the mail archive
src/postings/2020/
maintained by:

DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University