Re: Reference for typical SNRs is public spaces (John Culling )


Subject: Re: Reference for typical SNRs is public spaces
From:    John Culling  <CullingJ@xxxxxxxx>
Date:    Thu, 23 Jan 2014 10:44:14 +0000
List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

--_000_48f0d89a520148ff9deb3a6744b3a723DB3PR02MB091eurprd02pro_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Hi Andy. This is a very tricky question. I am not aware of any definitive data that = really addresses the issue adequately. Laboratory studies tend to use single sound sources in anechoic conditions. The auditory system copes very well in these conditions. The results indica= te that listeners can cope with very low SNRs (e.g. -10 dB for spatialised spe= ech- shaped noise interference, and lower for speech interferers). Moreover, som= e studies have used several interferers (e.g. Peissig and Kollmeier '97, Hawl= ey et al. '04), and shown a gradual elevation in SRT with increasing numbers of inter= ferers. Simulating a more complex scene, like a restaurant with multiple interferer= s and reverberation produces progressive degradation, though. We have been simulating up to eight interfering voices from a variety of speakers with r= everb based on real-room binaural room impulse responses. SRTs are around -2 to -= 3 dB with eigth interfering voices. I haven't begun to write this work up yet= , but the results are not disimilar to those from a cruder preliminary study publ= ished here. Culling, J. F. (2013). "Energetic and informational masking in a simulated = restaurant environment" in Moore, B. C. J., Carlyon, R. P., and Gockel, H.,= Patterson, R. D. and Winter, I. M.. (eds) Basic Aspects of Hearing: Physio= logy and Perception (Springer, New York) There remain limitations to this approach, of course. The technique remains dependent on standard target speech materials (IEEE/Harvard sentences) that are not very typical of normal conversation - particularly lacking a conversational context. It is also unclear whether 50% keyword intelligibil= ity is a tolerable level of comprehension for conversation. Karolina's study has other limitations. If I remember correctly, the materi= al was recorded from hearing impaired individuals, who may avoid the more severe listening conditions into which normally hearing people thrust themselves. Also, the method of establishing the SNR from the recordings would probably become impossible below a certain SNR, as it relies on a researcher judging from the recordings alone whether or not target speech is present. Noise level is collected from epochs without target speech, and speech level is derived by subtraction. Nonetheless, both approaches indicate that real-world SNRs are unlikely to be very near -10 dB, but be somewhere around 0 dB. Karolina's work suggests a bit above, mine a bit below. I guess what is really needed is for pairs of interlocutors to be wired up = with close microphones at the mouth (to establish reliably who is talking when) and at their ears, and then to go out for the night and try to produce norm= al speaking and listening behaviour. Perhaps after a few nights of this they would habituate to all the kit, and produce data that will get us closer to= a true answer. John From: AUDITORY - Research in Auditory Perception [mailto:AUDITORY@xxxxxxxx= ILL.CA] On Behalf Of Andy Sabin Sent: 22 January 2014 17:53 To: AUDITORY@xxxxxxxx Subject: Reference for typical SNRs is public spaces Hi List, Can anyone point me to a reference showing SNRs that are typically observed= in public spaces (e.g., restaurants, bars ...etc)? I can find this info fo= r overall SPL, but am having a hard time finding it for SNR. Thanks Andy Sabin --_000_48f0d89a520148ff9deb3a6744b3a723DB3PR02MB091eurprd02pro_ Content-Type: text/html; charset="us-ascii" Content-Transfer-Encoding: quoted-printable <html xmlns:v=3D"urn:schemas-microsoft-com:vml" xmlns:o=3D"urn:schemas-micr= osoft-com:office:office" xmlns:w=3D"urn:schemas-microsoft-com:office:word" = xmlns:m=3D"http://schemas.microsoft.com/office/2004/12/omml" xmlns=3D"http:= //www.w3.org/TR/REC-html40"> <head> <meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3Dus-ascii"= > <meta name=3D"Generator" content=3D"Microsoft Word 15 (filtered medium)"> <style><!-- /* Font Definitions */ @xxxxxxxx {font-family:"Cambria Math"; panose-1:2 4 5 3 5 4 6 3 2 4;} @xxxxxxxx {font-family:Calibri; panose-1:2 15 5 2 2 2 4 3 2 4;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {margin:0cm; margin-bottom:.0001pt; font-size:12.0pt; font-family:"Times New Roman","serif";} a:link, span.MsoHyperlink {mso-style-priority:99; color:#0563C1; text-decoration:underline;} a:visited, span.MsoHyperlinkFollowed {mso-style-priority:99; color:#954F72; text-decoration:underline;} span.EmailStyle17 {mso-style-type:personal-reply; font-family:"Calibri","sans-serif"; color:#1F497D;} .MsoChpDefault {mso-style-type:export-only; font-family:"Calibri","sans-serif"; mso-fareast-language:EN-US;} @xxxxxxxx WordSection1 {size:612.0pt 792.0pt; margin:72.0pt 72.0pt 72.0pt 72.0pt;} div.WordSection1 {page:WordSection1;} --></style><!--[if gte mso 9]><xml> <o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" /> </xml><![endif]--><!--[if gte mso 9]><xml> <o:shapelayout v:ext=3D"edit"> <o:idmap v:ext=3D"edit" data=3D"1" /> </o:shapelayout></xml><![endif]--> </head> <body lang=3D"EN-GB" link=3D"#0563C1" vlink=3D"#954F72"> <div class=3D"WordSection1"> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US">Hi Andy.<o:p= ></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US"><o:p>&nbsp;<= /o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US">This is a ve= ry tricky question. I am not aware of any definitive data that really<o:p><= /o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US">addresses th= e issue adequately. <o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US"><o:p>&nbsp;<= /o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US">Laboratory s= tudies tend to use single sound sources in anechoic conditions.<o:p></o:p><= /span></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US">The auditory= system copes very well in these conditions. The results indicate <o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US">that listene= rs can cope with very low SNRs (e.g. -10 dB for spatialised speech-<o:p></o= :p></span></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US">shaped noise= interference, and lower for speech interferers). Moreover, some <o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US">studies have= used several interferers (e.g. Peissig and Kollmeier '97, Hawley et al. <o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US">'04), and sh= own a gradual elevation in SRT with increasing numbers of interferers.<o:p>= </o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US">Simulating a= more complex scene, like a restaurant with multiple interferers<o:p></o:p>= </span></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US">and reverber= ation produces progressive degradation, though. We have been<o:p></o:p></sp= an></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US">simulating u= p to eight interfering voices from a variety of speakers with reverb<o:p></= o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US">based on rea= l-room binaural room impulse responses. SRTs are around -2 to -3<o:p></o:p>= </span></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US">dB with eigt= h interfering voices. I haven't begun to write this work up yet, but <o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US">the results = are not disimilar to those from a cruder preliminary study published<o:p></= o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US">here.<o:p></= o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US"><o:p>&nbsp;<= /o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;">Culling, J. F<b>. </b>(2013). &#8220;Energetic and informational masking in a simulated resta= urant environment&#8221; in Moore, B. C. J., Carlyon, R. P., and Gockel, H.= , Patterson, R. D. and Winter, I. M.. (eds) <i>Basic Aspects of Hearing: Physiology and Perception</i> (Springer, New Y= ork)<o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;"><o:p>&nbsp;</o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">There remain limitations to this approach, of course. Th= e technique remains<o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">dependent on standard target speech materials (IEEE/Harv= ard sentences)<o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">that are not very typical of normal conversation - parti= cularly lacking a <o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">conversational context. It is also unclear whether 50% k= eyword intelligibility<o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">is a tolerable level of comprehension for conversation.<= o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;"><o:p>&nbsp;</o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">Karolina's study has other limitations. If I remember co= rrectly, the material<o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">was recorded from hearing impaired individuals, who may = avoid the more<o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">severe listening conditions into which normally hearing = people thrust <o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">themselves. Also, the method of establishing the SNR fro= m the recordings<o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">would probably become impossible below a certain SNR, as= it relies on<o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">a researcher judging from the recordings alone whether o= r not target <o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">speech is present. Noise level is collected from epochs = without target<o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">speech, and speech level is derived by subtraction.<o:p>= </o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;"><o:p>&nbsp;</o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">Nonetheless, both approaches indicate that real-world SN= Rs are unlikely<o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">to be very near -10 dB, but be somewhere around 0 dB. Ka= rolina's work<o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">suggests a bit above, mine a bit below.<o:p></o:p></span= ></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;"><o:p>&nbsp;</o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">I guess what is really needed is for pairs of interlocut= ors to be wired up with <o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">close microphones at the mouth (to establish reliably wh= o is talking when) <o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">and at their ears, and then to go out for the night and = try to produce normal<o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">speaking and listening behaviour. Perhaps after a few ni= ghts of this they <o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">would habituate to all the kit, and produce data that wi= ll get us closer to a <o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">true answer. <o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;"><o:p>&nbsp;</o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-family:&quot;Calibri&quot;,&quot= ;sans-serif&quot;">John<o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca= libri&quot;,&quot;sans-serif&quot;;mso-fareast-language:EN-US"><o:p>&nbsp;<= /o:p></span></p> <p class=3D"MsoNormal"><b><span lang=3D"EN-US" style=3D"font-size:11.0pt;fo= nt-family:&quot;Calibri&quot;,&quot;sans-serif&quot;">From:</span></b><span= lang=3D"EN-US" style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&= quot;sans-serif&quot;"> AUDITORY - Research in Auditory Perception [mailto:= AUDITORY@xxxxxxxx <b>On Behalf Of </b>Andy Sabin<br> <b>Sent:</b> 22 January 2014 17:53<br> <b>To:</b> AUDITORY@xxxxxxxx<br> <b>Subject:</b> Reference for typical SNRs is public spaces<o:p></o:p></spa= n></p> <p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p> <div> <p class=3D"MsoNormal">Hi List,&nbsp;<o:p></o:p></p> <div> <p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p> </div> <div> <p class=3D"MsoNormal">Can anyone point me to a reference showing SNRs that= are typically observed in public spaces (e.g., restaurants, bars ...etc)? = I can find this info for overall SPL, but am having a hard time finding it = for SNR.&nbsp;<o:p></o:p></p> </div> <div> <p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p> </div> <div> <p class=3D"MsoNormal">Thanks<o:p></o:p></p> </div> <div> <p class=3D"MsoNormal">Andy Sabin<o:p></o:p></p> </div> </div> </div> </body> </html> --_000_48f0d89a520148ff9deb3a6744b3a723DB3PR02MB091eurprd02pro_--


This message came from the mail archive
/var/www/postings/2014/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University