Subject: Re: [AUDITORY] High fidelity cocktail party recordings From: Philip Robinson <philrob22@xxxxxxxx> Date: Mon, 14 Sep 2020 17:37:29 -0700--0000000000000db53805af4f5c21 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable I really should to write a more detailed post to the Auditory list about this, but we just released an A/V speech corpus that may be suitable. Details and link in JASA letter to the editor: https://asa.scitation.org/doi/full/10.1121/10.0001670 Best, Philip Robinson Audio Research Manager *facebook* Reality Labs Philip Robinson On Sun, Sep 13, 2020 at 9:37 PM elif kaplan <elifjk@xxxxxxxx> wrote: > > > ---------- Forwarded message --------- > From: Carol Chermaz <c.chermaz@xxxxxxxx> > Date: Sun, 13 Sep 2020 at 14:20 > Subject: Re: [AUDITORY] High fidelity cocktail party recordings > To: elif kaplan <elifjk@xxxxxxxx> > > > Binaural cafeteria noise with impulse responses, 48 khz: > > http://medi.uni-oldenburg.de/hrir/ > > cocktail party: look into the CHiME Challenge data. Various editions of > the challenge have different type of recordings with multiple microphone > configurations. Not sure if it=E2=80=99s realistic (I remember the data b= eing > recorded with like 6 microphones attached to a tablet). Better than > nothing... > > I am not aware of other corpora, I have searched for that myself 2-3 year= s > ago and that was the best I could find. > Cheers > > > On 13 Sep 2020, at 11:53, elif kaplan <elifjk@xxxxxxxx> wrote: > > > > > > > >> Begin forwarded message: > >> > >> From: "Monson, Brian" <monson@xxxxxxxx> > >> Subject: [AUDITORY] High fidelity cocktail party recordings > >> Date: July 29, 2020 at 8:44:28 PM GMT+3 > >> To: AUDITORY@xxxxxxxx > >> Reply-To: "Monson, Brian" <monson@xxxxxxxx> > >> > >> Dear Colleagues, > >> > >> I am looking for high-fidelity recordings of natural cocktail party or > other complex acoustic background scenes. > >> > >> By =E2=80=9Cnatural=E2=80=9D I mean recorded in actual settings (cockt= ail parties, > restaurants, hospitals, subways/trains/buses, etc.), preferably with a > microphone location that could represent where a human might actually be > listening to the scene (rather than, say, a mic suspended from the ceilin= g > or something similar). > >> > >> By =E2=80=9Cbackground=E2=80=9D I mean true background scenes with no = near-field > talkers speaking directly into the microphone. > >> > >> By =E2=80=9Chigh fidelity=E2=80=9D I mean: > >> Original recording sampling rate at least 44.1 kHz > >> Flat microphone response to 20 kHz > >> At least 16-bit precision preferred > >> > >> COVID is preventing me from making my own recordings, so I=E2=80=99d g= reatly > appreciate it if anyone has any you=E2=80=99d be willing to share (or kno= w of any > publicly available) that meet, or nearly meet, these criteria. > >> > >> Many thanks, > >> > >> Brian > >> > >> > >> Brian B. Monson, PhD > >> > >> Assistant Professor > >> Department of Speech and Hearing Science > >> Neuroscience Program > >> University of Illinois at Urbana-Champaign > >> 901 S Sixth St, Rm 223 > >> Champaign, IL 61820 > >> 217-300-6212 | monson@xxxxxxxx > >> anexlab.shs.illinois.edu > >> > >> > >> <Illinois logo.png> > >> > >> Under the Illinois Freedom of Information Act any written communicatio= n > to or from university employees regarding university business is a public > record and may be subject to public disclosure. > >> > > > > Carol Chermaz > > Centre for Speech Technology Research > The University of Edinburgh > > > > > -- > The University of Edinburgh is a charitable body, registered in > Scotland, with registration number SC005336. > > --0000000000000db53805af4f5c21 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable <div dir=3D"ltr">I really should to write a more detailed post to the Audit= ory list about this, but we just released an A/V speech corpus that may be = suitable. <br>Details and link in JASA letter to the editor: <a href=3D"htt= ps://asa.scitation.org/doi/full/10.1121/10.0001670">https://asa.scitation.o= rg/doi/full/10.1121/10.0001670</a><div><div><br></div><div>Best,<br><p clas= s=3D"MsoNormal" style=3D"background-image:initial;background-position:initi= al;background-size:initial;background-repeat:initial;background-origin:init= ial;background-clip:initial;margin:0in 0in 0.0001pt;font-size:11pt;font-fam= ily:Calibri,sans-serif"><a name=3D"_MailAutoSig"><span style=3D"font-size:1= 0.5pt;font-family:inherit;color:rgb(68,84,106)">Philip Robinson</span></a><= /p> <p class=3D"MsoNormal" style=3D"background-image:initial;background-positio= n:initial;background-size:initial;background-repeat:initial;background-orig= in:initial;background-clip:initial;margin:0in 0in 0.0001pt;font-size:11pt;f= ont-family:Calibri,sans-serif"><span style=3D"font-size:10.5pt;font-family:= inherit;color:rgb(68,84,106)">Audio Research Manager</span></p> <p class=3D"MsoNormal" style=3D"margin:0in 0in 0.0001pt;font-size:11pt;font= -family:Calibri,sans-serif"><b><span style=3D"font-size:18pt;font-family:&q= uot;Microsoft Sans Serif",sans-serif;color:rgb(0,112,192)">facebook</s= pan></b><span style=3D"font-size:18pt;font-family:"Segoe UI Light"= ;,sans-serif;color:rgb(0,112,192)"> </span><span style=3D"font-size:16pt;fo= nt-family:"Segoe UI Light",sans-serif;color:rgb(0,112,192)">Reali= ty Labs</span></p> <p class=3D"MsoNormal" style=3D"margin:0in 0in 0.0001pt;font-size:11pt;font= -family:Calibri,sans-serif">=C2=A0</p><br></div></div></div><br clear=3D"al= l"><div><div dir=3D"ltr" class=3D"gmail_signature">Philip Robinson<br><br><= br><br></div></div><br><br><div class=3D"gmail_quote"><div dir=3D"ltr" clas= s=3D"gmail_attr">On Sun, Sep 13, 2020 at 9:37 PM elif kaplan <<a href=3D= "mailto:elifjk@xxxxxxxx">elifjk@xxxxxxxx</a>> wrote:<br></div><blockqu= ote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px= solid rgb(204,204,204);padding-left:1ex"><div><br></div><div><br><div clas= s=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmail_attr">---------- Forwarde= d message ---------<br>From: <strong class=3D"gmail_sendername" dir=3D"auto= ">Carol Chermaz</strong> <span dir=3D"auto"><<a href=3D"mailto:c.chermaz= @xxxxxxxx" target=3D"_blank">c.chermaz@xxxxxxxx</a>></span><br>D= ate: Sun, 13 Sep 2020 at 14:20<br>Subject: Re: [AUDITORY] High fidelity coc= ktail party recordings<br>To: elif kaplan <<a href=3D"mailto:elifjk@xxxxxxxx= l.com" target=3D"_blank">elifjk@xxxxxxxx</a>><br></div><br><br>Binaural= cafeteria noise with impulse responses, 48 khz:<br> <br> <a href=3D"http://medi.uni-oldenburg.de/hrir/" rel=3D"noreferrer" target=3D= "_blank">http://medi.uni-oldenburg.de/hrir/</a><br> <br> cocktail party: look into the CHiME Challenge data. Various editions of the= challenge have different type of recordings with multiple microphone confi= gurations. Not sure if it=E2=80=99s realistic (I remember the data being re= corded with like 6 microphones attached to a tablet). Better than nothing..= .<br> <br> I am not aware of other corpora, I have searched for that myself 2-3 years = ago and that was the best I could find.<br> Cheers<br> <br> > On 13 Sep 2020, at 11:53, elif kaplan <<a href=3D"mailto:elifjk@xxxxxxxx= il.com" target=3D"_blank">elifjk@xxxxxxxx</a>> wrote:<br> > <br> > <br> > <br> >> Begin forwarded message:<br> >> <br> >> From: "Monson, Brian" <<a href=3D"mailto:monson@xxxxxxxx= OIS.EDU" target=3D"_blank">monson@xxxxxxxx</a>><br> >> Subject: [AUDITORY] High fidelity cocktail party recordings<br> >> Date: July 29, 2020 at 8:44:28 PM GMT+3<br> >> To: <a href=3D"mailto:AUDITORY@xxxxxxxx" target=3D"_blank">= AUDITORY@xxxxxxxx</a><br> >> Reply-To: "Monson, Brian" <<a href=3D"mailto:monson@xxxxxxxx= LLINOIS.EDU" target=3D"_blank">monson@xxxxxxxx</a>><br> >> <br> >> Dear Colleagues,<br> >> <br> >> I am looking for high-fidelity recordings of natural cocktail part= y or other complex acoustic background scenes.<br> >> <br> >> By =E2=80=9Cnatural=E2=80=9D I mean recorded in actual settings (c= ocktail parties, restaurants, hospitals, subways/trains/buses, etc.), prefe= rably with a microphone location that could represent where a human might a= ctually be listening to the scene (rather than, say, a mic suspended from t= he ceiling or something similar).<br> >> <br> >> By =E2=80=9Cbackground=E2=80=9D I mean true background scenes with= no near-field talkers speaking directly into the microphone.<br> >> <br> >> By =E2=80=9Chigh fidelity=E2=80=9D I mean:<br> >> Original recording sampling rate at least 44.1 kHz<br> >> Flat microphone response to 20 kHz<br> >> At least 16-bit precision preferred<br> >> <br> >> COVID is preventing me from making my own recordings, so I=E2=80= =99d greatly appreciate it if anyone has any you=E2=80=99d be willing to sh= are (or know of any publicly available) that meet, or nearly meet, these cr= iteria.<br> >> <br> >> Many thanks,<br> >> <br> >> Brian<br> >> <br> >> <br> >> Brian B. Monson, PhD<br> >> <br> >> Assistant Professor<br> >> Department of Speech and Hearing Science<br> >> Neuroscience Program<br> >> University of Illinois at Urbana-Champaign<br> >> 901 S Sixth St, Rm 223<br> >> Champaign, IL 61820<br> >> 217-300-6212 | <a href=3D"mailto:monson@xxxxxxxx" target=3D"_b= lank">monson@xxxxxxxx</a><br> >> <a href=3D"http://anexlab.shs.illinois.edu" rel=3D"noreferrer" tar= get=3D"_blank">anexlab.shs.illinois.edu</a><br> >> <br> >> <br> >> <Illinois logo.png><br> >> <br> >> Under the Illinois Freedom of Information Act any written communic= ation to or from university employees regarding university business is a pu= blic record and may be subject to public disclosure.<br> >> <br> > <br> <br> Carol Chermaz<br> <br> Centre for Speech Technology Research<br> The University of Edinburgh<br> <br> <br> <br> <br> -- <br> The University of Edinburgh is a charitable body, registered in<br> Scotland, with registration number SC005336.<br> <br> </div></div> </blockquote></div> --0000000000000db53805af4f5c21--