[AUDITORY] Announcing WHAM!48kHz - A dataset of high-fidelity ambient backgrounds (Jonathan Le Roux )


Subject: [AUDITORY] Announcing WHAM!48kHz - A dataset of high-fidelity ambient backgrounds
From:    Jonathan Le Roux  <Jonathan.Le-Roux@xxxxxxxx>
Date:    Thu, 8 Oct 2020 17:34:51 -0400

--000000000000be916305b12f9bc4 Content-Type: text/plain; charset="UTF-8" [Apologies for cross posting] Hi all, As a Last Christmas present in the WHAM! series, we have decided to Make It Big and release WHAM!48kHz, a high-fidelity version of the ambient background noise recordings originally used for the WSJ0 Hipster Ambient Mixtures (WHAM!) dataset. The noise audio was collected by our definitely not Careless Whisper.ai collaborators at various urban locations throughout the San Francisco Bay Area in late 2018. The environments primarily consist of restaurants, cafes, bars, and parks. Audio was recorded using an Apogee Sennheiser binaural microphone (without a dummy head) on a tripod between 1.0 and 1.5 meters off the ground, at a sampling rate of 48 kHz and with 24 bit precision. Unlike the original WHAM! noise, which was segmented into clips with lengths corresponding to specific WSJ0 utterances, WHAM!48kHz contains approximately 78 hours of raw noise recordings that were only segmented to remove any regions containing intelligible speech. WHAM! is a joint effort between Mitsubishi Electronics Research Laboratories (MERL) and Whisper.ai. The data is available at: http://wham.whisper.ai/ Cheers, Jonathan, on behalf of The WHAM! Team Jonathan Le Roux <Jonathan.Le-Roux@xxxxxxxx> Senior Principal Research Scientist, Speech & Audio Senior Team Leader MERL - Mitsubishi Electric Research Laboratories 201 Broadway, 8th Floor, Cambridge, MA 02139 Tel.: +1-617-621-7547 Fax: +1-617-621-7550 --000000000000be916305b12f9bc4 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable <div dir=3D"ltr"><div>[Apologies for cross posting] <br><br>Hi all,<br><br>= As a Last Christmas present in the WHAM! series, we have decided to Make It= Big and release WHAM!48kHz, a high-fidelity version of the ambient backgro= und noise recordings originally used for the WSJ0 Hipster Ambient Mixtures = (WHAM!) dataset. <br><br>The noise audio was collected by our definitely no= t Careless Whisper.ai collaborators at various urban locations throughout t= he San Francisco Bay Area in late 2018. The environments primarily consist = of restaurants, cafes, bars, and parks. Audio was recorded using an Apogee = Sennheiser binaural microphone (without a dummy head) on a tripod between 1= .0 and 1.5 meters off the ground, at a sampling rate of 48 kHz and with 24 = bit precision. <br><br>Unlike the original WHAM! noise, which was segmented= into clips with lengths corresponding to specific WSJ0 utterances, WHAM!48= kHz contains approximately 78 hours of raw noise recordings that were only = segmented to remove any regions containing intelligible speech. <br><br>WHA= M! is a joint effort between Mitsubishi Electronics Research Laboratories (= MERL) and Whisper.ai.=C2=A0 <br></div><div>The data is available at: <a hre= f=3D"http://wham.whisper.ai/">http://wham.whisper.ai/</a> <br><br>Cheers, <= br>Jonathan, on behalf of The WHAM! Team <br></div><div><br></div><br clear= =3D"all"><div><div dir=3D"ltr" class=3D"gmail_signature" data-smartmail=3D"= gmail_signature"><div dir=3D"ltr"><div><div dir=3D"ltr"><div><div dir=3D"lt= r"><font color=3D"#888888">Jonathan Le Roux &lt;<a href=3D"mailto:Jonathan.= Le-Roux@xxxxxxxx" target=3D"_blank">Jonathan.Le-Roux@xxxxxxxx</= a>&gt;<br>Senior Principal Research Scientist, Speech &amp; Audio Senior Te= am Leader<br>MERL - Mitsubishi Electric Research Laboratories<br>201 Broadw= ay, 8th Floor, Cambridge, MA 02139<br></font><font color=3D"#888888"><font = color=3D"#888888">Tel.: <a value=3D"+16176217547">+1-617-621-7547</a>=C2=A0= </font></font><font color=3D"#888888"><font color=3D"#888888">Fax: +1-617-= 621-7550</font></font><br><font color=3D"#888888"><font color=3D"#888888"><= br></font></font></div></div></div></div></div></div></div></div> --000000000000be916305b12f9bc4--


This message came from the mail archive
src/postings/2020/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University