Release of UrbanSound: a dataset and taxonomy for urban sound research (Justin Salamon )


Subject: Release of UrbanSound: a dataset and taxonomy for urban sound research
From:    Justin Salamon  <justin.salamon@xxxxxxxx>
Date:    Thu, 23 Oct 2014 10:29:40 -0400
List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

--047d7bb04c1841cc44050617e62f Content-Type: text/plain; charset=UTF-8 ** Apologies for any cross-posting *** Dear list, We are pleased to announce the release of UrbanSound <http://serv.cusp.nyu.edu/projects/urbansounddataset/>, a dataset containing 27 hours of field-recordings with over 3000 labelled sound source occurrences from 10 sound classes. The dataset focuses on sounds that occur in urban acoustic environments. To facilitate comparable research on urban sound source classification, we are also releasing a second version of this dataset, UrbanSound8K <http://serv.cusp.nyu.edu/projects/urbansounddataset/>, with 8732 excerpts limited to 4 seconds (also with source labels), and pre-sorted into 10 stratified folds. In addition to the source ID both datasets also include a (subjective) salience label for each source occurrence: foreground / background. The datasets are released for research purposes under a Creative Commons Attribution Noncommercial License, and are available online at the dataset companion website: http://serv.cusp.nyu.edu/projects/urbansounddataset/ This companion website also contains further information about each dataset, including the Urban Sound Taxonomy <http://serv.cusp.nyu.edu/projects/urbansounddataset/taxonomy.html> from which the 10 sound classes in this dataset were selected. The datasets and taxonomy will be presented at the ACM Multimedia 2014 <http://acmmm.org/2014/> conference in Orlando in a couple of weeks. For those interested, please see our paper: J. Salamon, C. Jacoby and J. P. Bello, "A Dataset and Taxonomy for Urban Sound Research <http://serv.cusp.nyu.edu/projects/urbansounddataset/salamon_urbansound_acmmm14.pdf>", in Proc. 22nd ACM International Conference on Multimedia, Orlando USA, Nov. 2014. For those attending ISMIR 2014 next week, Justin will also be there if you would like to discuss the datasets and taxonomy. We hope you find the datasets useful for your work and look forward to seeing some of you at ISMIR and ACM-MM in the coming weeks. Best, Justin, Christopher and Juan. -- Justin Salamon Post-doctoral researcher Music and Audio Research Laboratory (MARL) & Center for Urban Science and Progress (CUSP) New York University, New York, NY www.justinsalamon.com --047d7bb04c1841cc44050617e62f Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable <div dir=3D"ltr"><span id=3D"docs-internal-guid-c195bfba-3d69-899f-6e8a-f1e= 729d858fc"><p dir=3D"ltr" style=3D"line-height:1.15;margin-top:0pt;margin-b= ottom:0pt"><span style=3D"font-size:13px;font-family:Arial;color:rgb(0,0,0)= ;vertical-align:baseline;white-space:pre-wrap;background-color:transparent"= >** Apologies for any cross-posting ***</span></p><br><p dir=3D"ltr" style= =3D"line-height:1.15;margin-top:0pt;margin-bottom:0pt"><span style=3D"font-= size:13px;font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white-= space:pre-wrap;background-color:transparent">Dear list,</span></p><br><p di= r=3D"ltr" style=3D"line-height:1.15;margin-top:0pt;margin-bottom:0pt"><span= style=3D"font-size:13px;font-family:Arial;color:rgb(0,0,0);vertical-align:= baseline;white-space:pre-wrap;background-color:transparent">We are pleased = to announce the release of </span><a href=3D"http://serv.cusp.nyu.edu/proje= cts/urbansounddataset/" style=3D"text-decoration:none"><span style=3D"font-= size:13px;font-family:Arial;text-decoration:underline;vertical-align:baseli= ne;white-space:pre-wrap;background-color:transparent">UrbanSound</span></a>= <span style=3D"font-size:13px;font-family:Arial;color:rgb(0,0,0);vertical-a= lign:baseline;white-space:pre-wrap;background-color:transparent">, a datase= t containing 27 hours of field-recordings with over 3000 labelled sound sou= rce occurrences from 10 sound classes. The dataset focuses on sounds that o= ccur in urban acoustic environments.</span></p><br><p dir=3D"ltr" style=3D"= line-height:1.15;margin-top:0pt;margin-bottom:0pt"><span style=3D"font-size= :13px;font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white-spac= e:pre-wrap;background-color:transparent">To facilitate comparable research = on urban sound source classification, we are also releasing a second versio= n of this dataset, </span><a href=3D"http://serv.cusp.nyu.edu/projects/urba= nsounddataset/" style=3D"text-decoration:none"><span style=3D"font-size:13p= x;font-family:Arial;text-decoration:underline;vertical-align:baseline;white= -space:pre-wrap;background-color:transparent">UrbanSound8K</span></a><span = style=3D"font-size:13px;font-family:Arial;color:rgb(0,0,0);vertical-align:b= aseline;white-space:pre-wrap;background-color:transparent">, with 8732 exce= rpts limited to 4 seconds (also with source labels), and pre-sorted into 10= stratified folds. In addition to the source ID both datasets also include = a (subjective) salience label for each source occurrence: foreground / back= ground.</span></p><br><p dir=3D"ltr" style=3D"line-height:1.15;margin-top:0= pt;margin-bottom:0pt"><span style=3D"font-size:13px;font-family:Arial;color= :rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:t= ransparent">The datasets are released for research purposes under a Creativ= e Commons Attribution Noncommercial License, and are available online at th= e dataset companion website:</span></p><br><p dir=3D"ltr" style=3D"line-hei= ght:1.15;margin-top:0pt;margin-bottom:0pt"><a href=3D"http://serv.cusp.nyu.= edu/projects/urbansounddataset/" style=3D"text-decoration:none"><span style= =3D"font-size:13px;font-family:Arial;text-decoration:underline;vertical-ali= gn:baseline;white-space:pre-wrap;background-color:transparent">http://serv.= cusp.nyu.edu/projects/urbansounddataset/</span></a></p><br><p dir=3D"ltr" s= tyle=3D"line-height:1.15;margin-top:0pt;margin-bottom:0pt"><span style=3D"f= ont-size:13px;font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;wh= ite-space:pre-wrap;background-color:transparent">This companion website als= o contains further information about each dataset, including the </span><a = href=3D"http://serv.cusp.nyu.edu/projects/urbansounddataset/taxonomy.html" = style=3D"text-decoration:none"><span style=3D"font-size:13px;font-family:Ar= ial;text-decoration:underline;vertical-align:baseline;white-space:pre-wrap;= background-color:transparent">Urban Sound Taxonomy</span></a><span style=3D= "font-size:13px;font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;= white-space:pre-wrap;background-color:transparent"> from which the 10 sound= classes in this dataset were selected.</span></p><br><p dir=3D"ltr" style= =3D"line-height:1.15;margin-top:0pt;margin-bottom:0pt"><span style=3D"font-= size:13px;font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white-= space:pre-wrap;background-color:transparent">The datasets and taxonomy will= be presented at the </span><a href=3D"http://acmmm.org/2014/" style=3D"tex= t-decoration:none"><span style=3D"font-size:13px;font-family:Arial;text-dec= oration:underline;vertical-align:baseline;white-space:pre-wrap;background-c= olor:transparent">ACM Multimedia 2014</span></a><span style=3D"font-size:13= px;font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white-space:p= re-wrap;background-color:transparent"> conference in Orlando in a couple of= weeks. For those interested, please see our paper:</span></p><br><p dir=3D= "ltr" style=3D"line-height:1.15;margin-top:0pt;margin-bottom:0pt"><span sty= le=3D"font-size:13px;font-family:Arial;color:rgb(0,0,0);vertical-align:base= line;white-space:pre-wrap;background-color:transparent">J. Salamon, C. Jaco= by and J. P. Bello, &quot;</span><a href=3D"http://serv.cusp.nyu.edu/projec= ts/urbansounddataset/salamon_urbansound_acmmm14.pdf" style=3D"text-decorati= on:none"><span style=3D"font-size:13px;font-family:Arial;text-decoration:un= derline;vertical-align:baseline;white-space:pre-wrap;background-color:trans= parent">A Dataset and Taxonomy for Urban Sound Research</span></a><span sty= le=3D"font-size:13px;font-family:Arial;color:rgb(0,0,0);vertical-align:base= line;white-space:pre-wrap;background-color:transparent">&quot;, in Proc. 22= nd ACM International Conference on Multimedia, Orlando USA, Nov. 2014.</spa= n></p><br><p dir=3D"ltr" style=3D"line-height:1.15;margin-top:0pt;margin-bo= ttom:0pt"><span style=3D"font-size:13px;font-family:Arial;color:rgb(0,0,0);= vertical-align:baseline;white-space:pre-wrap;background-color:transparent">= For those attending ISMIR 2014 next week, Justin will also be there if you = would like to discuss the datasets and taxonomy.</span></p><br><p dir=3D"lt= r" style=3D"line-height:1.15;margin-top:0pt;margin-bottom:0pt"><span style= =3D"font-size:13px;font-family:Arial;color:rgb(0,0,0);vertical-align:baseli= ne;white-space:pre-wrap;background-color:transparent">We hope you find the = datasets useful for your work and look forward to seeing some of you at ISM= IR and ACM-MM in the coming weeks.</span></p><br><p dir=3D"ltr" style=3D"li= ne-height:1.15;margin-top:0pt;margin-bottom:0pt"><span style=3D"font-size:1= 3px;font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white-space:= pre-wrap;background-color:transparent">Best,</span></p><br><p dir=3D"ltr" s= tyle=3D"line-height:1.15;margin-top:0pt;margin-bottom:0pt"><span style=3D"f= ont-size:13px;font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;wh= ite-space:pre-wrap;background-color:transparent">Justin, Christopher and Ju= an.</span></p></span><br clear=3D"all"><div><br></div>-- <br><div dir=3D"lt= r"><div>Justin Salamon</div><div>Post-doctoral researcher</div><div>Music a= nd Audio Research Laboratory (MARL)</div><div>&amp; Center for Urban Scienc= e and Progress (CUSP)</div><div>New York University, New York, NY</div><div= ><a href=3D"http://www.justinsalamon.com/" style=3D"color:rgb(17,85,204)" t= arget=3D"_blank">www.justinsalamon.com</a></div></div></div> --047d7bb04c1841cc44050617e62f--


This message came from the mail archive
http://www.auditory.org/postings/2014/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University