Subject: Re: request for dataset From: "Richard F. Lyon" <dicklyon@xxxxxxxx> Date: Sat, 18 Oct 2014 21:53:14 -0700 List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>--001a113abee62de1b70505bf603d Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable The "Noncommercial" clause in the CC BY-NC-SA 4.0 license will make it hard for some of us to convince our lawyers that it's OK to do anything with this dataset. Dick On Sat, Oct 18, 2014 at 3:11 PM, aberrian <aberrian@xxxxxxxx> wrote= : > Hi Alexander, > > I already emailed Shabih about the following new dataset, > but I just realized I should've also sent it out to the rest of the > Auditory list. > You might be interested in adding it to the list on your website as well. > > It's called MedleyDB: http://medleydb.weebly.com/ > I think it just got released this month. > > MedleyDB contains 122 songs of diverse genres, each with all their > separated tracks (i.e. separate instruments and voices) available. > And it's all under a Creative Commons license. So if you > wanted to create a classifier that could detect a certain instrument, > or if you wanted to check the capacity of your classifier to detect > a specific kind of music, this is a great dataset to use. > > Best wishes, > Alex > > > On 2014-10-16 14:21, alexander lerch wrote: > >> Dear Shabih, >> >> you can find a list of music datasets on my website here: >> http://www.audiocontentanalysis.org/data-sets >> >> However, I don't remember seeing one containing broadcast streams, so I >> am not sure how much help that can be. >> >> Alexander >> >> On 2014-10-15 11:04, Syed Shabih Hasan wrote: >> >>> Dear All >>> >>> I am working on creating a classifier that can identify live speech, >>> music, media sounds (tv, radio etc). Can someone, please, point me to >>> publicly available datasets of audio that are also annotated with the >>> proper labels? >>> >>> Best Regards >>> Shabih >>> >>> =E2=80=94 >>> *Syed Shabih Hasan* >>> Graduate Student in CS >>> University of Iowa >>> http://shabih.hasan.net >>> >>> >>> >>> >>> >>> --001a113abee62de1b70505bf603d Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable <div dir=3D"ltr"><div>The "Noncommercial" clause in the CC BY-NC-= SA 4.0 license will make it hard for some of us to convince our lawyers tha= t it's OK to do anything with this dataset.<br><br></div>Dick<br><br></= div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Sat, Oct 1= 8, 2014 at 3:11 PM, aberrian <span dir=3D"ltr"><<a href=3D"mailto:aberri= an@xxxxxxxx" target=3D"_blank">aberrian@xxxxxxxx</a>></s= pan> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex= ;border-left:1px #ccc solid;padding-left:1ex">Hi Alexander,<br> <br> I already emailed Shabih about the following new dataset,<br> but I just realized I should've also sent it out to the rest of the Aud= itory list.<br> You might be interested in adding it to the list on your website as well.<b= r> <br> It's called MedleyDB: <a href=3D"http://medleydb.weebly.com/" target=3D= "_blank">http://medleydb.weebly.com/</a><br> I think it just got released this month.<br> <br> MedleyDB contains 122 songs of diverse genres, each with all their<br> separated tracks (i.e. separate instruments and voices) available.<br> And it's all under a Creative Commons license. So if you<br> wanted to create a classifier that could detect a certain instrument,<br> or if you wanted to check the capacity of your classifier to detect<br> a specific kind of music, this is a great dataset to use.<br> <br> Best wishes,<br> Alex<div class=3D"HOEnZb"><div class=3D"h5"><br> <br> On 2014-10-16 14:21, alexander lerch wrote:<br> <blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p= x #ccc solid;padding-left:1ex"> Dear Shabih,<br> <br> you can find a list of music datasets on my website here:<br> <a href=3D"http://www.audiocontentanalysis.org/data-sets" target=3D"_blank"= >http://www.<u></u>audiocontentanalysis.org/data-<u></u>sets</a><br> <br> However, I don't remember seeing one containing broadcast streams, so I= <br> am not sure how much help that can be.<br> <br> Alexander<br> <br> On 2014-10-15 11:04, Syed Shabih Hasan wrote:<br> <blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p= x #ccc solid;padding-left:1ex"> Dear All<br> <br> I am working on creating a classifier that can identify live speech,<br> music, media sounds (tv, radio etc). Can someone, please, point me to<br> publicly available datasets of audio that are also annotated with the<br> proper labels?<br> <br> Best Regards<br> Shabih<br> <br> =E2=80=94<br> *Syed Shabih Hasan*<br> Graduate Student in CS<br> University of Iowa<br> <a href=3D"http://shabih.hasan.net" target=3D"_blank">http://shabih.hasan.n= et</a><br> <br> <br> <br> <br> <br> </blockquote></blockquote> </div></div></blockquote></div><br></div> --001a113abee62de1b70505bf603d--