[AUDITORY] free large-scale dataset of annotated musical notes from Google (Charalampos Saitis )


Subject: [AUDITORY] free large-scale dataset of annotated musical notes from Google
From:    Charalampos Saitis  <charalampos.saitis@xxxxxxxx>
Date:    Wed, 24 May 2017 16:43:42 +0200
List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

--f4030436634ed52baf055046230e Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Dear list, You may know of this already, but Google has released a freely available large-scale dataset of annotated musical notes. https://magenta.tensorflow.org/datasets/nsynth#files NSynth is an audio dataset containing 305,979 musical notes, each with a > unique pitch, timbre, and envelope. For 1,006 instruments from commercial > sample libraries, we generated four second, monophonic 16kHz audio > snippets, referred to as notes, by ranging over every pitch of a standard > MIDI pian o (21-108) as well as five different velocities (25, 50, 75, 10= 0, > 127). The note was held for the first three seconds and allowed to decay > for the final second. > > Some instruments are not capable of producing all 88 pitches in this > range, resulting in an average of 65.4 pitches per instrument. Furthermor= e, > the commercial sample packs occasionally contain duplicate sounds across > multiple velocities, leaving an average of 4.75 unique velocities per pit= ch > . The dataset includes "unusual" instrument sounds synthesized (or morphed) using Google's neural networks. A paper describing the dataset and morphing algorithm can be found on arXiv= : https://arxiv.org/abs/1704.01279 Enjoy! Warm regards, Charis --=20 Dr. Charalampos Saitis Humboldt Research Fellow / Humboldt-Forschungsstipendiat Audio Communication Group / Fachgebiet Audiokommunikation Berlin Institute of Technology / Technische Universit=C3=A4t Berlin Research Collaborator, Sound of Vision ISI Foundation / Fondazione ISI www.ak.tu-berlin.de www.soundofvision.net www.music.mcgill.ca/~harry charalampos.saitis@xxxxxxxx --f4030436634ed52baf055046230e Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable <div dir=3D"ltr"><div>Dear list,</div><div><br></div><div><br></div><div>Yo= u may know of this already, but Google has released a freely available larg= e-scale dataset of annotated musical notes.=C2=A0</div><div><br></div><div>= <br></div><div><a href=3D"https://magenta.tensorflow.org/datasets/nsynth#fi= les">https://magenta.tensorflow.org/datasets/nsynth#files</a></div><div><br= ></div><div><br></div><div><blockquote style=3D"margin:0px 0px 0px 0.8ex;bo= rder-left:1px solid rgb(204,204,204);padding-left:1ex" class=3D"gmail_quote= ">NSynth is an audio dataset containing 305,979 musical notes, each with a = unique pitch, timbre, and envelope. For 1,006 instruments from commercial s= ample libraries, we generated four second, monophonic 16kHz audio snippets,= referred to as notes, by ranging over every pitch of a standard MIDI pian = o (21-108) as well as five different velocities (25, 50, 75, 100, 127). The= note was held for the first three seconds and allowed to decay for the fin= al second.<br></blockquote><div>=C2=A0</div><blockquote style=3D"margin:0px= 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex" cla= ss=3D"gmail_quote">Some instruments are not capable of producing all 88 pit= ches in this range, resulting in an average of 65.4 pitches per instrument.= Furthermore, the commercial sample packs occasionally contain duplicate so= unds across multiple velocities, leaving an average of 4.75 unique velociti= es per pitch<font color=3D"#111111" face=3D"Roboto, sans-serif"><span style= =3D"font-size:16px;background-color:rgb(253,253,253)">.</span></font></bloc= kquote></div><div><br></div><div><br></div><div>The dataset includes &quot;= unusual&quot; instrument sounds synthesized (or morphed) using Google&#39;s= neural networks.</div><div><br></div><div><br></div><div>A paper describin= g the dataset and morphing algorithm can be found on arXiv:</div><div><a hr= ef=3D"https://arxiv.org/abs/1704.01279">https://arxiv.org/abs/1704.01279</a= >=C2=A0<br></div><div><br></div><div><br></div><div>Enjoy!</div><div><br></= div><div><br></div><div>Warm regards,</div><div>Charis</div><div><br></div>= <div><br></div><div><br></div>-- <br><div class=3D"gmail_signature"><div di= r=3D"ltr"><div><div dir=3D"ltr"><div><div dir=3D"ltr"><div dir=3D"ltr"><div= dir=3D"ltr"><div dir=3D"ltr"><div dir=3D"ltr"><div dir=3D"ltr"><div style= =3D"font-size:12.8px"><div dir=3D"ltr"><div style=3D"font-size:12.8px"><div= dir=3D"ltr"><span style=3D"color:rgb(0,0,0);font-family:monospace,monospac= e;font-size:x-small">Dr. Charalampos Saitis</span><br style=3D"color:rgb(0,= 0,0);font-family:monospace,monospace;font-size:x-small"><br style=3D"color:= rgb(0,0,0);font-family:monospace,monospace;font-size:x-small"><span style= =3D"color:rgb(0,0,0);font-family:monospace,monospace;font-size:x-small">Hum= boldt Research Fellow / Humboldt-Forschungsstipendiat</span><br style=3D"co= lor:rgb(0,0,0);font-family:monospace,monospace;font-size:x-small"><span sty= le=3D"color:rgb(0,0,0);font-family:monospace,monospace;font-size:x-small">A= udio Communication Group / Fachgebiet Audiokommunikation</span><br style=3D= "color:rgb(0,0,0);font-family:monospace,monospace;font-size:x-small"><span = style=3D"color:rgb(0,0,0);font-family:monospace,monospace;font-size:x-small= ">Berlin Institute of Technology / Technische Universit=C3=A4t Berlin</span= ><br style=3D"color:rgb(0,0,0);font-family:monospace,monospace;font-size:x-= small"><br style=3D"color:rgb(0,0,0);font-family:monospace,monospace;font-s= ize:x-small"><span style=3D"color:rgb(0,0,0);font-family:monospace,monospac= e;font-size:x-small">Research Collaborator, Sound of Vision=C2=A0</span><br= style=3D"color:rgb(0,0,0);font-family:monospace,monospace;font-size:x-smal= l"><span style=3D"color:rgb(0,0,0);font-family:monospace,monospace;font-siz= e:x-small">ISI Foundation / Fondazione ISI</span><br style=3D"color:rgb(0,0= ,0);font-family:monospace,monospace;font-size:x-small"><br style=3D"color:r= gb(0,0,0);font-family:monospace,monospace;font-size:x-small"><a href=3D"htt= p://www.ak.tu-berlin.de/" style=3D"font-family:monospace,monospace;font-siz= e:x-small" target=3D"_blank">www.ak.tu-berlin.de</a><span style=3D"color:rg= b(0,0,0);font-family:monospace,monospace;font-size:x-small">=C2=A0</span><b= r style=3D"color:rgb(0,0,0);font-family:monospace,monospace;font-size:x-sma= ll"><a href=3D"http://www.soundofvision.net/" style=3D"font-family:monospac= e,monospace;font-size:x-small" target=3D"_blank">www.soundofvision.net</a><= br style=3D"color:rgb(0,0,0);font-family:monospace,monospace;font-size:x-sm= all"><a href=3D"http://www.music.mcgill.ca/~harry" style=3D"font-family:mon= ospace,monospace;font-size:x-small" target=3D"_blank">www.music.mcgill.ca/~= harry</a><br style=3D"color:rgb(0,0,0);font-family:monospace,monospace;font= -size:x-small"><a href=3D"mailto:charalampos.saitis@xxxxxxxx" st= yle=3D"font-family:monospace,monospace;font-size:x-small" target=3D"_blank"= >charalampos.saitis@xxxxxxxx</a><br></div></div></div></div></di= v></div></div></div></div></div></div></div></div></div></div> </div> --f4030436634ed52baf055046230e--


This message came from the mail archive
../postings/2017/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University