[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[AUDITORY] free large-scale dataset of annotated musical notes from Google



Dear list,


You may know of this already, but Google has released a freely available large-scale dataset of annotated musical notes. 


https://magenta.tensorflow.org/datasets/nsynth#files


NSynth is an audio dataset containing 305,979 musical notes, each with a unique pitch, timbre, and envelope. For 1,006 instruments from commercial sample libraries, we generated four second, monophonic 16kHz audio snippets, referred to as notes, by ranging over every pitch of a standard MIDI pian o (21-108) as well as five different velocities (25, 50, 75, 100, 127). The note was held for the first three seconds and allowed to decay for the final second.
 
Some instruments are not capable of producing all 88 pitches in this range, resulting in an average of 65.4 pitches per instrument. Furthermore, the commercial sample packs occasionally contain duplicate sounds across multiple velocities, leaving an average of 4.75 unique velocities per pitch.


The dataset includes "unusual" instrument sounds synthesized (or morphed) using Google's neural networks.


A paper describing the dataset and morphing algorithm can be found on arXiv:
https://arxiv.org/abs/1704.01279 


Enjoy!


Warm regards,
Charis



--
Dr. Charalampos Saitis

Humboldt Research Fellow / Humboldt-Forschungsstipendiat
Audio Communication Group / Fachgebiet Audiokommunikation
Berlin Institute of Technology / Technische Universität Berlin

Research Collaborator, Sound of Vision 
ISI Foundation / Fondazione ISI

www.ak.tu-berlin.de 
www.soundofvision.net
www.music.mcgill.ca/~harry
charalampos.saitis@xxxxxxxxxxxxxxxxxxx