Dear list,
You may know of this already, but Google has released a freely available large-scale dataset of annotated musical notes.
NSynth is an audio dataset containing 305,979 musical notes, each with a unique pitch, timbre, and envelope. For 1,006 instruments from commercial sample libraries, we generated four second, monophonic 16kHz audio snippets, referred to as notes, by ranging over every pitch of a standard MIDI pian o (21-108) as well as five different velocities (25, 50, 75, 100, 127). The note was held for the first three seconds and allowed to decay for the final second.
Some instruments are not capable of producing all 88 pitches in this range, resulting in an average of 65.4 pitches per instrument. Furthermore, the commercial sample packs occasionally contain duplicate sounds across multiple velocities, leaving an average of 4.75 unique velocities per pitch.
The dataset includes "unusual" instrument sounds synthesized (or morphed) using Google's neural networks.
A paper describing the dataset and morphing algorithm can be found on arXiv:
Enjoy!
Warm regards,
Charis
--