I believe this could be relevant also for the Auditory
community...
-------- Forwarded Message --------
Dear ISMIR Community,
Over the past few years, this list has seen an increasingly
active discussion about publishing and accessing datasets for
reuse in academic research. Although sometimes driven by
concrete needs concerning a particular dataset or project, this
topic is not accessory. In a data-driven research community like
ours, it is very healthy to exchange ideas and perspectives on
how to devise flexible means for making our data and results
accessible--a valuable pursuit towards supporting research
reproducibility.
The Music Technology Group of UPF hosts and provides free access
to a number of datasets for music and audio research. As it
normally happens with other published datasets, one needs to
download the data files and procure means for exploring them
locally. If limited to audio files or annotations, such process
does not generally bring significant difficulties other than
data volume. However, as the number and nature of modalities,
extracted descriptors, and annotations increase (think of motion
capture, video, physiological signals, time series of different
sample rates, etc.), difficulties arise not only in the design
or adoption of formatting schemes, but also in the availability
of platforms that enable and facilitate exchange by providing
simple ways to remotely visualize or explore the data before
downloading.
In the context of several recent projects focused on music
performance analysis and multimodal interaction, we had to
collect, process, and annotate music performance recordings that
included dense data streams of different modalities. Envisioning
the future release of our dataset for the research community, we
realized the need for better means to explore and exchange data.
Since then, at UPF we have been developing Repovizz, a remote
hosting platform for multimodal data storage, visualization,
annotation, and selective retrieval via a web interface and a
dedicated API.
By way of the recently published article
E. Maestre, P. Papiotis, M. Marchini, Q. Llimona, O. Mayor,
A. Pérez, M. Wanderley,
Enriched Multimodal Representations of Music
Performances: Online Access and Visualization
IEEE MultiMedia, Vol 24:1, pp. 24-34, 2017 (http://ieeexplore.ieee.org/document/7849104/),
we introduce Repovizz to the ISMIR Community and open access to
the QUARTET dataset, a fully annotated collection of string
quartet multimodal recordings released through Repovizz.
For a short, unadorned video demonstrating Repovizz, please go
to http://www.youtube.com/watch?v=JcHbGtltuG4.
Although still under development, Repovizz can be used by anyone
in the academic community.
The Repovizz website can be found at http://repovizz.upf.edu
(enter as guest to check the documentation). The only
requirement is to use Chrome or Safari (no mobile app yet).
The QUARTET dataset comprises 96 recordings of string quartet
exercises involving solo and ensemble conditions, containing
multichannel audio (ambient microphones and piezoelectric
pickups), video, motion capture (optical and magnetic) of
instrumental gestures and of musician upper bodies, computed
bowing gesture signals, extracted audio descriptors, and
multitrack score-performance alignment. The dataset, processed
and curated over the past years partly in the context of the PhD
dissertation work of Panos Papiotis on ensemble interdependence
(http://www.tdx.cat/bitstream/10803/361107/2/tpp.pdf),
is now freely available for the research community. A
description of its contents, including links to the
corresponding Repovizz entries, can be explored at http://mtg.upf.edu/download/datasets/quartet-dataset.
Repovizz and the QUARTET dataset only represent our own attempt
at expanding the possibilities for exchange and online
visualization of multimodal music performance data. Although
there are many issues to resolve as we keep on developing
Repovizz, we hope that this initiative triggers further
discussion and serves as an inspiration for others in proposing
and developing future means for nourishing a digital library
ecosystem that enables rich interaction with multimodal music
resources.
Best regards,
Esteban Maestre
..also on behalf of Panos Papiotis, Quim Llimona, and Oscar
Mayor.
--
Esteban Maestre
Music Technology Group, Universitat Pompeu Fabra
Center for Interdisciplinary Research in Music Media and
Technology, McGill University