[AUDITORY] AVA Speech dataset now available

Subject: [AUDITORY] AVA Speech dataset now available

From: Sourish Chaudhuri <0000007fde242bbe-dmarc-request@xxxxxxxxxxxxxxx>

Date: Fri, 24 Aug 2018 10:25:00 -0700

Arc-authentication-results: i=1; mx.google.com; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.102 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx

Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-archive:list-owner:list-subscribe:list-unsubscribe:list-help :precedence:to:subject:from:sender:reply-to:date:message-id :mime-version:approved-by:arc-authentication-results; bh=SqVVeGLWwaoURtePsRfUymQVNvD2qogY8n6BrNT6F9s=; b=VDZFrPCdchBB39KcFu1dzzeFg3PPXDuor1cNvmZqbB9x5A831M+Ynf2nB33K+K5/6n zVOFHvjcYUUFWRYVedbPktksTre/koiIfBXOWxZSsDBY+Rwzv+0AkyWRm+WxHM6deHbV EEkboC7ECL6whm2hlfFzSB8xQ+wHmG90dTijqacVNRdvGQoaDz1/coNJa6nkcBLG20sd qt1Ab48nLYZDLembuSlwktwGwBpRTsFjlphuiDjF0Q0F3SS2sOQEZm9w3pStMNQXDz3W 8lSkGq6CBteL/NFsibMiyjDKHy2GNyUg1uXWq0ErAZiXKzeGDcOv+e2XCEJ8r4gfMZvv bH5Q==

Arc-seal: i=1; a=rsa-sha256; t=1535170398; cv=none; d=google.com; s=arc-20160816; b=i1SGBE6Ee1b0dwfa17htH9WNLlMS9wvfIFspBgX93tpmgnd8UuOBxHuYTcT8XxXmad iPkqucN/DeLS6G35cMECXHSMlZUQiaqY9M0L4nau+NPyBogcrw0SkWAMdejUQmGs98zf O2aQ0lkryYxkJEh0ezcLIs4pFMp3AolrRRzPxMHU4Wdtq//mjb7CN6ZMTLAqFK1ev/H0 aIFRENfdXvQCF+YQfw+wDBd9psB6PYJP/mlHxmKkrIYmLylDVLqOu8EqQERFaXJP1ZLA itfeGYNUA1rvgUWpBbxJfO00tyG44bSW6pxx7EX46Bgp0q3YsZ39kBi5paYIg84fkMmX 4U8w==

Authentication-results: mx.google.com; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.102 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx

Delivered-to: dan.ellis@xxxxxxxxx

List-archive: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

List-help: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO%20AUDITORY>

List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>

List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>

List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>

Reply-to: Sourish Chaudhuri <sourc@xxxxxxxxxx>

Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>

Hi Everyone,

I'm happy to announce the release of a new dataset, AVA Speech, which provides speech activity labels for v1.0 of the AVA dataset:

– It contains densely annotated labels indicating when speech is present, as well as annotating the background condition: whether it was clean speech, speech with background music or speech with background noise. Multiple raters annotated every instant of each of the 15-minute clips, and the ratings were merged using a majority vote to obtain the final set of labels which have been released.

– The dataset is available on the AVA Download page.

– This work is described in more detail in our paper (available on arxiv here) which will be presented at Interspeech 2018 on September 4. In addition to the data itself, the paper provides baseline performance numbers for speech detection performance in the various conditions, using audio-only and visual-only systems.

– Please use the ava-dataset-users Google group for discussions and questions around the dataset, and please feel free to forward this note to relevant lists.

Regards,

Sourish Chaudhuri

Google AI Perception