[AUDITORY] AVA Active Speaker dataset now available

Subject: [AUDITORY] AVA Active Speaker dataset now available

From: Sourish Chaudhuri <0000007fde242bbe-dmarc-request@xxxxxxxxxxxxxxx>

Date: Fri, 11 Jan 2019 16:16:49 -0800

Arc-authentication-results: i=1; mx.google.com; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.101 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx

Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-archive:list-owner:list-subscribe:list-unsubscribe:list-help :precedence:to:subject:from:sender:reply-to:date:message-id :mime-version:approved-by; bh=qROtL/uiSck/KObtZCCLx9jKLAoj3Ezq0OUUYDu7Jfg=; b=swnc1WsKJ0iXcRkdwdWuaZwgCQYNB/7AtUTFLYXx6d+hIiYtMWnLKeNRUzNRmSopXr w2EB2sAZ6FHlQukU/xmvLllmkXbT9ZVP7jKMBlV71QeqOX6AW4SXwAKjuRrgbdYxhoP3 f4BKQf41BdFCpSZYCA+3oTFJHz/sKWlBg2F4jdZHjUD0cdGwG4n6XNuPk1CzxitmriW/ 476jcSwq8pHWeSEyPfgY+xo16tkTf6ocjrr7HDm1cFeGtW/KFc9iGZ6TT24sCnxPIj1a syHibMGfpKzhpq9mtAV8fL/x2XRf3sGItR5HUNU1MfpWGO6sb8ySn0NcumevRKR0dbB9 gUbQ==

Arc-seal: i=1; a=rsa-sha256; t=1547270175; cv=none; d=google.com; s=arc-20160816; b=ORYIKRiAT1sAfuJDwtP6X+ubwU5YTyY6riok+Z4qN+ZuhqlofYkRKFDw+OAjVBX9s1 /O8bSZvVa+4Usy1eJg6ek7cQHYPya24dPpe3Yhvbo9VUpZWiNoE/Pw4PKh+DH2aCC2fw SKGmaCQhCq5AE8PQMPr7q/T+ZkLKx22DXpPyacoPbcqaN63uziiwo3Rqcq1GRH3a9NFj wy2unoY6LQfatE9TWqNyuOV5hnwL5seYD8U8WaeM+l4xAfJgP8PJveZHGeqFJm4w0T0G TxGE5FKzVUPreNBG/nhifDs7YgcdalZVlkqjPlaWROuc1RCB/sN3OmZqCZR1NOtzWxAq fLjA==

Authentication-results: mx.google.com; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.101 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx

Delivered-to: dan.ellis@xxxxxxxxx

List-archive: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

List-help: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO%20AUDITORY>

List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>

List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>

List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>

Reply-to: Sourish Chaudhuri <sourc@xxxxxxxxxx>

Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>

Hi Everyone,

I'm happy to announce the release of a new dataset, AVA Active Speaker, addressing the problem of identifying which, if any, of the visible faces in a video are speaking at any point in time. Labels are provided over continuous 15 minute segments of movies from v1.0 of the AVA dataset.

The dataset creation process and our initial audiovisual models for this task are described in this arxiv paper. The dataset is available on the AVA Download page, along with details on the dataset format.

Please use the ava-dataset-users Google group for discussions and questions around the dataset, and please feel free to forward this note to relevant lists.

Regards,

Sourish Chaudhuri & the AVA team

Google AI Perception