[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[AUDITORY] Hannah: dense audio-visual person annotation in "Hannah and her sisters"

To: AUDITORY@xxxxxxxxxxxxxxx
Subject: [AUDITORY] Hannah: dense audio-visual person annotation in "Hannah and her sisters"
From: Ozerov Alexey <Alexey.Ozerov@xxxxxxxxxxxxxxx>
Date: Mon, 14 Oct 2013 17:30:27 +0200
Accept-language: en-US
Acceptlanguage: en-US
Approved-by: Alexey.Ozerov@xxxxxxxxxxxxxxx
Authentication-results: mx.google.com; spf=neutral (google.com: 128.59.28.172 is neither permitted nor denied by best guess record for domain of owner-auditory@xxxxxxxxxxxxxxx) smtp.mail=owner-auditory@xxxxxxxxxxxxxxx
Delivered-to: dan.ellis@xxxxxxxxx
List-archive: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>
List-help: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO AUDITORY>
List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>
List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>
List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>
Reply-to: Ozerov Alexey <Alexey.Ozerov@xxxxxxxxxxxxxxx>
Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>
Thread-index: Ac7I8k718zNYbFQYTU2M3NSop4kKPw==
Thread-topic: Hannah: dense audio-visual person annotation in "Hannah and her sisters"

Dear list,

[Sorry for cross-posting]

We have created and made publicly available a dense audio-visual person-oriented ground-truth annotation of a feature movie (100 minutes long): “Hannah and her sisters” by Woody Allen.

The annotation includes

• Face tracks in video (densely annotated, i.e., in each frame, and person-labeled)

• Speech segments in audio (person-labeled)

• Shot boundaries in video

The annotation can be useful for evaluating

• Person-oriented video-based tasks (e.g., face tracking, automatic character naming, etc.)

• Person-oriented audio-based tasks (e.g., speaker diarization or recognition)

• Person-oriented multimodal-based tasks (e.g., audio-visual character naming)

Detail on Hannah dataset and access to it can be obtained there:

https://research.technicolor.com/rennes/hannah-home/

https://research.technicolor.com/rennes/hannah-download/

Acknowledgments:

This work is supported by AXES EU project: http://www.axes-project.eu/

Best regards,

Alexey Ozerov, Jean-Ronan Vigouroux, Louis Chevallier and Patrick Pérez

Alexey Ozerov
Technicolor Research & Innovation

Alexey.Ozerov@xxxxxxxxxxxxxxx

Prev by Date: [AUDITORY] Cognitive Science Arena for Beginners: Bressanone (IT), February 28 - March 01, 2014
Next by Date: Re: [AUDITORY] How to speak to people about hearing loss and more
Previous by thread: [AUDITORY] Cognitive Science Arena for Beginners: Bressanone (IT), February 28 - March 01, 2014
Next by thread: Re: [AUDITORY] How to speak to people about hearing loss and more
Index(es):
- Date
- Thread