[AUDITORY] COG-MHEAR Updates: Research fellow vacancy (10 July deadline) & invites to AVSEC Challenge (as part of IEEE SLT 2022) & IEEE EMBC 2022 Workshop

Please see three key updates below on behalf of the UK EPSRC funded COG-MHEAR research programme (http://cogmhear.org), and kindly help circulate as appropriate - thank you very much in advance.

(1) COG-MHEAR is recruiting for a postdoctoral research fellow based at Edinburgh Napier University - please see application details below.

COG-MHEAR Research Fellow post (for up to 30 months - closing date: 10^th July 2022):

https://www.jobs.ac.uk/job/CRB059/research-fellow-cog-mhear-full-time-up-to-30-months-fixed-term

(2) Invite to the first COG-MHEAR Audio-visual Speech Enhancement Challenge (AVSEC) - http://challenge.cogmhear.org

Participants will work on a large dataset derived from TED talks to enhance speech in extremely challenging noisy environments and with competing speakers. The performance will be evaluated using human listening tests as well as with objective measures. We hope that the Challenge will create a benchmark for AVSEC research that will be useful for years to come. The challenge data and development tools are now available - for details see the challenge website: https://challenge.cogmhear.org/#/ and our github repository: https://github.com/cogmhear/avse_challenge

AVSEC has been accepted as an official challenge at the IEEE Spoken Language Technology (SLT) Workshop (https://slt2022.org/) to be held in Doha, Qatar, 9-12 Jan 2023, where a special session will be run.

Important Dates

1st May 2022: Challenge website launch

31st May 2022: Release of the full toolset, training/development data and baseline system

1st June 2022: Registration for challenge entrants opens

25th July 2022: Evaluation data released

1st Sept 2022: Submission deadline for evaluation (by objective and subjective measures)

9th Jan 2023: Results announced at IEEE SLT 2022

Background: Human performance in everyday noisy situations is known to be dependent upon both aural and visual senses that are contextually combined by the brain’s multi-level integration strategies. The multimodal nature of speech is well established, with listeners known to unconsciously lip-read to improve the intelligibility of speech in a real noisy environment. It has been shown that the visual aspect of speech has a potentially strong impact on the ability of humans to focus their auditory attention on a particular stimulus.

The aim of the first AVSEC is to bring together the wider computer vision, hearing and speech research communities to explore novel approaches to multimodal speech-in-noise processing. Both raw and pre-processed AV datasets – derived from TED talk videos – will be made available to participants for training and development of audio-visual models to perform speech enhancement and speaker separation at SNR levels that will be significantly more challenging than those typically used in audio-only scenarios. Baseline neural network models and a training recipe will be provided.

In addition to participation at IEEE SLT, Challenge participants will be invited to contribute to a Journal Special Issue on the topic of Audio-Visual Speech Enhancement that will be announced later this year.

Registration/further information: If you are interested in participating and wish to receive further information, please sign up here: https://challenge.cogmhear.org/#/getting-started/register

If you have questions, contact us directly at: cogmhear@xxxxxxxxxxxx

Organising Team:

Amir Hussain, Edinburgh Napier University, UK (co-Chair)

Peter Bell, University of Edinburgh, UK (co-Chair)

Mandar Gogate, Edinburgh Napier University, UK

Cassia Valentini Botinhao, University of Edinburgh, UK

Kia Dashtipour, Edinburgh Napier University, UK

Lorena Aldana, University of Edinburgh, UK

Evaluation Panel Chair: John Hansen, University of Texas in Dallas, USA

Scientific Committee Chair: Michael Akeroyd, University of Nottingham, UK

Industry co-ordinator: Peter Derleth, Sonova AG

Funded by the UK Engineering and Physical Sciences Research Council (EPSRC) programme grant: COG-MHEAR (http://cogmhear.org )

Supported by RNID (formerly Action on Hearing Loss), Deaf Scotland, Sonova AG

(3) Invite to the COG-MHEAR Workshop at IEEE EMBC 2022, Glasgow 11 July 2022

Don't miss our forthcoming COG-MHEAR Workshop on “Challenges and Opportunities in Developing Multi-Modal, Transformative Hearing Assistive Technologies” being held as part of the 44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) 2022, in Glasgow, Scotland, UK, 11-15 July 2022 (https://embc.embs.org/2022/)

Workshop Synopsis: Hearing loss affects at least 1.5 billion people globally and is associated with poorer health and social outcomes. The World Health Organisation (WHO) estimates 83% of people who could benefit from hearing aids (HAs) do not use them. Barriers to HA uptake are multifaceted but can include: social stigma; cost; a lack of understanding of hearing loss interventions; and limitations of current HA technology. Limited research developments in speech enhancement have been implemented into commercially available HAs. However, even sophisticated aids remain ineffective in very noisy environments where human performance is known to be dependent upon input from both the aural and visual senses.

Creating multi-modal HAs that draw on the principles of normal, visually-assisted, hearing raises many technical and usability challenges which need to be tackled holistically. For example, making use of lip movements or facial expressions traditionally requires a video camera filming the speaker, which introduces questions of privacy. Ambitious solutions to these challenges include exploring the use of radio signals for remote monitoring and designing all processing to be done on the hearing device itself to minimise latency and increase privacy.

This interdisciplinary workshop will be facilitated by leading academic and industry experts who will showcase a hands-on demonstration of a first of its kind real-time, multi-modal speech enhancement prototype that can exploit lip reading cues to effectively enhance speech in real noisy environments. The workshop will serve to stimulate discussions on current trends, future research and innovation, clinical evaluation and commercialisation challenges and opportunities to transform the current HA landscape. The prototype demonstrator will be made freely available as an open testbed for further research, evaluation and benchmarking by participants, who will also benefit from interdisciplinary networking and collaboration opportunities.

Registration process: You can register for our Workshop (number 29) via the EMBC link: https://embc.embs.org/2022/pricing/

Once you click on the above link, you will see the register now button.

If are only registering for our Workshop, start by choosing ‘workshop only’ in the first screen (from the 'Registration Type' drop-down menu). Select workshop #29 Amir Hussain from the list of Workshops.

If you have already registered to attend EMBC and would like to go back in and add our workshop, you would simply put in your confirmation # and hit Modify to add the workshop.

If you need assistance for any reason, please reach out to EMBCReg@xxxxxxxxxxxxxxxx

We hope to see you soon.

Kindest regards

Amir

---

Professor Amir Hussain

Programme Director: EPSRC COG-MHEAR (http://cogmhear.org)

School of Computing, Engineering & Built Environment,

Edinburgh Napier University, Edinburgh EH10 5DT, Scotland, UK

E-mail: a.hussain@xxxxxxxxxxxx