[AUDITORY] Deadline extended: 4th COG-MHEAR Audio-Visual Speech Enhancement Challenge (AVSEC-4) (Lorena Aldana )


Subject: [AUDITORY] Deadline extended: 4th COG-MHEAR Audio-Visual Speech Enhancement Challenge (AVSEC-4)
From:    Lorena Aldana  <lorena.aldana@xxxxxxxx>
Date:    Thu, 26 Jun 2025 13:49:13 +0000

--_000_PA4PR05MB7758A245680A194A65484FCDD57AAPA4PR05MB7758eurp_ Content-Type: text/plain; charset="Windows-1252" Content-Transfer-Encoding: quoted-printable Dear all (with apologies for any cross-postings), We are running a fourth edition of the COG-MHEAR International Audio-Visual= Speech Enhancement Challenge (AVSEC-4) as a Satellite Workshop of Interspe= ech 2025 in Rotterdam, on 16th August 2025 (http://challenge.cogmhear.org<h= ttp://challenge.cogmhear.org/>) The Audio-Visual Speech Enhancement Challenge (AVSEC) established the first= benchmark in the field, providing a common framework for the evaluation of= audio-visual speech enhancement and separation systems. Building upon three successful editions of the Challenge (SLT 2022, ASRU 20= 23 and Interspeech 2024), AVSEC-4 aims to further advance system performanc= e, create opportunities to reflect on the scope and limitations of current = audio-visual speech technologies, and help transform the future of multimod= al assistive hearing and speech communication systems. As in previous editions of the challenge, systems will be ranked based on t= he results of listening tests with human participants. In addition to a carefully curated audio-visual dataset, we provide facial = landmarks for the train/dev datasets. A new baseline model for AVSEC-4 has been released along with scripts for o= bjective evaluation. Baseline models of previous AVSEC editions are also av= ailable. This year's evaluation dataset includes an additional 'out-of-domain' corpu= s involving a small group, free-flowing conversation with a hearing-aid use= r in the loop. To register for the challenge and access the AVSEC-4 dataset please follow = the guidelines on the website: https://challenge.cogmhear.org <https://cha= llenge.cogmhear.org/#/getting-started/register> AVSEC scripts are available here: https://github.com/cogmhear/avse_challeng= e Results - including prizes, generously sponsored by Sonova, for both winner= s and runners-up of the three evaluation tracks (regular, low-latency and o= ut-of-domain) - will be announced at the AVSEC-4 Satellite Workshop during = Interspeech 2025. Important dates: * 21st March 2025: Release of training and development data. * 2nd April 2025: Release of low-latency baseline system. * 6th June 2025: Evaluation data release. * 9th June 2025: Leaderboard open for submissions. * 12th June 2025: Paper submission opens. * 24th June 2025: Additional "out-of-domain" evaluation corpus released. * (Extended) 7th July 2025: Deadline for Challenge submissions and one-page s= ystem description submission. * (Extended) 11th July 2025: Workshop paper submission closes. * 14th July 2025: Early acceptance notification. * 23rd July 2025: Early release of evaluation results. * 1st August 2025: camera-ready paper. AVSEC-4 Workshop proceedings: We invite prospective authors to submit, for peer review, either 2-page ext= ended abstracts or 4-6 page full-papers, following the Interspeech 2025 pap= er template. As a follow-on to the IEEE Journal of Selected Topics in Signal Processing = (JSTSP) special issue organised as part of AVSEC-3 Workshop (which is curre= ntly in press), we plan to invite extended AVSEC-4 Workshop papers for subm= ission to a new special issue (details to be confirmed). We welcome Workshop submissions from participants of both AVSEC-4 as well a= s previous editions: AVSEC-2 and AVSEC-3. Papers are also welcome from rese= archers not participating in the Challenge but interested in related Worksh= op topics, including (but not limited to): * Low-latency approaches to audio-visual speech enhancement and separation. * Human auditory-inspired models of multi-modal speech perception and enhance= ment. * Energy-efficient audio-visual speech enhancement and separation methods. * Machine learning for diverse target listeners and diverse listening scenari= os. * Audio quality & intelligibility assessment of audio-visual speech enhanceme= nt systems. * Objective metrics to predict quality & intelligibility from audio-visual st= imuli. * Understanding human speech perception in competing speaker scenarios in rea= l world and virtual environments. * Clinical applications of audio-visual speech enhancement and separation, (e= .g. multi-modal hearing assistive technologies for hearing-impaired listene= rs). * Accessibility and human-centric factors in the design and evaluation of inn= ovative multimodal technologies, including multimodal corpus development, p= ublic perceptions, ethics considerations, standards, societal, economic and= political impacts. The call for papers is available here: https://challenge.cogmhear.org/#/get= ting-started/call-for-papers Workshop registration: Workshop registration costs: * Regular/Retiree (ISCA Member and Non-member) registration: =8040 EUR * Student (ISCA Member and Non-member) registration: =8025 EUR Further information about the workshop registration process is available on= the Challenge website and also via Interspeech: https://www.interspeech202= 5.org/registration We look forward to seeing you in Rotterdam. AVSEC organising team The University of Edinburgh is a charitable body, registered in Scotland, w= ith registration number SC005336. Is e buidheann carthannais a th=92 ann an= Oilthigh Dh=F9n =C8ideann, cl=E0raichte an Alba, =E0ireamh cl=E0raidh SC00= 5336. --_000_PA4PR05MB7758A245680A194A65484FCDD57AAPA4PR05MB7758eurp_ Content-Type: text/html; charset="Windows-1252" Content-Transfer-Encoding: quoted-printable <html> <head> <meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3DWindows-1= 252"> <style type=3D"text/css" style=3D"display:none;"> P {margin-top:0;margin-bo= ttom:0;} </style> </head> <body dir=3D"ltr"> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> Dear all (with apologies for any cross-postings),&nbsp;&nbsp;</div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> &nbsp;</div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt;"> <span style=3D"color: black;">We are running a fourth edition of the COG-MH= EAR International Audio-Visual&nbsp;Speech Enhancement Challenge (AVSEC-4) = as a Satellite Workshop of Interspeech 2025 in Rotterdam, on 16th August 20= 25 (</span><span style=3D"color: rgb(70, 120, 134);"><a href=3D"http://chal= lenge.cogmhear.org/" target=3D"_blank" id=3D"OWA00d27fbf-ef7c-efc0-b8ce-d49= 1d85dd53b" class=3D"OWAAutoLink" title=3D"http://challenge.cogmhear.org/" r= el=3D"noopener noreferrer" originalsrc=3D"http://challenge.cogmhear.org/" d= ata-loopstyle=3D"linkonly" data-auth=3D"NotApplicable" style=3D"color: rgb(= 70, 120, 134); margin: 0px;">http://challenge.cogmhear.org</a></span><span = style=3D"color: black;">)</span></div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> &nbsp;</div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> The Audio-Visual&nbsp;Speech&nbsp;Enhancement Challenge (AVSEC) established= <span style=3D"line-height: 1.656;">&nbsp;the first benchmark in the field,= providing </span>a common framework for the evaluation of audio-visual&nbsp;speech en= hancement and separation systems.</div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> <br> </div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> Building upon three successful editions of the Challenge (SLT 2022, ASRU 20= 23 and Interspeech 2024), AVSEC-4 aims to further advance system performanc= e, create opportunities to reflect on the scope and limitations of current = audio-visual&nbsp;speech technologies, and help transform the future of multimodal assistive hearing and&nbsp;spe= ech&nbsp;communication systems.</div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> <br> </div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> As in previous editions of the challenge, systems will be ranked based on t= he results of listening tests with human participants.&nbsp;</div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> <br> </div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> In addition to a carefully curated audio-visual dataset, we provide facial = landmarks for the train/dev datasets.&nbsp;</div> <div class=3D"elementToProof" style=3D"text-align: left; text-indent: 0px; = background-color: rgb(255, 255, 255); margin: 0px; font-family: Aptos, Apto= s_EmbeddedFont, Aptos_MSFontService, Calibri, Helvetica, sans-serif; font-s= ize: 12pt; color: black;"> <br> </div> <div class=3D"elementToProof" style=3D"text-align: left; text-indent: 0px; = background-color: rgb(255, 255, 255); margin: 0px; font-family: Aptos, Apto= s_EmbeddedFont, Aptos_MSFontService, Calibri, Helvetica, sans-serif; font-s= ize: 12pt; color: black;"> A new baseline model for AVSEC-4 has been released along with scripts for o= bjective evaluation. Baseline models of previous AVSEC editions are also av= ailable.</div> <div class=3D"elementToProof" style=3D"text-align: left; text-indent: 0px; = background-color: rgb(255, 255, 255); margin: 0px; font-family: Aptos, Apto= s_EmbeddedFont, Aptos_MSFontService, Calibri, Helvetica, sans-serif; font-s= ize: 12pt; color: black;"> <br> </div> <div class=3D"elementToProof" style=3D"text-align: left; text-indent: 0px; = background-color: rgb(255, 255, 255); margin: 0px; font-family: Aptos, Apto= s_EmbeddedFont, Aptos_MSFontService, Calibri, Helvetica, sans-serif; font-s= ize: 12pt; color: black;"> This year's evaluation dataset includes an additional 'out-of-domain' corpu= s involving a small group, free-flowing conversation with a hearing-aid use= r in the loop.&nbsp;</div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> &nbsp;</div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt;"> <span style=3D"color: black;">To register for the challenge and access the&= nbsp;AVSEC-4 dataset please follow the guidelines on the website: &nbsp;</s= pan><span style=3D"color: rgb(17, 85, 204);"><u><a href=3D"https://challeng= e.cogmhear.org/#/getting-started/register" target=3D"_blank" id=3D"OWA8fc8a= 4a8-5959-2f37-1dca-a23500fd0d0c" class=3D"OWAAutoLink" title=3D"https://cha= llenge.cogmhear.org/#/getting-started/register" rel=3D"noopener noreferrer"= data-auth=3D"NotApplicable" style=3D"color: rgb(17, 85, 204); margin: 0px;= ">https://challenge.cogmhear.org </a></u></span><span style=3D"color: black;">&nbsp;&nbsp;</span></div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> <br> </div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt;"> <span style=3D"color: black;">AVSEC&nbsp;scripts are available here: </span= ><span style=3D"color: rgb(17, 85, 204);"><u><a href=3D"https://github.com/= cogmhear/avse_challenge" target=3D"_blank" id=3D"OWA099a7abc-7755-360c-8d00= -5cf36a7240f5" class=3D"OWAAutoLink" title=3D"https://github.com/cogmhear/a= vse_challenge" rel=3D"noopener noreferrer" data-auth=3D"NotApplicable" styl= e=3D"color: rgb(17, 85, 204); margin: 0px;">https://github.com/cogmhear/avs= e_challenge</a></u></span><span style=3D"color: black;">&nbsp;</span></div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> <br> </div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> Results - including prizes, generously sponsored by <b>Sonova</b>, for both= winners and runners-up of the three evaluation tracks (regular, low-latenc= y and out-of-domain) - will be announced at the AVSEC-4 Satellite Workshop = during Interspeech 2025.</div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> <br> </div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> Important dates:&nbsp;</div> <ul style=3D"direction: ltr; text-align: left; margin-top: 0px; margin-bott= om: 0px; background-color: white;"> <li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, C= alibri, Helvetica, sans-serif; font-size: 12pt; color: black; direction: lt= r; list-style-position: initial; list-style-type: disc;"> <div role=3D"presentation" style=3D"direction: ltr; line-height: 1.38; marg= in: 11pt 0px 0px;"> 21st March 2025: Release of training and development data.</div> </li><li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontServi= ce, Calibri, Helvetica, sans-serif; font-size: 12pt; color: black; directio= n: ltr; list-style-position: initial; list-style-type: disc;"> <div role=3D"presentation" style=3D"direction: ltr; line-height: 1.38; marg= in: 0px;"> 2nd April 2025: Release of low-latency baseline system.</div> </li><li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontServi= ce, Calibri, Helvetica, sans-serif; font-size: 12pt; color: black; directio= n: ltr; list-style-position: initial; list-style-type: disc;"> <div role=3D"presentation" style=3D"direction: ltr; line-height: 1.38; marg= in: 0px;"> 6th June 2025: Evaluation data release.</div> </li><li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontServi= ce, Calibri, Helvetica, sans-serif; font-size: 12pt; color: black; directio= n: ltr; list-style-position: initial; list-style-type: disc;"> <div role=3D"presentation" style=3D"direction: ltr; line-height: 1.38; marg= in: 0px;"> 9th June 2025: Leaderboard open for submissions.</div> </li><li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontServi= ce, Calibri, Helvetica, sans-serif; font-size: 12pt; color: black; directio= n: ltr; list-style-position: initial; list-style-type: disc;"> <div role=3D"presentation" style=3D"direction: ltr; line-height: 1.38; marg= in: 0px;"> 12th June 2025: Paper submission opens.</div> </li><li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontServi= ce, Calibri, Helvetica, sans-serif; font-size: 12pt; color: black; directio= n: ltr; list-style-position: initial; list-style-type: disc;"> <div class=3D"elementToProof" role=3D"presentation" style=3D"direction: ltr= ; line-height: 1.38; margin: 0px;"> 24th June 2025: Additional &quot;out-of-domain&quot; evaluation corpus rele= ased.</div> </li><li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontServi= ce, Calibri, Helvetica, sans-serif; font-size: 12pt; color: black; directio= n: ltr;"> <div class=3D"elementToProof" role=3D"presentation" style=3D"direction: ltr= ; line-height: 1.38; margin: 0px;"> <b>(Extended)</b> <span style=3D"font-size: 16px; color: rgb(0, 0, 0); back= ground-color: rgb(255, 255, 255);"> <b>7th July 2025</b></span>: Deadline for Challenge submissions and one-pag= e system description submission.</div> </li><li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontServi= ce, Calibri, Helvetica, sans-serif; font-size: 12pt; color: black; directio= n: ltr;"> <div class=3D"elementToProof" role=3D"presentation" style=3D"direction: ltr= ; line-height: 1.38; margin: 0px; font-size: 16px; color: rgb(0, 0, 0);"> <span style=3D"background-color: rgb(255, 255, 255);"><b>(Extended) 11th Ju= ly 2025</b>: Workshop paper submission closes.</span></div> </li><li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontServi= ce, Calibri, Helvetica, sans-serif; font-size: 12pt; color: black; directio= n: ltr;"> <div class=3D"elementToProof" role=3D"presentation" style=3D"direction: ltr= ; line-height: 1.38; margin: 0px;"> 14th July 2025: Early acceptance notification.</div> </li><li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontServi= ce, Calibri, Helvetica, sans-serif; font-size: 12pt; color: black; directio= n: ltr;"> <div class=3D"elementToProof" role=3D"presentation" style=3D"direction: ltr= ; line-height: 1.38; margin: 0px;"> 23rd July 2025: Early release of evaluation results.</div> </li><li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontServi= ce, Calibri, Helvetica, sans-serif; font-size: 12pt; color: black; directio= n: ltr; list-style-position: initial; list-style-type: disc;"> <div class=3D"elementToProof" role=3D"presentation" style=3D"direction: ltr= ; line-height: 1.38; margin: 0px 0px 11pt;"> 1<sup>st</sup>&nbsp;August 2025: camera-ready paper.</div> </li></ul> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> <b>AVSEC-4 Workshop proceedings</b>:</div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> <br> </div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> We invite prospective authors to submit, for peer review, either <b>2-page = extended abstracts</b>&nbsp;or <b>4-6 page full-papers</b>, following the Interspeech 2025 paper template.= </div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> <br> </div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> As a follow-on to the IEEE Journal of Selected Topics in Signal Processing = (JSTSP) special issue organised as part of AVSEC-3 Workshop (which is curre= ntly in press), we plan to invite extended&nbsp;AVSEC-4 Workshop papers for= submission to a <b>new special issue</b>&nbsp;(details to be confirmed).&nbsp; &nbsp;</div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> <br> </div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> We welcome Workshop submissions from participants of both AVSEC-4 as well a= s previous editions: AVSEC-2 and AVSEC-3. Papers are also welcome from rese= archers <b>not participating in the Challenge</b>&nbsp;but interested in related Wo= rkshop topics, including (but not limited to):&nbsp;</div> <ul style=3D"direction: ltr; text-align: left; margin-top: 0px; margin-bott= om: 0px; background-color: white;"> <li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, C= alibri, Helvetica, sans-serif; font-size: 12pt; color: black; direction: lt= r; list-style-position: initial; list-style-type: disc;"> <div role=3D"presentation" style=3D"direction: ltr; line-height: 1.38; marg= in: 11pt 0px 0px;"> Low-latency approaches to audio-visual&nbsp;speech&nbsp;enhancement and sep= aration.</div> </li><li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontServi= ce, Calibri, Helvetica, sans-serif; font-size: 12pt; color: black; directio= n: ltr; list-style-position: initial; list-style-type: disc;"> <div role=3D"presentation" style=3D"direction: ltr; line-height: 1.38; marg= in: 0px;"> Human auditory-inspired models of multi-modal&nbsp;speech&nbsp;perception a= nd enhancement.</div> </li><li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontServi= ce, Calibri, Helvetica, sans-serif; font-size: 12pt; color: black; directio= n: ltr; list-style-position: initial; list-style-type: disc;"> <div role=3D"presentation" style=3D"direction: ltr; line-height: 1.38; marg= in: 0px;"> Energy-efficient audio-visual&nbsp;speech&nbsp;enhancement and separation m= ethods.</div> </li><li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontServi= ce, Calibri, Helvetica, sans-serif; font-size: 12pt; color: black; directio= n: ltr; list-style-position: initial; list-style-type: disc;"> <div role=3D"presentation" style=3D"direction: ltr; line-height: 1.38; marg= in: 0px;"> Machine learning for diverse target listeners and diverse listening scenari= os.</div> </li><li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontServi= ce, Calibri, Helvetica, sans-serif; font-size: 12pt; color: black; directio= n: ltr; list-style-position: initial; list-style-type: disc;"> <div role=3D"presentation" style=3D"direction: ltr; line-height: 1.38; marg= in: 0px;"> Audio quality &amp; intelligibility assessment of audio-visual&nbsp;speech&= nbsp;enhancement systems.</div> </li><li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontServi= ce, Calibri, Helvetica, sans-serif; font-size: 12pt; color: black; directio= n: ltr; list-style-position: initial; list-style-type: disc;"> <div role=3D"presentation" style=3D"direction: ltr; line-height: 1.38; marg= in: 0px;"> Objective metrics to predict quality &amp; intelligibility from audio-visua= l stimuli.</div> </li><li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontServi= ce, Calibri, Helvetica, sans-serif; font-size: 12pt; color: black; directio= n: ltr; list-style-position: initial; list-style-type: disc;"> <div role=3D"presentation" style=3D"direction: ltr; line-height: 1.38; marg= in: 0px;"> Understanding human&nbsp;speech&nbsp;perception in competing speaker scenar= ios in real world and virtual environments.</div> </li><li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontServi= ce, Calibri, Helvetica, sans-serif; font-size: 12pt; color: black; directio= n: ltr; list-style-position: initial; list-style-type: disc;"> <div class=3D"elementToProof" role=3D"presentation" style=3D"direction: ltr= ; line-height: 1.38; margin: 0px;"> Clinical applications of audio-visual&nbsp;speech&nbsp;enhancement and sepa= ration, (e.g. multi-modal hearing assistive technologies for hearing-impair= ed listeners).</div> </li><li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontServi= ce, Calibri, Helvetica, sans-serif; font-size: 12pt; color: black; directio= n: ltr; list-style-position: initial; list-style-type: disc;"> <div role=3D"presentation" style=3D"direction: ltr; line-height: 1.38; marg= in: 0px 0px 11pt;"> Accessibility and human-centric factors in the design and evaluation of inn= ovative multimodal technologies, including multimodal corpus development, p= ublic perceptions, ethics considerations, standards, societal, economic and= political impacts.</div> </li></ul> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt;"> <span style=3D"color: black;">The call for papers is available here: </span= ><span style=3D"color: rgb(17, 85, 204);"><u><a href=3D"https://challenge.c= ogmhear.org/#/getting-started/call-for-papers" target=3D"_blank" id=3D"OWA8= 2fcded7-ae30-fe1e-3fff-55f4c5e468a8" class=3D"OWAAutoLink" title=3D"https:/= /challenge.cogmhear.org/#/getting-started/call-for-papers" rel=3D"noopener = noreferrer" data-auth=3D"NotApplicable" style=3D"color: rgb(17, 85, 204); m= argin: 0px;">https://challenge.cogmhear.org/#/getting-started/call-for-pape= rs</a></u></span><span style=3D"color: black;">&nbsp;</span></div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> <br> </div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> <span style=3D"font-weight: 700;">Workshop registration:</span></div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: rgb(255, 255, 255); ma= rgin: 0px 0px 14pt; padding-top: 14pt; font-family: Aptos, Aptos_EmbeddedFo= nt, Aptos_MSFontService, Calibri, Helvetica, sans-serif; font-size: 12pt; c= olor: black;"> Workshop registration costs:</div> <ul style=3D"direction: ltr; text-align: left; margin-top: 0px; margin-bott= om: 0px; background-color: rgb(255, 255, 255);"> <li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, C= alibri, Helvetica, sans-serif; font-size: 12pt; color: black; direction: lt= r; list-style-position: initial; list-style-type: disc;"> <div role=3D"presentation" style=3D"direction: ltr; line-height: 1.38; marg= in: 11pt 0px 0px;"> Regular/Retiree (ISCA Member and Non-member) registration: =8040 EUR</div> </li><li style=3D"font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontServi= ce, Calibri, Helvetica, sans-serif; font-size: 12pt; color: black; directio= n: ltr; list-style-position: initial; list-style-type: disc;"> <div role=3D"presentation" style=3D"direction: ltr; line-height: 1.38; marg= in: 0px;"> Student (ISCA Member and Non-member) registration: =8025 EUR</div> </li></ul> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: rgb(255, 255, 255); ma= rgin: 0px; font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Cal= ibri, Helvetica, sans-serif; font-size: 12pt; color: black;"> <br> </div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> Further information about the workshop registration process is available on= the Challenge website and also via Interspeech: <a href=3D"https://www.interspeech2025.org/registration" target=3D"_blank" = id=3D"OWAa9c6e95b-d46f-af3e-8dad-47586e415d43" class=3D"OWAAutoLink" title= =3D"https://www.interspeech2025.org/registration" rel=3D"noopener noreferre= r" data-auth=3D"NotApplicable" style=3D"margin: 0px;"> https://www.interspeech2025.org/registration</a></div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> <br> </div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> We look forward to seeing you in Rotterdam.&nbsp;</div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> &nbsp;</div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> AVSEC&nbsp;organising team&nbsp;</div> <div class=3D"elementToProof" style=3D"direction: ltr; text-align: left; te= xt-indent: 0px; line-height: 1.38; background-color: white; margin: 0px; fo= nt-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helveti= ca, sans-serif; font-size: 12pt; color: black;"> <br> </div> The University of Edinburgh is a charitable body, registered in Scotland, w= ith registration number SC005336. Is e buidheann carthannais a th=92 ann an= Oilthigh Dh=F9n =C8ideann, cl=E0raichte an Alba, =E0ireamh cl=E0raidh SC00= 5336. </body> </html> --_000_PA4PR05MB7758A245680A194A65484FCDD57AAPA4PR05MB7758eurp_--


This message came from the mail archive
postings/2025/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University