[AUDITORY] Call for papers for special sessions in speech and audio neural coding (Dennis Xiao )


Subject: [AUDITORY] Call for papers for special sessions in speech and audio neural coding
From:    Dennis Xiao  <xiao.dennis@xxxxxxxx>
Date:    Wed, 20 Dec 2023 08:38:53 +0800

--000000000000e2691b060ce635c1 Content-Type: text/plain; charset="UTF-8" ### Apologies for cross-posting, please distribute ### Dear all, We plan to propose a special session in Interspeech 2024 about speech and audio neural coding. Please feel free to contact us if you have any questions. Best regards, Wei ### Detail description of the special session proposed ### Multi-functional neural speech and audio coding Speech and Audio coding is one of the critical technologies in real-time communication. Traditional coding methods (i.e., signal processing-based ones) mostly rely on physical sound perception and production models as well as basic digital signal processing principles. Recently, deep learning and artificial intelligent (AI) based speech synthesis and audio compression methods were developed. In comparison with the SP-based methods, AI-based approaches bring more possibilities for audio compressioon and are able to achieve better performance with higher compression efficiency. However, the AI-based method (e.g., the neural speech and audio coding) still suffers from certain problems including but not limited to robustness and high computational complexity, which have attracted the attention from many academic and industrial organizations and researchers. This session proposal aims to collect new ideas and developments in neural coding techniques, including low bitrate and low latency neural coding. We are also looking for new solutions that enable the neural codec to work with different functions such as packet loss concealment, noise reduction, voice conversion, TTS, audio band extension, and AIGC-related topics, etc. Therefore, we propose to apply a special session in INTERSPEECH 2024. Please feel free to contact us if you have interest to contribute to this special session. Organizers Wei XIAO (denniswxiao@xxxxxxxx), Tencent Ethereal Audio Lab Prof. Jing WANG (wangjing@xxxxxxxx), Beijing Institute of Technology Prof. Jingdong CHEN, IEEE Fellow (jingdongchen@xxxxxxxx), Center of Intelligent Acoustics and Immersive Communications, Northwestern Polytechnical University Xuan ZHU (xuan.zhu@xxxxxxxx), Samsung Research China - Beijing --000000000000e2691b060ce635c1 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable <div dir=3D"ltr"><div>### Apologies for cross-posting, please distribute ##= #</div><div><br></div>Dear all,<div><br></div><div>We plan to propose a spe= cial session in Interspeech 2024 about speech and audio neural coding. Plea= se feel free to contact us if you have=C2=A0any questions.</div><div><br></= div><div>Best regards,</div><div><br></div><div>Wei</div><div><br></div><di= v>### Detail description of the special session proposed ###</div><div><br>= </div><div><div></div><div class=3D"gmail-document"><div class=3D"gmail-sec= tion"><p class=3D"gmail-paragraph gmail-text-align-type-left" style=3D"line= -height:100%;margin:3pt 0pt;font-family:=E7=AD=89=E7=BA=BF;font-size:12pt">= <span style=3D"font-size:16pt;font-weight:bold;color:rgb(51,51,51);letter-s= pacing:0pt;vertical-align:baseline">Multi-functional=C2=A0neural=C2=A0speec= h=C2=A0and=C2=A0audio=C2=A0coding</span><span lang=3D"EN-US"></span></p><p = class=3D"gmail-paragraph gmail-text-align-type-justify" style=3D"text-align= :justify;line-height:100%;margin:3pt 0pt;font-family:=E7=AD=89=E7=BA=BF;fon= t-size:12pt"><span lang=3D"EN-US"></span></p><p class=3D"gmail-paragraph gm= ail-text-align-type-justify" style=3D"text-align:justify;line-height:100%;m= argin:3pt 0pt;font-family:=E7=AD=89=E7=BA=BF;font-size:12pt"><span style=3D= "font-size:11pt;color:rgb(51,51,51);letter-spacing:0pt;vertical-align:basel= ine"><br></span></p><p class=3D"gmail-paragraph gmail-text-align-type-justi= fy" style=3D"text-align:justify;line-height:100%;margin:3pt 0pt;font-family= :=E7=AD=89=E7=BA=BF;font-size:12pt"><span style=3D"font-size:11pt;color:rgb= (51,51,51);letter-spacing:0pt;vertical-align:baseline">Speech=C2=A0and=C2= =A0Audio=C2=A0coding=C2=A0is=C2=A0one=C2=A0of=C2=A0the=C2=A0critical=C2=A0t= echnologies=C2=A0in=C2=A0real-time=C2=A0communication.=C2=A0=C2=A0Tradition= al=C2=A0coding=C2=A0methods=C2=A0=C2=A0(i.e.,=C2=A0signal=C2=A0processing-b= ased=C2=A0ones)=C2=A0mostly=C2=A0rely=C2=A0on=C2=A0physical=C2=A0sound=C2= =A0perception=C2=A0and=C2=A0production=C2=A0models=C2=A0as=C2=A0well=C2=A0a= s=C2=A0basic=C2=A0digital=C2=A0signal=C2=A0processing=C2=A0principles.=C2= =A0=C2=A0Recently,=C2=A0deep=C2=A0learning=C2=A0and=C2=A0artificial=C2=A0in= telligent=C2=A0(AI)=C2=A0based=C2=A0speech=C2=A0synthesis=C2=A0and=C2=A0aud= io=C2=A0compression=C2=A0methods=C2=A0were=C2=A0developed.=C2=A0In=C2=A0com= parison=C2=A0with=C2=A0the=C2=A0SP-based=C2=A0methods,=C2=A0=C2=A0AI-based= =C2=A0approaches=C2=A0bring=C2=A0more=C2=A0possibilities=C2=A0for=C2=A0audi= o=C2=A0compressioon=C2=A0and=C2=A0are=C2=A0able=C2=A0to=C2=A0achieve=C2=A0b= etter=C2=A0performance=C2=A0with=C2=A0higher=C2=A0compression=C2=A0efficien= cy.=C2=A0=C2=A0</span><span style=3D"font-size:11pt;color:rgb(51,51,51);let= ter-spacing:0pt;vertical-align:baseline">However,=C2=A0the=C2=A0AI-based=C2= =A0me</span><span style=3D"font-size:11pt;color:rgb(51,51,51);letter-spacin= g:0pt;vertical-align:baseline">thod</span><span style=3D"white-space:pre;fo= nt-size:11pt;color:rgb(51,51,51);letter-spacing:0pt;vertical-align:baseline= "> </span><span style=3D"font-size:11pt;color:rgb(51,51,51);letter-spacing:= 0pt;vertical-align:baseline">(e.g.,=C2=A0the=C2=A0neural=C2=A0speech=C2=A0a= nd=C2=A0audio=C2=A0coding)</span><span style=3D"font-size:11pt;color:rgb(51= ,51,51);letter-spacing:0pt;vertical-align:baseline">=C2=A0still=C2=A0</span= ><span style=3D"font-size:11pt;color:rgb(51,51,51);letter-spacing:0pt;verti= cal-align:baseline">suffers=C2=A0from=C2=A0</span><span style=3D"font-size:= 11pt;color:rgb(51,51,51);letter-spacing:0pt;vertical-align:baseline">certai= n=C2=A0problems=C2=A0</span><span style=3D"font-size:11pt;color:rgb(51,51,5= 1);letter-spacing:0pt;vertical-align:baseline">including=C2=A0but=C2=A0not= =C2=A0limited=C2=A0to=C2=A0robustness=C2=A0and=C2=A0high=C2=A0computational= =C2=A0complexity,=C2=A0which=C2=A0</span><span style=3D"font-size:11pt;colo= r:rgb(51,51,51);letter-spacing:0pt;vertical-align:baseline">have=C2=A0attra= cted=C2=A0the=C2=A0attention=C2=A0from=C2=A0many=C2=A0academic=C2=A0and=C2= =A0industrial=C2=A0organizations=C2=A0and=C2=A0researchers.=C2=A0=C2=A0</sp= an><span lang=3D"EN-US"></span></p><p class=3D"gmail-paragraph gmail-text-a= lign-type-justify" style=3D"text-align:justify;line-height:100%;margin:3pt = 0pt;font-family:=E7=AD=89=E7=BA=BF;font-size:12pt"><span style=3D"font-size= :11pt;color:rgb(51,51,51);letter-spacing:0pt;vertical-align:baseline">This= =C2=A0session=C2=A0proposal=C2=A0aims=C2=A0to=C2=A0collect=C2=A0new=C2=A0id= eas=C2=A0and=C2=A0developments=C2=A0in=C2=A0neural=C2=A0coding=C2=A0techniq= ues,=C2=A0including=C2=A0low=C2=A0bitrate=C2=A0and=C2=A0low=C2=A0latency=C2= =A0neural=C2=A0coding.=C2=A0We=C2=A0are=C2=A0also=C2=A0looking=C2=A0for=C2= =A0new=C2=A0solutions=C2=A0that=C2=A0enable=C2=A0the=C2=A0neural=C2=A0codec= =C2=A0to=C2=A0work=C2=A0with=C2=A0different=C2=A0functions=C2=A0such=C2=A0a= s=C2=A0packet=C2=A0loss=C2=A0concealment,=C2=A0noise=C2=A0reduction,=C2=A0v= oice=C2=A0conversion,=C2=A0TTS,=C2=A0audio=C2=A0band=C2=A0extension,=C2=A0a= nd=C2=A0AIGC-related=C2=A0topics,=C2=A0etc.</span><span lang=3D"EN-US"></sp= an></p><p class=3D"gmail-paragraph gmail-text-align-type-justify" style=3D"= text-align:justify;line-height:100%;margin:3pt 0pt;font-family:=E7=AD=89=E7= =BA=BF;font-size:12pt"><span style=3D"font-size:11pt;color:rgb(51,51,51);le= tter-spacing:0pt;vertical-align:baseline">Therefore,=C2=A0we=C2=A0propose= =C2=A0to=C2=A0apply=C2=A0a=C2=A0special=C2=A0session=C2=A0in=C2=A0INTERSPEE= CH=C2=A02024.=C2=A0Please=C2=A0feel=C2=A0free=C2=A0to=C2=A0contact=C2=A0us= =C2=A0if=C2=A0you=C2=A0have=C2=A0interest=C2=A0to=C2=A0contribute=C2=A0to= =C2=A0this=C2=A0special=C2=A0session.</span><span lang=3D"EN-US"></span></p= ><p class=3D"gmail-paragraph gmail-text-align-type-left" style=3D"line-heig= ht:100%;margin:3pt 0pt;font-family:=E7=AD=89=E7=BA=BF;font-size:12pt"><span= lang=3D"EN-US"></span></p><p class=3D"gmail-paragraph gmail-text-align-typ= e-left" style=3D"line-height:100%;margin:3pt 0pt;font-family:=E7=AD=89=E7= =BA=BF;font-size:12pt"><span style=3D"font-size:11pt;font-weight:bold;color= :rgb(51,51,51);letter-spacing:0pt;vertical-align:baseline"><br></span></p><= p class=3D"gmail-paragraph gmail-text-align-type-left" style=3D"line-height= :100%;margin:3pt 0pt;font-family:=E7=AD=89=E7=BA=BF;font-size:12pt"><span s= tyle=3D"font-size:11pt;font-weight:bold;color:rgb(51,51,51);letter-spacing:= 0pt;vertical-align:baseline">Organizers</span><span lang=3D"EN-US"></span><= /p><p class=3D"gmail-paragraph gmail-text-align-type-left" style=3D"line-he= ight:100%;margin:3pt 0pt;font-family:=E7=AD=89=E7=BA=BF;font-size:12pt"><sp= an style=3D"font-size:11pt;color:rgb(51,51,51);letter-spacing:0pt;vertical-= align:baseline">Wei=C2=A0XIAO=C2=A0(</span><a href=3D"denniswxiao@xxxxxxxx= om),"><span style=3D"font-size:11pt;color:rgb(30,111,255);letter-spacing:0p= t;vertical-align:baseline">denniswxiao@xxxxxxxx),</span></a><span style= =3D"white-space:pre;font-size:11pt;color:rgb(51,51,51);letter-spacing:0pt;v= ertical-align:baseline"> </span><span lang=3D"EN-US"></span></p><p class=3D= "gmail-paragraph gmail-text-align-type-left" style=3D"line-height:100%;marg= in:3pt 0pt;font-family:=E7=AD=89=E7=BA=BF;font-size:12pt"><span style=3D"fo= nt-size:11pt;color:rgb(51,51,51);letter-spacing:0pt;vertical-align:baseline= ">Tencent=C2=A0Ethereal=C2=A0Audio=C2=A0Lab</span><span lang=3D"EN-US"></sp= an></p><p class=3D"gmail-paragraph gmail-text-align-type-left" style=3D"lin= e-height:100%;margin:3pt 0pt;font-family:=E7=AD=89=E7=BA=BF;font-size:12pt"= ><span style=3D"font-size:11pt;color:rgb(51,51,51);letter-spacing:0pt;verti= cal-align:baseline">Prof.=C2=A0Jing=C2=A0WANG=C2=A0(</span><a href=3D"wangj= ing@xxxxxxxx),"><span style=3D"font-size:11pt;font-family:Calibri;color:r= gb(30,111,255);letter-spacing:0pt;vertical-align:baseline">wangjing@xxxxxxxx= .cn</span><span style=3D"font-size:11pt;color:rgb(30,111,255);letter-spacin= g:0pt;vertical-align:baseline">),</span></a><span style=3D"white-space:pre;= font-size:11pt;color:rgb(51,51,51);letter-spacing:0pt;vertical-align:baseli= ne"> </span><span lang=3D"EN-US"></span></p><p class=3D"gmail-paragraph gma= il-text-align-type-left" style=3D"line-height:100%;margin:3pt 0pt;font-fami= ly:=E7=AD=89=E7=BA=BF;font-size:12pt"><span style=3D"font-size:11pt;color:r= gb(51,51,51);letter-spacing:0pt;vertical-align:baseline">Beijing=C2=A0Insti= tute=C2=A0of=C2=A0Technology</span><span lang=3D"EN-US"></span></p><p class= =3D"gmail-paragraph gmail-text-align-type-left" style=3D"line-height:100%;m= argin:3pt 0pt;font-family:=E7=AD=89=E7=BA=BF;font-size:12pt"><span style=3D= "font-size:11pt;color:rgb(51,51,51);letter-spacing:0pt;vertical-align:basel= ine">Prof.=C2=A0Jingdong=C2=A0CHEN,=C2=A0IEEE=C2=A0Fellow=C2=A0(</span><a h= ref=3D"jingdongchen@xxxxxxxx)"><span style=3D"font-size:11pt;color:rgb(30,1= 11,255);letter-spacing:0pt;vertical-align:baseline">jingdongchen@xxxxxxxx)<= /span></a><span style=3D"font-size:11pt;color:rgb(51,51,51);letter-spacing:= 0pt;vertical-align:baseline">,=C2=A0</span><span lang=3D"EN-US"></span></p>= <p class=3D"gmail-paragraph gmail-text-align-type-left" style=3D"line-heigh= t:100%;margin:3pt 0pt;font-family:=E7=AD=89=E7=BA=BF;font-size:12pt"><span = style=3D"font-size:11pt;color:rgb(51,51,51);letter-spacing:0pt;vertical-ali= gn:baseline">Center=C2=A0of=C2=A0Intelligent=C2=A0Acoustics=C2=A0and=C2=A0I= mmersive=C2=A0Communications,=C2=A0</span><span lang=3D"EN-US"></span></p><= p class=3D"gmail-paragraph gmail-text-align-type-left" style=3D"line-height= :100%;margin:3pt 0pt;font-family:=E7=AD=89=E7=BA=BF;font-size:12pt"><span s= tyle=3D"font-size:11pt;color:rgb(51,51,51);letter-spacing:0pt;vertical-alig= n:baseline">Northwestern=C2=A0Polytechnical=C2=A0University</span><span lan= g=3D"EN-US"></span></p><p class=3D"gmail-paragraph gmail-text-align-type-le= ft" style=3D"line-height:100%;margin:3pt 0pt;font-family:=E7=AD=89=E7=BA=BF= ;font-size:12pt"><span style=3D"font-size:11pt;color:rgb(51,51,51);letter-s= pacing:0pt;vertical-align:baseline">Xuan=C2=A0ZHU=C2=A0(</span><a href=3D"x= uan.zhu@xxxxxxxx)"><span style=3D"font-size:11pt;color:rgb(30,111,255);l= etter-spacing:0pt;vertical-align:baseline">xuan.zhu@xxxxxxxx)</span></a>= <span style=3D"font-size:11pt;color:rgb(51,51,51);letter-spacing:0pt;vertic= al-align:baseline">,</span><span lang=3D"EN-US"></span></p><p class=3D"gmai= l-paragraph gmail-text-align-type-left" style=3D"line-height:130%;margin:3p= t 0pt;font-family:=E7=AD=89=E7=BA=BF;font-size:12pt"><span style=3D"font-si= ze:11pt;color:rgb(51,51,51);letter-spacing:0pt;vertical-align:baseline">Sam= sung=C2=A0Research=C2=A0China=C2=A0-=C2=A0Beijing</span><span lang=3D"EN-US= "></span></p></div></div></div></div> --000000000000e2691b060ce635c1--


This message came from the mail archive
postings/2023/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University