Subject: [AUDITORY] vacancy: Research Fellow in Machine Learning for Audio Captioning at University of Surrey, UK (deadline: 8th April 2021) From: Wenwu Wang <000000615c5e5fae-dmarc-request@xxxxxxxx> Date: Thu, 18 Mar 2021 16:03:20 +0000--_000_DB9PR06MB7724CF003C796DEFCE88B0DFBA699DB9PR06MB7724eurp_ Content-Type: text/plain; charset="Windows-1252" Content-Transfer-Encoding: quoted-printable RESEARCH FELLOW IN MACHINE LEARNING FOR AUDIO CAPTIONING Department of Electrical & Electronic Engineering Location: Guildford Salary: =A336,914 to =A339,152 Post Type: Full Time Advert Placed: Thursday 18 March 2021 Closing Date: Thursday 08 April 2021 Reference: 015821 Applications are invited for a Research Fellow (RF) position for 22 months = within the Centre for Vision Speech and Signal Processing (CVSSP), Universi= ty of Surrey, UK, to work on a project titled =93Automated Captioning of Im= age and Audio for Visually and Hearing Impaired=94, which is a collaborativ= e project between the University of Surrey and the Izmir Katip Celebi Unive= rsity (IKCU), Turkey, with project partners from charities and industrial s= ectors working with the hearing and visually impaired. This project aims to= address fundamental challenges in audio and image captioning, develop new = algorithms to improve performance of audio and image captioning algorithms,= and application tools that could be used by the hearing and visually impai= red to access audio and image content. The work at Surrey will focus on new methods and algorithms of automated au= dio captioning and natural language description of audio. This work is buil= t on the significant contributions of CVSSP in the area of acoustic scene a= nalysis, audio event detection, environmental sound recognition, and audio = tagging, together with preliminary results on audio captioning. This new pr= oject offers an opportunity to take this work to the next stages, and demon= strate the benefit of such technologies for the hearing and visually impair= ed. A smartphone based prototype will be developed for audio and visual cap= tioning jointly by Surrey and IKCU. New data will also be gathered, includi= ng audio-visual data for captioning, and user feedback for the prototype sy= stem. The postholder will be responsible for investigating and developing audio s= ignal processing, machine learning algorithms for natural language descript= ion of sound, and implementing software for prototyping the concept and alg= orithms. The postholder should have a doctoral level (or equivalent) resear= ch and development experience in electronic engineering, applied mathematic= s, computer science, artificial intelligence, machine learning, natural lan= guage processing, or related subjects. The postholder should ideally have e= xperience in one of the following areas: audio captioning, machine descript= ion of audio, audio classification, audio tagging, image captioning, video = captioning, translations between audio/image and texts, and/or translation = between audio and video. The post-holder will be based in CVSSP, and work under the direction of the= Principal Investigator Prof Wenwu Wang, with co-supervision by Prof Sabine= Braun, Director of the Centre for Translation Studies, at University of Su= rrey, and in collaboration with Dr Volkan Kilic, from the IKCU, Turkey. CVSSP is an International Centre of Excellence for research in Audio-Visual= Machine Perception, with over 150 researchers, a grant portfolio of =A324M= (=A317.5M EPSRC) from EPSRC, EU, InnovateUK, charity and industry, and a t= urnover of =A37M/annum. The Centre has state-of-the-art acoustic capture an= d analysis facilities and a Visual Media Lab with video and audio capture f= acilities supporting research in real-time video and audio processing and v= isualisation. CVSSP has a compute facility with 120 GPUs and >1PB of high-s= peed secure storage. For informal inquiries, please contact Prof Wenwu Wang (Email: w.wang@xxxxxxxx= y.ac.uk<mailto:w.wang@xxxxxxxx>; Web: http://personal.ee.surrey.ac.uk/P= ersonal/W.Wang/). https://jobs.surrey.ac.uk/vacancy.aspx?ref=3D015821 [https://jobs.surrey.ac.uk/org/images/socialmedia.png]<https://jobs.surrey.= ac.uk/vacancy.aspx?ref=3D015821> Job Vacancy at the University of Surrey: Research Fellow in Machine Learnin= g for Audio Captioning<https://jobs.surrey.ac.uk/vacancy.aspx?ref=3D015821> Applications are invited for a Research Fellow (RF) position for 22 months = within the Centre for Vision Speech and Signal Processing (CVSSP), Universi= ty of Surrey, UK, to work on a project titled =93Automated Captioning of Im= age and Audio for... jobs.surrey.ac.uk Please feel free to circulate the above vacancy to those who might be inter= ested. Apologies for cross-posting. Best wishes, Wenwu -- Professor Wenwu Wang Centre for Vision Speech and Signal Processing Department of Electronic Engineering University of Surrey Guildford GU2 7XH United Kingdom Phone: +44 (0) 1483 686039 Fax: +44 (0) 1483 686031 Email: w.wang@xxxxxxxx http://personal.ee.surrey.ac.uk/Personal/W.Wang/ --_000_DB9PR06MB7724CF003C796DEFCE88B0DFBA699DB9PR06MB7724eurp_ Content-Type: text/html; charset="Windows-1252" Content-Transfer-Encoding: quoted-printable <html> <head> <meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3DWindows-1= 252"> <style type=3D"text/css" style=3D"display:none;"> P {margin-top:0;margin-bo= ttom:0;} </style> </head> <body dir=3D"ltr"> <div style=3D"font-family: Calibri, Arial, Helvetica, sans-serif; font-size= : 12pt; color: rgb(0, 51, 0);"> <h1 style=3D"background-repeat:no-repeat;font-size:3.21429rem;color:rgb(0, = 0, 0);margin:0px 0px 0.15em;font-weight:normal;text-align:left;line-height:= 1em;font-family:corradine-neuron-black, neuron-extra-bold, HelveticaNeueBla= ckCondensed, HelveticaNeue-Black-Condensed, "Helvetica Neue Black Cond= ensed", HelveticaNeueBlack, HelveticaNeue-Black, HelveticaNeue-Condens= edBold, "Helvetica Neue Black", HelveticaNeue, "Helvetica Ne= ue", TeXGyreHerosCnBold, "Arial Narrow", Arial, sans-serif;f= ont-stretch:condensed;display:block;text-transform:uppercase;background-col= or:rgb(238, 238, 232)"> <span style=3D"font-size: 16pt; line-height: normal;">RESEARCH FELLOW IN MA= CHINE LEARNING FOR AUDIO CAPTIONING</span></h1> <h4 style=3D"background-repeat:no-repeat;margin:1em 0px 0.15em;line-height:= 0.95em;font-family:neuron-extra-bold, corradine-neuron-black, HelveticaNeue= BlackCondensed, HelveticaNeue-Black-Condensed, "Helvetica Neue Black C= ondensed", HelveticaNeueBlack, HelveticaNeue-Black, HelveticaNeue-Cond= ensedBold, "Helvetica Neue Black", HelveticaNeue, "Helvetica= Neue", TeXGyreHerosCnBold, "Arial Narrow", Arial, sans-seri= f;font-weight:normal;font-stretch:condensed;display:block;font-size:1.57143= rem;color:rgb(0, 0, 0);text-align:left;background-color:rgb(238, 238, 232)"= > <span style=3D"font-size: 14pt; line-height: normal;">Department of Electri= cal & Electronic Engineering</span></h4> <table style=3D"background-repeat:no-repeat;background-color:rgb(216, 215, = 202);border-collapse:collapse;font-size:0.9em;width:474.667px;margin-bottom= :0.5em;border:none;color:rgb(0, 0, 0);font-family:"Helvetica Neue"= ;, Helvetica, Arial, sans-serif;text-align:left"> <tbody style=3D"background-repeat:no-repeat;margin:5px"> <tr style=3D"background-repeat:no-repeat;border-bottom:1px solid rgba(200, = 198, 179, 0.2)"> <td style=3D"background-repeat:no-repeat;padding:0.4em;border-right:1px sol= id rgb(200, 198, 179);vertical-align:top"> <b style=3D"background-repeat:no-repeat">Location: </b></td> <td style=3D"background-repeat:no-repeat;padding:0.4em;border-right:none;ve= rtical-align:top"> Guildford</td> </tr> <tr style=3D"background-repeat:no-repeat;border-bottom:1px solid rgba(200, = 198, 179, 0.2)"> <td style=3D"background-repeat:no-repeat;padding:0.4em;border-right:1px sol= id rgb(200, 198, 179);vertical-align:top"> <b style=3D"background-repeat:no-repeat">Salary: </b></td> <td style=3D"background-repeat:no-repeat;padding:0.4em;border-right:none;ve= rtical-align:top"> =A336,914 to =A339,152</td> </tr> <tr style=3D"background-repeat:no-repeat;border-bottom:1px solid rgba(200, = 198, 179, 0.2)"> <td style=3D"background-repeat:no-repeat;padding:0.4em;border-right:1px sol= id rgb(200, 198, 179);vertical-align:top"> <b style=3D"background-repeat:no-repeat">Post Type: </b></td> <td style=3D"background-repeat:no-repeat;padding:0.4em;border-right:none;ve= rtical-align:top"> Full Time</td> </tr> <tr style=3D"background-repeat:no-repeat;border-bottom:1px solid rgba(200, = 198, 179, 0.2)"> <td style=3D"background-repeat:no-repeat;padding:0.4em;border-right:1px sol= id rgb(200, 198, 179);vertical-align:top"> <b style=3D"background-repeat:no-repeat">Advert Placed: </b></td> <td style=3D"background-repeat:no-repeat;padding:0.4em;border-right:none;ve= rtical-align:top"> Thursday 18 March 2021</td> </tr> <tr style=3D"background-repeat:no-repeat;border-bottom:1px solid rgba(200, = 198, 179, 0.2)"> <td style=3D"background-repeat:no-repeat;padding:0.4em;border-right:1px sol= id rgb(200, 198, 179);vertical-align:top"> <b style=3D"background-repeat:no-repeat">Closing Date: </b></td> <td style=3D"background-repeat:no-repeat;padding:0.4em;border-right:none;ve= rtical-align:top"> Thursday 08 April 2021</td> </tr> <tr style=3D"background-repeat:no-repeat;border-bottom:1px solid rgba(200, = 198, 179, 0.2)"> <td style=3D"background-repeat:no-repeat;padding:0.4em;border-right:1px sol= id rgb(200, 198, 179);vertical-align:top"> <b style=3D"background-repeat:no-repeat">Reference: </b></td> <td style=3D"background-repeat:no-repeat;padding:0.4em;border-right:none;ve= rtical-align:top"> 015821</td> </tr> </tbody> </table> <p style=3D"background-repeat:no-repeat;margin-top:0px;color:rgb(0, 0, 0);f= ont-family:"Helvetica Neue", Helvetica, Arial, sans-serif;font-si= ze:14px;text-align:left;background-color:rgb(238, 238, 232)"> Applications are invited for a Research Fellow (RF) position for 22 months = within the Centre for Vision Speech and Signal Processing (CVSSP), Universi= ty of Surrey, UK, to work on a project titled =93Automated Captioning of Im= age and Audio for Visually and Hearing Impaired=94, which is a collaborative project between the University of Su= rrey and the Izmir Katip Celebi University (IKCU), Turkey, with project par= tners from charities and industrial sectors working with the hearing and vi= sually impaired. This project aims to address fundamental challenges in audio and image captioning, develop n= ew algorithms to improve performance of audio and image captioning algorith= ms, and application tools that could be used by the hearing and visually im= paired to access audio and image content.</p> <p style=3D"background-repeat:no-repeat;margin-top:0px;color:rgb(0, 0, 0);f= ont-family:"Helvetica Neue", Helvetica, Arial, sans-serif;font-si= ze:14px;text-align:left;background-color:rgb(238, 238, 232)"> The work at Surrey will focus on new methods and algorithms of automated au= dio captioning and natural language description of audio. This work is buil= t on the significant contributions of CVSSP in the area of acoustic scene a= nalysis, audio event detection, environmental sound recognition, and audio tagging, together with prelimin= ary results on audio captioning. This new project offers an opportunity to = take this work to the next stages, and demonstrate the benefit of such tech= nologies for the hearing and visually impaired. A smartphone based prototype will be developed for audio and vis= ual captioning jointly by Surrey and IKCU. New data will also be gathered, = including audio-visual data for captioning, and user feedback for the proto= type system.</p> <p style=3D"background-repeat:no-repeat;margin-top:0px;color:rgb(0, 0, 0);f= ont-family:"Helvetica Neue", Helvetica, Arial, sans-serif;font-si= ze:14px;text-align:left;background-color:rgb(238, 238, 232)"> The postholder will be responsible for investigating and developing audio s= ignal processing, machine learning algorithms for natural language descript= ion of sound, and implementing software for prototyping the concept and alg= orithms. The postholder should have a doctoral level (or equivalent) research and development experience in el= ectronic engineering, applied mathematics, computer science, artificial int= elligence, machine learning, natural language processing, or related subjec= ts. The postholder should ideally have experience in one of the following areas: audio captioning, machine d= escription of audio, audio classification, audio tagging, image captioning,= video captioning, translations between audio/image and texts, and/or trans= lation between audio and video.</p> <p style=3D"background-repeat:no-repeat;margin-top:0px;color:rgb(0, 0, 0);f= ont-family:"Helvetica Neue", Helvetica, Arial, sans-serif;font-si= ze:14px;text-align:left;background-color:rgb(238, 238, 232)"> The post-holder will be based in CVSSP, and work under the direction of the= Principal Investigator Prof Wenwu Wang, with co-supervision by Prof Sabine= Braun, Director of the Centre for Translation Studies, at University of Su= rrey, and in collaboration with Dr Volkan Kilic, from the IKCU, Turkey.</p> <p style=3D"background-repeat:no-repeat;margin-top:0px;color:rgb(0, 0, 0);f= ont-family:"Helvetica Neue", Helvetica, Arial, sans-serif;font-si= ze:14px;text-align:left;background-color:rgb(238, 238, 232)"> CVSSP is an International Centre of Excellence for research in Audio-Visual= Machine Perception, with over 150 researchers, a grant portfolio of =A324M= (=A317.5M EPSRC) from EPSRC, EU, InnovateUK, charity and industry, and a t= urnover of =A37M/annum. The Centre has state-of-the-art acoustic capture and analysis facilities and a Visual Med= ia Lab with video and audio capture facilities supporting research in real-= time video and audio processing and visualisation. CVSSP has a compute faci= lity with 120 GPUs and >1PB of high-speed secure storage.</p> <p style=3D"background-repeat:no-repeat;margin-top:0px;color:rgb(0, 0, 0);f= ont-family:"Helvetica Neue", Helvetica, Arial, sans-serif;font-si= ze:14px;text-align:left;background-color:rgb(238, 238, 232)"> For informal inquiries, please contact Prof Wenwu Wang (Email:<span> <= /span><a href=3D"mailto:w.wang@xxxxxxxx" style=3D"background-repeat:no-= repeat;color:black;border-bottom:1px solid rgb(99, 184, 207);font-weight:no= rmal">w.wang@xxxxxxxx</a>; Web:<span> </span><a href=3D"http://per= sonal.ee.surrey.ac.uk/Personal/W.Wang/" style=3D"background-repeat:no-repea= t;color:black;border-bottom:1px solid rgb(99, 184, 207);font-weight:normal"= >http://personal.ee.surrey.ac.uk/Personal/W.Wang/</a>).</p> <br> </div> <div style=3D"font-family: Calibri, Arial, Helvetica, sans-serif; font-size= : 12pt; color: rgb(0, 51, 0);"> <a href=3D"https://jobs.surrey.ac.uk/vacancy.aspx?ref=3D015821" id=3D"LPlnk= ">https://jobs.surrey.ac.uk/vacancy.aspx?ref=3D015821</a><br> </div> <div class=3D"_Entity _EType_OWALinkPreview _EId_OWALinkPreview _EReadonly_= 1"> <div id=3D"LPBorder_GTaHR0cHM6Ly9qb2JzLnN1cnJleS5hYy51ay92YWNhbmN5LmFzcHg:c= mVmPTAxNTgyMQ.." class=3D"LPBorder391164" style=3D"width: 100%; margin-top:= 16px; margin-bottom: 16px; position: relative; max-width: 800px; min-width= : 424px;"> <table id=3D"LPContainer391164" role=3D"presentation" style=3D"padding: 12p= x 36px 12px 12px; width: 100%; border-width: 1px; border-style: solid; bord= er-color: rgb(200, 200, 200); border-radius: 2px;"> <tbody> <tr valign=3D"top" style=3D"border-spacing: 0px;"> <td> <div id=3D"LPImageContainer391164" style=3D"position: relative; margin-righ= t: 12px; height: 160px; overflow: hidden;"> <a target=3D"_blank" id=3D"LPImageAnchor391164" href=3D"https://jobs.surrey= .ac.uk/vacancy.aspx?ref=3D015821"><img id=3D"LPThumbnailImageId391164" alt= =3D"" height=3D"160" style=3D"display: block;" width=3D"160" src=3D"https:/= /jobs.surrey.ac.uk/org/images/socialmedia.png"></a></div> </td> <td style=3D"width: 100%;"> <div id=3D"LPTitle391164" style=3D"font-size: 21px; font-weight: 300; margi= n-right: 8px; font-family: wf_segoe-ui_light, "Segoe UI Light", &= quot;Segoe WP Light", "Segoe UI", "Segoe WP", Taho= ma, Arial, sans-serif; margin-bottom: 12px;"> <a target=3D"_blank" id=3D"LPUrlAnchor391164" href=3D"https://jobs.surrey.a= c.uk/vacancy.aspx?ref=3D015821" style=3D"text-decoration: none; color: var(= --themePrimary);">Job Vacancy at the University of Surrey: Research Fellow = in Machine Learning for Audio Captioning</a></div> <div id=3D"LPDescription391164" style=3D"font-size: 14px; max-height: 100px= ; color: rgb(102, 102, 102); font-family: wf_segoe-ui_normal, "Segoe U= I", "Segoe WP", Tahoma, Arial, sans-serif; margin-bottom: 12= px; margin-right: 8px; overflow: hidden;"> Applications are invited for a Research Fellow (RF) position for 22 months = within the Centre for Vision Speech and Signal Processing (CVSSP), Universi= ty of Surrey, UK, to work on a project titled =93Automated Captioning of Im= age and Audio for...</div> <div id=3D"LPMetadata391164" style=3D"font-size: 14px; font-weight: 400; co= lor: rgb(166, 166, 166); font-family: wf_segoe-ui_normal, "Segoe UI&qu= ot;, "Segoe WP", Tahoma, Arial, sans-serif;"> jobs.surrey.ac.uk</div> </td> </tr> </tbody> </table> </div> </div> <div style=3D"font-family: Calibri, Arial, Helvetica, sans-serif; font-size= : 12pt; color: rgb(0, 51, 0);"> <br> </div> <div style=3D"font-family: Calibri, Arial, Helvetica, sans-serif; font-size= : 12pt; color: rgb(0, 51, 0);"> Please feel free to circulate the above vacancy to those who might be inter= ested. </div> <div style=3D"font-family: Calibri, Arial, Helvetica, sans-serif; font-size= : 12pt; color: rgb(0, 51, 0);"> <br> </div> <div style=3D"font-family: Calibri, Arial, Helvetica, sans-serif; font-size= : 12pt; color: rgb(0, 51, 0);"> Apologies for cross-posting. </div> <div> <div style=3D"font-family: Calibri, Arial, Helvetica, sans-serif; font-size= : 12pt; color: rgb(0, 51, 0);"> <br> </div> <div id=3D"Signature"> <div> <meta content=3D"text/html; charset=3DUTF-8"> <div id=3D"divtagdefaultwrapper" dir=3D"ltr" style=3D"font-size:12pt; color= :#003300; font-family:Calibri,Arial,Helvetica,sans-serif"> <div name=3D"divtagdefaultwrapper" style=3D"font-family:Calibri,Arial,Helve= tica,sans-serif; font-size:; margin:0"> <div class=3D"BodyFragment"><font size=3D"2"> <div class=3D"PlainText"><font size=3D"3">Best wishes,<br> <br> Wenwu<br> <br> <br> <br> --<br> Professor Wenwu Wang<br> Centre for Vision Speech and Signal Processing <br> Department of Electronic Engineering <br> University of Surrey <br> Guildford GU2 7XH <br> United Kingdom<br> Phone: +44 (0) 1483 686039 <br> Fax: +44 (0) 1483 686031 <br> Email: w.wang@xxxxxxxx<br> <a href=3D"http://personal.ee.surrey.ac.uk/Personal/W.Wang/" class=3D"OWAAu= toLink">http://personal.ee.surrey.ac.uk/Personal/W.Wang/</a></font></div> <div class=3D"PlainText"><br> </div> </font></div> </div> </div> </div> </div> </div> <div> <div id=3D"appendonsend"></div> <div style=3D"font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12p= t; color:rgb(0,51,0)"> <br> </div> </div> </body> </html> --_000_DB9PR06MB7724CF003C796DEFCE88B0DFBA699DB9PR06MB7724eurp_--