Re: [AUDITORY] cleaning background noise (murmur, clatter, ...) (Wasmann, Jan-Willem)


Subject: Re: [AUDITORY] cleaning background noise (murmur, clatter, ...)
From:    Wasmann, Jan-Willem <"Wasmann, Jan-Willem">
Date:    Fri, 11 Apr 2025 07:21:04 +0000

--_000_DBBSPR01MB001282856939C48865279DE0DFB62DBBSPR01MB0012eu_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Hi Esra, A very easy tool for editing recordings containing speech is descript. It w= ill read in the recording and use speech to text to create a transcription.= You can then edit the transcript as if you are working in a word editor. R= emoving a line will remove the audio and video of that portion. In addition= , you can use filters (noise canceling etc) to improve sound quality. I hav= e used this a year ago for some podcast recordings and it worked well. Not = sure how the software developed in the meantime. I don't have the paid vers= ion anymore. You pay for the file size so video editing will be much more e= xpensive than audio-only editing. In what language did you record? It doesn= 't support all languages yet. Also my accent in English introduced many tra= nscription errors. Descript: AI-Powered Podcast & Audio Editor<https://www.descript.com/podcas= ting?utm_source=3Dgoogle&utm_medium=3Dpaid&utm_campaign=3D16259794211&utm_c= ampaignname=3DGoogle_Search_Brand_T1&utm_content=3D163844740293&utm_term=3D= descript%20audio%20tools&utm_kwid=3Dkwd-2317408635568&gclid=3DEAIaIQobChMI-= fLwubfPjAMV7LKDBx1nnAc6EAAYASAAEgKg4PD_BwE&utm_ad_id=3D705690567761&utm_cam= paigntype=3Dpaid_search&gad_source=3D1> All my best, Jan-Willem Van: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxx> = Namens Esra Mungan Verzonden: donderdag 10 april 2025 13:33 Aan: AUDITORY@xxxxxxxx Onderwerp: [AUDITORY] cleaning background noise (murmur, clatter, ...) dear list, since more than a year we are doing a series of weekly public science lectu= res (as part of a collective resistance against a political crash attempt a= gainst our university, hence we call our initiative "academy-in-resistance = for free, autonomous, and democratic universities") at a central public lib= rary in istanbul. we have wonderful attendance, yet, there are, of course, many people who ca= nnot attend these lectures in person. all lectures are recorded but because the sound and recording conditions ar= e not ideal (particularly on warm days we need to keep windows open hence w= e get some background city noise; but also our recording devices and the wh= ole bluetooth sound system are rather modest) we now want to improve the li= stening quality of our recorded lectures (https://www.youtube.com/@xxxxxxxx= kademi; last years' lectures were added under a more general academic youtu= be account of ours, https://www.youtube.com/@xxxxxxxx). a colleague brought these programs to our attention, * https://flixier.com/tools/audio-enhancer * https://www.kapwing.com/tools/audio-editor/audio-enhancer * https://www.geeksforgeeks.org/top-free-video-audio-enhancer-tools/ * https://www.canva.com/features/audio-enhancer/ * https://voice.ai/tools/audio-enhancer however, I understand that these can only be used once the videos are uploa= ded. what we need though is a way to do the cleaning beforehand (each video= recording is about 17-35GB in size). another supporter offered to do it o= n adobe premiere and the results were indeed great in terms of speaker clar= ity, however this time, the timbre of the speaker had quite changed. :) so= me of them are well-known scholars, hence such major changes in voice chara= cter feels really alienating. so we are stuck now. is there anything you can suggest us? many thanks, esra Esra Mungan, PhD Associate Professor (left Bogazici University since Dec. 2024) https://psychology.bogazici.edu.tr/content/esra-mungan (TK/EN) https://universitybogazici.wordpress.com/ (TK/EN) https://direnenakademi.wordpress.com/ (TK) De informatie in dit bericht is uitsluitend bestemd voor de geadresseerde. = Aan dit bericht en de bijlagen kunnen geen rechten worden ontleend. Heeft u= deze e-mail onbedoeld ontvangen? Dan verzoeken wij u het te vernietigen en= de afzender te informeren. Openbaar maken, kopi?ren en verspreiden van dez= e e-mail of informatie uit deze e-mail is alleen toegestaan met voorafgaand= e schriftelijke toestemming van de afzender. Het Radboudumc staat geregistr= eerd bij de Kamer van Koophandel in het handelsregister onder nummer 802627= 83. The content of this message is intended solely for the addressee. No rights= can be derived from this message or its attachments. If you are not the in= tended recipient, we kindly request you to delete the message and inform th= e sender. It is strictly prohibited to disclose, copy or distribute this em= ail or the information inside it, without a written consent from the sender= . Radboud university medical center is registered with the Dutch Chamber of= Commerce trade register with number 80262783. --_000_DBBSPR01MB001282856939C48865279DE0DFB62DBBSPR01MB0012eu_ Content-Type: text/html; charset="us-ascii" Content-Transfer-Encoding: quoted-printable <html xmlns:v=3D"urn:schemas-microsoft-com:vml" xmlns:o=3D"urn:schemas-micr= osoft-com:office:office" xmlns:w=3D"urn:schemas-microsoft-com:office:word" = xmlns:m=3D"http://schemas.microsoft.com/office/2004/12/omml" xmlns=3D"http:= //www.w3.org/TR/REC-html40"> <head> <meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3Dus-ascii"= > <meta name=3D"Generator" content=3D"Microsoft Word 15 (filtered medium)"> <style><!-- /* Font Definitions */ @xxxxxxxx {font-family:Wingdings; panose-1:5 0 0 0 0 0 0 0 0 0;} @xxxxxxxx {font-family:"Cambria Math"; panose-1:2 4 5 3 5 4 6 3 2 4;} @xxxxxxxx {font-family:Calibri; panose-1:2 15 5 2 2 2 4 3 2 4;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {margin:0cm; font-size:11.0pt; font-family:"Calibri",sans-serif;} a:link, span.MsoHyperlink {mso-style-priority:99; color:#0563C1; text-decoration:underline;} .MsoChpDefault {mso-style-type:export-only; font-size:10.0pt; mso-ligatures:none;} @xxxxxxxx WordSection1 {size:612.0pt 792.0pt; margin:72.0pt 72.0pt 72.0pt 72.0pt;} div.WordSection1 {page:WordSection1;} /* List Definitions */ @xxxxxxxx l0 {mso-list-id:287394170; mso-list-template-ids:215400652;} @xxxxxxxx l0:level1 {mso-level-number-format:bullet; mso-level-text:\F0B7; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt; mso-ansi-font-size:10.0pt; font-family:Symbol;} @xxxxxxxx l0:level2 {mso-level-number-format:bullet; mso-level-text:\F0B7; mso-level-tab-stop:72.0pt; mso-level-number-position:left; text-indent:-18.0pt; mso-ansi-font-size:10.0pt; font-family:Symbol;} @xxxxxxxx l0:level3 {mso-level-number-format:bullet; mso-level-text:\F0B7; mso-level-tab-stop:108.0pt; mso-level-number-position:left; text-indent:-18.0pt; mso-ansi-font-size:10.0pt; font-family:Symbol;} @xxxxxxxx l0:level4 {mso-level-number-format:bullet; mso-level-text:\F0B7; mso-level-tab-stop:144.0pt; mso-level-number-position:left; text-indent:-18.0pt; mso-ansi-font-size:10.0pt; font-family:Symbol;} @xxxxxxxx l0:level5 {mso-level-number-format:bullet; mso-level-text:\F0B7; mso-level-tab-stop:180.0pt; mso-level-number-position:left; text-indent:-18.0pt; mso-ansi-font-size:10.0pt; font-family:Symbol;} @xxxxxxxx l0:level6 {mso-level-number-format:bullet; mso-level-text:\F0B7; mso-level-tab-stop:216.0pt; mso-level-number-position:left; text-indent:-18.0pt; mso-ansi-font-size:10.0pt; font-family:Symbol;} @xxxxxxxx l0:level7 {mso-level-number-format:bullet; mso-level-text:\F0B7; mso-level-tab-stop:252.0pt; mso-level-number-position:left; text-indent:-18.0pt; mso-ansi-font-size:10.0pt; font-family:Symbol;} @xxxxxxxx l0:level8 {mso-level-number-format:bullet; mso-level-text:\F0B7; mso-level-tab-stop:288.0pt; mso-level-number-position:left; text-indent:-18.0pt; mso-ansi-font-size:10.0pt; font-family:Symbol;} @xxxxxxxx l0:level9 {mso-level-number-format:bullet; mso-level-text:\F0B7; mso-level-tab-stop:324.0pt; mso-level-number-position:left; text-indent:-18.0pt; mso-ansi-font-size:10.0pt; font-family:Symbol;} @xxxxxxxx l1 {mso-list-id:1587154470; mso-list-template-ids:-735388756;} @xxxxxxxx l1:level1 {mso-level-number-format:bullet; mso-level-text:\F0B7; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt; mso-ansi-font-size:10.0pt; font-family:Symbol;} @xxxxxxxx l1:level2 {mso-level-number-format:bullet; mso-level-text:o; mso-level-tab-stop:72.0pt; mso-level-number-position:left; text-indent:-18.0pt; mso-ansi-font-size:10.0pt; font-family:"Courier New"; mso-bidi-font-family:"Times New Roman";} @xxxxxxxx l1:level3 {mso-level-number-format:bullet; mso-level-text:\F0A7; mso-level-tab-stop:108.0pt; mso-level-number-position:left; text-indent:-18.0pt; mso-ansi-font-size:10.0pt; font-family:Wingdings;} @xxxxxxxx l1:level4 {mso-level-number-format:bullet; mso-level-text:\F0A7; mso-level-tab-stop:144.0pt; mso-level-number-position:left; text-indent:-18.0pt; mso-ansi-font-size:10.0pt; font-family:Wingdings;} @xxxxxxxx l1:level5 {mso-level-number-format:bullet; mso-level-text:\F0A7; mso-level-tab-stop:180.0pt; mso-level-number-position:left; text-indent:-18.0pt; mso-ansi-font-size:10.0pt; font-family:Wingdings;} @xxxxxxxx l1:level6 {mso-level-number-format:bullet; mso-level-text:\F0A7; mso-level-tab-stop:216.0pt; mso-level-number-position:left; text-indent:-18.0pt; mso-ansi-font-size:10.0pt; font-family:Wingdings;} @xxxxxxxx l1:level7 {mso-level-number-format:bullet; mso-level-text:\F0A7; mso-level-tab-stop:252.0pt; mso-level-number-position:left; text-indent:-18.0pt; mso-ansi-font-size:10.0pt; font-family:Wingdings;} @xxxxxxxx l1:level8 {mso-level-number-format:bullet; mso-level-text:\F0A7; mso-level-tab-stop:288.0pt; mso-level-number-position:left; text-indent:-18.0pt; mso-ansi-font-size:10.0pt; font-family:Wingdings;} @xxxxxxxx l1:level9 {mso-level-number-format:bullet; mso-level-text:\F0A7; mso-level-tab-stop:324.0pt; mso-level-number-position:left; text-indent:-18.0pt; mso-ansi-font-size:10.0pt; font-family:Wingdings;} ol {margin-bottom:0cm;} ul {margin-bottom:0cm;} --></style><!--[if gte mso 9]><xml> <o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" /> </xml><![endif]--><!--[if gte mso 9]><xml> <o:shapelayout v:ext=3D"edit"> <o:idmap v:ext=3D"edit" data=3D"1" /> </o:shapelayout></xml><![endif]--> </head> <body lang=3D"NL" link=3D"#0563C1" vlink=3D"#954F72" style=3D"word-wrap:bre= ak-word"> <div class=3D"WordSection1"> <p class=3D"MsoNormal"><span style=3D"mso-fareast-language:EN-US">Hi Esra,<= o:p></o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"mso-fareast-language:E= N-US">A very easy tool for editing recordings containing speech is descript= . It will read in the recording and use speech to text to create a transcri= ption. You can then edit the transcript as if you are working in a word editor. Removing a line will remove the au= dio and video of that portion. In addition, you can use filters (noise canc= eling etc) to improve sound quality. I have used this a year ago for some p= odcast recordings and it worked well. Not sure how the software developed in the meantime. I don&#8217;t h= ave the paid version anymore. You pay for the file size so video editing wi= ll be much more expensive than audio-only editing. In what language did you= record? It doesn&#8217;t support all languages yet. Also my accent in English introduced many transcription errors.<o:p><= /o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"mso-fareast-language:E= N-US"><o:p>&nbsp;</o:p></span></p> <p class=3D"MsoNormal"><a href=3D"https://www.descript.com/podcasting?utm_s= ource=3Dgoogle&amp;utm_medium=3Dpaid&amp;utm_campaign=3D16259794211&amp;utm= _campaignname=3DGoogle_Search_Brand_T1&amp;utm_content=3D163844740293&amp;u= tm_term=3Ddescript%20audio%20tools&amp;utm_kwid=3Dkwd-2317408635568&amp;gcl= id=3DEAIaIQobChMI-fLwubfPjAMV7LKDBx1nnAc6EAAYASAAEgKg4PD_BwE&amp;utm_ad_id= =3D705690567761&amp;utm_campaigntype=3Dpaid_search&amp;gad_source=3D1"><spa= n lang=3D"EN-US">Descript: AI-Powered Podcast &amp; Audio Editor</span></a><o:p></o:p></p> <p class=3D"MsoNormal"><span lang=3D"EN-US"><o:p>&nbsp;</o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US">All my best,<o:p></o:p></span><= /p> <p class=3D"MsoNormal"><span lang=3D"EN-US">Jan-Willem </span><span lang=3D= "EN-US" style=3D"mso-fareast-language:EN-US"><o:p></o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"mso-fareast-language:E= N-US"><o:p>&nbsp;</o:p></span></p> <div> <div style=3D"border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0cm = 0cm 0cm"> <p class=3D"MsoNormal"><b>Van:</b> AUDITORY - Research in Auditory Percepti= on &lt;AUDITORY@xxxxxxxx&gt; <b>Namens </b>Esra Mungan<br> <b>Verzonden:</b> donderdag 10 april 2025 13:33<br> <b>Aan:</b> AUDITORY@xxxxxxxx<br> <b>Onderwerp:</b> [AUDITORY] cleaning background noise (murmur, clatter, ..= .)<o:p></o:p></p> </div> </div> <p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:12.0pt;color= :black">dear list,<o:p></o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:12.0pt;color= :black"><o:p>&nbsp;</o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:12.0pt;color= :black">since more than a year we are doing a series of weekly public scien= ce lectures </span><i><span lang=3D"EN-US" style=3D"font-size:10.0pt;color:#548235">(as= part of a collective resistance against a political crash attempt against = our university, hence we call our initiative &#8220;academy-in-resistance f= or free, autonomous, and democratic universities&#8221;)</span></i><span la= ng=3D"EN-US" style=3D"font-size:12.0pt;color:#548235"> </span><span lang=3D"EN-US" style=3D"font-size:12.0pt;color:black">at a cen= tral public library in istanbul. <o:p></o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:12.0pt;color= :black">we have wonderful attendance, yet, there are, of course, many peopl= e who cannot attend these lectures in person.&nbsp; <o:p></o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:12.0pt;color= :black">all lectures are recorded but because the sound and recording condi= tions are not ideal </span><i><span lang=3D"EN-US" style=3D"font-size:10.0pt;color:#548235">(pa= rticularly on warm days we need to keep windows open hence we get some back= ground city noise; but also our recording devices and the whole bluetooth s= ound system are rather modest)</span></i><span lang=3D"EN-US" style=3D"font= -size:12.0pt;color:black"> we now want to improve the listening quality of our recorded lectures </sp= an><i><span lang=3D"EN-US" style=3D"font-size:10.0pt;color:#548235">(<a hre= f=3D"https://www.youtube.com/@xxxxxxxx"><span style=3D"color:#548235"= >https://www.youtube.com/@xxxxxxxx</span></a>; last years&#8217; lectures were added under a more general academic youtub= e account of ours, <a href=3D"https://www.youtube.com/@xxxxxxxx"><span style=3D"col= or:#548235">https://www.youtube.com/@xxxxxxxx</span></a>)</span>= </i><span lang=3D"EN-US" style=3D"font-size:12.0pt;color:black">.<o:p></o:p= ></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:12.0pt;color= :black"><o:p>&nbsp;</o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:12.0pt;color= :black">a colleague brought these programs to our attention,<o:p></o:p></sp= an></p> <ul style=3D"margin-top:0cm" type=3D"disc"> <li class=3D"MsoNormal" style=3D"color:black;mso-list:l1 level1 lfo3"><span= lang=3D"EN-US" style=3D"font-size:12.0pt"><a href=3D"https://flixier.com/t= ools/audio-enhancer" target=3D"_blank">https://flixier.com/tools/audio-enha= ncer</a><o:p></o:p></span></li><li class=3D"MsoNormal" style=3D"color:black= ;mso-list:l1 level1 lfo3"><span lang=3D"EN-US" style=3D"font-size:12.0pt"><= a href=3D"https://www.kapwing.com/tools/audio-editor/audio-enhancer" target= =3D"_blank">https://www.kapwing.com/tools/audio-editor/audio-enhancer</a><o= :p></o:p></span></li><li class=3D"MsoNormal" style=3D"color:black;mso-list:= l1 level1 lfo3"><span lang=3D"EN-US" style=3D"font-size:12.0pt"><a href=3D"= https://www.geeksforgeeks.org/top-free-video-audio-enhancer-tools/" target= =3D"_blank">https://www.geeksforgeeks.org/top-free-video-audio-enhancer-too= ls/</a><o:p></o:p></span></li><li class=3D"MsoNormal" style=3D"color:black;= mso-list:l1 level1 lfo3"><span lang=3D"EN-US" style=3D"font-size:12.0pt"><a= href=3D"https://www.canva.com/features/audio-enhancer/" target=3D"_blank">= https://www.canva.com/features/audio-enhancer/</a><o:p></o:p></span></li><l= i class=3D"MsoNormal" style=3D"color:black;mso-list:l1 level1 lfo3"><span l= ang=3D"EN-US" style=3D"font-size:12.0pt"><a href=3D"https://voice.ai/tools/= audio-enhancer" target=3D"_blank">https://voice.ai/tools/audio-enhancer</a>= <o:p></o:p></span></li></ul> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:12.0pt;color= :black"><o:p>&nbsp;</o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:12.0pt;color= :black">however, I understand that these can only be used once the videos a= re uploaded. what we need though is <b>a way to do the cleaning beforehand</b> (each video recording is about 1= 7-35GB in size). &nbsp;another supporter offered to do it on adobe premiere= and the results were indeed great in terms of speaker clarity, however thi= s time, the timbre of the speaker had quite changed. :)&nbsp; some of them are well-known scholars, hence such m= ajor changes in voice character feels really alienating.&nbsp; so we are st= uck now.<o:p></o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:12.0pt;color= :black"><o:p>&nbsp;</o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:12.0pt;color= :black">is there anything you can suggest us?<o:p></o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:12.0pt;color= :black">many thanks,<o:p></o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:12.0pt;color= :black">esra<o:p></o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:12.0pt;color= :black"><o:p>&nbsp;</o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:12.0pt;color= :black"><o:p>&nbsp;</o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:12.0pt;color= :black">Esra Mungan, PhD<o:p></o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:8.0pt;color:= black">Associate Professor<o:p></o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:8.0pt;color:= black">(left Bogazici University since Dec. 2024)<o:p></o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:8.0pt;color:= black"><a href=3D"https://psychology.bogazici.edu.tr/content/esra-mungan"><= span lang=3D"DE">https://psychology.bogazici.edu.tr/content/esra-mungan</sp= an></a></span><span lang=3D"DE" style=3D"font-size:8.0pt;color:black"> (TK/EN)<o:p></o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:8.0pt;color:= black"><a href=3D"https://universitybogazici.wordpress.com/"><span lang=3D"= DE">https://universitybogazici.wordpress.com/</span></a> </span><span lang=3D"DE" style=3D"font-size:8.0pt;color:black">(TK/EN)<o:p>= </o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"font-size:8.0pt;color:= black"><a href=3D"https://direnenakademi.wordpress.com/"><span lang=3D"DE">= https://direnenakademi.wordpress.com/</span></a> </span><span lang=3D"DE" style=3D"font-size:8.0pt;color:black">(TK) <o:p></= o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"DE"><o:p>&nbsp;</o:p></span></p> </div> <p style=3D"font-size:13px;font-style:italic;font-family:arial;">De informa= tie in dit bericht is uitsluitend bestemd voor de geadresseerde. Aan dit be= richt en de bijlagen kunnen geen rechten worden ontleend. Heeft u deze e-ma= il onbedoeld ontvangen? Dan verzoeken wij u het te vernietigen en de afzender te informeren. Openbaar maken, kop= i&euml;ren en verspreiden van deze e-mail of informatie uit deze e-mail is = alleen toegestaan met voorafgaande schriftelijke toestemming van de afzende= r. Het Radboudumc staat geregistreerd bij de Kamer van Koophandel in het handelsregister onder nummer 80262783.<= br> <br> The content of this message is intended solely for the addressee. No rights= can be derived from this message or its attachments. If you are not the in= tended recipient, we kindly request you to delete the message and inform th= e sender. It is strictly prohibited to disclose, copy or distribute this email or the information inside it, w= ithout a written consent from the sender. Radboud university medical center= is registered with the Dutch Chamber of Commerce trade register with numbe= r 80262783. <br> </p> </body> </html> --_000_DBBSPR01MB001282856939C48865279DE0DFB62DBBSPR01MB0012eu_--


This message came from the mail archive
postings/2025/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University