Re: [AUDITORY] Spanish Speech Material (Male) (Cas)


Subject: Re: [AUDITORY] Spanish Speech Material (Male)
From:    Cas <"Smits, J.C.M. ">
Date:    Sun, 8 Feb 2026 19:47:41 +0000

--_000_AM9P193MB1956C1B605BF2E24A6436FCFE464AAM9P193MB1956EURP_ Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Dear Jonathan, TTS can indeed be a good option. We have compared synthetically generated s= peech material with the original material (using the MOS scale). This was a= lso done for Spanish, and we found similar scores. Note that these systems = generally have a limited bandwidth (up to 8 kHz). Polspoel, S., Moore, D. R., Swanepoel, D. W., Kramer, S. E., & Smits, C. (2= 025). Automatic development of speech-in-noise hearing tests using machine = learning. Scientific Reports, 15(1), 12878. Best, Cas Smits Van: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxx> = Namens Massimo Grassi Verzonden: zaterdag 7 februari 2026 09:23 Aan: AUDITORY@xxxxxxxx Onderwerp: Re: [AUDITORY] Spanish Speech Material (Male) Dear Jonathan, why you do not consider the text-to-speech AI? They seem very good to my ea= rs. Actually, I wanted to use these sounds in an experiment in which I want= to present speech in various languages with the same voice-timbre. In othe= r words, (and this is also a question I address to the list members, possib= ly experts in this topic) are these new AI voices bad? All the best to everybody and greetings from the alps, very close to the ol= ympics. m On Sat, 7 Feb 2026 at 06:18, Jonathan Regev (JTEG) <0000048aec16583c-dmarc-= request@xxxxxxxx<mailto:0000048aec16583c-dmarc-request@xxxxxxxx= ca>> wrote: Dear list members, We are currently preparing a study investigating speech-intelligibility sco= res in listeners with normal hearing. The study will be run in Spain and will use the Castilian Spanish HINT mate= rial as target speech (Huarte, 2008). We are looking for speech recordings in Castilian Spanish by male speakers,= to be used as competing-talker maskers. We are for example looking for recordings of male speakers reading news sto= ries, books, etc... Any resource you may be aware of would be highly appreciated! Reference: Huarte, A. (2008). "The Castilian Spanish Hearing in Noise Test,= " Int J Audiol, 47, 369-370. doi:10.1080/14992020801908269 Thanks and best, Jonathan Regev M.Sc., Ph.D. Scientist Eriksholm Research Centre R=F8rtangvej 20 3070 Snekkersten Denmark Email: jteg@xxxxxxxx<mailto:jteg@xxxxxxxx> Website: www.eriksholm.com<http://www.eriksholm.com/> ______________________________________________________ AmsterdamUMC disclaimer : www.amsterdamumc.org/nl/disclaimers.htm --_000_AM9P193MB1956C1B605BF2E24A6436FCFE464AAM9P193MB1956EURP_ Content-Type: text/html; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable <html xmlns:v=3D"urn:schemas-microsoft-com:vml" xmlns:o=3D"urn:schemas-micr= osoft-com:office:office" xmlns:w=3D"urn:schemas-microsoft-com:office:word" = xmlns:m=3D"http://schemas.microsoft.com/office/2004/12/omml" xmlns=3D"http:= //www.w3.org/TR/REC-html40"> <head> <meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3Diso-8859-= 1"> <meta name=3D"Generator" content=3D"Microsoft Word 15 (filtered medium)"> <style><!-- /* Font Definitions */ @xxxxxxxx {font-family:"Cambria Math"; panose-1:2 4 5 3 5 4 6 3 2 4;} @xxxxxxxx {font-family:Calibri; panose-1:2 15 5 2 2 2 4 3 2 4;} @xxxxxxxx {font-family:Aptos;} @xxxxxxxx {font-family:"Trebuchet MS"; panose-1:2 11 6 3 2 2 2 2 2 4;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {margin:0cm; font-size:12.0pt; font-family:"Aptos",sans-serif;} a:link, span.MsoHyperlink {mso-style-priority:99; color:blue; text-decoration:underline;} .MsoChpDefault {mso-style-type:export-only; font-size:11.0pt; mso-fareast-language:EN-US;} @xxxxxxxx WordSection1 {size:612.0pt 792.0pt; margin:70.85pt 70.85pt 70.85pt 70.85pt;} div.WordSection1 {page:WordSection1;} --></style> </head> <body lang=3D"NL" link=3D"blue" vlink=3D"purple" style=3D"word-wrap:break-w= ord"> <div class=3D"WordSection1"> <p class=3D"MsoNormal"><span lang=3D"EN-GB" style=3D"font-size:10.0pt;font-= family:&quot;Trebuchet MS&quot;,sans-serif;mso-fareast-language:EN-US">Dear= Jonathan,<o:p></o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-GB" style=3D"font-size:10.0pt;font-= family:&quot;Trebuchet MS&quot;,sans-serif;mso-fareast-language:EN-US"><o:p= >&nbsp;</o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-GB" style=3D"font-size:10.0pt;font-= family:&quot;Trebuchet MS&quot;,sans-serif;mso-fareast-language:EN-US">TTS = can indeed be a good option. We have compared synthetically generated speec= h material with the original material (using the MOS scale). This was also done for Spanish, and we found similar scores. N= ote that these systems generally have a limited bandwidth (up to 8 kHz).<o:= p></o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-GB" style=3D"font-size:10.0pt;font-= family:&quot;Trebuchet MS&quot;,sans-serif;mso-fareast-language:EN-US"><o:p= >&nbsp;</o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-size:10.0pt;font-family:&quot;Tr= ebuchet MS&quot;,sans-serif;mso-fareast-language:EN-US">Polspoel, S., Moore= , D. R., Swanepoel, D. W., Kramer, S. E., &amp; Smits, C. (2025). </span><span lang=3D"EN-GB" style=3D"font-size:10.0pt;font-family:&quot;Tre= buchet MS&quot;,sans-serif;mso-fareast-language:EN-US">Automatic developmen= t of speech-in-noise hearing tests using machine learning.&nbsp;</span><i><= span style=3D"font-size:10.0pt;font-family:&quot;Trebuchet MS&quot;,sans-se= rif;mso-fareast-language:EN-US">Scientific Reports</span></i><span style=3D"font-size:10.0pt;font-family:&quot;Trebuc= het MS&quot;,sans-serif;mso-fareast-language:EN-US">,&nbsp;<i>15</i>(1), 12= 878.<o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"font-size:10.0pt;font-family:&quot;Tr= ebuchet MS&quot;,sans-serif;mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></= span></p> <p class=3D"MsoNormal"><span style=3D"font-size:10.0pt;font-family:&quot;Tr= ebuchet MS&quot;,sans-serif;mso-fareast-language:EN-US">Best,<o:p></o:p></s= pan></p> <p class=3D"MsoNormal"><span style=3D"font-size:10.0pt;font-family:&quot;Tr= ebuchet MS&quot;,sans-serif;mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></= span></p> <p class=3D"MsoNormal"><span style=3D"font-size:10.0pt;font-family:&quot;Tr= ebuchet MS&quot;,sans-serif;mso-fareast-language:EN-US">Cas Smits</span><sp= an lang=3D"EN-GB" style=3D"font-size:10.0pt;font-family:&quot;Trebuchet MS&= quot;,sans-serif;mso-fareast-language:EN-US"><o:p></o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-GB" style=3D"font-size:10.0pt;font-= family:&quot;Trebuchet MS&quot;,sans-serif;mso-fareast-language:EN-US"><o:p= >&nbsp;</o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-GB" style=3D"mso-fareast-language:E= N-US"><o:p>&nbsp;</o:p></span></p> <div style=3D"border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0cm = 0cm 0cm"> <p class=3D"MsoNormal"><b><span style=3D"font-size:11.0pt;font-family:&quot= ;Calibri&quot;,sans-serif">Van:</span></b><span style=3D"font-size:11.0pt;f= ont-family:&quot;Calibri&quot;,sans-serif"> AUDITORY - Research in Auditory= Perception &lt;AUDITORY@xxxxxxxx&gt; <b>Namens </b>Massimo Grassi<br> <b>Verzonden:</b> zaterdag 7 februari 2026 09:23<br> <b>Aan:</b> AUDITORY@xxxxxxxx<br> <b>Onderwerp:</b> Re: [AUDITORY] Spanish Speech Material (Male)<o:p></o:p><= /span></p> </div> <p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p> <div> <p class=3D"MsoNormal">Dear&nbsp;Jonathan,<o:p></o:p></p> <div> <p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p> </div> <div> <p class=3D"MsoNormal">why you do not consider the text-to-speech AI? They = seem very good to my ears. Actually, I wanted to use these sounds in an exp= eriment in which I want to present speech in various languages with the sam= e voice-timbre. In other words, (and this is also a question I address to the list members, possibly experts&nb= sp;in this topic) are these new AI voices bad?<o:p></o:p></p> </div> <div> <p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p> </div> <div> <p class=3D"MsoNormal">All the best to everybody and greetings from the alp= s, very close to the olympics.<o:p></o:p></p> </div> <div> <p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p> </div> <div> <p class=3D"MsoNormal">m<o:p></o:p></p> </div> </div> <p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p> <div> <div> <p class=3D"MsoNormal">On Sat, 7 Feb 2026 at 06:18, Jonathan Regev (JTEG) &= lt;<a href=3D"mailto:0000048aec16583c-dmarc-request@xxxxxxxx">000004= 8aec16583c-dmarc-request@xxxxxxxx</a>&gt; wrote:<o:p></o:p></p> </div> <blockquote style=3D"border:none;border-left:solid #CCCCCC 1.0pt;padding:0c= m 0cm 0cm 6.0pt;margin-left:4.8pt;margin-right:0cm"> <div> <div> <div> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;color:black">Dear li= st members,&nbsp;<br> <br> We are currently preparing a study investigating speech-intelligibility sco= res in listeners with normal hearing.<o:p></o:p></span></p> </div> <div> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;color:black">The stu= dy will be run in Spain and will use the Castilian Spanish HINT material as= target speech (Huarte, 2008).<o:p></o:p></span></p> </div> <div> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;color:black"><o:p>&n= bsp;</o:p></span></p> </div> <div> <p class=3D"MsoNormal"><b><span style=3D"font-size:11.0pt;color:black">We a= re looking for speech recordings in Castilian Spanish by male speakers, to = be used as competing-talker maskers.&nbsp;</span></b><span style=3D"font-si= ze:11.0pt;color:black"><o:p></o:p></span></p> </div> <div> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;color:black">We are = for example looking for recordings of male speakers reading news stories, b= ooks, etc...&nbsp;<o:p></o:p></span></p> </div> <div> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;color:black"><o:p>&n= bsp;</o:p></span></p> </div> <div> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;color:black">Any res= ource you may be aware of would be highly appreciated!<o:p></o:p></span></p= > </div> <div> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;color:black"><o:p>&n= bsp;</o:p></span></p> </div> <div> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;color:black">Referen= ce: Huarte, A. (<b>2008</b>). &#8220;The Castilian Spanish Hearing in Noise= Test,&#8221; Int J Audiol, <b>47</b>, 369&#8211;370. doi:10.1080/14992020801908269<o:p></o:p></span></= p> </div> <div> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;color:black"><o:p>&n= bsp;</o:p></span></p> </div> <div id=3D"m_-7997340493433078264Signature"> <div> <p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;color:black"><o:p>&n= bsp;</o:p></span></p> </div> <p style=3D"margin:0cm"><span style=3D"font-family:&quot;Arial&quot;,sans-s= erif">Thanks and best, <span style=3D"color:#00505C">&nbsp;</span></span><o:p></o:p></p> <p style=3D"margin:0cm"><span style=3D"font-family:&quot;Arial&quot;,sans-s= erif;color:#00505C">Jonathan Regev</span><o:p></o:p></p> <p style=3D"margin:0cm"><span style=3D"font-family:&quot;Arial&quot;,sans-s= erif;color:#595959">M.Sc., Ph.D.</span><o:p></o:p></p> <p style=3D"margin:0cm"><span style=3D"font-family:&quot;Arial&quot;,sans-s= erif;color:#595959">Scientist</span><o:p></o:p></p> <p style=3D"margin:0cm"><span style=3D"font-family:&quot;Arial&quot;,sans-s= erif;color:#595959"><br> <b>Eriksholm Research Centre<br> </b>R=F8rtangvej 20<br> 3070 Snekkersten<br> Denmark<br> Email:</span><span style=3D"font-family:&quot;Arial&quot;,sans-serif;color:= #00505C"> </span><u><span style=3D"font-family:&quot;Arial&quot;,sans-serif= ;color:#467886"><a href=3D"mailto:jteg@xxxxxxxx" target=3D"_blank"><sp= an style=3D"color:#467886">jteg@xxxxxxxx</span></a></span></u><o:p></o= :p></p> <p style=3D"margin:0cm"><span style=3D"font-family:&quot;Arial&quot;,sans-s= erif;color:#595959">Website: </span><u><span style=3D"font-family:&quot;Arial&quot;,sans-serif;color:#00= 505C"><a href=3D"http://www.eriksholm.com/" target=3D"_blank"><span style= =3D"color:#00505C">www.eriksholm.com</span></a></span></u><o:p></o:p></p> <p style=3D"mso-margin-top-alt:0cm;margin-right:0cm;margin-bottom:12.0pt;ma= rgin-left:0cm"> <o:p>&nbsp;</o:p></p> </div> </div> </div> </blockquote> </div> </div> <div style=3D"text-align: center; font-family: Trebuchet MS,Arial,Helvetica= ,sans; font-size: 9px; color: #000000;"> ______________________________________________________<br> AmsterdamUMC disclaimer : www.amsterdamumc.org/nl/disclaimers.htm<br> </div> </body> </html> --_000_AM9P193MB1956C1B605BF2E24A6436FCFE464AAM9P193MB1956EURP_--


This message came from the mail archive
postings/2026/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University