Re: speech database (Mark Huckvale )


Subject: Re: speech database
From:    Mark Huckvale  <m.huckvale@xxxxxxxx>
Date:    Wed, 25 May 2016 10:48:38 +0100
List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

--------------33C5D84E27BDDEDF46C48F7F Content-Type: text/plain; charset="windows-1252"; format=flowed Content-Transfer-Encoding: quoted-printable X-MIME-Autoconverted: from 8bit to quoted-printable by edgeum1.it.mcgill.ca id u4P9mhbl013297 Two useful databases of British English speakers: Accents of the British Isles (280 speakers of mixed ages) http://www.thespeechark.com/abi-1-page.html VCTK Corpus (109 young speakers) http://homepages.inf.ed.ac.uk/jyamagis/page3/page58/page58.html Regards Mark On 24/05/2016 10:55, Sollini, Joseph wrote: > > Dear list, > > I am looking for recommendations for speech databases of English=20 > speakers, containing male and female voices of different ages. =20 > Ideally the voices would cover large range of speakers with both a=20 > range of similar and dissimilar sounding speakers (i.e. across=20 > different pitch, vocal tract length, formant frequencies). =20 > Alternatively does anyone know of some realistic speech generation=20 > software that allows manipulation of these parameters and is suitable=20 > for generating large stimulus batteries? Basically I=92m hoping to hav= e=20 > a database of speakers that allows me to (as much as possible)=20 > parametrise the characteristics of speakers voices. > > I have found a few potential databases and speech generation packages=20 > but I=92d really love to know which ones people prefer (and to know=20 > which ones I=92ve missed). > > All help welcome! > > Joe > > Dr Joseph Sollini > > Post-doctoral researcher > > UCL Ear Institute > --=20 Prof. Mark Huckvale Speech, Hearing and Phonetic Sciences www.ucl.ac.uk/pals/research/shaps --------------33C5D84E27BDDEDF46C48F7F Content-Type: text/html; charset="windows-1252" Content-Transfer-Encoding: quoted-printable X-MIME-Autoconverted: from 8bit to quoted-printable by edgeum1.it.mcgill.ca id u4P9mhbl013297 <html> <head> <meta content=3D"text/html; charset=3Dwindows-1252" http-equiv=3D"Content-Type"> </head> <body bgcolor=3D"#FFFFFF" text=3D"#000000"> <p>Two useful databases of British English speakers:</p> <p>Accents of the British Isles (280 speakers of mixed ages)</p> <p><a class=3D"moz-txt-link-freetext" href=3D"http://www.thespeechark= .com/abi-1-page.html">http://www.thespeechark.com/abi-1-page.html</a></p> <p>VCTK Corpus (109 young speakers)<br> </p> <a class=3D"moz-txt-link-freetext" href=3D"http://homepages.inf.ed.ac= .uk/jyamagis/page3/page58/page58.html">http://homepages.inf.ed.ac.uk/jyam= agis/page3/page58/page58.html</a><br> <br> Regards<br> <br> Mark<br> <br> <div class=3D"moz-cite-prefix">On 24/05/2016 10:55, Sollini, Joseph wrote:<br> </div> <blockquote cite=3D"mid:13681_1464149334_57452555_13681_86_1_VI1PR0101MB2221B4796CA0E= 6C094E27BDEAA4F0@xxxxxxxx" type=3D"cite"> <meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3Dwindows-1252"> <meta name=3D"Generator" content=3D"Microsoft Word 15 (filtered medium)"> <style><!-- /* Font Definitions */ @xxxxxxxx {font-family:"Cambria Math"; panose-1:2 4 5 3 5 4 6 3 2 4;} @xxxxxxxx {font-family:Calibri; panose-1:2 15 5 2 2 2 4 3 2 4;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {margin:0cm; margin-bottom:.0001pt; font-size:11.0pt; font-family:"Calibri",sans-serif; mso-fareast-language:EN-US;} a:link, span.MsoHyperlink {mso-style-priority:99; color:#0563C1; text-decoration:underline;} a:visited, span.MsoHyperlinkFollowed {mso-style-priority:99; color:#954F72; text-decoration:underline;} span.EmailStyle17 {mso-style-type:personal-compose; font-family:"Calibri",sans-serif; color:windowtext;} .MsoChpDefault {mso-style-type:export-only; font-family:"Calibri",sans-serif; mso-fareast-language:EN-US;} @xxxxxxxx WordSection1 {size:612.0pt 792.0pt; margin:72.0pt 72.0pt 72.0pt 72.0pt;} div.WordSection1 {page:WordSection1;} --></style><!--[if gte mso 9]><xml> <o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" /> </xml><![endif]--><!--[if gte mso 9]><xml> <o:shapelayout v:ext=3D"edit"> <o:idmap v:ext=3D"edit" data=3D"1" /> </o:shapelayout></xml><![endif]--> <div class=3D"WordSection1"> <p class=3D"MsoNormal">Dear list,<o:p></o:p></p> <p class=3D"MsoNormal"><o:p>=A0</o:p></p> <p class=3D"MsoNormal">I am looking for recommendations for speec= h databases of English speakers, containing male and female voices of different ages.=A0 Ideally the voices would cover large range of speakers with both a range of similar and dissimilar sounding speakers (i.e. across different pitch, vocal tract length, formant frequencies).=A0 Alternatively does anyone know of some realistic speech generation software that allows manipulation of these parameters and is suitable for generating large stimulus batteries?=A0 Basically I=92m hoping = to have a database of speakers that allows me to (as much as possible) parametrise the characteristics of speakers voices.<o= :p></o:p></p> <p class=3D"MsoNormal"><o:p>=A0</o:p></p> <p class=3D"MsoNormal">I have found a few potential databases and speech generation packages but I=92d really love to know which ones people prefer (and to know which ones I=92ve missed).<o:p>= </o:p></p> <p class=3D"MsoNormal"><o:p>=A0</o:p></p> <p class=3D"MsoNormal">All help welcome! <o:p></o:p></p> <p class=3D"MsoNormal"><o:p>=A0</o:p></p> <p class=3D"MsoNormal">Joe<span style=3D"mso-fareast-language:EN-= GB"><o:p></o:p></span></p> <p class=3D"MsoNormal"><o:p>=A0</o:p></p> <div style=3D"mso-element:para-border-div;border:none;border-bottom:= solid windowtext 3.0pt;padding:0cm 0cm 1.0pt 0cm"> <p class=3D"MsoNormal" style=3D"border:none;padding:0cm"><span style=3D"mso-fareast-language:EN-GB"><o:p>=A0</o:p></span><= /p> </div> <p class=3D"MsoNormal"><span style=3D"mso-fareast-language:EN-GB"= ><o:p>=A0</o:p></span></p> <p class=3D"MsoNormal"><span style=3D"mso-fareast-language:EN-GB"= >Dr Joseph Sollini<o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"mso-fareast-language:EN-GB"= >Post-doctoral researcher<o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"mso-fareast-language:EN-GB"= >UCL Ear Institute<o:p></o:p></span></p> <div style=3D"mso-element:para-border-div;border:none;border-bottom:= solid windowtext 3.0pt;padding:0cm 0cm 1.0pt 0cm"> <p class=3D"MsoNormal" style=3D"border:none;padding:0cm"><span style=3D"mso-fareast-language:EN-GB"><o:p>=A0</o:p></span><= /p> </div> <p class=3D"MsoNormal"><o:p>=A0</o:p></p> </div> </blockquote> <br> <pre class=3D"moz-signature" cols=3D"72">--=20 Prof. Mark Huckvale Speech, Hearing and Phonetic Sciences <a class=3D"moz-txt-link-abbreviated" href=3D"http://www.ucl.ac.uk/pals/r= esearch/shaps">www.ucl.ac.uk/pals/research/shaps</a> </pre> </body> </html> --------------33C5D84E27BDDEDF46C48F7F--


This message came from the mail archive
/var/www/html/postings/2016/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University