Re: speech database (Etienne Gaudrain )


Subject: Re: speech database
From:    Etienne Gaudrain  <egaudrain.cam@xxxxxxxx>
Date:    Wed, 25 May 2016 08:59:34 +0200
List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

--001a113dc86253d14f0533a537fc Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Dear Joe, I second Dick on this. It all depends on what you want to do, but often, speech databases don't include much information about the speakers (no age, or height). This is one of the reasons why we (and others) have used STRAIGHT so much in our research. Cheers, -Etienne On 25 May 2016 at 06:13, Richard F. Lyon <dicklyon@xxxxxxxx> wrote: > For parametric synthesis, STRAIGHT is a good package. > http://www.wakayama-u.ac.jp/~kawahara/STRAIGHTadv/index_e.html > > Dick > > > > On Tue, May 24, 2016 at 2:55 AM, Sollini, Joseph <j.sollini@xxxxxxxx> > wrote: > >> Dear list, >> >> >> >> I am looking for recommendations for speech databases of English >> speakers, containing male and female voices of different ages. Ideally = the >> voices would cover large range of speakers with both a range of similar = and >> dissimilar sounding speakers (i.e. across different pitch, vocal tract >> length, formant frequencies). Alternatively does anyone know of some >> realistic speech generation software that allows manipulation of these >> parameters and is suitable for generating large stimulus batteries? >> Basically I=E2=80=99m hoping to have a database of speakers that allows = me to (as >> much as possible) parametrise the characteristics of speakers voices. >> >> >> >> I have found a few potential databases and speech generation packages bu= t >> I=E2=80=99d really love to know which ones people prefer (and to know wh= ich ones >> I=E2=80=99ve missed). >> >> >> >> All help welcome! >> >> >> >> Joe >> >> >> >> >> >> >> >> Dr Joseph Sollini >> >> Post-doctoral researcher >> >> UCL Ear Institute >> >> >> >> >> > > --001a113dc86253d14f0533a537fc Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable <div dir=3D"ltr"><p>Dear Joe,</p> <p>I second Dick on this. It all depends on what you want to do, but=20 often, speech databases don&#39;t include much information about the=20 speakers (no age, or height). This is one of the reasons why we (and=20 others) have used STRAIGHT so much in our research.</p> <p>Cheers,<br> -Etienne</p></div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote"= >On 25 May 2016 at 06:13, Richard F. Lyon <span dir=3D"ltr">&lt;<a href=3D"= mailto:dicklyon@xxxxxxxx" target=3D"_blank">dicklyon@xxxxxxxx</a>&gt;</span> = wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;bord= er-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div>For parametr= ic synthesis, STRAIGHT is a good package.<br><a href=3D"http://www.wakayama= -u.ac.jp/~kawahara/STRAIGHTadv/index_e.html" target=3D"_blank">http://www.w= akayama-u.ac.jp/~kawahara/STRAIGHTadv/index_e.html</a><br><br></div>Dick<br= ><br><div><br></div></div><div class=3D"HOEnZb"><div class=3D"h5"><div clas= s=3D"gmail_extra"><br><div class=3D"gmail_quote">On Tue, May 24, 2016 at 2:= 55 AM, Sollini, Joseph <span dir=3D"ltr">&lt;<a href=3D"mailto:j.sollini@xxxxxxxx= l.ac.uk" target=3D"_blank">j.sollini@xxxxxxxx</a>&gt;</span> wrote:<br><bl= ockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #= ccc solid;padding-left:1ex"> <div link=3D"#0563C1" vlink=3D"#954F72" lang=3D"EN-GB"> <div> <p class=3D"MsoNormal">Dear list,<u></u><u></u></p> <p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p> <p class=3D"MsoNormal">I am looking for recommendations for speech database= s of English speakers, containing male and female voices of different ages.= =C2=A0 Ideally the voices would cover large range of speakers with both a r= ange of similar and dissimilar sounding speakers (i.e. across different pitch, vocal tract length, formant frequen= cies).=C2=A0 Alternatively does anyone know of some realistic speech genera= tion software that allows manipulation of these parameters and is suitable = for generating large stimulus batteries?=C2=A0 Basically I=E2=80=99m hoping to have a database of speakers that allows me= to (as much as possible) parametrise the characteristics of speakers voice= s.<u></u><u></u></p> <p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p> <p class=3D"MsoNormal">I have found a few potential databases and speech ge= neration packages but I=E2=80=99d really love to know which ones people pre= fer (and to know which ones I=E2=80=99ve missed).<u></u><u></u></p> <p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p> <p class=3D"MsoNormal">All help welcome! <u></u><u></u></p> <p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p> <p class=3D"MsoNormal">Joe<span><u></u><u></u></span></p> <p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p> <div style=3D"border:none;border-bottom:solid windowtext 3.0pt;padding:0cm = 0cm 1.0pt 0cm"> <p class=3D"MsoNormal" style=3D"border:none;padding:0cm"><span><u></u>=C2= =A0<u></u></span></p> </div> <p class=3D"MsoNormal"><span><u></u>=C2=A0<u></u></span></p> <p class=3D"MsoNormal"><span>Dr Joseph Sollini<u></u><u></u></span></p> <p class=3D"MsoNormal"><span>Post-doctoral researcher<u></u><u></u></span><= /p> <p class=3D"MsoNormal"><span>UCL Ear Institute<u></u><u></u></span></p> <div style=3D"border:none;border-bottom:solid windowtext 3.0pt;padding:0cm = 0cm 1.0pt 0cm"> <p class=3D"MsoNormal" style=3D"border:none;padding:0cm"><span><u></u>=C2= =A0<u></u></span></p> </div> <p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p> </div> </div> </blockquote></div><br></div> </div></div></blockquote></div><br></div> --001a113dc86253d14f0533a537fc--


This message came from the mail archive
/var/www/html/postings/2016/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University