Subject: Re: what's the difference between the voice of two people? From: Siping Tao <siping.tao@xxxxxxxx> Date: Wed, 14 Nov 2012 18:34:03 +0800 List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>--14dae9340901c68cad04ce721212 Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable Etienne, Thank you very much! I will read these publications. When using voip in office, sometimes other people in the office are talking but with small volume, what I want to do is suppressing this unwanted voice. I think the first step is to identify some features that can distinguish them. Thanks, Siping On Wed, Nov 14, 2012 at 5:25 PM, Etienne Gaudrain <e.p.c.gaudrain@xxxxxxxx>w= rote: > Hi Siping, > > There's many cues. We studied two obvious that are directly related to th= e > anatomy of the speaker: glottal-pulse rate and vocal-tract length. There'= s > many papers on the topic from Patterson's lab ( > http://www.pdn.cam.ac.uk/groups/cnbh/research/publications/). One that > may directly answer your question is: > > Gaudrain, Etienne, S Li, VS Ban, and RD Patterson. =93The Role of Glotta= l > Pulse Rate and Vocal Tract Length in the Perception of Speaker Identity.= =94 > *Interspeech 2009: 10th Annual Conference of the International Speech > Communication Association* 1-5 (2009): 152=96155. > > Regards, > > -Etienne > > > > On 14/11/2012 09:05, Siping Tao wrote: > > hi experts, > > l am curious of what's the key feature that human can identify who is > talking, what's the difference between the voice signal of different > speakers? pitch? > > I have little knowledge about that, any papers about this topic is reall= y > appreciated! > > Thanks, > Siping > > > -- > Etienne Gaudrain, PhD > UMCG, Afdeling KNO > BB20 > PO Box 30.001 > 9700 RB Groningen > Netherlands > > Room P3.236 > Phone +31 5036 13290 > Skype egaudrain > > Note: emails to this address are limited to 10 MB. To send larger attachm= ents, please use egaudrain.cam@xxxxxxxx > > --14dae9340901c68cad04ce721212 Content-Type: text/html; charset=windows-1252 Content-Transfer-Encoding: quoted-printable Etienne,<br><br>Thank you very much! I will read these publications.<br><br= >When using voip in office, sometimes other people in the office are talkin= g but with small volume, what I want to do is suppressing this unwanted voi= ce. I think the first step is to identify some features that can distinguis= h them.<br> <br>Thanks,<br>Siping<br><br><div class=3D"gmail_quote">On Wed, Nov 14, 201= 2 at 5:25 PM, Etienne Gaudrain <span dir=3D"ltr"><<a href=3D"mailto:e.p.= c.gaudrain@xxxxxxxx" target=3D"_blank">e.p.c.gaudrain@xxxxxxxx</a>></span>= wrote:<br> <blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p= x #ccc solid;padding-left:1ex"> =20 =20 =20 <div bgcolor=3D"#FFFFFF" text=3D"#000000"> Hi Siping,<br> <br> There's many cues. We studied two obvious that are directly related to the anatomy of the speaker: glottal-pulse rate and vocal-tract length. There's many papers on the topic from Patterson's lab (<a href=3D"http://www.pdn.cam.ac.uk/groups/cnbh/research/publications/= " target=3D"_blank">http://www.pdn.cam.ac.uk/groups/cnbh/research/publicati= ons/</a>). One that may directly answer your question is:<br> <br> =20 <div style=3D"line-height:1.35;padding-left:2em"> <div>Gaudrain, Etienne, S Li, VS Ban, and RD Patterson. =93The Role of Glottal Pulse Rate and Vocal Tract Length in the Perception of Speaker Identity.=94 <i>Interspeech 2009: 10th Annual Conference of the International Speech Communication Association</i> 1-5 (2009): 152=96155.</div> <span title=3D"url_ver=3DZ39.88-2004&ctx_ver=3DZ39.88-2004&rf= r_id=3Dinfo%3Asid%2Fzotero.org%3A2&rft_val_fmt=3Dinfo%3Aofi%2Ffmt%3Akev= %3Amtx%3Ajournal&rft.genre=3Darticle&rft.atitle=3DThe%20role%20of%2= 0glottal%20pulse%20rate%20and%20vocal%20tract%20length%20in%20the%20percept= ion%20of%20speaker%20identity&rft.jtitle=3DInterspeech%202009%3A%2010th= %20Annual%20Conference%20of%20the%20International%20Speech%20Communication%= 20Association&rft.stitle=3DInterspeech%202009&rft.volume=3D1-5&= rft.aufirst=3DEtienne&rft.aulast=3DGaudrain&rft.au=3DEtienne%20Gaud= rain&rft.au=3DS%20Li&rft.au=3DVS%20Ban&rft.au=3DRD%20Patterson&= amp;rft.date=3D2009&rft.pages=3D152-155&rft.spage=3D152&rft.epa= ge=3D155"></span></div> <br> Regards,<div><div class=3D"h5"><br> -Etienne<br> <br> <br> <br> <div>On 14/11/2012 09:05, Siping Tao wrote:<br> </div> <blockquote type=3D"cite"> <p dir=3D"ltr">hi experts, </p> <p dir=3D"ltr">l am curious of what's the key feature that human = can identify who is talking, what's the difference between the voic= e signal of different speakers? pitch?</p> <p dir=3D"ltr">I have little knowledge about that, any papers about= =A0 this topic is really appreciated!</p> <p dir=3D"ltr">Thanks,<br> Siping</p> </blockquote> <br> <pre cols=3D"72">--=20 Etienne Gaudrain, PhD UMCG, Afdeling KNO BB20 PO Box 30.001 9700 RB Groningen Netherlands Room P3.236 Phone +31 5036 13290 Skype egaudrain Note: emails to this address are limited to 10 MB. To send larger attachmen= ts, please use <a href=3D"mailto:egaudrain.cam@xxxxxxxx" target=3D"_blank"= >egaudrain.cam@xxxxxxxx</a>.</pre> </div></div></div> </blockquote></div><br> --14dae9340901c68cad04ce721212--