Replies to inquiry re: modifying speech signal (David Schwartz )

Subject: Replies to inquiry re: modifying speech signal From: David Schwartz <schwartz@xxxxxxxx> Date: Fri, 8 Dec 2006 09:55:38 -0500 List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY> --Apple-Mail-1-451460928 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=WINDOWS-1252; delsp=yes; format=flowed Thank you to everyone who responded to my inquiry regarding how to =20 modify a speech signal so as to alter the apparent vocal tract length =20= of the speaker. I have compiled most of the responses below. =20 Apologies if I missed your reply. David *********************************************** Hi David, Check out Roy Patterson=92s web site for info (specifically the =20 publications related to =93size perception=94). http://www.mrc-cbu.cam.ac.uk/~roy/ Cheers, --Maria Chait *********************************************** Praat (www.praat.org) is a free speech analysis/synthesis program =20 that can do this, apparently quite easily. I've not used this =20 particular function myself (it's part of the "change gender" =20 mechanism) but the built-in help manual is quite good. Hope this helps, -Alex Francis *********************************************** Dear David, It's fairly straightforward acoustics. Longer tubes yield lower =20 resonances (formants). In general, you simply increase or decrease =20 formant center frequencies by some percentage. I must note, however, that such simple transformations are not quite =20 right because talkers can employ other articulatory maneuvers to =20 adjust effective formant frequencies, especially effective F1. For =20 example, F1 with female talkers (generally shorter vocal tracts) is =20 not as high as would be predicted on vocal tract length alone even =20 though F2, F3, etc. are higher. I think women do this by adjusting =20 the source spectrum (laryngeal control) in a way that makes the =20 acoustic spectral peak appear at a lower frequency than the resonant =20 frequency. But, that's a longer explanation. -- Keith Kluender *********************************************** Hi David, A longer vocal tract length will in general lower the frequency of =20 the formants, and a shorter one raise their frequency. A simplified =20 transform to achieve the effect of would be to multiply the frequency =20= of the formants by the the inverse of the amount you want to vary the =20= vocal tract by, e.g. multiply the formant frequency by 0.5 if you =20 want double the vocal tract length, or by 2 if you want to half the =20 vocal tract length. Typically the spectral shape of the voie signal =20 is attributed to the vocal tract, so how you do this depends on the =20 spectral shape estimation technique you are using. One way to do this would be to transpose a tone without formant =20 preservation (e.g time stretching follwoed by resampling), and use =20 the spectral shape of the transposed tone to shape a spectrally flat =20 source at the original pitch. For example to lengthen the vocal =20 tract by a factor of two, transpose the sample down one octave, then =20 use the spectral shape of the tranposed sample to shape a spectrally =20 flat source at the original pitch. Alternatively, if you are using a sinusoidal model you could multiply =20= the frequencies of each partial by the inverse of the amount you want =20= to vary the vocal tract length by (0.5 to double the length) and use =20 the shape defined by the resulting spectrum to define the spectral =20 shape. To get the amplitude of the transformed partials interpolate =20 the amplitudes at the original frequency using the new spectral shape. This should give you a first approximation of the effect of varying =20 the vocal tract length, and should work pretty well for relatively =20 small variations. Sean O'Leary *********************************************** Hello David, We use Kawahara-san's vocoder STRAIGHT. It works very well on clean =20 speech recordings. We describe its use in several papers where we =20 scaled speech sounds over a wide range. Smith, D. R. R., Patterson, R. D., Turner, R., Kawahara, H., and =20 Irino, T. (2005). "The processing and perception of size information =20= in speech sounds," J. Acoust. Soc. Am. 117, 305-318. Ives, D. T., Smith, D. R. R. and Patterson, R. D. (2005). =20 "Discrimination of speaker size from syllable phrases," J. Acoust. =20 Soc. Am. 118 (6), 3816-3822. Smith, D. R. R. and Patterson, R. D. (2005). "The interaction of =20 glottal-pulse rate and vocal-tract length in judgements of speaker =20 size, sex and age," J. Acoust. Soc. Am. 118, 3177-3186. It also works on some musical instruments by the way. Dinther, R. van, Patterson, R. D. (2006). =93Perception of acoustic =20 scale and size in musical instrument sounds,=94 J. Acoust. Soc. Am. =20 120, 2158-2176. You can also use PRAAT from Peter Boersema in Amsterdam. Another useful application is PSOLA from Eindhoven. Regards Roy Patterson *********************************************** It seems that you need "resynthesis". You can google it and will get you a couple of hundreds. Festival, Synthworks, HLSyn, Praat are some of the most popular =20 options out there. Heriberto Avelino *********************************************** Dear David, I am not sure that this is what you want to achieve, but Praat (available free at www.praat.org) has this interesting "Change gender" button, which allows you to shift all formants up or down by a given ratio independent of F0. Holger Mitterer *********************************************** STRAIGHT is a program that has been used to modify perceived vocal =20 tract length. See papers by Patterson RD and colleagues and the following pages: http://www.wakayama-u.ac.jp/~kawahara/index-e.html http://www.wakayama-u.ac.jp/~kawahara/STRAIGHTtrial/ It's available free to academics. Christopher Long ***********************************************= --Apple-Mail-1-451460928 Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=WINDOWS-1252 <HTML><BODY style=3D"word-wrap: break-word; -khtml-nbsp-mode: space; = -khtml-line-break: after-white-space; "><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" face=3D"Arial" size=3D"4"><SPAN = class=3D"Apple-style-span" style=3D"font-size: 13.3px;">Thank you to = everyone who responded to my inquiry regarding how to modify a speech = signal so as to alter the apparent vocal tract length of the speaker. I = have compiled most of the responses below. Apologies if I missed your = reply.</SPAN></FONT></DIV><DIV style=3D"margin-top: 0px; margin-right: = 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" face=3D"Arial" size=3D"4"><SPAN = class=3D"Apple-style-span" style=3D"font-size: 13.3px;"><BR = class=3D"khtml-block-placeholder"></SPAN></FONT></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><FONT class=3D"Apple-style-span" face=3D"Arial" = size=3D"4"><SPAN class=3D"Apple-style-span" style=3D"font-size: = 13.3px;">David</SPAN></FONT></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" face=3D"Arial" size=3D"4"><SPAN = class=3D"Apple-style-span" style=3D"font-size: 13.3px;"><BR = class=3D"khtml-block-placeholder"></SPAN></FONT></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><FONT class=3D"Apple-style-span" face=3D"Arial" = size=3D"4"><SPAN class=3D"Apple-style-span" style=3D"font-size: = 13.3px;">***********************************************</SPAN></FONT><FON= T class=3D"Apple-style-span" face=3D"Arial" size=3D"4"><SPAN = class=3D"Apple-style-span" style=3D"font-size: = 13.3px;"></SPAN></FONT></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" face=3D"Arial" size=3D"4"><SPAN = class=3D"Apple-style-span" style=3D"font-size: 13.3px;"><BR = class=3D"khtml-block-placeholder"></SPAN></FONT></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><FONT class=3D"Apple-style-span" face=3D"Arial" = size=3D"4"><SPAN class=3D"Apple-style-span" style=3D"font-size: = 13.3px;"><FONT class=3D"Apple-style-span" color=3D"#000000">Hi = David,</FONT></SPAN></FONT></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" face=3D"Arial" size=3D"4"><SPAN = class=3D"Apple-style-span" style=3D"font-size: 13.3px;"><FONT = class=3D"Apple-style-span" color=3D"#000000">Check out Roy Patterson=92s = web site for info (specifically the publications related to =93size = perception=94).</FONT></SPAN></FONT></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" color=3D"#00137F" face=3D"Arial" = size=3D"4"><SPAN class=3D"Apple-style-span" style=3D"font-size: = 13.3px;">=A0</SPAN></FONT><A = href=3D"http://www.mrc-cbu.cam.ac.uk/~roy/"><FONT = class=3D"Apple-style-span" face=3D"Arial" size=3D"4"><SPAN = class=3D"Apple-style-span" style=3D"font-size: 13.3px;"><FONT = class=3D"Apple-style-span" = color=3D"#0026FC">http://www.mrc-cbu.cam.ac.uk/~roy/</FONT></SPAN></FONT><= /A></DIV><P style=3D"margin: 0.0px 0.0px 0.0px 0.0px"><FONT = class=3D"Apple-style-span" color=3D"#00137F" face=3D"Arial" = size=3D"4"><SPAN class=3D"Apple-style-span" style=3D"font-size: = 13.3px;">=A0</SPAN></FONT></P><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" face=3D"Arial" size=3D"4"><SPAN = class=3D"Apple-style-span" style=3D"font-size: 13.3px;"><FONT = class=3D"Apple-style-span" = color=3D"#000000">Cheers,</FONT></SPAN></FONT></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><FONT class=3D"Apple-style-span" face=3D"Arial" = size=3D"4"><SPAN class=3D"Apple-style-span" style=3D"font-size: = 13.3px;"><FONT class=3D"Apple-style-span" color=3D"#000000">--Maria = Chait</FONT></SPAN></FONT></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" color=3D"#00137F" face=3D"Arial" = size=3D"4"><SPAN class=3D"Apple-style-span" style=3D"font-size: = 13.3px;"><BR class=3D"khtml-block-placeholder"></SPAN></FONT></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><FONT class=3D"Apple-style-span" face=3D"Arial" = size=3D"4"><SPAN class=3D"Apple-style-span" style=3D"font-size: = 13.3px;"><FONT class=3D"Apple-style-span" = color=3D"#000000">***********************************************</FONT></= SPAN></FONT></DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; "><FONT class=3D"Apple-style-span" = color=3D"#00137F" face=3D"Arial" size=3D"4"><SPAN = class=3D"Apple-style-span" style=3D"font-size: 13.3px;"><BR = class=3D"khtml-block-placeholder"></SPAN></FONT></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; min-height: 14px; "><BR></DIV><DIV style=3D"margin-top: = 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">Praat = (<A href=3D"http://www.praat.org">www.praat.org</A>) is a free speech = analysis/synthesis program that can do this, apparently quite easily. = I've not used this particular function myself (it's part of the "change = gender" mechanism) but the built-in help manual is quite good.</DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; min-height: 14px; "><BR></DIV><DIV style=3D"margin-top: = 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">Hope = this helps,</DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; min-height: 14px; "><BR></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; ">-Alex Francis</DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><BR = class=3D"khtml-block-placeholder"></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" face=3D"Arial" size=3D"4"><SPAN = class=3D"Apple-style-span" style=3D"font-size: = 13.3px;">***********************************************</SPAN></FONT></DI= V><DIV style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><BR class=3D"khtml-block-placeholder"></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><BR class=3D"khtml-block-placeholder"></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; ">Dear David,</DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; font: normal = normal normal 12px/normal Helvetica; min-height: 14px; "><BR></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; ">It's fairly straightforward acoustics. Longer tubes = yield lower resonances (formants). In general, you simply increase or = decrease formant center frequencies by some percentage.</DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; font: normal normal normal 12px/normal Helvetica; = min-height: 14px; "><BR></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">I must note, = however, that such simple transformations are not quite right because = talkers can employ other articulatory maneuvers to adjust effective = formant frequencies, especially effective F1. For example, F1 with = female talkers (generally shorter vocal tracts) is not as high as would = be predicted on vocal tract length alone even though F2, F3, etc. are = higher. I think women do this by adjusting the source spectrum = (laryngeal control) in a way that makes the acoustic spectral peak = appear at a lower frequency than the resonant frequency. But, that's a = longer explanation.=A0</DIV><DIV style=3D"margin-top: 0px; margin-right: = 0px; margin-bottom: 0px; margin-left: 0px; font: normal normal normal = 12px/normal Helvetica; min-height: 14px; "><BR></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; ">-- Keith Kluender</DIV><DIV style=3D"margin-top: = 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><BR = class=3D"khtml-block-placeholder"></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" face=3D"Arial" size=3D"4"><SPAN = class=3D"Apple-style-span" style=3D"font-size: = 13.3px;">***********************************************</SPAN></FONT></DI= V><DIV style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><BR class=3D"khtml-block-placeholder"></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><FONT class=3D"Apple-style-span" face=3D"Arial">Hi = David,</FONT></DIV><P style=3D"margin: 0.0px 0.0px 0.0px = 0.0px">=A0</P><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; "><FONT class=3D"Apple-style-span" = face=3D"Arial">A longer vocal tract length will in general lower the = frequency of the formants, and a shorter one raise their frequency.=A0 A = simplified transform to achieve the effect of would be to multiply the = frequency of the formants by the the inverse of the amount you want to = vary the vocal tract by, e.g. multiply the formant frequency=A0by 0.5 if = you want double the vocal tract length, or by 2 if you want to half the = vocal tract length.=A0 Typically the spectral shape of the voie signal = is attributed to the vocal tract, so how you do this depends on the = spectral shape estimation technique you are using.</FONT></DIV><P = style=3D"margin: 0.0px 0.0px 0.0px 0.0px">=A0</P><DIV style=3D"margin-top:= 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" face=3D"Arial">One way to do this would be to = transpose a tone without formant preservation (e.g time stretching = follwoed by resampling), and use the spectral shape of the transposed = tone to shape a spectrally flat source at the original pitch.=A0 For = example to lengthen the vocal tract by a factor of two, transpose the = sample down one octave, then use the spectral shape of the tranposed = sample=A0to shape a spectrally flat source at the original = pitch.</FONT></DIV><P style=3D"margin: 0.0px 0.0px 0.0px = 0.0px">=A0</P><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; "><FONT class=3D"Apple-style-span" = face=3D"Arial">Alternatively, if you are using a sinusoidal model you = could multiply the frequencies of each partial by the inverse of the = amount you want to vary the vocal tract length by (0.5 to double the = length) and use the shape defined by the resulting spectrum to define = the spectral shape.=A0 To get the amplitude of the transformed partials = interpolate the amplitudes at the original frequency using the new = spectral shape.</FONT></DIV><P style=3D"margin: 0.0px 0.0px 0.0px = 0.0px">=A0</P><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; "><FONT class=3D"Apple-style-span" = face=3D"Arial">This=A0should give you a first approximation of the = effect of varying the vocal tract length, and should work pretty well = for relatively small variations.</FONT></DIV><DIV style=3D"margin-top: = 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" face=3D"Arial"><BR = class=3D"khtml-block-placeholder"></FONT></DIV><DIV style=3D"margin-top: = 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" face=3D"Arial">Sean O'Leary</FONT></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><FONT class=3D"Apple-style-span" face=3D"Arial" = size=3D"2"><SPAN class=3D"Apple-style-span" style=3D"font-size: = 10px;"><BR class=3D"khtml-block-placeholder"></SPAN></FONT></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><FONT class=3D"Apple-style-span" face=3D"Arial" = size=3D"4"><SPAN class=3D"Apple-style-span" style=3D"font-size: = 13.3px;">***********************************************</SPAN></FONT><FON= T class=3D"Apple-style-span" face=3D"Arial" size=3D"2"><SPAN = class=3D"Apple-style-span" style=3D"font-size: = 10px;"></SPAN></FONT></DIV><DIV style=3D"margin-top: 0px; margin-right: = 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" face=3D"Arial" size=3D"2"><SPAN = class=3D"Apple-style-span" style=3D"font-size: 10px;"><BR = class=3D"khtml-block-placeholder"></SPAN></FONT></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><FONT class=3D"Apple-style-span" face=3D"Arial" = size=3D"2"><SPAN class=3D"Apple-style-span" style=3D"font-size: = 10px;"><BR class=3D"khtml-block-placeholder"></SPAN></FONT></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; ">Hello David,</DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; min-height: = 14px; "><BR></DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; ">We use Kawahara-san's vocoder = STRAIGHT. It works very well on clean speech recordings. We describe its = use in several papers where we scaled speech sounds over a wide = range.</DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; min-height: 14px; "><BR></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; ">Smith, D. R. R., Patterson, R. D., Turner, R., = Kawahara, H., and Irino, T. (2005).=A0 "The processing and perception of = size information in speech sounds," J. Acoust. Soc. Am. 117,=A0 = 305-318.</DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; min-height: 14px; "><BR></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; ">Ives, D. T., Smith, D. R. R. and Patterson, R. D. = (2005). "Discrimination of speaker size from syllable phrases," J. = Acoust. Soc. Am. 118 (6), 3816-3822.</DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; min-height: = 14px; "><BR></DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; ">Smith, D. R. R. and Patterson, = R. D. (2005). "The interaction of glottal-pulse rate and vocal-tract = length in judgements of speaker size, sex and age," J. Acoust. Soc. Am. = 118, 3177-3186.</DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; min-height: 14px; "><BR></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; ">It also works on some musical instruments by the = way.</DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; min-height: 14px; "><BR></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; ">Dinther, R. van, Patterson, R. D. (2006). = =93Perception of acoustic scale and size in musical instrument sounds,=94 = J. Acoust. Soc. Am. 120, 2158-2176.</DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; min-height: = 14px; "><BR></DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; ">You can also use PRAAT from = Peter Boersema in Amsterdam.</DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">Another = useful application is PSOLA from Eindhoven.</DIV><DIV style=3D"margin-top:= 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; = min-height: 14px; "><BR></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">Regards Roy = Patterson</DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; "><BR = class=3D"khtml-block-placeholder"></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" face=3D"Arial" size=3D"4"><SPAN = class=3D"Apple-style-span" style=3D"font-size: = 13.3px;">***********************************************</SPAN></FONT></DI= V><DIV style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><BR class=3D"khtml-block-placeholder"></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><BR class=3D"khtml-block-placeholder"></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; ">It seems that you need "resynthesis".</DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; ">You can google it and will get you a couple of = hundreds.</DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; ">Festival, Synthworks, HLSyn, = Praat are some of the most popular options out there.</DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><BR class=3D"khtml-block-placeholder"></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; ">Heriberto Avelino</DIV><DIV style=3D"margin-top: = 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><BR = class=3D"khtml-block-placeholder"></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" face=3D"Arial" size=3D"4"><SPAN = class=3D"Apple-style-span" style=3D"font-size: = 13.3px;">***********************************************</SPAN></FONT></DI= V><DIV style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><BR class=3D"khtml-block-placeholder"></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; ">Dear David,</DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; min-height: = 14px; "><BR></DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; ">I am not sure that this is what = you want</DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; ">to achieve, but Praat (available = free at</DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; ">www.praat.org) has this = interesting</DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; ">"Change gender" button, which = allows you to</DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; ">shift all formants up or down by = a given</DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; ">ratio independent of = F0.</DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; "><BR = class=3D"khtml-block-placeholder"></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">Holger = Mitterer</DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; "><BR = class=3D"khtml-block-placeholder"></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" face=3D"Arial" size=3D"4"><SPAN = class=3D"Apple-style-span" style=3D"font-size: = 13.3px;">***********************************************</SPAN></FONT></DI= V><DIV style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><BR class=3D"khtml-block-placeholder"></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><BR class=3D"khtml-block-placeholder"></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; ">STRAIGHT is a program that has been used to modify = perceived vocal tract length.</DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; min-height: = 14px; "><BR></DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; ">See papers by Patterson RD and = colleagues</DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; min-height: 14px; "><BR></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; ">and the following pages:</DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; min-height: 14px; "><BR></DIV><DIV style=3D"margin-top: = 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><A = href=3D"http://www.wakayama-u.ac.jp/~kawahara/index-e.html">http://www.wak= ayama-u.ac.jp/~kawahara/index-e.html</A></DIV><DIV style=3D"margin-top: = 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; = min-height: 14px; "><BR></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><A = href=3D"http://www.wakayama-u.ac.jp/~kawahara/STRAIGHTtrial/">http://www.w= akayama-u.ac.jp/~kawahara/STRAIGHTtrial/</A></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; min-height: 14px; "><BR></DIV><DIV style=3D"margin-top: = 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">It's = available free to academics.</DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><BR = class=3D"khtml-block-placeholder"></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">Christopher = Long</DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; "><BR = class=3D"khtml-block-placeholder"></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" face=3D"Arial" size=3D"4"><SPAN = class=3D"Apple-style-span" style=3D"font-size: = 13.3px;">***********************************************</SPAN></FONT></DI= V></BODY></HTML>= --Apple-Mail-1-451460928--

This message came from the mail archive
http://www.auditory.org/postings/2006/
maintained by:

DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University