Re: MDS distances (David Wessel )


Subject: Re: MDS distances
From:    David Wessel  <wessel@xxxxxxxx>
Date:    Fri, 23 Jun 2006 12:23:35 -0700

--Apple-Mail-1-984724139 Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed On Jun 22, 2006, at 12:36 PM, Malcolm Slaney wrote: On Jun 21, 2006, at 2:54 AM, Olivier Tache wrote: I have read a number of "classical" papers about MDS and auditory dissimilarity (by Gordon&Grey, Grey&Moorer, Wessel) (and was wondering if such experiments were still carried out). I think the Gray/Wessel approach has failed.. Failed at what? Malcolm, I think you have missed the point. My original goal in the early 70's was to develop representations of musical material that would help me reason about the composition of timbre sequences known as klangfarbenmelodies. In one of my early papers ( http://xenakis.ircam.fr/articles/textes/Wessel78a/ ), I demonstrate that one can obtain interpretable geometric representations of musically useful timbres and that such "timbre spaces" can be used to make predictions about the behavior of timbre sequences. Hardly a failure! Grey and Gordon demonstrated how such MDS spaces can be used to make interesting timbral interpolations or hybrid instrument sounds. Recently, I've taken a new look at the timbre space representations obtained by myself, Grey, McAdams, Wedin and Goude and am struck by how well Les Atlas's Modulation Spectrum describes what is a common feature of many of the 2-D spaces wherein one of the dimensions is related to the spectral envelope and the other the temporal envelope. I have no objection to your and Terasawa's approach of testing a pre- ordained model such as one based on MFCC's. However, such tests should be carried out in a direct manner as suggested by Krantz and Tversky in their work on the foundations of the geometric representation of perceptual data (see Suppes, Krantz, Luce, & Tversky's Foundations of Measurement Vol 2). I doubt that MFCC's will pass the straightforward qualitative test of "interdimensional additivity" essential to a geometric representation. David Wessel "The spectrum is not an interesting steady date." she said as I was enveloped. it's too hard to figure out what the results mean. (Just trying to be blunt to get your attention. ;-) You start with convenient sounds, measure perception and then try to figure out what the MDS dimensions mean. That hasn't worked. I think that is why people have not been pushing on it very hard lately. Hiroko Terasawa and I have been taking an opposite approach. We're *starting* with the dimensions, synthesizing sounds and then measuring the stress between human perception and the pre-ordained model. Several papers describing our initial results are online at http://ccrma.stanford.edu/~hiroko/timbre/ Sounds like Jim is doing something in between the two extremes. - Malcolm --Apple-Mail-1-984724139 Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=ISO-8859-1 <HTML><BODY style=3D"word-wrap: break-word; -khtml-nbsp-mode: space; = -khtml-line-break: after-white-space; "><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">On Jun 22, = 2006, at 12:36 PM, Malcolm Slaney wrote:</DIV><DIV style=3D"margin-top: = 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" color=3D"#002CD3">On Jun 21, 2006, at 2:54 = AM, Olivier Tache wrote:</FONT></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; font: normal = normal normal 12px/normal Helvetica; color: rgb(0, 44, 211); min-height: = 14px; "><FONT class=3D"Apple-style-span" = color=3D"#002CD3"><BR></FONT></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" color=3D"#325F20">I have read a number of = "classical" papers about MDS and auditory dissimilarity (by = Gordon&amp;Grey, Grey&amp;Moorer, Wessel) (and was wondering if such = experiments were still carried out).</FONT></DIV><DIV style=3D"margin-top:= 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; font: = normal normal normal 12px/normal Helvetica; color: rgb(0, 44, 211); = min-height: 14px; "><FONT class=3D"Apple-style-span" = color=3D"#002CD3"><BR></FONT></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; font: normal = normal normal 12px/normal Helvetica; color: rgb(0, 44, 211); min-height: = 14px; "><FONT class=3D"Apple-style-span" = color=3D"#002CD3"><BR></FONT></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" color=3D"#002CD3">I think the Gray/Wessel = approach has failed..</FONT></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; font: normal = normal normal 12px/normal Helvetica; min-height: 14px; "><BR></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; ">Failed at what?=A0 =A0Malcolm, I think you have = missed the point.=A0 My original goal in the early 70's was to develop = representations of musical material that would help me reason about the = composition of timbre sequences known as <I>klangfarbenmelodies.=A0 = </I>=A0 In one of my early papers (=A0<A = href=3D"http://xenakis.ircam.fr/articles/textes/Wessel78a/"><FONT = class=3D"Apple-style-span" = color=3D"#002FE3">http://xenakis.ircam.fr/articles/textes/Wessel78a/</FONT= ></A> ), I demonstrate that one can obtain interpretable geometric = representations of musically useful timbres and that such "timbre = spaces" can be used to make predictions about the behavior of timbre = sequences. =A0 Hardly a failure!=A0 Grey and Gordon demonstrated how = such MDS spaces can be used to make interesting timbral interpolations = or hybrid instrument sounds. =A0 =A0Recently, I've taken a new look at = the timbre space representations obtained by myself, Grey, McAdams, = Wedin and Goude and am struck by how well Les Atlas's Modulation = Spectrum describes what is a common feature of many of the 2-D spaces = wherein one of the dimensions is related to the spectral envelope and = the other the temporal envelope.=A0 =A0</DIV><DIV style=3D"margin-top: = 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; font: = normal normal normal 12px/normal Helvetica; min-height: 14px; = "><BR></DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; ">I have no objection to your and = Terasawa's=A0 approach of testing a pre-ordained model such as one based = on MFCC's.=A0 However, such tests should be carried out in a direct = manner as suggested by Krantz and Tversky in their work on the = foundations of the geometric representation of perceptual data (see=A0 = Suppes, Krantz, Luce, &amp; Tversky's=A0 <I>Foundations of Measurement = Vol 2).=A0 </I>=A0I doubt that MFCC's will pass the straightforward = qualitative test of=A0 "interdimensional additivity" essential to a = geometric representation.=A0=A0=A0=A0 =A0</DIV><DIV style=3D"margin-top: = 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; font: = normal normal normal 12px/normal Helvetica; min-height: 14px; = "><BR></DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; font: normal normal normal = 12px/normal Helvetica; min-height: 14px; "><BR></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; ">David Wessel</DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; font: normal = normal normal 12px/normal Helvetica; min-height: 14px; "><BR></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; ">"The spectrum is not an interesting steady date." = she said as I was enveloped.=A0</DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; font: normal = normal normal 12px/normal Helvetica; min-height: 14px; "><BR></DIV><P = style=3D"margin: 0.0px 0.0px 0.0px 0.0px">=A0</P><DIV style=3D"margin-top:= 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><FONT = class=3D"Apple-style-span" color=3D"#002CD3">it's too hard to figure out = what the results mean.=A0 (Just trying to be blunt to get your = attention. ;-)=A0 You start with convenient sounds, measure perception = and then try to figure out what the MDS dimensions mean.=A0 That hasn't = worked.=A0 I think that is why people have not been pushing on it very = hard lately.</FONT></DIV><DIV style=3D"margin-top: 0px; margin-right: = 0px; margin-bottom: 0px; margin-left: 0px; font: normal normal normal = 12px/normal Helvetica; color: rgb(0, 44, 211); min-height: 14px; "><FONT = class=3D"Apple-style-span" color=3D"#002CD3"><BR></FONT></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><FONT class=3D"Apple-style-span" = color=3D"#002CD3">Hiroko Terasawa and I have been taking an opposite = approach.=A0 We're *starting* with the dimensions, synthesizing sounds = and then measuring the stress between human perception and the = pre-ordained model.=A0 Several papers describing our initial results are = online at</FONT></DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; "><SPAN class=3D"Apple-tab-span" = style=3D"white-space:pre"> </SPAN><A = href=3D"http://ccrma.stanford.edu/~hiroko/timbre/"><FONT = class=3D"Apple-style-span" = color=3D"#002FE3">http://ccrma.stanford.edu/~hiroko/timbre/</FONT></A></DI= V><DIV style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><FONT class=3D"Apple-style-span" = color=3D"#002CD3">Sounds like Jim is doing something in between the two = extremes.</FONT></DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; font: normal normal normal = 12px/normal Helvetica; color: rgb(0, 44, 211); min-height: 14px; "><FONT = class=3D"Apple-style-span" color=3D"#002CD3"><BR></FONT></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; "><FONT class=3D"Apple-style-span" color=3D"#002CD3">- = Malcolm</FONT></DIV><DIV style=3D"margin-top: 0px; margin-right: 0px; = margin-bottom: 0px; margin-left: 0px; font: normal normal normal = 12px/normal Helvetica; color: rgb(0, 44, 211); min-height: 14px; "><FONT = class=3D"Apple-style-span" color=3D"#002CD3"><BR></FONT></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; font: normal normal normal 12px/normal Helvetica; = min-height: 14px; "><BR></DIV><DIV style=3D"margin-top: 0px; = margin-right: 0px; margin-bottom: 0px; margin-left: 0px; font: normal = normal normal 12px/normal Helvetica; min-height: 14px; "><BR></DIV><DIV = style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; = margin-left: 0px; font: normal normal normal 12px/normal Helvetica; = min-height: 14px; "><BR = class=3D"khtml-block-placeholder"></DIV></BODY></HTML>= --Apple-Mail-1-984724139--


This message came from the mail archive
http://www.auditory.org/postings/2006/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University