Subject: Re: Origin of the Mel frequency scale equation? From: Guillaume Lemaitre <Guillaume.Lemaitre@xxxxxxxx> Date: Mon, 10 Mar 2008 09:52:42 +0100 List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>This is a multi-part message in MIME format. --------------010504040507000704050002 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: quoted-printable X-MIME-Autoconverted: from 8bit to quoted-printable by torrent.cc.mcgill.ca id m2A8sEeH018199 Arturo Camacho a =E9crit : > Dear members of the list, > > I am looking for the reference of first use of the equation > > m =3D C log(1+f/700) > > known as mel frequency scale transformation. In Wikipedia says that the > scale was originated by Stevens, Volkman and Newman in 1937 (J. Acoust. > Soc. Am 8(3) 185--190), but the paper only has tabulated data and no > equation. The paper by S.B. Davis & P. Mermelstein (1980), "Comparison = of > parametric representations for monosyllabic word recognition in > continuously spoken sentences", IEEE Trans. on ASSP 28, 357-366 is usua= lly > cited in the speech recognition community as origin of MFCCs, but the > equation is absent there as well. > > Thanks, > > Arturo > > =20 Dear Arturo, As far as I I remember, this equation is given in Lawrence Rabiner and Biing-Hwang Juang, Fundamentals of speech recognition Prentice Hall 1993 The implementation of the Mel fitering is also discussed in Sirko Molau, Michael Pitz, Ralf Schl=FCter and Hermann Ney, Computing Mel-Frequency Cepstral Coefficients on the Power Spectrum, Proceedings of the International Conference on Acoustic, Speech and=20 Signal Processing, Salt Lake City, UT, USA, June 2001 It is also used in Malcom Slaney's auditory toolbox. Best regards Guillaume Lemaitre --=20 -------------------------------------- Guillaume Lemaitre, Ph.D. /Charg=E9 de recherches/Researcher/ Equipe Perception et Design Sonores / Sound Perception and Design Team IRCAM - 1, place Igor Stravinsky F-75004 Paris - FRANCE tel : (+33 1) 44.78.48.38 fax : (+33 1) 44.78.15.40 e-mail : lemaitre@xxxxxxxx -------------------------------------- --------------010504040507000704050002 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"> <html> <head> <meta content="text/html;charset=ISO-8859-1" http-equiv="Content-Type"> </head> <body bgcolor="#ffffff" text="#000000"> Arturo Camacho a écrit : <blockquote cite="mid:4776.10.228.8.30.1205129390.squirrel@xxxxxxxx" type="cite"> <pre wrap="">Dear members of the list, I am looking for the reference of first use of the equation m = C log(1+f/700) known as mel frequency scale transformation. In Wikipedia says that the scale was originated by Stevens, Volkman and Newman in 1937 (J. Acoust. Soc. Am 8(3) 185--190), but the paper only has tabulated data and no equation. The paper by S.B. Davis & P. Mermelstein (1980), "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences", IEEE Trans. on ASSP 28, 357-366 is usually cited in the speech recognition community as origin of MFCCs, but the equation is absent there as well. Thanks, Arturo </pre> </blockquote> <font face="Century Gothic">Dear Arturo,<br> As far as I I remember, this equation is given in <br> Lawrence Rabiner and Biing-Hwang Juang,<br> Fundamentals of speech recognition<br> Prentice Hall<br> 1993<br> <br> The implementation of the Mel fitering is also discussed in <br> Sirko Molau, Michael Pitz, Ralf Schlüter and Hermann Ney,<br> Computing Mel-Frequency Cepstral Coefficients on the Power Spectrum,<br> Proceedings of the International Conference on Acoustic, Speech and Signal Processing, </font><font face="Century Gothic">Salt Lake City, UT, USA, June 2001</font><br> <font face="Century Gothic"><br> It is also used in Malcom Slaney's auditory toolbox.<br> Best regards<br> Guillaume Lemaitre<br> </font><br> <div class="moz-signature">-- <br> <meta http-equiv="Content-Type" content="text/html; "> <meta http-equiv="Content-Style-Type" content="text/css"> <title></title> <meta name="Generator" content="Cocoa HTML Writer"> <meta name="CocoaVersion" content="824.42"> <style type="text/css"> p.p1 {margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica} </style> <p class="p1">--------------------------------------</p> <p class="p1">Guillaume Lemaitre, Ph.D.</p> <p class="p1"><i>Chargé de recherches/Researcher</i></p> <p class="p1">Equipe Perception et Design Sonores /</p> <p class="p1">Sound Perception and Design Team</p> <p class="p1">IRCAM - 1, place Igor Stravinsky F-75004 Paris - FRANCE</p> <p class="p1">tel<span class="Apple-converted-space"> </span>: (+33 1) 44.78.48.38</p> <p class="p1">fax : (+33 1) 44.78.15.40</p> <p class="p1">e-mail<span class="Apple-converted-space"> </span>: <a class="moz-txt-link-abbreviated" href="mailto:lemaitre@xxxxxxxx">lemaitre@xxxxxxxx</a></p> <p class="p1">--------------------------------------<span class="Apple-converted-space"> <br> </span></p> </div> </body> </html> --------------010504040507000704050002--