Arturo Camacho a écrit :
Dear Arturo,Dear members of the list, I am looking for the reference of first use of the equation m = C log(1+f/700) known as mel frequency scale transformation. In Wikipedia says that the scale was originated by Stevens, Volkman and Newman in 1937 (J. Acoust. Soc. Am 8(3) 185--190), but the paper only has tabulated data and no equation. The paper by S.B. Davis & P. Mermelstein (1980), "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences", IEEE Trans. on ASSP 28, 357-366 is usually cited in the speech recognition community as origin of MFCCs, but the equation is absent there as well. Thanks, Arturo As far as I I remember, this equation is given in Lawrence Rabiner and Biing-Hwang Juang, Fundamentals of speech recognition Prentice Hall 1993 The implementation of the Mel fitering is also discussed in Sirko Molau, Michael Pitz, Ralf Schlüter and Hermann Ney, Computing Mel-Frequency Cepstral Coefficients on the Power Spectrum, Proceedings of the International Conference on Acoustic, Speech and Signal Processing, Salt Lake City, UT, USA, June 2001 It is also used in Malcom Slaney's auditory toolbox. Best regards Guillaume Lemaitre --
-------------------------------------- Guillaume Lemaitre, Ph.D. Chargé de recherches/Researcher Equipe Perception et Design Sonores / Sound Perception and Design Team IRCAM - 1, place Igor Stravinsky F-75004 Paris - FRANCE tel : (+33 1) 44.78.48.38 fax : (+33 1) 44.78.15.40 e-mail : lemaitre@xxxxxxxx -------------------------------------- |