[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Origin of the Mel frequency scale equation?



Arturo Camacho a écrit :
Dear members of the list,

I am looking for the reference of first use of the equation

m = C log(1+f/700)

known as mel frequency scale transformation. In Wikipedia says that the
scale was originated by Stevens, Volkman and Newman in 1937 (J. Acoust.
Soc. Am 8(3) 185--190), but the paper only has tabulated data and no
equation. The paper by S.B. Davis & P. Mermelstein (1980), "Comparison of
parametric representations for monosyllabic word recognition in
continuously spoken sentences", IEEE Trans. on ASSP 28, 357-366 is usually
cited in the speech recognition community as origin of MFCCs, but the
equation is absent there as well.

Thanks,

Arturo

  
Dear Arturo,
As far as I I remember,  this equation is given in
Lawrence Rabiner and Biing-Hwang Juang,
Fundamentals of speech recognition
Prentice Hall
1993

The implementation of the Mel fitering is also discussed in
Sirko Molau, Michael Pitz, Ralf Schlüter and Hermann Ney,
Computing Mel-Frequency Cepstral Coefficients on the Power Spectrum,
Proceedings of the International Conference on Acoustic, Speech  and Signal Processing,
Salt Lake City, UT, USA, June 2001

It is also used in Malcom Slaney's auditory toolbox.
Best regards
Guillaume Lemaitre

--

--------------------------------------

Guillaume Lemaitre, Ph.D.

Chargé de recherches/Researcher

Equipe Perception et Design Sonores /

Sound Perception and Design Team

IRCAM - 1, place Igor Stravinsky F-75004 Paris - FRANCE

tel  : (+33 1) 44.78.48.38

fax : (+33 1) 44.78.15.40

e-mail  : lemaitre@xxxxxxxx

--------------------------------------