Re: MFCC method (Arturo Camacho )


Subject: Re: MFCC method
From:    Arturo Camacho  <acamacho@xxxxxxxx>
Date:    Thu, 8 Jan 2009 23:28:07 -0800
List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

Dear Dick, The Wikipedia page that you mention says that the Mel scale "approximates the human auditory system's response more closely than the linearly-spaced frequency bands used in the normal cepstrum." If that means that the Mel scale approximates better the tonotopic response of the cochlea than the linear scale, I wonder if it would not be an even better idea to use the Greenwood function (see entry in Wikipedia), which was explicitly created with that purpose. (Recall that the Mel scale was designed to represent equidistant steps in pitch, but that does not necessarily corresponds with equidistant tonotopic steps.) Regards, Arturo On Thu, Jan 8, 2009 at 8:46 PM, Richard F. Lyon <DickLyon@xxxxxxxx> wrote: > Thanks Malcolm; now that you've told us, it's in wikipedia: > http://en.wikipedia.org/wiki/Mel-frequency_cepstrum#History > Including the connection to earlier work by Pols; I can share > a copy of Plomp, Pols, and van de Geer (1967) on request. > > Dick > > At 2:07 PM -0800 1/7/09, Malcolm Slaney wrote: >> >> On Jan 7, 2009, at 12:40 PM, James W. Beauchamp wrote: >>> >>> I'm looking for a (the?) seminal article on the MFCC method of >>> coding spectral envelopes. It could be a journal paper or a chapter >>> in a book. Also, who was the first to publish on this idea? >> >> These are the usual references, especially the 1980 paper. >> >> P. Mermelstein, Distance measures for speech recognition, psychological >> and instrumental, in Pattern Recognition and Artificial Intelligence, C. H. >> Chen, Ed., pp. 374­388. Academic, New York, 1976. >> >> S.B. Davis, and P. Mermelstein, Comparison of Parametric Representations >> for Monosyllabic Word Recognition in Continuously Spoken Sentences, in IEEE >> Transactions on Acoustics, Speech, and Signal Processing, vol. 28(4), 1980, >> pp. 357­366. >> >> >> But Mermelstein usually credits John Bridle's work for the idea >> JSRU Report No. 1003 >> AN EXPERIMENTAL AUTOMATIC WORD·RECOGNITION SYSTEM: >> INTERIM REPORT >> J . S. Bridle and M. D. Brown >> >> >> I have copies of the early two if you need them. >> >> - Malcolm > -- __________________________________________________ Arturo Camacho, PhD Alumni Computer and Information Science and Engineering University of Florida E-mail: acamacho@xxxxxxxx Web page: www.cise.ufl.edu/~acamacho __________________________________________________


This message came from the mail archive
http://www.auditory.org/postings/2009/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University