[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Frequency to Mel Formula

To: AUDITORY@xxxxxxxxxxxxxxx
Subject: Re: Frequency to Mel Formula
From: "Richard F. Lyon" <DickLyon@xxxxxxx>
Date: Sun, 9 Aug 2009 22:52:17 -0700
Approved-by: DickLyon@xxxxxxx
Comments: To: Donald D Greenwood <ddg@xxxxxxxxxxxxx>
Delivery-date: Mon Aug 10 02:03:29 2009
In-reply-to: <20090730183411.759DC49BD@xxxxxxxxxxxxxxxxxxxxxxx>
List-archive: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>
List-help: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO AUDITORY>
List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>
List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>
List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>
References: <95C3910C-83F5-46F1-A953-5C8F1D141D5D@xxxxxxx> <20090728204713.3E48F95F1@xxxxxxxxxxxxxxxxxxxxxxx> <20090729115312.D431C677C@xxxxxxxxxxxxxxxxxxxxxxx> <20090729232205.7FD8C9275@xxxxxxxxxxxxxxxxxxxxxxx> <20090730031526.4DAD98857@xxxxxxxxxxxxxxxxxxxxxxx> <20090730183411.759DC49BD@xxxxxxxxxxxxxxxxxxxxxxx>
Reply-to: "Richard F. Lyon" <DickLyon@xxxxxxx>
Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>

Don,

Thanks again for your great explanations of this complicated stuff.

All that notwithstanding, I'm still poking around at why we havethese two different mel scales, with breaks at 700 and 1000. So Igot hold of Fant's book, which has Baranek's data table in it, andplotted up some comparisons.


See http://dicklyon.com/tech/Hearing/Mel-like_scales.svg

The "Mel 1000" curve comes pretty close to the Baranek table data upthrough about 4 kHz, then diverges far from it above that. The "Mel700" curve misses pretty badly around 2-6 kHz, but fits better onaverage if you count the highest frequencies.

The "Umesh" curve, f / (0.741 + 0.00024*f), doesn't fit particularlywell, but has a good shape, so I did a "fit" and got f / (0.759 +0.000252*f).

I also did a mel-type fit, and found a broad optimum for the corneraround 711.5 Hz (under the constraint that 1000 Hz maps to 1000,which I should probably have tried relaxing, but didn't).

Anyway, here's my theory: Fant fitted to the frequency range hecared about, which probably only went to 4 kHz or so. And thensomeone else probably did a fit to the same Baranek table over thewhole range, and got the 700 number (the plot shows that the 711.5point are pretty much right on the 700 curve). And that's why we seeBaranek referenced so much, maybe?

I also looked at goodness of fit (sum squared error in mel space)including all the frequencies in the Fant/Baranek table. It turnsout that the Umesh type fit has only 1/8 as much error as themel-like fit, due to the Bark-like curvature at the high-frequencyend.

So for people who like Baranek's table (assuming Fant has a true copyof it), the Umesh type function should be a win. But I don't thinkthat function extends well to the larger log-like range that we findin the ERB and Greenwood type curves, which are the ones that makemore sense in auditory-based applications.


That's my theory and I'm sticking to it.

Dick

Prev by Date: post doc position: children and cochlear implants
Next by Date: Half-time University Assistant in Systematic Musicology in Graz, Austria
Previous by thread: Re: Frequency to Mel Formula
Next by thread: Re: Frequency to Mel Formula
Index(es):
- Date
- Thread