Re: [AUDITORY] Gammatone filter bank in MATLABr2019a ("Bernstein,Leslie" )


Subject: Re: [AUDITORY] Gammatone filter bank in MATLABr2019a
From:    "Bernstein,Leslie"  <lbernstein@xxxxxxxx>
Date:    Thu, 18 Apr 2019 13:30:32 +0000
List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

--_004_258fa0684e7d43d0926c5f40bc284601NSOITEXCMB21uchcnet_ Content-Type: multipart/alternative; boundary="_000_258fa0684e7d43d0926c5f40bc284601NSOITEXCMB21uchcnet_" --_000_258fa0684e7d43d0926c5f40bc284601NSOITEXCMB21uchcnet_ Content-Type: text/plain; charset="Windows-1252" Content-Transfer-Encoding: quoted-printable Thanks so much, Volker. Les On 4/18/2019 1:49 AM, Volker Hohmann wrote: Dear Les and all, this is my (not complete) list: https://urldefense.proofpoint.com/v2/url?u=3Dhttps-3A__engineering.purdue.e= du_-7Emalcolm_interval_1998-2D010_&d=3DDwIF-g&c=3DEZxp_D7cDnouwj5YEFHgXuSKo= Uq2zVQZ_7Fw9yfotck&r=3D2Pw2GwelGcMR4953G-STHGpPJm2-pYYYSPmTwJk3sWM&m=3DxH64= y5gRhw916jm0LgV9jq2U41QaXzJ6Y_oiWYnYaHY&s=3D00Zvw76TMOnHTJUYCtP1JUjXryT-loJ= 14UrCVEliSmQ&e=3D https://urldefense.proofpoint.com/v2/url?u=3Dhttps-3A__code.soundsoftware.a= c.uk_projects_aimmat&d=3DDwIF-g&c=3DEZxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yf= otck&r=3D2Pw2GwelGcMR4953G-STHGpPJm2-pYYYSPmTwJk3sWM&m=3DxH64y5gRhw916jm0Lg= V9jq2U41QaXzJ6Y_oiWYnYaHY&s=3DDSPMahUPWX8FI5gZMYVORDMO1H6tiHUp0aDu3BTkY7U&e= =3D https://urldefense.proofpoint.com/v2/url?u=3Dhttp-3A__legacy.spa.aalto.fi_s= oftware_HUTear_&d=3DDwIF-g&c=3DEZxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&= r=3D2Pw2GwelGcMR4953G-STHGpPJm2-pYYYSPmTwJk3sWM&m=3DxH64y5gRhw916jm0LgV9jq2= U41QaXzJ6Y_oiWYnYaHY&s=3Dj6c5QtuRUogXtjg8DH5uq6BmLQtjpSMrtj1MCu8BXuY&e=3D https://urldefense.proofpoint.com/v2/url?u=3Dhttp-3A__amtoolbox.sourceforge= .net_&d=3DDwIF-g&c=3DEZxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&r=3D2Pw2Gw= elGcMR4953G-STHGpPJm2-pYYYSPmTwJk3sWM&m=3DxH64y5gRhw916jm0LgV9jq2U41QaXzJ6Y= _oiWYnYaHY&s=3Dev5sGGS5aE5ca34fNpCgm5KMgxvK_csK7EqyfTeaSJo&e=3D https://urldefense.proofpoint.com/v2/url?u=3Dhttps-3A__uol.de_en_mediphysic= s_downloads_&d=3DDwIF-g&c=3DEZxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&r= =3D2Pw2GwelGcMR4953G-STHGpPJm2-pYYYSPmTwJk3sWM&m=3DxH64y5gRhw916jm0LgV9jq2U= 41QaXzJ6Y_oiWYnYaHY&s=3DBKFIGuZJoVL2phtso9M94nvPN0m7d4BN80ZcoCX6nQg&e=3D (n= ow also at https://urldefense.proofpoint.com/v2/url?u=3Dhttps-3A__doi.org_10.5281_zeno= do.2643400&d=3DDwIF-g&c=3DEZxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&r=3D2= Pw2GwelGcMR4953G-STHGpPJm2-pYYYSPmTwJk3sWM&m=3DxH64y5gRhw916jm0LgV9jq2U41Qa= XzJ6Y_oiWYnYaHY&s=3DIVPdi7be_zWSUhAC5OV0Unaakn5IP0aB01gKPbblqns&e=3D) Best, Volker On 17.04.2019 15:10, Bernstein,Leslie wrote: Thanks, Volker. A link to those implementations would be very helpful. Les On 4/17/2019 4:58 AM, Volker Hohmann wrote: Dear Dick and all, just want to add that the re-synthesis method they apply is not optimal. I would recommend using the Matlab implementations contributed by our community, which have been described properly in citable publications, are readily available and have been running flawlessly for many years under whatever Matlab version came out. Best regards, Volker On 17.04.2019 02:51, Richard F. Lyon wrote: Bastian, That's an interesting distinction that needs to be made, between the peripheral and "whole system" auditory filter, whether gammatone or otherwise. In my book, I say this about that (in Part III =96 The Auditory Periphery): 13.1 What Is an Auditory Filter? The auditory filters that we consider here include both those motivated by psychoacoustic experiments, such as detection of tones in noise maskers, and those motivated by reproducing the observed mechanical response of the basilar membrane or neural response of the auditory nerve. One thesis of this work is that a single model can do a good job for both of these, and thereby provide a good basis for a machine hearing system. Since there are several stages of neural processing between the cochlea and our psychoacoustic perceptions, it would not be surprising if the best parameters were different between these types of models, but it seems likely that the linear and nonlinear filtering due to the cochlea plays a sufficient role in perception that we may find one set of parameters is adequate, at least for a range of machine hearing applications. And to be fair, the gammatone was originally proposed as a model of frog hearing physiology, and is widely used in cochlear models, even though Patterson popularized it in the psychoacoustic domain. So the MathWorks ought to be more careful what they say. I'd have several other quibbles with their docs (in the Audio Toolbox reference at https://urldefense.proofpoint.com/v2/url?u=3Dhttps-3A__www.mathworks.com= _help_pdf-5Fdoc_audio_index.html&d=3DDwIFaQ&c=3DEZxp_D7cDnouwj5YEFHgXuSKoUq= 2zVQZ_7Fw9yfotck&r=3D2Pw2GwelGcMR4953G-STHGpPJm2-pYYYSPmTwJk3sWM&m=3DGHXIqZ= nxZ7ZjCjlEGmDuiQlnjJQizpHYy3weycRYNko&s=3DBE9euCO95AcdvV7T4r3Kob_OyFq4F1_v9= -0p75nY_Ok&e=3D). Quibbles: 1. "The gammatoneFilterBank follows the algorithm described in [1] and first proposed by [2]." [1] is Slaney's method, a simple filter cascade based on analyzing the Laplace transform of the gammatone. [2] is Patterson et al.'s "Complex Sounds and Auditory Images", a great paper but it doesn't say one word about how to implement the gammatone (they did have other implementation papers elsewhere, but not this method and not here). 2. Ref 2 says "the shape of the magnitude characteristic of the gammatone filter is very similar to that of the roex(p) filter commonly used to represent the magnitude characteristic of the human auditory filter." Mathworks says "The gammatone filter is similar to the roex filter derived from the notched-noise experiment." A cursory look at more recent literature on auditory filters, including Patterson's, would suggest omitting or at least tempering this claim. See my book Chapter 13 or this paper: https://urldefense.proofpoint.com/v2/url?u=3Dhttps-3A__storage.googleapis.c= om_pub-2Dtools-2Dpublic-2Dpublication-2Ddata_pdf_36895.pdf&d=3DDwIFaQ&c=3DE= Zxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&r=3D2Pw2GwelGcMR4953G-STHGpPJm2-= pYYYSPmTwJk3sWM&m=3DGHXIqZnxZ7ZjCjlEGmDuiQlnjJQizpHYy3weycRYNko&s=3D_Jft13a= I1rDz891VcgKid-OKGfUIm6NugFjoDEcj1lg&e=3D 3. Error where it says b =96=96 bandwidth, set to 1.019*erb2hz(fc). Either the documentation is wrong, or the functionality is wrong. Hopefully the former. 4. The parameterization by only FrequencyRange, NumFilters, and SampleRate is rather impoverished. It is not documented whether the filters match the ERB bandwidth if some of these parameters are changed, or whether adjacent filters continue to cross over about 3 dB down; you can't have both, but you might want one or the other, and there's not enough control to say what you want. With a few more parameters one could do useful comparisons, tradeoffs, and tunings of filter numbers, orders, bandwidths, and phases for example. With just a few more one could include better auditory filter variants (that differ only in the locations of the zeros of the cascaded second-order filters), including APGF and OZGF. R2019a also adds gtcc (gammatone cepstral coefficients). Their algorithm uses log(energy) before the DCT, instead of the cube root proposed by the Shao et al. reference, which also uses a slightly different acronym: GFCC (gammatone frequency cepstral coefficients). Not clear why. The referenced paper did not really investigate whether their improvement over mfcc was due to the different frequency scale (700 Hz mel vs 229 Hz ERB break point between linear and exponential spacing), or the filter shape (triangle vs gammatone), or the nonlinearity (log vs cube root), or the domain of implementation (frequency vs time). With the impoverished parameterizations of these functions in the audio toolboxes, it's hard to further compare such things (though the gtcc does allow some of that). The other gtcc ref (Rabiner and Schafer) has nothing on gammatone or gtcc or gfcc. I could go on... Dick On Tue, Apr 16, 2019 at 12:24 AM Bastian Epp <000000a94eb56441-dmarc-request@xxxxxxxx<mailto:000000a94eb56441-dma= rc-request@xxxxxxxx> <mailto:000000a94eb56441-dmarc-request@xxxxxxxx><mailto:000000a94eb5= 6441-dmarc-request@xxxxxxxx>> wrote: Dear list, This morning I read through the release notes of MATLAB R2019a and was happy to find that there was an implementation of a Gammatone filter bank included: "Gammatone Filter Bank: Mimic the human auditory system" With the reference to (among others): Glasberg, Brian R., and Brian CJ Moore. "Derivation of Auditory Filter Shapes from Notched-Noise Data." Hearing Research. Vol. 47. Issue 1-2, 1990, pp. 103 =96138. This made me quite happy because it is a proper description of what Gammatone filter banks most often are used for - to model the frequency selectivity of the auditory system (as measured using psychoacoustics). However, in the DOC page, they show a picture of the Basilar membrane on top with the frequency response of the filter bank - suggesting that there exists a 1:1 correspondance. Everybody needs a topic to grow old and grumpy on - mine is this: From my point of view, this is only correct under the (overly strong?) assumption that the cochlear is the only place in the auditory system underlying the perceptually observed frequency selectivity. Measuring "auditory filters" means to evaluate the auditory system as a whole (the concept of a "neuron" also only makes sense when being embedded in its network). "Cochlear filters" are measured on/in the cochlea . Besides the common critiques (linearity, coarse approximation of the actual "filter" shape, etc), the main problem in my point of view is that we teach students that we can "measure" the function of a "subsystem" (the cochlea) using a method that assesses the function of the "whole" system. There are some data sets that suggest a strong link, but the "tool" of psychoacoustics simply does not allow such a statement. Even though I like the working hypothesis "The brain exists to keep the cochlea warm", I think equating cochlear frequency selectivity with auditory filters (without explicitly stating the assumption that no(!) element along the auditory pathway modifies this frequency selectivity) is a point where we could be more careful to avoid misconceptions and overly strong conclusions. In most publications and books, this point is not explicitly wrong, but not as precise as it could be in my opinion. I hope that someone from MATHWORKS follows this list and considers a more careful description in the DOCs. I would also be happy to compile all the constructive arguments that people might have for/against my point of view. Have a great day everybody! BAstian -- Bastian Epp Associate Professor DTU Healthtech ------------------------------------ Technical University of Denmark =D8rsteds Plads Building 352, Room 118 2800 Kgs. Lyngby Direct +45 45253953 bepp@xxxxxxxx<mailto:bepp@xxxxxxxx> <mailto:bepp@xxxxxxxx><mailto:bepp@xxxxxxxx= > https://urldefense.proofpoint.com/v2/url?u=3Dhttp-3A__www.dtu.dk_englis= h&d=3DDwIFaQ&c=3DEZxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&r=3D2Pw2GwelGc= MR4953G-STHGpPJm2-pYYYSPmTwJk3sWM&m=3DGHXIqZnxZ7ZjCjlEGmDuiQlnjJQizpHYy3wey= cRYNko&s=3DUtZyeOWPT8vhvgDk4ouA5eLQ9REPci24KX0I7LjUw3s&e=3D -- *Leslie R. Bernstein, Ph.D. **| *Professor Depts. of Neuroscience and Surgery (Otolaryngology)| UConn School of Medicine 263 Farmington Avenue, Farmington, CT 06030-3401 Office: 860.679.4622 | Fax: 860.679.2495 -- Leslie R. Bernstein, Ph.D. | Professor Depts. of Neuroscience and Surgery (Otolaryngology)| UConn School of Medici= ne 263 Farmington Avenue, Farmington, CT 06030-3401 Office: 860.679.4622 | Fax: 860.679.2495 [cid:part1.C559F795.321BFAD3@xxxxxxxx --_000_258fa0684e7d43d0926c5f40bc284601NSOITEXCMB21uchcnet_ Content-Type: text/html; charset="Windows-1252" Content-Transfer-Encoding: quoted-printable <html> <head> <meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3DWindows-1= 252"> </head> <body text=3D"#000000" bgcolor=3D"#FFFFFF"> <font size=3D"-1"><font face=3D"Verdana">Thanks so much, Volker.<br> <br> Les<br> </font></font><br> <div class=3D"moz-cite-prefix">On 4/18/2019 1:49 AM, Volker Hohmann wrote:<= br> </div> <blockquote type=3D"cite" cite=3D"mid:98599f8f-4633-d082-6358-e5d86485d5a0@xxxxxxxx= uni-oldenburg.de"> <pre class=3D"moz-quote-pre" wrap=3D"">Dear Les and all,=0A= =0A= this is my (not complete) list:=0A= =0A= <a class=3D"moz-txt-link-freetext" href=3D"https://urldefense.proofpoint.co= m/v2/url?u=3Dhttps-3A__engineering.purdue.edu_-7Emalcolm_interval_1998-2D01= 0_&amp;d=3DDwIF-g&amp;c=3DEZxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&amp;r= =3D2Pw2GwelGcMR4953G-STHGpPJm2-pYYYSPmTwJk3sWM&amp;m=3DxH64y5gRhw916jm0LgV9= jq2U41QaXzJ6Y_oiWYnYaHY&amp;s=3D00Zvw76TMOnHTJUYCtP1JUjXryT-loJ14UrCVEliSmQ= &amp;e=3D">https://urldefense.proofpoint.com/v2/url?u=3Dhttps-3A__engineeri= ng.purdue.edu_-7Emalcolm_interval_1998-2D010_&amp;d=3DDwIF-g&amp;c=3DEZxp_D= 7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&amp;r=3D2Pw2GwelGcMR4953G-STHGpPJm2-p= YYYSPmTwJk3sWM&amp;m=3DxH64y5gRhw916jm0LgV9jq2U41QaXzJ6Y_oiWYnYaHY&amp;s=3D= 00Zvw76TMOnHTJUYCtP1JUjXryT-loJ14UrCVEliSmQ&amp;e=3D</a>=0A= <a class=3D"moz-txt-link-freetext" href=3D"https://urldefense.proofpoint.co= m/v2/url?u=3Dhttps-3A__code.soundsoftware.ac.uk_projects_aimmat&amp;d=3DDwI= F-g&amp;c=3DEZxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&amp;r=3D2Pw2GwelGcM= R4953G-STHGpPJm2-pYYYSPmTwJk3sWM&amp;m=3DxH64y5gRhw916jm0LgV9jq2U41QaXzJ6Y_= oiWYnYaHY&amp;s=3DDSPMahUPWX8FI5gZMYVORDMO1H6tiHUp0aDu3BTkY7U&amp;e=3D">htt= ps://urldefense.proofpoint.com/v2/url?u=3Dhttps-3A__code.soundsoftware.ac.u= k_projects_aimmat&amp;d=3DDwIF-g&amp;c=3DEZxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7= Fw9yfotck&amp;r=3D2Pw2GwelGcMR4953G-STHGpPJm2-pYYYSPmTwJk3sWM&amp;m=3DxH64y= 5gRhw916jm0LgV9jq2U41QaXzJ6Y_oiWYnYaHY&amp;s=3DDSPMahUPWX8FI5gZMYVORDMO1H6t= iHUp0aDu3BTkY7U&amp;e=3D</a>=0A= <a class=3D"moz-txt-link-freetext" href=3D"https://urldefense.proofpoint.co= m/v2/url?u=3Dhttp-3A__legacy.spa.aalto.fi_software_HUTear_&amp;d=3DDwIF-g&a= mp;c=3DEZxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&amp;r=3D2Pw2GwelGcMR4953= G-STHGpPJm2-pYYYSPmTwJk3sWM&amp;m=3DxH64y5gRhw916jm0LgV9jq2U41QaXzJ6Y_oiWYn= YaHY&amp;s=3Dj6c5QtuRUogXtjg8DH5uq6BmLQtjpSMrtj1MCu8BXuY&amp;e=3D">https://= urldefense.proofpoint.com/v2/url?u=3Dhttp-3A__legacy.spa.aalto.fi_software_= HUTear_&amp;d=3DDwIF-g&amp;c=3DEZxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&= amp;r=3D2Pw2GwelGcMR4953G-STHGpPJm2-pYYYSPmTwJk3sWM&amp;m=3DxH64y5gRhw916jm= 0LgV9jq2U41QaXzJ6Y_oiWYnYaHY&amp;s=3Dj6c5QtuRUogXtjg8DH5uq6BmLQtjpSMrtj1MCu= 8BXuY&amp;e=3D</a>=0A= <a class=3D"moz-txt-link-freetext" href=3D"https://urldefense.proofpoint.co= m/v2/url?u=3Dhttp-3A__amtoolbox.sourceforge.net_&amp;d=3DDwIF-g&amp;c=3DEZx= p_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&amp;r=3D2Pw2GwelGcMR4953G-STHGpPJm= 2-pYYYSPmTwJk3sWM&amp;m=3DxH64y5gRhw916jm0LgV9jq2U41QaXzJ6Y_oiWYnYaHY&amp;s= =3Dev5sGGS5aE5ca34fNpCgm5KMgxvK_csK7EqyfTeaSJo&amp;e=3D">https://urldefense= .proofpoint.com/v2/url?u=3Dhttp-3A__amtoolbox.sourceforge.net_&amp;d=3DDwIF= -g&amp;c=3DEZxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&amp;r=3D2Pw2GwelGcMR= 4953G-STHGpPJm2-pYYYSPmTwJk3sWM&amp;m=3DxH64y5gRhw916jm0LgV9jq2U41QaXzJ6Y_o= iWYnYaHY&amp;s=3Dev5sGGS5aE5ca34fNpCgm5KMgxvK_csK7EqyfTeaSJo&amp;e=3D</a>= =0A= <a class=3D"moz-txt-link-freetext" href=3D"https://urldefense.proofpoint.co= m/v2/url?u=3Dhttps-3A__uol.de_en_mediphysics_downloads_&amp;d=3DDwIF-g&amp;= c=3DEZxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&amp;r=3D2Pw2GwelGcMR4953G-S= THGpPJm2-pYYYSPmTwJk3sWM&amp;m=3DxH64y5gRhw916jm0LgV9jq2U41QaXzJ6Y_oiWYnYaH= Y&amp;s=3DBKFIGuZJoVL2phtso9M94nvPN0m7d4BN80ZcoCX6nQg&amp;e=3D">https://url= defense.proofpoint.com/v2/url?u=3Dhttps-3A__uol.de_en_mediphysics_downloads= _&amp;d=3DDwIF-g&amp;c=3DEZxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&amp;r= =3D2Pw2GwelGcMR4953G-STHGpPJm2-pYYYSPmTwJk3sWM&amp;m=3DxH64y5gRhw916jm0LgV9= jq2U41QaXzJ6Y_oiWYnYaHY&amp;s=3DBKFIGuZJoVL2phtso9M94nvPN0m7d4BN80ZcoCX6nQg= &amp;e=3D</a> (now also at=0A= <a class=3D"moz-txt-link-freetext" href=3D"https://urldefense.proofpoint.co= m/v2/url?u=3Dhttps-3A__doi.org_10.5281_zenodo.2643400&amp;d=3DDwIF-g&amp;c= =3DEZxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&amp;r=3D2Pw2GwelGcMR4953G-ST= HGpPJm2-pYYYSPmTwJk3sWM&amp;m=3DxH64y5gRhw916jm0LgV9jq2U41QaXzJ6Y_oiWYnYaHY= &amp;s=3DIVPdi7be_zWSUhAC5OV0Unaakn5IP0aB01gKPbblqns&amp;e=3D">https://urld= efense.proofpoint.com/v2/url?u=3Dhttps-3A__doi.org_10.5281_zenodo.2643400&a= mp;d=3DDwIF-g&amp;c=3DEZxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&amp;r=3D2= Pw2GwelGcMR4953G-STHGpPJm2-pYYYSPmTwJk3sWM&amp;m=3DxH64y5gRhw916jm0LgV9jq2U= 41QaXzJ6Y_oiWYnYaHY&amp;s=3DIVPdi7be_zWSUhAC5OV0Unaakn5IP0aB01gKPbblqns&amp= ;e=3D</a>)=0A= =0A= Best, Volker=0A= =0A= On 17.04.2019 15:10, Bernstein,Leslie wrote:=0A= </pre> <blockquote type=3D"cite"> <pre class=3D"moz-quote-pre" wrap=3D"">Thanks, Volker.&nbsp; A link to thos= e implementations would be very helpful.=0A= =0A= Les=0A= =0A= On 4/17/2019 4:58 AM, Volker Hohmann wrote:=0A= </pre> <blockquote type=3D"cite"> <pre class=3D"moz-quote-pre" wrap=3D"">Dear Dick and all,=0A= =0A= just want to add that the re-synthesis method they apply is not optimal.=0A= I would recommend using the Matlab implementations contributed by our=0A= community, which have been described properly in citable publications,=0A= are readily available and have been running flawlessly for many years=0A= under whatever Matlab version came out.=0A= =0A= Best regards,=0A= =0A= Volker=0A= =0A= On 17.04.2019 02:51, Richard F. Lyon wrote:=0A= </pre> <blockquote type=3D"cite"> <pre class=3D"moz-quote-pre" wrap=3D"">Bastian,=0A= =0A= That's an interesting distinction that needs to be made, between the=0A= peripheral and &quot;whole system&quot; auditory filter, whether gammatone = or=0A= otherwise.&nbsp; In my book, I say this about that (in Part III =96 The=0A= Auditory Periphery):=0A= =0A= 13.1 What Is an Auditory Filter?=0A= The auditory filters that we consider here include both those=0A= motivated by psychoacoustic experiments, such as detection of tones=0A= in noise maskers, and those motivated by reproducing the observed=0A= mechanical response of the basilar membrane or neural response of=0A= the auditory nerve. One thesis of this work is that a single model=0A= can do a good job for both of these, and thereby provide a good=0A= basis for a machine hearing system. Since there are several stages=0A= of neural processing between the cochlea and our psychoacoustic=0A= perceptions, it would not be surprising if the best parameters were=0A= different between these types of models, but it seems likely that=0A= the linear and nonlinear filtering due to the cochlea plays a=0A= sufficient role in perception that we may find one set of parameters=0A= is adequate, at least for a range of machine hearing applications.=0A= =0A= =0A= And to be fair, the gammatone was originally proposed as a model of frog=0A= hearing physiology, and is widely used in cochlear models, even though=0A= Patterson popularized it in the psychoacoustic domain.=0A= =0A= So the MathWorks ought to be more careful what they say.&nbsp; I'd have=0A= several other quibbles with their docs (in the Audio Toolbox reference=0A= at <a class=3D"moz-txt-link-freetext" href=3D"https://urldefense.proofpoint= .com/v2/url?u=3Dhttps-3A__www.mathworks.com_help_pdf-5Fdoc_audio_index.html= &amp;d=3DDwIFaQ&amp;c=3DEZxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&amp;r= =3D2Pw2GwelGcMR4953G-STHGpPJm2-pYYYSPmTwJk3sWM&amp;m=3DGHXIqZnxZ7ZjCjlEGmDu= iQlnjJQizpHYy3weycRYNko&amp;s=3DBE9euCO95AcdvV7T4r3Kob_OyFq4F1_v9-0p75nY_Ok= &amp;e=3D">https://urldefense.proofpoint.com/v2/url?u=3Dhttps-3A__www.mathw= orks.com_help_pdf-5Fdoc_audio_index.html&amp;d=3DDwIFaQ&amp;c=3DEZxp_D7cDno= uwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&amp;r=3D2Pw2GwelGcMR4953G-STHGpPJm2-pYYYSP= mTwJk3sWM&amp;m=3DGHXIqZnxZ7ZjCjlEGmDuiQlnjJQizpHYy3weycRYNko&amp;s=3DBE9eu= CO95AcdvV7T4r3Kob_OyFq4F1_v9-0p75nY_Ok&amp;e=3D</a>).&nbsp;=0A= =0A= Quibbles:=0A= =0A= 1. &quot;The gammatoneFilterBank follows the algorithm described in [1] and= =0A= first proposed by [2].&quot;&nbsp; [1] is Slaney's method, a simple filter = cascade=0A= based on analyzing the Laplace transform of the gammatone.&nbsp; [2] is=0A= Patterson et al.'s &quot;Complex Sounds and Auditory Images&quot;, a great = paper=0A= but it doesn't say one word about how to implement the gammatone (they=0A= did have other implementation papers elsewhere, but not this method and=0A= not here).=0A= =0A= 2. Ref 2 says &quot;the shape of the magnitude characteristic of the=0A= gammatone filter is very similar to that of the roex(p) filter commonly=0A= used to represent the magnitude characteristic of the human auditory=0A= filter.&quot;&nbsp; Mathworks says &quot;The gammatone filter is similar to= the roex=0A= filter derived from the notched-noise=0A= experiment.&quot;&nbsp; A cursory look at more recent literature on auditor= y=0A= filters, including Patterson's, would suggest omitting or at least=0A= tempering this claim.&nbsp; See my book Chapter 13 or this paper:=0A= <a class=3D"moz-txt-link-freetext" href=3D"https://urldefense.proofpoint.co= m/v2/url?u=3Dhttps-3A__storage.googleapis.com_pub-2Dtools-2Dpublic-2Dpublic= ation-2Ddata_pdf_36895.pdf&amp;d=3DDwIFaQ&amp;c=3DEZxp_D7cDnouwj5YEFHgXuSKo= Uq2zVQZ_7Fw9yfotck&amp;r=3D2Pw2GwelGcMR4953G-STHGpPJm2-pYYYSPmTwJk3sWM&amp;= m=3DGHXIqZnxZ7ZjCjlEGmDuiQlnjJQizpHYy3weycRYNko&amp;s=3D_Jft13aI1rDz891VcgK= id-OKGfUIm6NugFjoDEcj1lg&amp;e=3D">https://urldefense.proofpoint.com/v2/url= ?u=3Dhttps-3A__storage.googleapis.com_pub-2Dtools-2Dpublic-2Dpublication-2D= data_pdf_36895.pdf&amp;d=3DDwIFaQ&amp;c=3DEZxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_= 7Fw9yfotck&amp;r=3D2Pw2GwelGcMR4953G-STHGpPJm2-pYYYSPmTwJk3sWM&amp;m=3DGHXI= qZnxZ7ZjCjlEGmDuiQlnjJQizpHYy3weycRYNko&amp;s=3D_Jft13aI1rDz891VcgKid-OKGfU= Im6NugFjoDEcj1lg&amp;e=3D</a>=0A= =0A= 3. Error where it says b =96=96 bandwidth, set to 1.019*erb2hz(fc).&nbsp; E= ither=0A= the documentation is wrong, or the functionality is wrong.&nbsp; Hopefully= =0A= the former.=0A= =0A= 4. The parameterization by only FrequencyRange, NumFilters, and=0A= SampleRate is rather impoverished.&nbsp; It is not documented whether the= =0A= filters match the ERB bandwidth if some of these parameters are changed,=0A= or whether adjacent filters continue to cross over about 3 dB down; you=0A= can't have both, but you might want one or the other, and there's not=0A= enough control to say what you want.&nbsp; With a few more parameters one= =0A= could do useful comparisons, tradeoffs, and tunings of filter numbers,=0A= orders, bandwidths, and phases for example.&nbsp; With just a few more one= =0A= could include better auditory filter variants (that differ only in the=0A= locations of the zeros of the cascaded second-order filters), including=0A= APGF and OZGF.=0A= =0A= R2019a also adds gtcc (gammatone cepstral coefficients).&nbsp; Their=0A= algorithm uses log(energy) before the DCT, instead of the cube root=0A= proposed by the Shao et al. reference, which also uses a slightly=0A= different acronym:&nbsp; GFCC (gammatone frequency cepstral coefficients).&= nbsp;=0A= Not clear why.&nbsp; The referenced paper did not really investigate whethe= r=0A= their improvement over mfcc was due to the different frequency scale=0A= (700 Hz&nbsp; mel vs 229 Hz ERB break point between linear and exponential= =0A= spacing), or the filter shape (triangle vs gammatone), or the=0A= nonlinearity (log vs cube root), or the domain of implementation=0A= (frequency vs time). With the impoverished parameterizations of these=0A= functions in the audio toolboxes, it's hard to further compare such=0A= things (though the gtcc does allow some of that).&nbsp; The other gtcc ref= =0A= (Rabiner and Schafer) has nothing on gammatone or gtcc or gfcc.=0A= =0A= I could go on...=0A= =0A= Dick=0A= =0A= =0A= =0A= =0A= =0A= =0A= =0A= =0A= =0A= On Tue, Apr 16, 2019 at 12:24 AM Bastian Epp=0A= &lt;<a class=3D"moz-txt-link-abbreviated" href=3D"mailto:000000a94eb56441-d= marc-request@xxxxxxxx">000000a94eb56441-dmarc-request@xxxxxxxx= a</a>=0A= <a class=3D"moz-txt-link-rfc2396E" href=3D"mailto:000000a94eb56441-dmarc-re= quest@xxxxxxxx">&lt;mailto:000000a94eb56441-dmarc-request@xxxxxxxx= ll.ca&gt;</a>&gt; wrote:=0A= =0A= Dear list,=0A= =0A= This morning I read through the release notes of MATLAB R2019a and was= =0A= happy to find that there was an implementation of a Gammatone filter=0A= bank included:=0A= =0A= &quot;Gammatone Filter Bank: Mimic the human auditory system&quot;=0A= =0A= With the reference to (among others):=0A= =0A= Glasberg, Brian R., and Brian CJ Moore. &quot;Derivation of Auditory Fi= lter=0A= Shapes from Notched-Noise Data.&quot; Hearing&nbsp;Research. Vol. 47. I= ssue 1-2,=0A= 1990, pp. 103 =96138.=0A= =0A= This made me quite happy because it is a proper description of what=0A= Gammatone filter banks most often are used for - to model the frequency= =0A= selectivity of the auditory system (as measured using psychoacoustics).= =0A= =0A= However, in the DOC page, they show a picture of the Basilar membrane= =0A= on top with the frequency response of the filter bank - suggesting that= =0A= there exists a 1:1 correspondance.=0A= =0A= Everybody needs a topic to grow old and grumpy on - mine is this:&nbsp;= =0A= =0A= From my point of view, this is only correct under the (overly strong?)= =0A= assumption that the cochlear is the only place in the auditory system= =0A= underlying the perceptually observed frequency selectivity. Measuring= =0A= &quot;auditory filters&quot; means to evaluate the auditory system as a= =0A= whole&nbsp;(the concept of a &quot;neuron&quot; also only makes sense w= hen being=0A= embedded in its network). &quot;Cochlear filters&quot; are measured on/= in the=0A= cochlea .&nbsp;=0A= =0A= Besides the common critiques (linearity, coarse approximation of the=0A= actual &quot;filter&quot; shape, etc), the main problem in my point of = view is=0A= that we teach students that we can &quot;measure&quot; the function of = a=0A= &quot;subsystem&quot; (the cochlea) using a method that assesses the fu= nction of=0A= the &quot;whole&quot; system. There are some data sets that suggest a s= trong=0A= link, but the &quot;tool&quot; of psychoacoustics simply does not allow= such a=0A= statement.=0A= =0A= Even though I like the working hypothesis &quot;The brain exists to kee= p the=0A= cochlea warm&quot;, I think equating cochlear frequency selectivity wit= h=0A= auditory filters (without explicitly stating the assumption that no(!)= =0A= element along the auditory pathway modifies this frequency selectivity)= =0A= is a point where we could&nbsp; be more careful to avoid misconceptions= and=0A= overly strong conclusions. In most publications and books, this point= =0A= is not explicitly wrong, but not as precise as it could be in my=0A= opinion.=0A= =0A= I hope that someone from MATHWORKS follows this list and considers a=0A= more careful description in the DOCs. I would also be happy to compile= =0A= all the constructive arguments that people might have for/against my=0A= point of view.=0A= =0A= Have a great day everybody!=0A= =0A= BAstian=0A= =0A= =0A= =0A= =0A= -- =0A= Bastian Epp=0A= Associate Professor=0A= =0A= DTU Healthtech&nbsp;&nbsp;&nbsp;&nbsp;=0A= ------------------------------------=0A= Technical University of Denmark=0A= =D8rsteds Plads=0A= Building 352, Room 118=0A= 2800 Kgs. Lyngby=0A= Direct &#43;45 45253953=0A= <a class=3D"moz-txt-link-abbreviated" href=3D"mailto:bepp@xxxxxxxx">bepp@xxxxxxxx= dtu.dk</a> <a class=3D"moz-txt-link-rfc2396E" href=3D"mailto:bepp@xxxxxxxx">&= lt;mailto:bepp@xxxxxxxx&gt;</a>=0A= <a class=3D"moz-txt-link-freetext" href=3D"https://urldefense.proofpoin= t.com/v2/url?u=3Dhttp-3A__www.dtu.dk_english&amp;d=3DDwIFaQ&amp;c=3DEZxp_D7= cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&amp;r=3D2Pw2GwelGcMR4953G-STHGpPJm2-pY= YYSPmTwJk3sWM&amp;m=3DGHXIqZnxZ7ZjCjlEGmDuiQlnjJQizpHYy3weycRYNko&amp;s=3DU= tZyeOWPT8vhvgDk4ouA5eLQ9REPci24KX0I7LjUw3s&amp;e=3D">https://urldefense.pro= ofpoint.com/v2/url?u=3Dhttp-3A__www.dtu.dk_english&amp;d=3DDwIFaQ&amp;c=3DE= Zxp_D7cDnouwj5YEFHgXuSKoUq2zVQZ_7Fw9yfotck&amp;r=3D2Pw2GwelGcMR4953G-STHGpP= Jm2-pYYYSPmTwJk3sWM&amp;m=3DGHXIqZnxZ7ZjCjlEGmDuiQlnjJQizpHYy3weycRYNko&amp= ;s=3DUtZyeOWPT8vhvgDk4ouA5eLQ9REPci24KX0I7LjUw3s&amp;e=3D</a>=0A= =0A= </pre> </blockquote> </blockquote> <pre class=3D"moz-quote-pre" wrap=3D"">=0A= =0A= -- =0A= *Leslie R. Bernstein, Ph.D. **| *Professor=0A= Depts. of Neuroscience and Surgery (Otolaryngology)| UConn School of=0A= Medicine=0A= 263 Farmington Avenue, Farmington, CT 06030-3401=0A= Office: 860.679.4622 | Fax: 860.679.2495=0A= =0A= =0A= </pre> </blockquote> <pre class=3D"moz-quote-pre" wrap=3D"">=0A= </pre> </blockquote> <br> <br> <div class=3D"moz-signature">-- <br> <title></title> <b><span style=3D"font-size:10.0pt;font-family:&quot;Arial&quot;,sans-serif= ;=0A= mso-fareast-font-family:&quot;Times New Roman=0A= \;color\:\#002048&quot;">Leslie R. Bernstein, Ph.D. </span></b><b><span style=3D"font-size:10.0pt;font-family:&quot;Arial&quot;= ,sans-serif;=0A= mso-fareast-font-family:&quot;Times New Roman=0A= \;color\:\#585858&quot;">| </span></b><span style=3D"font-size:10.0pt;font-family:&quot;Arial&quot;,sa= ns-serif;mso-fareast-font-family:&quot;Times=0A= New Roman \;color\:\#585858&quot;">Professor</span><span style=3D"m= so-fareast-font-family: &quot;Times New Roman&quot;"><o:p></o:p></span><spa= n style=3D"font-size:10.0pt;font-family:&quot;Arial&quot;,sans-serif;=0A= mso-fareast-font-family:&quot;Times New Roman=0A= \;color\:\#585858&quot;"><br> Depts. of Neuroscience and Surgery (Otolaryngology)| UConn School of Medici= ne </span> <br> <span style=3D"font-size:10.0pt;font-family:&quot;Arial&quot;,sans-serif;= =0A= mso-fareast-font-family:&quot;Times New Roman=0A= \;color\:\#585858&quot;"></span><span style=3D"font-size:10.0pt;fon= t-family:&quot;Arial&quot;,sans-serif;=0A= mso-fareast-font-family:&quot;Times New Roman=0A= \;color\:\#585858&quot;">263 Farmington Avenue, Farmington, CT 06030-3401</span><br> <span style=3D"font-size:10.0pt;font-family:&quot;Arial&quot;,sans-serif;= =0A= mso-fareast-font-family:&quot;Times New Roman=0A= \;color\:\#585858&quot;"></span><span style=3D"font-size:10.0pt;fon= t-family:&quot;Arial&quot;,sans-serif;=0A= mso-fareast-font-family:&quot;Times New Roman=0A= \;color\:\#585858&quot;">Office: 860.679.4622 | Fax: 860.679.2495<br> <br> <img alt=3D"" src=3D"cid:part1.C559F795.321BFAD3@xxxxxxxx" width=3D"125" he= ight=3D"48"><br> </span></div> </body> </html> --_000_258fa0684e7d43d0926c5f40bc284601NSOITEXCMB21uchcnet_-- --_004_258fa0684e7d43d0926c5f40bc284601NSOITEXCMB21uchcnet_ Content-Type: image/png; name="uconnhealth_stacked_blue_email.png" Content-Description: uconnhealth_stacked_blue_email.png Content-Disposition: inline; filename="uconnhealth_stacked_blue_email.png"; size=796; creation-date="Thu, 18 Apr 2019 13:30:32 GMT"; modification-date="Thu, 18 Apr 2019 13:30:32 GMT" Content-ID: <part1.C559F795.321BFAD3@xxxxxxxx> Content-Transfer-Encoding: base64 iVBORw0KGgoAAAANSUhEUgAAAH0AAAAwCAMAAAALmIWlAAAAGXRFWHRTb2Z0d2FyZQBBZG9iZSBJ bWFnZVJlYWR5ccllPAAAADNQTFRFKzVYHCZM4eLn8PHzpKi30dTbaG+IlZqrs7fDWWB8O0Nkd32U wsXPSlJwhoufDRhA////A68jmAAAABF0Uk5T/////////////////////wAlrZliAAACYklEQVR4 2uzY22KDIAwAUC7e6gjw/187gUC4OetaupflaTXKWSuBKNMuJptC+QMCP20auPHBQCubBWU46K24 uB7NfZ58ZqaMP8D8AEBHtT/goXnB8WPwBf8r1WYUXWxUOZobHcJXIMcfONclmE4s0lrRzYBIOjvT jX5WV6YbP2ZQJ6TWjfiAbuYzHW7p+xpuolQrP83sjQ5nupnu6LqaJ92MbnSznulcfkBHpNXNY6y+ c0JKnWX1eFeflQt5qcNESKnrQMrf6LSAXeiWkEoXdOU4nZBKxzPnoTohtW5ZZyK8W09Io6tYjyP1 iDS6XbAeh815S0irS6zHUfXuB0Ok1S3W41AdEWh1/Nb7UD1bWytd0LY4ThdnerYbBJ1taT5XBmy4 J8mNnWagp5fbba6HeiTdneK6vNgoFj0EhyO4eSZT6AlpdNXoNzqYi94GbIU0eqhHp3d5drRf4heZ Uo9Iq8vYptup6VB37H6m/W6m0hFpdaxH3/yK9cHpVq5Zy59n2GMV/QxPGdV9NElPE9njyOozsfWW Kq6j7bLuw97KPBnM/mX86//6n+tH/yLKP7AUXcyd80Lx53WnspB47ty5qKND1gLockfIFzHR7srp Q7X0+6GgHPNFXedPyaN1CPFli71z+ZBeTxbcI2RXxwOQj/lW/Ui610Z6gA4a32WRrssXYbO76V/Z y6FLnZVj/qCnqGcdZE1DeFKcntWrMV/RZXCh7B/fpF/+8tQQx6J74y9/Oeuyrm75+Jyf8oKWH9CL 1SYNJtJ91HTOAD2fdjPN9UcsumwmDNYXqvPtOLa9pH8LMACnoV0siZAyOAAAAABJRU5ErkJggg== --_004_258fa0684e7d43d0926c5f40bc284601NSOITEXCMB21uchcnet_--


This message came from the mail archive
src/postings/2019/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University