Re: [AUDITORY] Gammatone filter bank in MATLABr2019a

Subject: Re: [AUDITORY] Gammatone filter bank in MATLABr2019a

From: Jihad Ibrahim <jibrahim@xxxxxxxxxxxxx>

Date: Mon, 20 May 2019 16:25:00 +0000

Accept-language: en-US

Arc-authentication-results: i=1; mx.google.com; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.102 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx

Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-archive:list-owner:list-subscribe:list-unsubscribe:list-help :precedence:to:subject:from:sender:reply-to:date:message-id :mime-version:content-language:accept-language:thread-index :thread-topic:approved-by; bh=xOp1mjfuaabk0wl6ugDDg7soBGrjzuFAP8CWk/l61Xk=; b=J6+jlTtxpACrkRR3b8Ymc3V2/kmSsAX9GLondKgBIS7GyE5XBU69vrCVIQw5pvhQrC TEOZVnaJMxr6Yh5NRL+Yh4rg6mSwG4gTJ7WRsw/tk5mattcCIu7VMq7yOFn0aCUpDDWa HJOnZmxpUqG72gPEerwB+iACMkWg7Z6gQbrhoOaJMCWc03rMRBLlSTHE29n7n+bMA5yP LY3jsr2QbXWX87d7Pu0A6M1Ggu01j4+Z46b9UBzbK9aoce0qqPBhquju8s3mhOAUqDu6 +oDabOtb8z25OySKXRGa2CLAK1sWjMwZxhsDSxQ6K5j5c+VitmPbT5D+eTJXrCO8g1qW KPqA==

Arc-seal: i=1; a=rsa-sha256; t=1558412490; cv=none; d=google.com; s=arc-20160816; b=JYA0+sKEGhOEoyG0mTu+gIiXNeaWzFMegTiFGZUSNgNMgRm+tfy9VVB1Kay/jZ0ZLM m4wiBEcNju3L1KueNr35He3Yla/jWzYcW0BkVixbO8nTUpyLza3uTs7FVnSNz7+MiAg/ xkjU57wJpJPz/xd49wV7fiI9eZdLwHawoD0P34yPObt/KVqW9+OtD7vKAVxXWsrqgA+6 vM34mYdYvZ7ZXIxnGhjqJeSFiilc+V9OL5lfpWMHmHOOsf6u4PUsrADmKZoV0jRPyQAZ DhwLztoRaODJmRVmj7PekCTXPzVlRSy/HXqeGSgGACC/Jh5w7dSkD1U+F7aDRQyvT8wn Eehg==

Authentication-results: mx.google.com; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.102 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx

Delivered-to: dan.ellis@xxxxxxxxx

List-archive: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

List-help: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO%20AUDITORY>

List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>

List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>

List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>

Reply-to: Jihad Ibrahim <jibrahim@xxxxxxxxxxxxx>

Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>

Thread-index: AdUPKIgSgvn3cZgiR72IpzCzojkx5Q==

Thread-topic: Re: [AUDITORY] Gammatone filter bank in MATLABr2019a

Hi all,

I am a developer in Audio Toolbox at MathWorks, and just wanted to let everyone know that we are capturing your comments regarding new R2019a releases and really appreciate your feedback.

It will take us some time to digest this feedback and convert it into user-visible changes, but I thought I’d share a few notes in the meantime:

Regarding Bastian Epp’s initial post, he is right to point out that the image might be misleading and interpreted to indicate an equivalence between the cochlea and the gammatone filter bank. We will aim to remove the image of the basilar membrane in the next release to help avoid that incorrect interpretation.
Regarding Richard F. Lyon’s post: The confusion here is due to an ambiguously worded sentence. The gammatone filter bank implemented in Audio Toolbox followed the algorithm described in [1] (Slaney). [1] says the algorithm is an implementation of an idea proposed by [2] (Patterson et al). [2] is in general a good primer for understanding [1], which is why we thought it was good to reference. We think we should reword this more carefully.
The formula stating that the bandwidth is 1.019*erb2hz(fc) does indeed have a typo. We will fix this ASAP starting from the online documentation.
Regarding the limited parametrizations of the function(s): So far, Audio Toolbox has focused on providing simple and fast implementations of feature extractors. The idea is to find a balance between an expert in auditory science and someone looking to build a machine learning or deep learning application. That being said, if exposing more parameters would enable more workflows, then we would definitely consider adding more options on the functions. We plan to investigate alternative options and we may try to reach out to some of those who commented on this for additional feedback.
We agree that the cubic root is a very common implementation of GTCC. We will investigate offering the option of using a cubic root in the nonlinear rectification stage )along with the log option, which is used as well). Rabiner and Schafer are referenced because the computation of the deltas is implemented based on Theory and Applications of Digital Speech Processing.
Regarding Volker Hohmanns’ note on the re-synthesis method being non-optimal: The intention of the example was to showcase a straightforward and simple usage of the object rather than demonstrate how to best achieve reconstruction. We agree that the showcased method is not optimal, and we will reword the example to clarify this. We will also consider adding an optimal reconstruction example based on Dr. Hohmanns’ paper

Regards,

Jihad Ibrahim

Developer, Audio Toolbox, MathWorks