Re: [AUDITORY] Request for objective evaluation models based on temporal envelope

Subject: Re: [AUDITORY] Request for objective evaluation models based on temporal envelope

From: James Kates <james.kates@xxxxxxxxxxxx>

Date: Thu, 15 Apr 2021 14:51:04 +0000

Accept-language: en-US

Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=colorado.edu; dmarc=pass action=none header.from=colorado.edu; dkim=pass header.d=colorado.edu; arc=none

Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=GeYi2onIvPj1dK5Owl96BcPzhTSo8lBuRrWhbNSrgzc=; b=S3VYyxg/ePpfZ2aBalj7ZDBpSpfEYf9A+VcxnGqb6dmRW5U4EFi/RkxzK6LWCErQsHtvQga2Jq7JHmWUHrbWlBQCkdAL0Ls56P7Gw+vtOu2u62Fo3ti4CtWQ9ZwqQjSZowdwE+DiODAk1NpX12Of9+PAmJkK1s9aWJDCxj6fDvZPtoaLmlBnMaoj5etc6iXSxqVfc3g+UC6dCEgpHQmuXxx2/p3IMNOSF3XqLt9/7oP3QJy8vNu4gJUens+W0R6+OA1kWekCL0LEtO0dldWSR1Ehxh7xD/TGqIpvNbT91TMb0qWruKc+dr3ZblChQVeWmCq66IgT8564rInst5vOFQ==

Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=VxuFfdWmJpP+WrYQK2aTsEqwBrPHC0O+ZWl4g13RjPcVpvgj0SpgyG8vmI/dlZTWeaE8ql1h2nZWPOIXNkMobYRhjzCClOM4PkGNlrcUHEEUF3TKkroqeL2SEhwfzZhm7Zbf98MZHKZYAXFkHLyRp/6Yp0Uaz1odAhMUnzlB7GJ/ewcmL8yn8+uA8zX1aQZjv5B/hntgQx3/GbXG15TaAnMo7EXx58sVtXFA5Pkxu9DI+5XQnKIJ9/9tVI8LniXoYfL1vIwoBoh4UC6Y35/IzzTVuV103IaEZyBZMaPN/CCGEme7mGcJ033cAtNxFc2Yd6KxJg1bE2J/P08D6d36Lg==

Authentication-results: mx.google.com; arc=fail (signature failed); spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.103 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=colorado.edu

Delivered-to: dan.ellis@xxxxxxxxx

In-reply-to: <CAM-3dAk8m4eLYdSE7RmVqJY2yuVc5sJv8a51E3O5-zY7Rz7DDQ@mail.gmail.com>

List-archive: <https://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

List-help: <https://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO%20AUDITORY>

List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>

List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>

List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>

References: <CAM-3dAk8m4eLYdSE7RmVqJY2yuVc5sJv8a51E3O5-zY7Rz7DDQ@mail.gmail.com>

Reply-to: James Kates <james.kates@xxxxxxxxxxxx>

Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>

Thread-index: AQHXMa4CeWuNohruVEGxhNV6nAtRQ6q1ppRA

Thread-topic: Request for objective evaluation models based on temporal envelope

Sohhom,

Our lab has been actively involved in developing intelligibility and quality indices for several years. The current version of HASPI (intelligibility) passes the envelope time-frequency modulation through a modulation filterbank. HASQI (speech quality) combines envelope time-frequency modulation analysis with temporal fine structure and long-term spectral changes. The MATLAB code is free to anyone interested: send me a request at James.Kates@xxxxxxxxxxxx if you would like a copy.

Current versions:

J.M. Kates and K.H. Arehart (2020), “The hearing-aid speech perception index (HASPI) version 2,” to appear in Speech Comm. DOI: 10.1016/j.specom.2020.05.001.

J.M. Kates and K.H. Arehart (2016), “The hearing aid audio quality index (HAAQI)”, IEEE Trans. Audio Speech and Lang. Proc. Vol. 24(2), pp 354-365. DOI: 10.1109/TASLP.2015.2507858.

J.M. Kates and K.H. Arehart (2014), “The hearing aid speech quality index (HASQI) version 2”, J. Audio Eng. Soc., Vol. 62(3), pp 99-117. DOI: 10.17743/jaes.2014.0006

Older versions:

J.M. Kates and K.H. Arehart (2014), “The hearing aid speech perception index (HASPI)”, Speech Comm., Vol. 65, pp 75-93. DOI: 10.1016/j.specom.2014.06.002

J.M. Kates and K.H. Arehart (2010), “The hearing aid speech quality index (HASQI)”, J. Audio Eng. Soc., Vol. 58(5), pp 363-381.

Jim Kates

<<>><<>><<>><<>><<>><<>><<>>

James M. Kates

Scholar in Residence / Prof. Hearing Engineering Research Practice

Dept. Speech, Language, and Hearing Sciences

409 UCB, University of Colorado, Boulder, CO 80309

Office (mobile): 720-226-1266

Home: 303-652-1523

<<>><<>><<>><<>><<>><<>><<>>

From: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx> On Behalf Of Sohhom Bandyopadhyay
Sent: Wednesday, April 14, 2021 5:44 AM
To: AUDITORY@xxxxxxxxxxxxxxx
Subject: Request for objective evaluation models based on temporal envelope

Dear list,

I am looking for objective quality or intelligibility models (general audio or speech) that take into account the temporal envelope of the signal(s). Both intrusive and non-intrusive models are welcome.

Two examples of such models are:

* Falk, T. H., Zheng, C., & Chan, W. Y. (2010). A non-intrusive quality and intelligibility measure of reverberant and dereverberated speech. IEEE Transactions on Audio, Speech, and Language Processing, 18(7), 1766-1774.

(implementation: https://github.com/MuSAELab/SRMRToolbox)

* van de Par, S., Disch, S., Niedermeier, A., Burdiel Pérez, E., & Edler, B. (2019, October). Temporal Envelope-Based Psychoacoustic Modelling for Evaluating Non-Waveform Preserving Audio Codecs. In Audio Engineering Society Convention 147. Audio Engineering Society.

(implementation not available)

Would really prefer models that have publicly available implementations, or it is available upon request from the authors. Please let me know if you know of any such work.

Thanks and regards

Sohhom

Sohhom Bandyopadhyay

PhD Scholar | Center for Cognitive Science

Indian Institute of Technology Gandhinagar

http://cogs.iitgn.ac.in/member/sohhom-bandyopadhyay/