Hi Sohhom, as already mentioned by Raul, there exist some quality models that includes the Envelope Power Spectrum Model: Monaural audio quality predictions: Biberger, T., Fleßner, J.-H., Huber, R., and Ewert, S.D. (2018) An objective audio quality measure based on power and
envelope power cues. Journal of the Audio Engineering Society. DOI:
https://doi.org/10.17743/jaes.2018.0031 https://gitlab.uni-oldenburg.de/kuxo2262/GPSMq Combined predictions for monaural and binaural aspects of audio quality: Fleßner,J-H, Biberger, T., and Ewert, S.D. (2019) Subjective and objective assessment of monaural and binaural aspects
of audio quality. IEEE/ACM Tran. Audio, Speech and Language Processing. DOI:
https://doi.org/10.1109/TASLP.2019.2904850 https://gitlab.uni-oldenburg.de/kuxo2262/combinedaudioqualitymodel Biberger, T., Schepker, H., Denk, F., and Ewert, S.D. (2021) Instrumental quality predictions and analysis of auditory
cues for algorithms in modern headphone technology. Trends in Hearing. DOI:
https://doi.org/10.1177/23312165211001219 https://gitlab.uni-oldenburg.de/kuxo2262/mobiq_add I would like to add PEMO-Q and CASP-Q to the list, as their quality predictions are mainly based on AM cues: Huber, R. and Kollmeier, B. (2006) PEMO-Q – A new method for objective quality assessment
using a model of auditory perception. IEEE Tran. Audio, Speech and Language Processing. DOI:
https://doi.org/10.1109/TASL.2006.883259 Harlander, N., Huber, R., and Ewert, S.D. (2014).
Sound quality assessment using auditory models. Journal of the Audio Engineering Society. DOI:
https://doi.org/10.17743/jaes.2014.0020 As far as I know PEMO-Q is publicly available via Hörtech:
https://www.hoertech.de/de/produkte/pemo-q.html CASP-Q code has not yet been published, so let me know if you are interested.
BR, Thomas -- Dr. Thomas Biberger phone:+49-441-798 3557 Von: AUDITORY - Research in Auditory Perception [mailto:AUDITORY@xxxxxxxxxxxxxxx]
Im Auftrag von Raul Sanchez Lopez Hej Sohhom Jørgensen, S., Ewert, S. D., & Dau, T. (2013). A multi-resolution envelope-power based model for speech intelligibility. Journal of the Acoustical Society of America, 134(1),
436–446. https://doi.org/10.1121/1.4807563 Also Binaural Or correlation-based preditions
From: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>
on behalf of Sohhom Bandyopadhyay <sohhom.bandyopadhyay@xxxxxxxxxxx> Dear list, I am looking for objective quality or intelligibility models (general audio or speech) that take into account the temporal envelope of the signal(s). Both intrusive and non-intrusive models are welcome.
Two examples of such models are: * Falk, T. H., Zheng, C., & Chan, W. Y. (2010). A non-intrusive quality and intelligibility measure of reverberant and dereverberated speech. IEEE Transactions on Audio, Speech, and Language Processing, 18(7), 1766-1774. (implementation:
https://github.com/MuSAELab/SRMRToolbox) * van de Par, S., Disch, S., Niedermeier, A., Burdiel Pérez, E., & Edler, B. (2019, October). Temporal Envelope-Based Psychoacoustic Modelling for Evaluating Non-Waveform Preserving Audio Codecs. In Audio Engineering Society Convention
147. Audio Engineering Society. (implementation not available) Would really prefer models that have publicly available implementations, or it is available upon request from the authors. Please let me know if you know of any such work. Thanks and regards Sohhom -- Sohhom Bandyopadhyay PhD Scholar | Center for Cognitive Science Indian Institute of Technology Gandhinagar |