Re: [AUDITORY] Why is it that joint speech-enhancement with ASR is not a popular research topic? (Laszlo Toth )

Subject: Re: [AUDITORY] Why is it that joint speech-enhancement with ASR is not a popular research topic? From: Laszlo Toth <tothl@xxxxxxxx> Date: Mon, 25 Jun 2018 09:15:35 +0200 List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY> On Sun, 24 Jun 2018, Samer Hijazi wrote: > It is easy to see that ASR would benefit from speech enhancement, and > speech enhancement would benefit from ASR. But there is very limited > research and publications in this direction vs the 100's of publications on > stand alone ASR, why is that? The currently dominant directon in ASR is "end-to-end learning". That is, to drop any hand-crafted feature extraction step from the processing chain, and let the deep learning algorithm solve the whole problem "as is". While many people doubt that this is the good direction (at least, with the current limited-capability learning algorithms), there is a strong pressure to prefer these end-to-end models over a two-step model (I mean enhancement+recognition). Laszlo Toth Hungarian Academy of Sciences * Research Group on Artificial Intelligence * "Failure only begins e-mail: tothl@xxxxxxxx * when you stop trying" http://www.inf.u-szeged.hu/~tothl *

This message came from the mail archive
src/postings/2018/
maintained by:

DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University