[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [AUDITORY] Why is it that joint speech-enhancement with ASR is not a popular research topic?

To: AUDITORY@xxxxxxxxxxxxxxx
Subject: Re: [AUDITORY] Why is it that joint speech-enhancement with ASR is not a popular research topic?
From: Laszlo Toth <tothl@xxxxxxxxxxxxxxx>
Date: Mon, 25 Jun 2018 09:15:35 +0200
Approved-by: tothl@xxxxxxxxxxxxxxx
Arc-authentication-results: i=1; mx.google.com; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.102 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx
Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-archive:list-owner:list-subscribe:list-unsubscribe:list-help :precedence:in-reply-to:to:comments:subject:from:sender:reply-to :date:message-id:mime-version:references:approved-by :arc-authentication-results; bh=IJDUBVPESUGjo+MnSZMnHJnCEQkEqlGqcdmYtuH7Do4=; b=wvw23o5IngA4bYMhYJMygF3h94IiF+9zLT/kDmyJIttoIqjOsFCaZr6zseqMWqjQJm CvpUlN5GQJSB6OnCiK0HLLlcMN60MiCL2nP+zNKYpoZ9WyPHAG9nuYFw1Zx1PLAUQxVo zZwNvz8LelvCYTM6cDwpCXRgL2QHdskrbMMn4MS2m0/RR8mBpGwjEWLMnhu+4r+J6InP Ivyp9kqXccqd7lJyc1lTDswrmUNgfSxfX+ToCGpPjCYUHC4tcKESONNLr+OiGj8wT/r0 T75Xk38/e2LCnouvV7BcPrJFg7pLxINTEGOW/A063Eghm3cqrcY2C66LT71oyueybDaK KuoA==
Arc-seal: i=1; a=rsa-sha256; t=1529911873; cv=none; d=google.com; s=arc-20160816; b=VOHW3uMn4cGQh7MjOKW/29SDic2EV+CjCVxHznjXXsYYeTkcSN1+9iUBJoNpNi/APr j19bs5bUhqnLcAUGNbvDufxfsNg/oaPVZJSboYpwphQWGh2kXkielS8mQiW3278pUvQz LR+kk4vNQuPqulj5QlNPxHSiSNJj0uGpHnTQfdVuAipL9SzsavAT/1XVr6dr+2gpPWnj 6AG/8Qc+S9tRclgC7cbdT5pbiAQ4XG/aJIcYtNW28rDdP+QiucAxvGxpI2qq9gPLVVpF CBh3Tw+Pl6P/a2dtVEg/GfoSKxtGOjgukmI1RcFCkb1Gv2r//TEy07oZ/pWdyolIFdwL im+w==
Authentication-results: mx.google.com; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.102 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx
Comments: To: Samer Hijazi <hijazi@xxxxxxxxxxxxxx>
Delivered-to: dan.ellis@xxxxxxxxx
In-reply-to: <22611_1529900065_5B306C21_22611_165_1_CANPVCKjdtChc+wesqeCtMjJz0TviGX7q0PWhxMZCG9rQ9Fqcug@mail.gmail.com>
List-archive: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>
List-help: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO%20AUDITORY>
List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>
List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>
List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>
References: <24427_1529727153_5B2DC8B1_24427_258_1_8542A9387F138643A44D148D648EEAC642B5F7F6@KBNMXEXC10.Demant.com> <22611_1529900065_5B306C21_22611_165_1_CANPVCKjdtChc+wesqeCtMjJz0TviGX7q0PWhxMZCG9rQ9Fqcug@mail.gmail.com>
Reply-to: Laszlo Toth <tothl@xxxxxxxxxxxxxxx>
Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>

On Sun, 24 Jun 2018, Samer Hijazi wrote:

>  It is easy to see that ASR would benefit from speech enhancement, and
> speech enhancement would benefit from ASR. But there is very limited
> research and publications in this direction vs the 100's of publications on
> stand alone ASR, why is that?

The currently dominant directon in ASR is "end-to-end learning".
That is, to drop any hand-crafted feature extraction step from the
processing chain, and let the deep learning algorithm solve the whole
problem "as is". While many people doubt that this is the good direction
(at least, with the current limited-capability learning algorithms), there
is a strong pressure to prefer these end-to-end models over a two-step
model (I mean enhancement+recognition).

               Laszlo Toth
        Hungarian Academy of Sciences         *
  Research Group on Artificial Intelligence   *   "Failure only begins
     e-mail: tothl@xxxxxxxxxxxxxxx            *    when you stop trying"
     http://www.inf.u-szeged.hu/~tothl        *

Prev by Date: [AUDITORY] Why is it that joint speech-enhancement with ASR is not a popular research topic?
Next by Date: Re: [AUDITORY] Why is it that joint speech-enhancement with ASR is not a popular research topic?
Previous by thread: [AUDITORY] Why is it that joint speech-enhancement with ASR is not a popular research topic?
Next by thread: Re: [AUDITORY] Why is it that joint speech-enhancement with ASR is not a popular research topic?
Index(es):
- Date
- Thread