On 25/06/2018 17:00, Samer Hijazi
wrote:
Are, you mean multitask learning. That didn't come over at all in your first mail. An early paper on this, probably the first application to ASR, was Parveen & Green, Multitask Learning in Connectionist Robust ASR using Recurrent Neural Networks, Eurospeech 2003.
It would be wrong to start with clean speech, add noise, use that as input and clean speech + text as training targets, because in real life speech & other sound sources don't combine like that. That's why the spectacular results in the Parveen/Green paper are misleading.. HTH -- *** note email is now p.green@xxxxxxxxxx *** Professor Phil Green SPandH Dept of Computer Science University of Sheffield *** note email is now p.green@xxxxxxxxxx *** |