[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Features for robust speaker identification
One feature we proposed and found to be rather effective for robust
speaker identification is GFCC (gammatone frequency cepstral
coefficient). Its description and analysis are given below:
- Shao Y. and Wang D.L. (2008): "Robust speaker identification using
auditory features and computational auditory scene analysis." ICASSP-08,
pp. 1589-1592.
- Zhao X., Shao Y., and Wang D.L. (2012): "CASA-based robust speaker
identification," IEEE Transactions on Audio, Speech, and Language
Processing, vol. 20, pp. 1608-1616.
- Zhao X. and Wang D.L. (2013): "Analyzing noise robustness of MFCC and
GFCC features in speaker identification," ICASSP-13, pp. 7204-7208.
You can also find the Matlab code for GFCC extraction on my lab's website.
Cheers,
DeLiang
On 9/16/2014 12:23 PM, Celestino Alvarez wrote:
Dear list,
I was planning to build a speaker identification application, and I
was wondering what are the best features for a robust identification.
Any advise on the right papers to read, would help.
Best,
Tino
--
------------------------------------------------------------
DeLiang Wang, Professor
Co-Editor-in-Chief, Neural Networks
Department of Computer Science and Engineering
The Ohio State University
2015 Neil Ave.
Columbus, OH 43210-1277, U.S.A.
Phone: 614-292-6827 (OFFICE); 614-292-7402 (LAB)
http://www.cse.ohio-state.edu/~dwang
"Happiness = Reality - Expectation"