[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Features for robust speaker identification

To: AUDITORY@xxxxxxxxxxxxxxx
Subject: Re: Features for robust speaker identification
From: DeLiang Wang <dwang@xxxxxxxxxxxxxxxxxx>
Date: Wed, 17 Sep 2014 21:21:46 -0400
Approved-by: dwang@xxxxxxxxxxxxxxxxxx
Delivery-date: Thu Sep 18 00:16:17 2014
In-reply-to: <10994_1410927897_54190D19_10994_75_4_CAKFD25xtenEK9x__i3d6xwg4jz-mgNjvB_p3FiHZB=Wb__QM3Q@mail.gmail.com>
List-archive: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>
List-help: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO AUDITORY>
List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>
List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>
List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>
References: <10994_1410927897_54190D19_10994_75_4_CAKFD25xtenEK9x__i3d6xwg4jz-mgNjvB_p3FiHZB=Wb__QM3Q@mail.gmail.com>
Reply-to: DeLiang Wang <dwang@xxxxxxxxxxxxxxxxxx>
Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>
User-agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:24.0) Gecko/20100101 Thunderbird/24.6.0

One feature we proposed and found to be rather effective for robustspeaker identification is GFCC (gammatone frequency cepstralcoefficient). Its description and analysis are given below:

- Shao Y. and Wang D.L. (2008): "Robust speaker identification usingauditory features and computational auditory scene analysis." ICASSP-08,pp. 1589-1592.

- Zhao X., Shao Y., and Wang D.L. (2012): "CASA-based robust speakeridentification," IEEE Transactions on Audio, Speech, and LanguageProcessing, vol. 20, pp. 1608-1616.

- Zhao X. and Wang D.L. (2013): "Analyzing noise robustness of MFCC andGFCC features in speaker identification," ICASSP-13, pp. 7204-7208.


You can also find the Matlab code for GFCC extraction on my lab's website.

Cheers,
DeLiang

On 9/16/2014 12:23 PM, Celestino Alvarez wrote:

Dear list,
I was planning to build a speaker identification application, and Iwas wondering what are the best features for a robust identification.
Any advise on the right papers to read, would help.

Best,

Tino


--
------------------------------------------------------------
DeLiang Wang, Professor
Co-Editor-in-Chief, Neural Networks
Department of Computer Science and Engineering
The Ohio State University
2015 Neil Ave.
Columbus, OH 43210-1277, U.S.A.

Phone: 614-292-6827 (OFFICE); 614-292-7402 (LAB)
http://www.cse.ohio-state.edu/~dwang

"Happiness = Reality - Expectation"

Prev by Date: Job Announcement: DSP Engineer at EarLens Corp
Next by Date: COSYNE2015: Meeting Announcement and Call for Abstracts
Previous by thread: Features for robust speaker identification
Next by thread: Job Announcement: DSP Engineer at EarLens Corp
Index(es):
- Date
- Thread