[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

PhD studentship at Stirling University (Scotland)



Dear all:

Bio-inspired Feature Detection for Sound Streaming and Interpretation

This Department has set aside funds to fund a 3 year PhD studentship (UK/EC students only) starting as soon as possible.

This project aims to develop biologically inspired techniques for the detection of spectro-temporal features in sound which may be used (i) for separating foreground sound streams from background sound streams and (ii) for the interpretation of the foreground sound streams. Most current work is speech recognition based, and assumes (incorrectly) that the speech is the only signal present. In recent years there has been work on Computer Audio Scene Analysis (CASA), which attempts to separate the sound into a number of sound streams, each from a different sound source. This normally starts off by band pass filtering the sound into multiple channels, using a biologically inspired filter bank, unlike the Fourier-transform approach taken in earlier speech recognition systems. We have followed this approach, and extended it by using an (auditory nerve like) spike-based coding approach which provides precise timing and can cope with the very wide dynamic range of sound signals, and have already used onset features detected for sound source direction finding and some basic interpretation [1, 2]. Recently, we have extended the types of (proto-) feature that can be discovered by using a two dimensional spectro-temporal window operator, implemented as a set synapses (whose weights and delays encode these window operators) from the spike-coded signal to a leaky integrate and fire neuron. This approach appears promising, since proto-feature detection can be made signal level independent, and proto-features which are invariant under the usual variation in listening environments can be chosen. Further, this approach can use greedy (parallel) processing, taking advantage of modern CPUs and signal processing technologies. The research will assess proto-features and combinations of these features which are useful in foreground/background sound signal separation, and which are useful in sound (including, but not only) speech. The techniques will recode the sound as a set of sequences of proto-features (and features combined from these proto-features), and these sequences will be interpreted, and the mapping from proto-features to features adapted (for example, using neural net technologies). The Department has a lab which can be used for multi-stream sound acquisition. It also enables the reverberance of the environment to be varied.

Further details: see http://www.cs.stir.ac.uk/~lss/research/proposedproject2008.html

--Leslie Smith

Professor Leslie S. Smith,
Head, Dept of Computing Science and Mathematics,
University of Stirling,
Stirling FK9 4LA, Scotland
l.s.smith@xxxxxxxxxxxxx
Tel (44) 1786 467435 Fax (44) 1786 464551
www http://www.cs.stir.ac.uk/~lss/






--
Academic Excellence at the Heart of Scotland.
The University of Stirling is a charity registered in Scotland, number SC 011159.