[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: About importance of "phase" in sound recognition

To: AUDITORY@xxxxxxxxxxxxxxx
Subject: Re: About importance of "phase" in sound recognition
From: Joachim Thiemann <joachim.thiemann@xxxxxxxxx>
Date: Sat, 9 Oct 2010 08:51:14 -0400
Approved-by: joachim.thiemann@xxxxxxxxx
Comments: To: James Johnston <James.Johnston@xxxxxxx>
Delivery-date: Sat Oct 9 08:52:52 2010
In-reply-to: <20101008194629.9A4FD9857@xxxxxxxxxxxxxxxxxxxxxxx>
List-archive: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>
List-help: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO AUDITORY>
List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>
List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>
List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>
References: <20101007213645.637317165@xxxxxxxxxxxxxxxxxxxxxxx> <20101008094422.E19A78110@xxxxxxxxxxxxxxxxxxxxxxx> <20101008130215.358FE5685@xxxxxxxxxxxxxxxxxxxxxxx> <20101008133504.CC22085FA@xxxxxxxxxxxxxxxxxxxxxxx> <20101008160133.69E225590@xxxxxxxxxxxxxxxxxxxxxxx> <20101008194629.9A4FD9857@xxxxxxxxxxxxxxxxxxxxxxx>
Reply-to: Joachim Thiemann <joachim.thiemann@xxxxxxxxx>
Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>

On Fri, Oct 8, 2010 at 15:44, James Johnston <James.Johnston@xxxxxxx> wrote:
> Do a 2^20th fft.
> In the bin corresponding to 500Hz, your choice of sampling frequencies, put a '1'.
> In the bins corresponding to 996 and 1004, put a .25.
[...]
> Repeat, using the same gain so as to avoid intensity differences.
[...]
> To me, at least, they are different sounds.

But the Fourier transform as used here is a 1-1 transform, without
redundancy.  All reconstruction from magnitude methods rely on
redundancy - Griffin & Lim use FFT blocks that overlap fully, and the
algorithms by Cassaza et al for polynomial time inversion rely on N^2
magnitude coefficients.

The Fourier transform is a projection of a signal onto infinite-length
sinusoids, (or in the case of the STFT, a circulant projection onto
short-time sinusoids) which is not very perceptually based.

Joe.
-- 
Joachim Thiemann :: http://www.tsp.ece.mcgill.ca/~jthiem

Prev by Date: Faculty position at the University of Maryland
Next by Date: Re: About importance of "phase" in sound recognition
Previous by thread: Re: About importance of "phase" in sound recognition
Next by thread: Re: About importance of "phase" in sound recognition
Index(es):
- Date
- Thread