[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: psychoacoustically driven temporal approximation

To: AUDITORY@xxxxxxxxxxxxxxx
Subject: Re: psychoacoustically driven temporal approximation
From: Joachim Thiemann <joachim.thiemann@xxxxxxxxx>
Date: Wed, 5 Mar 2014 11:01:07 +0100
Approved-by: joachim.thiemann@xxxxxxxxx
Comments: To: JesterN Alberto Novello <jestern77@xxxxxxxx>
Delivery-date: Wed Mar 5 05:03:19 2014
In-reply-to: <11859_1393997752_5316B7B7_11859_165_17_1393931107.47255.YahooMailNeo@web171605.mail.ir2.yahoo.com>
List-archive: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>
List-help: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO AUDITORY>
List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>
List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>
List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>
References: <11859_1393997752_5316B7B7_11859_165_17_1393931107.47255.YahooMailNeo@web171605.mail.ir2.yahoo.com>
Reply-to: Joachim Thiemann <joachim.thiemann@xxxxxxxxx>
Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>

Hello Alberto,

I'm not entirely sure, but this sounds a bit like my thoughts when I
started my Ph.D. many years ago: can I synthesize an audio waveform
from a perceptual representation of another audio signal?  (and what
does that imply about that particular perceptual representation?)

My answer was to so by iterative resynthesis: make a first good guess
of the inverse perceptual transform, then correct for the error.  Key
point is that the correction needs to go in the right direction.  My
perceptual transform was a set of sparsely sampled hilbert envelopes
of the outputs of a gammatone filterbank.

If you want you can have a look at my thesis "A Sparse Auditory
Envelope Representation with Iterative Reconstruction for Audio
Coding", linked from my homepage
(http://jthiem.bitbucket.org/research.html), you can find the MATLAB
code on that page too. Of course, in my thesis I refer to work that
others have done in a similar vein.

Cheers,
Joachim.

On 4 March 2014 12:05, JesterN Alberto Novello <jestern77@xxxxxxxx> wrote:
> Hi all,
> i'm trying to find a way to approximate the sample values of an audio
> waveform in time domain.
> I want a method that takes care of approximating perceptually-relevant audio
> bands better than others.
> Basically a spectral-weighted temporal approximation method.
> In my head it's not clear how to connect frequency components to specific
> samples in the time domain.
> Any DSP wizard out there with a good idea/papers ?
> Best regards
> Alberto
>

-- 
Joachim Thiemann :: http://jthiem.bitbucket.org ::
http://signalsprocessed.blogspot.co

Prev by Date: Book Anouncement: Computational Paralinguistics
Next by Date: Re: psychoacoustically driven temporal approximation
Previous by thread: psychoacoustically driven temporal approximation
Next by thread: Re: psychoacoustically driven temporal approximation
Index(es):
- Date
- Thread