[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [AUDITORY] Logan's theorem - a challenge

To: AUDITORY@xxxxxxxxxxxxxxx
Subject: Re: [AUDITORY] Logan's theorem - a challenge
From: Malcolm Slaney <000001757ffb5fe1-dmarc-request@xxxxxxxxxxxxxxx>
Date: Sun, 26 Sep 2021 08:01:22 -0700
Approved-by: malcolm@xxxxxxxx
Arc-authentication-results: i=1; mx.google.com; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.104 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=mcgill.ca
Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-archive:list-owner:list-subscribe:list-unsubscribe:list-help :precedence:in-reply-to:to:comments:subject:from:sender:reply-to :date:message-id:references:content-transfer-encoding:mime-version :approved-by; bh=Qg9mlxCQxo84UQy3sTNd01LhsKfm1UVckgo8KV0vjo8=; b=wLpOV61pi3Fc7eDZR3xSTzZB1xFH4RRcn9T1UDRIybSg8OlCST8DuZlqYpzy3tCiAt r17a5gRZOjogLcyBw1ehhgHYk0n3F9E5wEjNL9CDnz6C+Iq8UZfpR3PE/OhqocZIxXRf cGlC9mU5cboJZgjzzg6bKPOPpBDAH7/6r+Loq5Tm2HJTCrpNvFgyz5rMCn4UljvZG5nc EiZplfL6rE4Ges+z8AXNomyp4d/FlefmKO8SWi19Q1WM6tmk+FTXBhJvkbX44ZyT7mQl 23sk+ohMqyh3rT8Lr84jmf1N/ygGg6/x/SUeHvoQAW8hbpTnU9h7OQSF/4fGRaiITOW0 P8kw==
Arc-seal: i=1; a=rsa-sha256; t=1632716523; cv=none; d=google.com; s=arc-20160816; b=YiKJ/dwu2Ub4eo10ZYIUC+kuiku3eyCgsyynmxiCcmYLSJ02D5Bd94/G4LW6T5ndRN qGcDgIT1HXn+WXNREEuqkx9hbBaJGPcU6s80dvMAMtPOyKojjCQGYtGIYBONTo8MI5+E dhosulSkH2WyaGZc6AcPjl9DIwGLu98/f56jflAzck0KhcjKLEkRUvuC7W/kI1SvvGZy CjZ4Y2ry3jt5EN4V9cQSGtS9qA0RsbIR/HOAihD96opnZcFzEQCxr4RU8U/Nkbd2JdCo D/qeWZwT14sooh5+UPEhCT4iO36SBqamC+Q+5WL9LdhM3FnOQZ3F08mqhTndyhnr4IPV 8UtQ==
Authentication-results: mx.google.com; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.104 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=mcgill.ca
Comments: To: Alain de Cheveigne <alain.de.cheveigne@xxxxxxxxxx>
Delivered-to: dan.ellis@xxxxxxxxx
In-reply-to: <6DB642F5-0E04-452F-B8F6-517301BEC405@ens.psl.eu>
List-archive: <https://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>
List-help: <https://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO%20AUDITORY>
List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>
List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>
List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>
References: <6DB642F5-0E04-452F-B8F6-517301BEC405@ens.psl.eu>
Reply-to: Malcolm Slaney <malcolm@xxxxxxxx>
Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>

POCS.

Projections onto Convex Sets [1].

Dick Lyon and I used POCS to invert [2] our favorite auditory model.  A contemporaneous paper [3] from Shamma’s lab did the same. 

Both the band-limited constraint and the known positive values of the signal define convex sets.  We know in the frequency domain many parts of the spectrum are equal to zero.  And in the time domain we know the values that are positive.  We can iterate between the time and the frequency domain, each time projecting onto the appropriate constraint, to find the best solution.

I didn’t work out the theory, but since the auditory filter bank has a bandwidth of less than an octave, I think there must be only a single solution.  In practice, just a handful of back and forth iterations was sufficient to find the solution.

Piece of cake.  :-)

Our interest in this problem was not to generate audio, a cute parlor trick, but to show that the auditory representation we were working with did not lose any perceptually important information.

— Malcolm
P.S.  Reconstructions from zero crossing requires infinite resolution of the time of the zero crossing.  That would be hard to do with a spike representation. Fortunately, there is a LOT more information in the HWR signal.

[1] https://en.wikipedia.org/wiki/Projections_onto_convex_sets

[2] M. Slaney, D. Naar.  R. Lyon. Auditory model inversion for sound separation. Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing, 1994. https://engineering.purdue.edu/~malcolm/apple/icassp94/CorrelogramInversion.pdf

[3] X. Yang; K. Wang; S.A. Shamma. Auditory representations of acoustic signals. IEEE Transactions on Information Theory, Volume: 38, Issue: 2, March 1992. https://ieeexplore.ieee.org/document/119739

> On Sep 25, 2021, at 11:03 PM, Alain de Cheveigne <alain.de.cheveigne@xxxxxxxxxx> wrote:
> 
> Hi all,
> 
> Here’s a challenge for the young nimble minds on this list, and the old and wise.
> 
> Logan’s theorem states that a signal can be reconstructed from its zero crossings, to a scale, as long as the spectral representation of that signal is less than an octave wide.  It sounds like magic given that zero crossing information is so crude. How can the full signal be recovered from a sparse series of time values (with signs but no amplitudes)?  “Band-limited” is clearly a powerful assumption.
> 
> Why is this of interest in the auditory context?  The band-limited premise is approximately valid for each channel of the cochlear filterbank (sometimes characterized as a 1/3 octave filter).  While cochlear transduction is non-linear, Logan’s theorem suggests that any information lost due to that non-linearity can be restored, within each channel. If so, cochlear transduction is “transparent”, which is encouraging for those who like to speculate about neural models of auditory processing. An algorithm applicable to the sound waveform can be implemented by the brain with similar results, in principle.  
> 
> Logan’s theorem has been invoked by David Marr for vision and several authors for hearing (some refs below). The theorem is unclear as to how the original signal should be reconstructed, which is an obstacle to formulating concrete models, but in these days of machine learning it might be OK to assume that the system can somehow learn to use the information, granted that it’s there.  The hypothesis has far-reaching implications, for example it implies that spectral resolution of central auditory processing is not limited by peripheral frequency analysis (as already assumed by for example phase opponency or lateral inhibitory hypotheses).
> 
> Before venturing further along this limb, it’s worth considering some issues.  First, Logan made clear that his theorem only applies to a perfectly band-limited signal, and might not be “approximately valid” for a signal that is “approximately band-limited”.  No practical signal is band-limited, if only because it must be time limited, and thus the theorem might conceivably not be applicable at all.  On the other hand, half-wave rectification offers much richer information than zero crossings, so perhaps the end result is valid (information preserved) even if the theorem is not applicable stricto sensu.  Second, there are many other imperfections such as adaptation, stochastic sampling to a spike-based representation, and so on, that might affect the usefulness of the hypothesis.
> 
> The challenge is to address some of these loose ends. For example:
> (1) Can the theorem be extended to make use of a halfwave-rectified signal rather than zero crossings? Might that allow it to be applicable to practical time-limited signals?
> (2) What is the impact of real cochlear filter characteristics, adaptation, or stochastic sampling?  
> (3) In what sense can one say that the acoustic signal is "available” to neural signal processing?  What are the limits of that concept?
> (4) Can all this be formulated in a way intelligible by non-mathematical auditory scientists?
> 
> This is the challenge.  The reward is - possibly - a better understanding of how our brain hears the world.
> 
> Alain
> 
> ---
> Logan BF, JR. (1977) Information in the zero crossings of bandpass signals. Bell Syst. Tech. J. 56:487–510.
> 
> Marr, D. (1982) VISION - A Computational Investigation into the Human Representation and Processing of Visual Information. W.H. Freeman and Co, republished by MIT press 2010.
> 
> Heinz, M.G., Swaminathan J. (2009) Quantifying Envelope and Fine-Structure Coding in Auditory Nerve Responses to Chimaeric Speech, JARO 10: 407–423
> DOI: 10.1007/s10162-009-0169-8.
> 
> Shamma, S, Lorenzi, C (2013) On the balance of envelope and temporal fine structure in the encoding of speech in the early auditory system, J. Acoust. Soc. Am. 133, 2818–2833.
> 
> Parida S, Bharadwaj H, Heinz MG (2021) Spectrally specific temporal analyses of spike-train responses to complex sounds: A unifying framework. PLoS Comput Biol 17(2): e1008155. https://doi.org/10.1371/journal.pcbi.1008155
> 
> de Cheveigné, A. (in press) Harmonic Cancellation, a Fundamental of Auditory Scene Analysis. Trends in Hearing (https://psyarxiv.com/b8e5w/).

Follow-Ups:
- Re: [AUDITORY] Logan's theorem - a challenge
  - From: Prof Leslie Smith
- Re: [AUDITORY] Logan's theorem - a challenge
  - From: Alain de Cheveigne

References:
- [AUDITORY] Logan's theorem - a challenge
  - From: Alain de Cheveigne

Prev by Date: [AUDITORY] Research Assistant in prediction in conversation and hearing loss (location Glasgow)
Next by Date: Re: [AUDITORY] Logan's theorem - a challenge
Previous by thread: [AUDITORY] Logan's theorem - a challenge
Next by thread: Re: [AUDITORY] Logan's theorem - a challenge
Index(es):
- Date
- Thread