Re: [AUDITORY] Logan's theorem - a challenge (Neeraj Sharma )


Subject: Re: [AUDITORY] Logan's theorem - a challenge
From:    Neeraj Sharma  <neerajww@xxxxxxxx>
Date:    Tue, 28 Sep 2021 21:30:51 +0530

--00000000000004ed3f05cd10524f Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Hi Alain and members, I will attempt to give a bit of non-mathematical intuition behind information in the zeros. Zeros are cool! For example, if you know zeros (roots) of a polynomial you then know everything about it (upto a scaling factor). Hence, recovering a polynomial using just its zeros is not a surprise. Folks do it in high school too. A practical question is - do you have access to all its zeros (or roots)? If yes, excellent. If not, then we have to state the necessary and sufficient conditions which the polynomial should obey to enable its recovery from zeros. To give a hint, polynomials can have complex roots, and it is not search for the imaginary roots if all you see is your polynomial wiggling on the real axis! Now, let's switch gears to signals which are oscillatory. Polynomials can be oscillatory too but those will have many terms, with a huge mix of degrees. Let's instead consider sine waves as oscillatory signals. Often, they give a compact representation to represent oscillations. Can zeros (or zero-crossings, ZCs) of an oscillatory signal be sufficient for its recovery? B.F. Logan Jr addressed some aspects associated with this in http://languagelog.ldc.upenn.edu/myl/Logan1977.pdf (Theorem 8 and 9). Broadly, he presented the conditions under which the ZCs of the oscillatory signal can uniquely define the signal. Note that, he did present any reconstruction algorithms. =3D=3D=3D What are the conditions under which ZCs suffice to represent an oscillatory signal? He gave two conditions. This was very nice of him as it made the paper popular! Let's get an intuition in a few steps. - Firstly, what if the signal never crosses zero. It can have a DC making the oscillatory signal to wiggle above zero voltage all the time. No zeros hence, no recovery possible using ZCs! Hence, the signal should be bandpass (no DC). This will make sure that we get some ZCs. - Secondly, Nyquist sampling rate suggests that to sample an oscillatory signal at a rate at least twice the maximum frequency contained in its spectrum for enabling perfect recovery from just the samples. We must make sure the ZC rate satisfies the Nyquist rate. If the signal is bandpass with an octave bandwidth this will be satisfied. Under this condition, the ZCs rate no more below the Nyquist rate, and thus, we have hope for recovery just from ZCs. - Thirdly, we got the ZCs but do they uniquely define this oscillatory signal? Or, can there be another oscillatory signal which satisfies the above conditions and have the ZCs at the same instants? Here is the key contribution of Logan's paper. He presents a condition which if obeyed by the signal then the ZCs are unique. Interestingly, this condition links to the polynomials but in a more abstract way. I will not expand on it here. =3D=3D=3D So, nothing got solved as, like Alain suggested, we still don't have a reconstruction algorithm to get back the signal from ZCs. But we do now know some conditions which can guarantee the recovery and some day we will have an algorithm for recovery. Also, since Logan's paper, multiple researchers have worked on this topic. Like, what Malcolm et al. suggested is an iterative approach (projection on convex sets). These approaches are actually very interesting, and even more when they present the guarantees on convergence. Anyways, there is a much bigger picture here. Sampling ZCs implies sampling time instants instead of amplitude. I find this interesting as it breaks away from the uniform sampling (and analysis) framework employed in signal processing. Researchers continue to work on this aspect. I am also one of them :). One of our works which might be of interest to you is [1]. With new approaches, we might be able to reach out to a bigger class of oscillatory signals than what Logan considered. Further, also design algorithms (maybe DDN-style) to understand the information encoded in ZCs of arbitrary time limited (not bandlimited) signals. To summarize, ZCs might be the "dark matter" of signals and exploring these can potentially re-define the way we do DSP. [1] Neeraj Kumar Sharma, Time-Instant Sampling Based Encoding of Time-Varying Acoustic Spectrum, AIP Conference Proceedings 1703, 100003 (2015) https://aip.scitation.org/doi/pdf/10.1063/1.4939431 Cheers, Neeraj https://neerajww.github.io/ On Tue, Sep 28, 2021 at 5:27 PM Alain de Cheveigne < alain.de.cheveigne@xxxxxxxx> wrote: > Hi Malcom, > > Every time I come across a problem it invariably turns out that you, or > Dick, or Shihab, have figured it out in the 80s or 90s :-). > > POCS is very elegant. The fact that you implemented it in an auditory > model and got a good result suggests that the idea is indeed sound. And > yet, you say the outcome was not perfect, and it=E2=80=99s uncertain exac= tly how it > might depend on further imperfections of a biological implementation. It > would be nice to have a formal proof that relates imperfection of outcome > to imperfection of premises. And that is understandable by those who wou= ld > benefit from it. > > I see 3 categories of customer: (a) those, like you, or I, or Shihab, tha= t > trust that the result holds and think it's useful, (b) those who are > skeptical and/or put off by too much hand-waving, and (c) those who have = no > idea what we=E2=80=99re talking about, but should. Categories (b) and esp= ecially > (c) need catering for. > > The band-limited constraint is powerful, but brittle. A band-limited > signal extends over all time, allowing for LOTS of zero-crossings. A > time-limited signal has access to much less. The ~1/3 octave bandwidth of > an auditory filter might seem narrow enough, and yet its impulse response > (which determines the temporal span and thus the available zero-crossings= ) > extends for just a few cycles before falling below the noise floor. That= =E2=80=99s > VERY far from the ideal. On the other hand, as you say, a half-wave > rectified signal contains much more information, so intuitively the > consequence might hold even if the premise does not. It would be nice to > make that slightly more formal. > > The aim is to make things simple. If a Logan-like theorem holds, and > peripheral transduction is `transparent=E2=80=99, we don=E2=80=99t need t= o worry about it > when developing a neural model of auditory processing. We can conceptuali= ze > the model as having full access to the acoustic waveform, no information > lost. Implementation details are important, but we can work on them with > the trust that there is no obstacle of principle. That=E2=80=99s how I un= derstand > your ICASSP paper. > > This is potentially useful enough to justify the effort to get it right t= o > the satisfaction of all categories, particularly as the implications are > are non-trivial and possibly contentious. I=E2=80=99m not sure everyone = is happy > with the idea that spectral resolution is NOT limited by auditory filter > bandwidth... > > Alain > (please cc to Alain.de.Cheveigne@xxxxxxxx, as we=E2=80=99re having mail = server > problems) > > > > On 26 Sep 2021, at 16:01, Malcolm Slaney < > 000001757ffb5fe1-dmarc-request@xxxxxxxx> wrote: > > > > POCS. > > > > Projections onto Convex Sets [1]. > > > > Dick Lyon and I used POCS to invert [2] our favorite auditory model. A > contemporaneous paper [3] from Shamma=E2=80=99s lab did the same. > > > > Both the band-limited constraint and the known positive values of the > signal define convex sets. We know in the frequency domain many parts of > the spectrum are equal to zero. And in the time domain we know the value= s > that are positive. We can iterate between the time and the frequency > domain, each time projecting onto the appropriate constraint, to find the > best solution. > > > > I didn=E2=80=99t work out the theory, but since the auditory filter ban= k has a > bandwidth of less than an octave, I think there must be only a single > solution. In practice, just a handful of back and forth iterations was > sufficient to find the solution. > > > > Piece of cake. :-) > > > > Our interest in this problem was not to generate audio, a cute parlor > trick, but to show that the auditory representation we were working with > did not lose any perceptually important information. > > > > =E2=80=94 Malcolm > > P.S. Reconstructions from zero crossing requires infinite resolution o= f > the time of the zero crossing. That would be hard to do with a spike > representation. Fortunately, there is a LOT more information in the HWR > signal. > > > > [1] https://en.wikipedia.org/wiki/Projections_onto_convex_sets > > > > [2] M. Slaney, D. Naar. R. Lyon. Auditory model inversion for sound > separation. Proceedings of ICASSP '94. IEEE International Conference on > Acoustics, Speech and Signal Processing, 1994. > https://engineering.purdue.edu/~malcolm/apple/icassp94/CorrelogramInversi= on.pdf > > > > [3] X. Yang; K. Wang; S.A. Shamma. Auditory representations of acoustic > signals. IEEE Transactions on Information Theory, Volume: 38, Issue: 2, > March 1992. https://ieeexplore.ieee.org/document/119739 > > > >> On Sep 25, 2021, at 11:03 PM, Alain de Cheveigne < > alain.de.cheveigne@xxxxxxxx> wrote: > >> > >> Hi all, > >> > >> Here=E2=80=99s a challenge for the young nimble minds on this list, an= d the old > and wise. > >> > >> Logan=E2=80=99s theorem states that a signal can be reconstructed from= its zero > crossings, to a scale, as long as the spectral representation of that > signal is less than an octave wide. It sounds like magic given that zero > crossing information is so crude. How can the full signal be recovered fr= om > a sparse series of time values (with signs but no amplitudes)? > =E2=80=9CBand-limited=E2=80=9D is clearly a powerful assumption. > >> > >> Why is this of interest in the auditory context? The band-limited > premise is approximately valid for each channel of the cochlear filterban= k > (sometimes characterized as a 1/3 octave filter). While cochlear > transduction is non-linear, Logan=E2=80=99s theorem suggests that any inf= ormation > lost due to that non-linearity can be restored, within each channel. If s= o, > cochlear transduction is =E2=80=9Ctransparent=E2=80=9D, which is encourag= ing for those who > like to speculate about neural models of auditory processing. An algorith= m > applicable to the sound waveform can be implemented by the brain with > similar results, in principle. > >> > >> Logan=E2=80=99s theorem has been invoked by David Marr for vision and = several > authors for hearing (some refs below). The theorem is unclear as to how t= he > original signal should be reconstructed, which is an obstacle to > formulating concrete models, but in these days of machine learning it mig= ht > be OK to assume that the system can somehow learn to use the information, > granted that it=E2=80=99s there. The hypothesis has far-reaching implica= tions, for > example it implies that spectral resolution of central auditory processin= g > is not limited by peripheral frequency analysis (as already assumed by fo= r > example phase opponency or lateral inhibitory hypotheses). > >> > >> Before venturing further along this limb, it=E2=80=99s worth consideri= ng some > issues. First, Logan made clear that his theorem only applies to a > perfectly band-limited signal, and might not be =E2=80=9Capproximately va= lid=E2=80=9D for a > signal that is =E2=80=9Capproximately band-limited=E2=80=9D. No practica= l signal is > band-limited, if only because it must be time limited, and thus the theor= em > might conceivably not be applicable at all. On the other hand, half-wave > rectification offers much richer information than zero crossings, so > perhaps the end result is valid (information preserved) even if the theor= em > is not applicable stricto sensu. Second, there are many other > imperfections such as adaptation, stochastic sampling to a spike-based > representation, and so on, that might affect the usefulness of the > hypothesis. > >> > >> The challenge is to address some of these loose ends. For example: > >> (1) Can the theorem be extended to make use of a halfwave-rectified > signal rather than zero crossings? Might that allow it to be applicable t= o > practical time-limited signals? > >> (2) What is the impact of real cochlear filter characteristics, > adaptation, or stochastic sampling? > >> (3) In what sense can one say that the acoustic signal is "available= =E2=80=9D > to neural signal processing? What are the limits of that concept? > >> (4) Can all this be formulated in a way intelligible by > non-mathematical auditory scientists? > >> > >> This is the challenge. The reward is - possibly - a better > understanding of how our brain hears the world. > >> > >> Alain > >> > >> --- > >> Logan BF, JR. (1977) Information in the zero crossings of bandpass > signals. Bell Syst. Tech. J. 56:487=E2=80=93510. > >> > >> Marr, D. (1982) VISION - A Computational Investigation into the Human > Representation and Processing of Visual Information. W.H. Freeman and Co, > republished by MIT press 2010. > >> > >> Heinz, M.G., Swaminathan J. (2009) Quantifying Envelope and > Fine-Structure Coding in Auditory Nerve Responses to Chimaeric Speech, JA= RO > 10: 407=E2=80=93423 > >> DOI: 10.1007/s10162-009-0169-8. > >> > >> Shamma, S, Lorenzi, C (2013) On the balance of envelope and temporal > fine structure in the encoding of speech in the early auditory system, J. > Acoust. Soc. Am. 133, 2818=E2=80=932833. > >> > >> Parida S, Bharadwaj H, Heinz MG (2021) Spectrally specific temporal > analyses of spike-train responses to complex sounds: A unifying framework= . > PLoS Comput Biol 17(2): e1008155. > https://doi.org/10.1371/journal.pcbi.1008155 > >> > >> de Cheveign=C3=A9, A. (in press) Harmonic Cancellation, a Fundamental = of > Auditory Scene Analysis. Trends in Hearing (https://psyarxiv.com/b8e5w/). > --00000000000004ed3f05cd10524f Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable <div dir=3D"ltr">Hi Alain and members,<br><br>I will attempt to give a bit = of non-mathematical intuition behind information in the zeros.<div><br></di= v><div>Zeros are cool! For example, if you know zeros (roots) of a polynomi= al you then know everything about it (upto a scaling factor). Hence, recove= ring a polynomial using just its zeros=C2=A0 is not a surprise. Folks do it= in high school too.</div><div>A practical question is=C2=A0 - do you have = access to all its zeros (or roots)? If yes, excellent. If not, then we have= to state the necessary and sufficient conditions which the polynomial shou= ld obey to enable its=C2=A0recovery from zeros. To give a hint, polynomials= can have complex roots, and it is not search for the imaginary roots if al= l you see is your polynomial wiggling on the real axis!</div><div><br><div>= Now, let&#39;s switch gears to signals which are oscillatory. Polynomials c= an be oscillatory too but those will have many terms, with a huge mix of de= grees.<br>Let&#39;s instead consider sine waves as oscillatory signals. Oft= en, they give a compact representation to represent oscillations.<br>Can ze= ros (or zero-crossings, ZCs) of an oscillatory signal be sufficient for its= recovery? B.F. Logan Jr addressed some aspects associated with this in <a = href=3D"http://languagelog.ldc.upenn.edu/myl/Logan1977.pdf">http://language= log.ldc.upenn.edu/myl/Logan1977.pdf</a> (Theorem 8 and 9).<br>Broadly, he p= resented the conditions under which the ZCs of the oscillatory signal can u= niquely define the signal. Note that, he did present any reconstruction alg= orithms.<br>=3D=3D=3D<br>What are the conditions under which ZCs suffice to= represent an oscillatory signal? He gave two conditions. This was very nic= e of him as it made the paper popular! Let&#39;s get an intuition in a few = steps.<br>- Firstly, what if the signal never crosses zero. It can have a D= C making the oscillatory signal to wiggle above zero voltage all the time. = No zeros hence, no recovery possible using ZCs! Hence, the signal should be= bandpass (no DC). This will make sure that we get some ZCs.<br>- Secondly,= Nyquist sampling rate suggests that to sample an oscillatory signal at a r= ate at least twice the maximum frequency contained in its spectrum for enab= ling perfect recovery from just the samples.<br>We must make sure the ZC ra= te satisfies the Nyquist rate. If the signal is bandpass with an octave ban= dwidth this will be satisfied. Under this condition, the ZCs rate no more b= elow the Nyquist rate, and thus, we have hope for recovery just from ZCs.<b= r>- Thirdly, we got the ZCs but do they uniquely define this oscillatory si= gnal? Or, can there be another oscillatory signal which satisfies the above= conditions and have the ZCs at the same instants? Here is the key contribu= tion of Logan&#39;s paper. He presents a condition which if obeyed by the s= ignal then the ZCs are unique. Interestingly, this condition links to the p= olynomials but in a more abstract way. I will not expand on it here.<br>=3D= =3D=3D<br>So, nothing got solved as, like Alain suggested, we still don&#39= ;t have a reconstruction algorithm to get back the signal from ZCs. But we = do now know some conditions which can guarantee the recovery and some day w= e will have an algorithm for recovery. Also, since Logan&#39;s paper, multi= ple researchers have worked on this topic. Like, what Malcolm et al. sugges= ted is an iterative approach (projection on convex sets). These approaches = are actually very interesting, and even more when they present the guarante= es on convergence.</div><div><br>Anyways, there is a much bigger picture he= re. Sampling ZCs implies sampling time instants instead of amplitude.</div>= <div>I find this interesting as it breaks away from the uniform sampling (a= nd analysis) framework employed in signal processing. Researchers continue = to work on this aspect. I am also one of them :). One of our works which mi= ght be of interest=C2=A0to you is [1].<br><br></div><div>With new approache= s, we might be able to reach out to a bigger class of oscillatory signals t= han what Logan considered. Further, also design algorithms (maybe DDN-style= ) to understand the information encoded in ZCs of arbitrary=C2=A0time limit= ed=C2=A0 (not bandlimited) signals.</div><div>To summarize, ZCs might be th= e &quot;dark matter&quot; of signals and exploring these can potentially re= -define the way we do DSP.</div><div><br></div><div>[1] Neeraj Kumar Sharma= , Time-Instant Sampling Based Encoding of Time-Varying Acoustic Spectrum, A= IP Conference Proceedings 1703, 100003 (2015) <a href=3D"https://aip.scitat= ion.org/doi/pdf/10.1063/1.4939431">https://aip.scitation.org/doi/pdf/10.106= 3/1.4939431</a></div><div><br></div><div>Cheers,<br>Neeraj<br><a href=3D"ht= tps://neerajww.github.io/">https://neerajww.github.io/</a><br></div></div><= /div><br><div class=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmail_attr">O= n Tue, Sep 28, 2021 at 5:27 PM Alain de Cheveigne &lt;<a href=3D"mailto:ala= in.de.cheveigne@xxxxxxxx">alain.de.cheveigne@xxxxxxxx</a>&gt; wrote:<br= ></div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;= border-left:1px solid rgb(204,204,204);padding-left:1ex">Hi Malcom, <br> <br> Every time I come across a problem it invariably turns out that you, or Dic= k, or Shihab, have figured it out in the 80s or 90s :-).=C2=A0 <br> <br> POCS is very elegant. The fact that you implemented it in an auditory model= and got a good result suggests that the idea is indeed sound.=C2=A0 And ye= t, you say the outcome was not perfect, and it=E2=80=99s uncertain exactly = how it might depend on further imperfections of a biological implementation= . It would be nice to have a formal proof that relates imperfection of outc= ome to imperfection of premises.=C2=A0 And that is understandable by those = who would benefit from it.<br> <br> I see 3 categories of customer: (a) those, like you, or I, or Shihab, that = trust that the result holds and think it&#39;s useful, (b) those who are sk= eptical and/or put off by too much hand-waving, and (c) those who have no i= dea what we=E2=80=99re talking about, but should. Categories (b) and especi= ally (c) need catering for.<br> <br> The band-limited constraint is powerful, but brittle.=C2=A0 A band-limited = signal extends over all time, allowing for LOTS of zero-crossings.=C2=A0 A = time-limited signal has access to much less. The ~1/3 octave bandwidth of a= n auditory filter might seem narrow enough, and yet its impulse response (w= hich determines the temporal span and thus the available zero-crossings) ex= tends for just a few cycles before falling below the noise floor. That=E2= =80=99s VERY far from the ideal.=C2=A0 On the other hand, as you say, a hal= f-wave rectified signal contains much more information, so intuitively the = consequence might hold even if the premise does not.=C2=A0 It would be nice= to make that slightly more formal.<br> <br> The aim is to make things simple.=C2=A0 If a Logan-like theorem holds, and = peripheral transduction is `transparent=E2=80=99, we don=E2=80=99t need to = worry about it when developing a neural model of auditory processing. We ca= n conceptualize the model as having full access to the acoustic waveform, n= o information lost. Implementation details are important, but we can work o= n them with the trust that there is no obstacle of principle. That=E2=80=99= s how I understand your ICASSP paper.<br> <br> This is potentially useful enough to justify the effort to get it right to = the satisfaction of all categories, particularly as the implications are ar= e non-trivial and possibly contentious.=C2=A0 I=E2=80=99m not sure everyone= is happy with the idea that spectral resolution is NOT limited by auditory= filter bandwidth...<br> <br> Alain<br> (please cc to <a href=3D"mailto:Alain.de.Cheveigne@xxxxxxxx" target=3D"_bl= ank">Alain.de.Cheveigne@xxxxxxxx</a>, as we=E2=80=99re having mail server = problems)<br> <br> <br> &gt; On 26 Sep 2021, at 16:01, Malcolm Slaney &lt;<a href=3D"mailto:0000017= 57ffb5fe1-dmarc-request@xxxxxxxx" target=3D"_blank">000001757ffb5fe1= -dmarc-request@xxxxxxxx</a>&gt; wrote:<br> &gt; <br> &gt; POCS.<br> &gt; <br> &gt; Projections onto Convex Sets [1].<br> &gt; <br> &gt; Dick Lyon and I used POCS to invert [2] our favorite auditory model.= =C2=A0 A contemporaneous paper [3] from Shamma=E2=80=99s lab did the same. = <br> &gt; <br> &gt; Both the band-limited constraint and the known positive values of the = signal define convex sets.=C2=A0 We know in the frequency domain many parts= of the spectrum are equal to zero.=C2=A0 And in the time domain we know th= e values that are positive.=C2=A0 We can iterate between the time and the f= requency domain, each time projecting onto the appropriate constraint, to f= ind the best solution.<br> &gt; <br> &gt; I didn=E2=80=99t work out the theory, but since the auditory filter ba= nk has a bandwidth of less than an octave, I think there must be only a sin= gle solution.=C2=A0 In practice, just a handful of back and forth iteration= s was sufficient to find the solution.<br> &gt; <br> &gt; Piece of cake.=C2=A0 :-)<br> &gt; <br> &gt; Our interest in this problem was not to generate audio, a cute parlor = trick, but to show that the auditory representation we were working with di= d not lose any perceptually important information.<br> &gt; <br> &gt; =E2=80=94 Malcolm<br> &gt; P.S.=C2=A0 Reconstructions from zero crossing requires infinite resolu= tion of the time of the zero crossing.=C2=A0 That would be hard to do with = a spike representation. Fortunately, there is a LOT more information in the= HWR signal.<br> &gt; <br> &gt; [1] <a href=3D"https://en.wikipedia.org/wiki/Projections_onto_convex_s= ets" rel=3D"noreferrer" target=3D"_blank">https://en.wikipedia.org/wiki/Pro= jections_onto_convex_sets</a><br> &gt; <br> &gt; [2] M. Slaney, D. Naar.=C2=A0 R. Lyon. Auditory model inversion for so= und separation. Proceedings of ICASSP &#39;94. IEEE International Conferenc= e on Acoustics, Speech and Signal Processing, 1994. <a href=3D"https://engi= neering.purdue.edu/~malcolm/apple/icassp94/CorrelogramInversion.pdf" rel=3D= "noreferrer" target=3D"_blank">https://engineering.purdue.edu/~malcolm/appl= e/icassp94/CorrelogramInversion.pdf</a><br> &gt; <br> &gt; [3] X. Yang; K. Wang; S.A. Shamma. Auditory representations of acousti= c signals. IEEE Transactions on Information Theory, Volume: 38, Issue: 2, M= arch 1992. <a href=3D"https://ieeexplore.ieee.org/document/119739" rel=3D"n= oreferrer" target=3D"_blank">https://ieeexplore.ieee.org/document/119739</a= ><br> &gt; <br> &gt;&gt; On Sep 25, 2021, at 11:03 PM, Alain de Cheveigne &lt;<a href=3D"ma= ilto:alain.de.cheveigne@xxxxxxxx" target=3D"_blank">alain.de.cheveigne@xxxxxxxx= S.PSL.EU</a>&gt; wrote:<br> &gt;&gt; <br> &gt;&gt; Hi all,<br> &gt;&gt; <br> &gt;&gt; Here=E2=80=99s a challenge for the young nimble minds on this list= , and the old and wise.<br> &gt;&gt; <br> &gt;&gt; Logan=E2=80=99s theorem states that a signal can be reconstructed = from its zero crossings, to a scale, as long as the spectral representation= of that signal is less than an octave wide.=C2=A0 It sounds like magic giv= en that zero crossing information is so crude. How can the full signal be r= ecovered from a sparse series of time values (with signs but no amplitudes)= ?=C2=A0 =E2=80=9CBand-limited=E2=80=9D is clearly a powerful assumption.<br= > &gt;&gt; <br> &gt;&gt; Why is this of interest in the auditory context?=C2=A0 The band-li= mited premise is approximately valid for each channel of the cochlear filte= rbank (sometimes characterized as a 1/3 octave filter).=C2=A0 While cochlea= r transduction is non-linear, Logan=E2=80=99s theorem suggests that any inf= ormation lost due to that non-linearity can be restored, within each channe= l. If so, cochlear transduction is =E2=80=9Ctransparent=E2=80=9D, which is = encouraging for those who like to speculate about neural models of auditory= processing. An algorithm applicable to the sound waveform can be implement= ed by the brain with similar results, in principle.=C2=A0 <br> &gt;&gt; <br> &gt;&gt; Logan=E2=80=99s theorem has been invoked by David Marr for vision = and several authors for hearing (some refs below). The theorem is unclear a= s to how the original signal should be reconstructed, which is an obstacle = to formulating concrete models, but in these days of machine learning it mi= ght be OK to assume that the system can somehow learn to use the informatio= n, granted that it=E2=80=99s there.=C2=A0 The hypothesis has far-reaching i= mplications, for example it implies that spectral resolution of central aud= itory processing is not limited by peripheral frequency analysis (as alread= y assumed by for example phase opponency or lateral inhibitory hypotheses).= <br> &gt;&gt; <br> &gt;&gt; Before venturing further along this limb, it=E2=80=99s worth consi= dering some issues.=C2=A0 First, Logan made clear that his theorem only app= lies to a perfectly band-limited signal, and might not be =E2=80=9Capproxim= ately valid=E2=80=9D for a signal that is =E2=80=9Capproximately band-limit= ed=E2=80=9D.=C2=A0 No practical signal is band-limited, if only because it = must be time limited, and thus the theorem might conceivably not be applica= ble at all.=C2=A0 On the other hand, half-wave rectification offers much ri= cher information than zero crossings, so perhaps the end result is valid (i= nformation preserved) even if the theorem is not applicable stricto sensu.= =C2=A0 Second, there are many other imperfections such as adaptation, stoch= astic sampling to a spike-based representation, and so on, that might affec= t the usefulness of the hypothesis.<br> &gt;&gt; <br> &gt;&gt; The challenge is to address some of these loose ends. For example:= <br> &gt;&gt; (1) Can the theorem be extended to make use of a halfwave-rectifie= d signal rather than zero crossings? Might that allow it to be applicable t= o practical time-limited signals?<br> &gt;&gt; (2) What is the impact of real cochlear filter characteristics, ad= aptation, or stochastic sampling?=C2=A0 <br> &gt;&gt; (3) In what sense can one say that the acoustic signal is &quot;av= ailable=E2=80=9D to neural signal processing?=C2=A0 What are the limits of = that concept?<br> &gt;&gt; (4) Can all this be formulated in a way intelligible by non-mathem= atical auditory scientists?<br> &gt;&gt; <br> &gt;&gt; This is the challenge.=C2=A0 The reward is - possibly - a better u= nderstanding of how our brain hears the world.<br> &gt;&gt; <br> &gt;&gt; Alain<br> &gt;&gt; <br> &gt;&gt; ---<br> &gt;&gt; Logan BF, JR. (1977) Information in the zero crossings of bandpass= signals. Bell Syst. Tech. J. 56:487=E2=80=93510.<br> &gt;&gt; <br> &gt;&gt; Marr, D. (1982) VISION - A Computational Investigation into the Hu= man Representation and Processing of Visual Information. W.H. Freeman and C= o, republished by MIT press 2010.<br> &gt;&gt; <br> &gt;&gt; Heinz, M.G., Swaminathan J. (2009) Quantifying Envelope and Fine-S= tructure Coding in Auditory Nerve Responses to Chimaeric Speech, JARO 10: 4= 07=E2=80=93423<br> &gt;&gt; DOI: 10.1007/s10162-009-0169-8.<br> &gt;&gt; <br> &gt;&gt; Shamma, S, Lorenzi, C (2013) On the balance of envelope and tempor= al fine structure in the encoding of speech in the early auditory system, J= . Acoust. Soc. Am. 133, 2818=E2=80=932833.<br> &gt;&gt; <br> &gt;&gt; Parida S, Bharadwaj H, Heinz MG (2021) Spectrally specific tempora= l analyses of spike-train responses to complex sounds: A unifying framework= . PLoS Comput Biol 17(2): e1008155. <a href=3D"https://doi.org/10.1371/jour= nal.pcbi.1008155" rel=3D"noreferrer" target=3D"_blank">https://doi.org/10.1= 371/journal.pcbi.1008155</a><br> &gt;&gt; <br> &gt;&gt; de Cheveign=C3=A9, A. (in press) Harmonic Cancellation, a Fundamen= tal of Auditory Scene Analysis. Trends in Hearing (<a href=3D"https://psyar= xiv.com/b8e5w/" rel=3D"noreferrer" target=3D"_blank">https://psyarxiv.com/b= 8e5w/</a>).<br> </blockquote></div> --00000000000004ed3f05cd10524f--


This message came from the mail archive
src/postings/2021/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University