Re: [AUDITORY] On 3D audio rendering for signals with the low sampling frequency (Junfeng Li )


Subject: Re: [AUDITORY] On 3D audio rendering for signals with the low sampling frequency
From:    Junfeng Li  <junfeng.li.1979@xxxxxxxx>
Date:    Tue, 16 Aug 2022 13:46:16 +0800

--00000000000003ecca05e655458c Content-Type: text/plain; charset="UTF-8" Dear Leslie, Possibly, the compression approach that you examined is the traditional ones for audio or speech signals. To keep the spatial cues after compression, I think we should examine the spatial audio coding methods, as described in MEPG-H, which should helps. Best regards, Junfeng On Tue, Aug 16, 2022 at 12:16 PM Prof Leslie Smith <l.s.smith@xxxxxxxx> wrote: > Dear all: > > Some years ago, I worked on using sound at onsets for calculating source > direction in reverberant environments [1]. It's kind-of obvious, because > after the onset, the sound at the ear/microphone is made up of energy both > from the source and from reflections. > > Sampling rates are normally constant, and techniques for compression are > aimed at recreating the percept of the original sound: I am under the > impression that this doesn't extend to the percept of precise location of > the sound. Perhaps we need novel compression/decompression techniques > that include the relevant data for source location. > > [1] L.S. Smith, S. Collins Determining ITDs using two microphones on a > flat panel during onset intervals with a biologically inspired spike based > technique > IEEE Transactions of Audio, Speech and Language Processing, 15, 8, > 2278-2286, (2007). > > --Leslie Smith > > Adam Weisser wrote: > > > 1. Compressed sensing - This heavily researched signal-processing method > > uses signal sparsity to faithfully reconstruct undersampled signals [1]. > > > ..... > > Neural adaptation can be thought of as dense > > sampling of the signal around its onset / transient portion, which > becomes > > more sparsely sampled quickly after the onset. Because of adaptation, > this > > effect is very illusive, but I believe that it is measurable > > notwithstanding. I tried to demonstrate it psychoacoustically in Appendix > > E of [4]. While I don't know how it relates to binaural processing > > directly, there may be instantaneous effects that may be detectable there > > too, given that the input to both processing types is the same. > > > > All the best, > > Adam. > > > ... > > > -- > Prof Leslie Smith (Emeritus) > Computing Science & Mathematics, > University of Stirling, Stirling FK9 4LA > Scotland, UK > Tel +44 1786 467435 > Web: http://www.cs.stir.ac.uk/~lss > Blog: http://lestheprof.com > --00000000000003ecca05e655458c Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable <div dir=3D"ltr">Dear Leslie,=C2=A0<div><br></div><div>Possibly, the compre= ssion approach that you examined=C2=A0is the traditional ones for audio or = speech signals. To keep the spatial cues after compression, I think we shou= ld examine the spatial audio coding methods, as described in MEPG-H, which = should helps.=C2=A0</div><div><br></div><div>Best regards,</div><div>Junfen= g=C2=A0</div></div><br><div class=3D"gmail_quote"><div dir=3D"ltr" class=3D= "gmail_attr">On Tue, Aug 16, 2022 at 12:16 PM Prof Leslie Smith &lt;<a href= =3D"mailto:l.s.smith@xxxxxxxx">l.s.smith@xxxxxxxx</a>&gt; wrote:<= br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8e= x;border-left:1px solid rgb(204,204,204);padding-left:1ex">Dear all:<br> <br> Some years ago, I worked on using sound at onsets for calculating source<br= > direction in reverberant environments [1]. It&#39;s kind-of obvious, becaus= e<br> after the onset, the sound at the ear/microphone is made up of energy both<= br> from the source and from reflections.<br> <br> Sampling rates are normally constant, and techniques for compression are<br= > aimed at recreating the percept of the original sound: I am under the<br> impression that this doesn&#39;t extend to the percept of precise location = of<br> the sound. Perhaps we need novel compression/decompression=C2=A0 techniques= <br> that include the relevant data for source location.<br> <br> [1] L.S. Smith, S. Collins Determining ITDs using two microphones on a<br> flat panel during onset intervals with a biologically inspired spike based<= br> technique<br> IEEE Transactions of Audio, Speech and Language Processing, 15, 8,<br> 2278-2286, (2007).<br> <br> --Leslie Smith<br> <br> Adam Weisser wrote:<br> <br> &gt; 1. Compressed sensing - This heavily researched signal-processing meth= od<br> &gt; uses signal sparsity to faithfully reconstruct undersampled signals [1= ].<br> &gt;<br> .....<br> &gt; Neural adaptation can be thought of as dense<br> &gt; sampling of the signal around its onset / transient portion, which bec= omes<br> &gt; more sparsely sampled quickly after the onset. Because of adaptation, = this<br> &gt; effect is very illusive, but I believe that it is measurable<br> &gt; notwithstanding. I tried to demonstrate it psychoacoustically in Appen= dix<br> &gt; E of [4]. While I don&#39;t know how it relates to binaural processing= <br> &gt; directly, there may be instantaneous effects that may be detectable th= ere<br> &gt; too, given that the input to both processing types is the same.<br> &gt;<br> &gt; All the best,<br> &gt; Adam.<br> &gt;<br> ...<br> <br> <br> -- <br> Prof Leslie Smith (Emeritus)<br> Computing Science &amp; Mathematics,<br> University of Stirling, Stirling FK9 4LA<br> Scotland, UK<br> Tel +44 1786 467435<br> Web: <a href=3D"http://www.cs.stir.ac.uk/~lss" rel=3D"noreferrer" target=3D= "_blank">http://www.cs.stir.ac.uk/~lss</a><br> Blog: <a href=3D"http://lestheprof.com" rel=3D"noreferrer" target=3D"_blank= ">http://lestheprof.com</a><br> </blockquote></div> --00000000000003ecca05e655458c--


This message came from the mail archive
src/postings/2022/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University