[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Blind Source Separation by Sparse Decomposition

To: AUDITORY@xxxxxxxxxxxxxxx
Subject: Re: Blind Source Separation by Sparse Decomposition
From: "Richard J. Fabbri" <fabbri@xxxxxxxxxxx>
Date: Tue, 7 Sep 1999 10:58:28 -0400
Comments: To: Al Bregman <bregman@HEBB.PSYCH.MCGILL.CA>, michael@cs.unm.edu
In-reply-to: <Pine.SUN.3.95.990906134309.29516C-100000@hebb.psych.mcgill .ca>
References: <Pine.SGI.4.05.9909061051250.18335-100000@infinity.cs.unm.edu>
Reply-to: "Richard J. Fabbri" <fabbri@xxxxxxxxxxx>
Sender: AUDITORY Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>

Al, Michael -

"The physics of the problem" as Michael called it, require, first, a
model through which to pass ("clipped") signals and, later,
inspect the resultant Source data.

However, without first proposing that model, and, in an effort to
promote dialogue, I propose a somewhat more severe form of
clipping, i.e., high gain amplification (a voltage comparator) that
conveniently "clips" @ the usual GND and +VS ends of the supply
voltage.

In this extreme clipping case the only thing remaining of the original
audio are the zero-crossings and, a very high speed (usually nsec)
has been introduced at the zero-crossing transitions.  Yet, upon
audition we perceive intelligible speech under these circumstances.
And, in the spirit of Al's comments, all frequency components other
than those related to zero-crossings have been removed.

Returning to the specific question of Source "segregation", it would
be interesting to experiment with binaural recordings by clipping the
Left/Right audio with voltage comparators before passing the L/R
channels to the usual "stereo" headset for binaural audition.

Again, as per Al's comment, "...the human ability to (imperfectly) deal
with many sources with only 2 ears [is] even more intriguing."

Anyone who tries this proposed, severe (binaural) clipping will be
further intrigued by our 2 ear processing abilities.

Of course, an A/B audio switch should also be employed so as to
allow a ready way to compare the original binaural sound to the
clipped sound.

Given the non-commercial nature of this forum, I would not initially
supply the web addresses of binaural cassettes recordings but, if
asked, would privately supply a website I've found useful.

Regards,
Rich Fabbri
Scarce Ideas
____________________Reply Separator__________________
>Dear Michael,
>
>Thanks for your response about the number of receivers versus the number
>of sources It makes the human ability to (imperfectly) deal with many
>sources with only 2 ears even more intriguing.  Somehow humans are trading
>off perfection in a narrow set of circumstances for flexibility.  I
>suspect _heuristic_ approaches to CASA (computational auditory scene
>analysis) would work more like people do.
>
>Here is why I asked about the clipping problem.  I'm no physicist so I
>can't give you an exact physical formulation of the problem.  However, it
>seems to me that clipping destroys the linear additivity of the frequency
>components in the signal.  Here is a simple example: mix a low amplitude
>high frequency component with a high amplitude, low frequency one.  In the
>waveform, the high frequency seems to be riding on top of the low
>frequency at all points in the signal.  Now clip the signal.  Now the high
>frequency signal is missing in the segments that exceed the clipping
>threshold.  It could have changed in frequency (and then back again) for
>all we know.
>
>I wanted to know whether, by destroying the additivity of the signals,
>clipping ruled out any mathematical methods for separation that are based
>on this additivity.  I'm also not sure what echos and reverberation would
>do to such mathematical methods.
>
>- Al
>-----------------------------------------------
>On Mon, 6 Sep 1999, Zibulevsky Michael wrote:
>
>> Al,
>>
>> You wrote:
>> > - How many receivers of the signal would it require to segregate 4
>> > sources?  Is there any fixed relation between the number of receivers
>> > required and the number of underlying signals in the mixture?
>>
>> in general you need less sensors, than sources (say, 2 or 3 sensors for
>> 4 sources), but it leads you to the  computationaly difficult and not
>> very stable procedure). So, if you have a choice, it would be better to
>> have
>> the  number of sensors at least the same as the number of sources.
>>
>> You wrote:
>> > - What would happen if the signal were clipped at some arbitrary
>> > amplitude?
>>
>> It's an interesting question. I never heard, that somebody was solving
>> such a problem, but there might be in principle few possibilities to
>> treat it. Could you say a bit more about the physics of this problem?
>>
>>   --Michael
>>
>>
>> ------------------------------------------------------------------------
>> | Michael Zibulevsky, Ph.D.             Email: michael@cs.unm.edu      |
>> | Brain Computation Laboratory          Phone: 505/265-6448 (home)     |
>> | Computer Science Dept., FEC 313              505/265-5313 (home)     |
>> | University of New Mexico                     505/277-9426 (work)     |
>> | Albuquerque, NM  87131  USA           FAX:   505/277-6927 (shared)   |
>> |                 http://iew3.technion.ac.il:8080/~mcib/               |
>> ------------------------------------------------------------------------

Prev by Date: Re: Blind Source Separation by Sparse Decomposition
Next by Date: Research Opportunity in Audio Classification
Previous by thread: Re: Blind Source Separation by Sparse Decomposition
Next by thread: Re: Blind Source Separation by Sparse Decomposition
Index(es):
- Date
- Thread