I need help in my masters thesis -- please read (gehan kamel )


Subject: I need help in my masters thesis -- please read
From:    gehan kamel  <gehan84@xxxxxxxx>
Date:    Wed, 5 Apr 2006 04:53:00 -0700

--0-291740417-1144237980=:46733 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable X-MIME-Autoconverted: from 8bit to quoted-printable by drizzle.cc.mcgill.ca id k35BrEIN004804 =20 Dear List, My Name is Gehan Mustafa, I am a masters student and a teacher assistan= t in Cairo University (Egypt), Faculty of Computers and Information, IT D= epartment. My Masters Thesis is a about Cocktail Party Speech Segregation= ( CASA and BSS models). If you don=92t mind I have some questions concern= ing the topic: 1. The TDOA (Time Delay Of Arrival) of each time/ frequency region= : how is it calculated ? What is TDOA, i.e. what does it represent when = calculated for each time/ frequency region? 2. How is the local relative level or SNR calculated a priori for = each time frequency regions? 3. If we use in CASA models the relationship between TDOA and the = local relative level (RL, which is known a priori), this means we need to= have the 2 sources unmixed to calculate the local relative level. Does t= his mean that such an algorithm is non-realistic to be tried out since in= real world applications, no one has a priori the unmixed sources? 4. In my masters thesis, is it more realistic to work on the ST-Nu= mbers Database or should I use a database which takes into account the ec= hoing and reverberation effects, for the sake of being more realistic and= more applicable to real world situations which normally does contain ech= o and reverberation. 5. Do you advise me to work with CASA models or with BSS (Blind So= urce Separation) models? Which is more realistic for real-world applicati= ons? I think that BSS models are more realistic and more appropriate for = me to use in my masters thesis. Do you agree with me or do you have anoth= er opinion? =20 I would really appreciate it if you could help me out in the formerly s= tated points. Thank you very much. Yours, Gehan Mustafa =20 =09 --------------------------------- Yahoo! Messenger with Voice. Make PC-to-Phone Calls to the US (and 30+ co= untries) for 2=A2/min or less. --0-291740417-1144237980=:46733 Content-Type: text/html; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable X-MIME-Autoconverted: from 8bit to quoted-printable by drizzle.cc.mcgill.ca id k35BrEIN004804 <div class=3DMsoNormal style=3D"MARGIN: 0cm 0cm 0pt"><FONT face=3D"Times = New Roman" size=3D3></FONT>&nbsp;</div> <div class=3DMsoNormal style=3D"= MARGIN: 0cm 0cm 0pt"><FONT face=3D"Times New Roman" size=3D3>Dear List,</= FONT></div> <div class=3DMsoNormal style=3D"MARGIN: 0cm 0cm 0pt"><FONT f= ace=3D"Times New Roman" size=3D3>My Name is Gehan Mustafa, I am a masters= student and a teacher assistant in <?xml:namespace prefix =3D st1 ns =3D= "urn:schemas-microsoft-com:office:smarttags" /><st1:PlaceName w:st=3D"on= ">Cairo</st1:PlaceName> <st1:PlaceType w:st=3D"on">University</st1:PlaceT= ype> (<st1:place w:st=3D"on"><st1:country-region w:st=3D"on">Egypt</st1:c= ountry-region></st1:place>), Faculty of Computers and Information, IT Dep= artment. My Masters Thesis is a about Cocktail Party Speech Segregation( = CASA and BSS models). If you don=92t mind I have some questions concernin= g the topic:</FONT></div> <div class=3DMsoNormal style=3D"MARGIN: 0cm 0c= m 0pt 39pt; TEXT-INDENT: -18pt; mso-list: l0 level1 lfo1; tab-stops: list 39.0pt"><FONT face=3D"Times New Roman"><SPAN style=3D"mso-list: Ignore">= <FONT size=3D3>1.</FONT><SPAN style=3D"FONT: 7pt 'Times New Roman'">&nbsp= ;&nbsp;&nbsp;&nbsp;&nbsp; </SPAN></SPAN><FONT size=3D3>The TDOA (Time Del= ay Of Arrival) of each time/ frequency region : how is it calculated ? Wh= at is TDOA, i.e. what does it represent when calculated for each time/ fr= equency region?</FONT></FONT></div> <div class=3DMsoNormal style=3D"MARG= IN: 0cm 0cm 0pt 39pt; TEXT-INDENT: -18pt; mso-list: l0 level1 lfo1; tab-s= tops: list 39.0pt"><FONT face=3D"Times New Roman"><SPAN style=3D"mso-list= : Ignore"><FONT size=3D3>2.</FONT><SPAN style=3D"FONT: 7pt 'Times New Rom= an'">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </SPAN></SPAN><FONT size=3D3>How is t= he local relative level or SNR calculated a priori for each time frequenc= y regions?</FONT></FONT></div> <div class=3DMsoNormal style=3D"MARGIN: 0= cm 0cm 0pt 39pt; TEXT-INDENT: -18pt; mso-list: l0 level1 lfo1; tab-stops:= list 39.0pt"><FONT face=3D"Times New Roman"><SPAN style=3D"mso-list: Ignore"><FONT size=3D3>3.</FONT><SPAN style=3D"FONT: 7pt 'Times New Roma= n'">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </SPAN></SPAN><FONT size=3D3>If we use= in CASA models the relationship between TDOA and the local relative leve= l (RL, which is known a priori), this means we need to have the 2 sources= unmixed to calculate the local relative level. Does this mean that such = an algorithm is non-realistic to be tried out since in real world applica= tions, no one has a priori the unmixed sources?</FONT></FONT></div> <div= class=3DMsoNormal style=3D"MARGIN: 0cm 0cm 0pt 39pt; TEXT-INDENT: -18pt;= mso-list: l0 level1 lfo1; tab-stops: list 39.0pt"><FONT face=3D"Times Ne= w Roman"><SPAN style=3D"mso-list: Ignore"><FONT size=3D3>4.</FONT><SPAN s= tyle=3D"FONT: 7pt 'Times New Roman'">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </SPA= N></SPAN><FONT size=3D3>In my masters thesis, is it more realistic to wor= k on the ST-Numbers Database or should I use a database which takes into = account the echoing and reverberation effects, for the sake of being more realistic and more applicable to real world situations whi= ch normally does contain echo and reverberation.</FONT></FONT></div> <di= v class=3DMsoNormal style=3D"MARGIN: 0cm 0cm 0pt 39pt; TEXT-INDENT: -18pt= ; mso-list: l0 level1 lfo1; tab-stops: list 39.0pt"><FONT face=3D"Times N= ew Roman"><SPAN style=3D"mso-list: Ignore"><FONT size=3D3>5.</FONT><SPAN = style=3D"FONT: 7pt 'Times New Roman'">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </SP= AN></SPAN><FONT size=3D3>Do you advise me to work with CASA models or wit= h BSS (Blind Source Separation) models? Which is more realistic for real-= world applications? I think that BSS models are more realistic and more a= ppropriate for me to use in my masters thesis. Do you agree with me or do= you have another opinion?</FONT></FONT></div> <div class=3DMsoNormal st= yle=3D"MARGIN: 0cm 0cm 0pt"><?xml:namespace prefix =3D o ns =3D "urn:sche= mas-microsoft-com:office:office" /><o:p><FONT face=3D"Times New Roman" si= ze=3D3>&nbsp;</FONT></o:p></div> <div class=3DMsoNormal style=3D"MARGIN: 0cm 0cm 0pt 39pt; TEXT-INDENT: -18pt; tab-stops: list 3= 9.0pt"><FONT face=3D"Times New Roman" size=3D3>I would really appreciate = it if you could help me out in the formerly stated points. Thank you very= much.</FONT></div> <div class=3DMsoNormal style=3D"MARGIN: 0cm 0cm 0pt = 39pt; TEXT-INDENT: -18pt; tab-stops: list 39.0pt"><FONT face=3D"Times New= Roman" size=3D3>Yours,</FONT></div> <div class=3DMsoNormal style=3D"MAR= GIN: 0cm 0cm 0pt 39pt; TEXT-INDENT: -18pt; tab-stops: list 39.0pt"><FONT = face=3D"Times New Roman" size=3D3>&nbsp;&nbsp;&nbsp; Gehan Mustafa</FONT>= </div> <div class=3DMsoNormal style=3D"MARGIN: 0cm 0cm 0pt"><o:p><FONT f= ace=3D"Times New Roman" size=3D3>&nbsp;</FONT></o:p></div><p> <hr size=3D1>Yahoo! Messenger with Voice. <a href=3D"http://us.rd.yahoo= .com/mail_us/taglines/postman1/*http://us.rd.yahoo.com/evt=3D39663/*http:= //voice.yahoo.com">Make PC-to-Phone Calls</a> to the US (and 30+ countrie= s) for 2=A2/min or less. --0-291740417-1144237980=:46733--


This message came from the mail archive
http://www.auditory.org/postings/2006/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University