Re: 60 Hz hum causing problems with speech recognition (Lloyd Watts )


Subject: Re: 60 Hz hum causing problems with speech recognition
From:    Lloyd Watts  <lwatts@xxxxxxxx>
Date:    Tue, 16 Jun 2009 11:29:47 -0700
List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

--_000_A73A509F70C65245B2B010D00825C402070254FA23EXMaudiencelo_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Hello Tarun, I've been using my USBpre since 2001 and have been extremely happy with it. http://www.sounddevices.com/products/usbpremaster.htm Lloyd ________________________________ From: AUDITORY - Research in Auditory Perception [mailto:AUDITORY@xxxxxxxx= ILL.CA] On Behalf Of Tarun Pruthi Sent: Tuesday, June 16, 2009 10:02 AM To: AUDITORY@xxxxxxxx Subject: Re: [AUDITORY] 60 Hz hum causing problems with speech recognition Hi Linda: I am trying to make a speech command recognizer using a standard open-sourc= e speech recognizer on my laptop. Most of the work I do, is either on my of= fice desk or home. I guess the sources of noise would be the usual things: = lights, unclean power, wireless signals etc. Tarun On Tue, Jun 16, 2009 at 11:57 AM, Linda Seltzer <lseltzer@xxxxxxxx= u<mailto:lseltzer@xxxxxxxx>> wrote: What exactly is the experiment you are trying to do or the test that you are conducting and where is the recording being done? > Hi all: > I have been having a lot of problems lately with 60 Hz humming noise in m= y > speech recordings (I will confess that most of my tests have been on > windows/linux laptops - and my guess is that they probably have much wors= e > recording quality than desktops on average - but we live in a mobile worl= d > now and I have to make it work on laptops). Whenever it is there, it > causes > a serious drop in speech recognition accuracy, especially since it comes > in > very randomly at times giving for example clean recordings during speaker > enrollment and noisy recordings in live testing. The problem is even wors= e > on linux which has given me all kinds of artifacts in the recordings > besides > the 60 Hz hum - my guess is that this is most likely because of driver > problems. > > So, I have 3 questions: > 1. What is the source of this 60 Hz noise? Powerline? RF interference? I > have tried disconnecting the power cord - it works sometimes, but not > always. > 2. Is there a standard way to get rid of the 60 Hz hum and ensure clean > recordings for the purposes of speech recognition? The only way I can > think > of is to use notch filters at 60 Hz, and some of the harmonics, say 120 H= z > and 180 Hz, since most of the times it doesn't manifest as a single tone > at > 60 Hz, but has harmonics all over the frequency range. > 3. If this is a well-known problem with sound cards, then why aren't we, > speech recognition community, trying to come up with a mandatory > compliance > standard for sound card manufacterers which ensures that the microphone > input is not corrupted by 60/50 Hz noise? That should provide a big boost > to > speech recognition accuracies, and a wider acceptance of speech > recognition. > > Tarun > Senior Research Engineer > Think A Move, Ltd > --_000_A73A509F70C65245B2B010D00825C402070254FA23EXMaudiencelo_ Content-Type: text/html; charset="us-ascii" Content-Transfer-Encoding: quoted-printable <html xmlns:v=3D"urn:schemas-microsoft-com:vml" xmlns:o=3D"urn:schemas-micr= osoft-com:office:office" xmlns:w=3D"urn:schemas-microsoft-com:office:word" = xmlns=3D"http://www.w3.org/TR/REC-html40"> <head> <META HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; charset=3Dus-ascii"= > <meta name=3DGenerator content=3D"Microsoft Word 11 (filtered medium)"> <!--[if !mso]> <style> v\:* {behavior:url(#default#VML);} o\:* {behavior:url(#default#VML);} w\:* {behavior:url(#default#VML);} .shape {behavior:url(#default#VML);} </style> <![endif]--> <style> <!-- /* Font Definitions */ @xxxxxxxx {font-family:"MS Mincho"; panose-1:2 2 6 9 4 2 5 8 3 4;} @xxxxxxxx {font-family:Tahoma; panose-1:2 11 6 4 3 5 4 4 2 4;} @xxxxxxxx {font-family:"\@xxxxxxxx Mincho"; panose-1:2 2 6 9 4 2 5 8 3 4;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {margin:0in; margin-bottom:.0001pt; font-size:12.0pt; font-family:"Times New Roman";} a:link, span.MsoHyperlink {color:blue; text-decoration:underline;} a:visited, span.MsoHyperlinkFollowed {color:blue; text-decoration:underline;} span.EmailStyle17 {mso-style-type:personal-reply; font-family:Arial; color:navy;} @xxxxxxxx Section1 {size:8.5in 11.0in; margin:1.0in 1.25in 1.0in 1.25in;} div.Section1 {page:Section1;} --> </style> <!--[if gte mso 9]><xml> <o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" /> </xml><![endif]--><!--[if gte mso 9]><xml> <o:shapelayout v:ext=3D"edit"> <o:idmap v:ext=3D"edit" data=3D"1" /> </o:shapelayout></xml><![endif]--> </head> <body lang=3DEN-US link=3Dblue vlink=3Dblue> <div class=3DSection1> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span style= =3D'font-size: 10.0pt;font-family:Arial;color:navy'>Hello Tarun,<o:p></o:p></span></font><= /p> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span style= =3D'font-size: 10.0pt;font-family:Arial;color:navy'><o:p>&nbsp;</o:p></span></font></p> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span style= =3D'font-size: 10.0pt;font-family:Arial;color:navy'>I&#8217;ve been using my USBpre since = 2001 and have been extremely happy with it.<o:p></o:p></span></font></p> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span style= =3D'font-size: 10.0pt;font-family:Arial;color:navy'><a href=3D"http://www.sounddevices.com/products/usbpremaster.htm">http://www.s= ounddevices.com/products/usbpremaster.htm</a><o:p></o:p></span></font></p> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span style= =3D'font-size: 10.0pt;font-family:Arial;color:navy'><o:p>&nbsp;</o:p></span></font></p> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span style= =3D'font-size: 10.0pt;font-family:Arial;color:navy'>Lloyd<o:p></o:p></span></font></p> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span style= =3D'font-size: 10.0pt;font-family:Arial;color:navy'><o:p>&nbsp;</o:p></span></font></p> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span style= =3D'font-size: 10.0pt;font-family:Arial;color:navy'><o:p>&nbsp;</o:p></span></font></p> <div> <div class=3DMsoNormal align=3Dcenter style=3D'text-align:center'><font siz= e=3D3 face=3D"Times New Roman"><span style=3D'font-size:12.0pt'> <hr size=3D2 width=3D"100%" align=3Dcenter tabindex=3D-1> </span></font></div> <p class=3DMsoNormal><b><font size=3D2 face=3DTahoma><span style=3D'font-si= ze:10.0pt; font-family:Tahoma;font-weight:bold'>From:</span></font></b><font size=3D2 face=3DTahoma><span style=3D'font-size:10.0pt;font-family:Tahoma'> AUDITORY= - Research in Auditory Perception [mailto:AUDITORY@xxxxxxxx <b><span style=3D'font-weight:bold'>On Behalf Of </span></b>Tarun Pruthi<br> <b><span style=3D'font-weight:bold'>Sent:</span></b> Tuesday, June 16, 2009= 10:02 AM<br> <b><span style=3D'font-weight:bold'>To:</span></b> AUDITORY@xxxxxxxx= <br> <b><span style=3D'font-weight:bold'>Subject:</span></b> Re: [AUDITORY] 60 H= z hum causing problems with speech recognition</span></font><o:p></o:p></p> </div> <p class=3DMsoNormal><font size=3D3 face=3D"Times New Roman"><span style=3D= 'font-size: 12.0pt'><o:p>&nbsp;</o:p></span></font></p> <p class=3DMsoNormal><font size=3D3 face=3D"Times New Roman"><span style=3D= 'font-size: 12.0pt'>Hi Linda:<o:p></o:p></span></font></p> <div> <p class=3DMsoNormal><font size=3D3 face=3D"Times New Roman"><span style=3D= 'font-size: 12.0pt'><o:p>&nbsp;</o:p></span></font></p> </div> <div> <p class=3DMsoNormal><font size=3D3 face=3D"Times New Roman"><span style=3D= 'font-size: 12.0pt'>I am trying to make a speech command recognizer using a standard open-source speech recognizer on my laptop. Most of the work I do, is eithe= r on my office desk or home. I guess the sources of noise would be the usual thi= ngs: lights, unclean power, wireless signals etc.<o:p></o:p></span></font></p> </div> <div> <p class=3DMsoNormal><font size=3D3 face=3D"Times New Roman"><span style=3D= 'font-size: 12.0pt'><o:p>&nbsp;</o:p></span></font></p> </div> <div> <p class=3DMsoNormal style=3D'margin-bottom:12.0pt'><font size=3D3 face=3D"Times New Roman"><span style=3D'font-size:12.0pt'>Tarun<o:p></o:p><= /span></font></p> <div> <p class=3DMsoNormal><font size=3D3 face=3D"Times New Roman"><span style=3D= 'font-size: 12.0pt'>On Tue, Jun 16, 2009 at 11:57 AM, Linda Seltzer &lt;<a href=3D"mailto:lseltzer@xxxxxxxx">lseltzer@xxxxxxxx</a>= &gt; wrote:<o:p></o:p></span></font></p> <p class=3DMsoNormal><font size=3D3 face=3D"Times New Roman"><span style=3D= 'font-size: 12.0pt'>What exactly is the experiment you are trying to do or the test tha= t you<br> are conducting and where is the recording being done?<o:p></o:p></span></fo= nt></p> <div> <div> <p class=3DMsoNormal style=3D'margin-bottom:12.0pt'><font size=3D3 face=3D"Times New Roman"><span style=3D'font-size:12.0pt'>&gt; Hi all:<br> &gt; I have been having a lot of problems lately with 60 Hz humming noise i= n my<br> &gt; speech recordings (I will confess that most of my tests have been on<b= r> &gt; windows/linux laptops - and my guess is that they probably have much w= orse<br> &gt; recording quality than desktops on average - but we live in a mobile w= orld<br> &gt; now and I have to make it work on laptops). Whenever it is there, it<b= r> &gt; causes<br> &gt; a serious drop in speech recognition accuracy, especially since it com= es<br> &gt; in<br> &gt; very randomly at times giving for example clean recordings during spea= ker<br> &gt; enrollment and noisy recordings in live testing. The problem is even w= orse<br> &gt; on linux which has given me all kinds of artifacts in the recordings<b= r> &gt; besides<br> &gt; the 60 Hz hum - my guess is that this is most likely because of driver= <br> &gt; problems.<br> &gt;<br> &gt; So, I have 3 questions:<br> &gt; 1. What is the source of this 60 Hz noise? Powerline? RF interference?= I<br> &gt; have tried disconnecting the power cord - it works sometimes, but not<= br> &gt; always.<br> &gt; 2. Is there a standard way to get rid of the 60 Hz hum and ensure clea= n<br> &gt; recordings for the purposes of speech recognition? The only way I can<= br> &gt; think<br> &gt; of is to use notch filters at 60 Hz, and some of the harmonics, say 12= 0 Hz<br> &gt; and 180 Hz, since most of the times it doesn't manifest as a single to= ne<br> &gt; at<br> &gt; 60 Hz, but has harmonics all over the frequency range.<br> &gt; 3. If this is a well-known problem with sound cards, then why aren't w= e,<br> &gt; speech recognition community, trying to come up with a mandatory<br> &gt; compliance<br> &gt; standard for sound card manufacterers which ensures that the microphon= e<br> &gt; input is not corrupted by 60/50 Hz noise? That should provide a big bo= ost<br> &gt; to<br> &gt; speech recognition accuracies, and a wider acceptance of speech<br> &gt; recognition.<br> &gt;<br> &gt; Tarun<br> &gt; Senior Research Engineer<br> &gt; Think A Move, Ltd<br> &gt;<o:p></o:p></span></font></p> </div> </div> </div> <p class=3DMsoNormal><font size=3D3 face=3D"Times New Roman"><span style=3D= 'font-size: 12.0pt'><o:p>&nbsp;</o:p></span></font></p> </div> </div> </body> </html> --_000_A73A509F70C65245B2B010D00825C402070254FA23EXMaudiencelo_--


This message came from the mail archive
http://www.auditory.org/postings/2009/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University