Re: AUDITORY Digest - 13 Mar 2009 to 14 Mar 2009 (#2009-60) (Matthew Reynolds )


Subject: Re: AUDITORY Digest - 13 Mar 2009 to 14 Mar 2009 (#2009-60)
From:    Matthew Reynolds  <matthew.reynolds@xxxxxxxx>
Date:    Mon, 16 Mar 2009 13:08:08 +0000
List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

--MCBoundary=_109031613081100801 Content-Type: text/plain; charset=WINDOWS-1252 Content-Transfer-Encoding: quoted-printable Hi Nick, There are a few common ways of presenting audio data in a printed form each with their advantages and disadvantages depending on what you want the data to convey. I hope I'm not being patronising in describing the three most common by far.=20 The most simplest is merely providing a time-history, i.e. A picture of the waveform as you would see in a wave editing program such as audacity or Adobe. This is a graph with time as the x-axis and wave amplitude as the y-axis. This is useful is you want to see unusual physical features of the sound, such as clicks and pops, or to visualise tonal elements or periodic modulations. While this is the simplest it is often not used, as waveforms of recorded sounds often look like random garbage to the naked eye. Another common tool is to take a fourier transform or calculate the power spectrum of a signal. These both provide a way of looking at sound in the frequency domain. So now the X-axis becomes frequency and the y-axis is amplitude or power. So if a sound has a loud 1kHz tone in it, the graph wil= l have a large peak at 1000Hz. This is an extremely powerful way of analysing signals, but does not provide any indication of how the signal changes over time, but of a total of the whole signal (or a snapshot of the part of the signal where the fourier transform or power spectrum is applied). You must also be careful of drawing false conclusions from results as processing the data in this way can lead to information being lost or clouded due to the windowing that is required, or spurious peaks can sometimes be seen that don't necessarily represent a useful part of your signal. The third tool is called a spectrogram, which is essentially a combination of the two tools described above. It is a three dimensional graph (x-axis, y-axis, colour) where the x-axis is normally time, the y-axis frequency and colour represents the amplitude/power of the signal. Think of it as many fourier transforms taken over a period of time, and then stacked up next to each other. So you can see the content of the signal as a function of frequency but can also watch how this changes over time. The disadvantage o= f this is that with so much information represented on a single graph, any small changes, or fine information will be hard to detect. I'm sure wikipedia or google will be full of information about these techniques (along with some complicated looking maths for the uninitiated). I hope this has helped, and good luck. Matt Reynolds On 15/03/2009 04:18, "AUDITORY automatic digest system" <LISTSERV@xxxxxxxx> wrote: > There is 1 message totalling 39 lines in this issue. >=20 > Topics of the day: >=20 > 1. Representing Sound in non-audio formats, specifically Journal Artic= les >=20 > ---------------------------------------------------------------------- >=20 > Date: Sat, 14 Mar 2009 14:16:00 +0000 > From: Nick Bearman <nick.bearman@xxxxxxxx> > Subject: Representing Sound in non-audio formats, specifically Journal > Articles >=20 > I'm in the process of writing a journal article about the research I > have done on using sound to represent spatial data. As I'm sure you are > all aware, there are many issues when trying to demonstrate research > involving sound in a written journal paper, particularly when the > journal is not used to receiving these types of submissions. >=20 > Does anyone know of a good text exploring the issues of writing an > article trying to represent sounds in a very non-sonic way? >=20 > Obviously there are ways to make the sound available via the Internet > such as sound clips or examples of the program in question. However I'm > specifically interested in ways sound can be represented in a written or > visual medium for the article itself. >=20 > Thanks, > Nick. >=20 > P.S. I've specifically been ambiguous about the type of sound to be > represented, as even though a particular method may not work for my > data, it might be useful for someone else or prompt ideas for my data in > a very different way! >=20 > -- > Nick Bearman > PhD Student > School of Environmental Science > University of East Anglia > Norwich, NR4 7TJ > E-mail: n.bearman@xxxxxxxx >=20 > ------------------------------ >=20 > End of AUDITORY Digest - 13 Mar 2009 to 14 Mar 2009 (#2009-60) > ************************************************************** >=20 The information in this email is confidential. It is intended solely for the addressee. Access to this email by anyone else is unauthorised. If you are not the intended recipient, any disclosure, copying, or distribution is prohibited and may be unlawful. If you have received this email in error please delete it immediately and contact commercial@xxxxxxxx Sepura plc. Registered Office: Radio House, St Andrew=92s Road, Cambridge, = CB4 1GR, England. Registered in England and Wales. Registration Number 4353= 801 =20 --MCBoundary=_109031613081100801 Content-Type: text/html; charset=WINDOWS-1252 Content-Transfer-Encoding: quoted-printable <HTML><BODY> <BR> <BR> Hi Nick,<BR> <BR> There are a few common ways of presenting audio data in a printed form each= <BR> with their advantages and disadvantages depending on what you want the data= <BR> to convey. I hope I'm not being patronising in describing the three most<BR= > common by far. <BR> <BR> The most simplest is merely providing a time-history, i.e. A picture of the= <BR> waveform as you would see in a wave editing program such as audacity or<BR> Adobe. This is a graph with time as the x-axis and wave amplitude as the<BR= > y-axis. This is useful is you want to see unusual physical features of the<= BR> sound, such as clicks and pops, or to visualise tonal elements or periodic<= BR> modulations. While this is the simplest it is often not used, as waveforms<= BR> of recorded sounds often look like random garbage to the naked eye.<BR> <BR> Another common tool is to take a fourier transform or calculate the power<B= R> spectrum of a signal. These both provide a way of looking at sound in the<B= R> frequency domain. So now the X-axis becomes frequency and the y-axis is<BR> amplitude or power. So if a sound has a loud 1kHz tone in it, the graph wil= l<BR> have a large peak at 1000Hz. This is an extremely powerful way of analysing= <BR> signals, but does not provide any indication of how the signal changes over= <BR> time, but of a total of the whole signal (or a snapshot of the part of the<= BR> signal where the fourier transform or power spectrum is applied). You must<= BR> also be careful of drawing false conclusions from results as processing the= <BR> data in this way can lead to information being lost or clouded due to the<B= R> windowing that is required, or spurious peaks can sometimes be seen that<BR= > don't necessarily represent a useful part of your signal.<BR> <BR> The third tool is called a spectrogram, which is essentially a combination<= BR> of the two tools described above. It is a three dimensional graph (x-axis,<= BR> y-axis, colour) where the x-axis is normally time, the y-axis frequency and= <BR> colour represents the amplitude/power of the signal. Think of it as many<BR= > fourier transforms taken over a period of time, and then stacked up next to= <BR> each other. So you can see the content of the signal as a function of<BR> frequency but can also watch how this changes over time. The disadvantage o= f<BR> this is that with so much information represented on a single graph, any<BR= > small changes, or fine information will be hard to detect.<BR> <BR> I'm sure wikipedia or google will be full of information about these<BR> techniques (along with some complicated looking maths for the uninitiated).= <BR> I hope this has helped, and good luck.<BR> <BR> Matt Reynolds<BR> <BR> <BR> <BR> <BR> On 15/03/2009 04:18, &quot;AUDITORY automatic digest system&quot;<BR> &lt;<a href=3D"mailto:LISTSERV@xxxxxxxx">LISTSERV@xxxxxxxx</a= >&gt; wrote: <BR> <BR> &gt; There is 1 message totalling 39 lines in this issue.<BR> &gt; <BR> &gt; Topics of the day:<BR> &gt; <BR> &gt; 1. Representing Sound in non-audio formats, specifically Journal Ar= ticles<BR> &gt; <BR> &gt; ----------------------------------------------------------------------= <BR> &gt; <BR> &gt; Date: Sat, 14 Mar 2009 14:16:00 +0000<BR> &gt; From: Nick Bearman &lt;<a href=3D"mailto:nick.bearman@xxxxxxxx">ni= ck.bearman@xxxxxxxx</a>&gt; <BR> &gt; Subject: Representing Sound in non-audio formats, specifically Journa= l<BR> &gt; Articles<BR> &gt; <BR> &gt; I'm in the process of writing a journal article about the research I<B= R> &gt; have done on using sound to represent spatial data. As I'm sure you ar= e<BR> &gt; all aware, there are many issues when trying to demonstrate research<B= R> &gt; involving sound in a written journal paper, particularly when the<BR> &gt; journal is not used to receiving these types of submissions.<BR> &gt; <BR> &gt; Does anyone know of a good text exploring the issues of writing an<BR> &gt; article trying to represent sounds in a very non-sonic way?<BR> &gt; <BR> &gt; Obviously there are ways to make the sound available via the Internet<= BR> &gt; such as sound clips or examples of the program in question. However I'= m<BR> &gt; specifically interested in ways sound can be represented in a written = or<BR> &gt; visual medium for the article itself.<BR> &gt; <BR> &gt; Thanks,<BR> &gt; Nick.<BR> &gt; <BR> &gt; P.S. I've specifically been ambiguous about the type of sound to be<BR= > &gt; represented, as even though a particular method may not work for my<BR= > &gt; data, it might be useful for someone else or prompt ideas for my data = in<BR> &gt; a very different way!<BR> &gt; <BR> &gt; --<BR> &gt; Nick Bearman<BR> &gt; PhD Student<BR> &gt; School of Environmental Science<BR> &gt; University of East Anglia<BR> &gt; Norwich, NR4 7TJ<BR> &gt; E-mail: <a href=3D"mailto:n.bearman@xxxxxxxx">n.bearman@xxxxxxxx</a>= <BR> &gt; <BR> &gt; ------------------------------<BR> &gt; <BR> &gt; End of AUDITORY Digest - 13 Mar 2009 to 14 Mar 2009 (#2009-60)<BR> &gt; **************************************************************<BR> &gt; <BR> <BR> <BR> <span style=3D"font-family:Times New Roman; Font-size:12.0pt"> =20 <hr width=3D"100%"> The information in this email is confidential. It is intended<BR> solely for the addressee. Access to this email by anyone else<BR> is unauthorised. If you are not the intended recipient, any<BR> disclosure, copying, or distribution is prohibited and may be<BR> unlawful. If you have received this email in error please delete<BR> it immediately and contact commercial@xxxxxxxx<BR><BR> Sepura plc. Registered Office: Radio House, St Andrew=92s Road, Cambridge, = CB4 1GR, England. Registered in England and Wales. Registration Number 4353= 801 <hr width=3D"100%"> =20 <BR> This email message has been scanned for viruses by Mimecast.<BR> Mimecast delivers a complete managed email solution from a single we= b based platform.<BR> =20 </span> </BODY></HTML> --MCBoundary=_109031613081100801--


This message came from the mail archive
http://www.auditory.org/postings/2009/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University