Subject: Re: Talking to Computers From: Prof Roger K Moore <r.k.moore@xxxxxxxx> Date: Mon, 6 Apr 2009 19:49:58 +0100 List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>This is a multi-part message in MIME format. ------=_NextPart_000_004D_01C9B6F0.DE7B7950 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit Dear Paul, There is a huge amount published on this topic - and many good textbooks much more recent than those you cite. A couple of texts that address the points that I think you are interested in are . 1. Nass, C., & Brave, S. (2005). Wired for Speech: How Voice Activates and Advances the Human-computer Relationship. Cambridge, MA: MIT Press. 2. Balentine, B. (2007). It's Better to Be a Good Machine Than a Bad Person: Speech Recognition and Other Exotic User Interfaces at the Twilight of the Jetsonian Age: ICMI Press. To find out more about the basic underlying technologies, I particularly recommend . Holmes, J. N., & Holmes, W. (2002). Speech Synthesis and Recognition: Taylor & Francis. . and to find out about 'expressive' (i.e. emotional) speech, take a look at the HUMAINE website - http://emotion-research.net/ Good luck Roger Moore ________________________________________________________________ Prof ROGER K MOORE BA(Hons) MSc PhD FIOA MIET Chair of Spoken Language Processing Speech and Hearing Research Group (SPandH) Department of Computer Science, University of Sheffield, Regent Court, 211 Portobello, Sheffield, S1 4DP, UK e-mail: r.k.moore@xxxxxxxx web: http://www.dcs.shef.ac.uk/~roger/ tel: +44 (0) 11422 21807 fax: +44 (0) 11422 21810 mobile: +44 (0) 7910 073631 General Chair: INTERSPEECH-2009 http://www.interspeech2009.org/ ________________________________________________________________ _____ From: AUDITORY - Research in Auditory Perception [mailto:AUDITORY@xxxxxxxx On Behalf Of Paul Grant Sent: 06 April 2009 17:59 To: AUDITORY@xxxxxxxx Subject: [AUDITORY] Talking to Computers Dear Auditory list, I am currently writing my dissertation on computer speech, or more specifically human reaction, response and feelings when engaging in conversation with a machine. The study is focusing on systems like "expressive speech synthesis" (the computer analysing the incoming human voice and responding accordingly) and asking whether we are ready and willing to engage with technology like this. I am having trouble finding literature that covers this subject; I have found a lot of theory-based writing, such as Principles of Computer Speech, (Witten, 1982) and Electronic Synthesis of Speech, (Linggard 1985) covering speech synthesis itself but not much on what happens to humans when using it. If anybody can suggest any books, journals or articles that would be a great help. Thank you. Paul Grant _____ Surfing the web just got more rewarding. Download the New Internet Explorer 8 <http://extras.uk.msn.com/internet-explorer-8/?ocid=T010MSN07A0716U> ------=_NextPart_000_004D_01C9B6F0.DE7B7950 Content-Type: text/html; charset="US-ASCII" Content-Transfer-Encoding: quoted-printable <html xmlns:v=3D"urn:schemas-microsoft-com:vml" = xmlns:o=3D"urn:schemas-microsoft-com:office:office" = xmlns:w=3D"urn:schemas-microsoft-com:office:word" = xmlns:st1=3D"urn:schemas-microsoft-com:office:smarttags" = xmlns=3D"http://www.w3.org/TR/REC-html40"> <head> <meta http-equiv=3DContent-Type content=3D"text/html; = charset=3Dus-ascii"> <meta name=3DGenerator content=3D"Microsoft Word 11 (filtered medium)"> <!--[if !mso]> <style> v\:* {behavior:url(#default#VML);} o\:* {behavior:url(#default#VML);} w\:* {behavior:url(#default#VML);} .shape {behavior:url(#default#VML);} </style> <![endif]--><o:SmartTagType namespaceuri=3D"urn:schemas-microsoft-com:office:smarttags" = name=3D"PlaceName"/> <o:SmartTagType = namespaceuri=3D"urn:schemas-microsoft-com:office:smarttags" name=3D"PlaceType"/> <o:SmartTagType = namespaceuri=3D"urn:schemas-microsoft-com:office:smarttags" name=3D"country-region"/> <o:SmartTagType = namespaceuri=3D"urn:schemas-microsoft-com:office:smarttags" name=3D"PostalCode"/> <o:SmartTagType = namespaceuri=3D"urn:schemas-microsoft-com:office:smarttags" name=3D"place"/> <o:SmartTagType = namespaceuri=3D"urn:schemas-microsoft-com:office:smarttags" name=3D"City"/> <o:SmartTagType = namespaceuri=3D"urn:schemas-microsoft-com:office:smarttags" name=3D"State"/> <o:SmartTagType = namespaceuri=3D"urn:schemas-microsoft-com:office:smarttags" name=3D"PersonName"/> <!--[if !mso]> <style> st1\:*{behavior:url(#default#ieooui) } </style> <![endif]--> <style> <!-- /* Font Definitions */ @xxxxxxxx {font-family:Tahoma; panose-1:2 11 6 4 3 5 4 4 2 4;} @xxxxxxxx {font-family:Verdana; panose-1:2 11 6 4 3 5 4 4 2 4;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {margin:0cm; margin-bottom:.0001pt; font-size:12.0pt; font-family:"Times New Roman";} a:link, span.MsoHyperlink {color:blue; text-decoration:underline;} a:visited, span.MsoHyperlinkFollowed {color:blue; text-decoration:underline;} p {mso-margin-top-alt:auto; margin-right:0cm; mso-margin-bottom-alt:auto; margin-left:0cm; font-size:12.0pt; font-family:"Times New Roman";} span.EmailStyle18 {mso-style-type:personal-reply; font-family:Arial; color:navy;} /* Page Definitions */ @xxxxxxxx Section1 {size:21.0cm 842.0pt; margin:70.9pt 53.85pt 89.85pt 53.85pt;} div.Section1 {page:Section1;} /* List Definitions */ @xxxxxxxx l0 {mso-list-id:1592094; mso-list-type:hybrid; mso-list-template-ids:697842240 67698703 67698713 67698715 67698703 = 67698713 67698715 67698703 67698713 67698715;} @xxxxxxxx l0:level1 {mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt;} ol {margin-bottom:0cm;} ul {margin-bottom:0cm;} --> </style> </head> <body lang=3DEN-US link=3Dblue vlink=3Dblue> <div class=3DSection1> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span = style=3D'font-size: 10.0pt;font-family:Arial;color:navy'>Dear = Paul,<o:p></o:p></span></font></p> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span = style=3D'font-size: 10.0pt;font-family:Arial;color:navy'><o:p> </o:p></span></font></p> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span = style=3D'font-size: 10.0pt;font-family:Arial;color:navy'>There is a huge amount published on = this topic – and many good textbooks much more recent than those you = cite. A couple of texts that address the points that I think you are = interested in are …<o:p></o:p></span></font></p> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span = style=3D'font-size: 10.0pt;font-family:Arial;color:navy'><o:p> </o:p></span></font></p> <ol style=3D'margin-top:0cm' start=3D1 type=3D1> <li class=3DMsoNormal style=3D'color:navy;mso-list:l0 level1 = lfo1'><font size=3D2 color=3Dnavy face=3DArial><span = style=3D'font-size:10.0pt;font-family:Arial'>Nass, C., & Brave, S. (2005). Wired for Speech: How Voice Activates = and Advances the Human-computer Relationship. <st1:place = w:st=3D"on"><st1:City w:st=3D"on">Cambridge</st1:City>, <st1:State = w:st=3D"on">MA</st1:State></st1:place>: MIT Press.<o:p></o:p></span></font></li> <li class=3DMsoNormal style=3D'color:navy;mso-list:l0 level1 = lfo1'><font size=3D2 color=3Dnavy face=3DArial><span = style=3D'font-size:10.0pt;font-family:Arial'>Balentine, B. (2007). It's Better to Be a Good Machine Than a Bad Person: = Speech Recognition and Other Exotic User Interfaces at the Twilight of the Jetsonian Age: ICMI Press.<o:p></o:p></span></font></li> </ol> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span = style=3D'font-size: 10.0pt;font-family:Arial;color:navy'><o:p> </o:p></span></font></p> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span = style=3D'font-size: 10.0pt;font-family:Arial;color:navy'>To find out more about the basic underlying technologies, I particularly recommend = …<o:p></o:p></span></font></p> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span = style=3D'font-size: 10.0pt;font-family:Arial;color:navy'>Holmes, J. N., & Holmes, W. = (2002). Speech Synthesis and Recognition: Taylor & = Francis.<o:p></o:p></span></font></p> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span = style=3D'font-size: 10.0pt;font-family:Arial;color:navy'><o:p> </o:p></span></font></p> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span = style=3D'font-size: 10.0pt;font-family:Arial;color:navy'>… and to find out about = ‘expressive’ (i.e. emotional) speech, take a look at the HUMAINE website - <a href=3D"http://emotion-research.net/">http://emotion-research.net/</a><o:= p></o:p></span></font></p> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span = style=3D'font-size: 10.0pt;font-family:Arial;color:navy'><o:p> </o:p></span></font></p> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span = style=3D'font-size: 10.0pt;font-family:Arial;color:navy'><o:p> </o:p></span></font></p> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span = style=3D'font-size: 10.0pt;font-family:Arial;color:navy'>Good = luck<o:p></o:p></span></font></p> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span = style=3D'font-size: 10.0pt;font-family:Arial;color:navy'><o:p> </o:p></span></font></p> <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span = style=3D'font-size: 10.0pt;font-family:Arial;color:navy'>Roger = Moore<o:p></o:p></span></font></p> <div> <p><font size=3D2 color=3Dnavy face=3D"Times New Roman"><span = style=3D'font-size:10.0pt; color:navy'>_____________________________________________________________= ___<br> <br> Prof ROGER K MOORE BA(Hons) MSc PhD FIOA MIET<br> <br> Chair of Spoken Language Processing<br> Speech and Hearing Research Group (SPandH)<br> Department of Computer Science, <st1:place w:st=3D"on"><st1:PlaceType = w:st=3D"on">University</st1:PlaceType> of <st1:PlaceName = w:st=3D"on">Sheffield</st1:PlaceName></st1:place>,<br> Regent Court, 211 Portobello,<br> <st1:place w:st=3D"on"><st1:City w:st=3D"on">Sheffield</st1:City>, = <st1:PostalCode w:st=3D"on">S1 4DP</st1:PostalCode>, <st1:country-region = w:st=3D"on">UK</st1:country-region></st1:place><br> <br> e-mail: <st1:PersonName = w:st=3D"on">r.k.moore@xxxxxxxx</st1:PersonName><br> web: <a = href=3D"http://www.dcs.shef.ac.uk/~roger/">http://www.dcs.shef.ac.uk/~rog= er/</a><br> tel: +44 (0) 11422 21807<br> fax: +44 (0) 11422 21810<br> mobile: +44 (0) 7910 073631<br> <br> General Chair: INTERSPEECH-2009 <a = href=3D"http://www.interspeech2009.org/">http://www.interspeech2009.org/<= /a><br> ________________________________________________________________</span></= font><font color=3Dnavy><span style=3D'color:navy'> </span></font><o:p></o:p></p> </div> <div style=3D'border:none;border-left:solid blue 1.5pt;padding:0cm 0cm = 0cm 4.0pt'> <div> <div class=3DMsoNormal align=3Dcenter style=3D'text-align:center'><font = size=3D3 face=3D"Times New Roman"><span style=3D'font-size:12.0pt'> <hr size=3D3 width=3D"100%" align=3Dcenter tabindex=3D-1> </span></font></div> <p class=3DMsoNormal><b><font size=3D2 face=3DTahoma><span = style=3D'font-size:10.0pt; font-family:Tahoma;font-weight:bold'>From:</span></font></b><font = size=3D2 face=3DTahoma><span style=3D'font-size:10.0pt;font-family:Tahoma'> = AUDITORY - Research in Auditory Perception [mailto:<st1:PersonName = w:st=3D"on">AUDITORY@xxxxxxxx</st1:PersonName>] <b><span style=3D'font-weight:bold'>On Behalf Of </span></b>Paul = Grant<br> <b><span style=3D'font-weight:bold'>Sent:</span></b> 06 April 2009 = 17:59<br> <b><span style=3D'font-weight:bold'>To:</span></b> <st1:PersonName = w:st=3D"on">AUDITORY@xxxxxxxx</st1:PersonName><br> <b><span style=3D'font-weight:bold'>Subject:</span></b> [AUDITORY] = Talking to Computers</span></font><o:p></o:p></p> </div> <p class=3DMsoNormal><font size=3D3 face=3D"Times New Roman"><span = style=3D'font-size: 12.0pt'><o:p> </o:p></span></font></p> <p class=3DMsoNormal style=3D'margin-bottom:12.0pt'><font size=3D2 = face=3DVerdana><span style=3D'font-size:10.0pt;font-family:Verdana'>Dear Auditory list,<br> <br> I am currently writing my dissertation on computer speech, or more = specifically human reaction, response and feelings when engaging in conversation with = a machine. The study is focusing on systems like "expressive speech synthesis" (the computer analysing the incoming human voice and = responding accordingly) and asking whether we are ready and willing to engage with technology like this.<br> <br> I am having trouble finding literature that covers this subject; I have = found a lot of theory-based writing, such as Principles of Computer Speech, = (Witten, 1982) and Electronic Synthesis of Speech, (Linggard 1985) covering = speech synthesis itself but not much on what happens to humans when using = it.<br> <br> If anybody can suggest any books, journals or articles that would be a = great help.<br> <br> Thank you.<br> <br> Paul Grant<br> <br> <o:p></o:p></span></font></p> <div class=3DMsoNormal align=3Dcenter style=3D'text-align:center'><font = size=3D2 face=3DVerdana><span style=3D'font-size:10.0pt;font-family:Verdana'> <hr size=3D3 width=3D"100%" align=3Dcenter> </span></font></div> <p class=3DMsoNormal><font size=3D2 face=3DVerdana><span = style=3D'font-size:10.0pt; font-family:Verdana'>Surfing the web just got more rewarding. <a href=3D"http://extras.uk.msn.com/internet-explorer-8/?ocid=3DT010MSN07A07= 16U" target=3D"_new">Download the New Internet Explorer = 8</a><o:p></o:p></span></font></p> </div> </div> </body> </html> ------=_NextPart_000_004D_01C9B6F0.DE7B7950--