PhD position in diagnostic method for voice quality at Orange Labs in Lannion (France) (adrien leman )


Subject: PhD position in diagnostic method for voice quality at Orange Labs in Lannion (France)
From:    adrien leman  <leman.adrien@xxxxxxxx>
Date:    Tue, 8 Jun 2010 12:02:19 +0200
List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

--0015174733ce8323f2048881e2bb Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable Content-Disposition: inline *Campaign 2010 * *Description of the PHD* Orange Labs Supervisor: Vincent Barriac *Supervisor email: * Vincent.barriac@xxxxxxxx Location: Lannion (France) PHD title: Development of a technical diagnostic method for voice quality impairments perceived in telephone communications, based on an analysis of speech signal. Global context and state of the art The assessment of perceived quality of voice communications can be performed thanks to two rather distinct families of tools: - Signal analysis techniques, from simple measures like signal level or spectrum until complex =93psycho-acoustical=94 models combining si= gnal analysis and modelling of human perception and judgment (like PESQ, ITU-T P.862). - Parametric techniques based on an interpretation of technical factors not linked to the signal itself, but rather to the way it has been processed and transported inside the network. Inside these two families, methods have been recently developed, which are particularly accurate, allowing pertinent prediction and estimation of perceived voice quality. More recently, new approaches have been developed, called =93hybrid=94 beca= use they combine measurement on signal and parametric indications, in particular in the context of voice over IP. The complementarities of signal-based and parametric families of methods make it possible (in theory) to envisage a combination of their respective advantages: accuracy for signal based techniques, and capacity of parametric tools to be implemented without constraint on CPU or on signal decoding. Furthermore, parametric methods bring elements of understanding about the technical underlying causes (e.g. packet losses may explain cuts in the signal). But all these methods have a common drawback: they do not allow a link between the perceived impairments and their origins. Some academic studies can be quoted on this, but without real result until now. From an operational point of view, this is however the real goal of any assessment technique to find the causes for issues and propose fixes. PHD objectives / Expected results / Scientific challenges / Key Issues The basic idea behind this new study is that it is now realistic to envisage providing operational supervision teams with powerful diagnostic tools able to give them an expert view of the perceived voice quality impairments on telephone communications and to troubleshoot these impairments deep in detail. The objective of this study is therefore the development of such a tool, combining analysis of the audio signal and interpretation of parametric data. This study will specifically focus on VoIP services and architectures. These are based on IMS solutions (SIP protocol) provided by a few technology vendors to France Telecom/Orange. The extrapolation of the results of this study to general (and even standardisable) rules of diagnostic (in general highly dependent on specific characteristics of services and networks) is not easy to foresee, and therefore we won=92t try to work on that direction. This work will be undertaken in close cooperation with (in a first time) our searchers specialised in the development of algorithms and models for voice signal processing (voice quality measurement, speech coding, voice enhancements) and (afterwards) with operational teams having the knowledge of network equipments characteristics and able to provide data on real incidents necessary to set up diagnostic rules. Methodological approach proposed by the supervisor We foresee two steps: - detection in speech signal of perceived and annoying degradations, classified in general categories: o cuts in the signal, loss of information o distortion of the audio signal o different types of noises o signal level modifications o various impairments linked with interaction issues (e.g. echo) - determination of more detailed sub-categories (e.g. for noise: distinction according to spectral content and level), linked with known and identified technical causes The first step is clearly and purely signal processing oriented. We must mention that recent PhD. studies (e.g. M. W=E4ltermann at DT, N. C=F4t=E9 a= nd A. Leman at FT) started this work and already determined degradation categories (for listening-only contexts), as well as first (still perfectible) detection algorithms. The second step is more the focus of the current study. It will combine the existing algorithms (or enhancements of them) with the analysis of IP parametric information (packet loss ratio and its time repartition, network equipments counters or trouble tickets, measurements performed on terminals, etc.) Global schedule The time schedule will follow the distinction in two steps exposed before: - Enhancement of existing algorithms, to allow detection of sub-categories as well as =93recognition=94 of the signature of some signal processing features (noise reduction, coding and transcoding, etc.). This is the hardest and longest part of the study. - Setting up of diagnostic rules to link these new sub-categories to real technical issues, thanks to a combination of measurements on signal and of parametric data. An expert system based on neuronal networks is foreseen, but other approaches can be envisaged as well. Additional contributions None foreseen until now. %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%= %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% *Campagne 2010* *Fiche descriptive de la th=E8se* Encadrant Orange Labs: Vincent Barriac *Adresse =E9lectronique de l'encadrant : * Vincent.barriac@xxxxxxxx Site: Lannion Sujet de la th=E8se (*Titre*): D=E9veloppement d=92une m=E9thode de diagnos= tic technique des d=E9gradations de qualit=E9 vocale per=E7ue des communications t=E9l=E9phoniques =E0 partir d=92une analyse du signal de parole Contexte= global de l'=E9tude et =E9tat de l'art L=92=E9valuation de la qualit=E9 per=E7ue des communications vocales fait a= ppel =E0 deux familles de techniques assez distinctes : - Les techniques d=92analyse du signal, depuis des mesures simples sur le niveau de signal ou son spectre, jusqu=92=E0 des mod=E8les dits psycho-acoustiques, c'est-=E0-dire combinant analyse du signal et mod=E9lis= ation de la perception et du jugement. Le plus connu de ces mod=E8les est PESQ (UIT-T P.862). - Les techniques d=92interpr=E9tation des indicateurs techniques n= on li=E9s au signal lui-m=EAme, mais plut=F4t =E0 la fa=E7on dont il a =E9t=E9= trait=E9 et transport=E9 par le r=E9seau. On parle alors de m=E9thodes param=E9triques. Dans ces deux familles on a abouti =E0 des m=E9thodes particuli=E8rement pr= =E9cises, permettant d=E9sormais de pr=E9dire ou d=92estimer la qualit=E9 per=E7ue de= fa=E7on pertinente. Plus r=E9cemment, on a vu appara=EEtre des approches hybrides, combinant me= sures sur le signal et indications param=E9triques, notamment dans le domaine du transport sur IP. La compl=E9mentarit=E9 des deux approches doit permettre = de combiner les avantages des deux familles : la pr=E9cision des mesures sur le signal, et la capacit=E9 des m=E9thodes param=E9triques =E0 =EAtre utilis= =E9es sans contrainte de CPU ou de d=E9codage du signal. Les m=E9thodes param=E9triques apportent de plus des =E9l=E9ments de compr=E9hension de d=E9fauts techniqu= es (par exemple, une mesure de pertes de paquets pour comprendre des coupures dans le signal). Toutes ces m=E9thodes ont cependant un d=E9faut : elles ne permettent pas de faire le lien entre une d=E9gradation per=E7ue et sa cause. Quelques =E9tud= es ont commenc=E9 =E0 aborder ce lien, mais elles en sont encore =E0 un stade peu = avanc=E9. Or, d=92un point de vue op=E9rationnel, c=92est =E0 cela que doivent avant = tout servir des m=E9thodes et outils de mesure de qualit=E9 : trouver l=92origin= e des d=E9fauts constat=E9s, voire m=EAme proposer des actions correctives. Objectifs de la th=E8se/ R=E9sultats attendus/ D=E9fis scientifiques/techni= ques =E0 relever. L=92id=E9e =E0 la base du lancement de cette =E9tude est qu=92il est possib= le et r=E9aliste d=92envisager de doter les =E9quipes op=E9rationnelles en charge= de superviser les r=E9seaux et services de t=E9l=E9communications d=92outils de diagnostic puissants capables d=92expertiser les d=E9fauts de qualit=E9 per= =E7ue sur des communications t=E9l=E9phoniques et d=92en d=E9duire les causes techniq= ues sous-jacentes (et a fortiori les solutions =E0 apporter). L=92objectif est donc la r=E9alisation d=92un tel outil, combinant analyse = du signal audio et interpr=E9tation de donn=E9es param=E9triques. Cette =E9tude sera restreinte aux architectures des r=E9seaux VoIP de France T=E9l=E9com / Orange, bas=E9es sur IMS (protocole SIP), et aux seuls fourni= sseurs de France T=E9l=E9com / Orange de ces architectures. . L=92extrapolation de= ces travaux vers une g=E9n=E9ralisation des r=E8gles de diagnostic (qui seront fortement d=E9pendantes d'architectures particuli=E8res) ou la normalisatio= n ne nous semble donc pas ais=E9ment envisageable (et pas non plus forc=E9ment souhaitable). Ce travail va =EAtre r=E9alis=E9 en =E9troite collaboration avec (premi=E8r= e partie) les chercheurs en charge du d=E9veloppement d=92algorithmes et mod=E8les de traitement du signal vocal (mesure de qualit=E9 vocale, codage de parole, am=E9lioration du signal), mais aussi (seconde partie) avec les =E9quipes op=E9rationnelles connaissant les =E9quipements du r=E9seau et en mesure de fournir des donn=E9es r=E9elles d=92incidents pour =E9laborer des r=E8gles = de diagnostic. Approche m=E9thodologique propos=E9e par le responsable technique (Pr=E9ciser les comp=E9tences recherch=E9es n=E9cessaires =E0 l'approche) L=92approche que nous privil=E9gions comporte deux =E9tapes : - la d=E9tection dans le signal de parole de d=E9gradations perceptibles et g=EAnantes, parmi des cat=E9gories g=E9n=E9rales : o coupures dans le signal, pertes d=92information o distorsion, d=E9formation du signal o pr=E9sence de bruit de fond o modification du niveau de signal o d=E9fauts li=E9s =E0 des difficult=E9s d=92interaction (=E9cho, no= tamment) - la d=E9termination de sous-cat=E9gories plus pr=E9cises (par exe= mple pour le bruit : distinction du type et de l=92amplitude du bruit), li=E9es = =E0 des causes techniques pr=E9visibles identifi=E9es. La premi=E8re =E9tape est purement ax=E9e sur du traitement de signal. Il f= aut mentionner que des travaux r=E9cents (th=E8ses de M. W=E4ltermann =E0 DT, d= e N. C=F4t=E9 et d=92A. Leman =E0 FT) ont bien d=E9broussaill=E9 le terrain (dans le cont= exte d=92=E9coute, pas en contexte conversationnel, qui reste encore =E0 =E9tudi= er), puisque les principales dimensions sont connues et que des algorithmes de d=E9tection (perfectibles) ont =E9t=E9 d=E9velopp=E9s. - La seconde =E9tape, qui constitue le sujet de cette =E9tude, va combiner ces algorithmes (ou plut=F4t des raffinements de ces algorithmes) = et l=92analyse d=92informations de type param=E9trique IP (taux de pertes de p= aquets et leur r=E9partition dans le temps, =E9v=E9nements sur =E9quipement du r= =E9seau et accessibles via des compteurs ou des CDR, r=E9sultats de mesures effectu=E9= s par les terminaux et renvoy=E9s par eux sur le r=E9seau, etc.). Planning Global du d=E9roulement de la th=E8se (*grandes lignes*) La th=E8se propos=E9e se d=E9compose donc en deux sous-parties bien distin= ctes : - L=92am=E9lioration des algorithmes existants, pour leur permettr= e de d=E9tecter des sous-cat=E9gories ainsi que d=92=EAtre capables de =AB recon= na=EEtre =BB la signature de certains traitements (notamment : d=E9bruitage, codage et transcodage). Il s=92agit de la partie la plus ardue et longue de cette =E9= tude. - L=92=E9tablissement de r=E8gles de diagnostic permettant de reli= er ces sous-cat=E9gories =E0 des d=E9fauts r=E9els, gr=E2ce =E0 une combinaison de= mesures sur le signal et d=92informations param=E9triques. Un syst=E8me expert bas=E9 s= ur des r=E9seaux de neurones est envisag=E9, mais d=92autres solutions peuvent =EA= tre imagin=E9es. Contributions secondaires si pr=E9vues (*participation =E0 des projets collaboratifs*) Aucune pour l=92instant ** *Adrien Leman* *Orange Labs - *Doctorant de l'Unit=E9 de R&D Mod=E9lisation et =E9valuation Objective de la qualit=E9 Vocale FT/RD/TECH/OPERA/MOV Bat. LD 230 2 avenue Pierre Marzin 22307 Lannion cedex t=E9l. +33 2 96 05 30 02 adrien.leman@xxxxxxxx --0015174733ce8323f2048881e2bb Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Content-Disposition: inline <meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3Dutf-8"><m= eta name=3D"ProgId" content=3D"Word.Document"><meta name=3D"Generator" cont= ent=3D"Microsoft Word 11"><meta name=3D"Originator" content=3D"Microsoft Wo= rd 11"><!-- <link rel=3D"File-List" href=3D"file:///C:%5CDOCUME%7E1%5Caful8= 265%5CLOCALS%7E1%5CTemp%5Cmsohtml1%5C06%5Cclip_filelist.xml"> --><!-- <link= rel=3D"Edit-Time-Data" href=3D"file:///C:%5CDOCUME%7E1%5Caful8265%5CLOCALS= %7E1%5CTemp%5Cmsohtml1%5C06%5Cclip_editdata.mso"> --><style> <!-- /* Font Definitions */ @xxxxxxxx {font-family:Wingdings; panose-1:5 0 0 0 0 0 0 0 0 0; mso-font-charset:2; mso-generic-font-family:auto; mso-font-pitch:variable; mso-font-signature:0 268435456 0 0 -2147483648 0;} @xxxxxxxx {font-family:SimSun; panose-1:2 1 6 0 3 1 1 1 1 1; mso-font-alt:=E5=AE=8B=E4=BD=93; mso-font-charset:134; mso-generic-font-family:auto; mso-font-pitch:variable; mso-font-signature:3 135135232 16 0 262145 0;} @xxxxxxxx {font-family:"\@xxxxxxxx"; panose-1:2 1 6 0 3 1 1 1 1 1; mso-font-charset:134; mso-generic-font-family:auto; mso-font-pitch:variable; mso-font-signature:3 135135232 16 0 262145 0;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {mso-style-parent:""; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:10.0pt; mso-bidi-font-size:12.0pt; font-family:Arial; mso-fareast-font-family:"Times New Roman"; mso-bidi-font-family:"Times New Roman"; mso-fareast-language:EN-US;} h1 {mso-style-next:Normal; margin:0cm; margin-bottom:.0001pt; text-align:center; mso-line-height-alt:12.0pt; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:1; font-size:14.0pt; mso-bidi-font-size:12.0pt; font-family:Arial; mso-fareast-font-family:"Times New Roman"; mso-font-kerning:0pt; mso-fareast-language:EN-US; mso-bidi-font-weight:normal;} p.MsoHeader, li.MsoHeader, div.MsoHeader {margin:0cm; margin-bottom:.0001pt; text-align:justify; text-justify:inter-ideograph; line-height:14.0pt; mso-pagination:widow-orphan; tab-stops:center 8.0cm right 16.0cm; font-size:10.0pt; font-family:Arial; mso-fareast-font-family:"Times New Roman"; mso-bidi-font-family:"Times New Roman"; mso-fareast-language:EN-US;} p.MsoFooter, li.MsoFooter, div.MsoFooter {margin:0cm; margin-bottom:.0001pt; text-align:justify; text-justify:inter-ideograph; line-height:14.0pt; mso-pagination:widow-orphan; tab-stops:center 8.0cm right 16.0cm; font-size:10.0pt; font-family:Arial; mso-fareast-font-family:"Times New Roman"; mso-bidi-font-family:"Times New Roman"; mso-fareast-language:EN-US;} @xxxxxxxx Section1 {size:595.3pt 841.9pt; margin:2.0cm 2.0cm 2.0cm 2.0cm; mso-header-margin:1.0cm; mso-footer-margin:1.0cm; mso-forms-protection:no; mso-paper-source:0;} div.Section1 {page:Section1;} /* List Definitions */ @xxxxxxxx l0 {mso-list-id:1813331279; mso-list-type:hybrid; mso-list-template-ids:1251393632 -897570060 67895299 67895301 67895297 678= 95299 67895301 67895297 67895299 67895301;} @xxxxxxxx l0:level1 {mso-level-start-at:0; mso-level-number-format:bullet; mso-level-text:-; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt; font-family:Arial; mso-fareast-font-family:SimSun;} @xxxxxxxx l0:level2 {mso-level-number-format:bullet; mso-level-text:o; mso-level-tab-stop:72.0pt; mso-level-number-position:left; text-indent:-18.0pt; font-family:"Courier New";} ol {margin-bottom:0cm;} ul {margin-bottom:0cm;} --> </style> <div style=3D"border: 1pt solid windowtext; padding: 1pt 4pt;"> <p class=3D"MsoNormal" style=3D"text-align: center; border: medium none; pa= dding: 0cm;" align=3D"center"><b style=3D""><span style=3D"font-size: 14pt;= " lang=3D"EN-GB">Campaign 2010 </span></b></p> <p class=3D"MsoNormal" style=3D"text-align: center; border: medium none; pa= dding: 0cm;" align=3D"center"><b style=3D""><span style=3D"font-size: 14pt;= " lang=3D"EN-GB">Description of the PHD</span></b></p> </div> <p class=3D"MsoNormal" style=3D"text-align: justify;"><span style=3D"color:= black;" lang=3D"EN-GB">=C2=A0</span></p> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"font-size: 12= pt; color: black;" lang=3D"EN-GB">Orange Labs Supervisor: </span></h1> <p class=3D"MsoNormal"><span style=3D"font-size: 12pt;" lang=3D"EN-GB">Vinc= ent Barriac</span><span style=3D"" lang=3D"EN-GB"></span></p> <p class=3D"MsoNormal"><b style=3D""><span style=3D"font-size: 12pt;" lang= =3D"EN-GB">Supervisor email: </span></b></p> <p class=3D"MsoNormal"><span style=3D"" lang=3D"EN-GB"><a href=3D"mailto:Vi= ncent.barriac@xxxxxxxx">Vincent.barriac@xxxxxxxx</a></s= pan></p> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"" lang=3D"EN-= GB">Location: </span></h1> <p class=3D"MsoNormal"><span style=3D"" lang=3D"EN-GB">Lannion (France)</sp= an></p> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"" lang=3D"EN-= GB">PHD title: </span></h1> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"font-size: 12= pt; font-family: &quot;Times New Roman&quot;;" lang=3D"EN-GB">Development o= f a technical diagnostic method for voice quality impairments perceived in telephone communications, based on an analysis of speech signal.</span></h1> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"font-size: 10= pt; color: black; font-weight: normal;" lang=3D"EN-GB">=C2=A0</span></h1> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"font-size: 12= pt; color: black;" lang=3D"EN-GB">Global context and state of the art</span= ></h1> <p class=3D"MsoNormal"><span style=3D"" lang=3D"EN-GB">=C2=A0</span></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt;"><span style=3D"" lang=3D"= EN-GB">The assessment of perceived quality of voice communications can be performed thanks to two rather disti= nct families of tools:</span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 36pt; text-indent:= -18pt;"><span style=3D"" lang=3D"EN-GB"><span style=3D"">-<span style=3D"f= ont: 7pt &quot;Times New Roman&quot;;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0= =C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"" lang=3D"EN-GB">Signal analysis techni= ques, from simple measures like signal level or spectrum until complex =E2=80=9Cp= sycho-acoustical=E2=80=9D models combining signal analysis and modelling of human perception and judg= ment (like PESQ, ITU-T P.862).</span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 36pt; text-indent:= -18pt;"><span style=3D"" lang=3D"EN-GB"><span style=3D"">-<span style=3D"f= ont: 7pt &quot;Times New Roman&quot;;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0= =C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"" lang=3D"EN-GB">Parametric techniques based on an interpretation of technical factors not linked to the signal itself, but rather to the way it has been processed and transported inside the network.</span></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt;"><span style=3D"" lang=3D"= EN-GB">Inside these two families, methods have been recently developed, which are particularly accurate, allowing pertinent prediction and estimation of perceived voice quality.</span></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt;"><span style=3D"" lang=3D"= EN-GB">More recently, new approaches have been developed, called =E2=80=9Chybrid=E2=80=9D because the= y combine measurement on signal and parametric indications, in particular in the cont= ext of voice over IP. The complementarities of signal-based and parametric fami= lies of methods make it possible (in theory) to envisage a combination of their respective advantages: accuracy for signal based techniques, and capacity of parametric tools to be implemented without constraint on CPU or on signal decoding. Furthermore, parametric methods bring elements of understanding a= bout the technical underlying causes (e.g. packet losses may explain cuts in the signal).</span></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt;"><span style=3D"" lang=3D"= EN-GB">But all these methods have a common drawback: they do not allow a link between the percei= ved impairments and their origins. Some academic studies can be quoted on this,= but without real result until now. From an operational point of view, this is h= owever the real goal of any assessment technique to find the causes for issues and propose fixes.</span></p> <p class=3D"MsoNormal" style=3D""><span style=3D"color: black;" lang=3D"EN-= GB">=C2=A0</span></p> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"font-size: 12= pt; color: black;" lang=3D"EN-GB">PHD objectives / Expected results / Scien= tific challenges / Key Issues</span></h1> <p class=3D"MsoNormal" style=3D"margin-top: 6pt; text-align: justify;"><spa= n style=3D"" lang=3D"EN-GB">The basic idea behind this new study is that it is now realistic to envisage providing operational supervision teams with powerful diagnostic tools able to give them an expert view of the perceived voice quality impairments on telephone communications and to troubleshoot t= hese impairments deep in detail.</span></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt; text-align: justify;"><spa= n style=3D"" lang=3D"EN-GB">The objective of this study is therefore the development of such a tool, combining analysis of the audio signal and interpretation of parametric data.</span></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt; text-align: justify;"><spa= n style=3D"" lang=3D"EN-GB">This study will specifically focus on VoIP services and architectures. These are based on IMS solutions (SIP protocol) provided by a few technology vendors to France Telecom/Orange. The extrapolation of the results of this study to general (and even standardisa= ble) rules of diagnostic (in general highly dependent on specific characteristic= s of services and networks) is not easy to foresee, and therefore we won=E2=80= =99t try to work on that direction.</span></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt; text-align: justify;"><spa= n style=3D"" lang=3D"EN-GB">This work will be undertaken in close cooperati= on with (in a first time) our searchers specialised in the development of algorithms and models for voice signal processing (voice quality measuremen= t, speech coding, voice enhancements) and (afterwards) with operational teams having the knowledge of network equipments characteristics and able to prov= ide data on real incidents necessary to set up diagnostic rules.</span></p> <p class=3D"MsoNormal" style=3D""><span style=3D"color: black;" lang=3D"EN-= GB">=C2=A0</span></p> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"font-size: 12= pt; color: black;" lang=3D"EN-GB">Methodological approach proposed by the s= upervisor </span></h1> <p class=3D"MsoNormal"><span style=3D"" lang=3D"EN-GB">=C2=A0</span></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt; text-align: justify;"><spa= n style=3D"" lang=3D"EN-GB">We foresee two steps:</span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 36pt; text-align: = justify; text-indent: -18pt;"><span style=3D"" lang=3D"EN-GB"><span style= =3D"">-<span style=3D"font: 7pt &quot;Times New Roman&quot;;">=C2=A0=C2=A0= =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"" lang=3D"EN-GB">detection in speech si= gnal of perceived and annoying degradations, classified in general categories:</span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 72pt; text-align: = justify; text-indent: -18pt;"><span style=3D"font-family: &quot;Courier New= &quot;;" lang=3D"EN-GB"><span style=3D"">o<span style=3D"font: 7pt &quot;Ti= mes New Roman&quot;;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"" lang=3D"EN-GB">cuts in the signal, lo= ss of information</span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 72pt; text-align: = justify; text-indent: -18pt;"><span style=3D"font-family: &quot;Courier New= &quot;;" lang=3D"EN-GB"><span style=3D"">o<span style=3D"font: 7pt &quot;Ti= mes New Roman&quot;;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"" lang=3D"EN-GB">distortion of the audi= o signal</span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 72pt; text-align: = justify; text-indent: -18pt;"><span style=3D"font-family: &quot;Courier New= &quot;;" lang=3D"EN-GB"><span style=3D"">o<span style=3D"font: 7pt &quot;Ti= mes New Roman&quot;;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"" lang=3D"EN-GB">different types of noi= ses</span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 72pt; text-align: = justify; text-indent: -18pt;"><span style=3D"font-family: &quot;Courier New= &quot;;" lang=3D"EN-GB"><span style=3D"">o<span style=3D"font: 7pt &quot;Ti= mes New Roman&quot;;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"" lang=3D"EN-GB">signal level modificat= ions</span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 72pt; text-align: = justify; text-indent: -18pt;"><span style=3D"font-family: &quot;Courier New= &quot;;" lang=3D"EN-GB"><span style=3D"">o<span style=3D"font: 7pt &quot;Ti= mes New Roman&quot;;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"" lang=3D"EN-GB">various impairments li= nked with interaction issues (e.g. echo)</span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 36pt; text-align: = justify; text-indent: -18pt;"><span style=3D"" lang=3D"EN-GB"><span style= =3D"">-<span style=3D"font: 7pt &quot;Times New Roman&quot;;">=C2=A0=C2=A0= =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"" lang=3D"EN-GB">determination of more = detailed sub-categories (e.g. for noise: distinction according to spectral content and level), link= ed with known and identified technical causes</span></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt; text-align: justify;"><spa= n style=3D"" lang=3D"EN-GB">The first step is clearly and purely signal processing oriented. We must mention that recent PhD. studies (e.g. M. W=C3=A4ltermann at DT, N. C=C3=B4t=C3=A9 and A. Leman at FT) started this w= ork and already determined degradation categories (for listening-only contexts), as well as first (still perfectible) detection algorithms.</span></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt; text-align: justify;"><spa= n style=3D"" lang=3D"EN-GB">The second step is more the focus of the current study. It will combine the existing algorithms (or enhancements of them) wi= th the analysis of IP parametric information (packet loss ratio and its time repartition, network equipments counters or trouble tickets, measurements performed on terminals, etc.)</span></p> <p class=3D"MsoNormal"><span style=3D"" lang=3D"EN-GB">=C2=A0</span></p> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"font-size: 12= pt; color: black;" lang=3D"EN-GB">Global schedule</span></h1> <p class=3D"MsoNormal" style=3D""><span style=3D"color: black;" lang=3D"EN-= GB">=C2=A0</span></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt; text-align: justify;"><spa= n style=3D"" lang=3D"EN-GB">The time schedule will follow the distinction in two steps exposed before:</span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 36pt; text-align: = justify; text-indent: -18pt;"><span style=3D"" lang=3D"EN-GB"><span style= =3D"">-<span style=3D"font: 7pt &quot;Times New Roman&quot;;">=C2=A0=C2=A0= =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"" lang=3D"EN-GB">Enhancement of existin= g algorithms, to allow detection of sub-categories as well as =E2=80=9Crecognition=E2=80=9D of the= signature of some signal processing features (noise reduction, coding and transcoding, etc.).= This is the hardest and longest part of the study.</span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 36pt; text-align: = justify; text-indent: -18pt;"><span style=3D"" lang=3D"EN-GB"><span style= =3D"">-<span style=3D"font: 7pt &quot;Times New Roman&quot;;">=C2=A0=C2=A0= =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"" lang=3D"EN-GB">Setting up of diagnost= ic rules to link these new sub-categories to real technical issues, thanks to a combination of measurements on signal and of parametric data. An expert system based on neuronal networks is foreseen, but other approaches can be envisaged as wel= l.</span></p> <p class=3D"MsoNormal"><span style=3D"" lang=3D"EN-GB">=C2=A0</span></p> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"font-size: 12= pt; color: black;" lang=3D"EN-GB">Additional contributions</span></h1> <p class=3D"MsoNormal" style=3D""><span style=3D"color: black;" lang=3D"EN-= GB">=C2=A0</span></p> <p class=3D"MsoNormal" style=3D""><span style=3D"color: black;" lang=3D"EN-= GB">None foreseen until now.</span></p><p class=3D"MsoNormal" style=3D""><br></p><p class=3D"= MsoNormal" style=3D""><br></p><p class=3D"MsoNormal" style=3D"">%%%%%%%%%%%= %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%= %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%</p> <p class=3D"MsoNormal" style=3D""><br></p><p class=3D"MsoNormal" style=3D""= ><meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3Dutf-8"><= meta name=3D"ProgId" content=3D"Word.Document"><meta name=3D"Generator" con= tent=3D"Microsoft Word 11"><meta name=3D"Originator" content=3D"Microsoft W= ord 11"><!-- <link rel=3D"File-List" href=3D"file:///C:%5CDOCUME%7E1%5Caful= 8265%5CLOCALS%7E1%5CTemp%5Cmsohtml1%5C07%5Cclip_filelist.xml"> --><!-- <lin= k rel=3D"Edit-Time-Data" href=3D"file:///C:%5CDOCUME%7E1%5Caful8265%5CLOCAL= S%7E1%5CTemp%5Cmsohtml1%5C07%5Cclip_editdata.mso"> --><style> <!-- /* Font Definitions */ @xxxxxxxx {font-family:Wingdings; panose-1:5 0 0 0 0 0 0 0 0 0; mso-font-charset:2; mso-generic-font-family:auto; mso-font-pitch:variable; mso-font-signature:0 268435456 0 0 -2147483648 0;} @xxxxxxxx {font-family:SimSun; panose-1:2 1 6 0 3 1 1 1 1 1; mso-font-alt:=E5=AE=8B=E4=BD=93; mso-font-charset:134; mso-generic-font-family:auto; mso-font-pitch:variable; mso-font-signature:3 135135232 16 0 262145 0;} @xxxxxxxx {font-family:"\@xxxxxxxx"; panose-1:2 1 6 0 3 1 1 1 1 1; mso-font-charset:134; mso-generic-font-family:auto; mso-font-pitch:variable; mso-font-signature:3 135135232 16 0 262145 0;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {mso-style-parent:""; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:10.0pt; mso-bidi-font-size:12.0pt; font-family:Arial; mso-fareast-font-family:"Times New Roman"; mso-bidi-font-family:"Times New Roman"; mso-fareast-language:EN-US;} h1 {mso-style-next:Normal; margin:0cm; margin-bottom:.0001pt; text-align:center; mso-line-height-alt:12.0pt; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:1; font-size:14.0pt; mso-bidi-font-size:12.0pt; font-family:Arial; mso-fareast-font-family:"Times New Roman"; mso-font-kerning:0pt; mso-fareast-language:EN-US; mso-bidi-font-weight:normal;} p.MsoHeader, li.MsoHeader, div.MsoHeader {margin:0cm; margin-bottom:.0001pt; text-align:justify; text-justify:inter-ideograph; line-height:14.0pt; mso-pagination:widow-orphan; tab-stops:center 8.0cm right 16.0cm; font-size:10.0pt; font-family:Arial; mso-fareast-font-family:"Times New Roman"; mso-bidi-font-family:"Times New Roman"; mso-fareast-language:EN-US;} p.MsoFooter, li.MsoFooter, div.MsoFooter {margin:0cm; margin-bottom:.0001pt; text-align:justify; text-justify:inter-ideograph; line-height:14.0pt; mso-pagination:widow-orphan; tab-stops:center 8.0cm right 16.0cm; font-size:10.0pt; font-family:Arial; mso-fareast-font-family:"Times New Roman"; mso-bidi-font-family:"Times New Roman"; mso-fareast-language:EN-US;} @xxxxxxxx Section1 {size:595.3pt 841.9pt; margin:2.0cm 2.0cm 2.0cm 2.0cm; mso-header-margin:1.0cm; mso-footer-margin:1.0cm; mso-forms-protection:no; mso-paper-source:0;} div.Section1 {page:Section1;} /* List Definitions */ @xxxxxxxx l0 {mso-list-id:1709986268; mso-list-type:hybrid; mso-list-template-ids:-1262979550 531001108 67895299 67895301 67895297 678= 95299 67895301 67895297 67895299 67895301;} @xxxxxxxx l0:level1 {mso-level-number-format:bullet; mso-level-text:-; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt; font-family:Arial; mso-fareast-font-family:SimSun;} @xxxxxxxx l1 {mso-list-id:1723943832; mso-list-type:hybrid; mso-list-template-ids:-1447373218 1537483794 67895299 67895301 67895297 67= 895299 67895301 67895297 67895299 67895301;} @xxxxxxxx l1:level1 {mso-level-start-at:2; mso-level-number-format:bullet; mso-level-text:-; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt; font-family:"Times New Roman"; mso-fareast-font-family:SimSun;} @xxxxxxxx l1:level2 {mso-level-number-format:bullet; mso-level-text:o; mso-level-tab-stop:72.0pt; mso-level-number-position:left; text-indent:-18.0pt; font-family:"Courier New";} @xxxxxxxx l1:level3 {mso-level-tab-stop:108.0pt; mso-level-number-position:left; text-indent:-18.0pt;} @xxxxxxxx l1:level4 {mso-level-tab-stop:144.0pt; mso-level-number-position:left; text-indent:-18.0pt;} @xxxxxxxx l1:level5 {mso-level-tab-stop:180.0pt; mso-level-number-position:left; text-indent:-18.0pt;} @xxxxxxxx l1:level6 {mso-level-tab-stop:216.0pt; mso-level-number-position:left; text-indent:-18.0pt;} @xxxxxxxx l1:level7 {mso-level-tab-stop:252.0pt; mso-level-number-position:left; text-indent:-18.0pt;} @xxxxxxxx l1:level8 {mso-level-tab-stop:288.0pt; mso-level-number-position:left; text-indent:-18.0pt;} @xxxxxxxx l1:level9 {mso-level-tab-stop:324.0pt; mso-level-number-position:left; text-indent:-18.0pt;} ol {margin-bottom:0cm;} ul {margin-bottom:0cm;} --> </style> </p><div style=3D"border: 1pt solid windowtext; padding: 1pt 4pt;"> <p class=3D"MsoNormal" style=3D"text-align: center; border: medium none; pa= dding: 0cm;" align=3D"center"><b style=3D""><span style=3D"font-size: 14pt;= ">Campagne 2010</span></b></p> <p class=3D"MsoNormal" style=3D"text-align: center; border: medium none; pa= dding: 0cm;" align=3D"center"><b style=3D""><span style=3D"font-size: 14pt;= ">Fiche descriptive de la th=C3=A8se</span></b></p> </div> <p class=3D"MsoNormal" style=3D"text-align: justify;"><span style=3D"color:= black;">=C2=A0</span></p> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"font-size: 12= pt; color: black;">Encadrant Orange Labs:</span></h1> <p class=3D"MsoNormal"><span style=3D"font-size: 12pt;">Vincent Barriac</sp= an><span style=3D""></span></p> <p class=3D"MsoNormal"><b style=3D""><span style=3D"font-size: 12pt;">Adres= se =C3=A9lectronique de l&#39;encadrant : </span></b></p> <p class=3D"MsoNormal"><span style=3D""><a href=3D"mailto:Vincent.barriac@xxxxxxxx= range-ftgroup.com">Vincent.barriac@xxxxxxxx</a></span></p> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"font-size: 12= pt; color: black;">Site:</span><span style=3D"font-size: 10pt; color: black= ; font-weight: normal;"></span></h1> <p class=3D"MsoNormal"><span style=3D"">Lannion</span></p> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"font-size: 12= pt; color: black;">Sujet de la th=C3=A8se (<i style=3D"">Titre</i>):</span></h1> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"font-size: 12= pt; font-family: &quot;Times New Roman&quot;;">D=C3=A9veloppement d=E2=80= =99une m=C3=A9thode de diagnostic technique des d=C3=A9gradations de qualit=C3=A9 vocale per=C3=A7ue des comm= unications t=C3=A9l=C3=A9phoniques =C3=A0 partir d=E2=80=99une analyse du signal de pa= role</span><span style=3D"font-size: 12pt;"></span></h1> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"font-size: 10= pt; color: black; font-weight: normal;">=C2=A0</span></h1> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"font-size: 12= pt; color: black;">Contexte global de l&#39;=C3=A9tude et =C3=A9tat de l&#39;art</span></h1> <p class=3D"MsoNormal" style=3D""><span style=3D"color: black;">=C2=A0</spa= n></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt;"><span style=3D"color: bla= ck;">L=E2=80=99=C3=A9valuation de la qualit=C3=A9 per=C3=A7ue des communications vocales fait appel =C3=A0 deux familles de techniques as= sez distinctes=C2=A0:</span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 36pt; text-indent:= -18pt;"><span style=3D"color: black;"><span style=3D"">-<span style=3D"fon= t: 7pt &quot;Times New Roman&quot;;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0= =C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"color: black;">Les techniques d=E2=80=99analyse du signal, depuis des mesures simples sur le n= iveau de signal ou son spectre, jusqu=E2=80=99=C3=A0 des mod=C3=A8les dits psycho-ac= oustiques, c&#39;est-=C3=A0-dire combinant analyse du signal et mod=C3=A9lisation de l= a perception et du jugement. Le plus connu de ces mod=C3=A8les est PESQ (UIT-T P.862).</span><= /p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 36pt; text-indent:= -18pt;"><span style=3D"color: black;"><span style=3D"">-<span style=3D"fon= t: 7pt &quot;Times New Roman&quot;;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0= =C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"color: black;">Les techniques d=E2=80=99interpr=C3=A9tation des indicateurs techniques non li= =C3=A9s au signal lui-m=C3=AAme, mais plut=C3=B4t =C3=A0 la fa=C3=A7on dont il a =C3=A9t=C3= =A9 trait=C3=A9 et transport=C3=A9 par le r=C3=A9seau. On parle alors de m=C3=A9thodes param=C3=A9triques.</span></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt;"><span style=3D"color: bla= ck;">Dans ces deux familles on a abouti =C3=A0 des m=C3=A9thodes particuli=C3=A8rement pr=C3=A9cises, permet= tant d=C3=A9sormais de pr=C3=A9dire ou d=E2=80=99estimer la qualit=C3=A9 per=C3=A7ue de fa=C3=A7on= pertinente.</span></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt;"><span style=3D"color: bla= ck;">Plus r=C3=A9cemment, on a vu appara=C3=AEtre des approches hybrides, combinant mesures sur le signal et indications param=C3=A9triques, notamment dans le domaine du transport sur = IP. La compl=C3=A9mentarit=C3=A9 des deux approches doit permettre de combiner les= avantages des deux familles=C2=A0: la pr=C3=A9cision des mesures sur le signal, et la cap= acit=C3=A9 des m=C3=A9thodes param=C3=A9triques =C3=A0 =C3=AAtre utilis=C3=A9es sans contr= ainte de CPU ou de d=C3=A9codage du signal. Les m=C3=A9thodes param=C3=A9triques apportent de plus des =C3= =A9l=C3=A9ments de compr=C3=A9hension de d=C3=A9fauts techniques (par exemple, une mesure de p= ertes de paquets pour comprendre des coupures dans le signal).</span></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt;"><span style=3D"color: bla= ck;">Toutes ces m=C3=A9thodes ont cependant un d=C3=A9faut=C2=A0: elles ne permettent pas de faire le lien entre une d= =C3=A9gradation per=C3=A7ue et sa cause. Quelques =C3=A9tudes ont commenc=C3=A9 =C3=A0 abor= der ce lien, mais elles en sont encore =C3=A0 un stade peu avanc=C3=A9. Or, d=E2=80=99un point de v= ue op=C3=A9rationnel, c=E2=80=99est =C3=A0 cela que doivent avant tout servir des m=C3=A9thodes et outils de me= sure de qualit=C3=A9=C2=A0: trouver l=E2=80=99origine des d=C3=A9fauts constat=C3= =A9s, voire m=C3=AAme proposer des actions correctives.</span></p> <p class=3D"MsoNormal" style=3D""><span style=3D"color: black;">=C2=A0</spa= n></p> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"font-size: 12= pt; color: black;">Objectifs de la th=C3=A8se/ R=C3=A9sultats attendus/ D=C3=A9fis scientifiques/techniq= ues =C3=A0 relever.</span></h1> <p class=3D"MsoNormal" style=3D"margin-top: 6pt; text-align: justify;"><spa= n style=3D"">L=E2=80=99id=C3=A9e =C3=A0 la base du lancement de cette =C3= =A9tude est qu=E2=80=99il est possible et r=C3=A9aliste d=E2=80=99envisager de doter les =C3=A9quipes op=C3=A9rationn= elles en charge de superviser les r=C3=A9seaux et services de t=C3=A9l=C3=A9communications d= =E2=80=99outils de diagnostic puissants capables d=E2=80=99expertiser les d=C3=A9fauts de qualit=C3=A9 pe= r=C3=A7ue sur des communications t=C3=A9l=C3=A9phoniques et d=E2=80=99en d=C3=A9duire les cau= ses techniques sous-jacentes (et a fortiori les solutions =C3=A0 apporter).</span></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt; text-align: justify;"><spa= n style=3D"">L=E2=80=99objectif est donc la r=C3=A9alisation d=E2=80=99un t= el outil, combinant analyse du signal audio et interpr=C3=A9tation de donn=C3=A9es param=C3=A9triques.</sp= an></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt; text-align: justify;"><spa= n style=3D"">Cette =C3=A9tude sera restreinte aux architectures des r=C3=A9= seaux VoIP de France T=C3=A9l=C3=A9com / Orange, bas=C3=A9es sur IMS (protocole SIP), et aux seu= ls fournisseurs de France T=C3=A9l=C3=A9com / Orange de ces architectures. . L=E2=80=99extrapo= lation de ces travaux vers une g=C3=A9n=C3=A9ralisation des r=C3=A8gles de diagnostic (qui seront= fortement d=C3=A9pendantes d&#39;architectures particuli=C3=A8res) ou la normalisatio= n ne nous semble donc pas ais=C3=A9ment envisageable (et pas non plus forc=C3=A9ment souhait= able).</span></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt; text-align: justify;"><spa= n style=3D"">Ce travail va =C3=AAtre r=C3=A9alis=C3=A9 en =C3=A9troite coll= aboration avec (premi=C3=A8re partie) les chercheurs en charge du d=C3=A9veloppement d=E2=80=99algorithmes et mod= =C3=A8les de traitement du signal vocal (mesure de qualit=C3=A9 vocale, codage de parole, am=C3=A9lioration du signal), mais aussi (seconde partie) avec les =C3=A9qu= ipes op=C3=A9rationnelles connaissant les =C3=A9quipements du r=C3=A9seau et en = mesure de fournir des donn=C3=A9es r=C3=A9elles d=E2=80=99incidents pour =C3=A9laborer des r= =C3=A8gles de diagnostic.</span></p> <p class=3D"MsoNormal" style=3D""><span style=3D"">=C2=A0</span></p> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"font-size: 12= pt; color: black;">Approche </span><span style=3D"font-size: 12pt;">m=C3=A9thodologique<span style=3D"c= olor: black;"> propos=C3=A9e par le responsable technique</span></span></h1> <p class=3D"MsoNormal" style=3D""><span style=3D"color: black;">(Pr=C3=A9ci= ser les comp=C3=A9tences recherch=C3=A9es n=C3=A9cessaires =C3=A0 l&#39;approche)</span></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt; text-align: justify;"><spa= n style=3D"">L=E2=80=99approche que nous privil=C3=A9gions comporte deux = =C3=A9tapes=C2=A0:</span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 36pt; text-align: = justify; text-indent: -18pt;"><span style=3D"font-family: &quot;Times New R= oman&quot;;"><span style=3D"">-<span style=3D"font: 7pt &quot;Times New Rom= an&quot;;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"">la d=C3=A9tection dans le signal de p= arole de d=C3=A9gradations perceptibles et g=C3=AAnantes, parmi des cat=C3=A9gories g=C3=A9n=C3=A9rales : </span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 72pt; text-align: = justify; text-indent: -18pt;"><span style=3D"font-family: &quot;Courier New= &quot;;"><span style=3D"">o<span style=3D"font: 7pt &quot;Times New Roman&q= uot;;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"">coupures dans le signal, pertes d=E2= =80=99information</span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 72pt; text-align: = justify; text-indent: -18pt;"><span style=3D"font-family: &quot;Courier New= &quot;;"><span style=3D"">o<span style=3D"font: 7pt &quot;Times New Roman&q= uot;;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"">distorsion, d=C3=A9formation du signa= l</span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 72pt; text-align: = justify; text-indent: -18pt;"><span style=3D"font-family: &quot;Courier New= &quot;;"><span style=3D"">o<span style=3D"font: 7pt &quot;Times New Roman&q= uot;;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"">pr=C3=A9sence de bruit de fond</span>= </p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 72pt; text-align: = justify; text-indent: -18pt;"><span style=3D"font-family: &quot;Courier New= &quot;;"><span style=3D"">o<span style=3D"font: 7pt &quot;Times New Roman&q= uot;;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"">modification du niveau de signal</spa= n></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 72pt; text-align: = justify; text-indent: -18pt;"><span style=3D"font-family: &quot;Courier New= &quot;;"><span style=3D"">o<span style=3D"font: 7pt &quot;Times New Roman&q= uot;;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"">d=C3=A9fauts li=C3=A9s =C3=A0 des dif= ficult=C3=A9s d=E2=80=99interaction (=C3=A9cho, notamment)</span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 36pt; text-align: = justify; text-indent: -18pt;"><span style=3D"font-family: &quot;Times New R= oman&quot;;"><span style=3D"">-<span style=3D"font: 7pt &quot;Times New Rom= an&quot;;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"">la d=C3=A9termination de sous-cat=C3= =A9gories plus pr=C3=A9cises (par exemple pour le bruit=C2=A0: distinction du type et de l=E2=80=99amplitude du bruit), li=C3= =A9es =C3=A0 des causes techniques pr=C3=A9visibles identifi=C3=A9es.</span></p> <p class=3D"MsoNormal" style=3D"margin-top: 6pt; text-align: justify;"><spa= n style=3D"">La premi=C3=A8re =C3=A9tape est purement ax=C3=A9e sur du trai= tement de signal. Il faut mentionner que des travaux r=C3=A9cents (th=C3=A8ses de M. W=C3=A4ltermann = =C3=A0 DT, de N. C=C3=B4t=C3=A9 et d=E2=80=99A. Leman =C3=A0 FT) ont bien d=C3=A9broussaill=C3=A9 le terrain (= dans le contexte d=E2=80=99=C3=A9coute, pas en contexte conversationnel, qui reste encore =C3=A0 =C3=A9tudier), pui= sque les principales dimensions sont connues et que des algorithmes de d=C3=A9tection (perfectibles) ont =C3=A9t=C3=A9 d=C3=A9velopp=C3=A9s.</span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 36pt; text-align: = justify; text-indent: -18pt;"><span style=3D"font-family: &quot;Times New R= oman&quot;;"><span style=3D"">-<span style=3D"font: 7pt &quot;Times New Rom= an&quot;;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"">La seconde =C3=A9tape, qui constitue = le sujet de cette =C3=A9tude, va combiner ces algorithmes (ou plut=C3=B4t des raffinements de ces algorithmes) et l=E2=80= =99analyse d=E2=80=99informations de type param=C3=A9trique IP (taux de pertes de paqu= ets et leur r=C3=A9partition dans le temps, =C3=A9v=C3=A9nements sur =C3=A9quipement du= r=C3=A9seau et accessibles via des compteurs ou des CDR, r=C3=A9sultats de mesures effectu=C3=A9s par = les terminaux et renvoy=C3=A9s par eux sur le r=C3=A9seau, etc.).</span></p> <p class=3D"MsoNormal"><span style=3D"">=C2=A0</span></p> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"font-size: 12= pt; color: black;">Planning Global du d=C3=A9roulement de la th=C3=A8se (<i style=3D"">grandes lignes</i>)</span></h1> <p class=3D"MsoNormal" style=3D"margin-top: 6pt; text-align: justify;"><spa= n style=3D"">La th=C3=A8se propos=C3=A9e<span style=3D"">=C2=A0 </span>se d= =C3=A9compose donc en deux sous-parties bien distinctes=C2=A0:</span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 36pt; text-align: = justify; text-indent: -18pt;"><span style=3D"font-family: &quot;Times New R= oman&quot;;"><span style=3D"">-<span style=3D"font: 7pt &quot;Times New Rom= an&quot;;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"">L=E2=80=99am=C3=A9lioration des algor= ithmes existants, pour leur permettre de d=C3=A9tecter des sous-cat=C3=A9gories ainsi que d=E2=80=99=C3=AAtre capables de =C2=AB= =C2=A0reconna=C3=AEtre=C2=A0=C2=BB la signature de certains traitements (notamment=C2=A0: d=C3=A9bruitage, codage= et transcodage). Il s=E2=80=99agit de la partie la plus ardue et longue de cet= te =C3=A9tude.</span></p> <p class=3D"MsoNormal" style=3D"margin: 6pt 0cm 0.0001pt 36pt; text-align: = justify; text-indent: -18pt;"><span style=3D"font-family: &quot;Times New R= oman&quot;;"><span style=3D"">-<span style=3D"font: 7pt &quot;Times New Rom= an&quot;;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 </span></span></span><span style=3D"">L=E2=80=99=C3=A9tablissement de r=C3= =A8gles de diagnostic permettant de relier ces sous-cat=C3=A9gories =C3=A0 des d=C3=A9fauts r=C3=A9els, gr=C3=A2ce =C3=A0 = une combinaison de mesures sur le signal et d=E2=80=99informations param=C3=A9triques. Un syst=C3=A8me expert= bas=C3=A9 sur des r=C3=A9seaux de neurones est envisag=C3=A9, mais d=E2=80=99autres solutions peuvent =C3= =AAtre imagin=C3=A9es.</span></p> <p class=3D"MsoNormal" style=3D""><span style=3D"">=C2=A0</span></p> <h1 style=3D"text-align: left;" align=3D"left"><span style=3D"font-size: 12= pt; color: black;">Contributions secondaires si pr=C3=A9vues (<i style=3D"">participation =C3=A0 des projets collaboratifs</i>)</span></h1> <p class=3D"MsoNormal"><span style=3D"">=C2=A0</span></p> <p class=3D"MsoNormal"><span style=3D"">Aucune pour l=E2=80=99instant</span= ></p><p class=3D"MsoNormal"><br></p><p class=3D"MsoNormal"><br></p><p></p><= div align=3D"left"> <div align=3D"left"><span style=3D"font-size: 12pt; font-family: Verdana;">= <font><img alt=3D"" src=3D"http://www.francetelecom.com/sirius/logos_mail/o= range_logo.gif" border=3D"0" hspace=3D"0"></font></span></div> <div align=3D"left"><span style=3D"font-size: 12pt; font-family: Verdana;">= <font><font face=3D"Times New Roman"><strong></strong></font></font></span>= <font>=C2=A0</font></div> <div align=3D"left"><span style=3D"font-size: 12pt; font-family: Verdana;">= <font><font face=3D"Times New Roman"><strong>Adrien Leman</strong></font></= font></span></div> <div align=3D"left"> <p style=3D"margin: 0cm 0cm 0pt;"><font><font face=3D"Times New Roman"><str= ong>Orange Labs=20 - </strong>Doctorant de l&#39;Unit=C3=A9 de R&amp;D </font></font></p> <p style=3D"margin: 0cm 0cm 0pt;"><font><font face=3D"Times New Roman">Mod= =C3=A9lisation et=20 =C3=A9valuation Objective de la qualit=C3=A9 Vocale </font></font></p> <p style=3D"margin: 0cm 0cm 0pt;"><font><font face=3D"Times New Roman">FT/R= D/TECH/<span class=3D"786195112-10062009">OPERA</span>/MOV</font></font></p> <p style=3D"margin-top: 0pt; font-size: 10pt; margin-bottom: 0pt; color: rg= b(0, 0, 0); font-family: Arial,Helvetica,sans-serif;" align=3D"left"><span = style=3D"font-size: 12pt; font-family: &#39;Times New Roman&#39;;"><font>Ba= t.=20 LD 230</font></span><span style=3D"font-size: 12pt; font-family: Verdana;">= <font><br></font></span><span style=3D"font-size: 12pt; font-family: &#39;T= imes New Roman&#39;;"><font>2=20 avenue Pierre Marzin</font></span><span style=3D"font-size: 12pt; font-fami= ly: Verdana;"><font><br></font></span><span style=3D"font-size: 12pt; font-= family: &#39;Times New Roman&#39;;"><font>22307=20 Lannion cedex</font></span></p> <p style=3D"margin-top: 0pt; font-size: 10pt; margin-bottom: 0pt; color: rg= b(0, 0, 0); font-family: Arial,Helvetica,sans-serif;" align=3D"left"><span = style=3D"font-size: 12pt; font-family: &#39;Times New Roman&#39;;"></span><= span style=3D"font-size: 12pt; font-family: Verdana;"><font></font></span><= span style=3D"font-size: 12pt; font-family: &#39;Times New Roman&#39;;"><fo= nt>t=C3=A9l.=20 +33 2 96 05 30 02<br></font></span><span style=3D"font-size: 12pt; font-fam= ily: Verdana;"><font></font></span><font><a style=3D"font-size: 10pt; color= : rgb(255, 102, 0); font-family: Arial,Helvetica,sans-serif;" href=3D"mailt= o:adrien.leman@xxxxxxxx">adrien.leman@xxxxxxxx</a></fon= t></p> <p style=3D"margin-top: 10pt; font-size: 10pt; margin-bottom: 10pt; color: = rgb(0, 0, 0); font-family: Arial,Helvetica,sans-serif;" align=3D"left"><fon= t><img alt=3D"" src=3D"http://www.francetelecom.com/sirius/logos_mail/amper= sand.gif" border=3D"0" hspace=3D"0"></font></p> </div></div><p class=3D"MsoNormal" style=3D""></p><p class=3D"MsoNormal"><b= r><span style=3D""></span></p> <p></p><p class=3D"MsoNormal" style=3D""></p> --0015174733ce8323f2048881e2bb--


This message came from the mail archive
/home/empire6/dpwe/public_html/postings/2010/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University