[AUDITORY] Related Work on Sound Polyphony Estimation (=?iso-8859-1?Q?Abe=DFer=2C_Jakob?=)


Subject: [AUDITORY] Related Work on Sound Polyphony Estimation
From:    =?iso-8859-1?Q?Abe=DFer=2C_Jakob?= <=?iso-8859-1?Q?Abe=DFer=2C_Jakob?=>
Date:    Tue, 13 Sep 2022 12:16:56 +0000

--_000_BEZP281MB2056AE06B30925BA43919C4BC5479BEZP281MB2056DEUP_ Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Dear list, we're currently conducting a literature review on methods for the task of s= ound polyphony estimation, i.e., counting the number of sound events in a s= hort audio segment. While there exists several works on related "counting" tasks such as speake= r counting, music instrument counting, ensemble size estimation, and note p= olyphony estimation, we were not able to find many references with focus on everyday / environme= ntal sounds. If you are aware of relevant literature, I would very much appreciate any p= ointers. Thank you & best regards Jakob Abe=DFer -- Dr.-Ing. Jakob Abe=DFer Senior Scientist Semantic Music Technologies Fraunhofer Institute for Digital Media Technology IDMT Ehrenbergstr. 31 98693 Ilmenau, Germany Phone +49 3677 467-288 Fax +49 3677 467-467 Email: jakob.abesser@xxxxxxxx http://www.idmt.fraunhofer.de --_000_BEZP281MB2056AE06B30925BA43919C4BC5479BEZP281MB2056DEUP_ Content-Type: text/html; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable <html xmlns:v=3D"urn:schemas-microsoft-com:vml" xmlns:o=3D"urn:schemas-micr= osoft-com:office:office" xmlns:w=3D"urn:schemas-microsoft-com:office:word" = xmlns:m=3D"http://schemas.microsoft.com/office/2004/12/omml" xmlns=3D"http:= //www.w3.org/TR/REC-html40"> <head> <meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3Diso-8859-= 1"> <meta name=3D"Generator" content=3D"Microsoft Word 15 (filtered medium)"> <style><!-- /* Font Definitions */ @xxxxxxxx {font-family:"Cambria Math"; panose-1:2 4 5 3 5 4 6 3 2 4;} @xxxxxxxx {font-family:Calibri; panose-1:2 15 5 2 2 2 4 3 2 4;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {margin:0cm; font-size:11.0pt; font-family:"Calibri",sans-serif; mso-fareast-language:EN-US;} .MsoChpDefault {mso-style-type:export-only; font-family:"Calibri",sans-serif; mso-fareast-language:EN-US;} @xxxxxxxx WordSection1 {size:612.0pt 792.0pt; margin:70.85pt 70.85pt 2.0cm 70.85pt;} div.WordSection1 {page:WordSection1;} --></style><!--[if gte mso 9]><xml> <o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" /> </xml><![endif]--><!--[if gte mso 9]><xml> <o:shapelayout v:ext=3D"edit"> <o:idmap v:ext=3D"edit" data=3D"1" /> </o:shapelayout></xml><![endif]--> </head> <body lang=3D"DE" link=3D"#0563C1" vlink=3D"#954F72" style=3D"word-wrap:bre= ak-word"> <div class=3D"WordSection1"> <p class=3D"MsoNormal">Dear list,<o:p></o:p></p> <p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p> <p class=3D"MsoNormal"><span lang=3D"EN-US">we&#8217;re currently conductin= g a literature review on methods for the task of sound polyphony estimation= , i.e., counting the number of sound events in a short audio segment.<o:p><= /o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US">While there exists several work= s on related &#8220;counting&#8221; tasks such as speaker counting, music i= nstrument counting, ensemble size estimation, and note polyphony estimation= , <o:p></o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US">we were not able to find many r= eferences with focus on everyday / environmental sounds.<o:p></o:p></span><= /p> <p class=3D"MsoNormal"><span lang=3D"EN-US">If you are aware of relevant li= terature, I would very much appreciate any pointers.<o:p></o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US"><o:p>&nbsp;</o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US">Thank you &amp; best regards<o:= p></o:p></span></p> <p class=3D"MsoNormal"><span lang=3D"EN-US">Jakob Abe=DFer<o:p></o:p></span= ></p> <p class=3D"MsoNormal"><span lang=3D"EN-US"><o:p>&nbsp;</o:p></span></p> <p class=3D"MsoNormal"><span style=3D"mso-fareast-language:DE">--<o:p></o:p= ></span></p> <p class=3D"MsoNormal"><span style=3D"mso-fareast-language:DE">Dr.-Ing. Jak= ob Abe=DFer<o:p></o:p></span></p> <p class=3D"MsoNormal"><span style=3D"mso-fareast-language:DE">Senior Scien= tist<br> Semantic Music Technologies<br> <br> Fraunhofer Institute for Digital Media Technology IDMT<br> Ehrenbergstr. 31<br> 98693 Ilmenau, Germany<br> <br> Phone +49 3677 467-288<br> Fax +49 3677 467-467<br> <br> Email: jakob.abesser@xxxxxxxx<br> http://www.idmt.fraunhofer.de<o:p></o:p></span></p> <p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p> </div> </body> </html> --_000_BEZP281MB2056AE06B30925BA43919C4BC5479BEZP281MB2056DEUP_--


This message came from the mail archive
src/postings/2022/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University