[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [AUDITORY] Feedback on features for music similarity
- To: AUDITORY@xxxxxxxxxxxxxxx
- Subject: Re: [AUDITORY] Feedback on features for music similarity
- From: Paul Arzelier <paul.arzelier@xxxxxxx>
- Date: Wed, 31 Jul 2024 20:19:33 +0200
- Approved-by: paul.arzelier@xxxxxxx
- Arc-authentication-results: i=1; mx.google.com; dkim=pass header.i=@LISTS.MCGILL.CA header.s=SELECTOR1 header.b=D5gJKYie; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.103 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=free.fr
- Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-archive:list-owner:list-subscribe:list-unsubscribe:list-help :precedence:in-reply-to:to:subject:from:sender:reply-to:date :message-id:content-transfer-encoding:content-language:references :user-agent:mime-version:approved-by:dkim-signature; bh=CWXFns+qFQTFwA79T1B6WQM9M2ZjERJ0dWq0pg86KQk=; fh=5/42mu9FVmfuMp6n0xGXVcDar2H3ENcHt8Uv11Om8gY=; b=wsOogE//NtFZ7WO3qnGVEhu1J4aK3DlnBZZjFBRzupzbS1tvqjeJ+ifGw4oszHxtfV J2JbAVWXpFfOP6bD7/wia8m6ZEsCH3SCuGusdGudW6neEQoRJRerR8GWO7yfVOWcbwpw gsTcHoFqw7beIrXDIWkSe1HJR48Vp4GmqFgA5drI86yB6DAv2//TBwd7BsiZ9aEIAVC/ Of4ifA03gDteDsL1+w18S1QqvjfRd6AbilinZ0MOLX76XAhn5id8C+hmRLcAiZq3LluN M7gY7Yp5P2n9CXRCgYWeqpSsWxmRgzc3MejJH8z7AIwA0b6pD/hh3EjI+xdHAVt2P6C4 1KLQ==; dara=google.com
- Arc-seal: i=1; a=rsa-sha256; t=1722485808; cv=none; d=google.com; s=arc-20160816; b=ILbtDeZvN/FLWRsSE5Ut9/zjdJxCo3lAjJbR4tpwalSbtXTYuUvYrYZB/J0+z2P4nR c3GMWAnZ4OoM3lr16lvbk6CT9TQjrOAFP/ffbaOfC0ji6LCrWcPv3vNLFLwh4zNKPDeb Bg8QPVcwzJCrIfWEw9j5q/DNno/QuFp7un79gYIc4sPUUNvjYiovtPBnGL/1ffi+8U/F nYfi+5CFDOkf0Nhgo9fuG/w1jorueodWJRnBVcXeh3QIB2MLvQq/rFHBg6V4qYVm7XnQ jcLieNI1J14zj2tjvdOKXKaPzt8A0NE6BIY7HwxumWU9xbylNFm6DwTRYaOXFVjzosz3 E4MA==
- Authentication-results: mx.google.com; dkim=pass header.i=@LISTS.MCGILL.CA header.s=SELECTOR1 header.b=D5gJKYie; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.103 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=free.fr
- Delivered-to: dan.ellis@xxxxxxxxx
- Dkim-signature: v=1; a=rsa-sha256; d=LISTS.MCGILL.CA; s=SELECTOR1; c=relaxed/relaxed; bh=CWXFns+qFQTFwA79T1B6WQM9M2ZjERJ0dWq0pg86KQk=; i=@LISTS.MCGILL.CA; h=Approved-By:MIME-Version:User-Agent:References:Content-Language:Content-Type:Content-Transfer-Encoding:Message-ID:Date:Reply-To:Sender:From:Subject:To:In-Reply-To:List-Help:List-Unsubscribe:List-Subscribe:List-Owner:List-Archive; b=D5gJKYie6eJubJgGue3CHTz4s9R63nKicEGcKuag2jsxduUQv8wrsHemBEkKw7wzIt21CQ4r8m3qTbz795nkpd3F9uOzbAhTmSW7jO2nMX5FwCvc7KwGZRI6kZBLe1kfl8jtwx98CAaRycLQLXj0VI6LDkRH6i8EiS5XNCtNC2dvaLS26jKqR0/Akv1Kjf6QxNyTcCreY2TEsT8nePIBsfXh38yqT4apc9MR9zSSLJL3ivllPmTRMabHsjvIZdzOAZUUbIkoaQlcGXwbHxA0duGPUeYp56piobfe7/VpYcL6lQG8oeEw/u0wuXnjSQhIYI3ayu8QUg4DPiAPhdDI5A==
- In-reply-to: <19C7F226-DF0C-4779-9F5B-AF324476CB5B@ccrma.stanford.edu>
- List-archive: <https://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>
- List-help: <https://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO%20AUDITORY>
- List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>
- List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>
- List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>
- References: <ee531ad8-9554-4612-99a7-5cb3ba3fd452@free.fr> <19C7F226-DF0C-4779-9F5B-AF324476CB5B@ccrma.stanford.edu>
- Reply-to: Paul Arzelier <paul.arzelier@xxxxxxx>
- Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>
- User-agent: Mozilla Thunderbird
Hi!
Thanks for your answer, and appreciate the kind words! You definitely
make some good points:
1. I did NOT think about windowing the songs in "meaningful" moments,
but the beginning and the end make a lot of sense since we wanna make it
smooth. I'll explore and see what that yield.
3. I'm also thinking of letting users choose themselves, and see what
they like more.
5. It's super cool, I didn't know they exposed that! I'll definitely use
it to check that it matches at least e.g. the key the song is in.
@Pierre - do you have a hunch of what they might say? One problem is
that musicians are not really concerned with the harsh reality of
digital signal processing :D
Best Regards,
Paul
Le 30/07/2024 à 17:54, Hem, Charles a écrit :
Hi Paul,
Very cool project! I'm a PhD student at Harvard studying cochlear
implants on the human perception side of things, so I'm no audio
specialist myself. Trying to study music perception for cochlear
implanted recipients brings me back to featurization of music
frequently, though, so I have a couple non-expert thoughts for bullets
1, 3, and 5:
1: Maybe it's adding too much complexity, but I wonder if
splitting your feature analysis into more windows could be helpful if
you're most concerned about seamless transitions between songs. For
example, "start of song" (first 20 seconds, or something near that),
"whole song", and "end of song" (last 20 seconds) could be used to
give some extra weight to making sure that the "end of song" from the
first song matches the "start of song" of the following song. Mainly
for songs that are highly variable or have a drastically different intro.
3: I haven't read the chroma paper you cited, but gut reaction feels
like these features shouldn't outweigh tempo and loudness so much. I
think your four class division could be a good idea.
5: Spotify song features are pretty easy to get using this:
https://developer.spotify.com/documentation/web-api/reference/get-audio-features.
The features are much more abstract than your features, but in my
experience, Spotify's smart playlists do a pretty good job. I'm
guessing these features are partially used in that, so could be a nice
reference to compare your results to.
Most of this probably isn't super helpful, but I'll keep an eye out
for any updates about the project! It could be a cool research tool
to quantify people's music preferences one day, which would be an
interesting use for me.
Best,
Charles Hem