[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [AUDITORY] Feedback on features for music similarity

To: AUDITORY@xxxxxxxxxxxxxxx
Subject: Re: [AUDITORY] Feedback on features for music similarity
From: Paul Arzelier <paul.arzelier@xxxxxxx>
Date: Wed, 31 Jul 2024 20:19:33 +0200
Approved-by: paul.arzelier@xxxxxxx
Arc-authentication-results: i=1; mx.google.com; dkim=pass header.i=@LISTS.MCGILL.CA header.s=SELECTOR1 header.b=D5gJKYie; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.103 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=free.fr
Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-archive:list-owner:list-subscribe:list-unsubscribe:list-help :precedence:in-reply-to:to:subject:from:sender:reply-to:date :message-id:content-transfer-encoding:content-language:references :user-agent:mime-version:approved-by:dkim-signature; bh=CWXFns+qFQTFwA79T1B6WQM9M2ZjERJ0dWq0pg86KQk=; fh=5/42mu9FVmfuMp6n0xGXVcDar2H3ENcHt8Uv11Om8gY=; b=wsOogE//NtFZ7WO3qnGVEhu1J4aK3DlnBZZjFBRzupzbS1tvqjeJ+ifGw4oszHxtfV J2JbAVWXpFfOP6bD7/wia8m6ZEsCH3SCuGusdGudW6neEQoRJRerR8GWO7yfVOWcbwpw gsTcHoFqw7beIrXDIWkSe1HJR48Vp4GmqFgA5drI86yB6DAv2//TBwd7BsiZ9aEIAVC/ Of4ifA03gDteDsL1+w18S1QqvjfRd6AbilinZ0MOLX76XAhn5id8C+hmRLcAiZq3LluN M7gY7Yp5P2n9CXRCgYWeqpSsWxmRgzc3MejJH8z7AIwA0b6pD/hh3EjI+xdHAVt2P6C4 1KLQ==; dara=google.com
Arc-seal: i=1; a=rsa-sha256; t=1722485808; cv=none; d=google.com; s=arc-20160816; b=ILbtDeZvN/FLWRsSE5Ut9/zjdJxCo3lAjJbR4tpwalSbtXTYuUvYrYZB/J0+z2P4nR c3GMWAnZ4OoM3lr16lvbk6CT9TQjrOAFP/ffbaOfC0ji6LCrWcPv3vNLFLwh4zNKPDeb Bg8QPVcwzJCrIfWEw9j5q/DNno/QuFp7un79gYIc4sPUUNvjYiovtPBnGL/1ffi+8U/F nYfi+5CFDOkf0Nhgo9fuG/w1jorueodWJRnBVcXeh3QIB2MLvQq/rFHBg6V4qYVm7XnQ jcLieNI1J14zj2tjvdOKXKaPzt8A0NE6BIY7HwxumWU9xbylNFm6DwTRYaOXFVjzosz3 E4MA==
Authentication-results: mx.google.com; dkim=pass header.i=@LISTS.MCGILL.CA header.s=SELECTOR1 header.b=D5gJKYie; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.103 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=free.fr
Delivered-to: dan.ellis@xxxxxxxxx
Dkim-signature: v=1; a=rsa-sha256; d=LISTS.MCGILL.CA; s=SELECTOR1; c=relaxed/relaxed; bh=CWXFns+qFQTFwA79T1B6WQM9M2ZjERJ0dWq0pg86KQk=; i=@LISTS.MCGILL.CA; h=Approved-By:MIME-Version:User-Agent:References:Content-Language:Content-Type:Content-Transfer-Encoding:Message-ID:Date:Reply-To:Sender:From:Subject:To:In-Reply-To:List-Help:List-Unsubscribe:List-Subscribe:List-Owner:List-Archive; b=D5gJKYie6eJubJgGue3CHTz4s9R63nKicEGcKuag2jsxduUQv8wrsHemBEkKw7wzIt21CQ4r8m3qTbz795nkpd3F9uOzbAhTmSW7jO2nMX5FwCvc7KwGZRI6kZBLe1kfl8jtwx98CAaRycLQLXj0VI6LDkRH6i8EiS5XNCtNC2dvaLS26jKqR0/Akv1Kjf6QxNyTcCreY2TEsT8nePIBsfXh38yqT4apc9MR9zSSLJL3ivllPmTRMabHsjvIZdzOAZUUbIkoaQlcGXwbHxA0duGPUeYp56piobfe7/VpYcL6lQG8oeEw/u0wuXnjSQhIYI3ayu8QUg4DPiAPhdDI5A==
In-reply-to: <19C7F226-DF0C-4779-9F5B-AF324476CB5B@ccrma.stanford.edu>
List-archive: <https://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>
List-help: <https://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO%20AUDITORY>
List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>
List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>
List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>
References: <ee531ad8-9554-4612-99a7-5cb3ba3fd452@free.fr> <19C7F226-DF0C-4779-9F5B-AF324476CB5B@ccrma.stanford.edu>
Reply-to: Paul Arzelier <paul.arzelier@xxxxxxx>
Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>
User-agent: Mozilla Thunderbird

Hi!

Thanks for your answer, and appreciate the kind words! You definitelymake some good points:

1. I did NOT think about windowing the songs in "meaningful" moments,but the beginning and the end make a lot of sense since we wanna make itsmooth. I'll explore and see what that yield.3. I'm also thinking of letting users choose themselves, and see whatthey like more.5. It's super cool, I didn't know they exposed that! I'll definitely useit to check that it matches at least e.g. the key the song is in.

@Pierre - do you have a hunch of what they might say? One problem isthat musicians are not really concerned with the harsh reality ofdigital signal processing :D


Best Regards,
Paul

Le 30/07/2024 à 17:54, Hem, Charles a écrit :

Hi Paul,
Very cool project! I'm a PhD student at Harvard studying cochlearimplants on the human perception side of things, so I'm no audiospecialist myself. Trying to study music perception for cochlearimplanted recipients brings me back to featurization of musicfrequently, though, so I have a couple non-expert thoughts for bullets1, 3, and 5:
1: Maybe it's adding too much complexity, but I wonder ifsplitting your feature analysis into more windows could be helpful ifyou're most concerned about seamless transitions between songs. Forexample, "start of song" (first 20 seconds, or something near that),"whole song", and "end of song" (last 20 seconds) could be used togive some extra weight to making sure that the "end of song" from thefirst song matches the "start of song" of the following song. Mainlyfor songs that are highly variable or have a drastically different intro.
3: I haven't read the chroma paper you cited, but gut reaction feelslike these features shouldn't outweigh tempo and loudness so much. Ithink your four class division could be a good idea.
5: Spotify song features are pretty easy to get using this:https://developer.spotify.com/documentation/web-api/reference/get-audio-features.The features are much more abstract than your features, but in myexperience, Spotify's smart playlists do a pretty good job. I'mguessing these features are partially used in that, so could be a nicereference to compare your results to.
Most of this probably isn't super helpful, but I'll keep an eye outfor any updates about the project! It could be a cool research toolto quantify people's music preferences one day, which would be aninteresting use for me.
Best,
Charles Hem

References:
- [AUDITORY] Feedback on features for music similarity
  - From: Paul Arzelier

Prev by Date: [AUDITORY] Joint PhD position (Sydney, Australia and Dijon, France):
Next by Date: [AUDITORY] Deadline Extension to 01.09.2024: TISMIR Special Collection on Multi-Modal Music Information Retrieval
Previous by thread: [AUDITORY] Feedback on features for music similarity
Next by thread: [AUDITORY] Joint PhD position (Sydney, Australia and Dijon, France):
Index(es):
- Date
- Thread