«Recommended Frequencies» is a recommendation engine for playlists and currently works for Spotify. Given a selected playlist in a user's library, the app suggests songs from the user's liked songs that may make a good addition to it. The current goal of this app is to provide recommendations solely using information from a user's library. Thus, methods like collaborative filtering are outside of the current scope.
Specifically, under the current scope, song suggestions are based on a trained Catboost model (for more information refer to the model section) that uses audio features, song attributes and genre information in the form of embeddings provided by Every Noise at Once. The audio features used by the app are a subset of those provided by Spotify's API in addition to the year of the song's album release. All features used are listed here.
The app is developed using streamlit
.
The first step in the dashboard is to select a desired playlist from one's library. A selection of songs, the playlist profile as measured by the audio features of its songs and some album covers are visualised.
The second step is to select similarity metrics. The first option is to simply base similarity off of audio features. The second is to include genre information. Details on the available similarity metrics can be found in here.
These steps have been validated with the website's version as of 2021-01-17.
- Go to https://developer.spotify.com/dashboard/ and create an account
- Click on "CREATE AN APP"
- Provide the app name & description of your choice, tick the terms of service and click "CREATE"
- Click on "EDIT SETTINGS"
- Under "Redirect URIs" put
http://localhost:9000/callback/
andhttp://localhost:8090
- On the left side of the dashboard, underneath the description of your app, you will find your apps' "Client ID". Take note of it as you will need it in step 2.2.
- Below your "Client ID" you will find an option to "SHOW CLIENT SECRET", click on it and take note of the value as you you will need it in step 2.2.
The below instructions are for Linux or MacOS.
git clone git@github.com:calyptis/RecommendedFrequencies.git
cd RecommendedFrequencies
source prepare_env.sh
In the folder credentials
create a file named credentials.json
where you specify the configurations you obtained in step 1.6 & 1.7.
The file has the following structure:
{
"client_id": "{your_client_id}",
"client_secret": "{you_client_secret}",
"redirect_uri": "http://localhost:9000/callback/",
"redirect_flask_uri": "http://localhost:8090"
}
replace your client ID with value from step 1.6 and your client secret from step 1.7.
In order to host the dashboard locally, one must collect all relevant data. This can be done by running
python src/spotify/main.py
Note that this will open a Spotify login page in the browser were one must authenticate themselves.
Once logged in, a .spotify_caches
will be created in the root folder.
Subsequent API calls will not trigger a log-in page anymore unless this file is deleted.
Once all the data has been obtained, one can spin up the dashboard by running
streamlit run src/streamlit/main.py
Documentation of all the available audio features can be found on Spotify's official documentation page.
In this project, a subset of those are used and are listed below with their official description for convenience:
- Acousticness: A confidence measure from 0.0 to 1.0 of whether the track is acoustic. 1.0 represents high confidence the track is acoustic.
- Danceability: Danceability describes how suitable a track is for dancing based on a combination of musical elements including tempo, rhythm stability, beat strength, and overall regularity. A value of 0.0 is least danceable and 1.0 is most danceable.
- Energy: Energy is a measure from 0.0 to 1.0 and represents a perceptual measure of intensity and activity. Typically, energetic tracks feel fast, loud, and noisy. Perceptual features contributing to this attribute include dynamic range, perceived loudness, timbre, onset rate, and general entropy.
- Instrumentalness: Predicts whether a track contains no vocals. "Ooh" and "aah" sounds are treated as instrumental in this context. The closer the instrumentalness value is to 1.0, the greater likelihood the track contains no vocal content. Values above 0.5 are intended to represent instrumental tracks, but confidence is higher as the value approaches 1.0.
- Liveness: Detects the presence of an audience in the recording. Higher liveness values represent an increased probability that the track was performed live. A value above 0.8 provides strong likelihood that the track is live.
- Loudness: The overall loudness of a track in decibels (dB). Loudness values are averaged across the entire track and are useful for comparing relative loudness of tracks. Loudness is the quality of a sound that is the primary psychological correlate of physical strength (amplitude). Values typically range between -60 and 0 db.
- Speechiness: Speechiness detects the presence of spoken words in a track. The more exclusively speech-like the recording (e.g. talk show, audio book, poetry), the closer to 1.0 the attribute value. Values above 0.66 describe tracks that are probably made entirely of spoken words.
- Tempo: The overall estimated tempo of a track in beats per minute (BPM). In musical terminology, tempo is the speed or pace of a given piece and derives directly from the average beat duration.
- Valence: A measure from 0.0 to 1.0 describing the musical positiveness conveyed by a track. Tracks with high valence sound more positive (e.g. happy, cheerful, euphoric), while tracks with low valence sound more negative (e.g. sad, depressed, angry).
In addition, metadata on songs are considered. Specifically, the 10th audio attribute used in this project is Year of Album Release.
All these features are cast to a range of [0, 1] if they are not already.
2. Model
To measure the similarity between two songs, a Catboost
model is trained.
Positive song pairs are generated based on their occurrence in the same playlist.
Negative song pairs are generated based on user specified pairs of playlists that are very different.
For each song pair, the model uses the audio features as well as their location in the genre space as measured by
the Every Noise at Once project. Similar genres lie close to each other in this space, of which a visual representation is available on
the project's website.
Since this musical genre space is two-dimensional, each genre is represented by its (x, y)
location in the scatter plot.
A song's location in this space is simply the average of all its associated genres.
- All audio previews are sometimes played at the same time when updating the dashboard.
- Get the oldest year of release for a given song. For example, if a song appeared in a recent remastered album, track down the first album the song appears in and use that as a release date.
- https://dubolt.com
- https://newsroom.spotify.com/2021-09-09/get-perfect-song-recommendations-in-the-playlists-you-create-with-enhance/
- https://spotibot.com
- https://www.chosic.com/
- Spotify's song features
- Sample of data: https://www.kaggle.com/nadintamer/top-tracks-of-2017
- Description of features: https://developer.spotify.com/documentation/web-api/reference/#/operations/get-audio-features
- Analysis of music genres
- Genre similarity: https://everynoise.com/
- Music recommendation
- Articles on Spotify's music recommendation
- Pre-trained ML models: