Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Spotify Recommendation’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/bricevergnou/spotify-recommendation on 28 January 2022.
--- Dataset description provided by original source is as follows ---
( You can check how I used this dataset on my github repository )
I am basically a HUGE fan of music ( mostly French rap though with some exceptions but I love music ). And someday , while browsing stuff on Internet , I found the Spotify's API . I knew I had to use it when I found out you could get information like danceability about your favorite songs just with their id's.
https://user-images.githubusercontent.com/86613710/127216769-745ac143-7456-4464-bbe3-adc53872c133.png" alt="image">
Once I saw that , my machine learning instincts forced me to work on this project.
I collected 100 liked songs and 95 disliked songs
For those I like , I made a playlist of my favorite 100 songs. It is mainly French Rap , sometimes American rap , rock or electro music.
For those I dislike , I collected songs from various kind of music so the model will have a broader view of what I don't like
There is : - 25 metal songs ( Cannibal Corps ) - 20 " I don't like " rap songs ( PNL ) - 25 classical songs - 25 Disco songs
I didn't include any Pop song because I'm kinda neutral about it
From the Spotify's API "Get a playlist's Items" , I turned the playlists into json formatted data which cointains the ID and the name of each track ( ids/yes.py and ids/no.py ). NB : on the website , specify "items(track(id,name))" in the fields format , to avoid being overwhelmed by useless data.
With a script ( ids/ids_to_data.py ) , I turned the json data into a long string with each ID separated with a comma.
Now I just had to enter the strings into the Spotify API "Get Audio Features from several tracks" and get my data files ( data/good.json and data/dislike.json )
From Spotify's API documentation :
And the variable that has to be predicted :
--- Original source retains full ownership of the source dataset ---
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Spotify Song Attributes’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/geomack/spotifyclassification on 28 January 2022.
--- Dataset description provided by original source is as follows ---
A dataset of 2017 songs with attributes from Spotify's API. Each song is labeled "1" meaning I like it and "0" for songs I don't like. I used this to data to see if I could build a classifier that could predict whether or not I would like a song.
I wrote an article about the project I used this data for. It includes code on how to grab this data from the Spotipy API wrapper and the methods behind my modeling. https://opendatascience.com/blog/a-machine-learning-deep-dive-into-my-spotify-data/
Each row represents a song.
There are 16 columns. 13 of which are song attributes, one column for song name, one for artist, and a column called "target" which is the label for the song.
Here are the 13 track attributes: acousticness, danceability, duration_ms, energy, instrumentalness, key, liveness, loudness, mode, speechiness, tempo, time_signature, valence.
Information on what those traits mean can be found here: https://developer.spotify.com/web-api/get-audio-features/
I would like to thank Spotify for providing this readily accessible data.
I'm a music lover who's curious about why I love the music that I love.
--- Original source retains full ownership of the source dataset ---
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Spotify Past Decades Songs Attributes’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/cnic92/spotify-past-decades-songs-50s10s on 28 January 2022.
--- Dataset description provided by original source is as follows ---
Why do we like some songs more than others? Is there something about a song that pleases out subconscious, making us listening to it on repeat? To understand this I collected various attributes from a selection of songs available in the Spotify's playlist "All out ..s" starting from the 50s up to the newly ended 10s. Can you find the secret sauce to make a song popular?
This data repo contains 7 datasets (.csv
files), each representing a Spotify's "All out ..s" type of playlist. Those playlists collect the most popular/iconic songs from the decade. For each song, a set of attributes have been reported in order to perform some data analysis. The attributes have been scraped from this amazing website. In particular, according to the website the attributes are:
top genre
: genre of the songyear
: year of the song (due to re-releases, the year might not correspond to the release year of the original song)bpm
(beats per minute): beats per minutenrgy
(energy): energy of a song, the higher the value the more energetic the song isdnce
(danceability): the higher the value, the easier it is to dance to this song.dB
(loudness): the higher the value, the louder the song.live
(liveness): the higher the value, the more likely the song is a live recording.val
(valence): the higher the value, the more positive mood for the song.dur
(duration): the duration of the song.acous
(acousticness): the higher the value the more acoustic the song is.spch
(speechiness): the higher the value the more spoken word the song contains.pop
(popularity): the higher the value the more popular the song is.I got inspired by the top-notch work by Leonardo Henrique in this dataset. Thanks to him I discovered this website, from which all the data collected here have been scraped.
--- Original source retains full ownership of the source dataset ---
The dataset is composed by different musial genres and for each kind we have different features that caracterize it. Reference https://developer.spotify.com/documentation/web-api/reference/#endpoint-get-audio-features
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Spotify Top 2020 Songs’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/heminp16/spotify-top-2020-songs on 28 January 2022.
--- Dataset description provided by original source is as follows ---
The top 50 songs from the year 2020, totaling 50 songs in the dataset.
The attributes were scraped from this website. Top Genre - genre of the song Year - release year of the song BPM – Beats Per Minute - The tempo of the song. Energy - The energy of a song; the higher the value, the more energetic. Danceability – Describes how suitable a track is for dancing; the higher the value, the easier it is to dance. Loudness (dB) – The loudness level in decibels, higher the value, the louder the song Liveness - the higher the value, the more likely the song is a live recording. Valence - A measure of musical positiveness of the track. The tracks with the highest number give a sense of positive moods. Duration (sec) - The duration of the song in seconds. Acousticness - A measure of how acoustic the track is. Speechiness - The higher the value that tells how many spoken words were in the track. Popularity - The higher the value, the more popular the song is.
I got inspired by Leonardo Henrique in this dataset, and found the amazing website that helped me scrap this data!
--- Original source retains full ownership of the source dataset ---
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Spotify Top 2018 & 2019 Songs’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/heminp16/spotify-top-2018-2019-songs on 28 January 2022.
--- Dataset description provided by original source is as follows ---
The top 50 songs from the years 2018 and 2019, totaling 100 songs in the dataset. Note: Better Now by Post Malone repeats twice, as it was popular in both 2018 and 2019.
The attributes were scraped from this website. Top Genre - genre of the song Year - release year of the song BPM – Beats Per Minute - The tempo of the song. Energy - The energy of a song; the higher the value, the more energetic. Danceability – Describes how suitable a track is for dancing; the higher the value, the easier it is to dance. Loudness (dB) – The loudness level in decibels, higher the value, the louder the song Liveness - the higher the value, the more likely the song is a live recording. Valence - A measure of musical positiveness of the track. The tracks with the highest number give a sense of positive moods. Duration (sec) - The duration of the song in seconds. Acousticness - A measure of how acoustic the track is. Speechiness - The higher the value that tells how many spoken words were in the track. Popularity - The higher the value, the more popular the song is.
I got inspired by Leonardo Henrique in this dataset, and found the amazing website that helped me scrap this data!
--- Original source retains full ownership of the source dataset ---
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Spotify Recommendation’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/bricevergnou/spotify-recommendation on 28 January 2022.
--- Dataset description provided by original source is as follows ---
( You can check how I used this dataset on my github repository )
I am basically a HUGE fan of music ( mostly French rap though with some exceptions but I love music ). And someday , while browsing stuff on Internet , I found the Spotify's API . I knew I had to use it when I found out you could get information like danceability about your favorite songs just with their id's.
https://user-images.githubusercontent.com/86613710/127216769-745ac143-7456-4464-bbe3-adc53872c133.png" alt="image">
Once I saw that , my machine learning instincts forced me to work on this project.
I collected 100 liked songs and 95 disliked songs
For those I like , I made a playlist of my favorite 100 songs. It is mainly French Rap , sometimes American rap , rock or electro music.
For those I dislike , I collected songs from various kind of music so the model will have a broader view of what I don't like
There is : - 25 metal songs ( Cannibal Corps ) - 20 " I don't like " rap songs ( PNL ) - 25 classical songs - 25 Disco songs
I didn't include any Pop song because I'm kinda neutral about it
From the Spotify's API "Get a playlist's Items" , I turned the playlists into json formatted data which cointains the ID and the name of each track ( ids/yes.py and ids/no.py ). NB : on the website , specify "items(track(id,name))" in the fields format , to avoid being overwhelmed by useless data.
With a script ( ids/ids_to_data.py ) , I turned the json data into a long string with each ID separated with a comma.
Now I just had to enter the strings into the Spotify API "Get Audio Features from several tracks" and get my data files ( data/good.json and data/dislike.json )
From Spotify's API documentation :
And the variable that has to be predicted :
--- Original source retains full ownership of the source dataset ---