100+ datasets found

Playlist2vec: Spotify Million Playlist Dataset
zenodo.org
data.niaid.nih.gov
+1more
bin
Updated Jun 22, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Piyush Papreja; Piyush Papreja (2021). Playlist2vec: Spotify Million Playlist Dataset [Dataset]. http://doi.org/10.5281/zenodo.5002584
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.5002584
Dataset updated
Jun 22, 2021
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Piyush Papreja; Piyush Papreja
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset was created using Spotify developer API. It consists of user-created as well as Spotify-curated playlists.
The dataset consists of 1 million playlists, 3 million unique tracks, 3 million unique albums, and 1.3 million artists.
The data is stored in a SQL database, with the primary entities being songs, albums, artists, and playlists.
Each of the aforementioned entities are represented by unique IDs (Spotify URI).
Data is stored into following tables:

album

artist

track

playlist

track_artist1

track_playlist1

album

| id | name | uri |

id: Album ID as provided by Spotify
name: Album Name as provided by Spotify
uri: Album URI as provided by Spotify

artist

| id | name | uri |

id: Artist ID as provided by Spotify
name: Artist Name as provided by Spotify
uri: Artist URI as provided by Spotify

track

| id | name | duration | popularity | explicit | preview_url | uri | album_id |

id: Track ID as provided by Spotify
name: Track Name as provided by Spotify
duration: Track Duration (in milliseconds) as provided by Spotify
popularity: Track Popularity as provided by Spotify
explicit: Whether the track has explicit lyrics or not. (true or false)
preview_url: A link to a 30 second preview (MP3 format) of the track. Can be null
uri: Track Uri as provided by Spotify
album_id: Album Id to which the track belongs

playlist

| id | name | followers | uri | total_tracks |

id: Playlist ID as provided by Spotify
name: Playlist Name as provided by Spotify
followers: Playlist Followers as provided by Spotify
uri: Playlist Uri as provided by Spotify
total_tracks: Total number of tracks in the playlist.

track_artist1

| track_id | artist_id |

Track-Artist association table

track_playlist1

| track_id | playlist_id |

Track-Playlist association table

- - - - - SETUP - - - - -

The data is in the form of a SQL dump. The download size is about 10 GB, and the database populated from it comes out to about 35GB.

spotifydbdumpschemashare.sql contains the schema for the database (for reference):
spotifydbdumpshare.sql is the actual data dump.

Setup steps:
1. Create database

- - - - - PAPER - - - - -

The description of this dataset can be found in the following paper:

Papreja P., Venkateswara H., Panchanathan S. (2020) Representation, Exploration and Recommendation of Playlists. In: Cellier P., Driessens K. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2019. Communications in Computer and Information Science, vol 1168. Springer, Cham
🎧 Spotify Global Streaming Data (2024)
kaggle.com
zip
Updated Apr 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Atharva Soundankar (2025). 🎧 Spotify Global Streaming Data (2024) [Dataset]. https://www.kaggle.com/datasets/atharvasoundankar/spotify-global-streaming-data-2024
Explore at:
zip(28022 bytes)Available download formats
Dataset updated
Apr 30, 2025
Authors
Atharva Soundankar
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
📊 About the Dataset

This dataset captures the global music streaming trends on Spotify for the year 2024. It provides valuable insights into user preferences across various countries, top-performing artists and albums, streaming hours, and listener behavior patterns. It is designed to support data analysis, machine learning models, and business intelligence dashboards in the music and media industry.

With over 500 rows of clean, non-duplicated, and realistic entries from countries around the world, this dataset is ideal for uncovering:

Global music popularity patterns

Listener engagement across genres and demographics

Artist performance across countries

Revenue forecasting and content recommendations

--
Spotify Dataset
brightdata.com
.json, .csv, .xlsx
Updated Apr 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bright Data (2024). Spotify Dataset [Dataset]. https://brightdata.com/products/datasets/spotify
Explore at:
.json, .csv, .xlsxAvailable download formats
Dataset updated
Apr 10, 2024
Dataset authored and provided by
Bright Datahttps://brightdata.com/
License
https://brightdata.com/licensehttps://brightdata.com/license
Area covered
Worldwide
Description
Gain valuable insights into music trends, artist popularity, and streaming analytics with our comprehensive Spotify Dataset. Designed for music analysts, marketers, and businesses, this dataset provides structured and reliable data from Spotify to enhance market research, content strategy, and audience engagement.

Dataset Features

Track Information: Access detailed data on songs, including track name, artist, album, genre, and release date. Streaming Popularity: Extract track popularity scores, listener engagement metrics, and ranking trends. Artist & Album Insights: Analyze artist performance, album releases, and genre trends over time. Related Searches & Recommendations: Track related search terms and suggested content for deeper audience insights. Historical & Real-Time Data: Retrieve historical streaming data or access continuously updated records for real-time trend analysis.

Customizable Subsets for Specific Needs Our Spotify Dataset is fully customizable, allowing you to filter data based on track popularity, artist, genre, release date, or listener engagement. Whether you need broad coverage for industry analysis or focused data for content optimization, we tailor the dataset to your needs.

Popular Use Cases

Market Analysis & Trend Forecasting: Identify emerging music trends, genre popularity, and listener preferences. Artist & Label Performance Tracking: Monitor artist rankings, album success, and audience engagement. Competitive Intelligence: Analyze competitor music strategies, playlist placements, and streaming performance. AI & Machine Learning Applications: Use structured music data to train AI models for recommendation engines, playlist curation, and predictive analytics. Advertising & Sponsorship Insights: Identify high-performing tracks and artists for targeted advertising and sponsorship opportunities.

Whether you're optimizing music marketing, analyzing streaming trends, or enhancing content strategies, our Spotify Dataset provides the structured data you need. Get started today and customize your dataset to fit your business objectives.
Spotify dataset
kaggle.com
zip
Updated Jun 17, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gati Ambaliya (2024). Spotify dataset [Dataset]. https://www.kaggle.com/datasets/ambaliyagati/spotify-dataset-for-playing-around-with-sql
Explore at:
zip(309669 bytes)Available download formats
Dataset updated
Jun 17, 2024
Authors
Gati Ambaliya
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Description for Spotify Songs Dataset on Kaggle

Dataset Title: Spotify Songs Dataset

Description: This dataset contains a collection of songs fetched from the Spotify API, covering various genres including "acoustic", "afrobeat", "alt-rock", "alternative", "ambient", "anime", "black-metal", "bluegrass", "blues", "bossanova", "brazil", "breakbeat", "british", "cantopop", "chicago-house", "children", "chill", "classical", "club", "comedy", "country", "dance", "dancehall", "death-metal", "deep-house", "detroit-techno", "disco", "disney", "drum-and-bass", "dub", "dubstep", "edm", "electro", "electronic", "emo", "folk", "forro", "french", "funk", "garage", "german", "gospel", "goth", "grindcore", "groove", "grunge", "guitar", "happy", "hard-rock", "hardcore", "hardstyle", "heavy-metal", "hip-hop", "holidays", "honky-tonk", "house", "idm", "indian", "indie", "indie-pop", "industrial", "iranian", "j-dance", "j-idol", "j-pop", "j-rock", "jazz", "k-pop", "kids", "latin", "latino", "malay", "mandopop", "metal", "metal-misc", "metalcore", "minimal-techno", "movies", "mpb", "new-age", "new-release", "opera", "pagode", "party", "philippines-opm", "piano", "pop", "pop-film", "post-dubstep", "power-pop", "progressive-house", "psych-rock", "punk", "punk-rock", "r-n-b", "rainy-day", "reggae", "reggaeton", "road-trip", "rock", "rock-n-roll", "rockabilly", "romance", "sad", "salsa", "samba", "sertanejo", "show-tunes", "singer-songwriter", "ska", "sleep", "songwriter", "soul", "soundtracks", "spanish", "study", "summer", "swedish", "synth-pop", "tango", "techno", "trance", "trip-hop", "turkish", "work-out", "world-music". Each entry in the dataset provides detailed information about a song, including its name, artists, album, popularity, duration, and whether it is explicit.

Data Collection Method: The data was collected using the Spotify Web API through a Python script. The script performed searches for different genres and retrieved the top tracks for each genre. The fetched data was then compiled and saved into a CSV file.

Columns Description: id: Unique identifier for the track on Spotify. name: Name of the track. genre: genre of the song. artists: Names of the artists who performed the track, separated by commas if there are multiple artists. album: Name of the album the track belongs to. popularity: Popularity score of the track (0-100, where higher is more popular). duration_ms: Duration of the track in milliseconds. explicit: Boolean indicating whether the track contains explicit content.

Potential Uses: This dataset can be used for a variety of purposes, including but not limited to:

Music Analysis: Analyze the popularity and characteristics of songs across different genres.

Recommendation Systems: Develop and test music recommendation algorithms.

Trend Analysis: Study trends in music preferences and popularity over time.

Machine Learning: Train machine learning models for tasks like genre classification or popularity prediction. _ Acknowledgements: This dataset was created using the Spotify Web API. Special thanks to Spotify for providing access to their extensive music library through their API. _ License: This dataset is made available under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. You are free to use, modify, and distribute this dataset, provided you give appropriate credit to the original creator. _
Z
spotify data
data-staging.niaid.nih.gov
data.niaid.nih.gov
Updated Jul 5, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ryan Hulke (2023). spotify data [Dataset]. https://data-staging.niaid.nih.gov/resources?id=zenodo_8114617
Explore at:
Dataset updated
Jul 5, 2023
Dataset provided by
student
Authors
Ryan Hulke
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
from kaggle
My Spotify Data - Cleaned
kaggle.com
zip
Updated Jan 26, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Malinga Rajapaksha (2024). My Spotify Data - Cleaned [Dataset]. https://www.kaggle.com/datasets/malingarajapaksha/my-spotify-data-cleaned
Explore at:
zip(2952139 bytes)Available download formats
Dataset updated
Jan 26, 2024
Authors
Malinga Rajapaksha
Description
The dataset contains records of the user's Spotify streaming history, with each row representing a specific instance of a played track. The data includes various attributes providing insights into the user's music listening habits.

Columns:

ts (Timestamp):

The timestamp when the track was played.

platform:

The platform or device used for streaming (e.g., Windows 10).

ms_played:

The duration in milliseconds of how long the track was played.

conn_country:

The country code indicating the user's location during streaming (e.g., LK for Sri Lanka).

master_metadata_track_name:

The name of the track played.

master_metadata_album_artist_name:

The artist of the album to which the track belongs.

master_metadata_album_album_name:

The name of the album containing the track.

spotify_track_uri:

The unique Spotify URI for the track.

reason_start:

The reason for starting the track (e.g., play button clicked).

reason_end:

The reason for ending the track (e.g., track done).

shuffle:

Indicates whether shuffle mode was enabled (True/False).

offline:

Indicates whether the track was played offline (True/False).

offline_timestamp:

Timestamp indicating when the track was played offline (if applicable).

incognito_mode:

Indicates whether incognito mode was enabled (True/False).

Purpose:

This dataset is suitable for performing detailed Exploratory Data Analysis (EDA) to uncover patterns, trends, and insights into the user's music-listening behaviour. Potential analyses could include the distribution of listening durations, favourite artists and tracks, exploration of geographic listening patterns, and examination of usage patterns across different platforms.

Visualization tools such as Matplotlib and Seaborn could be utilized for a more in-depth analysis to create visual representations of the findings. This dataset aligns well with your interest in data science, offering opportunities to apply analytical techniques to real-world streaming data.
Data from: MusicOSet: An Enhanced Open Dataset for Music Data Mining
zenodo.org
data.niaid.nih.gov
+1more
bin, zip
Updated Jun 7, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mariana O. Silva; Mariana O. Silva; Laís Mota; Mirella M. Moro; Mirella M. Moro; Laís Mota (2021). MusicOSet: An Enhanced Open Dataset for Music Data Mining [Dataset]. http://doi.org/10.5281/zenodo.4904639
Explore at:
zip, binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.4904639
Dataset updated
Jun 7, 2021
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Mariana O. Silva; Mariana O. Silva; Laís Mota; Mirella M. Moro; Mirella M. Moro; Laís Mota
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
MusicOSet is an open and enhanced dataset of musical elements (artists, songs and albums) based on musical popularity classification. Provides a directly accessible collection of data suitable for numerous tasks in music data mining (e.g., data visualization, classification, clustering, similarity search, MIR, HSS and so forth). To create MusicOSet, the potential information sources were divided into three main categories: music popularity sources, metadata sources, and acoustic and lyrical features sources. Data from all three categories were initially collected between January and May 2019. Nevertheless, the update and enhancement of the data happened in June 2019.

The attractive features of MusicOSet include:

Integration and centralization of different musical data sources

Calculation of popularity scores and classification of hits and non-hits musical elements, varying from 1962 to 2018

Enriched metadata for music, artists, and albums from the US popular music industry

Availability of acoustic and lyrical resources

Unrestricted access in two formats: SQL database and compressed .csv files

| Data | # Records | |:-----------------:|:---------:| | Songs | 20,405 | | Artists | 11,518 | | Albums | 26,522 | | Lyrics | 19,664 | | Acoustic Features | 20,405 | | Genres | 1,561 |
Data from: Spotify Playlists Dataset
zenodo.org
data.niaid.nih.gov
+1more
zip
Updated Jan 24, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Martin Pichl; Eva Zangerle; Eva Zangerle; Martin Pichl (2020). Spotify Playlists Dataset [Dataset]. http://doi.org/10.5281/zenodo.2594557
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.2594557
Dataset updated
Jan 24, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Martin Pichl; Eva Zangerle; Eva Zangerle; Martin Pichl
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset is based on the subset of users in the #nowplaying dataset who publish their #nowplaying tweets via Spotify. In principle, the dataset holds users, their playlists and the tracks contained in these playlists.

The csv-file holding the dataset contains the following columns: "user_id", "artistname", "trackname", "playlistname", where

user_id is a hash of the user's Spotify user name

artistname is the name of the artist

trackname is the title of the track and

playlistname is the name of the playlist that contains this track.

The separator used is , each entry is enclosed by double quotes and the escape character used is \.

A description of the generation of the dataset and the dataset itself can be found in the following paper:

Pichl, Martin; Zangerle, Eva; Specht, Günther: "Towards a Context-Aware Music Recommendation Approach: What is Hidden in the Playlist Name?" in 15th IEEE International Conference on Data Mining Workshops (ICDM 2015), pp. 1360-1365, IEEE, Atlantic City, 2015.
c
Spotify Tracks Dataset
cubig.ai
zip
Updated May 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CUBIG (2025). Spotify Tracks Dataset [Dataset]. https://cubig.ai/store/products/276/spotify-tracks-dataset
Explore at:
zipAvailable download formats
Dataset updated
May 20, 2025
Dataset authored and provided by
CUBIG
License
https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service
Measurement technique
Synthetic data generation using AI techniques for model training, Privacy-preserving data transformation via differential privacy
Description
1) Data Introduction • The Spotify Tracks Dataset contains information on tracks from over 125 music genres, including both audio features (e.g., danceability, energy, valence) and metadata (e.g., title, artist, genre).

2) Data Utilization (1) Characteristics of the Spotify Tracks Dataset: • The data is structured in a tabular format at the track level, where each column represents numerical or categorical features based on musical properties. This makes it suitable for recommendation systems, genre classification, and emotion analysis. • It includes multi-dimensional attributes grounded in music theory such as track duration, time signature, energy, loudness, tempo, and speechiness—enabling its use in music classification and clustering tasks.

(2) Applications of the Spotify Tracks Dataset: • Design of Music Recommendation Systems: It can be used to build content-based filtering systems or hybrid recommendation algorithms based on user preferences.
h
spotify-million-song-dataset
huggingface.co
Updated Jun 16, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vishnu Priya VR (2024). spotify-million-song-dataset [Dataset]. https://huggingface.co/datasets/vishnupriyavr/spotify-million-song-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 16, 2024
Authors
Vishnu Priya VR
License
https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/
Description
Dataset Card for Spotify Million Song Dataset

Dataset Summary

This is Spotify Million Song Dataset. This dataset contains song names, artists names, link to the song and lyrics. This dataset can be used for recommending songs, classifying or clustering songs.

Supported Tasks and Leaderboards

[More Information Needed]

Languages

[More Information Needed]

Dataset Structure Data Instances

[More Information Needed]

Data… See the full description on the dataset page: https://huggingface.co/datasets/vishnupriyavr/spotify-million-song-dataset.
Data from: Spotify Playlists
zenodo.org
csv
Updated Jan 24, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Francesco Cambria; Francesco Cambria (2025). Spotify Playlists [Dataset]. http://doi.org/10.5281/zenodo.14728731
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.14728731
Dataset updated
Jan 24, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Francesco Cambria; Francesco Cambria
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset was constructed based on the data found in Kaggle from Spotify.

The files here reported can be used to build a property graph in Neo4J:

song.csv - contains all the data for the Song nodes.

artist.csv - contains the data for the Artist nodes.

playlist.csv - contains the data for the Playlist nodes.

user.csv - contains the data for the Playlist nodes (those creating Playlists).

genre.csv - contains the data for the Genre nodes (a category for the Artists).

type.csv - contains the data for the Type nodes (a category for the Playlists).

sing.csv - contains the data for the SING relationship from Artist to Song nodes.

created.csv - contains the data for the CREATED relationship from User to Playlist nodes.

in.csv - contains the data for the IN relationship from Song to Playlist nodes.

of_type.csv - contains the data for the OFTYPE relationship from Playlist to Type nodes.

labelled.csv - contains the data for the LABELLED relationship from Artist to Genre nodes.

This data was used as test dataset in the paper "MINE GRAPH RULE: A New GQL Operator for Mining Association Rules in Property Graph Databases".
World's Spotify TOP-50 playlist musicality data
kaggle.com
zip
Updated Nov 26, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Miquel Neck (2023). World's Spotify TOP-50 playlist musicality data [Dataset]. https://www.kaggle.com/datasets/miquelneck/worlds-spotify-top-50-playlist-musicality-data
Explore at:
zip(175413 bytes)Available download formats
Dataset updated
Nov 26, 2023
Authors
Miquel Neck
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Area covered
World
Description
Every week, Spotify updates its Top-50 playlists for each country. This dataset includes every country list of the 45th week of 2023 (6th November - 12th November). There are 73 available countries.

The dataset has a column for every musical aspect of each song, and also the name, country, artist and publication date of the track.

Data extracted from the Spotify Official API.

Columns

These features are created by Spotify to analyze tracks. Here I copy the definition of each column, based on Spotify's API documentation.

Danceability: Danceability describes how suitable a track is for dancing based on a combination of musical elements including tempo, rhythm stability, beat strength, and overall regularity. A value of 0.0 is least danceable and 1.0 is most danceable.

Acousticness: A confidence measure from 0.0 to 1.0 of whether the track is acoustic. 1.0 represents high confidence the track is acoustic.

Duration_ms: The duration of the track in milliseconds.

Energy: Energy is a measure from 0.0 to 1.0 and represents a perceptual measure of intensity and activity. Typically, energetic tracks feel fast, loud, and noisy.

Instrumentalness: Predicts whether a track contains no vocals. "Ooh" and "aah" sounds are treated as instrumental in this context. Rap or spoken word tracks are clearly "vocal".

Key: The key the track is in. Integers map to pitches using standard Pitch Class notation. E.g. 0 = C, 1 = C♯/D♭, 2 = D, and so on. If no key was detected, the value is -1.

Liveness: Detects the presence of an audience in the recording. Higher liveness values represent an increased probability that the track was performed live. A value above 0.8 provides strong likelihood that the track is live.

Loudness: The overall loudness of a track in decibels (dB). Loudness values are averaged across the entire track and are useful for comparing relative loudness of tracks.

Mode: Mode indicates the modality (major or minor) of a track, the type of scale from which its melodic content is derived. Major is represented by 1 and minor is 0.

Speechiness: Speechiness detects the presence of spoken words in a track. The more exclusively speech-like the recording (e.g. talk show, audio book, poetry), the closer to 1.0 the attribute value.

Tempo: The overall estimated tempo of a track in beats per minute (BPM). In musical terminology, tempo is the speed or pace of a given piece and derives directly from the average beat duration.

Time_signature: An estimated time signature. The time signature (meter) is a notational convention to specify how many beats are in each bar (or measure). The time signature ranges from 3 to 7 indicating time signatures of "3/4", to "7/4".

Valence: A measure from 0.0 to 1.0 describing the musical positiveness conveyed by a track. Tracks with high valence sound more positive (e.g. happy, cheerful, euphoric), while tracks with low valence sound more negative (e.g. sad, depressed, angry).
Spotify dataset
kaggle.com
zip
Updated Jul 20, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sanjana chaudhari☑️ (2023). Spotify dataset [Dataset]. https://www.kaggle.com/datasets/sanjanchaudhari/spotify-dataset
Explore at:
zip(2045049 bytes)Available download formats
Dataset updated
Jul 20, 2023
Authors
Sanjana chaudhari☑️
License
ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
Description
Introduction to the Spotify Dataset

Overview of the Dataset Source and Purpose Description of the Data Collection Process Data Exploration

Understanding the Structure and Size of the Dataset Overview of the Features and Columns Key Features in the Spotify Dataset

Explanation of Important Columns (e.g., track name, artist, album, duration, popularity) Genre and Music Category Analysis

Categorizing Songs by Genre and Music Type Most Popular Genres on Spotify **Artist Analysis ** Identifying Top Artists Based on Popularity and Number of Songs Relationship between Artist and Song Attributes Song Duration Analysis

Distribution of Song Durations Impact of Song Duration on Popularity and Listener Engagement Song Popularity and Listener Engagement

Analyzing the Popularity Scores of Songs Correlation between Popularity and Other Song Features Audio Features Analysis

Examination of Audio Features (danceability, energy, instrumentalness, etc.) Clustering Songs Based on Audio Features Time-Based Analysis

Seasonal Trends in Song Releases and Popularity Time Series Analysis of Listening Patterns Collaborations and Featured Artists

Frequency of Collaborations and Featured Artists Impact of Collaborations on Song Popularity Recommendation Systems

Overview of Spotify's Recommendation Algorithms Building Simple Recommendation Models User Behavior and Playlist Analysis

Analysis of User-Generated Playlists Common Song Additions and Removals
c
Spotify Playlist ORIGINS Dataset
cubig.ai
zip
Updated Jun 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CUBIG (2025). Spotify Playlist ORIGINS Dataset [Dataset]. https://cubig.ai/store/products/402/spotify-playlist-origins-dataset
Explore at:
zipAvailable download formats
Dataset updated
Jun 5, 2025
Dataset authored and provided by
CUBIG
License
https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service
Measurement technique
Synthetic data generation using AI techniques for model training, Privacy-preserving data transformation via differential privacy
Description
1) Data Introduction • The Spotify Playlist-ORIGINS Dataset is a dataset of Spotify playlists called ORIGINS, which individuals have made with their favorite songs since 2014.

2) Data Utilization (1) Spotify Playlist-ORIGINS Dataset has characteristics that: • This dataset contains detailed music information for each playlist, including song name, artist, album, genre, release year, track ID, and structured metadata such as name, description, and song order for each playlist. (2) Spotify Playlist-ORIGINS Dataset can be used to: • Playlist-based music recommendation and user preference analysis: It can be used to develop a machine learning/deep learning-based music recommendation system or to study user preference analysis using playlist and song information. • Music Trend and Genre Popularity Analysis: It analyzes release year, genre, and artist data and can be used to study the music industry and culture, including music trends by period and genre, and changes in popular artists and songs.
Spotify users in the U.S. 2018, by age
statista.com
Updated Nov 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Spotify users in the U.S. 2018, by age [Dataset]. https://www.statista.com/statistics/475821/spotify-users-age-usa/
Explore at:
Dataset updated
Nov 27, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Mar 2018
Area covered
United States
Description
As of March 2018, Spotify’s user base was dominated by Millennials, with ** percent of its users aged 25 to 34 and ** percent aged between 18 and 24 years old. The streaming giant has permanently altered how consumers discover, engage with and share music, and according to a 2018 survey, Spotify reaches almost **** of 16 to 24 year olds in the United States each week. The power of SpotifySpotify’s popularity is undeniable, accumulating millions of premium subscribers worldwide each quarter and hundreds of millions of unique visitors to Spotify.com every month. In the United States, Spotify is one of the most commonly used apps for listening to podcasts, and despite being in constant competition with Apple Music, remains a large part of U.S. music listeners’ lives. A survey revealed that Spotify is also the preferred music streaming service among 18 to 29-year-olds, which may seem unremarkable given the data on Spotify’s user base, but serves as further evidence of Spotify’s popularity among younger users. Whether Spotify’s growth will last forever, only time will tell, particularly as Apple Music continues to put up a good fight and smaller but increasingly popular services such as Deezer begin to make their mark. But with the company recording a profit in early 2019 for the first time since its inception, Spotify remains very much a market leader and firmly on the path to future success.
H
My Spotify Data
dataverse.harvard.edu
Updated Oct 7, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ty Mulholland (2022). My Spotify Data [Dataset]. http://doi.org/10.7910/DVN/FVCXKG
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/FVCXKG
Dataset updated
Oct 7, 2022
Dataset provided by
Harvard Dataverse
Authors
Ty Mulholland
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
My Spotify Data
e
spotify.com Traffic Analytics Data
analytics.explodingtopics.com
Updated Sep 1, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). spotify.com Traffic Analytics Data [Dataset]. https://analytics.explodingtopics.com/website/spotify.com
Explore at:
Dataset updated
Sep 1, 2025
Variables measured
Global Rank, Monthly Visits, Authority Score, US Country Rank, Online Services Category Rank
Description
Traffic analytics, rankings, and competitive metrics for spotify.com as of September 2025
Spotify - Beyoncé's Track Data
kaggle.com
Updated Mar 15, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
yuka_with_data (2024). Spotify - Beyoncé's Track Data [Dataset]. https://www.kaggle.com/datasets/yukawithdata/beyonce-track-attribute-data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 15, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
yuka_with_data
Description
💁‍♀️Please take a moment to carefully read through this description and metadata to better understand the dataset and its nuances before proceeding to the Suggestions and Discussions section.

Dataset Description:

This dataset compiles the tracks from all of Beyoncé's albums available on Spotify, showcasing the evolution of one of the most influential artists in the music industry. It represents a comprehensive array of genres, influences, and musical styles that Beyoncé has explored throughout her career. Each track in the dataset is detailed with a variety of features, popularity, and metadata. This dataset serves as an excellent resource for music enthusiasts, data analysts, and researchers aiming to explore the impact of Beyoncé's music, identify trends in her musical evolution, or develop music recommendation systems based on empirical data.

Scope of the Data:

The focus of this dataset is on providing a comprehensive view of Beyoncé's musical releases on Spotify, specifically tailored to showcase her creative output. To this end, the dataset includes tracks from the following album types: - Albums: Full-length albums released by Beyoncé, encapsulating a range of her musical styles and eras. - Singles: Standalone single releases, highlighting key songs that have been released independently of her full albums. It's important to note that this dataset deliberately excludes compilation albums. Compilations, which often contain a mixture of tracks from various artists or previously released tracks by Beyoncé, are not included to maintain a focus on her original releases and to provide a clearer picture of her artistic evolution.

Data Collection and Processing:

Obtaining the Data: The data was obtained directly from the Spotify Web API, specifically focusing on albums and tracks by Beyoncé. The Spotify API provides detailed information about tracks, artists, and albums through various endpoints.

Data Processing: To process and structure the data, Python scripts were developed using data science libraries such as pandas for data manipulation and spotipy for API interactions, specifically for Spotify data retrieval.

Workflow: - Authentication - API Requests - Data Cleaning and Transformation - Saving the Data

Attribute Descriptions:

artist_name: the name of the artist (Beyoncé and collaborators)

track_name: the title of the track

is_explicit: Indicates whether the track contains explicit content

album_release_date: The date when the track was released

genres: A list of genres associated with Beyoncé

danceability: A measure from 0.0 to 1.0 indicating how suitable a track is for - dancing based on a combination of musical elements

valence: A measure from 0.0 to 1.0 indicating the musical positiveness conveyed by a track

energy: A measure from 0.0 to 1.0 representing a perceptual measure of intensity and activity

loudness: The overall loudness of a track in decibels (dB)

acousticness: A measure from 0.0 to 1.0 whether the track is acoustic

instrumentalness: Predicts whether a track contains no vocals

liveness: Detects the presence of an audience in the recordings

speechiness: Detects the presence of spoken words in a track

key: The key the track is in. Integers map to pitches using standard Pitch Class notation

tempo: The overall estimated tempo of a track in beats per minute (BPM)

mode: Modality of the track

duration_ms: The length of the track in milliseconds

time_signature: An estimated overall time signature of a track

popularity: A score between 0 and 100, with 100 being the most popular

Possible Data Projects:

Trend Analysis in Beyonce's Musical Evolution

Mood and Musical Elements in Beyonce's Tracks

Beyonce's Influence on the Music Industry Analysis

Disclaimer and Responsible Use:

This dataset, derived from Spotify focusing on Beyoncé's albums and tracks, is intended for educational, research, and analysis purposes only. Users are urged to use this data responsibly, ethically, and within the bounds of legal stipulations. - Compliance with Terms of Service: Users should adhere to Spotify's Terms of Service and Developer Policies when utilizing this dataset. - Copyright Notice: The dataset presents music track information including names and artist details for analytical purposes and does not convey any rights to the music itself. Users must ensure that their use does not infringe on the copyright holders' rights. Any analysis, distribution, or derivative work should respect the intellectual property rights of all involved parties and comply with applicable laws. - No Warranty Disclaimer: The dataset is provided "as is," without warranty, and the creator disclaims any legal liability for its use by others. - Ethical Use: Users are encouraged to consider the ethical implications of their analyses and the potential impact...
Z
Data from: P4KxSpotify: A Dataset of Pitchfork Music Reviews and Spotify...
data-staging.niaid.nih.gov
data.niaid.nih.gov
+1more
Updated Jan 24, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pinter, Anthony T.; Paul, Jacob M.; Jessie Smith; Brubaker, Jed R. (2020). P4KxSpotify: A Dataset of Pitchfork Music Reviews and Spotify Musical Features [Dataset]. https://data-staging.niaid.nih.gov/resources?id=zenodo_3603329
Explore at:
Dataset updated
Jan 24, 2020
Dataset provided by
University of Colorado Boulder
Authors
Pinter, Anthony T.; Paul, Jacob M.; Jessie Smith; Brubaker, Jed R.
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
18,403 music reviews scraped from Pitchfork, including relevant metadata such as author, review date, record release year, score, and genre, along with those album's audio features pulled from Spotify's API.
T
Spotify | SPOT - Current Liabilities
tradingeconomics.com
csv, excel, json, xml
Updated Sep 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TRADING ECONOMICS (2025). Spotify | SPOT - Current Liabilities [Dataset]. https://tradingeconomics.com/spot:us:current-liabilities
Explore at:
xml, excel, csv, jsonAvailable download formats
Dataset updated
Sep 15, 2025
Dataset authored and provided by
TRADING ECONOMICS
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jan 1, 2000 - Dec 2, 2025
Area covered
United States
Description
Spotify reported EUR6.24B in Current Liabilities for its fiscal quarter ending in September of 2025. Data for Spotify | SPOT - Current Liabilities including historical, tables and charts were last updated by Trading Economics this last December in 2025.

Facebook

Twitter

Click to copy link

Link copied

Cite

Piyush Papreja; Piyush Papreja (2021). Playlist2vec: Spotify Million Playlist Dataset [Dataset]. http://doi.org/10.5281/zenodo.5002584

Playlist2vec: Spotify Million Playlist Dataset

Explore at:

binAvailable download formats

Unique identifier

https://doi.org/10.5281/zenodo.5002584

Dataset updated

Jun 22, 2021

Dataset provided by

Zenodohttp://zenodo.org/

Authors

Piyush Papreja; Piyush Papreja

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

This dataset was created using Spotify developer API. It consists of user-created as well as Spotify-curated playlists.
The dataset consists of 1 million playlists, 3 million unique tracks, 3 million unique albums, and 1.3 million artists.
The data is stored in a SQL database, with the primary entities being songs, albums, artists, and playlists.
Each of the aforementioned entities are represented by unique IDs (Spotify URI).
Data is stored into following tables:

album
artist
track
playlist
track_artist1
track_playlist1

album

| id | name | uri |

id: Album ID as provided by Spotify
name: Album Name as provided by Spotify
uri: Album URI as provided by Spotify

artist

| id | name | uri |

id: Artist ID as provided by Spotify
name: Artist Name as provided by Spotify
uri: Artist URI as provided by Spotify

track

id: Track ID as provided by Spotify
name: Track Name as provided by Spotify
duration: Track Duration (in milliseconds) as provided by Spotify
popularity: Track Popularity as provided by Spotify
explicit: Whether the track has explicit lyrics or not. (true or false)
preview_url: A link to a 30 second preview (MP3 format) of the track. Can be null
uri: Track Uri as provided by Spotify
album_id: Album Id to which the track belongs

playlist

| id | name | followers | uri | total_tracks |

id: Playlist ID as provided by Spotify
name: Playlist Name as provided by Spotify
followers: Playlist Followers as provided by Spotify
uri: Playlist Uri as provided by Spotify
total_tracks: Total number of tracks in the playlist.

track_artist1

| track_id | artist_id |

Track-Artist association table

track_playlist1

| track_id | playlist_id |

Track-Playlist association table

- - - - - SETUP - - - - -

The data is in the form of a SQL dump. The download size is about 10 GB, and the database populated from it comes out to about 35GB.

spotifydbdumpschemashare.sql contains the schema for the database (for reference):
spotifydbdumpshare.sql is the actual data dump.

Setup steps:
1. Create database

- - - - - PAPER - - - - -

The description of this dataset can be found in the following paper:

Papreja P., Venkateswara H., Panchanathan S. (2020) Representation, Exploration and Recommendation of Playlists. In: Cellier P., Driessens K. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2019. Communications in Computer and Information Science, vol 1168. Springer, Cham

Clear search

Close search

Google apps

Main menu

Playlist2vec: Spotify Million Playlist Dataset

🎧 Spotify Global Streaming Data (2024)

📊 About the Dataset

Spotify Dataset

Spotify dataset

Description for Spotify Songs Dataset on Kaggle

Dataset Title: Spotify Songs Dataset

spotify data

My Spotify Data - Cleaned

The dataset contains records of the user's Spotify streaming history, with each row representing a specific instance of a played track. The data includes various attributes providing insights into the user's music listening habits.

Columns:

Purpose:

Data from: MusicOSet: An Enhanced Open Dataset for Music Data Mining

Data from: Spotify Playlists Dataset

Spotify Tracks Dataset

spotify-million-song-dataset

Data from: Spotify Playlists

World's Spotify TOP-50 playlist musicality data

Columns

Spotify dataset

Spotify Playlist ORIGINS Dataset

Spotify users in the U.S. 2018, by age

My Spotify Data

spotify.com Traffic Analytics Data

Spotify - Beyoncé's Track Data

Dataset Description:

Scope of the Data:

Data Collection and Processing:

Attribute Descriptions:

Possible Data Projects:

Disclaimer and Responsible Use:

Data from: P4KxSpotify: A Dataset of Pitchfork Music Reviews and Spotify...

Spotify | SPOT - Current Liabilities

Playlist2vec: Spotify Million Playlist Dataset