Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
The summary statistics by North American Industry Classification System (NAICS) which include: operating revenue (dollars x 1,000,000), operating expenses (dollars x 1,000,000), salaries wages and benefits (dollars x 1,000,000), and operating profit margin (by percent), of record production and integrated record production/distribution (NAICS 512210 & 512220), music publishers (NAICS 512230), sound recording studios (NAICS 512240), and other sound recording industries (NAICS 512290), annual, for five years of data.
The statistic provides data on favorite music genres among consumers in the United States as of July 2018, sorted by age group. According to the source, 52 percent of respondents aged 16 to 19 years old stated that pop music was their favorite music genre, compared to 19 percent of respondents aged 65 or above. Country music in the United States – additional information
In 2012, country music topped the list; 27.6 percent of respondents picked it among their three favorite genres. A year earlier, the result was one percent lower, which allowed classic rock to take the lead. The figures show, however, the genre’s popularity across the United States is unshakeable and it has also been spreading abroad. This could be demonstrated by the international success of (among others) Shania Twain or the second place the Dutch country duo “The Common Linnets” received in the Eurovision Song Contest in 2014, singing “Calm after the storm.”
The genre is also widely popular among American teenagers, earning the second place and 15.3 percent of votes in a survey in August 2012. The first place and more than 18 percent of votes was awarded to pop music, rock scored 13.1 percent and landed in fourth place. Interestingly, Christian music made it to top five with nine percent of votes. The younger generation is also widely represented among country music performers with such prominent names as Taylor Swift (born in 1989), who was the highest paid musician in 2015, and Hunter Hayes (born in 1991).
Country music is also able to attract crowds (and large sums of money) to live performances. Luke Bryan’s tour was the most successful tour in North America in 2016 based on ticket sales as almost 1.43 million tickets were sold for his shows. Fellow country singer, Garth Brooks, came second on the list, selling 1.4 million tickets for his tour in North America in 2016.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
MusicOSet is an open and enhanced dataset of musical elements (artists, songs and albums) based on musical popularity classification. Provides a directly accessible collection of data suitable for numerous tasks in music data mining (e.g., data visualization, classification, clustering, similarity search, MIR, HSS and so forth). To create MusicOSet, the potential information sources were divided into three main categories: music popularity sources, metadata sources, and acoustic and lyrical features sources. Data from all three categories were initially collected between January and May 2019. Nevertheless, the update and enhancement of the data happened in June 2019.
The attractive features of MusicOSet include:
Integration and centralization of different musical data sources
Calculation of popularity scores and classification of hits and non-hits musical elements, varying from 1962 to 2018
Enriched metadata for music, artists, and albums from the US popular music industry
Availability of acoustic and lyrical resources
Unrestricted access in two formats: SQL database and compressed .csv files
Data | # Records |
---|---|
Songs | 20,405 |
Artists | 11,518 |
Albums | 26,522 |
Lyrics | 19,664 |
Acoustic Features | 20,405 |
Genres | 1,561 |
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset was designed to evaluate the effectiveness of music education by collecting data on student performance, physiological information, engagement, and other metrics in a classroom setting, enhanced by Internet of Things (IoT) devices and Artificial Intelligence (AI) algorithms. The data simulates a learning environment where IoT devices track various music performance metrics, physiological responses, and behavioral patterns of students during music lessons. The dataset is structured to enable evaluation of the impact of music education, focusing on skill development, engagement, and performance outcomes.
Features: Student Information:
Student_ID: A unique identifier for each student (e.g., "S001", "S002"). Age: The age of the student (integer). Gender: The gender of the student, either "Male" or "Female". Class_Level: The level of music education (e.g., "Beginner", "Intermediate", "Advanced"). Music Performance Metrics:
Accuracy: A score representing the student's accuracy in reproducing the music correctly (percentage). Rhythm: A metric representing the student's ability to keep rhythm consistently (percentage). Tempo: The tempo of the student's performance, measured in beats per minute (BPM). Pitch_Accuracy: A score representing the accuracy of the pitch in the student's performance (percentage). Duration: The total time (in seconds) that the student spent playing during the lesson. Volume: The loudness level of the student's performance, measured in decibels (dB). Physiological Data:
Heart_Rate: The heart rate of the student during the lesson, measured in beats per minute (BPM). Blood_Pressure: The systolic blood pressure of the student, measured in mmHg. Stress_Level: A subjective stress level rating from 1 to 10, indicating how stressed the student felt during the lesson. Engagement and Behavioral Data:
Engagement_Level: A rating (1-10) representing the student's engagement level during the lesson. Focus_Time: The total time (in seconds) the student was focused on the task. Behavioral_Patterns: A categorical variable indicating the level of distraction during the lesson: 0: No distractions 1: Mild distractions 2: Heavy distractions Learning Outcomes (Target Variables):
Performance_Score: The overall performance score for the student, ranging from 60 to 100 (percentage). Skill_Development: A rating (1-5) representing the student’s development in musical skills, where: 1: Poor 2: Fair 3: Good 4: Very good 5: Excellent Engagement_Score: A numeric score representing the level of engagement in the lesson, ranging from 1 to 10. Lesson Information:
Lesson_Type: The type of lesson the student is participating in, either "Theory" or "Practical". Instrument_Type: The type of musical instrument used in the lesson (e.g., "Piano", "Guitar", "Violin"). Timestamps:
Timestamp: The timestamp of the lesson session, recorded to track when each data point was collected.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
MGD: Music Genre Dataset
Over recent years, the world has seen a dramatic change in the way people consume music, moving from physical records to streaming services. Since 2017, such services have become the main source of revenue within the global recorded music market.
Therefore, this dataset is built by using data from Spotify. It provides a weekly chart of the 200 most streamed songs for each country and territory it is present, as well as an aggregated global chart.
Considering that countries behave differently when it comes to musical tastes, we use chart data from global and regional markets from January 2017 to December 2019, considering eight of the top 10 music markets according to IFPI: United States (1st), Japan (2nd), United Kingdom (3rd), Germany (4th), France (5th), Canada (8th), Australia (9th), and Brazil (10th).
We also provide information about the hit songs and artists present in the charts, such as all collaborating artists within a song (since the charts only provide the main ones) and their respective genres, which is the core of this work. MGD also provides data about musical collaboration, as we build collaboration networks based on artist partnerships in hit songs. Therefore, this dataset contains:
This dataset was originally built for a conference paper at ISMIR 2020. If you make use of the dataset, please also cite the following paper:
Gabriel P. Oliveira, Mariana O. Silva, Danilo B. Seufitelli, Anisio Lacerda, and Mirella M. Moro. Detecting Collaboration Profiles in Success-based Music Genre Networks. In Proceedings of the 21st International Society for Music Information Retrieval Conference (ISMIR 2020), 2020.
@inproceedings{ismir/OliveiraSSLM20,
title = {Detecting Collaboration Profiles in Success-based Music Genre Networks},
author = {Gabriel P. Oliveira and
Mariana O. Silva and
Danilo B. Seufitelli and
Anisio Lacerda and
Mirella M. Moro},
booktitle = {21st International Society for Music Information Retrieval Conference}
pages = {726--732},
year = {2020}
}
This statistic shows the public opinion on the music genres which are most representative of America today in the United States as of May 2018, by ethnicity. During the survey, 54 percent of White respondents stated that they considered country music to be representative of modern America.
https://market.biz/privacy-policyhttps://market.biz/privacy-policy
Introduction
Music Streaming Statistics: The music streaming sector has significantly transformed how people discover and enjoy music, providing seamless, on-demand access across various digital devices. As the industry moves away from physical formats and downloads toward cloud-based services, streaming has emerged as the primary method of music consumption worldwide.
Its continued expansion is fueled by mobile convenience, curated content, and intelligent recommendation algorithms that boost user interaction. Streaming platforms are also introducing features such as social sharing, live audio, and tailored user experiences to strengthen engagement and loyalty.
Moreover, increasing smartphone adoption and improved internet infrastructure in developing regions are driving broader access to these services. Monitoring shifts in user behavior, platform usage, and demographic engagement is essential to understanding the evolving landscape of the music streaming industry.
https://electroiq.com/privacy-policyhttps://electroiq.com/privacy-policy
Amazon Music Statistics: Amazon Music is a popular music streaming platform, offering a variety of services, including Amazon Music Unlimited, Amazon Prime Music, and Amazon Music HD. As of 2023, Amazon Music boasts over 80 million songs in its catalog, providing a wide range of music options across genres. The service is available in more than 50 countries and is integrated with Amazon's smart devices, like Echo and Fire TV. Amazon Music Unlimited, the premium version of the service, offers access to an even larger selection of over 90 million songs. The platform also supports high-definition audio for subscribers of Amazon Music HD, with tracks available in lossless, CD-quality audio.
Amazon Music has seen steady growth, with recent reports suggesting that it has gained a significant share of the global streaming market, though it still trails behind competitors like Spotify and Apple Music. Additionally, Amazon Music offers personalized playlists and radio stations, enhancing the user experience through tailored recommendations. This article will discuss the important Amazon Music statistics and key trends.
https://brightdata.com/licensehttps://brightdata.com/license
Gain valuable insights into music trends, artist popularity, and streaming analytics with our comprehensive Spotify Dataset. Designed for music analysts, marketers, and businesses, this dataset provides structured and reliable data from Spotify to enhance market research, content strategy, and audience engagement.
Dataset Features
Track Information: Access detailed data on songs, including track name, artist, album, genre, and release date. Streaming Popularity: Extract track popularity scores, listener engagement metrics, and ranking trends. Artist & Album Insights: Analyze artist performance, album releases, and genre trends over time. Related Searches & Recommendations: Track related search terms and suggested content for deeper audience insights. Historical & Real-Time Data: Retrieve historical streaming data or access continuously updated records for real-time trend analysis.
Customizable Subsets for Specific Needs Our Spotify Dataset is fully customizable, allowing you to filter data based on track popularity, artist, genre, release date, or listener engagement. Whether you need broad coverage for industry analysis or focused data for content optimization, we tailor the dataset to your needs.
Popular Use Cases
Market Analysis & Trend Forecasting: Identify emerging music trends, genre popularity, and listener preferences. Artist & Label Performance Tracking: Monitor artist rankings, album success, and audience engagement. Competitive Intelligence: Analyze competitor music strategies, playlist placements, and streaming performance. AI & Machine Learning Applications: Use structured music data to train AI models for recommendation engines, playlist curation, and predictive analytics. Advertising & Sponsorship Insights: Identify high-performing tracks and artists for targeted advertising and sponsorship opportunities.
Whether you're optimizing music marketing, analyzing streaming trends, or enhancing content strategies, our Spotify Dataset provides the structured data you need. Get started today and customize your dataset to fit your business objectives.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset provides a list of lyrics from 1950 to 2019 describing music metadata as sadness, danceability, loudness, acousticness, etc. We also provide some informations as lyrics which can be used to natural language processing.
The audio data was scraped using Echo Nest® API integrated engine with spotipy Python’s package. The spotipy API permits the user to search for specific genres, artists,songs, release date, etc. To obtain the lyrics we used the Lyrics Genius® API as baseURL for requesting data based on the song title and artist name.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset was studied on Temporal Analysis and Visualisation of Music paper, in the following link:
https://sol.sbc.org.br/index.php/eniac/article/view/12155
This dataset provides a list of lyrics from 1950 to 2019 describing music metadata as sadness, danceability, loudness, acousticness, etc. We also provide some informations as lyrics which can be used to natural language processing.
The audio data was scraped using Echo Nest® API integrated engine with spotipy Python’s package. The spotipy API permits the user to search for specific genres, artists,songs, release date, etc. To obtain the lyrics we used the Lyrics Genius® API as baseURL for requesting data based on the song title and artist name.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is a collection of mel-spectrogram features extracted from Indian folk music containing the following 15 folk styles:
Bauls, Bhavageethe, Garba, Kajri, Maand, Sohar, Tamang Selo, Veeragase, Bhatiali, Bihu, Gidha, Lavani, Naatupura Paatu, Sufi, Uttarakhandi.
The number of recordings varies from 16 to 50 in the mentioned folk styles representing the scarcity of availability of given folk styles on the Internet. There are at least 4 artists and a maximum of 22. Overall there are 125 artists (34 female + 91 male) in these 15 folk styles.
There is a total of 606 recordings in the dataset, with a total duration of 54.45 hrs.
Mel-spectrogram is extracted from a 3-second segment with each song's 1/2 second sliding window. Extracted mel-spectrogram for each segment is annotated with folk_style, state, artist, gender, song, source, no_of_artists, folk_style_id, state_id, artist_id, gender_id.
_
This project was funded under the grant number: ECR/2018/000204 by the Science & Engineering Research Board (SERB).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
MuMu is a Multimodal Music dataset with multi-label genre annotations that combines information from the Amazon Reviews dataset and the Million Song Dataset (MSD). The former contains millions of album customer reviews and album metadata gathered from Amazon.com. The latter is a collection of metadata and precomputed audio features for a million songs.
To map the information from both datasets we use MusicBrainz. This process yields the final set of 147,295 songs, which belong to 31,471 albums. For the mapped set of albums, there are 447,583 customer reviews from the Amazon Dataset. The dataset have been used for multi-label music genre classification experiments in the related publication. In addition to genre annotations, this dataset provides further information about each album, such as genre annotations, average rating, selling rank, similar products, and cover image url. For every text review it also provides helpfulness score of the reviews, average rating, and summary of the review.
The mapping between the three datasets (Amazon, MusicBrainz and MSD), genre annotations, metadata, data splits, text reviews and links to images are available here. Images and audio files can not be released due to copyright issues.
MuMu dataset (mapping, metadata, annotations and text reviews)
Data splits and multimodal feature embeddings for ISMIR multi-label classification experiments
These data can be used together with the Tartarus deep learning library https://github.com/sergiooramas/tartarus.
NOTE: This version provides simplified files with metadata and splits.
Scientific References
Please cite the following papers if using MuMu dataset or Tartarus library.
Oramas, S., Barbieri, F., Nieto, O., and Serra, X (2018). Multimodal Deep Learning for Music Genre Classification, Transactions of the International Society for Music Information Retrieval, V(1).
Oramas S., Nieto O., Barbieri F., & Serra X. (2017). Multi-label Music Genre Classification from audio, text and images using Deep Features. In Proceedings of the 18th International Society for Music Information Retrieval Conference (ISMIR 2017). https://arxiv.org/abs/1707.04916
The sales based on format of musical recordings for the record production and integrated record production and distribution industries, sound recording and music publishing (NAICS 512210 and 512220), for two years of data.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is built by using data from Spotify. It provides a daily chart of the 200 most streamed songs for each country and territory it is present, as well as an aggregated global chart.
Considering that countries behave differently when it comes to musical tastes, we use chart data from global and regional markets from January 2017 to March 2022 (downloaded from CSV files), considering 68 distinct markets.
We also provide information about the hit songs and artists present in the charts, such as all collaborating artists within a song (since the charts only provide the main ones) and their respective genres, which is the core of this work. MGD+ also provides data about musical collaboration, as we build collaboration networks based on artist partnerships in hit songs. Therefore, this dataset contains:
Genre Networks: Success-based genre collaboration networks
Artist Networks: Success-based artist collaboration networks
Artists: Some artist data
Hit Songs: Hit Song data and features
Charts: Enhanced data from Spotify Daily Top 200 Charts
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Music is a volatile industry, where its dynamic nature can directly influence artist career behavior. That is, musical careers can suffer ups and downs depending on the current market moment. This dataset provides data about hot streak periods in musical careers, which are defined by high-impact bursts occurring in sequence.
Success in the music industry has a temporal structure, as the audience tastes change over time. Here, we use the Billboard Hot 100 charts with Spotify data to represent success over time. For musical careers, we build their time series from the debut date (i.e., date of the first release obtained from Spotify) to the last chart collected. Thus, each point in the time series represents the success of such an artist in a given week, according to the Hot 100 chart.
Therefore, we present MUHSIC (Music-oriented Hot Streak Information Collection), which contains:
Charts: enhanced data on all weekly Hot 100 Charts
Artists: artist success time series with hot streak information
Genres: genre success time series with hot streak information (the genre is the aggregated of all its artists)
Hot Streaks: summarized hot streak information
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
This table contains 76 series, with data for years 2005 - 2011 (not all combinations necessarily have data for all years), and was last released on 2015-08-12. This table contains data described by the following dimensions (Not all combinations are available): Geography (6 items: Canada; Ontario; Quebec; Atlantic provinces ...), North American Industry Classification System (NAICS) (4 items: Record production and integrated record production/distribution; Sound recording studios; Other sound recording industries; Music publishers ...), Summary statistics (4 items: Operating revenue; Operating expenses; Operating profit margin; Salaries; wages and benefits ...).
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
( You can check how I used this dataset on my github repository )
I am basically a HUGE fan of music ( mostly French rap though with some exceptions but I love music ). And someday , while browsing stuff on Internet , I found the Spotify's API . I knew I had to use it when I found out you could get information like danceability about your favorite songs just with their id's.
https://user-images.githubusercontent.com/86613710/127216769-745ac143-7456-4464-bbe3-adc53872c133.png" alt="image">
Once I saw that , my machine learning instincts forced me to work on this project.
I collected 100 liked songs and 95 disliked songs
For those I like , I made a playlist of my favorite 100 songs. It is mainly French Rap , sometimes American rap , rock or electro music.
For those I dislike , I collected songs from various kind of music so the model will have a broader view of what I don't like
There is : - 25 metal songs ( Cannibal Corps ) - 20 " I don't like " rap songs ( PNL ) - 25 classical songs - 25 Disco songs
I didn't include any Pop song because I'm kinda neutral about it
From the Spotify's API "Get a playlist's Items" , I turned the playlists into json formatted data which cointains the ID and the name of each track ( ids/yes.py and ids/no.py ). NB : on the website , specify "items(track(id,name))" in the fields format , to avoid being overwhelmed by useless data.
With a script ( ids/ids_to_data.py ) , I turned the json data into a long string with each ID separated with a comma.
Now I just had to enter the strings into the Spotify API "Get Audio Features from several tracks" and get my data files ( data/good.json and data/dislike.json )
From Spotify's API documentation :
https://brightdata.com/licensehttps://brightdata.com/license
Unlock powerful insights with our custom music datasets, offering access to millions of records from popular music platforms like Spotify, SoundCloud, Amazon Music, YouTube Music, and more. These datasets provide comprehensive data points such as track titles, artists, albums, genres, release dates, play counts, playlist details, popularity scores, user-generated tags, and much more, allowing you to analyze music trends, listener behavior, and industry patterns with precision. Use these datasets to optimize your music strategies by identifying trending tracks, analyzing artist performance, understanding playlist dynamics, and tracking audience preferences across platforms. Gain valuable insights into streaming habits, regional popularity, and emerging genres to make data-driven decisions that enhance your marketing campaigns, content creation, and audience engagement. Whether you’re a music producer, marketer, data analyst, or researcher, our music datasets empower you with the data needed to stay ahead in the ever-evolving music industry. Available in various formats such as JSON, CSV, and Parquet, and delivered via flexible options like API, S3, or email, these datasets ensure seamless integration into your workflows.
Financial overview and grant giving statistics of Music In The Park Series
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
The summary statistics by North American Industry Classification System (NAICS) which include: operating revenue (dollars x 1,000,000), operating expenses (dollars x 1,000,000), salaries wages and benefits (dollars x 1,000,000), and operating profit margin (by percent), of record production and integrated record production/distribution (NAICS 512210 & 512220), music publishers (NAICS 512230), sound recording studios (NAICS 512240), and other sound recording industries (NAICS 512290), annual, for five years of data.