43 datasets found
  1. 600 Billboard Hot 100 Tracks (with Spotify Data)

    • kaggle.com
    zip
    Updated Aug 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Bumpkin (2024). 600 Billboard Hot 100 Tracks (with Spotify Data) [Dataset]. https://www.kaggle.com/datasets/thebumpkin/600-billboard-hot-100-tracks-with-spotify-data
    Explore at:
    zip(31522 bytes)Available download formats
    Dataset updated
    Aug 23, 2024
    Authors
    The Bumpkin
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This dataset offers a comprehensive glimpse into the evolution of contemporary music, featuring 620 tracks from 87 artists who have dominated the charts between 2000 and 2023. Representing the pulse of modern pop and R&B, this collection captures the diversity and dynamism of the Hot 100 hits over the past two decades. Each track is meticulously annotated with Spotify's audio features, providing a rich, data-driven perspective on the sonic characteristics that have shaped the soundscape of the 21st century. From tempo to energy levels, and from danceability to valence, this dataset is a treasure trove for anyone looking to explore the trends and transformations in popular music.

  2. Top 100 Billboard

    • kaggle.com
    zip
    Updated Sep 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sujay Kapadnis (2023). Top 100 Billboard [Dataset]. https://www.kaggle.com/datasets/sujaykapadnis/top-100-billboard
    Explore at:
    zip(28541119 bytes)Available download formats
    Dataset updated
    Sep 25, 2023
    Authors
    Sujay Kapadnis
    Description

    The data this week comes from Data.World by way of Sean Miller, Billboard.com and Spotify.

    Billboard Top 100 - Wikipedia

    The Billboard Hot 100 is the music industry standard record chart in the United States for songs, published weekly by Billboard magazine. Chart rankings are based on sales (physical and digital), radio play, and online streaming in the United States.

    Billboard Top 100 Article

    Drake rewrites the record for the most entries ever on the Billboard Hot 100, as he lands his 208th career title on the latest list, dated March 21

    Data Dictionary

    billboard.csv

    variableclassdescription
    urlcharacterBillboard Chart URL
    week_idcharacterWeek ID
    week_positiondoubleWeek position 1: 100
    songcharacterSong name
    performercharacterPerformer name
    song_idcharacterSong ID, combo of song/singer
    instancedoubleInstance (this is used to separate breaks on the chart for a given song. Example, an instance of 6 tells you that this is the sixth time this song has appeared on the chart)
    previous_week_positiondoublePrevious week position
    peak_positiondoublePeak position as of that week
    weeks_on_chartdoubleWeeks on chart as of that week

    audio_features.csv

    variableclassdescription
    song_idcharacterSong ID
    performercharacterPerformer name
    songcharacterSong
    spotify_genrecharacterGenre
    spotify_track_idcharacterTrack ID
    spotify_track_preview_urlcharacterSpotify URL
    spotify_track_duration_msdoubleDuration in ms
    spotify_track_explicitlogicalIs explicit
    spotify_track_albumcharacterAlbum name
    danceabilitydoubleDanceability describes how suitable a track is for dancing based on a combination of musical elements including tempo, rhythm stability, beat strength, and overall regularity. A value of 0.0 is least danceable and 1.0 is most danceable.
    energydoubleEnergy is a measure from 0.0 to 1.0 and represents a perceptual measure of intensity and activity. Typically, energetic tracks feel fast, loud, and noisy. For example, death metal has high energy, while a Bach prelude scores low on the scale. Perceptual features contributing to this attribute include dynamic range, perceived loudness, timbre, onset rate, and general entropy.
    keydoubleThe estimated overall key of the track. Integers map to pitches using standard Pitch Class notation . E.g. 0 = C, 1 = C♯/D♭, 2 = D, and so on. If no key was detected, the value is -1.
    loudnessdoubleThe overall loudness of a track in decibels (dB). Loudness values are averaged across the entire track and are useful for comparing relative loudness of tracks. Loudness is the quality of a sound that is the primary psychological correlate of physical strength (amplitude). Values typical range between -60 and 0 db.
    modedoubleMode indicates the modality (major or minor) of a track, the type of scale from which its melodic content is derived. Major is represented by 1 and minor is 0.
    speechinessdoubleSpeechiness detects the presence of spoken words in a track. The more exclusively speech-like the recording (e.g. talk show, audio book, poetry), the closer to 1.0 the attribute value. Values above 0.66 describe tracks that are probably made entirely of spoken words. Values between 0.33 and 0.66 describe tracks that may contain both music and speech, either in sections or layered, including such cases as rap music. Values below 0.33 most likely represent music and other non-speech-like tracks.
    acousticnessdoubleA confidence measure from 0.0 to 1.0 of whether the track is acoustic. 1.0 represents high confidence the track is acoustic.
    instrumentalnessdoublePredicts whether a track contains no vocals. "Ooh" and "aah" sounds are treated as instrumental in this context. Rap or spoken word tracks are clearly "vocal". The closer the instrumentalness value is to 1.0, the greater likelihood the track contains no vocal content. Values above 0.5 are intended to represent instrumental tracks, but confidence is higher as the value approaches 1.0.
    livenessdoubleDetects the presence of an audience in the recording. Higher liveness values represent an increased probability that t...
  3. Data from: MusicOSet: An Enhanced Open Dataset for Music Data Mining

    • zenodo.org
    • data.niaid.nih.gov
    • +1more
    bin, zip
    Updated Jun 7, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mariana O. Silva; Mariana O. Silva; Laís Mota; Mirella M. Moro; Mirella M. Moro; Laís Mota (2021). MusicOSet: An Enhanced Open Dataset for Music Data Mining [Dataset]. http://doi.org/10.5281/zenodo.4904639
    Explore at:
    zip, binAvailable download formats
    Dataset updated
    Jun 7, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Mariana O. Silva; Mariana O. Silva; Laís Mota; Mirella M. Moro; Mirella M. Moro; Laís Mota
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    MusicOSet is an open and enhanced dataset of musical elements (artists, songs and albums) based on musical popularity classification. Provides a directly accessible collection of data suitable for numerous tasks in music data mining (e.g., data visualization, classification, clustering, similarity search, MIR, HSS and so forth). To create MusicOSet, the potential information sources were divided into three main categories: music popularity sources, metadata sources, and acoustic and lyrical features sources. Data from all three categories were initially collected between January and May 2019. Nevertheless, the update and enhancement of the data happened in June 2019.

    The attractive features of MusicOSet include:

    • Integration and centralization of different musical data sources
    • Calculation of popularity scores and classification of hits and non-hits musical elements, varying from 1962 to 2018
    • Enriched metadata for music, artists, and albums from the US popular music industry
    • Availability of acoustic and lyrical resources
    • Unrestricted access in two formats: SQL database and compressed .csv files
    |    Data    | # Records |
    |:-----------------:|:---------:|
    | Songs       | 20,405  |
    | Artists      | 11,518  |
    | Albums      | 26,522  |
    | Lyrics      | 19,664  |
    | Acoustic Features | 20,405  |
    | Genres      | 1,561   |
  4. Billboard Hot weekly charts

    • kaggle.com
    zip
    Updated Dec 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2023). Billboard Hot weekly charts [Dataset]. https://www.kaggle.com/datasets/thedevastator/billboard-hot-100-audio-features/data
    Explore at:
    zip(14075422 bytes)Available download formats
    Dataset updated
    Dec 4, 2023
    Authors
    The Devastator
    Description

    Billboard Hot weekly charts

    Billboard Hot 100 Weekly Charts with Spotify Audio Features

    By Sean Miller [source]

    About this dataset

    The Billboard Hot 100 Weekly Charts with Audio dataset is a comprehensive collection that combines the historical data of the Billboard Hot 100 weekly singles charts with detailed audio features extracted from Spotify. The dataset provides valuable insights into the popularity and musical attributes of songs that have appeared on the Billboard charts.

    The primary dataset, Hot Stuff.csv, includes information about each song's position on the weekly charts. It contains columns such as the Billboard Chart URL, WeekID, Song name, Performer name, unique SongID (concatenation of song and performer), Current week on chart, Instance (indicating breaks in chart appearances), Previous week position, Peak Position (highest chart position reached), and Weeks on Chart.

    The second dataset, Hot 100 Audio Features.csv, provides in-depth audio features of each song sourced from Spotify's Web API. This includes various metrics such as danceability (suitability for dancing based on musical elements), energy level (intensity and activity), key (musical key signature), loudness (overall volume level in decibels dB), mode (major or minor key), speechiness rating (presence of spoken words in songs), acousticness rating (acoustic quality measure), instrumentalness rating (likelihood of a song being instrumental), liveness rating (presence of a live audience during recording/performance) valence rating(musical positiveness conveyed by a song). Additionally it provides tempo in BPM and time signature(e.g., 4/4 -the rhythm pattern).

    Furthermore , this comprehensive dataset encompasses Spotify-related features: track preview URL for audio samples before full streaming or purchase decisions; total duration measured in milliseconds; explicit content indication; album details for songs; genre details provided by Spotify.

    With this combined data set, researchers can analyze trends and patterns over time regarding how different audio features relate to a song's popularity and performance on the Billboard Hot 100. It offers endless possibilities for studying the influence of specific music attributes on commercial success and understanding the preferences of popular music audiences.

    Whether you are interested in exploring genre-based trends, discovering correlations between chart positions and audio features, or investigating how certain attributes contribute to a song's longevity on the charts, this dataset serves as a valuable resource for deep analysis and insights into Billboard Hot 100 songs

    How to use the dataset

    • Understanding the Datasets:

      • The dataset consists of two files: Hot Stuff.csv and Hot 100 Audio Features.csv.
      • The Hot Stuff.csv file contains the weekly Hot 100 singles chart data, including song names, performer names, chart positions, and other relevant information.
      • The Hot 100 Audio Features.csv file contains detailed audio features for each song extracted from Spotify, such as danceability, energy, instrumentalness, etc.
      • Both files can be merged using common attributes like Performer and Song to get a combined view of both datasets.
    • Exploring the Hot Stuff.csv File:

      • This file provides information about each song's position on that week's Hot 100 singles chart.
      • Important columns in this file are:
        • WeekID: The week identifier.
        • Song name: The name of the song.
        • Performer name: The name of the performer or artist.
        • Current week on chart: Represents how many weeks the song has been on the chart at that particular point in time.
        • Instance: Indicates whether it is a separate entry for an already listed song (for example, an instance value of 6 means it appeared for the sixth time).
        • Previous week position: The position of the song on the previous week's chart.
        • Peak Position: The highest position reached by a particular song on any given week's chart.
        • Weeks on Chart: Represents how many weeks a specific entry has spent on the chart so far.
    • Exploring the Hot 100 Audio Features.csv File:

      • This file provides detailed audio features for each song extracted from Spotify using the Spotify Web API.
      • It contains attributes like danceability, energy, instrumentalness, tempo, etc., which help capture different aspects of the song's musical characteristics.
      • Important columns in this file are:
        • Performer: The name of the performer or artist of the song.
        • Song: The name of the song.
        • spotify_genre: The genre(s) of the song according to Spotify....
  5. Billboard Hot 100(1958-2024)

    • kaggle.com
    zip
    Updated Jun 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Elizabeth Earhart (2024). Billboard Hot 100(1958-2024) [Dataset]. https://www.kaggle.com/datasets/elizabethearhart/billboard-hot-1001958-2024
    Explore at:
    zip(3359712 bytes)Available download formats
    Dataset updated
    Jun 7, 2024
    Authors
    Elizabeth Earhart
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    I downloaded this dataset from a UT Austin GitHub repository you can find here

    It contains the billboard hot 100 charts from 1958-2024.

  6. billboards_dataset

    • kaggle.com
    zip
    Updated Mar 31, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Saurav Sengupta (2020). billboards_dataset [Dataset]. https://www.kaggle.com/sausen7/billboards-dataset
    Explore at:
    zip(185887 bytes)Available download formats
    Dataset updated
    Mar 31, 2020
    Authors
    Saurav Sengupta
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    Data of the Billboards Hot 100 chart for about 67 weeks using a Scrapy spider and BeautifulSoup.

    To get more data use the following spider: https://github.com/ssen7/billboards-crawler

    Requirements: Python3, scrapy, bs4

    Content

    Each file contains the Billboard Chart Hot 100 songs of that week including the artist's name, previous week's rank, change in rank (default) and peak rank.

    Acknowledgements

    Billboard Charts: https://www.billboard.com/charts/hot-100/ scrapy team and BeautifulSoup

    Inspiration

    To build a tool to gather chart data for pop songs that can crawl data according to user specifications.

    A secondary goal was to have a dataset that can track an artist's performance (to the extent a song on the Billboards Hot 100 can) across years.

  7. Main Dataset for "Evolution of Popular Music: USA 1960–2010"

    • figshare.com
    txt
    Updated Jan 19, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matthias Mauch (2016). Main Dataset for "Evolution of Popular Music: USA 1960–2010" [Dataset]. http://doi.org/10.6084/m9.figshare.1309953.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jan 19, 2016
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Matthias Mauch
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This is a large file (~20MB) called EvolutionPopUSA_MainData.csv, in comma-separated data format with column headers. Each row corresponds to a recording. The file is viewable in any text editor, and can also be opened in Excel or imported to other data processing programs. Below is a list of the column headers, with annotations. public_idunique ID of the recording artist_namename of the recording artist artist_name_cleanartist name all upper case, no spaces, with secondary artists ("featuring") removed. track_namename of the track, i.e. usually name of the song first_entrydate of the first entry into the Billboard Hot 100 quarter, year, fiveyear, decadetransformations of first_entry to coarser time periods eraera the track belongs to (1,...,4), as determined by Foote segmentation on the PC data (see below) clustercluster membership of the track, as derived by k-means clustering on the PC data (see below) hTopic_01, ... , hTopic_08harmonic Topic weights, see description in the paper tTopic_01, ... , tTopic_08timbral Topic weights, see description in the paper PC1, ... , PC14principal components of the harmonic and timbral Topics harm_…193 columns of chord change counts; the chord change is indicated in the column label (e.g. harm_M.2.M means major chord followed by another major chord 2 semitones up). timb_01, ... , timb_3535 columns of timbre class counts (see description in supplementary information)

  8. Billboard Hot 100 & more

    • kaggle.com
    zip
    Updated Oct 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Godefroy Lambert (2025). Billboard Hot 100 & more [Dataset]. https://www.kaggle.com/datasets/ludmin/billboard/code
    Explore at:
    zip(17064341 bytes)Available download formats
    Dataset updated
    Oct 22, 2025
    Authors
    Godefroy Lambert
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    🎵 Billboard Historical Charts (Auto-Updated Weekly)

    A curated, long-horizon collection of major Billboard music charts — from their beginnings (as far back as publicly accessible) up to the present. Data is refreshed every Wednesday at 02:00 (server time) so you always have up-to-date rankings for analysis, trending, and music data science projects.

    📦 Included Chart Files

    Each file is a time series of weekly chart positions:

    FileChartType
    billboard200.csvBillboard 200Albums
    hot100.csvHot 100Songs (flagship singles chart)
    radio.csvRadio SongsAirplay-driven ranking
    streaming_songs.csvStreaming SongsStreaming activity
    digital_songs.csvDigital Song SalesDownload activity

    📑 Data Fields

    All chart files share a consistent schema:

    ColumnDescription
    dateChart week (YYYY-MM-DD; represents the chart issue date)
    titleSong (or album) title
    artistPrimary credited artist(s)
    rankCurrent chart position for that week
    last_weekPosition in the previous published week (may be blank if new)
    peak_posBest (lowest number) rank achieved to date
    weeks_on_chartTotal number of charting weeks up to and including this row
    image_urlArtwork URL when available (see Notes)

    🔄 Update Schedule

    New data is added weekly:
    Every Wednesday at 02:00 (automated scraping + ingestion pipeline).
    Missed a week? Older weeks are retained, so you can still build complete time series.

    🛠 Data Formatting Notes

    • Embedded separators: Some titles and artist lists contain commas. To avoid CSV parsing conflicts:
      • Song titles containing commas → internal commas replaced by semicolons ;
      • Multiple artists → separated by pipe |
    • Missing artwork: If Billboard provides no image, the image_url field is set to #.

    💡 Use Cases

    • Tracking the evolution of music trends over time
    • Analyzing the popularity of artists and genres
    • Building data visualizations or dashboards
    • Creating machine learning models for hit prediction

    🛠 Related Project on GitHub

    This dataset is maintained by an automated pipeline available as open-source here:
    🔗 Billboard Scraper GitHub Repository

    The project includes: - A weekly Airflow DAG to scrape and upload fresh data - Backup manual scraping scripts - Configurable settings - Code to push updates directly to Kaggle

    Feel free to explore it, fork it, or contribute improvements!

    🤝 Contributions & Feedback

    If you have suggestions for additional charts or improvements, feel free to reach out or share your ideas in the Kaggle discussion section.

  9. Merged data

    • kaggle.com
    zip
    Updated Nov 11, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CH. Nithin Chakravarthy (2021). Merged data [Dataset]. https://www.kaggle.com/chnithinchakravarthy/merged-data
    Explore at:
    zip(36696 bytes)Available download formats
    Dataset updated
    Nov 11, 2021
    Authors
    CH. Nithin Chakravarthy
    Description

    The Billboard Hot 100 is a chart that ranks the best-performing singles of the United States. Its data, published by Billboard magazine and compiled by Nielsen Sound Scan, is based collectively on each single's weekly physical and digital sales, as well as airplay and streaming. At the end of a year, Billboard will publish an annual list of the 100 most successful songs throughout that year on the Hot 100 chart based on the information.

    Billboard year end chart works These charts are a cumulative measure of a single or album's performance in the United States, based upon the Billboard magazine charts during any given chart year. Other factors including the total weeks a song spent on the chart and at its peak position were calculated into its year-end total.

    Billboard Hot is determined The Hot 100 is ranked by radio airplay audience impressions as measured by Nielsen BDS, sales data compiled by Nielsen Sound scan (both at retail and digitally) and streaming activity provided by online music sources. There are several component charts that contribute to the overall calculation of the Hot 100.

    The Billboard Global 200 is a weekly record chart published by Billboard magazine. The chart ranks the top songs globally and is based on digital sales and online streaming from over 200 territories worldwide.

    Stories about the Billboard 200 albums chart generally post on Sunday afternoons, while stories about the Billboard Hot 100 generally post each Monday afternoon. Other stories, podcasts, videos and more covering our full menu of charts post throughout the week.

    In the US, Billboard represents the cream of all the objective data. And their efforts to collect all the data from all these various sources to create an objective, final tally of each artist's popularity in a given week, still has merit. ... This is why the billboard chart is important.

    So here we had collected list of Billboard Hot 100 singles from the year 1992 to 2014.

  10. Data for "Evolution of Popular Music: USA 1960–2010"

    • figshare.com
    txt
    Updated Jan 19, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matthias Mauch (2016). Data for "Evolution of Popular Music: USA 1960–2010" [Dataset]. http://doi.org/10.6084/m9.figshare.1401981.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jan 19, 2016
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Matthias Mauch
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Combines two data tables and a PDF with extensive information on which artists and tags strongly feature the T-Topics and H-Topics from the paper.

  11. Z

    Data from: MUHSIC: An Open Dataset with Temporal Musical Success Information...

    • data.niaid.nih.gov
    • data-staging.niaid.nih.gov
    Updated Oct 22, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gabriel P. Oliveira; Gabriel R. G. Barbosa; Bruna C. Melo; Mariana O. Silva; Danilo B. Seufitelli; Anisio Lacerda; Mirella M. Moro (2021). MUHSIC: An Open Dataset with Temporal Musical Success Information [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4779002
    Explore at:
    Dataset updated
    Oct 22, 2021
    Dataset provided by
    Universidade Federal de Minas Gerais
    Authors
    Gabriel P. Oliveira; Gabriel R. G. Barbosa; Bruna C. Melo; Mariana O. Silva; Danilo B. Seufitelli; Anisio Lacerda; Mirella M. Moro
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Music is a volatile industry, where its dynamic nature can directly influence artist career behavior. That is, musical careers can suffer ups and downs depending on the current market moment. This dataset provides data about hot streak periods in musical careers, which are defined by high-impact bursts occurring in sequence.

    Success in the music industry has a temporal structure, as the audience tastes change over time. Here, we use the Billboard Hot 100 charts with Spotify data to represent success over time. For musical careers, we build their time series from the debut date (i.e., date of the first release obtained from Spotify) to the last chart collected. Thus, each point in the time series represents the success of such an artist in a given week, according to the Hot 100 chart.

    Therefore, we present MUHSIC (Music-oriented Hot Streak Information Collection), which contains:

    Charts: enhanced data on all weekly Hot 100 Charts

    Artists: artist success time series with hot streak information

    Genres: genre success time series with hot streak information (the genre is the aggregated of all its artists)

    Hot Streaks: summarized hot streak information

  12. Best-selling artists worldwide as of 2025

    • statista.com
    Updated Jul 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Best-selling artists worldwide as of 2025 [Dataset]. https://www.statista.com/statistics/271174/top-selling-artists-in-the-united-states/
    Explore at:
    Dataset updated
    Jul 18, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    Drake reigns supreme as the best-selling artist of all time worldwide, with an impressive 298.5 million certified units sold. This Canadian rapper has dominated the music industry, surpassing legendary acts like The Beatles and Elvis Presley. Drake's success reflects the changing landscape of popular music, with hip-hop and contemporary R&B artists now occupying top spots alongside rock and pop icons. Hip-hop's growing influence The rise of hip-hop is evident in the list of best-selling artists, with Drake, Eminem, and Kanye West all ranking in the top 10. This trend is further supported by recent Billboard chart data, which shows Drake as the top songwriter from 2012 to 2023, with 52 songs in the Billboard Top 100. The rapper's dominance extends to his performance as a solo artist, where he also leads with 52 songs in the Top 100 during the same period.

    Diversity and representation in music While male artists still dominate the best-selling list, female artists like Rihanna, Beyoncé, and Taylor Swift have secured high positions. This aligns with recent trends showing increased representation of women in popular music. A study found that 35 percent of artists featuring on songs in the top 100 charts between 2012 and 2023 were women, up from 30 percent in the previous year. The industry's evolving landscape is further exemplified by the 2024 Grammy nominations, where artists like Taylor Swift, Billie Eilish, and SZA received multiple nods, highlighting the growing recognition of diverse talent.

  13. Billboard "The Hot 100" Songs

    • kaggle.com
    zip
    Updated Nov 9, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dhruvil Dave (2021). Billboard "The Hot 100" Songs [Dataset]. https://www.kaggle.com/datasets/dhruvildave/billboard-the-hot-100-songs/data
    Explore at:
    zip(3198022 bytes)Available download formats
    Dataset updated
    Nov 9, 2021
    Authors
    Dhruvil Dave
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Content

    The Billboard Hot 100 is the music industry standard record chart in the United States for songs, published weekly by Billboard magazine. Chart rankings are based on sales, radio play, and online streaming in the United States.

    Every week, Billboard releases "The Hot 100" chart of songs that were trending on sales and airplay for that week. This dataset is a collection of all "The Hot 100" charts released since its inception in 1958.

    Starter Notebook + Basic EDA

    Acknowledgements

    Image credits: Photo by Stas Knop from Pexels

  14. Secondary Dataset (tags) for "Evolution of Popular Music: USA 1960–2010"

    • figshare.com
    • search.datacite.org
    txt
    Updated Jan 19, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matthias Mauch (2016). Secondary Dataset (tags) for "Evolution of Popular Music: USA 1960–2010" [Dataset]. http://doi.org/10.6084/m9.figshare.1309950.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jan 19, 2016
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Matthias Mauch
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We also provide this additional file with genre tags for every song, which we used for validation. Here, too, the rows correspond to recordings. The data sets can be joined via the recording_id field.

  15. Z

    Comusic: Good things come to those who collaborate

    • data-staging.niaid.nih.gov
    • data.niaid.nih.gov
    Updated Jun 7, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mariana O. Silva; Laís Mota; Mirella M. Moro (2021). Comusic: Good things come to those who collaborate [Dataset]. https://data-staging.niaid.nih.gov/resources?id=zenodo_4904675
    Explore at:
    Dataset updated
    Jun 7, 2021
    Dataset provided by
    Universidade Federal de Minas Gerais
    Authors
    Mariana O. Silva; Laís Mota; Mirella M. Moro
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Comusic is an ongoing project that seeks to study the impact of collaboration networks' topological features on musical success. To that end, we analyze and identify such characterizations in a musical success-based network; that is, a network composed only of successful artists. Our findings offer a new perspective on success in the music industry, unraveling how collaboration profiles can contribute to an artist's popularity.

    Our Methodology

    Initially, using data from Billboard and the Spotify platform, we model a "successful" collaborative network and apply tools of network science to study its structure. By means of topological metrics, we defined four categories of collaboration profiles and, applying a clustering algorithm, we identified three communities with different collaboration patterns and notable discrepancies in musical success levels. Then, we conduct a statistical correlation analysis to evaluate the correlation between collaboration profiles and the artist's success.

    Our Findings

    By detecting cluster and their respective patterns of network collaboration, we focus on analyzing the impact of these profiles on successful musical artists. Considering topological metrics, we define four main categories of collaboration profiles: Interaction, Distance, Influence and Similarity. Among them, we find that the first three affect musical success more intensely than Similarity.

    Our Contributions

    Our findings provide evidence that:

    there are indeed distinct success factors for music collaboration profiles that are socially measurable, and

    there exist common factors to successful collaboration in the music market.

    Furthermore, our exploratory approach based on collaborative networks can easily be extended to other areas of knowledge (e.g., arts and science).

    Files

    Successfull Network: The successful musical collaboration network. (8,88 MB)

    Billboard Charts: Some Billboard Charts data. (3,04 MB)

    Ego Networks: All the 30 ego networks. (38 KB)

    Time Series: All the time series. (807 KB)

  16. Billboard Year-End Hot 100 Singles USA

    • kaggle.com
    zip
    Updated Nov 10, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    philsong (2022). Billboard Year-End Hot 100 Singles USA [Dataset]. https://www.kaggle.com/datasets/liquidgenius1/billboard-yearend-hot-100-singles-usa
    Explore at:
    zip(105051 bytes)Available download formats
    Dataset updated
    Nov 10, 2022
    Authors
    philsong
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    United States
    Description

    This dataset is a csv that contains the Billboard top 100 year-end songs in the US. Not every year (particularly the earlier years) had 100 top songs. Some years include ties. Data goes from 1946-2021. Songs can appear in multiple years. Source: Wikipedia.

  17. Billboard Top Songs 🎶

    • kaggle.com
    zip
    Updated Mar 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Samay Ashar (2025). Billboard Top Songs 🎶 [Dataset]. https://www.kaggle.com/datasets/samayashar/billboard-top-songs
    Explore at:
    zip(147043 bytes)Available download formats
    Dataset updated
    Mar 19, 2025
    Authors
    Samay Ashar
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    🎵 Hit Predictor 5000: Dataset Overview

    📌 Description

    This dataset contains 5,000 songs, blending real-world data from Spotify charts with synthetic entries to create a diverse mix of genres, artists, and attributes. Designed for machine learning models, it helps predict a song’s Peak Chart Position based on key musical and popularity metrics.

    📊 Features

    FeatureDescription
    SongTitle of the track
    ArtistName of the performer/band
    StreamsTotal number of streams (lifetime)
    Daily StreamsStreams per day
    GenreMusic genre (Pop, Hip-Hop, Rock, etc.)
    Release YearYear the song was released
    Peak PositionHighest Billboard/Spotify chart rank achieved
    Weeks on ChartTotal weeks spent on the chart
    Lyrics SentimentSentiment analysis of lyrics (-1 to +1)
    TikTok ViralityPopularity score based on TikTok trends (0-100)
    DanceabilityHow danceable the song is (0-1)
    AcousticnessLevel of acoustic elements (0-1)
    EnergyOverall energy level of the song (0-1)

    📈 Use Cases

    • Predicting a song’s peak position on the charts 📊
    • Analyzing how factors like TikTok virality impact ranking 🎶
    • Exploring trends across genres and time

    🔗 Optimized for machine learning & data visualization! 🚀

  18. d

    Replication Data for: Beyond Views: Measuring and Predicting Engagement in...

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wu, Siqi; Rizoiu, Marian-Andrei; Xie, Lexing (2023). Replication Data for: Beyond Views: Measuring and Predicting Engagement in Online Videos [Dataset]. http://doi.org/10.7910/DVN/L3UWZT
    Explore at:
    Dataset updated
    Nov 22, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Wu, Siqi; Rizoiu, Marian-Andrei; Xie, Lexing
    Description

    The dataset is first introduced in the following paper: Siqi Wu, Marian-Andrei Rizoiu, and Lexing Xie. Beyond Views: Measuring and Predicting Engagement in Online Videos. In AAAI International Conference on Weblogs and Social Media (ICWSM), 2018. Tweeted videos dataset This dataset contains YouTube videos published between July 1st and August 31st, 2016. To be collected, the video needs (a) be mentioned on Twitter during aforementioned collection period; (b) have insight statistics available; (c) have at least 100 views within the first 30 days after upload. Quality videos datasets These datasets contain videos deemed of high quality by domain experts. Vevo videos: Videos of verified Vevo artists, as of August 31st, 2016. Billboard16 videos: Videos of 2016 Billboard Hot 100 chart. Top news videos: Videos of top 100 most viewed News channels. freebase_mid_type_name.csv It maps a freebase mid to a real-world entity. See more details in this data description.

  19. s

    Shazam Research Dataset - Offsets (SRD-O)

    • purl.stanford.edu
    Updated Mar 30, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shazam Entertainment, Ltd. (2017). Shazam Research Dataset - Offsets (SRD-O) [Dataset]. https://purl.stanford.edu/fj396zz8014
    Explore at:
    Dataset updated
    Mar 30, 2017
    Authors
    Shazam Entertainment, Ltd.
    License

    Attribution-NonCommercial-NoDerivs 3.0 (CC BY-NC-ND 3.0)https://creativecommons.org/licenses/by-nc-nd/3.0/
    License information was derived automatically

    Description

    This dataset contains Shazam query timings ('offsets') and query dates corresponding to 20 hit songs from the Billboard Year End Hot 100 2015 chart. Queries were aggregated from 1 January 2014 to 31 May 2016, inclusive. Number of queries per song range from 3,020,785 to 19,974,795, with a total of 188,271,243 queries across the 20 songs. Data are stored in .csv files (one file per song) ranging in size from 62.9MB to 416.1MB. The total size of the dataset is around 4GB.

  20. Gender of producers in the music industry in the U.S. 2023

    • statista.com
    Updated Jun 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Gender of producers in the music industry in the U.S. 2023 [Dataset]. https://www.statista.com/statistics/801248/share-producer-music-industry-us-gender/
    Explore at:
    Dataset updated
    Jun 23, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    United States
    Description

    According to a study on representation and equality in the music industry, only *** percent of producers were female while approximately **** percent were male. The share of female music producers has been increasing since 2017, despite the setback in 2020 and still leaving a significant gap in terms of proportionate representation. Gender inequality in the music industry Even though music audiences are as diverse as ever, and recent data has also indicated that male and female listeners account for similar shares of digital music users in the United States, there are still significant gaps when it comes to the representation of different groups. The share of female songwriters across the top 100 songs in 2020 stood at below ** percent - a figure that has pretty much remained unchanged in the past decade. But this disparity not only unfolds behind the scenes: In 2020, just over ** percent of artists on Billboard’s top 100 charts were female, and in genres like hip-hop or alternative, this share was even lower. Grammy Awards The fact that the music industry remains a male-dominated landscape is also reflected in the Grammy Awards. While the show made headlines by merging male and female categories back in 2012, the imbalances have remained. Data on the gender distribution of Grammy nominees collected between 2013 and 2021 shows that less than ** percent of nominees for awards like Record of the Year, Album of the Year, and Producer of the Year were female. And even though the playing field was much more balanced in the Best New Artist category, many artists still fail to get the spotlight they deserve.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
The Bumpkin (2024). 600 Billboard Hot 100 Tracks (with Spotify Data) [Dataset]. https://www.kaggle.com/datasets/thebumpkin/600-billboard-hot-100-tracks-with-spotify-data
Organization logo

600 Billboard Hot 100 Tracks (with Spotify Data)

620 Tracks From 87 Artists Spanning 2000-2023

Explore at:
zip(31522 bytes)Available download formats
Dataset updated
Aug 23, 2024
Authors
The Bumpkin
License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

This dataset offers a comprehensive glimpse into the evolution of contemporary music, featuring 620 tracks from 87 artists who have dominated the charts between 2000 and 2023. Representing the pulse of modern pop and R&B, this collection captures the diversity and dynamism of the Hot 100 hits over the past two decades. Each track is meticulously annotated with Spotify's audio features, providing a rich, data-driven perspective on the sonic characteristics that have shaped the soundscape of the 21st century. From tempo to energy levels, and from danceability to valence, this dataset is a treasure trove for anyone looking to explore the trends and transformations in popular music.

Search
Clear search
Close search
Google apps
Main menu