24 datasets found
  1. TMDB 5000 Movie Dataset

    • kaggle.com
    zip
    Updated Feb 8, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sarika (2025). TMDB 5000 Movie Dataset [Dataset]. https://www.kaggle.com/datasets/sarikaa9/tmdb-5000-movie-dataset
    Explore at:
    zip(1659098 bytes)Available download formats
    Dataset updated
    Feb 8, 2025
    Authors
    Sarika
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    What can we say about the success of a movie before it is released? Are there certain companies (Pixar?) that have found a consistent formula? Given that major films costing over $100 million to produce can still flop, this question is more important than ever to the industry. Film aficionados might have different interests. Can we predict which films will be highly rated, whether or not they are a commercial success?

    This is a great place to start digging in to those questions, with data on the plot, cast, crew, budget, and revenues of several thousand films.

  2. TMDB 5000 Movie Dataset

    • kaggle.com
    zip
    Updated Nov 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lĩnh Trần476 (2024). TMDB 5000 Movie Dataset [Dataset]. https://www.kaggle.com/datasets/lnhtrn476/tmdb-5000-movie-dataset
    Explore at:
    zip(9317430 bytes)Available download formats
    Dataset updated
    Nov 21, 2024
    Authors
    Lĩnh Trần476
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Lĩnh Trần476

    Released under Apache 2.0

    Contents

  3. IMDB 5000 Movie Dataset

    • kaggle.com
    zip
    Updated Dec 16, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yueming (2017). IMDB 5000 Movie Dataset [Dataset]. https://www.kaggle.com/datasets/carolzhangdc/imdb-5000-movie-dataset
    Explore at:
    zip(567524 bytes)Available download formats
    Dataset updated
    Dec 16, 2017
    Authors
    Yueming
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Description

    Dataset

    This dataset was created by Yueming

    Released under Database: Open Database, Contents: Database Contents

    Contents

  4. TMDB 5000 Movies Dataset

    • kaggle.com
    zip
    Updated Jul 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad_Nauman_k (2025). TMDB 5000 Movies Dataset [Dataset]. https://www.kaggle.com/datasets/muhammadnaumank/tmdb-5000-movies-dataset
    Explore at:
    zip(9317430 bytes)Available download formats
    Dataset updated
    Jul 30, 2025
    Authors
    Muhammad_Nauman_k
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Muhammad_Nauman_k

    Released under CC0: Public Domain

    Contents

  5. TMDB 5000 Credits

    • kaggle.com
    zip
    Updated Aug 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Adem Yıldız (2024). TMDB 5000 Credits [Dataset]. https://www.kaggle.com/datasets/ademylz/tmdb-5000-credits
    Explore at:
    zip(7658354 bytes)Available download formats
    Dataset updated
    Aug 13, 2024
    Authors
    Adem Yıldız
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Description

    This file contains detailed credit information on cast and crew members for more than 5,000 movies available on The Movie Database (TMDb). The data covers the names and roles of actors, directors, writers and other key crew members in each movie. It provides a comprehensive resource for film industry analysis and cinema history studies.

  6. TMDB 5000 Movie Dataset with Ratings

    • kaggle.com
    zip
    Updated Jan 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aayush Soni (2024). TMDB 5000 Movie Dataset with Ratings [Dataset]. https://www.kaggle.com/datasets/aayushsoni4/tmdb-5000-movie-dataset-with-ratings
    Explore at:
    zip(195820415 bytes)Available download formats
    Dataset updated
    Jan 29, 2024
    Authors
    Aayush Soni
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    TMDB 5000 Movie Dataset with Ratings

    Overview:

    Welcome to the TMDB 5000 Movie Dataset with Ratings, a comprehensive collection that merges the original TMDb 5000 Movie Dataset with additional user ratings. This dataset offers an extensive exploration of the cinematic world, providing valuable insights for data enthusiasts, researchers, and machine learning practitioners.

    Datasets Included:

    1. tmdb_movie_dataset:

      • Columns:
        • budget: The budget allocated for the movie.
        • genres: Genres associated with the movie.
        • homepage: The homepage URL of the movie.
        • tmdbId: The unique identifier assigned by TMDb.
        • keywords: Keywords or tags related to the movie.
        • original_language: The original language of the movie.
        • original_title: The original title of the movie.
        • overview: A brief overview or synopsis of the movie.
        • popularity: Popularity score of the movie.
        • production_companies: Companies involved in the movie's production.
        • production_countries: Countries where the movie was produced.
        • release_date: The date when the movie was released.
        • revenue: The revenue generated by the movie.
        • runtime: The duration of the movie in minutes.
        • spoken_languages: Languages spoken in the movie.
        • status: The production status of the movie.
        • tagline: A tagline associated with the movie.
        • title: The title of the movie.
        • vote_average: Average user rating.
        • vote_count: The number of votes the movie received.
        • ratingId: Unique identifier for ratings.
    2. tmdb_movie_credits:

      • Columns:
        • tmdbId: The unique identifier assigned by TMDb.
        • title: The title of the movie.
        • cast: Cast members of the movie.
        • crew: Crew members involved in the movie.
    3. tmdb_movie_ratings:

      • Columns:
        • userId: Unique identifier for users.
        • ratingId: Unique identifier for ratings.
        • rating: User rating for the movie.
        • timestamp: The timestamp when the rating was given.

    Key Features:

    1. Comprehensive Movie Details: Explore detailed information about 5000 movies, including budget, genres, production details, and more.
    2. User Ratings: Gain insights into audience reception with user ratings, facilitating sentiment analysis and audience preferences.
    3. Cast and Crew Information: Delve into the cast and crew details for each movie, providing a comprehensive understanding of the creative forces behind the scenes.

    Potential Use Cases:

    • Predictive Modeling: Develop machine learning models to predict user ratings based on various movie attributes.
    • Exploratory Data Analysis (EDA): Uncover trends and patterns in movie-related features, exploring correlations between budget, revenue, and user ratings.
    • Content Recommendation: Leverage user ratings to build recommendation systems for personalized movie suggestions.

    How to Use:

    • Merge datasets using common keys (TMDb ID, Rating ID) to unleash the full potential of this comprehensive dataset.
    • Apply data cleaning and preprocessing techniques to prepare the dataset for analysis or model building.
    • Utilize visualization tools and statistical techniques for in-depth exploratory data analysis.

    Acknowledgments:

    This dataset is a curated compilation, merging the original TMDb 5000 Movie Dataset with additional user ratings to provide a comprehensive resource for the data science and machine learning community. We express our gratitude to the TMDb community for their valuable contributions.

    Feedback and Contributions:

    We welcome feedback and contributions to enhance the dataset. Connect, collaborate, and contribute to make this resource even more valuable for the community.

    Explore the cinematic universe through data with the TMDB 5000 Movie Dataset with Ratings!

  7. TMDB 5000+ Movies dataset

    • kaggle.com
    zip
    Updated Feb 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    sumit kr (2023). TMDB 5000+ Movies dataset [Dataset]. https://www.kaggle.com/datasets/sumitkkr/tmdb-movies-dataset
    Explore at:
    zip(1251127 bytes)Available download formats
    Dataset updated
    Feb 1, 2023
    Authors
    sumit kr
    Description

    Dataset

    This dataset was created by sumit kr

    Contents

  8. Tmdb-5000-movie-Dataset

    • kaggle.com
    zip
    Updated Apr 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nitin Kharade (2023). Tmdb-5000-movie-Dataset [Dataset]. https://www.kaggle.com/datasets/nitinkharade/tmdb-5000-movie-dataset
    Explore at:
    zip(9317430 bytes)Available download formats
    Dataset updated
    Apr 21, 2023
    Authors
    Nitin Kharade
    Description

    Dataset

    This dataset was created by Nitin Kharade

    Contents

  9. TMDb 5000 Movie Metadata

    • kaggle.com
    zip
    Updated Nov 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MD RAIHAN ALI (2025). TMDb 5000 Movie Metadata [Dataset]. https://www.kaggle.com/datasets/raihan63/tmdb-5000-movie-metadata
    Explore at:
    zip(9317430 bytes)Available download formats
    Dataset updated
    Nov 23, 2025
    Authors
    MD RAIHAN ALI
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by MD RAIHAN ALI

    Released under CC0: Public Domain

    Contents

  10. Simplified TMDB movies

    • kaggle.com
    zip
    Updated Nov 24, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aivar Annamaa (2017). Simplified TMDB movies [Dataset]. https://www.kaggle.com/aivarannamaa/movies
    Explore at:
    zip(1242375 bytes)Available download formats
    Dataset updated
    Nov 24, 2017
    Authors
    Aivar Annamaa
    Description

    Overview

    This is a simplified version of TMDB 5000 Movie Dataset (https://www.kaggle.com/tmdb/tmdb-movie-metadata). See that dataset for more info.

    Changes compared to the original

    • removed rows with status other than "Released".
    • removed columns id, status, popularity
    • simplified following columns: genres, keywords, production_companies, production_countries, spoken_languages (replaced json-like structures with comma separated list of name attributes)

    Photo by Felix Mooneeram on Unsplash

  11. Movies Dataset TMDB

    • kaggle.com
    zip
    Updated Oct 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sreyan Sundaray (2023). Movies Dataset TMDB [Dataset]. https://www.kaggle.com/datasets/sreyansundaray/5000-movies-dataset-tmdb
    Explore at:
    zip(1456283 bytes)Available download formats
    Dataset updated
    Oct 1, 2023
    Authors
    Sreyan Sundaray
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    "Introducing the Ultimate Movie Database: Delve into the magic of cinema with our meticulously curated dataset, sourced directly from The Movie Database (TMDb) website. This comprehensive collection is a testament to the art of storytelling, featuring a vast array of films with rating, original language, Popularity etc. Our inspiration behind this dataset was to create a valuable resource for film enthusiasts, researchers, and data scientists, fostering a deeper understanding of movie trends, audience preferences, and industry evolution. Whether you're analyzing box office hits, exploring directorial styles, or uncovering hidden gems, this dataset opens the door to a world of cinematic exploration. Lights, camera, data – let the analysis begin!"

  12. TMDB top rated movie dataset

    • kaggle.com
    zip
    Updated Apr 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Arijit Pal (2025). TMDB top rated movie dataset [Dataset]. https://www.kaggle.com/datasets/arijit5122/tmdb-top-rated-movie-dataset
    Explore at:
    zip(1404366 bytes)Available download formats
    Dataset updated
    Apr 24, 2025
    Authors
    Arijit Pal
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This dataset features a comprehensive compilation of over 5,000 top-rated movies scraped from The Movie Database (TMDb) using their official API. TMDb is one of the largest open movie databases on the internet, trusted by millions of users and developers worldwide. This dataset focuses specifically on their top-rated films, ranked by a global audience based on average votes and vote counts.

    The dataset was generated programmatically using Python and includes 500 full pages of TMDb’s /movie/top_rated endpoint. Each page returns 20 movies, resulting in 10,000 entries, from which duplicates or low-vote entries can be filtered based on project needs. 🔍 What’s Inside?

    For each movie entry, the dataset includes:

    🎬 Title — The original title of the film
    
    🗓️ Release Date — The official release date
    
    🌐 Original Language — The ISO language code (e.g., en, fr, ja)
    
    📄 Overview — A short synopsis of the film's plot
    
    ⭐ Vote Average — The average rating out of 10
    
    🗳️ Vote Count — Number of votes cast by TMDb users
    
    📈 Popularity — TMDb’s internal popularity score
    
    🆔 Movie ID — Unique TMDb identifier for the movie
    

    💡 Why Use This Dataset?

    This dataset is ideal for a wide range of projects in:

    📊 Data analysis (e.g., trends in top-rated movies over time)
    
    🧠 Machine learning & deep learning (e.g., recommendation systems)
    
    💬 Natural Language Processing (e.g., sentiment analysis on movie overviews)
    
    📈 Visualization (e.g., top genres, ratings by year/language)
    
    🎞️ Film industry insights (e.g., how vote count influences average rating)
    

    With a blend of metadata and user interaction data, it's a perfect dataset for anyone looking to combine storytelling with statistics. ✅ Highlights:

    Extracted using the TMDb API with robust error handling and pagination
    
    Clean format with no missing columns
    
    Ready for immediate use in Jupyter Notebooks, Kaggle kernels, or data pipelines
    
    Can be joined with external genre, actor, or production data via id
    

    Whether you're a film buff, a data scientist looking to build a movie recommender, or a developer training an NLP model — this dataset is your launchpad into the world of data-driven storytelling with cinema.

  13. TMDB 5000 movies details

    • kaggle.com
    zip
    Updated Feb 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SOURAV SAHOO (2024). TMDB 5000 movies details [Dataset]. https://www.kaggle.com/datasets/souravdatascience/tmdb-5000-movies-details/discussion
    Explore at:
    zip(9317430 bytes)Available download formats
    Dataset updated
    Feb 6, 2024
    Authors
    SOURAV SAHOO
    Description

    Dataset

    This dataset was created by SOURAV SAHOO

    Contents

  14. tmdb2023csv

    • kaggle.com
    zip
    Updated Feb 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ak30 (2023). tmdb2023csv [Dataset]. https://www.kaggle.com/datasets/baba30/tmdb2023csv
    Explore at:
    zip(440125 bytes)Available download formats
    Dataset updated
    Feb 1, 2023
    Authors
    Ak30
    Description

    The TMDB 5000 Movie Dataset is a database of information on over 5000 films including various features such as budget, revenue, cast, directors, production companies, and genre. It is a popular dataset for data analysis and machine learning projects, particularly for natural language processing and recommendation systems. The data is collected from The Movie Database (TMDb), a user-edited database of information on films, TV shows, and other media. The dataset includes information on both popular and lesser-known films and provides a comprehensive overview of the film industry. The data is available for public use, making it a great resource for both researchers and students to practice their data analysis and machine learning skills.

  15. tmdb_5000_movies

    • kaggle.com
    zip
    Updated Sep 30, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    yiyiyi (2018). tmdb_5000_movies [Dataset]. https://www.kaggle.com/datasets/zsx242030/tmdb-5000-movies
    Explore at:
    zip(1659098 bytes)Available download formats
    Dataset updated
    Sep 30, 2018
    Authors
    yiyiyi
    Description

    Dataset

    This dataset was created by yiyiyi

    Contents

  16. tmdb_5000_movies

    • kaggle.com
    Updated Jan 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Şükrü Yusuf Kaya (2024). tmdb_5000_movies [Dataset]. https://www.kaggle.com/datasets/kryusufkaya/tmdb-5000-movies
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 29, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Şükrü Yusuf Kaya
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Şükrü Yusuf Kaya

    Released under MIT

    Contents

  17. tmdb_5000_movies

    • kaggle.com
    zip
    Updated Feb 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chaitanya Sood (2024). tmdb_5000_movies [Dataset]. https://www.kaggle.com/datasets/chaitanyasood1/tmdb-5000-movies
    Explore at:
    zip(9317430 bytes)Available download formats
    Dataset updated
    Feb 14, 2024
    Authors
    Chaitanya Sood
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Chaitanya Sood

    Released under Apache 2.0

    Contents

  18. tmdb_5000_movies

    • kaggle.com
    zip
    Updated Sep 17, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Marcelo Barbosa de Morais (2019). tmdb_5000_movies [Dataset]. https://www.kaggle.com/datasets/moraismcl/tmdb-5000-movies
    Explore at:
    zip(1659098 bytes)Available download formats
    Dataset updated
    Sep 17, 2019
    Authors
    Marcelo Barbosa de Morais
    Description

    Dataset

    This dataset was created by Marcelo Barbosa de Morais

    Contents

  19. Data from: Hollywood Movies Dataset

    • kaggle.com
    zip
    Updated Aug 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohd Muttalib (2022). Hollywood Movies Dataset [Dataset]. https://www.kaggle.com/mohdmuttalib/hollywood-movies-dataset
    Explore at:
    zip(9317390 bytes)Available download formats
    Dataset updated
    Aug 4, 2022
    Authors
    Mohd Muttalib
    Area covered
    Hollywood
    Description

    Background What can we say about the success of a movie before it is released? Are there certain companies (Pixar?) that have found a consistent formula? Given that major films costing over $100 million to produce can still flop, this question is more important than ever to the industry. Film aficionados might have different interests. Can we predict which films will be highly rated, whether or not they are a commercial success?

    This is a great place to start digging in to those questions, with data on the plot, cast, crew, budget, and revenues of several thousand films.

    Data Source Transfer Details Several of the new columns contain json. You can save a bit of time by porting the load data functions from this kernel.

    Even in simple fields like runtime may not be consistent across versions. For example, previous dataset shows the duration for Avatar's extended cut while TMDB shows the time for the original version.

    There's now a separate file containing the full credits for both the cast and crew.

    All fields are filled out by users so don't expect them to agree on keywords, genres, ratings, or the like. Your existing kernels will continue to render normally until they are re-run. If you are curious about how this dataset was prepared, the code to access TMDb's API is posted here.

    New columns: homepage id original_title overview popularity production_companies production_countries release_date spoken_languages status tagline vote_average Lost columns: actor1facebook_likes actor2facebook_likes actor3facebook_likes aspect_ratio casttotalfacebook_likes color content_rating directorfacebooklikesfacenumberinposter moviefacebooklikes movieimdblink numcriticfor_reviews numuserfor_reviews

  20. IMDB dataset of 5000 movie posters

    • kaggle.com
    zip
    Updated Nov 1, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nazima (2017). IMDB dataset of 5000 movie posters [Dataset]. https://www.kaggle.com/datasets/nazimamzz/imdb-dataset-of-5000-movie-posters
    Explore at:
    zip(564473 bytes)Available download formats
    Dataset updated
    Nov 1, 2017
    Authors
    Nazima
    Description

    Dataset

    This dataset was created by Nazima

    Contents

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Sarika (2025). TMDB 5000 Movie Dataset [Dataset]. https://www.kaggle.com/datasets/sarikaa9/tmdb-5000-movie-dataset
Organization logo

TMDB 5000 Movie Dataset

Metadata on ~5,000 movies from TMDb

Explore at:
zip(1659098 bytes)Available download formats
Dataset updated
Feb 8, 2025
Authors
Sarika
License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

What can we say about the success of a movie before it is released? Are there certain companies (Pixar?) that have found a consistent formula? Given that major films costing over $100 million to produce can still flop, this question is more important than ever to the industry. Film aficionados might have different interests. Can we predict which films will be highly rated, whether or not they are a commercial success?

This is a great place to start digging in to those questions, with data on the plot, cast, crew, budget, and revenues of several thousand films.

Search
Clear search
Close search
Google apps
Main menu