24 datasets found

TMDB 5000 Movie Dataset
kaggle.com
zip
Updated Feb 8, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sarika (2025). TMDB 5000 Movie Dataset [Dataset]. https://www.kaggle.com/datasets/sarikaa9/tmdb-5000-movie-dataset
Explore at:
zip(1659098 bytes)Available download formats
Dataset updated
Feb 8, 2025
Authors
Sarika
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
What can we say about the success of a movie before it is released? Are there certain companies (Pixar?) that have found a consistent formula? Given that major films costing over $100 million to produce can still flop, this question is more important than ever to the industry. Film aficionados might have different interests. Can we predict which films will be highly rated, whether or not they are a commercial success?

This is a great place to start digging in to those questions, with data on the plot, cast, crew, budget, and revenues of several thousand films.
TMDB 5000 Movie Dataset
kaggle.com
zip
Updated Nov 21, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lĩnh Trần476 (2024). TMDB 5000 Movie Dataset [Dataset]. https://www.kaggle.com/datasets/lnhtrn476/tmdb-5000-movie-dataset
Explore at:
zip(9317430 bytes)Available download formats
Dataset updated
Nov 21, 2024
Authors
Lĩnh Trần476
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Lĩnh Trần476

Released under Apache 2.0

Contents
IMDB 5000 Movie Dataset
kaggle.com
zip
Updated Dec 16, 2017
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yueming (2017). IMDB 5000 Movie Dataset [Dataset]. https://www.kaggle.com/datasets/carolzhangdc/imdb-5000-movie-dataset
Explore at:
zip(567524 bytes)Available download formats
Dataset updated
Dec 16, 2017
Authors
Yueming
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Description
Dataset

This dataset was created by Yueming

Released under Database: Open Database, Contents: Database Contents

Contents
TMDB 5000 Movies Dataset
kaggle.com
zip
Updated Jul 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Muhammad_Nauman_k (2025). TMDB 5000 Movies Dataset [Dataset]. https://www.kaggle.com/datasets/muhammadnaumank/tmdb-5000-movies-dataset
Explore at:
zip(9317430 bytes)Available download formats
Dataset updated
Jul 30, 2025
Authors
Muhammad_Nauman_k
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Dataset

This dataset was created by Muhammad_Nauman_k

Released under CC0: Public Domain

Contents
TMDB 5000 Credits
kaggle.com
zip
Updated Aug 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Adem Yıldız (2024). TMDB 5000 Credits [Dataset]. https://www.kaggle.com/datasets/ademylz/tmdb-5000-credits
Explore at:
zip(7658354 bytes)Available download formats
Dataset updated
Aug 13, 2024
Authors
Adem Yıldız
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Description
This file contains detailed credit information on cast and crew members for more than 5,000 movies available on The Movie Database (TMDb). The data covers the names and roles of actors, directors, writers and other key crew members in each movie. It provides a comprehensive resource for film industry analysis and cinema history studies.
TMDB 5000 Movie Dataset with Ratings
kaggle.com
zip
Updated Jan 29, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Aayush Soni (2024). TMDB 5000 Movie Dataset with Ratings [Dataset]. https://www.kaggle.com/datasets/aayushsoni4/tmdb-5000-movie-dataset-with-ratings
Explore at:
zip(195820415 bytes)Available download formats
Dataset updated
Jan 29, 2024
Authors
Aayush Soni
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
TMDB 5000 Movie Dataset with Ratings

Overview:

Welcome to the TMDB 5000 Movie Dataset with Ratings, a comprehensive collection that merges the original TMDb 5000 Movie Dataset with additional user ratings. This dataset offers an extensive exploration of the cinematic world, providing valuable insights for data enthusiasts, researchers, and machine learning practitioners.

Datasets Included:

tmdb_movie_dataset:

Columns:

budget: The budget allocated for the movie.

genres: Genres associated with the movie.

homepage: The homepage URL of the movie.

tmdbId: The unique identifier assigned by TMDb.

keywords: Keywords or tags related to the movie.

original_language: The original language of the movie.

original_title: The original title of the movie.

overview: A brief overview or synopsis of the movie.

popularity: Popularity score of the movie.

production_companies: Companies involved in the movie's production.

production_countries: Countries where the movie was produced.

release_date: The date when the movie was released.

revenue: The revenue generated by the movie.

runtime: The duration of the movie in minutes.

spoken_languages: Languages spoken in the movie.

status: The production status of the movie.

tagline: A tagline associated with the movie.

title: The title of the movie.

vote_average: Average user rating.

vote_count: The number of votes the movie received.

ratingId: Unique identifier for ratings.

tmdb_movie_credits:

Columns:

tmdbId: The unique identifier assigned by TMDb.

title: The title of the movie.

cast: Cast members of the movie.

crew: Crew members involved in the movie.

tmdb_movie_ratings:

Columns:

userId: Unique identifier for users.

ratingId: Unique identifier for ratings.

rating: User rating for the movie.

timestamp: The timestamp when the rating was given.

Key Features:

Comprehensive Movie Details: Explore detailed information about 5000 movies, including budget, genres, production details, and more.

User Ratings: Gain insights into audience reception with user ratings, facilitating sentiment analysis and audience preferences.

Cast and Crew Information: Delve into the cast and crew details for each movie, providing a comprehensive understanding of the creative forces behind the scenes.

Potential Use Cases:

Predictive Modeling: Develop machine learning models to predict user ratings based on various movie attributes.

Exploratory Data Analysis (EDA): Uncover trends and patterns in movie-related features, exploring correlations between budget, revenue, and user ratings.

Content Recommendation: Leverage user ratings to build recommendation systems for personalized movie suggestions.

How to Use:

Merge datasets using common keys (TMDb ID, Rating ID) to unleash the full potential of this comprehensive dataset.

Apply data cleaning and preprocessing techniques to prepare the dataset for analysis or model building.

Utilize visualization tools and statistical techniques for in-depth exploratory data analysis.

Acknowledgments:

This dataset is a curated compilation, merging the original TMDb 5000 Movie Dataset with additional user ratings to provide a comprehensive resource for the data science and machine learning community. We express our gratitude to the TMDb community for their valuable contributions.

Feedback and Contributions:

We welcome feedback and contributions to enhance the dataset. Connect, collaborate, and contribute to make this resource even more valuable for the community.

Explore the cinematic universe through data with the TMDB 5000 Movie Dataset with Ratings!
TMDB 5000+ Movies dataset
kaggle.com
zip
Updated Feb 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
sumit kr (2023). TMDB 5000+ Movies dataset [Dataset]. https://www.kaggle.com/datasets/sumitkkr/tmdb-movies-dataset
Explore at:
zip(1251127 bytes)Available download formats
Dataset updated
Feb 1, 2023
Authors
sumit kr
Description
Dataset

This dataset was created by sumit kr

Contents
Tmdb-5000-movie-Dataset
kaggle.com
zip
Updated Apr 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nitin Kharade (2023). Tmdb-5000-movie-Dataset [Dataset]. https://www.kaggle.com/datasets/nitinkharade/tmdb-5000-movie-dataset
Explore at:
zip(9317430 bytes)Available download formats
Dataset updated
Apr 21, 2023
Authors
Nitin Kharade
Description
Dataset

This dataset was created by Nitin Kharade

Contents
TMDb 5000 Movie Metadata
kaggle.com
zip
Updated Nov 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MD RAIHAN ALI (2025). TMDb 5000 Movie Metadata [Dataset]. https://www.kaggle.com/datasets/raihan63/tmdb-5000-movie-metadata
Explore at:
zip(9317430 bytes)Available download formats
Dataset updated
Nov 23, 2025
Authors
MD RAIHAN ALI
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Dataset

This dataset was created by MD RAIHAN ALI

Released under CC0: Public Domain

Contents
Simplified TMDB movies
kaggle.com
zip
Updated Nov 24, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Aivar Annamaa (2017). Simplified TMDB movies [Dataset]. https://www.kaggle.com/aivarannamaa/movies
Explore at:
zip(1242375 bytes)Available download formats
Dataset updated
Nov 24, 2017
Authors
Aivar Annamaa
Description
Overview

This is a simplified version of TMDB 5000 Movie Dataset (https://www.kaggle.com/tmdb/tmdb-movie-metadata). See that dataset for more info.

Changes compared to the original

removed rows with status other than "Released".

removed columns id, status, popularity

simplified following columns: genres, keywords, production_companies, production_countries, spoken_languages (replaced json-like structures with comma separated list of name attributes)

Photo by Felix Mooneeram on Unsplash
Movies Dataset TMDB
kaggle.com
zip
Updated Oct 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sreyan Sundaray (2023). Movies Dataset TMDB [Dataset]. https://www.kaggle.com/datasets/sreyansundaray/5000-movies-dataset-tmdb
Explore at:
zip(1456283 bytes)Available download formats
Dataset updated
Oct 1, 2023
Authors
Sreyan Sundaray
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
"Introducing the Ultimate Movie Database: Delve into the magic of cinema with our meticulously curated dataset, sourced directly from The Movie Database (TMDb) website. This comprehensive collection is a testament to the art of storytelling, featuring a vast array of films with rating, original language, Popularity etc. Our inspiration behind this dataset was to create a valuable resource for film enthusiasts, researchers, and data scientists, fostering a deeper understanding of movie trends, audience preferences, and industry evolution. Whether you're analyzing box office hits, exploring directorial styles, or uncovering hidden gems, this dataset opens the door to a world of cinematic exploration. Lights, camera, data – let the analysis begin!"
TMDB top rated movie dataset
kaggle.com
zip
Updated Apr 24, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Arijit Pal (2025). TMDB top rated movie dataset [Dataset]. https://www.kaggle.com/datasets/arijit5122/tmdb-top-rated-movie-dataset
Explore at:
zip(1404366 bytes)Available download formats
Dataset updated
Apr 24, 2025
Authors
Arijit Pal
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
This dataset features a comprehensive compilation of over 5,000 top-rated movies scraped from The Movie Database (TMDb) using their official API. TMDb is one of the largest open movie databases on the internet, trusted by millions of users and developers worldwide. This dataset focuses specifically on their top-rated films, ranked by a global audience based on average votes and vote counts.

The dataset was generated programmatically using Python and includes 500 full pages of TMDb’s /movie/top_rated endpoint. Each page returns 20 movies, resulting in 10,000 entries, from which duplicates or low-vote entries can be filtered based on project needs. 🔍 What’s Inside?

For each movie entry, the dataset includes:

🎬 Title — The original title of the film 🗓️ Release Date — The official release date 🌐 Original Language — The ISO language code (e.g., en, fr, ja) 📄 Overview — A short synopsis of the film's plot ⭐ Vote Average — The average rating out of 10 🗳️ Vote Count — Number of votes cast by TMDb users 📈 Popularity — TMDb’s internal popularity score 🆔 Movie ID — Unique TMDb identifier for the movie

💡 Why Use This Dataset?

This dataset is ideal for a wide range of projects in:

📊 Data analysis (e.g., trends in top-rated movies over time) 🧠 Machine learning & deep learning (e.g., recommendation systems) 💬 Natural Language Processing (e.g., sentiment analysis on movie overviews) 📈 Visualization (e.g., top genres, ratings by year/language) 🎞️ Film industry insights (e.g., how vote count influences average rating)

With a blend of metadata and user interaction data, it's a perfect dataset for anyone looking to combine storytelling with statistics. ✅ Highlights:

Extracted using the TMDb API with robust error handling and pagination Clean format with no missing columns Ready for immediate use in Jupyter Notebooks, Kaggle kernels, or data pipelines Can be joined with external genre, actor, or production data via id

Whether you're a film buff, a data scientist looking to build a movie recommender, or a developer training an NLP model — this dataset is your launchpad into the world of data-driven storytelling with cinema.
TMDB 5000 movies details
kaggle.com
zip
Updated Feb 6, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
SOURAV SAHOO (2024). TMDB 5000 movies details [Dataset]. https://www.kaggle.com/datasets/souravdatascience/tmdb-5000-movies-details/discussion
Explore at:
zip(9317430 bytes)Available download formats
Dataset updated
Feb 6, 2024
Authors
SOURAV SAHOO
Description
Dataset

This dataset was created by SOURAV SAHOO

Contents
tmdb2023csv
kaggle.com
zip
Updated Feb 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ak30 (2023). tmdb2023csv [Dataset]. https://www.kaggle.com/datasets/baba30/tmdb2023csv
Explore at:
zip(440125 bytes)Available download formats
Dataset updated
Feb 1, 2023
Authors
Ak30
Description
The TMDB 5000 Movie Dataset is a database of information on over 5000 films including various features such as budget, revenue, cast, directors, production companies, and genre. It is a popular dataset for data analysis and machine learning projects, particularly for natural language processing and recommendation systems. The data is collected from The Movie Database (TMDb), a user-edited database of information on films, TV shows, and other media. The dataset includes information on both popular and lesser-known films and provides a comprehensive overview of the film industry. The data is available for public use, making it a great resource for both researchers and students to practice their data analysis and machine learning skills.
tmdb_5000_movies
kaggle.com
zip
Updated Sep 30, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
yiyiyi (2018). tmdb_5000_movies [Dataset]. https://www.kaggle.com/datasets/zsx242030/tmdb-5000-movies
Explore at:
zip(1659098 bytes)Available download formats
Dataset updated
Sep 30, 2018
Authors
yiyiyi
Description
Dataset

This dataset was created by yiyiyi

Contents
tmdb_5000_movies
kaggle.com
Updated Jan 29, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Şükrü Yusuf Kaya (2024). tmdb_5000_movies [Dataset]. https://www.kaggle.com/datasets/kryusufkaya/tmdb-5000-movies
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 29, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Şükrü Yusuf Kaya
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset

This dataset was created by Şükrü Yusuf Kaya

Released under MIT

Contents
tmdb_5000_movies
kaggle.com
zip
Updated Feb 14, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Chaitanya Sood (2024). tmdb_5000_movies [Dataset]. https://www.kaggle.com/datasets/chaitanyasood1/tmdb-5000-movies
Explore at:
zip(9317430 bytes)Available download formats
Dataset updated
Feb 14, 2024
Authors
Chaitanya Sood
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Chaitanya Sood

Released under Apache 2.0

Contents
tmdb_5000_movies
kaggle.com
zip
Updated Sep 17, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Marcelo Barbosa de Morais (2019). tmdb_5000_movies [Dataset]. https://www.kaggle.com/datasets/moraismcl/tmdb-5000-movies
Explore at:
zip(1659098 bytes)Available download formats
Dataset updated
Sep 17, 2019
Authors
Marcelo Barbosa de Morais
Description
Dataset

This dataset was created by Marcelo Barbosa de Morais

Contents
Data from: Hollywood Movies Dataset
kaggle.com
zip
Updated Aug 4, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mohd Muttalib (2022). Hollywood Movies Dataset [Dataset]. https://www.kaggle.com/mohdmuttalib/hollywood-movies-dataset
Explore at:
zip(9317390 bytes)Available download formats
Dataset updated
Aug 4, 2022
Authors
Mohd Muttalib
Area covered
Hollywood
Description
Background What can we say about the success of a movie before it is released? Are there certain companies (Pixar?) that have found a consistent formula? Given that major films costing over $100 million to produce can still flop, this question is more important than ever to the industry. Film aficionados might have different interests. Can we predict which films will be highly rated, whether or not they are a commercial success?

This is a great place to start digging in to those questions, with data on the plot, cast, crew, budget, and revenues of several thousand films.

Data Source Transfer Details Several of the new columns contain json. You can save a bit of time by porting the load data functions from this kernel.

Even in simple fields like runtime may not be consistent across versions. For example, previous dataset shows the duration for Avatar's extended cut while TMDB shows the time for the original version.

There's now a separate file containing the full credits for both the cast and crew.

All fields are filled out by users so don't expect them to agree on keywords, genres, ratings, or the like. Your existing kernels will continue to render normally until they are re-run. If you are curious about how this dataset was prepared, the code to access TMDb's API is posted here.

New columns: homepage id original_title overview popularity production_companies production_countries release_date spoken_languages status tagline vote_average Lost columns: actor1facebook_likes actor2facebook_likes actor3facebook_likes aspect_ratio casttotalfacebook_likes color content_rating directorfacebooklikesfacenumberinposter moviefacebooklikes movieimdblink numcriticfor_reviews numuserfor_reviews
IMDB dataset of 5000 movie posters
kaggle.com
zip
Updated Nov 1, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nazima (2017). IMDB dataset of 5000 movie posters [Dataset]. https://www.kaggle.com/datasets/nazimamzz/imdb-dataset-of-5000-movie-posters
Explore at:
zip(564473 bytes)Available download formats
Dataset updated
Nov 1, 2017
Authors
Nazima
Description
Dataset

This dataset was created by Nazima

Contents

Facebook

Twitter

Click to copy link

Link copied

Cite

Sarika (2025). TMDB 5000 Movie Dataset [Dataset]. https://www.kaggle.com/datasets/sarikaa9/tmdb-5000-movie-dataset

TMDB 5000 Movie Dataset

Metadata on ~5,000 movies from TMDb

Explore at:

zip(1659098 bytes)Available download formats

Dataset updated

Feb 8, 2025

Authors

Sarika

License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

What can we say about the success of a movie before it is released? Are there certain companies (Pixar?) that have found a consistent formula? Given that major films costing over $100 million to produce can still flop, this question is more important than ever to the industry. Film aficionados might have different interests. Can we predict which films will be highly rated, whether or not they are a commercial success?

This is a great place to start digging in to those questions, with data on the plot, cast, crew, budget, and revenues of several thousand films.

Clear search

Close search

Google apps

Main menu

TMDB 5000 Movie Dataset

TMDB 5000 Movie Dataset

Dataset

Contents

IMDB 5000 Movie Dataset

Dataset

Contents

TMDB 5000 Movies Dataset

Dataset

Contents

TMDB 5000 Credits

TMDB 5000 Movie Dataset with Ratings

TMDB 5000 Movie Dataset with Ratings

Overview:

Datasets Included:

Key Features:

Potential Use Cases:

How to Use:

Acknowledgments:

Feedback and Contributions:

TMDB 5000+ Movies dataset

Dataset

Contents

Tmdb-5000-movie-Dataset

Dataset

Contents

TMDb 5000 Movie Metadata

Dataset

Contents

Simplified TMDB movies

Overview

Changes compared to the original

Movies Dataset TMDB

TMDB top rated movie dataset

TMDB 5000 movies details

Dataset

Contents

tmdb2023csv

tmdb_5000_movies

Dataset

Contents

tmdb_5000_movies

Dataset

Contents

tmdb_5000_movies

Dataset

Contents

tmdb_5000_movies

Dataset

Contents

Data from: Hollywood Movies Dataset

IMDB dataset of 5000 movie posters

Dataset

Contents

TMDB 5000 Movie DatasetSee More Versions

Metadata on ~5,000 movies from TMDb

TMDB 5000 Movie Dataset