100+ datasets found
  1. Full TMDB Movies Dataset 2024 (1M Movies)

    • kaggle.com
    zip
    Updated Nov 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    asaniczka (2025). Full TMDB Movies Dataset 2024 (1M Movies) [Dataset]. https://www.kaggle.com/datasets/asaniczka/tmdb-movies-dataset-2023-930k-movies
    Explore at:
    zip(239404730 bytes)Available download formats
    Dataset updated
    Nov 11, 2025
    Authors
    asaniczka
    License

    Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
    License information was derived automatically

    Description

    The TMDb (The Movie Database) is a comprehensive movie database that provides information about movies, including details like titles, ratings, release dates, revenue, genres, and much more.

    This dataset contains a collection of 1,000,000 movies from the TMDB database.

    Dataset is updated daily. If you find this dataset valuable, don't forget to hit the upvote button! 😊💝

    Interesting Task Ideas:

    1. Predict movie ratings based on features such as revenue, popularity, genre, and runtime.
    2. Identify trends in movie release dates and analyze their impact on revenue.
    3. Analyze the relationship between budget, revenue, and popularity to determine factors that contribute to a movie's success.
    4. Build a recommendation system that suggests similar movies based on genres, production companies, and language.
    5. Perform sentiment analysis on movie reviews to understand audience reactions.
    6. Explore the impact of movie genres on popularity and revenue.
    7. Investigate the correlation between runtime and audience engagement.
    8. Identify successful production companies and analyze their strategies.
    9. Utilize natural language processing techniques to extract meaningful insights from movie overviews.
    10. Visualize movie popularity over time and identify popular genres in different periods.

    Checkout my other datasets

    Clash of Clans Clans Dataset 2023 (3.5M Clans)

    Black-White Wage Gap in the USA Dataset

    130K Kindle Books

    USA Unemployment Rates by Demographics & Race

    150K TMDb TV Shows

    Photo by Onur Binay on Unsplash

  2. Movie Dataset for ML

    • kaggle.com
    zip
    Updated Oct 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abhik Dhar (2023). Movie Dataset for ML [Dataset]. https://www.kaggle.com/datasets/abhikdhar/movie-dataset-random
    Explore at:
    zip(19713 bytes)Available download formats
    Dataset updated
    Oct 2, 2023
    Authors
    Abhik Dhar
    Description

    Description: This dataset contains information about 616 movies spanning various genres, years of release, and creative talents involved in their production. The dataset is intended for use in data analysis, visualization, and machine learning projects related to the film industry. Each row represents a single movie entry, and the dataset includes the following columns:

    Movie: The title of the movie. Year: The year of release for the movie. Genres: The genres or categories associated with the movie. Certification/Rating: The film's certification or rating according to the relevant rating board or organization. IMDb ID: The unique IMDb identifier for the movie. Writer: The name(s) of the writer(s) or screenwriter(s) responsible for the movie's screenplay. Director: The name of the movie's director. Potential Use Cases:

    Film industry analysis: Analyze trends in movie genres and ratings over time. Predicting movie success: Build predictive models to forecast a movie's success based on its features. Recommender systems: Develop movie recommendation systems for users based on their preferences. Creative insights: Explore relationships between directors, writers, and movie genres.

  3. h

    movies-dataset

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pablo Merchán-Rivera, movies-dataset [Dataset]. https://huggingface.co/datasets/Pablinho/movies-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    Pablo Merchán-Rivera
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    +9000 Movie Dataset

      Overview
    

    This dataset is sourced from Kaggle and has been granted CC0 1.0 Universal (CC0 1.0) Public Domain Dedication by the original author. This means you can copy, modify, distribute, and perform the work, even for commercial purposes, all without asking permission. I would like to express our gratitude to the original author for their contribution to the data community.

      License
    

    This dataset is released under the CC0 1.0 Universal… See the full description on the dataset page: https://huggingface.co/datasets/Pablinho/movies-dataset.

  4. IMDB 5000 Movie Dataset

    • kaggle.com
    zip
    Updated Dec 16, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yueming (2017). IMDB 5000 Movie Dataset [Dataset]. https://www.kaggle.com/datasets/carolzhangdc/imdb-5000-movie-dataset
    Explore at:
    zip(567524 bytes)Available download formats
    Dataset updated
    Dec 16, 2017
    Authors
    Yueming
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Description

    Dataset

    This dataset was created by Yueming

    Released under Database: Open Database, Contents: Database Contents

    Contents

  5. c

    IMDB movie details dataset

    • crawlfeeds.com
    csv, zip
    Updated Nov 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Crawl Feeds (2025). IMDB movie details dataset [Dataset]. https://crawlfeeds.com/datasets/imdb-movie-details-dataset
    Explore at:
    zip, csvAvailable download formats
    Dataset updated
    Nov 9, 2025
    Dataset authored and provided by
    Crawl Feeds
    License

    https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy

    Description
    The IMDB Movie Details Dataset is a comprehensive collection of movie datasets that offers a treasure trove of information about movies, TV shows, and streaming content listed on IMDB. This dataset includes detailed data such as titles, release years, genres, cast, crew, ratings, and more, making it a go-to resource for film and entertainment enthusiasts. Ideal for data analysis, IMDB movie dataset applications span machine learning projects, predictive modeling, and insights into industry trends.
    Researchers can explore patterns in movie ratings and genre popularity, while developers can use the dataset to build recommendation systems or applications. Movie buffs can dive deep into historical and contemporary trends in the world of cinema. This dataset not only supports academic and professional pursuits but also opens doors for creative projects in storytelling, content creation, and audience engagement. Whether you’re a developer, researcher, or film enthusiast, the IMDB movie dataset is a powerful tool for uncovering trends and gaining deeper insights into the evolving entertainment landscape.
  6. IMDB Movie Ratings Dataset

    • kaggle.com
    zip
    Updated Jan 17, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2023). IMDB Movie Ratings Dataset [Dataset]. https://www.kaggle.com/datasets/thedevastator/imdb-movie-ratings-dataset
    Explore at:
    zip(319960 bytes)Available download formats
    Dataset updated
    Jan 17, 2023
    Authors
    The Devastator
    Description

    IMDB Movie Ratings Dataset

    Evaluating Directors, Actors, Genres, and Movie Titles

    By Himanshu Sekhar Paul [source]

    About this dataset

    This inspiring IMDB Movie Dataset is a comprehensive database of movie ratings, featuring director_name, duration, actor_2_name, genres, actor_1_name, movie title and more. Whether you're a fan of dramatic thrillers or nostalgic '90s classics from our childhoods; here you'll find information about the most voted movies from users across the world. Delve into num_voted_users trends and discover the language each movie was released in to craft your very own personal film library of country-specific titles released in any given year. With this dataset at your disposal comparing imdb scores will never be easier! Who will come out top when the votes have been tallied? Dive into data for a journey unparalleled!

    More Datasets

    For more datasets, click here.

    Featured Notebooks

    • 🚨 Your notebook can be here! 🚨!

    How to use the dataset

    This dataset offers a comprehensive overview of the movie ratings from IMDB. It includes data about director name, duration, actors, genres, movie title, number of votes, language, country of origin, year released and IMDB score.

    To use this dataset to get a deeper understanding of how movies are rated on IMDB you can take the following steps:

    • Look through each column of the data to get an overall understanding. This will help you identify any specific trends or correlations in the data that you can then analyze further in later steps.
    • Take some time to explore relationships between different columns such as 'Number Voted Users' and 'IMDB Score' – it could be interesting to look at how these numbers relate with each other in order better understan rating trends on IMDB?
    • Analyze how particular sub-groups perform within various categories such as genre or country; this could provide insight into preferences towards certain types of movies or countries with higher associated scores than others?
    • Through your analysis try and gain answers to questions related to specific demographic groups on IMDB – are there distinct preferences among age groups when it comes to what they watch? Are there any clear correlations between rating and genre within certain countries? etc…

    By utilizing the questions above and taking an initial 'big picture' view before diving into more detailed analysis users should be able find value from this dataset by uncovering useful insights about movie ratings on IMDB!

    Research Ideas

    • Movie Recommendation System: The dataset can be used to build a movie recommendation system using machine learning algorithms like k-nearest neighbors or collaborative filtering. Based on the user's past ratings, the system can suggest relevant movies with similar genres, actors and directors.
    • Movie Popularity Index: Using the data, a metric could be designed that provides an overall popularity index for movies released over the years. This index could be constructed by considering factors such as IMDb score, number of votes and reviews collected, etc..
    • Genre-based Over/Under Performance Analysis: Based on genre selections in each movie year, this dataset can provide insight into which genres are performing well and which are not. This kind of analysis could help form important decisioning when deciding to allocate resources towards production budgeting or marketing campaigns for upcoming films in different genres across different regions or markets

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. Data Source

    License

    See the dataset description for more information.

    Columns

    File: movie_data.csv | Column name | Description | |:-------------------------|:---------------------------------------------------| | director_name | Name of the director of the movie. (String) | | duration | Length of the movie in minutes. (Integer) | | actor_2_name | Name of the second actor in the movie. (String) | | genres | Genre of the movie. (String) | | actor_1_name | Name of the first actor in the movie. (String) | | movie_title | Title of the movie. (String) | | num_voted_users | Number of users who voted for the movie. (Integer) | | actor_3_name | Name of the third actor in the movie. (String) | | movie_imdb_link | Link to the movie's IMDB page. (String) | | num_user_for_reviews |...

  7. c

    Rotten Tomatoes Movie Dataset – Clean Movie Metadata

    • crawlfeeds.com
    csv, zip
    Updated Nov 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Crawl Feeds (2025). Rotten Tomatoes Movie Dataset – Clean Movie Metadata [Dataset]. https://crawlfeeds.com/datasets/rotten-tomatoes-movie-dataset-clean-movie-metadata
    Explore at:
    csv, zipAvailable download formats
    Dataset updated
    Nov 9, 2025
    Dataset authored and provided by
    Crawl Feeds
    License

    https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy

    Description

    We provide a high-quality Rotten Tomatoes movie dataset that includes key metadata for thousands of movies. This dataset is ideal for anyone working with movie-related platforms, entertainment analytics, content curation, or movie discovery tools.

    Our collection is structured, clean, and designed to support real-time apps, dashboards, and research use cases.

    What the Dataset Includes

    Each record in the dataset contains core information pulled directly from Rotten Tomatoes, including:

    • Movie Name – The official title of the movie.

    • Poster URL – High-resolution image link to the movie poster.

    • Trailer URL – Direct link to the official trailer (when available).

    • Genre – One or more genres associated with the movie, such as Action, Drama, Comedy, or Horror.

    • Release Date – The date the movie was released to the public.

    • Actors – Main cast members listed on Rotten Tomatoes.

    • Directors – Director(s) responsible for the movie.

    • Rating – Audience or critic scores, where available.

    Broad Coverage

    This dataset spans a wide range of movies across all major genres and decades. From modern releases to timeless classics, from Hollywood blockbusters to independent films — we’ve included movies of all types with relevant data points.

    You can expect data on:

    • U.S. theatrical releases

    • Netflix, Amazon, and other streaming exclusives

    • Festival films and limited releases

    • Animated and documentary films

    Use Cases

    Here are just a few ways this dataset can be useful:

    • Movie Recommendation Engines – Use metadata and genre info to power personalized movie suggestions.

    • Entertainment Search Tools – Build searchable movie listings with visual poster previews and trailer links.

    • Data Visualization Projects – Create dashboards showing trends by genre, release periods, or actor participation.

    • AI/ML Training – Use metadata to train classification models or sentiment prediction tools.

    • Research & Academic Use – Analyze patterns in movie releases, cast dynamics, and genre evolution.

    Why Use Our Dataset?

    • Clean & ready-to-use: No raw HTML, just clean structured data.

    • Minimal but meaningful fields: Focused on useful movie attributes without clutter.

    • Updated info: Covers both classic and current titles.

    • Simple integration: Easy to use for developers, analysts, and product teams.

    If you're working on a movie-based product or looking for reliable film metadata for your project, this dataset offers an ideal foundation.

    Let us know if you’d like to explore it further.

  8. h

    imdb-genres

    • huggingface.co
    Updated Sep 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jack Quigley (2024). imdb-genres [Dataset]. https://huggingface.co/datasets/jquigl/imdb-genres
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 18, 2024
    Authors
    Jack Quigley
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Dataset Card for IMDb Movie Dataset: All Movies by Genre

      Dataset Summary
    

    This dataset is an adapted version of "IMDb Movie Dataset: All Movies by Genre" found at: https://www.kaggle.com/datasets/rajugc/imdb-movies-dataset-based-on-genre?select=history.csv. Within the dataset, the movie title and year columns were combined, the genre was extracted from the seperate csv files, the pre-existing genre column was renamed to expanded-genres, any movies missing a description… See the full description on the dataset page: https://huggingface.co/datasets/jquigl/imdb-genres.

  9. h

    movie-dataset

    • huggingface.co
    Updated Mar 30, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mustafa UZUMCU (2025). movie-dataset [Dataset]. https://huggingface.co/datasets/Musss0/movie-dataset
    Explore at:
    Dataset updated
    Mar 30, 2025
    Authors
    Mustafa UZUMCU
    Description

    Musss0/movie-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. c

    Movies dataset from allmovie

    • crawlfeeds.com
    json, zip
    Updated Dec 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Crawl Feeds (2024). Movies dataset from allmovie [Dataset]. https://crawlfeeds.com/datasets/movies-dataset-form-allmovie
    Explore at:
    json, zipAvailable download formats
    Dataset updated
    Dec 26, 2024
    Dataset authored and provided by
    Crawl Feeds
    License

    https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy

    Description

    Movies Dataset from AllMovie is a comprehensive collection featuring over 430,000 records, encompassing a wide range of films across various genres and languages. This extensive dataset includes essential data points such as movie titles, genres, release dates, posters, languages, directors, durations, synopses, trailers, average ratings, cast information, and URLs. Such detailed metadata is invaluable for developers, researchers, and enthusiasts aiming to analyze trends, build recommendation systems, or conduct in-depth studies of the film industry.

    For those interested in alternative datasets, the IMDb Non-Commercial Datasets provide subsets of IMDb data accessible for personal and non-commercial use. These datasets allow users to hold local copies of movie information, facilitating various analytical projects.

    Additionally, the MovieLens datasets offer a range of movie rating data suitable for research purposes. For instance, the MovieLens 20M dataset comprises 20 million ratings and 465,000 tag applications applied to 27,000 movies by 138,000 users, making it a valuable resource for studies in user preferences and recommendation algorithms.

    Incorporating these datasets into your projects can significantly enhance the quality and depth of your analyses, providing a solid foundation for exploring various aspects of the cinematic world.

    Why Choose Crawl Feeds for Your Data Needs?

    Crawl Feeds is your trusted partner in acquiring high-quality, curated datasets tailored to your specific requirements. With a vast repository that includes the Movies Dataset, we empower developers and businesses to drive innovation. Explore our easy-to-use platform and transform your ideas into actionable insights.

    Get Started with Crawl Feeds Today

  11. h

    movie-posters-dataset

    • huggingface.co
    Updated Nov 30, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yashpreet Voladoddi (2024). movie-posters-dataset [Dataset]. https://huggingface.co/datasets/yashvoladoddi37/movie-posters-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 30, 2024
    Authors
    Yashpreet Voladoddi
    Description

    yashvoladoddi37/movie-posters-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  12. IMDB & TMDB Movie Metadata Big Dataset (over 1M)

    • kaggle.com
    zip
    Updated Aug 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shubham Chandra (2024). IMDB & TMDB Movie Metadata Big Dataset (over 1M) [Dataset]. https://www.kaggle.com/datasets/shubhamchandra235/imdb-and-tmdb-movie-metadata-big-dataset-1m
    Explore at:
    zip(416807108 bytes)Available download formats
    Dataset updated
    Aug 5, 2024
    Authors
    Shubham Chandra
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Title: IMDB & TMDB Movie Metadata Big Dataset (>1M)

    Subtitle: A Comprehensive Dataset Featuring Detailed Metadata of Movies (IMDB, TMDB). Over 1M Rows & 42 Features: Metadata, Ratings, Genres, Cast, Crew, Sentiment Analysis and many more...

    Detailed Description:

    Overview: This comprehensive dataset merges the extensive film data available from both IMDB and TMDB, offering a rich resource for movie enthusiasts, data scientists, and researchers. With over 1 million rows and 42 detailed features, this dataset provides in-depth information about a wide variety of movies, spanning different genres, periods, and production backgrounds.

    File Information: 1. File Size: ≈ 1GB 2. Format: CSV (Comma-Separated Values)

    Column Descriptors/Key Features: 1. ID: Unique identifier for each movie. 2. Title: The official title of the movie. 3. Vote Average: Average rating received by the movie. 4. Vote Count: Number of votes the movie has received. 5. Status: Current status of the movie (e.g., Released, Post-Production). 6. Release Date: Official release date of the movie. 7. Revenue: Box office revenue generated by the movie. 8. Runtime: Duration of the movie in minutes. 9. Adult: Indicates if the movie is for adults. 10. Genres: List of genres the movie belongs to. 11. Overview Sentiment: Sentiment analysis of the movie's overview text. 12. Cast: List of main actors in the movie. 13. Crew: List of key crew members, including directors, producers, and writers. 14. Genres List: Detailed genres in list format. 15. Keywords: List of relevant keywords associated with the movie. 16. Director of Photography: Name of the cinematographer. 17. Producers: Names of the producers. 18. Music Composer: Name of the music composer.

    Additional Features:

    1. Unnamed 0: Index column.
    2. Star1, Star2, Star3, Star4: Names of the top-billed stars.
    3. Writer: Name(s) of the writer(s).
    4. Original Language: Original language of the movie.
    5. Original Title: Original title if different from the main title.
    6. Popularity: Popularity score of the movie.
    7. Budget: Budget allocated for the movie.
    8. Tagline: Promotional tagline of the movie.
    9. Production Companies: Companies involved in the production.
    10. Production Countries: Countries where the movie was produced.
    11. Spoken Languages: Languages spoken in the movie.
    12. Homepage: Official website of the movie.
    13. IMDB ID: Unique identifier on IMDB.
    14. TMDB ID: Unique identifier on TMDB.
    15. Video: Indicates if there is a video associated.
    16. Poster Path: Path to the movie poster image.
    17. Backdrop Path: Path to the backdrop image.
    18. Release Year: Year the movie was released.
    19. Collection Name: Name of the collection the movie belongs to.
    20. Collection ID: Unique identifier for the collection.
    21. Genres ID: Unique identifier for the genres.
    22. Original Language Code: Code for the original language.
    23. Overview: Brief summary of the movie.
    24. All Combined Keywords: Combined keywords in a single field.

    Potential Use Cases: - Sentiment Analysis: Analyze audience sentiment towards movies based on reviews and ratings. - Recommendation Systems: Build models to recommend movies based on user preferences and viewing history. - Market Analysis: Study trends in the movie industry, including genre popularity and revenue patterns. - Content Analysis: Investigate the thematic content and diversity of movies over time. - Data Visualization: Create visual representations of movie data to uncover hidden insights.

  13. h

    IMDB-Dataset-of-50K-Movie-Reviews-Backup

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Q-b1t, IMDB-Dataset-of-50K-Movie-Reviews-Backup [Dataset]. https://huggingface.co/datasets/Q-b1t/IMDB-Dataset-of-50K-Movie-Reviews-Backup
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    Q-b1t
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    Q-b1t/IMDB-Dataset-of-50K-Movie-Reviews-Backup dataset hosted on Hugging Face and contributed by the HF Datasets community

  14. w

    Websites using Movie Database

    • webtechsurvey.com
    csv
    Updated Oct 13, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    WebTechSurvey (2025). Websites using Movie Database [Dataset]. https://webtechsurvey.com/technology/movie-database
    Explore at:
    csvAvailable download formats
    Dataset updated
    Oct 13, 2025
    Dataset authored and provided by
    WebTechSurvey
    License

    https://webtechsurvey.com/termshttps://webtechsurvey.com/terms

    Time period covered
    2025
    Area covered
    Global
    Description

    A complete list of live websites using the Movie Database technology, compiled through global website indexing conducted by WebTechSurvey.

  15. IMDb Movies Metadata Dataset – 4.5M Records (Global Coverage)

    • crawlfeeds.com
    csv, zip
    Updated Nov 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Crawl Feeds (2025). IMDb Movies Metadata Dataset – 4.5M Records (Global Coverage) [Dataset]. https://crawlfeeds.com/datasets/imdb-movies-metadata-dataset-4-5m-records-global-coverage
    Explore at:
    csv, zipAvailable download formats
    Dataset updated
    Nov 9, 2025
    Dataset authored and provided by
    Crawl Feeds
    License

    https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy

    Description

    Unlock one of the most comprehensive movie datasets available—4.5 million structured IMDb movie records, extracted and enriched for data science, machine learning, and entertainment research.

    This dataset includes a vast collection of global movie metadata, including details on title, release year, genre, country, language, runtime, cast, directors, IMDb ratings, reviews, and synopsis. Whether you're building a recommendation engine, benchmarking trends, or training AI models, this dataset is designed to give you deep and wide access to cinematic data across decades and continents.

    Perfect for use in film analytics, OTT platforms, review sentiment analysis, knowledge graphs, and LLM fine-tuning, the dataset is cleaned, normalized, and exportable in multiple formats.

    What’s Included:

    • Genres: Drama, Comedy, Horror, Action, Sci-Fi, Documentary, and more

    • Delivery: Direct download

    Use Cases:

    • Train LLMs or chatbots on cinematic language and metadata

    • Build or enrich movie recommendation engines

    • Run cross-lingual or multi-region film analytics

    • Benchmark genre popularity across time periods

    • Power academic studies or entertainment dashboards

    • Feed into knowledge graphs, search engines, or NLP pipelines

  16. h

    movie-dataset

    • huggingface.co
    Updated Jul 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vivek Eswaran (2023). movie-dataset [Dataset]. https://huggingface.co/datasets/veswaran/movie-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 18, 2023
    Authors
    Vivek Eswaran
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    veswaran/movie-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  17. h

    movie

    • huggingface.co
    Updated Mar 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Masterclass (2025). movie [Dataset]. https://huggingface.co/datasets/mc-ai/movie
    Explore at:
    Dataset updated
    Mar 30, 2025
    Dataset authored and provided by
    Masterclass
    Description

    mc-ai/movie dataset hosted on Hugging Face and contributed by the HF Datasets community

  18. q

    Movie Data - X - Test - w2v

    • data.researchdatafinder.qut.edu.au
    Updated Apr 8, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2018). Movie Data - X - Test - w2v [Dataset]. https://data.researchdatafinder.qut.edu.au/dataset/survey-word-vector/resource/e638fc06-7ef3-4a41-85e2-21f7fad2dfb3
    Explore at:
    Dataset updated
    Apr 8, 2018
    License

    http://researchdatafinder.qut.edu.au/display/n15252http://researchdatafinder.qut.edu.au/display/n15252

    Description

    This file contains the features for the test portion of the movie dataset. The data has been changed into an average word vector. This is 50% of the total movie results. QUT Research Data Respository Dataset Resource available for download

  19. i

    Large Movie Review Dataset

    • ieee-dataport.org
    Updated Jul 17, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tasnim Akter Onisha (2025). Large Movie Review Dataset [Dataset]. https://ieee-dataport.org/documents/large-movie-review-dataset
    Explore at:
    Dataset updated
    Jul 17, 2025
    Authors
    Tasnim Akter Onisha
    Description

    contains 50

  20. T

    imdb_reviews

    • tensorflow.org
    • kaggle.com
    Updated Sep 20, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). imdb_reviews [Dataset]. https://www.tensorflow.org/datasets/catalog/imdb_reviews
    Explore at:
    Dataset updated
    Sep 20, 2024
    Description

    Large Movie Review Dataset. This is a dataset for binary sentiment classification containing substantially more data than previous benchmark datasets. We provide a set of 25,000 highly polar movie reviews for training, and 25,000 for testing. There is additional unlabeled data for use as well.

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('imdb_reviews', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
asaniczka (2025). Full TMDB Movies Dataset 2024 (1M Movies) [Dataset]. https://www.kaggle.com/datasets/asaniczka/tmdb-movies-dataset-2023-930k-movies
Organization logo

Full TMDB Movies Dataset 2024 (1M Movies)

Complete dataset containing movie data from TMDb. Updated Daily

Explore at:
3 scholarly articles cite this dataset (View in Google Scholar)
zip(239404730 bytes)Available download formats
Dataset updated
Nov 11, 2025
Authors
asaniczka
License

Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
License information was derived automatically

Description

The TMDb (The Movie Database) is a comprehensive movie database that provides information about movies, including details like titles, ratings, release dates, revenue, genres, and much more.

This dataset contains a collection of 1,000,000 movies from the TMDB database.

Dataset is updated daily. If you find this dataset valuable, don't forget to hit the upvote button! 😊💝

Interesting Task Ideas:

  1. Predict movie ratings based on features such as revenue, popularity, genre, and runtime.
  2. Identify trends in movie release dates and analyze their impact on revenue.
  3. Analyze the relationship between budget, revenue, and popularity to determine factors that contribute to a movie's success.
  4. Build a recommendation system that suggests similar movies based on genres, production companies, and language.
  5. Perform sentiment analysis on movie reviews to understand audience reactions.
  6. Explore the impact of movie genres on popularity and revenue.
  7. Investigate the correlation between runtime and audience engagement.
  8. Identify successful production companies and analyze their strategies.
  9. Utilize natural language processing techniques to extract meaningful insights from movie overviews.
  10. Visualize movie popularity over time and identify popular genres in different periods.

Checkout my other datasets

Clash of Clans Clans Dataset 2023 (3.5M Clans)

Black-White Wage Gap in the USA Dataset

130K Kindle Books

USA Unemployment Rates by Demographics & Race

150K TMDb TV Shows

Photo by Onur Binay on Unsplash

Search
Clear search
Close search
Google apps
Main menu