100+ datasets found

Full TMDB Movies Dataset 2024 (1M Movies)
kaggle.com
zip
Updated Nov 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
asaniczka (2025). Full TMDB Movies Dataset 2024 (1M Movies) [Dataset]. https://www.kaggle.com/datasets/asaniczka/tmdb-movies-dataset-2023-930k-movies
Explore at:
zip(239404730 bytes)Available download formats
Dataset updated
Nov 11, 2025
Authors
asaniczka
License
Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
License information was derived automatically
Description
The TMDb (The Movie Database) is a comprehensive movie database that provides information about movies, including details like titles, ratings, release dates, revenue, genres, and much more.

This dataset contains a collection of 1,000,000 movies from the TMDB database.

Dataset is updated daily. If you find this dataset valuable, don't forget to hit the upvote button! 😊💝

Interesting Task Ideas:

Predict movie ratings based on features such as revenue, popularity, genre, and runtime.

Identify trends in movie release dates and analyze their impact on revenue.

Analyze the relationship between budget, revenue, and popularity to determine factors that contribute to a movie's success.

Build a recommendation system that suggests similar movies based on genres, production companies, and language.

Perform sentiment analysis on movie reviews to understand audience reactions.

Explore the impact of movie genres on popularity and revenue.

Investigate the correlation between runtime and audience engagement.

Identify successful production companies and analyze their strategies.

Utilize natural language processing techniques to extract meaningful insights from movie overviews.

Visualize movie popularity over time and identify popular genres in different periods.

Checkout my other datasets

Clash of Clans Clans Dataset 2023 (3.5M Clans)

Black-White Wage Gap in the USA Dataset

130K Kindle Books

USA Unemployment Rates by Demographics & Race

150K TMDb TV Shows

Photo by Onur Binay on Unsplash
Movie Dataset for ML
kaggle.com
zip
Updated Oct 2, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Abhik Dhar (2023). Movie Dataset for ML [Dataset]. https://www.kaggle.com/datasets/abhikdhar/movie-dataset-random
Explore at:
zip(19713 bytes)Available download formats
Dataset updated
Oct 2, 2023
Authors
Abhik Dhar
Description
Description: This dataset contains information about 616 movies spanning various genres, years of release, and creative talents involved in their production. The dataset is intended for use in data analysis, visualization, and machine learning projects related to the film industry. Each row represents a single movie entry, and the dataset includes the following columns:

Movie: The title of the movie. Year: The year of release for the movie. Genres: The genres or categories associated with the movie. Certification/Rating: The film's certification or rating according to the relevant rating board or organization. IMDb ID: The unique IMDb identifier for the movie. Writer: The name(s) of the writer(s) or screenwriter(s) responsible for the movie's screenplay. Director: The name of the movie's director. Potential Use Cases:

Film industry analysis: Analyze trends in movie genres and ratings over time. Predicting movie success: Build predictive models to forecast a movie's success based on its features. Recommender systems: Develop movie recommendation systems for users based on their preferences. Creative insights: Explore relationships between directors, writers, and movie genres.
h
movies-dataset
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pablo Merchán-Rivera, movies-dataset [Dataset]. https://huggingface.co/datasets/Pablinho/movies-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
Pablo Merchán-Rivera
License
https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/
Description
+9000 Movie Dataset

Overview

This dataset is sourced from Kaggle and has been granted CC0 1.0 Universal (CC0 1.0) Public Domain Dedication by the original author. This means you can copy, modify, distribute, and perform the work, even for commercial purposes, all without asking permission. I would like to express our gratitude to the original author for their contribution to the data community.

License

This dataset is released under the CC0 1.0 Universal… See the full description on the dataset page: https://huggingface.co/datasets/Pablinho/movies-dataset.
IMDB 5000 Movie Dataset
kaggle.com
zip
Updated Dec 16, 2017
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yueming (2017). IMDB 5000 Movie Dataset [Dataset]. https://www.kaggle.com/datasets/carolzhangdc/imdb-5000-movie-dataset
Explore at:
zip(567524 bytes)Available download formats
Dataset updated
Dec 16, 2017
Authors
Yueming
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Description
Dataset

This dataset was created by Yueming

Released under Database: Open Database, Contents: Database Contents

Contents
c
IMDB movie details dataset
crawlfeeds.com
csv, zip
Updated Nov 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Crawl Feeds (2025). IMDB movie details dataset [Dataset]. https://crawlfeeds.com/datasets/imdb-movie-details-dataset
Explore at:
zip, csvAvailable download formats
Dataset updated
Nov 9, 2025
Dataset authored and provided by
Crawl Feeds
License
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Description

The IMDB Movie Details Dataset is a comprehensive collection of movie datasets that offers a treasure trove of information about movies, TV shows, and streaming content listed on IMDB. This dataset includes detailed data such as titles, release years, genres, cast, crew, ratings, and more, making it a go-to resource for film and entertainment enthusiasts. Ideal for data analysis, IMDB movie dataset applications span machine learning projects, predictive modeling, and insights into industry trends.

Researchers can explore patterns in movie ratings and genre popularity, while developers can use the dataset to build recommendation systems or applications. Movie buffs can dive deep into historical and contemporary trends in the world of cinema. This dataset not only supports academic and professional pursuits but also opens doors for creative projects in storytelling, content creation, and audience engagement. Whether you’re a developer, researcher, or film enthusiast, the IMDB movie dataset is a powerful tool for uncovering trends and gaining deeper insights into the evolving entertainment landscape.
IMDB Movie Ratings Dataset
kaggle.com
zip
Updated Jan 17, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Devastator (2023). IMDB Movie Ratings Dataset [Dataset]. https://www.kaggle.com/datasets/thedevastator/imdb-movie-ratings-dataset
Explore at:
zip(319960 bytes)Available download formats
Dataset updated
Jan 17, 2023
Authors
The Devastator
Description
IMDB Movie Ratings Dataset

Evaluating Directors, Actors, Genres, and Movie Titles

By Himanshu Sekhar Paul [source]

About this dataset

This inspiring IMDB Movie Dataset is a comprehensive database of movie ratings, featuring director_name, duration, actor_2_name, genres, actor_1_name, movie title and more. Whether you're a fan of dramatic thrillers or nostalgic '90s classics from our childhoods; here you'll find information about the most voted movies from users across the world. Delve into num_voted_users trends and discover the language each movie was released in to craft your very own personal film library of country-specific titles released in any given year. With this dataset at your disposal comparing imdb scores will never be easier! Who will come out top when the votes have been tallied? Dive into data for a journey unparalleled!

More Datasets

For more datasets, click here.

Featured Notebooks

🚨 Your notebook can be here! 🚨!

How to use the dataset

This dataset offers a comprehensive overview of the movie ratings from IMDB. It includes data about director name, duration, actors, genres, movie title, number of votes, language, country of origin, year released and IMDB score.

To use this dataset to get a deeper understanding of how movies are rated on IMDB you can take the following steps:

Look through each column of the data to get an overall understanding. This will help you identify any specific trends or correlations in the data that you can then analyze further in later steps.

Take some time to explore relationships between different columns such as 'Number Voted Users' and 'IMDB Score' – it could be interesting to look at how these numbers relate with each other in order better understan rating trends on IMDB?

Analyze how particular sub-groups perform within various categories such as genre or country; this could provide insight into preferences towards certain types of movies or countries with higher associated scores than others?

Through your analysis try and gain answers to questions related to specific demographic groups on IMDB – are there distinct preferences among age groups when it comes to what they watch? Are there any clear correlations between rating and genre within certain countries? etc…

By utilizing the questions above and taking an initial 'big picture' view before diving into more detailed analysis users should be able find value from this dataset by uncovering useful insights about movie ratings on IMDB!

Research Ideas

Movie Recommendation System: The dataset can be used to build a movie recommendation system using machine learning algorithms like k-nearest neighbors or collaborative filtering. Based on the user's past ratings, the system can suggest relevant movies with similar genres, actors and directors.

Movie Popularity Index: Using the data, a metric could be designed that provides an overall popularity index for movies released over the years. This index could be constructed by considering factors such as IMDb score, number of votes and reviews collected, etc..

Genre-based Over/Under Performance Analysis: Based on genre selections in each movie year, this dataset can provide insight into which genres are performing well and which are not. This kind of analysis could help form important decisioning when deciding to allocate resources towards production budgeting or marketing campaigns for upcoming films in different genres across different regions or markets

Acknowledgements

If you use this dataset in your research, please credit the original authors. Data Source

License

See the dataset description for more information.

Columns

File: movie_data.csv | Column name | Description | |:-------------------------|:---------------------------------------------------| | director_name | Name of the director of the movie. (String) | | duration | Length of the movie in minutes. (Integer) | | actor_2_name | Name of the second actor in the movie. (String) | | genres | Genre of the movie. (String) | | actor_1_name | Name of the first actor in the movie. (String) | | movie_title | Title of the movie. (String) | | num_voted_users | Number of users who voted for the movie. (Integer) | | actor_3_name | Name of the third actor in the movie. (String) | | movie_imdb_link | Link to the movie's IMDB page. (String) | | num_user_for_reviews |...
c
Rotten Tomatoes Movie Dataset – Clean Movie Metadata
crawlfeeds.com
csv, zip
Updated Nov 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Crawl Feeds (2025). Rotten Tomatoes Movie Dataset – Clean Movie Metadata [Dataset]. https://crawlfeeds.com/datasets/rotten-tomatoes-movie-dataset-clean-movie-metadata
Explore at:
csv, zipAvailable download formats
Dataset updated
Nov 9, 2025
Dataset authored and provided by
Crawl Feeds
License
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Description
We provide a high-quality Rotten Tomatoes movie dataset that includes key metadata for thousands of movies. This dataset is ideal for anyone working with movie-related platforms, entertainment analytics, content curation, or movie discovery tools.

Our collection is structured, clean, and designed to support real-time apps, dashboards, and research use cases.

What the Dataset Includes

Each record in the dataset contains core information pulled directly from Rotten Tomatoes, including:

Movie Name – The official title of the movie.

Poster URL – High-resolution image link to the movie poster.

Trailer URL – Direct link to the official trailer (when available).

Genre – One or more genres associated with the movie, such as Action, Drama, Comedy, or Horror.

Release Date – The date the movie was released to the public.

Actors – Main cast members listed on Rotten Tomatoes.

Directors – Director(s) responsible for the movie.

Rating – Audience or critic scores, where available.

Broad Coverage

This dataset spans a wide range of movies across all major genres and decades. From modern releases to timeless classics, from Hollywood blockbusters to independent films — we’ve included movies of all types with relevant data points.

You can expect data on:

U.S. theatrical releases

Netflix, Amazon, and other streaming exclusives

Festival films and limited releases

Animated and documentary films

Use Cases

Here are just a few ways this dataset can be useful:

Movie Recommendation Engines – Use metadata and genre info to power personalized movie suggestions.

Entertainment Search Tools – Build searchable movie listings with visual poster previews and trailer links.

Data Visualization Projects – Create dashboards showing trends by genre, release periods, or actor participation.

AI/ML Training – Use metadata to train classification models or sentiment prediction tools.

Research & Academic Use – Analyze patterns in movie releases, cast dynamics, and genre evolution.

Why Use Our Dataset?

Clean & ready-to-use: No raw HTML, just clean structured data.

Minimal but meaningful fields: Focused on useful movie attributes without clutter.

Updated info: Covers both classic and current titles.

Simple integration: Easy to use for developers, analysts, and product teams.

If you're working on a movie-based product or looking for reliable film metadata for your project, this dataset offers an ideal foundation.

Let us know if you’d like to explore it further.
h
imdb-genres
huggingface.co
Updated Sep 18, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jack Quigley (2024). imdb-genres [Dataset]. https://huggingface.co/datasets/jquigl/imdb-genres
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 18, 2024
Authors
Jack Quigley
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
Dataset Card for IMDb Movie Dataset: All Movies by Genre

Dataset Summary

This dataset is an adapted version of "IMDb Movie Dataset: All Movies by Genre" found at: https://www.kaggle.com/datasets/rajugc/imdb-movies-dataset-based-on-genre?select=history.csv. Within the dataset, the movie title and year columns were combined, the genre was extracted from the seperate csv files, the pre-existing genre column was renamed to expanded-genres, any movies missing a description… See the full description on the dataset page: https://huggingface.co/datasets/jquigl/imdb-genres.
h
movie-dataset
huggingface.co
Updated Mar 30, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mustafa UZUMCU (2025). movie-dataset [Dataset]. https://huggingface.co/datasets/Musss0/movie-dataset
Explore at:
Dataset updated
Mar 30, 2025
Authors
Mustafa UZUMCU
Description
Musss0/movie-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community
c
Movies dataset from allmovie
crawlfeeds.com
json, zip
Updated Dec 26, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Crawl Feeds (2024). Movies dataset from allmovie [Dataset]. https://crawlfeeds.com/datasets/movies-dataset-form-allmovie
Explore at:
json, zipAvailable download formats
Dataset updated
Dec 26, 2024
Dataset authored and provided by
Crawl Feeds
License
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Description
Movies Dataset from AllMovie is a comprehensive collection featuring over 430,000 records, encompassing a wide range of films across various genres and languages. This extensive dataset includes essential data points such as movie titles, genres, release dates, posters, languages, directors, durations, synopses, trailers, average ratings, cast information, and URLs. Such detailed metadata is invaluable for developers, researchers, and enthusiasts aiming to analyze trends, build recommendation systems, or conduct in-depth studies of the film industry.

For those interested in alternative datasets, the IMDb Non-Commercial Datasets provide subsets of IMDb data accessible for personal and non-commercial use. These datasets allow users to hold local copies of movie information, facilitating various analytical projects.

Additionally, the MovieLens datasets offer a range of movie rating data suitable for research purposes. For instance, the MovieLens 20M dataset comprises 20 million ratings and 465,000 tag applications applied to 27,000 movies by 138,000 users, making it a valuable resource for studies in user preferences and recommendation algorithms.

Incorporating these datasets into your projects can significantly enhance the quality and depth of your analyses, providing a solid foundation for exploring various aspects of the cinematic world.

Why Choose Crawl Feeds for Your Data Needs?

Crawl Feeds is your trusted partner in acquiring high-quality, curated datasets tailored to your specific requirements. With a vast repository that includes the Movies Dataset, we empower developers and businesses to drive innovation. Explore our easy-to-use platform and transform your ideas into actionable insights.

Get Started with Crawl Feeds Today
h
movie-posters-dataset
huggingface.co
Updated Nov 30, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yashpreet Voladoddi (2024). movie-posters-dataset [Dataset]. https://huggingface.co/datasets/yashvoladoddi37/movie-posters-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 30, 2024
Authors
Yashpreet Voladoddi
Description
yashvoladoddi37/movie-posters-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community
IMDB & TMDB Movie Metadata Big Dataset (over 1M)
kaggle.com
zip
Updated Aug 5, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shubham Chandra (2024). IMDB & TMDB Movie Metadata Big Dataset (over 1M) [Dataset]. https://www.kaggle.com/datasets/shubhamchandra235/imdb-and-tmdb-movie-metadata-big-dataset-1m
Explore at:
zip(416807108 bytes)Available download formats
Dataset updated
Aug 5, 2024
Authors
Shubham Chandra
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Title: IMDB & TMDB Movie Metadata Big Dataset (>1M)

Subtitle: A Comprehensive Dataset Featuring Detailed Metadata of Movies (IMDB, TMDB). Over 1M Rows & 42 Features: Metadata, Ratings, Genres, Cast, Crew, Sentiment Analysis and many more...

Detailed Description:

Overview: This comprehensive dataset merges the extensive film data available from both IMDB and TMDB, offering a rich resource for movie enthusiasts, data scientists, and researchers. With over 1 million rows and 42 detailed features, this dataset provides in-depth information about a wide variety of movies, spanning different genres, periods, and production backgrounds.

File Information: 1. File Size: ≈ 1GB 2. Format: CSV (Comma-Separated Values)

Column Descriptors/Key Features: 1. ID: Unique identifier for each movie. 2. Title: The official title of the movie. 3. Vote Average: Average rating received by the movie. 4. Vote Count: Number of votes the movie has received. 5. Status: Current status of the movie (e.g., Released, Post-Production). 6. Release Date: Official release date of the movie. 7. Revenue: Box office revenue generated by the movie. 8. Runtime: Duration of the movie in minutes. 9. Adult: Indicates if the movie is for adults. 10. Genres: List of genres the movie belongs to. 11. Overview Sentiment: Sentiment analysis of the movie's overview text. 12. Cast: List of main actors in the movie. 13. Crew: List of key crew members, including directors, producers, and writers. 14. Genres List: Detailed genres in list format. 15. Keywords: List of relevant keywords associated with the movie. 16. Director of Photography: Name of the cinematographer. 17. Producers: Names of the producers. 18. Music Composer: Name of the music composer.

Additional Features:

Unnamed 0: Index column.

Star1, Star2, Star3, Star4: Names of the top-billed stars.

Writer: Name(s) of the writer(s).

Original Language: Original language of the movie.

Original Title: Original title if different from the main title.

Popularity: Popularity score of the movie.

Budget: Budget allocated for the movie.

Tagline: Promotional tagline of the movie.

Production Companies: Companies involved in the production.

Production Countries: Countries where the movie was produced.

Spoken Languages: Languages spoken in the movie.

Homepage: Official website of the movie.

IMDB ID: Unique identifier on IMDB.

TMDB ID: Unique identifier on TMDB.

Video: Indicates if there is a video associated.

Poster Path: Path to the movie poster image.

Backdrop Path: Path to the backdrop image.

Release Year: Year the movie was released.

Collection Name: Name of the collection the movie belongs to.

Collection ID: Unique identifier for the collection.

Genres ID: Unique identifier for the genres.

Original Language Code: Code for the original language.

Overview: Brief summary of the movie.

All Combined Keywords: Combined keywords in a single field.

Potential Use Cases: - Sentiment Analysis: Analyze audience sentiment towards movies based on reviews and ratings. - Recommendation Systems: Build models to recommend movies based on user preferences and viewing history. - Market Analysis: Study trends in the movie industry, including genre popularity and revenue patterns. - Content Analysis: Investigate the thematic content and diversity of movies over time. - Data Visualization: Create visual representations of movie data to uncover hidden insights.
h
IMDB-Dataset-of-50K-Movie-Reviews-Backup
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Q-b1t, IMDB-Dataset-of-50K-Movie-Reviews-Backup [Dataset]. https://huggingface.co/datasets/Q-b1t/IMDB-Dataset-of-50K-Movie-Reviews-Backup
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
Q-b1t
License
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Description
Q-b1t/IMDB-Dataset-of-50K-Movie-Reviews-Backup dataset hosted on Hugging Face and contributed by the HF Datasets community
w
Websites using Movie Database
webtechsurvey.com
csv
Updated Oct 13, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
WebTechSurvey (2025). Websites using Movie Database [Dataset]. https://webtechsurvey.com/technology/movie-database
Explore at:
csvAvailable download formats
Dataset updated
Oct 13, 2025
Dataset authored and provided by
WebTechSurvey
License
https://webtechsurvey.com/termshttps://webtechsurvey.com/terms
Time period covered
2025
Area covered
Global
Description
A complete list of live websites using the Movie Database technology, compiled through global website indexing conducted by WebTechSurvey.
IMDb Movies Metadata Dataset – 4.5M Records (Global Coverage)
crawlfeeds.com
csv, zip
Updated Nov 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Crawl Feeds (2025). IMDb Movies Metadata Dataset – 4.5M Records (Global Coverage) [Dataset]. https://crawlfeeds.com/datasets/imdb-movies-metadata-dataset-4-5m-records-global-coverage
Explore at:
csv, zipAvailable download formats
Dataset updated
Nov 9, 2025
Dataset authored and provided by
Crawl Feeds
License
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Description
Unlock one of the most comprehensive movie datasets available—4.5 million structured IMDb movie records, extracted and enriched for data science, machine learning, and entertainment research.

This dataset includes a vast collection of global movie metadata, including details on title, release year, genre, country, language, runtime, cast, directors, IMDb ratings, reviews, and synopsis. Whether you're building a recommendation engine, benchmarking trends, or training AI models, this dataset is designed to give you deep and wide access to cinematic data across decades and continents.

Perfect for use in film analytics, OTT platforms, review sentiment analysis, knowledge graphs, and LLM fine-tuning, the dataset is cleaned, normalized, and exportable in multiple formats.

What’s Included:

Genres: Drama, Comedy, Horror, Action, Sci-Fi, Documentary, and more

Delivery: Direct download

Use Cases:

Train LLMs or chatbots on cinematic language and metadata

Build or enrich movie recommendation engines

Run cross-lingual or multi-region film analytics

Benchmark genre popularity across time periods

Power academic studies or entertainment dashboards

Feed into knowledge graphs, search engines, or NLP pipelines
h
movie-dataset
huggingface.co
Updated Jul 18, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vivek Eswaran (2023). movie-dataset [Dataset]. https://huggingface.co/datasets/veswaran/movie-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 18, 2023
Authors
Vivek Eswaran
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
veswaran/movie-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community
h
movie
huggingface.co
Updated Mar 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Masterclass (2025). movie [Dataset]. https://huggingface.co/datasets/mc-ai/movie
Explore at:
Dataset updated
Mar 30, 2025
Dataset authored and provided by
Masterclass
Description
mc-ai/movie dataset hosted on Hugging Face and contributed by the HF Datasets community
q
Movie Data - X - Test - w2v
data.researchdatafinder.qut.edu.au
Updated Apr 8, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2018). Movie Data - X - Test - w2v [Dataset]. https://data.researchdatafinder.qut.edu.au/dataset/survey-word-vector/resource/e638fc06-7ef3-4a41-85e2-21f7fad2dfb3
Explore at:
Dataset updated
Apr 8, 2018
License
http://researchdatafinder.qut.edu.au/display/n15252http://researchdatafinder.qut.edu.au/display/n15252
Description
This file contains the features for the test portion of the movie dataset. The data has been changed into an average word vector. This is 50% of the total movie results. QUT Research Data Respository Dataset Resource available for download
i
Large Movie Review Dataset
ieee-dataport.org
Updated Jul 17, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tasnim Akter Onisha (2025). Large Movie Review Dataset [Dataset]. https://ieee-dataport.org/documents/large-movie-review-dataset
Explore at:
Dataset updated
Jul 17, 2025
Authors
Tasnim Akter Onisha
Description
contains 50
T
imdb_reviews
tensorflow.org
kaggle.com
Updated Sep 20, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). imdb_reviews [Dataset]. https://www.tensorflow.org/datasets/catalog/imdb_reviews
Explore at:
Dataset updated
Sep 20, 2024
Description
Large Movie Review Dataset. This is a dataset for binary sentiment classification containing substantially more data than previous benchmark datasets. We provide a set of 25,000 highly polar movie reviews for training, and 25,000 for testing. There is additional unlabeled data for use as well.

To use this dataset:

import tensorflow_datasets as tfds ds = tfds.load('imdb_reviews', split='train') for ex in ds.take(4): print(ex)

See the guide for more informations on tensorflow_datasets.

Facebook

Twitter

Click to copy link

Link copied

Cite

asaniczka (2025). Full TMDB Movies Dataset 2024 (1M Movies) [Dataset]. https://www.kaggle.com/datasets/asaniczka/tmdb-movies-dataset-2023-930k-movies

Full TMDB Movies Dataset 2024 (1M Movies)

Complete dataset containing movie data from TMDb. Updated Daily

Explore at:

3 scholarly articles cite this dataset (View in Google Scholar)

zip(239404730 bytes)Available download formats

Dataset updated

Nov 11, 2025

Authors

asaniczka

License

Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
License information was derived automatically

Description

The TMDb (The Movie Database) is a comprehensive movie database that provides information about movies, including details like titles, ratings, release dates, revenue, genres, and much more.

This dataset contains a collection of 1,000,000 movies from the TMDB database.

Dataset is updated daily. If you find this dataset valuable, don't forget to hit the upvote button! 😊💝

Interesting Task Ideas:

Predict movie ratings based on features such as revenue, popularity, genre, and runtime.
Identify trends in movie release dates and analyze their impact on revenue.
Analyze the relationship between budget, revenue, and popularity to determine factors that contribute to a movie's success.
Build a recommendation system that suggests similar movies based on genres, production companies, and language.
Perform sentiment analysis on movie reviews to understand audience reactions.
Explore the impact of movie genres on popularity and revenue.
Investigate the correlation between runtime and audience engagement.
Identify successful production companies and analyze their strategies.
Utilize natural language processing techniques to extract meaningful insights from movie overviews.
Visualize movie popularity over time and identify popular genres in different periods.

Checkout my other datasets

Clash of Clans Clans Dataset 2023 (3.5M Clans)

Black-White Wage Gap in the USA Dataset

130K Kindle Books

USA Unemployment Rates by Demographics & Race

150K TMDb TV Shows

Photo by Onur Binay on Unsplash

Clear search

Close search

Google apps

Main menu

Full TMDB Movies Dataset 2024 (1M Movies)

Interesting Task Ideas:

Checkout my other datasets

Movie Dataset for ML

movies-dataset

IMDB 5000 Movie Dataset

Dataset

Contents

IMDB movie details dataset

IMDB Movie Ratings Dataset

IMDB Movie Ratings Dataset

Evaluating Directors, Actors, Genres, and Movie Titles

About this dataset

More Datasets

Featured Notebooks

How to use the dataset

Research Ideas

Acknowledgements

License

Columns

Rotten Tomatoes Movie Dataset – Clean Movie Metadata

What the Dataset Includes

Broad Coverage

Use Cases

Why Use Our Dataset?

imdb-genres

movie-dataset

Movies dataset from allmovie

movie-posters-dataset

IMDB & TMDB Movie Metadata Big Dataset (over 1M)

IMDB-Dataset-of-50K-Movie-Reviews-Backup

Websites using Movie Database

IMDb Movies Metadata Dataset – 4.5M Records (Global Coverage)

What’s Included:

Use Cases:

movie-dataset

movie

Movie Data - X - Test - w2v

Large Movie Review Dataset

imdb_reviews

Full TMDB Movies Dataset 2024 (1M Movies)

Complete dataset containing movie data from TMDb. Updated Daily

Interesting Task Ideas:

Checkout my other datasets