100+ datasets found

IMDB Movies Dataset
kaggle.com
Updated Feb 1, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Harshit Shankhdhar (2021). IMDB Movies Dataset [Dataset]. https://www.kaggle.com/datasets/harshitshankhdhar/imdb-dataset-of-top-1000-movies-and-tv-shows
Explore at:
Dataset updated
Feb 1, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Harshit Shankhdhar
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Context

IMDB Dataset of top 1000 movies and tv shows. You can find the EDA Process on - https://www.kaggle.com/harshitshankhdhar/eda-on-imdb-movies-dataset

Please consider UPVOTE if you found it useful.

Content

Data:- - Poster_Link - Link of the poster that imdb using - Series_Title = Name of the movie - Released_Year - Year at which that movie released - Certificate - Certificate earned by that movie - Runtime - Total runtime of the movie - Genre - Genre of the movie - IMDB_Rating - Rating of the movie at IMDB site - Overview - mini story/ summary - Meta_score - Score earned by the movie - Director - Name of the Director - Star1,Star2,Star3,Star4 - Name of the Stars - No_of_votes - Total number of votes - Gross - Money earned by that movie

Inspiration

Analysis of the gross of a movie vs directors.

Analysis of the gross of a movie vs different - different stars.

Analysis of the No_of_votes of a movie vs directors.

Analysis of the No_of_votes of a movie vs different - different stars.

Which actor prefer which Genre more?

Which combination of actors are getting good IMDB_Rating maximum time?

Which combination of actors are getting good gross?
IMDB Dataset of 50K Movie Reviews
kaggle.com
bbg33.com
Updated Mar 12, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lakshmipathi N (2019). IMDB Dataset of 50K Movie Reviews [Dataset]. https://www.kaggle.com/lakshmi25npathi/imdb-dataset-of-50k-movie-reviews/data
Explore at:
Dataset updated
Mar 12, 2019
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Lakshmipathi N
Description
IMDB dataset having 50K movie reviews for natural language processing or Text analytics. This is a dataset for binary sentiment classification containing substantially more data than previous benchmark datasets. We provide a set of 25,000 highly polar movie reviews for training and 25,000 for testing. So, predict the number of positive and negative reviews using either classification or deep learning algorithms. For more dataset information, please go through the following link, http://ai.stanford.edu/~amaas/data/sentiment/
P
IMDb Movie Reviews Dataset
todaynewsss.org
paperswithcode.com
Updated Apr 15, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Andrew L. Maas; Raymond E. Daly; Peter T. Pham; Dan Huang; Andrew Y. Ng; Christopher Potts (2024). IMDb Movie Reviews Dataset [Dataset]. https://www.todaynewsss.org/?_=%2Fdataset%2Fimdb-movie-reviews%23KJWqMdlUlBn7Kv0QQg73jp%2F9e45kHhGipO4%3D
Explore at:
Dataset updated
Apr 15, 2024
Authors
Andrew L. Maas; Raymond E. Daly; Peter T. Pham; Dan Huang; Andrew Y. Ng; Christopher Potts
Description
The IMDb Movie Reviews dataset is a binary sentiment analysis dataset consisting of 50,000 reviews from the Internet Movie Database (IMDb) labeled as positive or negative. The dataset contains an even number of positive and negative reviews. Only highly polarizing reviews are considered. A negative review has a score ≤ 4 out of 10, and a positive review has a score ≥ 7 out of 10. No more than 30 reviews are included per movie. The dataset contains additional unlabeled data.
T
imdb_reviews
tensorflow.org
+1more
Updated Dec 10, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2022). imdb_reviews [Dataset]. https://www.tensorflow.org/datasets/catalog/imdb_reviews
Explore at:
Dataset updated
Dec 10, 2022
Description
Large Movie Review Dataset. This is a dataset for binary sentiment classification containing substantially more data than previous benchmark datasets. We provide a set of 25,000 highly polar movie reviews for training, and 25,000 for testing. There is additional unlabeled data for use as well.

To use this dataset:

import tensorflow_datasets as tfds ds = tfds.load('imdb_reviews', split='train') for ex in ds.take(4): print(ex)

See the guide for more informations on tensorflow_datasets.
IMDb Movie Reviews Dataset
ieee-dataport.org
Updated Aug 2, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Aditya Pal (2022). IMDb Movie Reviews Dataset [Dataset]. http://doi.org/10.21227/zm1y-b270
Explore at:
Unique identifier
https://doi.org/10.21227/zm1y-b270
Dataset updated
Aug 2, 2022
Dataset provided by
Institute of Electrical and Electronics Engineershttp://www.ieee.ro/
Authors
Aditya Pal
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains nearly 1 Million unique movie reviews from 1150 different IMDb movies spread across 17 IMDb genres - Action, Adventure, Animation, Biography, Comedy, Crime, Drama, Fantasy, History, Horror, Music, Mystery, Romance, Sci-Fi, Sport, Thriller and War. The dataset also contains movie metadata such as date of release of the movie, run length, IMDb rating, movie rating (PG-13, R, etc), number of IMDb raters, and number of reviews per movie.
h
IMDB-Dataset-of-50K-Movie-Reviews-Backup
huggingface.co
Updated Jul 15, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Q-b1t (2023). IMDB-Dataset-of-50K-Movie-Reviews-Backup [Dataset]. https://huggingface.co/datasets/Q-b1t/IMDB-Dataset-of-50K-Movie-Reviews-Backup
Explore at:
Dataset updated
Jul 15, 2023
Authors
Q-b1t
License
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Description
Q-b1t/IMDB-Dataset-of-50K-Movie-Reviews-Backup dataset hosted on Hugging Face and contributed by the HF Datasets community
IMDB dataset (Sentiment analysis) in CSV format
kaggle.com
Updated Nov 28, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ziqi Yuan (2019). IMDB dataset (Sentiment analysis) in CSV format [Dataset]. https://www.kaggle.com/datasets/columbine/imdb-dataset-sentiment-analysis-in-csv-format
Explore at:
Dataset updated
Nov 28, 2019
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Ziqi Yuan
License
https://www.worldbank.org/en/about/legal/terms-of-use-for-datasetshttps://www.worldbank.org/en/about/legal/terms-of-use-for-datasets
Description
Context

A movie review dataset. NLP tasks Sentiment Analysis.

Note : all the movie review are long sentence(most of them are longer than 200 words.)

Content

two columns used (text : the review of the movie and label : the sentiment label of the movie review)
IMDB Dataset 2023
kaggle.com
Updated Feb 29, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Adrian Kieżuń (2024). IMDB Dataset 2023 [Dataset]. https://www.kaggle.com/datasets/adriankiezun/imdb-dataset-2023
Explore at:
Dataset updated
Feb 29, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Adrian Kieżuń
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset containing information about movies which appears on IMDB website. Data was obtained by means of a web scraping in Python and combined with repository shared by IMDB. Data was preprocessed to include only movies which were released after 1970 and currently have over 50 000 ratings. Additionally there were selected only these movies whose budgets and gross' are denominated in USD to avoid discrepancies. Dataset contains 3348 observations described by 12 attributes.

Attributes 1. id - movie's ID used by IMDB repository 2. primaryTitle - title in English 3. originalTitle - original title in native language 4. isAdult - parental guidance 5. runtimeMinutes - total runtime in minutes 6. genres - genres 7. averageRating - final rating, based on all the ratings 8. numVotes - total number of votes (ratings) 9. budget - total budget in USD 10. gross - total gross worldwide in USD 11. release_date - release date, first occurrence 12. directors - directors

LAST UPDATE: 12th November 2023
P
IMDB-MULTI Dataset
paperswithcode.com
opendatalab.com
+1more
Updated Sep 1, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pinar Yanardag; S. V. N. Vishwanathan (2021). IMDB-MULTI Dataset [Dataset]. https://paperswithcode.com/dataset/imdb-multi
Explore at:
Dataset updated
Sep 1, 2021
Authors
Pinar Yanardag; S. V. N. Vishwanathan
Description
IMDB-MULTI is a relational dataset that consists of a network of 1000 actors or actresses who played roles in movies in IMDB. A node represents an actor or actress, and an edge connects two nodes when they appear in the same movie. In IMDB-MULTI, the edges are collected from three different genres: Comedy, Romance and Sci-Fi.
h
imdb-javanese
huggingface.co
Updated Feb 12, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wilson Wongso (2022). imdb-javanese [Dataset]. https://huggingface.co/datasets/w11wo/imdb-javanese
Explore at:
Dataset updated
Feb 12, 2022
Authors
Wilson Wongso
License
https://choosealicense.com/licenses/odbl/https://choosealicense.com/licenses/odbl/
Description
Large Movie Review Dataset translated to Javanese. This is a dataset for binary sentiment classification containing substantially more data than previous benchmark datasets. We provide a set of 25,000 highly polar movie reviews for training, and 25,000 for testing. There is additional unlabeled data for use as well. We translated the original IMDB Dataset to Javanese using the multi-lingual MarianMT Transformer model from Helsinki-NLP/opus-mt-en-mul.
k
IMDb-Movie-Dataset--All-Movies-by-Genre
kaggle.com
Updated Feb 18, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). IMDb-Movie-Dataset--All-Movies-by-Genre [Dataset]. https://www.kaggle.com/datasets/rajugc/imdb-movies-dataset-based-on-genre
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 18, 2023
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
Context:

The movie industry is a vast and ever-growing landscape, with countless movies being produced each year. Keeping track of all these movies and their characteristics can be a daunting task for researchers, film enthusiasts, and data scientists alike. That's where a comprehensive dataset that lists all movies and their genre can come in handy.

Column Description:

movie_id - IMDB Movie ID

movie_name - Name of the movie

year - Release year

certificate - Certificate of the movie

run_time - Total movie run time

genre - Genre of the movie

rating - Rating of the movie

description - Description of the movie

director - Director of the movie

director_id - IMDB id of the director

star - Star of the movie

star_id - IMDB id of the star

votes - Number of votes in IMDB website

gross(in $) - Gross Box Office of the movie

Source:

The primary source for an IMDb all movies dataset based on genre would be IMDb, the world's most popular and authoritative source for movie, TV, and celebrity content. IMDb has an extensive database of movies that is constantly updated with new titles and information.

Inspiration:

Creating an IMDb all movies dataset based on genre can provide a wealth of insights and opportunities for analysis. For example, researchers could use the dataset to study trends in movie genres over time or compare the characteristics of different genres. Film enthusiasts could use the dataset to discover new movies in their favorite genres or explore movies outside of their usual comfort zone. Data scientists could use the dataset to build predictive models or recommend movies to users based on their genre preferences. Overall, an IMDb all movies dataset based on genre has the potential to unlock a wealth of knowledge and insights about the movie industry.
H
PostgreSQL Dump of IMDB Data for JOB Workload
dataverse.harvard.edu
search.dataone.org
bin
Updated Sep 24, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Harvard Dataverse (2019). PostgreSQL Dump of IMDB Data for JOB Workload [Dataset]. http://doi.org/10.7910/DVN/2QYZBT
Explore at:
bin(1277543282)Available download formats
Unique identifier
https://doi.org/10.7910/DVN/2QYZBT
Dataset updated
Sep 24, 2019
Dataset provided by
Harvard Dataverse
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This is a dump generated by pg_dump -Fc of the IMDb data used in the "How Good are Query Optimizers, Really?" paper. PostgreSQL compatible SQL queries and scripts to automatically create a VM with this dataset can be found here: https://git.io/imdb
c
IMDB movie details dataset
crawlfeeds.com
json, zip
Updated Apr 6, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Crawl Feeds (2024). IMDB movie details dataset [Dataset]. https://crawlfeeds.com/datasets/imdb-movie-details-dataset
Explore at:
json, zipAvailable download formats
Dataset updated
Apr 6, 2024
Dataset authored and provided by
Crawl Feeds
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
IMDb is an online database of information related to films, television programs, home videos, video games, and streaming content online. Crawl feeds team crawled more than 300K+ records for research and analysis purposes.

Contact crawl feeds team to customize dataset as per your needs like format changes, data frequency, and adding or removing fields.
d
IMDB Top 250 Lists and 5000 plus IMDB records
data.world
csv, zip
Updated Mar 22, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TheMitchWorksPro (2024). IMDB Top 250 Lists and 5000 plus IMDB records [Dataset]. https://data.world/studentoflife/imdb-top-250-lists-and-5000-or-so-data-records
Explore at:
csv, zipAvailable download formats
Dataset updated
Mar 22, 2024
Authors
TheMitchWorksPro
Time period covered
1916 - 2017
Area covered

Description
This dataset is a work in progress. It includes data that was screen scraped using jsonlite and XML libraries in R and an open API through OMDB from the IMDB website. Movie IDs to help gather much of this data come from one or two Kaggle projects. There is a workflow from original cobbled together spreadsheets to the final product with 27 variables and over 5000 observations.

More detail on this data will be provided later in the project this data was gathered for. Stay tuned ...
P
IMDB-BINARY Dataset
paperswithcode.com
opendatalab.com
Updated May 15, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pinar Yanardag; S. V. N. Vishwanathan (2021). IMDB-BINARY Dataset [Dataset]. https://paperswithcode.com/dataset/imdb-binary
Explore at:
Dataset updated
May 15, 2021
Authors
Pinar Yanardag; S. V. N. Vishwanathan
Description
IMDB-BINARY is a movie collaboration dataset that consists of the ego-networks of 1,000 actors/actresses who played roles in movies in IMDB. In each graph, nodes represent actors/actress, and there is an edge between them if they appear in the same movie. These graphs are derived from the Action and Romance genres.
a
IMDb Large Movie Review Dataset
academictorrents.com
bittorrent
Updated Oct 20, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Andrew L. Maas et al., 2011 (2018). IMDb Large Movie Review Dataset [Dataset]. https://academictorrents.com/details/fd24bc44d461b10288469e05a64a8344eb079f15
Explore at:
bittorrentAvailable download formats
Dataset updated
Oct 20, 2018
Dataset authored and provided by
Andrew L. Maas et al., 2011
License
https://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified
Description
A dataset for binary sentiment classification containing 25,000 highly polarized movie reviews for training, and 25,000 for testing. There is additional unlabeled data for use as well.
h
mini-imdb
huggingface.co
Updated Sep 30, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Daniel Vila (2022). mini-imdb [Dataset]. https://huggingface.co/datasets/dvilasuero/mini-imdb
Explore at:
Dataset updated
Sep 30, 2022
Authors
Daniel Vila
Description
dvilasuero/mini-imdb dataset hosted on Hugging Face and contributed by the HF Datasets community
IMDb Users' Ratings Dataset
ieee-dataport.org
Updated Dec 27, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vahid Baghi (2020). IMDb Users' Ratings Dataset [Dataset]. http://doi.org/10.21227/br41-bd49
Explore at:
Unique identifier
https://doi.org/10.21227/br41-bd49
Dataset updated
Dec 27, 2020
Dataset provided by
Institute of Electrical and Electronics Engineershttp://www.ieee.ro/
Authors
Vahid Baghi
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains 4669820 ratings from 1499238 users to 351109 movies on the imdb.com website. This data is collected from reviews (https://www.imdb.com/review/rw0000001/). Each row in this dataset is as follows:userID, movieID, rating, review dateFor example : ur18238764, tt2177461, 9, 22 January 2019
P
IMDb-Face Dataset
paperswithcode.com
Updated Jul 30, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fei Wang; Liren Chen; Cheng Li; Shiyao Huang; Yanjie Chen; Chen Qian; Chen Change Loy (2018). IMDb-Face Dataset [Dataset]. https://paperswithcode.com/dataset/imdb-face
Explore at:
Dataset updated
Jul 30, 2018
Authors
Fei Wang; Liren Chen; Cheng Li; Shiyao Huang; Yanjie Chen; Chen Qian; Chen Change Loy
Description
IMDb-Face is large-scale noise-controlled dataset for face recognition research. The dataset contains about 1.7 million faces, 59k identities, which is manually cleaned from 2.0 million raw images. All images are obtained from the IMDb website.
Sentiment analysis in Galaxy with IMDB movie review dataset
zenodo.org
tsv
Updated Aug 4, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kaivan Kamali; Kaivan Kamali (2022). Sentiment analysis in Galaxy with IMDB movie review dataset [Dataset]. http://doi.org/10.5281/zenodo.4477881
Explore at:
tsvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.4477881
Dataset updated
Aug 4, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Kaivan Kamali; Kaivan Kamali
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
IMDB movie review sentiment classification dataset (Andrew L. Maas, Raymond E. Daly, Peter T. Pham, Dan Huang, Andrew Y. Ng, and Christopher Potts. (2011). Learning Word Vectors for Sentiment Analysis. The 49th Annual Meeting of the Association for Computational Linguistics (ACL 2011)). For more information please refer to: https://ai.stanford.edu/~amaas/data/sentiment/

The IMDB dataset was modified as follows to prepare it for use in a Galaxy Training Tutorial (https://training.galaxyproject.org/):

The top 50 words are excluded (mostly stop words). Included the next 10,000 top words. Reviews are limited to 500 words max (Longer reviews trimmed and shorter reviews are padded). 25,000 reviews are used for training and testing each. Files are in tsv (tab separated value) format to be consumed by Galaxy (www.usegalaxy.org).

Facebook

Twitter

Click to copy link

Link copied

Cite

Harshit Shankhdhar (2021). IMDB Movies Dataset [Dataset]. https://www.kaggle.com/datasets/harshitshankhdhar/imdb-dataset-of-top-1000-movies-and-tv-shows

IMDB Movies Dataset

Top 1000 Movies by IMDB Rating

Explore at:

490 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

Feb 1, 2021

Dataset provided by

Kagglehttp://kaggle.com/

Authors

Harshit Shankhdhar

License

CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically

Description

Context

IMDB Dataset of top 1000 movies and tv shows. You can find the EDA Process on - https://www.kaggle.com/harshitshankhdhar/eda-on-imdb-movies-dataset

Please consider UPVOTE if you found it useful.

Content

Data:- - Poster_Link - Link of the poster that imdb using - Series_Title = Name of the movie - Released_Year - Year at which that movie released - Certificate - Certificate earned by that movie - Runtime - Total runtime of the movie - Genre - Genre of the movie - IMDB_Rating - Rating of the movie at IMDB site - Overview - mini story/ summary - Meta_score - Score earned by the movie - Director - Name of the Director - Star1,Star2,Star3,Star4 - Name of the Stars - No_of_votes - Total number of votes - Gross - Money earned by that movie

Inspiration

Analysis of the gross of a movie vs directors.
Analysis of the gross of a movie vs different - different stars.
Analysis of the No_of_votes of a movie vs directors.
Analysis of the No_of_votes of a movie vs different - different stars.
Which actor prefer which Genre more?
Which combination of actors are getting good IMDB_Rating maximum time?
Which combination of actors are getting good gross?

Clear search

Close search

Google apps

Main menu

IMDB Movies Dataset

Context

Content

Inspiration

IMDB Dataset of 50K Movie Reviews

IMDb Movie Reviews Dataset

imdb_reviews

IMDb Movie Reviews Dataset

IMDB-Dataset-of-50K-Movie-Reviews-Backup

IMDB dataset (Sentiment analysis) in CSV format

Context

Content

IMDB Dataset 2023

LAST UPDATE: 12th November 2023

IMDB-MULTI Dataset

imdb-javanese

IMDb-Movie-Dataset--All-Movies-by-Genre

Context:

Column Description:

Source:

Inspiration:

PostgreSQL Dump of IMDB Data for JOB Workload

IMDB movie details dataset

IMDB Top 250 Lists and 5000 plus IMDB records

IMDB-BINARY Dataset

IMDb Large Movie Review Dataset

mini-imdb

IMDb Users' Ratings Dataset

IMDb-Face Dataset

Sentiment analysis in Galaxy with IMDB movie review dataset

IMDB Movies Dataset

Top 1000 Movies by IMDB Rating

Context

Content

Inspiration