100+ datasets found
  1. IMDB Movies Dataset

    • kaggle.com
    Updated Feb 1, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Harshit Shankhdhar (2021). IMDB Movies Dataset [Dataset]. https://www.kaggle.com/datasets/harshitshankhdhar/imdb-dataset-of-top-1000-movies-and-tv-shows
    Explore at:
    Dataset updated
    Feb 1, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Harshit Shankhdhar
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Context

    IMDB Dataset of top 1000 movies and tv shows. You can find the EDA Process on - https://www.kaggle.com/harshitshankhdhar/eda-on-imdb-movies-dataset

    Please consider UPVOTE if you found it useful.

    Content

    Data:- - Poster_Link - Link of the poster that imdb using - Series_Title = Name of the movie - Released_Year - Year at which that movie released - Certificate - Certificate earned by that movie - Runtime - Total runtime of the movie - Genre - Genre of the movie - IMDB_Rating - Rating of the movie at IMDB site - Overview - mini story/ summary - Meta_score - Score earned by the movie - Director - Name of the Director - Star1,Star2,Star3,Star4 - Name of the Stars - No_of_votes - Total number of votes - Gross - Money earned by that movie

    Inspiration

    • Analysis of the gross of a movie vs directors.
    • Analysis of the gross of a movie vs different - different stars.
    • Analysis of the No_of_votes of a movie vs directors.
    • Analysis of the No_of_votes of a movie vs different - different stars.
    • Which actor prefer which Genre more?
    • Which combination of actors are getting good IMDB_Rating maximum time?
    • Which combination of actors are getting good gross?
  2. IMDB Dataset of 50K Movie Reviews

    • kaggle.com
    • bbg33.com
    Updated Mar 12, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lakshmipathi N (2019). IMDB Dataset of 50K Movie Reviews [Dataset]. https://www.kaggle.com/lakshmi25npathi/imdb-dataset-of-50k-movie-reviews/data
    Explore at:
    Dataset updated
    Mar 12, 2019
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Lakshmipathi N
    Description

    IMDB dataset having 50K movie reviews for natural language processing or Text analytics. This is a dataset for binary sentiment classification containing substantially more data than previous benchmark datasets. We provide a set of 25,000 highly polar movie reviews for training and 25,000 for testing. So, predict the number of positive and negative reviews using either classification or deep learning algorithms. For more dataset information, please go through the following link, http://ai.stanford.edu/~amaas/data/sentiment/

  3. P

    IMDb Movie Reviews Dataset

    • todaynewsss.org
    • paperswithcode.com
    Updated Apr 15, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Andrew L. Maas; Raymond E. Daly; Peter T. Pham; Dan Huang; Andrew Y. Ng; Christopher Potts (2024). IMDb Movie Reviews Dataset [Dataset]. https://www.todaynewsss.org/?_=%2Fdataset%2Fimdb-movie-reviews%23KJWqMdlUlBn7Kv0QQg73jp%2F9e45kHhGipO4%3D
    Explore at:
    Dataset updated
    Apr 15, 2024
    Authors
    Andrew L. Maas; Raymond E. Daly; Peter T. Pham; Dan Huang; Andrew Y. Ng; Christopher Potts
    Description

    The IMDb Movie Reviews dataset is a binary sentiment analysis dataset consisting of 50,000 reviews from the Internet Movie Database (IMDb) labeled as positive or negative. The dataset contains an even number of positive and negative reviews. Only highly polarizing reviews are considered. A negative review has a score ≤ 4 out of 10, and a positive review has a score ≥ 7 out of 10. No more than 30 reviews are included per movie. The dataset contains additional unlabeled data.

  4. T

    imdb_reviews

    • tensorflow.org
    • +1more
    Updated Dec 10, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). imdb_reviews [Dataset]. https://www.tensorflow.org/datasets/catalog/imdb_reviews
    Explore at:
    Dataset updated
    Dec 10, 2022
    Description

    Large Movie Review Dataset. This is a dataset for binary sentiment classification containing substantially more data than previous benchmark datasets. We provide a set of 25,000 highly polar movie reviews for training, and 25,000 for testing. There is additional unlabeled data for use as well.

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('imdb_reviews', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

  5. IMDb Movie Reviews Dataset

    • ieee-dataport.org
    Updated Aug 2, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aditya Pal (2022). IMDb Movie Reviews Dataset [Dataset]. http://doi.org/10.21227/zm1y-b270
    Explore at:
    Dataset updated
    Aug 2, 2022
    Dataset provided by
    Institute of Electrical and Electronics Engineershttp://www.ieee.ro/
    Authors
    Aditya Pal
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains nearly 1 Million unique movie reviews from 1150 different IMDb movies spread across 17 IMDb genres - Action, Adventure, Animation, Biography, Comedy, Crime, Drama, Fantasy, History, Horror, Music, Mystery, Romance, Sci-Fi, Sport, Thriller and War. The dataset also contains movie metadata such as date of release of the movie, run length, IMDb rating, movie rating (PG-13, R, etc), number of IMDb raters, and number of reviews per movie.

  6. h

    IMDB-Dataset-of-50K-Movie-Reviews-Backup

    • huggingface.co
    Updated Jul 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Q-b1t (2023). IMDB-Dataset-of-50K-Movie-Reviews-Backup [Dataset]. https://huggingface.co/datasets/Q-b1t/IMDB-Dataset-of-50K-Movie-Reviews-Backup
    Explore at:
    Dataset updated
    Jul 15, 2023
    Authors
    Q-b1t
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    Q-b1t/IMDB-Dataset-of-50K-Movie-Reviews-Backup dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. IMDB dataset (Sentiment analysis) in CSV format

    • kaggle.com
    Updated Nov 28, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ziqi Yuan (2019). IMDB dataset (Sentiment analysis) in CSV format [Dataset]. https://www.kaggle.com/datasets/columbine/imdb-dataset-sentiment-analysis-in-csv-format
    Explore at:
    Dataset updated
    Nov 28, 2019
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Ziqi Yuan
    License

    https://www.worldbank.org/en/about/legal/terms-of-use-for-datasetshttps://www.worldbank.org/en/about/legal/terms-of-use-for-datasets

    Description

    Context

    A movie review dataset. NLP tasks Sentiment Analysis.

    Note : all the movie review are long sentence(most of them are longer than 200 words.)

    Content

    two columns used (text : the review of the movie and label : the sentiment label of the movie review)

  8. IMDB Dataset 2023

    • kaggle.com
    Updated Feb 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Adrian Kieżuń (2024). IMDB Dataset 2023 [Dataset]. https://www.kaggle.com/datasets/adriankiezun/imdb-dataset-2023
    Explore at:
    Dataset updated
    Feb 29, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Adrian Kieżuń
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset containing information about movies which appears on IMDB website. Data was obtained by means of a web scraping in Python and combined with repository shared by IMDB. Data was preprocessed to include only movies which were released after 1970 and currently have over 50 000 ratings. Additionally there were selected only these movies whose budgets and gross' are denominated in USD to avoid discrepancies. Dataset contains 3348 observations described by 12 attributes.

    Attributes 1. id - movie's ID used by IMDB repository 2. primaryTitle - title in English 3. originalTitle - original title in native language 4. isAdult - parental guidance 5. runtimeMinutes - total runtime in minutes 6. genres - genres 7. averageRating - final rating, based on all the ratings 8. numVotes - total number of votes (ratings) 9. budget - total budget in USD 10. gross - total gross worldwide in USD 11. release_date - release date, first occurrence 12. directors - directors

    LAST UPDATE: 12th November 2023

  9. P

    IMDB-MULTI Dataset

    • paperswithcode.com
    • opendatalab.com
    • +1more
    Updated Sep 1, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pinar Yanardag; S. V. N. Vishwanathan (2021). IMDB-MULTI Dataset [Dataset]. https://paperswithcode.com/dataset/imdb-multi
    Explore at:
    Dataset updated
    Sep 1, 2021
    Authors
    Pinar Yanardag; S. V. N. Vishwanathan
    Description

    IMDB-MULTI is a relational dataset that consists of a network of 1000 actors or actresses who played roles in movies in IMDB. A node represents an actor or actress, and an edge connects two nodes when they appear in the same movie. In IMDB-MULTI, the edges are collected from three different genres: Comedy, Romance and Sci-Fi.

  10. h

    imdb-javanese

    • huggingface.co
    Updated Feb 12, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wilson Wongso (2022). imdb-javanese [Dataset]. https://huggingface.co/datasets/w11wo/imdb-javanese
    Explore at:
    Dataset updated
    Feb 12, 2022
    Authors
    Wilson Wongso
    License

    https://choosealicense.com/licenses/odbl/https://choosealicense.com/licenses/odbl/

    Description

    Large Movie Review Dataset translated to Javanese. This is a dataset for binary sentiment classification containing substantially more data than previous benchmark datasets. We provide a set of 25,000 highly polar movie reviews for training, and 25,000 for testing. There is additional unlabeled data for use as well. We translated the original IMDB Dataset to Javanese using the multi-lingual MarianMT Transformer model from Helsinki-NLP/opus-mt-en-mul.

  11. k

    IMDb-Movie-Dataset--All-Movies-by-Genre

    • kaggle.com
    Updated Feb 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). IMDb-Movie-Dataset--All-Movies-by-Genre [Dataset]. https://www.kaggle.com/datasets/rajugc/imdb-movies-dataset-based-on-genre
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 18, 2023
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Context:

    The movie industry is a vast and ever-growing landscape, with countless movies being produced each year. Keeping track of all these movies and their characteristics can be a daunting task for researchers, film enthusiasts, and data scientists alike. That's where a comprehensive dataset that lists all movies and their genre can come in handy.

    Column Description:

    • movie_id - IMDB Movie ID
    • movie_name - Name of the movie
    • year - Release year
    • certificate - Certificate of the movie
    • run_time - Total movie run time
    • genre - Genre of the movie
    • rating - Rating of the movie
    • description - Description of the movie
    • director - Director of the movie
    • director_id - IMDB id of the director
    • star - Star of the movie
    • star_id - IMDB id of the star
    • votes - Number of votes in IMDB website
    • gross(in $) - Gross Box Office of the movie

    Source:

    The primary source for an IMDb all movies dataset based on genre would be IMDb, the world's most popular and authoritative source for movie, TV, and celebrity content. IMDb has an extensive database of movies that is constantly updated with new titles and information.

    Inspiration:

    Creating an IMDb all movies dataset based on genre can provide a wealth of insights and opportunities for analysis. For example, researchers could use the dataset to study trends in movie genres over time or compare the characteristics of different genres. Film enthusiasts could use the dataset to discover new movies in their favorite genres or explore movies outside of their usual comfort zone. Data scientists could use the dataset to build predictive models or recommend movies to users based on their genre preferences. Overall, an IMDb all movies dataset based on genre has the potential to unlock a wealth of knowledge and insights about the movie industry.

  12. H

    PostgreSQL Dump of IMDB Data for JOB Workload

    • dataverse.harvard.edu
    • search.dataone.org
    bin
    Updated Sep 24, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Harvard Dataverse (2019). PostgreSQL Dump of IMDB Data for JOB Workload [Dataset]. http://doi.org/10.7910/DVN/2QYZBT
    Explore at:
    bin(1277543282)Available download formats
    Dataset updated
    Sep 24, 2019
    Dataset provided by
    Harvard Dataverse
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This is a dump generated by pg_dump -Fc of the IMDb data used in the "How Good are Query Optimizers, Really?" paper. PostgreSQL compatible SQL queries and scripts to automatically create a VM with this dataset can be found here: https://git.io/imdb

  13. c

    IMDB movie details dataset

    • crawlfeeds.com
    json, zip
    Updated Apr 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Crawl Feeds (2024). IMDB movie details dataset [Dataset]. https://crawlfeeds.com/datasets/imdb-movie-details-dataset
    Explore at:
    json, zipAvailable download formats
    Dataset updated
    Apr 6, 2024
    Dataset authored and provided by
    Crawl Feeds
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    IMDb is an online database of information related to films, television programs, home videos, video games, and streaming content online. Crawl feeds team crawled more than 300K+ records for research and analysis purposes.

    Contact crawl feeds team to customize dataset as per your needs like format changes, data frequency, and adding or removing fields.

  14. d

    IMDB Top 250 Lists and 5000 plus IMDB records

    • data.world
    csv, zip
    Updated Mar 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TheMitchWorksPro (2024). IMDB Top 250 Lists and 5000 plus IMDB records [Dataset]. https://data.world/studentoflife/imdb-top-250-lists-and-5000-or-so-data-records
    Explore at:
    csv, zipAvailable download formats
    Dataset updated
    Mar 22, 2024
    Authors
    TheMitchWorksPro
    Time period covered
    1916 - 2017
    Area covered
    Description

    This dataset is a work in progress. It includes data that was screen scraped using jsonlite and XML libraries in R and an open API through OMDB from the IMDB website. Movie IDs to help gather much of this data come from one or two Kaggle projects. There is a workflow from original cobbled together spreadsheets to the final product with 27 variables and over 5000 observations.

    More detail on this data will be provided later in the project this data was gathered for. Stay tuned ...

  15. P

    IMDB-BINARY Dataset

    • paperswithcode.com
    • opendatalab.com
    Updated May 15, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pinar Yanardag; S. V. N. Vishwanathan (2021). IMDB-BINARY Dataset [Dataset]. https://paperswithcode.com/dataset/imdb-binary
    Explore at:
    Dataset updated
    May 15, 2021
    Authors
    Pinar Yanardag; S. V. N. Vishwanathan
    Description

    IMDB-BINARY is a movie collaboration dataset that consists of the ego-networks of 1,000 actors/actresses who played roles in movies in IMDB. In each graph, nodes represent actors/actress, and there is an edge between them if they appear in the same movie. These graphs are derived from the Action and Romance genres.

  16. a

    IMDb Large Movie Review Dataset

    • academictorrents.com
    bittorrent
    Updated Oct 20, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Andrew L. Maas et al., 2011 (2018). IMDb Large Movie Review Dataset [Dataset]. https://academictorrents.com/details/fd24bc44d461b10288469e05a64a8344eb079f15
    Explore at:
    bittorrentAvailable download formats
    Dataset updated
    Oct 20, 2018
    Dataset authored and provided by
    Andrew L. Maas et al., 2011
    License

    https://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified

    Description

    A dataset for binary sentiment classification containing 25,000 highly polarized movie reviews for training, and 25,000 for testing. There is additional unlabeled data for use as well.

  17. h

    mini-imdb

    • huggingface.co
    Updated Sep 30, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Daniel Vila (2022). mini-imdb [Dataset]. https://huggingface.co/datasets/dvilasuero/mini-imdb
    Explore at:
    Dataset updated
    Sep 30, 2022
    Authors
    Daniel Vila
    Description

    dvilasuero/mini-imdb dataset hosted on Hugging Face and contributed by the HF Datasets community

  18. IMDb Users' Ratings Dataset

    • ieee-dataport.org
    Updated Dec 27, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vahid Baghi (2020). IMDb Users' Ratings Dataset [Dataset]. http://doi.org/10.21227/br41-bd49
    Explore at:
    Dataset updated
    Dec 27, 2020
    Dataset provided by
    Institute of Electrical and Electronics Engineershttp://www.ieee.ro/
    Authors
    Vahid Baghi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains 4669820 ratings from 1499238 users to 351109 movies on the imdb.com website. This data is collected from reviews (https://www.imdb.com/review/rw0000001/). Each row in this dataset is as follows:userID, movieID, rating, review dateFor example : ur18238764, tt2177461, 9, 22 January 2019

  19. P

    IMDb-Face Dataset

    • paperswithcode.com
    Updated Jul 30, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fei Wang; Liren Chen; Cheng Li; Shiyao Huang; Yanjie Chen; Chen Qian; Chen Change Loy (2018). IMDb-Face Dataset [Dataset]. https://paperswithcode.com/dataset/imdb-face
    Explore at:
    Dataset updated
    Jul 30, 2018
    Authors
    Fei Wang; Liren Chen; Cheng Li; Shiyao Huang; Yanjie Chen; Chen Qian; Chen Change Loy
    Description

    IMDb-Face is large-scale noise-controlled dataset for face recognition research. The dataset contains about 1.7 million faces, 59k identities, which is manually cleaned from 2.0 million raw images. All images are obtained from the IMDb website.

  20. Sentiment analysis in Galaxy with IMDB movie review dataset

    • zenodo.org
    tsv
    Updated Aug 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kaivan Kamali; Kaivan Kamali (2022). Sentiment analysis in Galaxy with IMDB movie review dataset [Dataset]. http://doi.org/10.5281/zenodo.4477881
    Explore at:
    tsvAvailable download formats
    Dataset updated
    Aug 4, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Kaivan Kamali; Kaivan Kamali
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    IMDB movie review sentiment classification dataset (Andrew L. Maas, Raymond E. Daly, Peter T. Pham, Dan Huang, Andrew Y. Ng, and Christopher Potts. (2011). Learning Word Vectors for Sentiment Analysis. The 49th Annual Meeting of the Association for Computational Linguistics (ACL 2011)). For more information please refer to: https://ai.stanford.edu/~amaas/data/sentiment/

    The IMDB dataset was modified as follows to prepare it for use in a Galaxy Training Tutorial (https://training.galaxyproject.org/):

    The top 50 words are excluded (mostly stop words). Included the next 10,000 top words. Reviews are limited to 500 words max (Longer reviews trimmed and shorter reviews are padded). 25,000 reviews are used for training and testing each. Files are in tsv (tab separated value) format to be consumed by Galaxy (www.usegalaxy.org).

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Harshit Shankhdhar (2021). IMDB Movies Dataset [Dataset]. https://www.kaggle.com/datasets/harshitshankhdhar/imdb-dataset-of-top-1000-movies-and-tv-shows
Organization logo

IMDB Movies Dataset

Top 1000 Movies by IMDB Rating

Explore at:
490 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Feb 1, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Harshit Shankhdhar
License

CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically

Description

Context

IMDB Dataset of top 1000 movies and tv shows. You can find the EDA Process on - https://www.kaggle.com/harshitshankhdhar/eda-on-imdb-movies-dataset

Please consider UPVOTE if you found it useful.

Content

Data:- - Poster_Link - Link of the poster that imdb using - Series_Title = Name of the movie - Released_Year - Year at which that movie released - Certificate - Certificate earned by that movie - Runtime - Total runtime of the movie - Genre - Genre of the movie - IMDB_Rating - Rating of the movie at IMDB site - Overview - mini story/ summary - Meta_score - Score earned by the movie - Director - Name of the Director - Star1,Star2,Star3,Star4 - Name of the Stars - No_of_votes - Total number of votes - Gross - Money earned by that movie

Inspiration

  • Analysis of the gross of a movie vs directors.
  • Analysis of the gross of a movie vs different - different stars.
  • Analysis of the No_of_votes of a movie vs directors.
  • Analysis of the No_of_votes of a movie vs different - different stars.
  • Which actor prefer which Genre more?
  • Which combination of actors are getting good IMDB_Rating maximum time?
  • Which combination of actors are getting good gross?
Search
Clear search
Close search
Google apps
Main menu