100+ datasets found
  1. h

    Data from: imdb

    • huggingface.co
    Updated Aug 3, 2003
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stanford NLP (2003). imdb [Dataset]. https://huggingface.co/datasets/stanfordnlp/imdb
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 3, 2003
    Dataset authored and provided by
    Stanford NLP
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    Dataset Card for "imdb"

      Dataset Summary
    

    Large Movie Review Dataset. This is a dataset for binary sentiment classification containing substantially more data than previous benchmark datasets. We provide a set of 25,000 highly polar movie reviews for training, and 25,000 for testing. There is additional unlabeled data for use as well.

      Supported Tasks and Leaderboards
    

    More Information Needed

      Languages
    

    More Information Needed… See the full description on the dataset page: https://huggingface.co/datasets/stanfordnlp/imdb.

  2. h

    Data from: imdb

    • huggingface.co
    Updated Sep 5, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    scikit-learn (2022). imdb [Dataset]. https://huggingface.co/datasets/scikit-learn/imdb
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 5, 2022
    Dataset authored and provided by
    scikit-learn
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    This is the sentiment analysis dataset based on IMDB reviews initially released by Stanford University. This is a dataset for binary sentiment classification containing substantially more data than previous benchmark datasets. We provide a set of 25,000 highly polar movie reviews for training, and 25,000 for testing. There is additional unlabeled data for use as well. Raw text and already processed bag of words formats are provided. See the README file contained in the release for more… See the full description on the dataset page: https://huggingface.co/datasets/scikit-learn/imdb.

  3. IMDB Review Dataset

    • kaggle.com
    zip
    Updated Jan 16, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Utathya Ghosh (2018). IMDB Review Dataset [Dataset]. https://www.kaggle.com/datasets/utathya/imdb-review-dataset
    Explore at:
    zip(52989376 bytes)Available download formats
    Dataset updated
    Jan 16, 2018
    Authors
    Utathya Ghosh
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Description

    Dataset

    This dataset was created by Utathya Ghosh

    Released under Database: Open Database, Contents: Database Contents

    Contents

  4. i

    IMDb Movie Reviews Dataset

    • ieee-dataport.org
    Updated Aug 2, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aditya Pal (2022). IMDb Movie Reviews Dataset [Dataset]. http://doi.org/10.21227/zm1y-b270
    Explore at:
    Dataset updated
    Aug 2, 2022
    Dataset provided by
    IEEE Dataport
    Authors
    Aditya Pal
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains nearly 1 Million unique movie reviews from 1150 different IMDb movies spread across 17 IMDb genres - Action, Adventure, Animation, Biography, Comedy, Crime, Drama, Fantasy, History, Horror, Music, Mystery, Romance, Sci-Fi, Sport, Thriller and War. The dataset also contains movie metadata such as date of release of the movie, run length, IMDb rating, movie rating (PG-13, R, etc), number of IMDb raters, and number of reviews per movie.

  5. IMDB Dataset - Sentiment Analysis

    • kaggle.com
    Updated Dec 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bhavik Jikadara (2023). IMDB Dataset - Sentiment Analysis [Dataset]. https://www.kaggle.com/datasets/bhavikjikadara/imdb-dataset-sentiment-analysis
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 19, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Bhavik Jikadara
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    The IMDb dataset is a collection of 50,000 reviews from the Internet Movie Database (IMDb). The reviews are labeled as either positive or negative and are split into two sets of 25,000 reviews for training and testing. Each set contains an equal number of positive and negative reviews.

    The IMDb dataset is a binary sentiment analysis dataset for natural language processing or text analytics. It contains more data than previous benchmark datasets.

    IMDb is a rich source of film data that includes cast and crew lists, movie release dates, box office information, plot summaries, trailers, actor and director biographies, and other trivia. Information on IMDb comes from a variety of sources, such as filmmakers, film studios, on-screen credits, and other official sources.

  6. h

    IMDb-Dataset

    • huggingface.co
    Updated Nov 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sahil (2024). IMDb-Dataset [Dataset]. https://huggingface.co/datasets/labofsahil/IMDb-Dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 9, 2024
    Dataset authored and provided by
    Sahil
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    title.akas.csv

    titleId (string) - a tconst, an alphanumeric unique identifier of the title ordering (integer) – a number to uniquely identify rows for a given titleId title (string) – the localized title region (string) - the region for this version of the title language (string) - the language of the title types (array) - Enumerated set of attributes for this alternative title. One or more of the following: "alternative", "dvd", "festival", "tv", "video", "working", "original"… See the full description on the dataset page: https://huggingface.co/datasets/labofsahil/IMDb-Dataset.

  7. ImDb Movie Reviews Dataset

    • kaggle.com
    zip
    Updated Sep 12, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nidhi Mantri (2019). ImDb Movie Reviews Dataset [Dataset]. https://www.kaggle.com/datasets/mantri7/imdb-movie-reviews-dataset
    Explore at:
    zip(26921499 bytes)Available download formats
    Dataset updated
    Sep 12, 2019
    Authors
    Nidhi Mantri
    Description

    Context

    This is the IMDB dataset exactly same as ImDb Movie Reviews Dataset, contains the movie reviews.

    Content

    The real dataset contains text files for training and testing purpose, but I created two csv files from those text files to ease the task ✌️ . Now you only need to download and apply your model. Each file contains 25000 reviews with label 0 for negative and 1 for positive. Each file has two columns 0 and 1, 0 represents reviews and 1 represents labels.

  8. h

    Data from: imdb

    • huggingface.co
    Updated Nov 20, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Massive Text Embedding Benchmark (2024). imdb [Dataset]. https://huggingface.co/datasets/mteb/imdb
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 20, 2024
    Dataset authored and provided by
    Massive Text Embedding Benchmark
    Description

    mteb/imdb dataset hosted on Hugging Face and contributed by the HF Datasets community

  9. P

    IMDB-MULTI Dataset

    • paperswithcode.com
    • opendatalab.com
    Updated Sep 1, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pinar Yanardag; S. V. N. Vishwanathan (2021). IMDB-MULTI Dataset [Dataset]. https://paperswithcode.com/dataset/imdb-multi
    Explore at:
    Dataset updated
    Sep 1, 2021
    Authors
    Pinar Yanardag; S. V. N. Vishwanathan
    Description

    IMDB-MULTI is a relational dataset that consists of a network of 1000 actors or actresses who played roles in movies in IMDB. A node represents an actor or actress, and an edge connects two nodes when they appear in the same movie. In IMDB-MULTI, the edges are collected from three different genres: Comedy, Romance and Sci-Fi.

  10. h

    IMDB-BINARY

    • huggingface.co
    • modeldatabase.com
    Updated Mar 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Graph Datasets (2023). IMDB-BINARY [Dataset]. https://huggingface.co/datasets/graphs-datasets/IMDB-BINARY
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 13, 2023
    Dataset authored and provided by
    Graph Datasets
    License

    https://choosealicense.com/licenses/unknown/https://choosealicense.com/licenses/unknown/

    Description

    Dataset Card for IMDB-BINARY (IMDb-B)

      Dataset Summary
    

    The IMDb-B dataset is "a movie collaboration dataset that consists of the ego-networks of 1,000 actors/actresses who played roles in movies in IMDB. In each graph, nodes represent actors/actress, and there is an edge between them if they appear in the same movie. These graphs are derived from the Action and Romance genres".

      Supported Tasks and Leaderboards
    

    IMDb-B should be used for graph… See the full description on the dataset page: https://huggingface.co/datasets/graphs-datasets/IMDB-BINARY.

  11. c

    IMDB movie details dataset

    • crawlfeeds.com
    csv, zip
    Updated Nov 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Crawl Feeds (2024). IMDB movie details dataset [Dataset]. https://crawlfeeds.com/datasets/imdb-movie-details-dataset
    Explore at:
    zip, csvAvailable download formats
    Dataset updated
    Nov 28, 2024
    Dataset authored and provided by
    Crawl Feeds
    License

    https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy

    Description

    The IMDB Movie Details Dataset is a comprehensive collection of data about movies, TV shows, and streaming content listed on IMDB. It includes detailed information such as titles, release years, genres, cast, crew, ratings, and more. This dataset is ideal for data analysis, machine learning projects, and insights into the film and entertainment industry. Perfect for developers, researchers, and movie enthusiasts looking to explore trends and patterns in the world of cinema.

  12. h

    imdb-genres

    • huggingface.co
    Updated Sep 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jack Quigley (2024). imdb-genres [Dataset]. https://huggingface.co/datasets/jquigl/imdb-genres
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 18, 2024
    Authors
    Jack Quigley
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Dataset Card for IMDb Movie Dataset: All Movies by Genre

      Dataset Summary
    

    This dataset is an adapted version of "IMDb Movie Dataset: All Movies by Genre" found at: https://www.kaggle.com/datasets/rajugc/imdb-movies-dataset-based-on-genre?select=history.csv. Within the dataset, the movie title and year columns were combined, the genre was extracted from the seperate csv files, the pre-existing genre column was renamed to expanded-genres, any movies missing a… See the full description on the dataset page: https://huggingface.co/datasets/jquigl/imdb-genres.

  13. Z

    Sentiment analysis in Galaxy with IMDB movie review dataset

    • data.niaid.nih.gov
    • zenodo.org
    Updated Aug 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kaivan Kamali (2022). Sentiment analysis in Galaxy with IMDB movie review dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4477880
    Explore at:
    Dataset updated
    Aug 4, 2022
    Dataset authored and provided by
    Kaivan Kamali
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    IMDB movie review sentiment classification dataset (Andrew L. Maas, Raymond E. Daly, Peter T. Pham, Dan Huang, Andrew Y. Ng, and Christopher Potts. (2011). Learning Word Vectors for Sentiment Analysis. The 49th Annual Meeting of the Association for Computational Linguistics (ACL 2011)). For more information please refer to: https://ai.stanford.edu/~amaas/data/sentiment/

    The IMDB dataset was modified as follows to prepare it for use in a Galaxy Training Tutorial (https://training.galaxyproject.org/):

    The top 50 words are excluded (mostly stop words). Included the next 10,000 top words. Reviews are limited to 500 words max (Longer reviews trimmed and shorter reviews are padded). 25,000 reviews are used for training and testing each. Files are in tsv (tab separated value) format to be consumed by Galaxy (www.usegalaxy.org).

  14. h

    IMDB-test

    • huggingface.co
    Updated Apr 28, 2002
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FAR AI (2002). IMDB-test [Dataset]. https://huggingface.co/datasets/AlignmentResearch/IMDB-test
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 28, 2002
    Dataset authored and provided by
    FAR AI
    Description

    AlignmentResearch/IMDB-test dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. P

    IMDB-BINARY Dataset

    • paperswithcode.com
    • opendatalab.com
    Updated Jun 8, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pinar Yanardag; S. V. N. Vishwanathan (2021). IMDB-BINARY Dataset [Dataset]. https://paperswithcode.com/dataset/imdb-binary
    Explore at:
    Dataset updated
    Jun 8, 2021
    Authors
    Pinar Yanardag; S. V. N. Vishwanathan
    Description

    IMDB-BINARY is a movie collaboration dataset that consists of the ego-networks of 1,000 actors/actresses who played roles in movies in IMDB. In each graph, nodes represent actors/actress, and there is an edge between them if they appear in the same movie. These graphs are derived from the Action and Romance genres.

  16. n

    Data from: IMDB

    • networkrepository.com
    csv
    Updated Aug 18, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Network Data Repository (2018). IMDB [Dataset]. https://networkrepository.com/ca-IMDB.php
    Explore at:
    csvAvailable download formats
    Dataset updated
    Aug 18, 2018
    Dataset authored and provided by
    Network Data Repository
    License

    https://networkrepository.com/policy.phphttps://networkrepository.com/policy.php

    Description

    IMDB movie/actor network - IMDB movie/actor network, www.imdb.com

  17. P

    IMDB-Clean Dataset

    • paperswithcode.com
    • opendatalab.com
    Updated Mar 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yiming Lin; Jie Shen; Yujiang Wang; Maja Pantic (2022). IMDB-Clean Dataset [Dataset]. https://paperswithcode.com/dataset/imdb-clean
    Explore at:
    Dataset updated
    Mar 4, 2022
    Authors
    Yiming Lin; Jie Shen; Yujiang Wang; Maja Pantic
    Description

    We have cleaned the noisy IMDB-WIKI dataset using a constrained clustering method, resulting this new benchmark for in-the-wild age estimation. The annotations also allow this dataset to use for some other tasks, like gender classification and face recognition/verification. For more details, please refer to our FPAge paper.

  18. Data from: IMDB dataset

    • kaggle.com
    zip
    Updated Jul 16, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sharad Goel (2019). IMDB dataset [Dataset]. https://www.kaggle.com/datasets/sharad1501/imdb-dataset
    Explore at:
    zip(137174 bytes)Available download formats
    Dataset updated
    Jul 16, 2019
    Authors
    Sharad Goel
    Description

    Dataset

    This dataset was created by Sharad Goel

    Contents

  19. imdb-extensive-dataset

    • kaggle.com
    zip
    Updated Aug 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    simhyunsu (2022). imdb-extensive-dataset [Dataset]. https://www.kaggle.com/datasets/simhyunsu/imdbextensivedataset
    Explore at:
    zip(28164287 bytes)Available download formats
    Dataset updated
    Aug 4, 2022
    Authors
    simhyunsu
    Description

    Dataset

    This dataset was created by simhyunsu

    Contents

  20. h

    Data from: imdb

    • huggingface.co
    Updated Dec 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    No (2024). imdb [Dataset]. https://huggingface.co/datasets/NoNONONONOO/imdb
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 2, 2024
    Authors
    No
    Description

    NoNONONONOO/imdb dataset hosted on Hugging Face and contributed by the HF Datasets community

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Stanford NLP (2003). imdb [Dataset]. https://huggingface.co/datasets/stanfordnlp/imdb

Data from: imdb

IMDB

stanfordnlp/imdb

Related Article
Explore at:
10 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 3, 2003
Dataset authored and provided by
Stanford NLP
License

https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

Description

Dataset Card for "imdb"

  Dataset Summary

Large Movie Review Dataset. This is a dataset for binary sentiment classification containing substantially more data than previous benchmark datasets. We provide a set of 25,000 highly polar movie reviews for training, and 25,000 for testing. There is additional unlabeled data for use as well.

  Supported Tasks and Leaderboards

More Information Needed

  Languages

More Information Needed… See the full description on the dataset page: https://huggingface.co/datasets/stanfordnlp/imdb.

Search
Clear search
Close search
Google apps
Main menu