100+ datasets found
  1. P

    MovieLens Dataset

    • paperswithcode.com
    Updated Oct 25, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    F. Maxwell Harper; Joseph A. Konstan (2021). MovieLens Dataset [Dataset]. https://paperswithcode.com/dataset/movielens
    Explore at:
    Dataset updated
    Oct 25, 2021
    Authors
    F. Maxwell Harper; Joseph A. Konstan
    Description

    The MovieLens datasets, first released in 1998, describe people’s expressed preferences for movies. These preferences take the form of tuples, each the result of a person expressing a preference (a 0-5 star rating) for a movie at a particular time. These preferences were entered by way of the MovieLens web site1 — a recommender system that asks its users to give movie ratings in order to receive personalized movie recommendations.

  2. Movie Recommendation Dataset

    • kaggle.com
    Updated Jul 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abhay Ayare (2024). Movie Recommendation Dataset [Dataset]. https://www.kaggle.com/datasets/abhayayare/movie-recommendation-dataset/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 27, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Abhay Ayare
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset is a synthetic collection of data for movies, users, and ratings. It is intended for use in developing and testing recommendation algorithms, particularly those used in movie recommendation systems. The dataset includes:

    • Movies: Contains information about 100 movies, including their titles, genres, and release years.
    • Users: Contains information about 50 users, including their user IDs and names.
    • Ratings: Contains 500 ratings given by users to movies, with rating values ranging from 1 to 5
  3. H

    Movie and Music recommendation dataset and model codes

    • dataverse.harvard.edu
    Updated Feb 29, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NA Yang (2020). Movie and Music recommendation dataset and model codes [Dataset]. http://doi.org/10.7910/DVN/A5TLOZ
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 29, 2020
    Dataset provided by
    Harvard Dataverse
    Authors
    NA Yang
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    These datasets include Douban movies and NetEase songs with attributes such as actors, directors, singers, albums and so on. Furthermore, the source code of ACAM model is also provided, which is a feature-level co-attention based recommendation model.

  4. Z

    Data from: Video Recommendations Based on Visual Features Extracted with...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jun 2, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kvifte, Tord (2021). Video Recommendations Based on Visual Features Extracted with Deep Learning [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4889728
    Explore at:
    Dataset updated
    Jun 2, 2021
    Dataset authored and provided by
    Kvifte, Tord
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset contains visual features extracted from 12875 movie trailers. The visual features are extracted from key-frames of movie trailers with the VGG-19 CNN, pre-trained on ImageNet.

    Movies in the datset are identified by their MovieLens movieId.

    Features_sparse.zip contains the 4096-dimensional feature vectors of each key-frame from every movie.

    Visual labels.zip contains the1000 dimensional label feature vectors of each key-frame from every movie.

    DeepCineProp-f.p has combined the label features of each movie into a vector space model with the use of tf-idf.

    CineSub.p contains the subtitles of each movie represented in a vector space model pre-processed with various nlp techniques and produced using tf-idf.

    Abstract:

    When a movie is uploaded to a movie Recommender System (e.g., YouTube), the system can exploit various forms of descriptive features (e.g., tags and genre) in order to generate personalized recommendation for users. However, there are situations where the descriptive features are missing or very limited and the system may fail to include such a movie in the recommendation list, known as Cold-start problem. This thesis investigates recommendation based on a novel form of content features, extracted from movies, in order to generate recommendation for users. Such features represent the visual aspects of movies, based on Deep Learning models, and hence, do not require any human annotation when extracted. The proposed technique has been evaluated in both offline and online evaluations using a large dataset of movies. The online evaluation has been carried out in a evaluation framework developed for this thesis. Results from the offline and online evaluation (N=150) show that automatically extracted visual features can mitigate the cold-start problem by generating recommendation with a superior quality compared to different baselines, including recommendation based on human-annotated features. The results also point to subtitles as a high-quality future source of automatically extracted features.

  5. g

    MovieLens 1M

    • grouplens.org
    • meilu1.jpshuntong.com
    • +1more
    Updated Mar 19, 2016
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2016). MovieLens 1M [Dataset]. https://grouplens.org/datasets/movielens/1m/
    Explore at:
    Dataset updated
    Mar 19, 2016
    Description

    Stable benchmark dataset. 1 million ratings from 6000 users on 4000 movies. Released 2/2003.

  6. IMDB Dataset For Machine Learning

    • kaggle.com
    Updated Sep 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    KHUSHI YADAV (2023). IMDB Dataset For Machine Learning [Dataset]. https://www.kaggle.com/datasets/khushiyadav2022/imdb-dataset-for-machine-learning
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 25, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    KHUSHI YADAV
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    "Movie Recommendation on the IMDB Dataset: A Journey into Machine Learning" is an exciting project focused on leveraging the IMDB Dataset for developing an advanced movie recommendation system. This project aims to explore the vast potential of machine learning techniques in providing personalized movie recommendations to users.

    The IMDB Dataset, comprising a wealth of movie information including genres, ratings, and user reviews, serves as the foundation for this project. By harnessing the power of machine learning algorithms and data analysis, the project seeks to build a recommendation system that can accurately suggest movies tailored to each individual's preferences.

  7. h

    movie-recommendation-system

    • huggingface.co
    Updated Jun 14, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vatsal Gaur (2025). movie-recommendation-system [Dataset]. https://huggingface.co/datasets/vatsal1704/movie-recommendation-system
    Explore at:
    Dataset updated
    Jun 14, 2025
    Authors
    Vatsal Gaur
    Description

    vatsal1704/movie-recommendation-system dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. f

    Summary of the MovieLens 1M data set.

    • plos.figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xibin Wang; Fengji Luo; Ying Qian; Gianluca Ranzi (2023). Summary of the MovieLens 1M data set. [Dataset]. http://doi.org/10.1371/journal.pone.0165868.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Xibin Wang; Fengji Luo; Ying Qian; Gianluca Ranzi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Summary of the MovieLens 1M data set.

  9. movie recommendation system

    • kaggle.com
    Updated Jan 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tanisha Saggar765 (2023). movie recommendation system [Dataset]. https://www.kaggle.com/datasets/tanishasaggar765/movie-recommendation-system
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 26, 2023
    Dataset provided by
    Kaggle
    Authors
    Tanisha Saggar765
    Description

    Dataset

    This dataset was created by Tanisha Saggar765

    Contents

  10. Movielens DataSet

    • figshare.com
    zip
    Updated Dec 7, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tracy Dong (2017). Movielens DataSet [Dataset]. http://doi.org/10.6084/m9.figshare.5677750.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 7, 2017
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Tracy Dong
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Movielens is a movie recommendation dataset widely used for benchmarking process. 385There are nearly 100,000 hard ratings on 19 different types of movies (Action, Comedy 386and so on).

  11. h

    Recommendation-System

    • huggingface.co
    Updated Aug 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sayan Ghosh (2024). Recommendation-System [Dataset]. https://huggingface.co/datasets/DivineSayan/Recommendation-System
    Explore at:
    Dataset updated
    Aug 9, 2024
    Authors
    Sayan Ghosh
    Description

    Movie Recommender Dataset

    This dataset contains the pickled files for a Streamlit-based movie recommendation system.

    Movies.pkl: Preprocessed movie metadata and tags Similarity.pkl: Cosine similarity matrix

    Uploaded for use in Hugging Face Spaces.

  12. MA14KD [AGGREGATED] Dataset: Visual Attraction of Movie Trailers

    • zenodo.org
    • data.niaid.nih.gov
    zip
    Updated Jan 24, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mehdi Elahi; Mehdi Elahi; Farshad Bakhshandegan Moghaddam; Reza Hosseini; Christoph Trattner; Marko Tkalcic; Farshad Bakhshandegan Moghaddam; Reza Hosseini; Christoph Trattner; Marko Tkalcic (2020). MA14KD [AGGREGATED] Dataset: Visual Attraction of Movie Trailers [Dataset]. http://doi.org/10.5281/zenodo.3266236
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Mehdi Elahi; Mehdi Elahi; Farshad Bakhshandegan Moghaddam; Reza Hosseini; Christoph Trattner; Marko Tkalcic; Farshad Bakhshandegan Moghaddam; Reza Hosseini; Christoph Trattner; Marko Tkalcic
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    MA14KD (Movie Attract 14K Dataset) provides a set of 181 aggregated VISUAL features extracted from 14074 movie and tv series trailers. The movie IDs are in agreement with the movie IDs provided by another rating dataset that also contains movie genres and tags (see the description within the file). More details can be found in the following publication:

    Farshad B. Moghaddam, Mehdi Elahi, Reza Hosseini, Christoph Trattner, Marko Tkalcic, Predicting Movie Popularity and Ratings with Visual Features, IEEE SMAP’19, 9-10 June 2019, Larnaca, Cyprus

  13. Enhancing MovieLens Dataset: Enriching Recommendations with Audio...

    • zenodo.org
    • data.niaid.nih.gov
    bin, zip
    Updated Jun 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Victor Botti-Cebria; Victor Botti-Cebria; Laura Sebastia; Laura Sebastia; Vanessa Moscardo; Vanessa Moscardo (2023). Enhancing MovieLens Dataset: Enriching Recommendations with Audio Information, Transcriptions, and Metadata [Dataset]. http://doi.org/10.5281/zenodo.8037433
    Explore at:
    zip, binAvailable download formats
    Dataset updated
    Jun 16, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Victor Botti-Cebria; Victor Botti-Cebria; Laura Sebastia; Laura Sebastia; Vanessa Moscardo; Vanessa Moscardo
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Nowadays, there are lots of datasets available for training and experimentation in the field of recommender systems. Specifically, in the recommendation of audiovisual content, the MovieLens dataset is a prominent example. It is focused on the user-item relationship, providing actual interaction data between users and movies. However, although movies can be described with several characteristics, this dataset only offers limited information about the movie genres.

    In this work, we propose enriching the MovieLens dataset by incorporating metadata available on the web (such as cast, description, keywords, etc.) and movie trailers. By leveraging the trailers, we extract audio information and generate transcriptions for each trailer, introducing a crucial textual dimension to the dataset. The audio information was extracted by the waveform and frequency analysis, followed by the application of dimensionality reduction techniques. For the transcription generation, the deep learning model Whisper was used. Finally, metadata was obtained from TMDB, and the BERT model was applied to extract embeddings.

    These additional attributes enrich the original dataset, providing deeper and more precise analysis. Then, the use of this extended and enhanced dataset could drive significant advancements in recommendation systems, enhancing user experiences by providing more relevant and tailored movie recommendations based on their tastes and preferences.

  14. h

    movie-recommendation-queries

    • huggingface.co
    Updated Jul 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MHosein mirzaaei (2024). movie-recommendation-queries [Dataset]. https://huggingface.co/datasets/MHMirzaei/movie-recommendation-queries
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 6, 2024
    Authors
    MHosein mirzaaei
    Description

    MHMirzaei/movie-recommendation-queries dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. S

    Recommended Algorithm Experiment Data Set

    • scidb.cn
    Updated Mar 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    liu rui (2023). Recommended Algorithm Experiment Data Set [Dataset]. http://doi.org/10.57760/sciencedb.07654
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 8, 2023
    Dataset provided by
    Science Data Bank
    Authors
    liu rui
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    This data set is user historical viewing record data crawled from Douban platform using crawler technology, including 27819 scoring data of 198 users, with a sparsity of 97.8%. The data set includes not only the basic attribute information of the movie, but also the user's interest value of short-term interest and long-term interest and the score after resetting.

  16. M

    Movie Rating Sites Report

    • marketreportanalytics.com
    doc, pdf, ppt
    Updated Apr 10, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Report Analytics (2025). Movie Rating Sites Report [Dataset]. https://www.marketreportanalytics.com/reports/movie-rating-sites-75754
    Explore at:
    pdf, ppt, docAvailable download formats
    Dataset updated
    Apr 10, 2025
    Dataset authored and provided by
    Market Report Analytics
    License

    https://www.marketreportanalytics.com/privacy-policyhttps://www.marketreportanalytics.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The global movie rating sites market is experiencing robust growth, driven by the increasing consumption of online streaming services and a surge in user-generated content. The market's expansion is fueled by several key factors. Firstly, the rising popularity of streaming platforms like Netflix, Hulu, and Amazon Prime Video has led to a greater demand for reliable movie rating and review information. Users rely on these sites to make informed decisions about which movies to watch, enhancing their overall viewing experience. Secondly, the proliferation of social media and online communities focused on film discussion fosters engagement with movie rating platforms, creating a network effect that increases usage and influence. The segmentation by application (movie promotion, research, audience choice) and type (user ratings, professional ratings) indicates a diverse market landscape with opportunities for both user-driven and expert-curated content. While established players like Rotten Tomatoes and IMDb dominate, newer platforms are emerging, offering specialized features and niche audiences. Geographic expansion, particularly in regions with rapidly growing internet penetration and a rising middle class, presents significant growth potential. However, challenges remain, including the need to manage fake reviews and maintain data accuracy to retain user trust. Furthermore, competition from within the streaming platforms themselves, which often integrate their own rating systems, presents an ongoing challenge. Despite these challenges, the market is projected for continued growth. A conservative estimate, considering a global CAGR of 15% (a reasonable figure based on the growth of the streaming industry and online movie engagement), predicts substantial market expansion over the forecast period (2025-2033). This growth will be driven by technological advancements that enhance user experience and the integration of AI-driven recommendation systems within movie rating platforms. The market is ripe for innovation, with opportunities for personalized recommendation engines and the incorporation of data analytics to provide more insightful reviews and audience sentiment analysis. The competitive landscape will likely see consolidation and further specialization, with platforms focusing on specific niches or geographical regions to gain a competitive edge.

  17. T

    movielens

    • tensorflow.org
    • opendatalab.com
    • +1more
    Updated Jul 8, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2020). movielens [Dataset]. https://www.tensorflow.org/datasets/catalog/movielens
    Explore at:
    Dataset updated
    Jul 8, 2020
    Description

    This dataset contains a set of movie ratings from the MovieLens website, a movie recommendation service. This dataset was collected and maintained by GroupLens, a research group at the University of Minnesota. There are 5 versions included: "25m", "latest-small", "100k", "1m", "20m". In all datasets, the movies data and ratings data are joined on "movieId". The 25m dataset, latest-small dataset, and 20m dataset contain only movie data and rating data. The 1m dataset and 100k dataset contain demographic data in addition to movie and rating data.

    • "25m": This is the latest stable version of the MovieLens dataset. It is recommended for research purposes.
    • "latest-small": This is a small subset of the latest version of the MovieLens dataset. It is changed and updated over time by GroupLens.
    • "100k": This is the oldest version of the MovieLens datasets. It is a small dataset with demographic data.
    • "1m": This is the largest MovieLens dataset that contains demographic data.
    • "20m": This is one of the most used MovieLens datasets in academic papers along with the 1m dataset.

    For each version, users can view either only the movies data by adding the "-movies" suffix (e.g. "25m-movies") or the ratings data joined with the movies data (and users data in the 1m and 100k datasets) by adding the "-ratings" suffix (e.g. "25m-ratings").

    The features below are included in all versions with the "-ratings" suffix.

    • "movie_id": a unique identifier of the rated movie
    • "movie_title": the title of the rated movie with the release year in parentheses
    • "movie_genres": a sequence of genres to which the rated movie belongs
    • "user_id": a unique identifier of the user who made the rating
    • "user_rating": the score of the rating on a five-star scale
    • "timestamp": the timestamp of the ratings, represented in seconds since midnight Coordinated Universal Time (UTC) of January 1, 1970

    The "100k-ratings" and "1m-ratings" versions in addition include the following demographic features.

    • "user_gender": gender of the user who made the rating; a true value corresponds to male
    • "bucketized_user_age": bucketized age values of the user who made the rating, the values and the corresponding ranges are:
      • 1: "Under 18"
      • 18: "18-24"
      • 25: "25-34"
      • 35: "35-44"
      • 45: "45-49"
      • 50: "50-55"
      • 56: "56+"
    • "user_occupation_label": the occupation of the user who made the rating represented by an integer-encoded label; labels are preprocessed to be consistent across different versions
    • "user_occupation_text": the occupation of the user who made the rating in the original string; different versions can have different set of raw text labels
    • "user_zip_code": the zip code of the user who made the rating

    In addition, the "100k-ratings" dataset would also have a feature "raw_user_age" which is the exact ages of the users who made the rating

    Datasets with the "-movies" suffix contain only "movie_id", "movie_title", and "movie_genres" features.

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('movielens', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

  18. Movie Movie Database: With Imdb ratings and Genre

    • kaggle.com
    Updated Jan 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Orchid7 (2025). Movie Movie Database: With Imdb ratings and Genre [Dataset]. https://www.kaggle.com/datasets/ochid7/a-dataset-of-movie-information-and-rating/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 31, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Orchid7
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    The dataset has a total of 1000top English movies. Use it for building a movie recommendation system or classification based on genre, 1. Box office prediction: Predict movie revenue based on factors like budget, genre, and release date. 2. Rating prediction: Predict movie ratings based on attributes like genre, director, and cast.

  19. P

    Genre2Movies Dataset

    • paperswithcode.com
    Updated Jun 6, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shib Dasgupta; Andrew McCallum; Steffen Rendle; Li Zhang (2023). Genre2Movies Dataset [Dataset]. https://paperswithcode.com/dataset/genre2movies
    Explore at:
    Dataset updated
    Jun 6, 2023
    Authors
    Shib Dasgupta; Andrew McCallum; Steffen Rendle; Li Zhang
    Description

    Genre annotations for movies The file genre2movies.csv contains genre-movie tuples based on Wikidata annotations (https://www.wikidata.org/).

    Data Each line in genre2movies.csv represents one genre-movie tuple. The first entry is the genre. The second entry of each line is the movie name. There are 83,670 genre-movie tuples. Joining with the Movielens 20M dataset

    The movies considered are from the Movielens 20M corpus: https://grouplens.org/datasets/movielens/20m/ The movie names in genre2movies.csv match the movie 'titles' in Movielens 20M.

    Compositions The directory "compositions" contains movies assigned to compositions of genres. The compositions are of the form: "genre A and genre B", "genre A and not genre B", "genre A and genre B and genre C", "genre A and genre B and not genre C". These assignments have been automatically generated from genre2movies.csv. We try to generate genre-compositions that are useful, e.g., for a "genre A and genre B" composition we ensure that genre B is not a subgenre of genre A, because an interesection of a superset with a subset is identical to the subset and does not form a new concept.

  20. c

    IMDB movie details dataset

    • crawlfeeds.com
    csv, zip
    Updated Jul 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Crawl Feeds (2025). IMDB movie details dataset [Dataset]. https://crawlfeeds.com/datasets/imdb-movie-details-dataset
    Explore at:
    zip, csvAvailable download formats
    Dataset updated
    Jul 5, 2025
    Dataset authored and provided by
    Crawl Feeds
    License

    https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy

    Description
    The IMDB Movie Details Dataset is a comprehensive collection of movie datasets that offers a treasure trove of information about movies, TV shows, and streaming content listed on IMDB. This dataset includes detailed data such as titles, release years, genres, cast, crew, ratings, and more, making it a go-to resource for film and entertainment enthusiasts. Ideal for data analysis, IMDB movie dataset applications span machine learning projects, predictive modeling, and insights into industry trends.
    Researchers can explore patterns in movie ratings and genre popularity, while developers can use the dataset to build recommendation systems or applications. Movie buffs can dive deep into historical and contemporary trends in the world of cinema. This dataset not only supports academic and professional pursuits but also opens doors for creative projects in storytelling, content creation, and audience engagement. Whether you’re a developer, researcher, or film enthusiast, the IMDB movie dataset is a powerful tool for uncovering trends and gaining deeper insights into the evolving entertainment landscape.
Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
F. Maxwell Harper; Joseph A. Konstan (2021). MovieLens Dataset [Dataset]. https://paperswithcode.com/dataset/movielens

MovieLens Dataset

Explore at:
2 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Oct 25, 2021
Authors
F. Maxwell Harper; Joseph A. Konstan
Description

The MovieLens datasets, first released in 1998, describe people’s expressed preferences for movies. These preferences take the form of tuples, each the result of a person expressing a preference (a 0-5 star rating) for a movie at a particular time. These preferences were entered by way of the MovieLens web site1 — a recommender system that asks its users to give movie ratings in order to receive personalized movie recommendations.

Search
Clear search
Close search
Google apps
Main menu