59 datasets found
  1. Anime Database for Recommendation system

    • kaggle.com
    zip
    Updated Jun 20, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    vishal mane (2020). Anime Database for Recommendation system [Dataset]. https://www.kaggle.com/vishalmane109/anime-recommendations-database
    Explore at:
    zip(3705416 bytes)Available download formats
    Dataset updated
    Jun 20, 2020
    Authors
    vishal mane
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    This dataset contains a total of 16737 unique animes. The reason for creating this dataset is the requirement of a clean dataset of Anime. I found a few datasets on anime, most of the datasets had the major anime but some dataset 1) doesn't have 'Genre' or 'Synopsis' of anime. For content-based recommendation, it is helpful if we have more information about anime 2) have duplicate data 3) missing data is represented by different notations.

    Content

    Anime_id :anime Id (as per myanimelist.net) Title : name of anime Genre :Main genre
    Synopsis :Brief Discription Type
    Producer Studio Rating :Rating of anime as pe myanimelist.net/ ScoredBy : Total no user scored given anime Popularity :Rank of anime based on popularity Members :No of members added given anime on their list Episodes : No. of episodes Source
    Aired Link

    Acknowledgements

    This dataset is a combination of 2 datasets

    1. https://docs.google.com/spreadsheets/d/1brguO5nGfXS-Fr1Xcf3pqPTQoBUPGLTYM_EMAA9yJFw/export?format=csv&id=1brguO5nGfXS-Fr1Xcf3pqPTQoBUPGLTYM_EMAA9yJFw&gid=0
    2. https://www.kaggle.com/CooperUnion/anime-recommendations-database
  2. MyAnimeList 2050 Animes and Users Dataset

    • kaggle.com
    zip
    Updated Oct 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cơm tấm sườn nướng (2025). MyAnimeList 2050 Animes and Users Dataset [Dataset]. https://www.kaggle.com/datasets/comtam/anime-dataset
    Explore at:
    zip(870677 bytes)Available download formats
    Dataset updated
    Oct 16, 2025
    Authors
    Cơm tấm sườn nướng
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Anime Recommendation Dataset (2,000 Titles + 250 Users)

    This dataset is designed for building and testing anime recommendation systems. It contains 2,000 anime titles and 250 virtual users with generated preference data, making it ideal for both content-based and collaborative filtering approaches.

    Files Included

    animes.csv

    The main metadata file containing information for each anime.

    Dataset Overview

    ColumnDescription
    uidUnique anime ID
    titleAnime title
    linkMyAnimeList link
    synopsisShort summary or plot description
    scoreAverage community rating
    rankedRanking based on score
    popularityPopularity index
    membersNumber of users who interacted with the anime
    episodesTotal number of episodes
    genreComma-separated genres for the anime

    anime_genre_binary.csv — One-hot encoded version of the dataset.

    This file is more convenient for training machine learning models.

    Dataset Overview

    ColumnDescription
    uidUnique anime ID
    titleAnime title
    scoreAverage community rating
    rankedRanking based on score
    popularityPopularity index
    genresComma-separated genres for the anime

    Genre Information

    The dataset covers 76 genres, offering rich diversity for modeling. Genres include but are not limited to:

    Action, Adventure, Comedy, Drama, Fantasy, Mecha, Romance, Sci-Fi, Slice of Life, Supernatural, and many more.

    (Full list of 76 genres is included in the dataset metadata.)

    list_of_users.csv

    This file represents synthetic user profiles, each describing their preference intensity for each genre. Useful for modeling user embeddings or computing user–genre similarity.

    ColumnDescription
    user_idUnique user ID (1–250)
    # Genre ColumnsEach genre’s preference score (0–10 scale)

    training_data.csv

    The user–anime rating matrix used for collaborative filtering or hybrid recommendation.

    ColumnDescription
    user_idReference to user in list_of_users.csv
    anime_idReference to anime in anime_genre_binary.csv
    scoreRating given by the user (0–10 scale)

    Use Cases

    -Content-based or hybrid anime recommendation systems -Clustering or similarity analysis based on genre and synopsis -NLP tasks such as synopsis embedding or sentiment classification -Exploratory Data Analysis (EDA) and visualization

    Notes

    -All data are cleaned and preprocessed for ease of use. -Missing values have been handled appropriately. -The one-hot encoded version is optimized for ML pipelines.

  3. Anime Dataset

    • kaggle.com
    zip
    Updated Feb 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    M Ahmad (2024). Anime Dataset [Dataset]. https://www.kaggle.com/datasets/itsnobita/anime-details
    Explore at:
    zip(607178 bytes)Available download formats
    Dataset updated
    Feb 21, 2024
    Authors
    M Ahmad
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Hello Everyone This dataset consist of detailed information about Animes. The dataset consist of different columns namely: - Name - Type - Rating - Rank - Description - Tags - NTags This dataset can be used for various purposes including Recommendation Engines and Search Engines. I have used the data for making a Search Engine that can be accessed here https://flask-production-16d0.up.railway.app/

  4. User Animelist Dataset

    • figshare.com
    csv
    Updated Jul 20, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ramazan Turan (2025). User Animelist Dataset [Dataset]. http://doi.org/10.6084/m9.figshare.29538590.v1
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jul 20, 2025
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Ramazan Turan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset consists of user ratings for anime titles. Each user in the dataset has provided at least 5 ratings, ensuring a minimum level of engagement. The dataset includes user anime ratings and detailed information about anime, making it suitable for tasks such as recommendation systems and genre-based filtering. Dataset is freshly-created so it cover newer animes. Data is provided in the MovieLens format except timestamp column. With minor modifications, the dataset can be used in any recommendation project that utilizes the MovieLens dataset. I was able to train BERT model in https://github.com/jaywonchung/BERT4Rec-VAE-Pytorch project with some small modifications.Some StatisticsNumber of Users: 1,774,522Number of Animes: 20,237Total Ratings: 148,170,496BERT Anime Recommender GitHub repo : https://github.com/MRamazan/AnimeRecBERTDataset GitHub repo: https://github.com/MRamazan/User-Animelist-DatasetWeb demo: https://www.animerecbert.online (may be down)

  5. Top Anime Dataset 2024

    • kaggle.com
    zip
    Updated Apr 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bhavya Dhingra (2024). Top Anime Dataset 2024 [Dataset]. https://www.kaggle.com/datasets/bhavyadhingra00020/top-anime-dataset-2024
    Explore at:
    zip(469820 bytes)Available download formats
    Dataset updated
    Apr 29, 2024
    Authors
    Bhavya Dhingra
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    This dataset offers a comprehensive overview of the top animes of 2024, and is useful for building recommendation systems, visualizing trends in anime popularity and score, predicting scores and popularity, and such.

    Contents

    The dataset contains 22 features:

    • Score: The rating or score assigned to each anime title.
    • Popularity: Measure of how popular each anime is among viewers.
    • Rank: Ranking of each anime title within the dataset.
    • Members: The number of members or viewers associated with each anime.
    • Description: A brief overview or summary of the plot and themes of each anime.
    • Synonyms: Alternative titles or synonyms used for each anime.
    • Japanese Title: Original title of the anime in Japanese.
    • English Title: English-translated title of the anime.
    • Type: Classification of anime type (e.g., TV series, movie, OVA, etc.).
    • Eps: Total number of episodes in each anime series.
    • Status: Current status of the anime (e.g., ongoing, completed, etc.).
    • Aired: Date range of when the anime was aired.
    • Premiered: Date when the anime premiered for the first time.
    • Broadcast: Information about the broadcasting platform or channel.
    • Producers: Companies or studios involved in producing the anime.
    • Licensors: Organizations or companies holding the licensing rights for the anime.
    • Studios: Animation studios responsible for producing the anime.
    • Source: Original source material for the anime (e.g., manga, novel, original).
    • Genres: Categories or genres that the anime belongs to.
    • Demographic: Target demographic audience for the anime (e.g., shounen, shoujo, seinen, josei).
    • Duration: Duration of each episode or movie.
    • Rating: Content rating assigned to each anime (e.g., G, PG, PG-13, R).

    Acknowledgements

    All of the information in this dataset has been gathered by scraping the MyAnimeList website, and is available under the Creative Commons License.

    Cover Photo by: Playground.ai

  6. Datasets to Evaluate Accuracy, Miscalibration and Popularity Lift in...

    • data.niaid.nih.gov
    Updated Sep 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kowald, Dominik (2023). Datasets to Evaluate Accuracy, Miscalibration and Popularity Lift in Recommendations [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7428434
    Explore at:
    Dataset updated
    Sep 12, 2023
    Dataset provided by
    Know-Centerhttp://know-center.at/
    Authors
    Kowald, Dominik
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This repository contains three datasets for evaluating accuracy, miscalibration and popularity lift in recommender systems. All datasets contain genre/category information in addition to different user group splits:

    Last.fm (lfm.zip), based on the LFM-1b dataset of JKU Linz (http://www.cp.jku.at/datasets/LFM-1b/)

    MovieLens (ml.zip), based on MovieLens-1M dataset (https://grouplens.org/datasets/movielens/1m/)

    MyAnimeList (anime.zip), based on the MyAnimeList dataset of Kaggle (https://www.kaggle.com/CooperUnion/anime-recommendations-database)

    'user_events_cats.txt' contains the users' rating/interaction data along with a list of genres/categories assigend to the rated items. The list of categories is given in 'categories.txt'. Additionally, assignments to three user groups that differ in their inclination to popular/mainstream items are provided: LowPop in 'low_main_users.txt', MedPop in 'med_main_users.txt', and HighPop in 'high_main_users.txt'.

    The format of the three user files are "user,mainstreaminess"

    The format of the user-events files are "user,item,preference,cats", where different categories are separated by '|'

    The format of the categories files are "category-name,index", where index refers to the category-id in the user-events files

    Example Python-code for analyzing the datasets as well as empirical results on calibration, popularity lift and accuracy can be found on GitHub: https://github.com/domkowald/FairRecSys

  7. Anime Content Based Recommendation System Datasets

    • kaggle.com
    zip
    Updated Mar 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mauli Patel 18 (2024). Anime Content Based Recommendation System Datasets [Dataset]. https://www.kaggle.com/datasets/maulipatel18/anime-content-based-recommendation-system-datasets/code
    Explore at:
    zip(26259955 bytes)Available download formats
    Dataset updated
    Mar 24, 2024
    Authors
    Mauli Patel 18
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Description

    Our dataset comprises comprehensive user preference data gathered from 73,516 avid anime enthusiasts, spanning across 12,294 diverse anime titles. Each individual user has the autonomy to curate their own completed anime list, supplemented with personal ratings reflecting their viewing experience. This rich compilation of user-generated ratings forms the backbone of our dataset, offering invaluable insights into the nuanced preferences and tastes of anime enthusiasts worldwide.

    Key Features

    User Profiles:-

    Explore the preferences and behaviors of over 73,000 users, each with their unique anime consumption habits and rating patterns.

    Anime Titles:-

    Dive into a vast collection of 12,000+ anime titles, ranging from timeless classics to contemporary releases across various genres and themes.

    Completed Lists:-

    Gain access to users' completed anime lists, providing a glimpse into the breadth and depth of their viewing history.

    Ratings:-

    Uncover users' subjective evaluations of anime titles, quantified through personalized ratings, offering a granular understanding of viewer satisfaction and engagement.

    Content

    Anime.csv

    anime_id - unique id identifying an anime. name - full name of anime. genre - comma separated list of genres for this anime. type - movie, TV, OVA, etc. episodes - how many episodes in this show. (1 if movie). rating - average rating out of 10 for this anime. members - number of community members that are in this anime's "group".

    Rating.csv

    user_id - non identifiable randomly generated user id. anime_id - the anime that this user has rated. rating - rating out of 10 this user has assigned (-1 if the user watched it but didn't assign a rating).

  8. c

    Top Popular Anime Dataset

    • cubig.ai
    zip
    Updated Jun 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CUBIG (2025). Top Popular Anime Dataset [Dataset]. https://cubig.ai/store/products/539/top-popular-anime-dataset
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 30, 2025
    Dataset authored and provided by
    CUBIG
    License

    https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service

    Measurement technique
    Synthetic data generation using AI techniques for model training, Privacy-preserving data transformation via differential privacy
    Description

    1) Data Introduction • The Top Popular Anime Dataset is a large, open-source dataset containing information about more than 22,000 animated series and movies collected through MyAnimeList's Jikan API.

    2) Data Utilization (1) Top Popular Anime Dataset has characteristics that: • This dataset includes unique identifiers, English/Japanese titles, genres, types (TVs, movies, etc.), number of episodes, airing status, start/end date, running time per episode, user rating, rating, age rating, production company, producer, image and trailer URL, synopsis, etc. • For some animations, some values may be missing, such as English titles, ratings, trailers, and end-of-air dates. (2) Top Popular Anime Dataset can be used to: • Development of a recommendation system: It can utilize a variety of information such as user ratings, genres, synopsis, etc. to build a personalized animation recommendation system. • Trend and Genre Analysis: By analyzing popularity and rating changes by time series, genre, and production company, trends and success factors in the animation industry can be derived.

  9. Fair RecSys Datasets

    • data.niaid.nih.gov
    Updated Feb 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kowald Dominik (2023). Fair RecSys Datasets [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_6123878
    Explore at:
    Dataset updated
    Feb 22, 2023
    Dataset provided by
    Know-Centerhttp://know-center.at/
    Authors
    Kowald Dominik
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Four multimedia recommender systems datasets to study popularity bias and fairness:

    Last.fm (lfm.zip), based on the LFM-1b dataset of JKU Linz (http://www.cp.jku.at/datasets/LFM-1b/)

    MovieLens (ml.zip), based on MovieLens-1M dataset (https://grouplens.org/datasets/movielens/1m/)

    BookCrossing (book.zip), based on the BookCrossing dataset of Uni Freiburg (http://www2.informatik.uni-freiburg.de/~cziegler/BX/)

    MyAnimeList (anime.zip), based on the MyAnimeList dataset of Kaggle (https://www.kaggle.com/CooperUnion/anime-recommendations-database)

    Each dataset contains of user interactions (user_events.txt) and three user groups that differ in their inclination to popular/mainstream items: LowPop (low_main_users.txt), MedPop (med_main_users.txt), and HighPop (high_main_users.txt).

    The format of the three user files are "user,mainstreaminess"

    The format of the user-events files are "user,item,preference"

    Example Python-code for analyzing the datasets as well as more information on the user groups can be found on Github (https://github.com/domkowald/FairRecSys) and on Arxiv (https://arxiv.org/abs/2203.00376)

  10. h

    Anime_Characters

    • huggingface.co
    Updated Jun 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aditya Singh (2025). Anime_Characters [Dataset]. https://huggingface.co/datasets/adi2606/Anime_Characters
    Explore at:
    Dataset updated
    Jun 13, 2025
    Authors
    Aditya Singh
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    📚 Dataset Summary

    This dataset features 1,941 anime character images, neatly organized into 322 folders, each representing a different anime series 🎌.

    📦 Size of downloaded files: 152 MB 🪄 Size of auto-converted Parquet files: 151 MB 📊 Split: Train only 🎭 Classes: 322 unique anime titles

    Perfect for image classification, anime recommendation systems, and visual style analysis! 🎨✨

      🏆 Supported Tasks
    

    🖼️ Image Classification: Predict the anime title based on a… See the full description on the dataset page: https://huggingface.co/datasets/adi2606/Anime_Characters.

  11. h

    anime-titles-dataset

    • huggingface.co
    Updated Mar 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    realoperator42 (2025). anime-titles-dataset [Dataset]. https://huggingface.co/datasets/realoperator42/anime-titles-dataset
    Explore at:
    Dataset updated
    Mar 27, 2025
    Authors
    realoperator42
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Anime Dataset

      Dataset Description
    

    This dataset contains comprehensive information about anime series scraped from MyAnimeList (MAL). It includes detailed metadata about 871 (approx) anime series, making it valuable for various NLP tasks, recommendation systems, and cultural analysis. This dataset has NSFW content.

      Dataset Summary
    

    Anime Entries: 50 anime series with rich metadata Languages: English and Japanese (titles, descriptions) Format: JSONL (JSON Lines)… See the full description on the dataset page: https://huggingface.co/datasets/realoperator42/anime-titles-dataset.

  12. Anime-Planet Character Recommendation

    • kaggle.com
    zip
    Updated Aug 16, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hernan Valdivieso (2021). Anime-Planet Character Recommendation [Dataset]. https://www.kaggle.com/hernan4444/animeplanet-character-recommendation
    Explore at:
    zip(44839292 bytes)Available download formats
    Dataset updated
    Aug 16, 2021
    Authors
    Hernan Valdivieso
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Anime-Planet Character Recommendation

    Waifus and husbando dataset

    This dataset contains information about 132.028 characters and the preference from 72.629 different users of characters scrapped from anime-planet. In particular, this dataset contain:

    • Information about the character like Tags, description, genre, etc.
    • HTML with character information to do data scrapping. These files contain information such as name, alias, hate rank, tags, etc.
    • The character list per user. Include characters that love and hate.

    Warning: this dataset includes information on adult anime characters.

    Content

    The anime data was scrapped between June 29th and August 14th.

    • The "html" folder contain 1 html per character (132.028 different characters). I uploaded 2 files as example to don't increase the size of this dataset. All HTML files are in this link: https://drive.google.com/drive/folders/1Kg0OZ6dEsQuJZVqj1CcTGwDnwp4sNOnW?usp=sharing

    • user_characters.csv have the list of all character register by the user with the respective love boolean (means if the user love or hate the character). This dataset contains 12 Million row, 72.629 different animes and 132.028 different characters. The file have the following columns:

    1. user_id: non identifiable randomly generated user id.
    2. character_id: non identifiable randomly generated character id.
    3. loved: True if the user loves the character, False if he hates it.
    • characters_metadata.csv contain general information of every character (132.028 different character) like Tags, alias, name, gender, etc. This file have the following columns:
    1. ID: non identifiable randomly generated character id.
    2. Name: full name of this character.
    3. Alias: another way to call the character.
    4. Gender: gender of the this character.
    5. Hair Color: hair color of this character.
    6. Love Rank: love rank based in users preference.
    7. Hate Rank: hate rank based in users preference.
    8. Eye color: eye color of the character.
    9. Birthday: date of his birthday.
    10. Blood Type: blood type of the character.
    11. Tags: comma separated list of tags for this character.
    12. Love Count: how many users love this character.
    13. Hate Count: how many users hate this character.
    14. Description: short text with description if this character.
    15. url: url to the main page of character in Anime Planet.

    Acknowledgements

    Thanks to: 1. Anime Planet for providing anime data.

    Inspiration

    1. Experiment with different types of recommended. For instance, collaborative filtering or based on context like Tags, description, etc.

    2. Use this information to build a character recommended system.

    3. Build another dataset with anime topic.

    4. Try to Improve Anime Recommendation Database 2020 with more data of characters from the anime. This need to extract the anime id from every html.

  13. Anime Recommendation Dataset

    • kaggle.com
    zip
    Updated Sep 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    asel (2025). Anime Recommendation Dataset [Dataset]. https://www.kaggle.com/datasets/ylmzasel/anime-recommendation-dataset/suggestions
    Explore at:
    zip(80224 bytes)Available download formats
    Dataset updated
    Sep 13, 2025
    Authors
    asel
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains over 50,000 anime entries with key information useful for building recommendation systems.

    Data was collected via the AniList GraphQL API, cleaned, and formatted into a CSV file. This dataset is suitable for NLP tasks, recommendation engines, and data visualization projects.

    Inspired by the growing interest in anime recommendation models and the lack of comprehensive, high-quality datasets.

  14. h

    Anime-Viewers-Data

    • huggingface.co
    Updated Aug 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mikey (2025). Anime-Viewers-Data [Dataset]. https://huggingface.co/datasets/Mikey-TraceGod/Anime-Viewers-Data
    Explore at:
    Dataset updated
    Aug 31, 2025
    Authors
    Mikey
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    📺 Anime Watchers Dataset (1960–2025)

    📦 Dataset Size: 10,000 Records📄 Format: CSV📜 License: CC-BY-4.0

      📖 Overview
    

    This dataset contains 10,000 synthetic profiles of anime watchers, spanning from the Classic Era (1960s–1989) to the Modern Era (2010–2025).
    It is designed for:

    Data Analysis Machine Learning Recommendation Systems Trend Prediction in anime consumption.

      📂 Features
    

    Each record represents an individual anime watcher with detailed… See the full description on the dataset page: https://huggingface.co/datasets/Mikey-TraceGod/Anime-Viewers-Data.

  15. Anime Dataset for Recommendation Systems

    • kaggle.com
    zip
    Updated Feb 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dimple Bathija (2025). Anime Dataset for Recommendation Systems [Dataset]. https://www.kaggle.com/datasets/dimplebathija/anime-dataset-for-recommendation-systems
    Explore at:
    zip(314129 bytes)Available download formats
    Dataset updated
    Feb 26, 2025
    Authors
    Dimple Bathija
    Description

    Dataset

    This dataset was created by Dimple Bathija

    Contents

  16. h

    aimi-anime-rag-dataset-sample

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Divyanshu Singh, aimi-anime-rag-dataset-sample [Dataset]. http://doi.org/10.57967/hf/7055
    Explore at:
    Authors
    Divyanshu Singh
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    🎌 Ultimate Anime Dataset (8,248 Entries) | 1917-2025

      A meticulously curated collection spanning 108 years of anime history
    

    Love this dataset and the Anime Receipts concept? You can download the complete project via the links below:

      🚀 Unlock the Full Potential
    

    Product What You Get Get It Here

    Tier 1 8,248 Anime Dataset (Parquet)

    Tier 2 Full AiMi Recommendation System (Backend + UI)

    Tier 3 Ultimate AiMi Recommendation System + AiMi Anime… See the full description on the dataset page: https://huggingface.co/datasets/DivyanshuSingh96/aimi-anime-rag-dataset-sample.

  17. Anime MAL dataset

    • kaggle.com
    zip
    Updated Jul 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HOSSAM_AHMED_SALAH (2023). Anime MAL dataset [Dataset]. https://www.kaggle.com/datasets/hossamahmedsalah/anime-mal-dataset/code
    Explore at:
    zip(3172177 bytes)Available download formats
    Dataset updated
    Jul 20, 2023
    Authors
    HOSSAM_AHMED_SALAH
    License

    https://cdla.io/permissive-1-0/https://cdla.io/permissive-1-0/

    Description

    MyAnimeList is a popular online platform that allows users to create a list of anime and manga they have watched or read, rate them, and write reviews. The MyAnimeList dataset available on Kaggle is a collection of information about various anime titles and their corresponding attributes, such as title, genre, rating, popularity, and episode count. The columns include information about the anime title, the type of anime (TV show, movie, OVA, etc.), the genre(s) it belongs to, the studio that produced it, the source material (whether it is an original work or an adaptation), and the season and year of release.

    In addition to the basic information, the dataset also includes ratings and popularity metrics, such as the number of users who have rated the anime and the average rating score, as well as the number of members who have added the anime to their list and the number of favorites. Moreover, the dataset includes information about the anime's episodes, duration, and opening and ending themes.

    This dataset could be useful for various applications, such as building recommendation systems, conducting research on anime trends, and analyzing the relationship between various attributes (e.g., genre and popularity). Overall, the MyAnimeList dataset is an invaluable resource for anyone interested in anime and manga, and it provides a wealth of information that can be leveraged for various data-driven analyses.

  18. Anime Dataset 2024

    • kaggle.com
    zip
    Updated May 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Junaid Khan (2024). Anime Dataset 2024 [Dataset]. https://www.kaggle.com/datasets/junaidk0012/anime-dataset-2024
    Explore at:
    zip(11262184 bytes)Available download formats
    Dataset updated
    May 17, 2024
    Authors
    Junaid Khan
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    This dataset is a comprehensive collection of anime data, fetched from the MyAnimeList website using the Jikan API. It includes information on the latest animes, making it a valuable resource for up-to-date recommendations.

    The dataset is primed for building an anime recommendation system. But you can also perform Exploratory data analysis , Data Cleaning .

    Overall, this dataset provides a rich source of information for anime enthusiasts and data scientists alike, offering a solid foundation for developing sophisticated recommendation systems and conducting insightful data analysis. It stands as a testament to the power of data in enhancing user experiences and driving innovation in the entertainment industry.

  19. Anime Recommendations Database vol.2

    • kaggle.com
    zip
    Updated Apr 25, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Noiru Chijimatsu (2021). Anime Recommendations Database vol.2 [Dataset]. https://www.kaggle.com/noiruuuu/anime-recommendations-database-vol2
    Explore at:
    zip(40672916 bytes)Available download formats
    Dataset updated
    Apr 25, 2021
    Authors
    Noiru Chijimatsu
    Description

    Context

    This data set contains information on user preference data from 108,024 users on 15,221 animes. Each user is able to add anime to their completed list and give it a rating and this data set is a compilation of those ratings. This data set includes animes up to 2020 winter. 108,024 users are targeted at any anime fan around the world between the ages of 14 to 34.

    Content

    Anime.csv - anime_id - myanimelist.net's unique id identifying an anime. - title - full title name of anime. - genres - comma separated list of genres for this anime. - media - movie, TV, OVA, etc. - episodes - how many episodes in this show. (1 if movie or ova). - rating - average rating out of 10 for this anime. - members - number of community members that are in this anime's "group". - start_date - when this anime started. - season - what season this anime started. - source - manga, light_novel, original, etc.

    Rating.csv - user_id - non identifiable randomly generated user id. - anime_id - the anime that this user has rated. - rating - rating out of 10 this user has assigned (0 if the user watched it but didn't assign a rating).

    Acknowledgements

    Thanks to myanimelist.net API for providing anime data and user ratings, and thanks to CooperUnion(https://www.kaggle.com/CooperUnion/anime-recommendations-database)

    Inspiration

    Building a better anime recommendation system based only on user viewing history.

  20. Anime List - CLEANED

    • kaggle.com
    zip
    Updated Aug 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    breadddDD (2025). Anime List - CLEANED [Dataset]. https://www.kaggle.com/datasets/breaddddd/anime-list-cleaned
    Explore at:
    zip(431541896 bytes)Available download formats
    Dataset updated
    Aug 1, 2025
    Authors
    breadddDD
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Based on : https://www.kaggle.com/datasets/tavuksuzdurum/user-animelist-dataset

    Cleaned_animelist: Consists of Anime ID, titles, type of content, year of release, score (average ratings from all the users), amount of episodes, the MyAnimeList URL, and sequel.

    ratings_df: unique UserID giving Ratings (/10) to each unique AnimeID, also includes the embeddings for users and animes.

    This dataset includes fresh and newer animes available, usable for data analysis and identifying trends.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
vishal mane (2020). Anime Database for Recommendation system [Dataset]. https://www.kaggle.com/vishalmane109/anime-recommendations-database
Organization logo

Anime Database for Recommendation system

anime information database for developing recommendation system or data analysis

Explore at:
zip(3705416 bytes)Available download formats
Dataset updated
Jun 20, 2020
Authors
vishal mane
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Context

This dataset contains a total of 16737 unique animes. The reason for creating this dataset is the requirement of a clean dataset of Anime. I found a few datasets on anime, most of the datasets had the major anime but some dataset 1) doesn't have 'Genre' or 'Synopsis' of anime. For content-based recommendation, it is helpful if we have more information about anime 2) have duplicate data 3) missing data is represented by different notations.

Content

Anime_id :anime Id (as per myanimelist.net) Title : name of anime Genre :Main genre
Synopsis :Brief Discription Type
Producer Studio Rating :Rating of anime as pe myanimelist.net/ ScoredBy : Total no user scored given anime Popularity :Rank of anime based on popularity Members :No of members added given anime on their list Episodes : No. of episodes Source
Aired Link

Acknowledgements

This dataset is a combination of 2 datasets

  1. https://docs.google.com/spreadsheets/d/1brguO5nGfXS-Fr1Xcf3pqPTQoBUPGLTYM_EMAA9yJFw/export?format=csv&id=1brguO5nGfXS-Fr1Xcf3pqPTQoBUPGLTYM_EMAA9yJFw&gid=0
  2. https://www.kaggle.com/CooperUnion/anime-recommendations-database
Search
Clear search
Close search
Google apps
Main menu