100+ datasets found
  1. u

    Amazon review data 2018

    • cseweb.ucsd.edu
    • nijianmo.github.io
    • +1more
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UCSD CSE Research Project, Amazon review data 2018 [Dataset]. https://cseweb.ucsd.edu/~jmcauley/datasets/amazon_v2/
    Explore at:
    Dataset authored and provided by
    UCSD CSE Research Project
    Description

    Context

    This Dataset is an updated version of the Amazon review dataset released in 2014. As in the previous version, this dataset includes reviews (ratings, text, helpfulness votes), product metadata (descriptions, category information, price, brand, and image features), and links (also viewed/also bought graphs). In addition, this version provides the following features:

    • More reviews:

      • The total number of reviews is 233.1 million (142.8 million in 2014).
    • New reviews:

      • Current data includes reviews in the range May 1996 - Oct 2018.
    • Metadata: - We have added transaction metadata for each review shown on the review page.

      • Added more detailed metadata of the product landing page.

    Acknowledgements

    If you publish articles based on this dataset, please cite the following paper:

    • Jianmo Ni, Jiacheng Li, Julian McAuley. Justifying recommendations using distantly-labeled reviews and fined-grained aspects. EMNLP, 2019.
  2. d

    2005 - 2017 School Quality Review Ratings

    • catalog.data.gov
    • data.cityofnewyork.us
    • +2more
    Updated Nov 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.cityofnewyork.us (2024). 2005 - 2017 School Quality Review Ratings [Dataset]. https://catalog.data.gov/dataset/2005-2017-school-quality-review-ratings
    Explore at:
    Dataset updated
    Nov 29, 2024
    Dataset provided by
    data.cityofnewyork.us
    Description

    Yearly data of Quality Review ratings from 2005 to 2017

  3. Sentiment Analysis of movie review

    • kaggle.com
    Updated Nov 8, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mansi75 (2020). Sentiment Analysis of movie review [Dataset]. https://www.kaggle.com/mansi75/sentiment-analysis-of-movie-review-imdb/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 8, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Mansi75
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    I have always been a binge watcher and with so many movies and series to watch, a sentiment analysis of movie reviews is a good start to know more about them.

    Content

    This dataset contains the text of the reviews, together with a label that indi‐ cates whether a review is “positive” or “negative.” The IMDb website itself contains ratings from 1 to 10. To simplify the modeling, this annotation is summarized as a two-class classification dataset where reviews with a score of 6 or higher are labeled as positive, and the rest as negative.

    Acknowledgements

    author = {Maas, Andrew L. and Daly, Raymond E. and Pham, Peter T. and Huang, Dan and Ng, Andrew Y. and Potts, Christopher}, title = {Learning Word Vectors for Sentiment Analysis}, booktitle = {Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies}

    Inspiration

    One of the most simple but effective and commonly used ways to represent text for machine learning is using the bag-of-words representation. Classify the dataset with highest cross-validation accuracy with or without bag-of-words.

  4. o

    ChatGPT User Satisfaction Ratings

    • opendatabay.com
    .undefined
    Updated Jul 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Datasimple (2025). ChatGPT User Satisfaction Ratings [Dataset]. https://www.opendatabay.com/data/ai-ml/fd21bbf8-e5bf-4a34-93c2-57ae36ffbaf0
    Explore at:
    .undefinedAvailable download formats
    Dataset updated
    Jul 3, 2025
    Dataset authored and provided by
    Datasimple
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Reviews & Ratings
    Description

    This dataset provides user reviews for ChatGPT, offering valuable qualitative feedback, satisfaction ratings, and submission dates. It captures a diverse array of user sentiments, from concise remarks to more detailed feedback. The ratings are provided on a scale of 1 to 5, indicating different levels of user satisfaction. The dataset spans several months, which allows for temporal analysis of sentiment trends, as each review includes a timestamp. This data is ideal for gaining insights into user characteristics and for improving application features and services.

    Columns

    • Review Id: A unique identifier for each individual review. This is formatted as a String, typically in a UUID structure.
    • Review: The actual text of the user's feedback, offering qualitative insights into their experience with the application. This is a String data type.
    • Ratings: User-submitted numerical ratings, ranging from 1 (lowest satisfaction) to 5 (highest satisfaction), indicating their level of contentment. This is an Integer data type.
    • Review Date: The timestamp when the review was originally submitted, recorded in MM/DD/YYYY HH:MM format, serving as a Date_Time data type.

    Distribution

    The dataset is provided as a free resource. While a sample file will be updated separately to the platform, the data quality is assessed as 5 out of 5, with the current version being 1.0. It was listed on 08/06/2025, with 1 view and 0 downloads recorded so far. The dataset contains approximately 193,154 unique reviews.

    Usage

    This dataset is particularly useful for various analytical applications, including: * Sentiment Analysis: Developing models to predict the emotional tone or sentiment conveyed in user reviews. * Customer Feedback Analysis: Extracting actionable insights that can inform and guide improvements to application features and services. * Review Classification: Building machine learning models to categorise user reviews, for instance, as positive or negative. * Data Visualisation: Creating visual representations of review patterns and trends. * Exploratory Data Analysis: Investigating the characteristics and underlying patterns within the review data. * Natural Language Processing (NLP): Applying NLP techniques to understand and process the textual feedback. * Text Mining: Discovering patterns and insights from the large collection of text reviews. * Time-Series Analysis: Examining how sentiment and ratings evolve over time based on review timestamps.

    Coverage

    This dataset comprises user reviews for ChatGPT collected from 25th July 2023 to 24th August 2024. The data collection is global, reflecting feedback from users worldwide.

    License

    CCO

    Who Can Use It

    This dataset is ideal for a range of users interested in understanding user feedback and sentiment, including: * Data Scientists and Machine Learning Engineers for building and training sentiment analysis and classification models. * Product Managers and App Developers to gain actionable insights for product improvement and feature development. * Market Researchers to understand user satisfaction and market perception of AI applications. * Academic Researchers studying human-computer interaction, natural language processing, or user behaviour.

    Dataset Name Suggestions

    • ChatGPT User Reviews
    • GPT User Review Sentiment Data
    • AI App User Feedback Dataset
    • ChatGPT User Satisfaction Ratings

    Attributes

    Original Data Source: ChatGPT Users Reviews

  5. Independent Medical Review (IMR) Determinations, Trend

    • data.chhs.ca.gov
    csv, pdf, zip
    Updated Jul 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Managed Health Care (2025). Independent Medical Review (IMR) Determinations, Trend [Dataset]. https://data.chhs.ca.gov/dataset/independent-medical-review-imr-determinations-trend
    Explore at:
    csv(71304229), pdf(67720), zipAvailable download formats
    Dataset updated
    Jul 14, 2025
    Dataset provided by
    California Department of Managed Health Care
    Authors
    Department of Managed Health Care
    Description

    This data is from the California Department of Managed Health Care (DMHC). It contains all decisions from Independent Medical Reviews (IMR) administered by the DMHC since January 1, 2001. An IMR is an independent review of a denied, delayed, or modified health care service that the health plan has determined to be not medically necessary, experimental/investigational or non-emergent/urgent. If the IMR is decided in an enrollees favor, the health plan must authorize the service or treatment requested.

  6. Curated Email-Based Code Reviews Datasets

    • figshare.com
    bin
    Updated Feb 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mingzhao Liang; Ping Charoenwet; Patanamon Thongtanunam (2024). Curated Email-Based Code Reviews Datasets [Dataset]. http://doi.org/10.6084/m9.figshare.24679656.v1
    Explore at:
    binAvailable download formats
    Dataset updated
    Feb 7, 2024
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Mingzhao Liang; Ping Charoenwet; Patanamon Thongtanunam
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Code review is an important practice that improves the overall quality of a proposed patch (i.e. code changes). While much research focused on tool-based code reviews (e.g. a Gerrit code review tool, GitHub), many traditional open-source software (OSS) projects still conduct code reviews through emails. However, due to the nature of unstructured email-based data, it can be challenging to mine email-based code reviews, hindering researchers from delving into the code review practice of such long-standing OSS projects. Therefore, this paper presents large-scale datasets of email-based code reviews of 167 projects across three OSS communities (i.e. Linux Kernel, OzLabs, and FFmpeg). We mined the data from Patchwork, a web-based patch-tracking system for email-based code review, and curated the data by grouping a submitted patch and its revised versions and grouping email aliases. Our datasets include a total of 4.2M patches with 2.1M patch groups and 169K email addresses belonging to 141K individuals. Our published artefacts include the datasets as well as a tool suite to crawl, curate, and store Patchwork data. With our datasets, future work can directly delve into an email-based code review practice of large OSS projects without additional effort in data collection and curation.

  7. o

    Overwatch 2 - Steam Review Dataset

    • opendatabay.com
    .undefined
    Updated Jun 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Datasimple (2025). Overwatch 2 - Steam Review Dataset [Dataset]. https://www.opendatabay.com/data/ai-ml/2acc096a-d2df-4630-a0aa-ebda8024d61c
    Explore at:
    .undefinedAvailable download formats
    Dataset updated
    Jun 27, 2025
    Dataset authored and provided by
    Datasimple
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Entertainment & Media Consumption
    Description

    This dataset comprises user reviews and associated data for Overwatch 2, a popular video game title, sourced from the official Steam store. Overwatch 2 is the highly anticipated sequel to the original Overwatch game, developed by Blizzard Entertainment. As we know, it's renowned for its unfavorable reviews on Steam.

    I don't scrape many reviews because it would take a wicked amount of time and resources to do so.

    Disclaimer All data belongs to Valve Corporation and are not mine

    License

    CC0

    Original Data Source: Overwatch 2 - Steam Review Dataset

  8. Action Movies Review

    • kaggle.com
    Updated Jul 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    sleep2sleep1 (2023). Action Movies Review [Dataset]. https://www.kaggle.com/datasets/sleep2sleep1/action-movies-review
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 5, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    sleep2sleep1
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset consist of 2300+ action movies reviews. It has most of the info. related to movies including ratings. Collecting from most active critics, very helpful for all NLP tasks and ML operations.

  9. Frequency of referring to Google reviews India 2022

    • statista.com
    Updated Jun 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Frequency of referring to Google reviews India 2022 [Dataset]. https://www.statista.com/statistics/1379991/india-frequency-of-referring-to-google-reviews/
    Explore at:
    Dataset updated
    Jun 19, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Dec 13, 2022
    Area covered
    India
    Description

    According to a survey conducted by in December 2022 in India, 25 percent of Indians referred to Google reviews 50 percent to 75 percent of the times they use the search engine to find out more about a business. Comparatively, less than 10 percent of respondents never referred to Google reviews. In response to increasing complaints about fake online reviews, the Bureau of Indian Standards (BIS) guidelines were brought into effect in November 2022.

  10. Data from: Paper Reviews Data Set

    • kaggle.com
    Updated Jan 22, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Felipe Navarro (2018). Paper Reviews Data Set [Dataset]. https://www.kaggle.com/fnbalves/paper-reviews-data-set/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 22, 2018
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Felipe Navarro
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Description

    Dataset

    This dataset was created by Felipe Navarro

    Released under Database: Open Database, Contents: Database Contents

    Contents

  11. h

    reviews

    • huggingface.co
    Updated Apr 28, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ashish Kumar (2024). reviews [Dataset]. https://huggingface.co/datasets/ashishkgpian/reviews
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 28, 2024
    Authors
    Ashish Kumar
    License

    Attribution 2.0 (CC BY 2.0)https://creativecommons.org/licenses/by/2.0/
    License information was derived automatically

    Description

    ashishkgpian/reviews dataset hosted on Hugging Face and contributed by the HF Datasets community

  12. Etsy Reviews

    • kaggle.com
    Updated Apr 17, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abdulelah (2020). Etsy Reviews [Dataset]. https://www.kaggle.com/datasets/csabdulelah/etsy-seller-reviews/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 17, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Abdulelah
    Description

    Context

    There's a story behind every dataset and here's your opportunity to share yours. Welcome there are 1000 rows of reviews of different products at https://www.etsy.com/?ref=lgo. Etsy is a global online marketplace, where people come together to make, sell, buy and collect unique items.

    Content

    There are 2 columns in the data set, the first column is the review text and the second is the number of stars out of 5.

    Acknowledgements

    Thank you for all members of DSI7, instructors, Ais, and students.

    Inspiration

    sentiment analysis.

  13. o

    Playstore Review Analytics Data

    • opendatabay.com
    .undefined
    Updated Jul 7, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Datasimple (2025). Playstore Review Analytics Data [Dataset]. https://www.opendatabay.com/data/ai-ml/a62f86b2-2039-45fa-8758-a78fbbcedf6a
    Explore at:
    .undefinedAvailable download formats
    Dataset updated
    Jul 7, 2025
    Dataset authored and provided by
    Datasimple
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Reviews & Ratings
    Description

    This dataset is a collection of user reviews for various Google Apps available on the Play Store. It provides detailed insights into user feedback, ratings, and engagement with different applications. The dataset's primary purpose is to offer a rich resource for understanding user sentiment, identifying app performance issues, and tracking user satisfaction over time. It is a valuable asset for analytics and natural language processing tasks related to app reviews.

    Columns

    • reviewId: A unique identifier for each individual user review.
    • userName: The name of the user who submitted the review.
    • userImage: The URL pointing to the user's profile image.
    • content: The textual review provided by the user about the app.
    • score: The numerical rating given by the user for the app, typically on a scale of 1 to 5.
    • thumbsUpCount: The total number of likes or "thumbs up" received by that specific review.
    • reviewCreatedVersion: The version of the app that was being reviewed at the time the review was created.
    • at: The date and time when the user's review was created.
    • replyContent: The textual content of the reply provided by the app developer to the user's review. A significant portion of reviews do not have a developer reply.
    • repliedAt: The date and time when the developer's reply was issued. Many entries in this column are null, indicating no developer response.

    Distribution

    The dataset contains over 90,000 app reviews. The score column shows a distribution across ratings, with substantial counts for scores like 1.00-1.20, 2.00-2.20, 3.00-3.20, 4.00-4.20, and 4.80-5.00. For thumbsUpCount, the majority of reviews have a relatively low number of likes (0-720), but there are instances with significantly higher counts, reaching up to over 14,000 likes. The reviewCreatedVersion column shows a variety of app versions, with some being more frequently reviewed than others. Review creation dates span a period from April 2014 to February 2021, with a notable increase in review volume towards the later years, particularly between May 2020 and February 2021.

    Usage

    This dataset is ideal for: * Sentiment analysis of app reviews. * Natural Language Processing (NLP) tasks, such as topic modelling, text classification, and entity recognition. * App performance monitoring and identifying user pain points. * Market research on user satisfaction and trends in app usage. * Developing AI and Machine Learning models for predicting app ratings or automatically classifying feedback.

    Coverage

    The dataset offers global coverage for app reviews. The time range for review creation spans from 10th April 2014 to 4th February 2021. While developer replies are included, the data on repliedAt primarily indicates a single latest date (4th February 2021) with the majority being null, suggesting that developer reply timestamps are not as broadly distributed across the dataset as review creation times.

    License

    CC0

    Who Can Use It

    • App Developers: To understand user feedback, identify bugs, and improve app features.
    • Data Analysts: For trends analysis, user behaviour insights, and reporting.
    • Researchers: In fields like computer science, internet studies, and data analytics for academic studies on online reviews.
    • Machine Learning Engineers: To train models for sentiment analysis, user support automation, or content moderation.
    • Product Managers: To gather insights for product iteration and strategic planning.

    Dataset Name Suggestions

    • Google Play Store App Reviews
    • Play Store User Feedback
    • Google Apps Ratings and Reviews
    • Mobile App Review Data
    • Playstore Review Analytics Data

    Attributes

    Original Data Source: Google Apps Playstore Reviews

  14. d

    State Review Framework Manager Database

    • catalog.data.gov
    Updated Jan 24, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OECA, Office of Compliance (2022). State Review Framework Manager Database [Dataset]. https://catalog.data.gov/dataset/state-review-framework-manager-database
    Explore at:
    Dataset updated
    Jan 24, 2022
    Dataset provided by
    OECA, Office of Compliance
    Description

    The State Review Framework is a primary means by which EPA conducts oversight of three core federal statutes: Clean Air Act, Clean Water Act, and Resource Conservation and Recovery Act. The routine, nationwide review provides a consistent process for evaluating the performance of state, local and EPA compliance and enforcement programs. The overarching goal of the reviews is to ensure fair and consistent enforcement necessary to protect human health and the environment.

  15. U.S. online review reading time prior to decision 2019, by age group

    • statista.com
    Updated Apr 28, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2022). U.S. online review reading time prior to decision 2019, by age group [Dataset]. https://www.statista.com/statistics/713110/us-online-review-usage-frequency-new-purchases-age-group/
    Explore at:
    Dataset updated
    Apr 28, 2022
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Nov 2019
    Area covered
    United States
    Description

    Younger U.S. online users were more likely to spend a longer time reading online reviews than older internet users. During the November 2019 survey, seven percent of respondents aged 18 to 34 years stated that on average they spend at least an hour reading reviews making reviews before making a purchase decision. Only two percent of respondents aged 35 to 54 years stated the same.

  16. P

    Data from: Towards a Data-Driven Requirements Engineering Approach:...

    • paperswithcode.com
    • data.niaid.nih.gov
    • +1more
    Updated Oct 31, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jialiang Wei; Anne-Lise Courbis; Thomas Lambolais; Binbin Xu; Pierre Louis Bernard; Gérard Dray (2022). Towards a Data-Driven Requirements Engineering Approach: Automatic Analysis of User Reviews Dataset [Dataset]. https://paperswithcode.com/dataset/towards-a-data-driven-requirements
    Explore at:
    Dataset updated
    Oct 31, 2022
    Authors
    Jialiang Wei; Anne-Lise Courbis; Thomas Lambolais; Binbin Xu; Pierre Louis Bernard; Gérard Dray
    Description

    6000 French user reviews from three applications on Google Play (Garmin Connect, Huawei Health, Samsung Health) are labelled manually. We selected four labels: rating, bug report, feature request and user experience.

    Ratings are simple text which express the overall evaluation to that app, including praise, criticism, or dissuasion. Bug reports show the problems that users have met while using the app, like loss of data, crash of app, connection error, etc. Feature requests reflect the demande of users on new function, new content, new interface, etc. In user experience, users describe their experience in relation to the functionality of the app, how does certain functions be helpful.

    As we can observe from the following table, that shows examples of labelled user reviews, each review belongs to one or more categories.

    AppTotalRatingBug reportFeature requestUser experience
    Garmin Connect20001260757170493
    Huawei Health20001068819384289
    Samsung Health20001324491486349
  17. w

    Websites using Google Reviews Widget

    • webtechsurvey.com
    csv
    Updated Oct 27, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    WebTechSurvey (2020). Websites using Google Reviews Widget [Dataset]. https://webtechsurvey.com/technology/google-reviews-widget
    Explore at:
    csvAvailable download formats
    Dataset updated
    Oct 27, 2020
    Dataset authored and provided by
    WebTechSurvey
    License

    https://webtechsurvey.com/termshttps://webtechsurvey.com/terms

    Time period covered
    2025
    Area covered
    Global
    Description

    A complete list of live websites using the Google Reviews Widget technology, compiled through global website indexing conducted by WebTechSurvey.

  18. h

    Indian_Street_Food_Reviews

    • huggingface.co
    Updated Jun 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rohit Jha (2025). Indian_Street_Food_Reviews [Dataset]. https://huggingface.co/datasets/rohitjha09/Indian_Street_Food_Reviews
    Explore at:
    Dataset updated
    Jun 24, 2025
    Authors
    Rohit Jha
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    🍽️ Indian Street Food Reviews Dataset

    A small but rich dataset of Indian street food reviews, categorized by dish, location, sentiment, and star rating. Perfect for projects involving sentiment analysis, regional food preference prediction, NLP, and recommendation systems.

    📦 Dataset Summary

    This dataset contains short user-generated reviews (1–2 sentences) of popular Indian street foods from different regions across India. Each review includes:

  19. IMDB review data

    • kaggle.com
    Updated May 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muaaz9922 (2024). IMDB review data [Dataset]. https://www.kaggle.com/muaaz9922/imdb-review-data/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 10, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Muaaz9922
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Muaz Tahir

    Released under CC0: Public Domain

    Contents

  20. amazon-all-categories-best-sellers-reviews

    • huggingface.co
    Updated Aug 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    fastai X Hugging Face Group 2022 (2023). amazon-all-categories-best-sellers-reviews [Dataset]. https://huggingface.co/datasets/hugginglearners/amazon-all-categories-best-sellers-reviews
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 19, 2023
    Dataset provided by
    Hugging Facehttps://huggingface.co/
    Authors
    fastai X Hugging Face Group 2022
    Description

    hugginglearners/amazon-all-categories-best-sellers-reviews dataset hosted on Hugging Face and contributed by the HF Datasets community

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
UCSD CSE Research Project, Amazon review data 2018 [Dataset]. https://cseweb.ucsd.edu/~jmcauley/datasets/amazon_v2/

Amazon review data 2018

Explore at:
83 scholarly articles cite this dataset (View in Google Scholar)
Dataset authored and provided by
UCSD CSE Research Project
Description

Context

This Dataset is an updated version of the Amazon review dataset released in 2014. As in the previous version, this dataset includes reviews (ratings, text, helpfulness votes), product metadata (descriptions, category information, price, brand, and image features), and links (also viewed/also bought graphs). In addition, this version provides the following features:

  • More reviews:

    • The total number of reviews is 233.1 million (142.8 million in 2014).
  • New reviews:

    • Current data includes reviews in the range May 1996 - Oct 2018.
  • Metadata: - We have added transaction metadata for each review shown on the review page.

    • Added more detailed metadata of the product landing page.

Acknowledgements

If you publish articles based on this dataset, please cite the following paper:

  • Jianmo Ni, Jiacheng Li, Julian McAuley. Justifying recommendations using distantly-labeled reviews and fined-grained aspects. EMNLP, 2019.
Search
Clear search
Close search
Google apps
Main menu