100+ datasets found
  1. Amazon Reviews Dataset

    • kaggle.com
    Updated Sep 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dongre Laxman (2024). Amazon Reviews Dataset [Dataset]. https://www.kaggle.com/datasets/dongrelaxman/amazon-reviews-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 20, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Dongre Laxman
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    This dataset comprises customer reviews for Amazon, an online retail giant, featuring insights into customer experiences, including ratings, review titles, texts, and metadata. It is valuable for analyzing customer satisfaction, sentiment, and trends.

    Column Descriptions:

    Reviewer Name: Identifies the reviewer. Profile Link: Links to the reviewer's profile for additional insights. Country: Indicates the reviewer's location. Review Count: Number of reviews by the same user, showing engagement level. Review Date: When the review was posted, useful for time analysis. Rating: Numerical satisfaction measure. Review Title: Summarizes the review sentiment. Review Text: Detailed customer feedback. Date of Experience: When the service/product was experienced.

    Prospective applications:

    Sentiment Analysis: Analyze review texts and titles to assess overall customer sentiment toward products, enabling the identification of strengths and weaknesses. Customer Satisfaction Tracking: Track and visualize rating trends over time to understand fluctuations in customer satisfaction. Product Improvement: Identify common themes in reviews to highlight areas for product enhancement or development. Market Segmentation: Use country and demographic information to customize marketing strategies and gain insights into regional preferences. Competitor Analysis: Evaluate customer feedback on Amazon products in comparison to competitors to determine market positioning. Recommendation Systems: Leverage review data to enhance recommendation algorithms, improving personalized shopping experiences. Trend Analysis: Investigate temporal patterns in reviews to link sentiment changes with marketing efforts or product launches.

    This extensive dataset serves as a valuable asset for various analyses focused on enhancing customer engagement and refining business strategies.

  2. Phishing and Legitimate URLS

    • kaggle.com
    Updated Sep 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hari sudhan411 (2023). Phishing and Legitimate URLS [Dataset]. https://www.kaggle.com/datasets/harisudhan411/phishing-and-legitimate-urls
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 21, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Hari sudhan411
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset encompasses a comprehensive collection of over 800,000 URLs, meticulously curated to provide a diverse representation of online domains. Within this extensive corpus, approximately 52% of the domains are identified as legitimate, reflective of established and trustworthy entities within the digital landscape. Conversely, the remaining 47% of domains are categorized as phishing domains, indicative of potential threats and malicious activities.

    Structured with precision, the dataset comprises two key columns: "url" and "status". The "url" column serves as the primary identifier, housing the uniform resource locators (URLs) for each respective domain. Meanwhile, the "status" column employs binary encoding, with values represented as 0 and 1. Herein lies a crucial distinction: a value of 0 designates domains flagged as phishing, signaling a potential risk to users, while a value of 1 signifies domains deemed legitimate, offering assurance and credibility. Additionally paramount importance is the careful balance maintained between these two categories. With an almost equal distribution of instances across phishing and legitimate domains, this dataset mitigates the risk of class imbalance, ensuring robustness and reliability in subsequent analyses and model development. This deliberate approach fosters a more equitable and representative dataset, empowering researchers and practitioners in their endeavors to understand, combat, and mitigate online threats.

  3. Amazon Reviews Dataset

    • kaggle.com
    Updated Jan 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Daniel Ihenacho (2023). Amazon Reviews Dataset [Dataset]. https://www.kaggle.com/datasets/danielihenacho/amazon-reviews-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 2, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Daniel Ihenacho
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This dataset was created from the scraped reviews from products in Amazon for the purpose of text classification. The classes are three in number namely; - Negative Reviews - Neutral Reviews - Positive Reviews

    Data columns includes; - Sentiments - Cleaned Review - Cleaned Review Length - Review Score

    This dataset presents the problem of multiclass classification with the use of ML algorithms and also deep learning algorithms. Moreover, there is a class imbalance; negative reviews has the lowest number of reviews compared to positive and neutral reviews.

    For ML algo use a mapping of; negative--> -1, neutral--> 0, positive --> 1

    For Deep Learning algo use a mapping of; negative --> 0 neutral --> 1 positive --> 2

    Looking forward to your model discoveries on this dataset.

    Please leave an upvote if you find this relevant πŸ˜€.

  4. Online Education System - Review

    • kaggle.com
    • data.mendeley.com
    Updated Dec 30, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dr. Sujatha R (2021). Online Education System - Review [Dataset]. https://www.kaggle.com/datasets/sujaradha/online-education-system-review
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 30, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Dr. Sujatha R
    Description

    Pandemic has influenced all spheres of the humanity. COVID-19 impacted the education vertical in larger manner. Traditional classroom environment plays a very vital role in molding the life of an individual. Bond nurtured in the early ages of the life acts as the great moral support in the latter stages of the journey. As the pandemic has forced us into online education, this data collection aims to analyze the impact of online education. To check out the satisfactory level of the learners, review was conducted.

    Gender – Male, Female Home Location – Rural, Urban Level of Education – Post Graduate, School, Under Graduate Age – Years Number of Subjects – 1- 20 Device type used to attend classes – Desktop, Laptop, Mobile Economic status – Middle Class, Poor, Rich Family size – 1 -10 Internet facility in your locality – Number scale (Very Bad to Very Good) Are you involved in any sports? – Yes, No Do elderly people monitor you? – Yes, No Study time – Hours Sleep time – Hours Time spent on social media – Hours Interested in Gaming? – Yes, No Have separate room for studying? – Yes, No Engaged in group studies? – Yes, No Average marks scored before pandemic in traditional classroom – range Your interaction in online mode - Number scale (Very Bad to Very Good) Clearing doubts with faculties in online mode - Number scale (Very Bad to Very Good) Interested in? – Practical, Theory, Both Performance in online - Number scale (Very Bad to Very Good) Your level of satisfaction in Online Education – Average, Bad, Good

    radhakrishnan, sujatha (2021), β€œOnline Education System - Review”, Mendeley Data, V1, doi: 10.17632/bzk9zbyvv7.1

  5. Amazon Fine Food Reviews

    • kaggle.com
    zip
    Updated May 1, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stanford Network Analysis Project (2017). Amazon Fine Food Reviews [Dataset]. https://www.kaggle.com/snap/Amazon-fine-food-reviews
    Explore at:
    zip(253873708 bytes)Available download formats
    Dataset updated
    May 1, 2017
    Dataset authored and provided by
    Stanford Network Analysis Project
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    This dataset consists of reviews of fine foods from amazon. The data span a period of more than 10 years, including all ~500,000 reviews up to October 2012. Reviews include product and user information, ratings, and a plain text review. It also includes reviews from all other Amazon categories.

    Contents

    • Reviews.csv: Pulled from the corresponding SQLite table named Reviews in database.sqlite
    • database.sqlite: Contains the table 'Reviews'

    Data includes:
    - Reviews from Oct 1999 - Oct 2012
    - 568,454 reviews
    - 256,059 users
    - 74,258 products
    - 260 users with > 50 reviews

    wordcloud

    Acknowledgements

    See this SQLite query for a quick sample of the dataset.

    If you publish articles based on this dataset, please cite the following paper:

  6. Customer Reviews of Top Speaker Brands

    • kaggle.com
    Updated Sep 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pattipati Sai Chandu (2024). Customer Reviews of Top Speaker Brands [Dataset]. https://www.kaggle.com/datasets/pattipatisaichandu/customer-reviews-of-top-speaker-brands
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 17, 2024
    Dataset provided by
    Kaggle
    Authors
    Pattipati Sai Chandu
    Description

    Speakers Reviews Dataset This dataset contains reviews of various speakers from different brands, giving insights into customer experiences, ratings, and feedback. Here’s what you’ll typically find in the dataset:

    What’s in it?

    Review ID: A unique code for each review.

    Title: The headline of the review, usually a short summary of the user's opinion.

    Author: The name (or username) of the person who wrote the review.

    Rating:The star rating given by the reviewer (usually out of 5).

    Content:The full text of the review, where people share what they liked or didn’t like about the speaker.

    Timestamp: When the review was posted (date and time).

    Verified Purchase:Whether the reviewer actually bought the speaker or not.

    Helpful Count:How many people found this review useful (it’s a thumbs-up count).

    Product Attributes: Details about the speaker itself, like brand, model, features, etc.

    Company :Name of the company.

  7. Amazon Reviews

    • kaggle.com
    Updated Jan 18, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jagdish Chavan (2023). Amazon Reviews [Dataset]. https://www.kaggle.com/datasets/jagdishchavan/amazon-reviews
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 18, 2023
    Dataset provided by
    Kaggle
    Authors
    Jagdish Chavan
    Description

    Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazon’s iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

  8. Steam Review&Games Dataset

    • kaggle.com
    Updated Dec 16, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Filip Kin (2024). Steam Review&Games Dataset [Dataset]. https://www.kaggle.com/datasets/filipkin/steam-reviews
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 16, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Filip Kin
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    The dataset provides a collection of game reviews from the Steam platform, making it suitable for natural language processing (NLP) and machine learning projects. The columns included are:

    • id: Unique identifier for the review.
    • app_id: Game identifier on Steam.
    • content: Review text.
    • author_id: Identifier for the review's author.
    • is_positive: Label indicating whether the review is positive (1) or negative (0).

    Potential Use Cases: - Sentiment classification (positive vs. negative). - Textual exploration (e.g., identifying frequently used words in positive reviews). - Training NLP models.

  9. 3.5M Tiktok Mobile App Reviews

    • kaggle.com
    Updated Sep 23, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shivam Bansal (2021). 3.5M Tiktok Mobile App Reviews [Dataset]. https://www.kaggle.com/datasets/shivamb/35-million-tiktok-mobile-app-reviews/data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 23, 2021
    Dataset provided by
    Kaggle
    Authors
    Shivam Bansal
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    This dataset contains reviews for one of the most popular mobile app - tiktok. All the publicly posted reviews are scraped from the google play store.

    Inspiration

    • The dataset can be used to identify key insights related to the app, key problems/issues people have raised.
    • Perform sentiment analysis of the reviews and find what people are talking about.
    • Perform topic modeling to identify key topics mentioned in the review over time
    • Generate visualizations of different worlds / n-grams / topics extracted from the reviews.
  10. Italian Reviews From Amazon

    • kaggle.com
    Updated May 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Giuseppe Cristofaro (2024). Italian Reviews From Amazon [Dataset]. https://www.kaggle.com/datasets/giuseppecristofaro/italian-reviews-from-amazon/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 24, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Giuseppe Cristofaro
    Description

    The dataset used for this project consists of customer reviews with the following columns:

    • **score: **The rating given by the customer.
    • **sentiment: **The sentiment label (positive, neutral, negative).
    • ** review: **The text of the customer review.

    The dataset contains 17,340 entries with three columns. The data is loaded from a CSV file.

    The dataset was generated from a shared Kaggle dataset of Amazon reviews and fully translated into Italian using a Python script with Google APIs. The dataset is very rich, containing over 17,000 reviews.

    However, it has one issue: it is highly imbalanced. This imbalance influenced the decision to work with this dataset for experiments during the model training phase.

  11. review-chekpoints--2025-06-27--12796-12797

    • kaggle.com
    Updated Jun 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Marietta Kiser (2025). review-chekpoints--2025-06-27--12796-12797 [Dataset]. https://www.kaggle.com/datasets/mariettakiser/review-chekpoints--2025-06-27--12796-12797/versions/1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 27, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Marietta Kiser
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Afer files review-chekpoints--2025-06-27--12796-12797

  12. review-chekpoints--2025-07-06--12814-12815

    • kaggle.com
    Updated Jul 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Marietta Kiser (2025). review-chekpoints--2025-07-06--12814-12815 [Dataset]. https://www.kaggle.com/datasets/mariettakiser/review-chekpoints--2025-07-06--12814-12815
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 6, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Marietta Kiser
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Afer files review-chekpoints--2025-07-06--12814-12815

  13. Amazon review dataset

    • kaggle.com
    Updated May 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MOHAMED AMINE SABBAHI (2024). Amazon review dataset [Dataset]. https://www.kaggle.com/datasets/mohamedaminesabbahi/amazon-review-dataset/versions/1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 25, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    MOHAMED AMINE SABBAHI
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by MOHAMED AMINE SABBAHI

    Released under Apache 2.0

    Contents

  14. Yelp Review with Sentiments and Features

    • kaggle.com
    Updated Feb 1, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Naveed Hussain (2021). Yelp Review with Sentiments and Features [Dataset]. http://doi.org/10.34740/kaggle/dsv/1898501
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 1, 2021
    Dataset provided by
    Kaggle
    Authors
    Naveed Hussain
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Yelp hotels and restaurants reviews ( spam and not spam) with sentiments ( positive, negative, and neutral) and review features. Please cite following our published works, when used this dataset. 1. Naveed Hussain, Hamid Turab Mirza, Faiza Iqbal, Ibrar Hussain, and Mohammad Kaleem. "Detecting Spam Product Reviews in Roman Urdu Script." The Computer Journal (2020).
    2. Naveed Hussain, Hamid Turab Mirza, Abid Ali, Faiza Iqbal, Ibrar Hussain, and Mohammad Kaleem. " Spammer group detection and diversification of customers’ reviews ". PeerJ Computer Science 7:e472 https://doi.org/10.7717/peerj-cs.472 (2021).

  15. review-chekpoints--2025-02-03--12377-12378

    • kaggle.com
    Updated Feb 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Marietta Kiser (2025). review-chekpoints--2025-02-03--12377-12378 [Dataset]. https://www.kaggle.com/datasets/mariettakiser/review-chekpoints--2025-02-03--12377-12378
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 3, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Marietta Kiser
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Afer files review-chekpoints--2025-02-03--12377-12378

  16. Google Apps Playstore Reviews

    • kaggle.com
    zip
    Updated Feb 4, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    radioactive (2021). Google Apps Playstore Reviews [Dataset]. https://www.kaggle.com/tiquasar/playstore-reviews-google-apps
    Explore at:
    zip(18415789 bytes)Available download formats
    Dataset updated
    Feb 4, 2021
    Authors
    radioactive
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    This Dataset is a collection of Reviews of Google Apps available on playstore. Contains more than 90,000 cumulative App reviews on various Google Apps.

    Please Upvote the Dataset if you find it useful!

    Content

    This Dataset contains: 1.) The basic description of apps(for e.g. App Title,App Description,Number of Installs,etc.) 2.) ReviewID 3.) Score and Review by the User and thumbsUp count on the reviews. 4.) Review creation and reply by developer date and time. 5.) The App's Review by the Users

    Inspiration

    Not many datasets are available on app reviews on Kaggle

  17. Amazon Data Science Book Reviews

    • kaggle.com
    zip
    Updated Aug 26, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vladimir (2020). Amazon Data Science Book Reviews [Dataset]. https://www.kaggle.com/vvorotnikov/amazon-data-science-book-reviews
    Explore at:
    zip(4025971 bytes)Available download formats
    Dataset updated
    Aug 26, 2020
    Authors
    Vladimir
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Content

    This dataset contains 20647 amazon reviews for 836 data-science related books. Every review consists of review text and score (number of stars from 1 to 5).

    Acknowledgements

    Thanks to all the people who write reviews.

  18. Amazon-Reviews-2023

    • kaggle.com
    Updated Feb 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Afaq0456 (2025). Amazon-Reviews-2023 [Dataset]. https://www.kaggle.com/datasets/afaq0456/amazon-reviews-2023
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 24, 2025
    Dataset provided by
    Kaggle
    Authors
    Afaq0456
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This dataset, raw_review_Toys_and_Games, contains 100,000 Amazon product reviews from the Toys & Games category, sampled from 2023. It includes ratings, review text, product identifiers, user details, timestamps, helpful votes, and purchase verification status.

  19. πŸ“± Google Play App Reviews Dataset πŸ“Š

    • kaggle.com
    Updated Jan 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hassaan Mustafavi (2025). πŸ“± Google Play App Reviews Dataset πŸ“Š [Dataset]. https://www.kaggle.com/datasets/hassaanmustafavi/google-play-app-reviews-dataset/versions/1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 26, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Hassaan Mustafavi
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Don't forget to hit the upvoteπŸ™πŸ™

    πŸ”– Overview

    The Google Play App Reviews dataset contains valuable feedback from users who have reviewed apps on the Google Play Store. This dataset includes both user ratings and detailed comments, making it ideal for sentiment analysis, user experience evaluation, and app performance research.

    πŸ“š Columns Description

    Column NameDescription
    review_idUnique identifier for each review. πŸ†”
    user_nameName of the user who submitted the review. πŸ‘€
    review_titleTitle of the review (may be empty in some cases). πŸ“
    review_descriptionThe content or feedback given by the user about the app. πŸ’¬
    ratingRating given by the user, ranging from 1 (low) to 5 (high). ⭐
    thumbs_upNumber of thumbs up the review received. πŸ‘
    review_dateDate and time the review was submitted. πŸ“…
    developer_responseResponse from the app developer (if provided). πŸ’¬πŸ‘¨β€πŸ’»
    developer_response_dateDate when the developer responded to the review. πŸ“…πŸ’»
    appVersionThe version of the app when the review was submitted. πŸ“±πŸ”’
    language_codeThe language in which the review was written (e.g., 'en' for English). πŸ—£οΈ
    country_codeThe country of the user based on their review (e.g., 'us' for United States). 🌍

    πŸ“Š Key Features

    • βœ… Rich Feedback: Includes both ratings and textual feedback from users.
    • 🌍 Global Reach: Reviews are collected from users worldwide, providing diverse insights.
    • πŸ”’ Anonymized Data: No personally identifiable information is included.
    • βš™οΈ Ready for Analysis: Cleaned and pre-processed for immediate use in sentiment analysis and app performance evaluation.

    🎯 Potential Use Cases

    • Sentiment Analysis: Analyze user sentiment based on reviews and ratings.
    • Customer Feedback: Measure user satisfaction and discover areas for improvement.
    • App Version Comparison: Evaluate how different versions of the app perform based on user feedback.
    • Geographic Insights: Analyze regional differences in app usage and reviews.
    • Developer Interaction: Assess the effectiveness of developer responses to user reviews.

    πŸš€ Get Started!

    Ready to dive into the world of app feedback and sentiment analysis? Explore the dataset, build models to understand user sentiments, and enhance app experiences based on real feedback.

    Happy coding! ✨

  20. Product Reviews Data

    • kaggle.com
    Updated Oct 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vishnupriya (2023). Product Reviews Data [Dataset]. https://www.kaggle.com/datasets/vishnupriyagarige/product-reviews-data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 19, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Vishnupriya
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Vishnupriya

    Released under Apache 2.0

    Contents

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Dongre Laxman (2024). Amazon Reviews Dataset [Dataset]. https://www.kaggle.com/datasets/dongrelaxman/amazon-reviews-dataset
Organization logo

Amazon Reviews Dataset

A Comprehensive Review Dataset for E-Commerce Analysis

Explore at:
2 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 20, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Dongre Laxman
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

This dataset comprises customer reviews for Amazon, an online retail giant, featuring insights into customer experiences, including ratings, review titles, texts, and metadata. It is valuable for analyzing customer satisfaction, sentiment, and trends.

Column Descriptions:

Reviewer Name: Identifies the reviewer. Profile Link: Links to the reviewer's profile for additional insights. Country: Indicates the reviewer's location. Review Count: Number of reviews by the same user, showing engagement level. Review Date: When the review was posted, useful for time analysis. Rating: Numerical satisfaction measure. Review Title: Summarizes the review sentiment. Review Text: Detailed customer feedback. Date of Experience: When the service/product was experienced.

Prospective applications:

Sentiment Analysis: Analyze review texts and titles to assess overall customer sentiment toward products, enabling the identification of strengths and weaknesses. Customer Satisfaction Tracking: Track and visualize rating trends over time to understand fluctuations in customer satisfaction. Product Improvement: Identify common themes in reviews to highlight areas for product enhancement or development. Market Segmentation: Use country and demographic information to customize marketing strategies and gain insights into regional preferences. Competitor Analysis: Evaluate customer feedback on Amazon products in comparison to competitors to determine market positioning. Recommendation Systems: Leverage review data to enhance recommendation algorithms, improving personalized shopping experiences. Trend Analysis: Investigate temporal patterns in reviews to link sentiment changes with marketing efforts or product launches.

This extensive dataset serves as a valuable asset for various analyses focused on enhancing customer engagement and refining business strategies.

Search
Clear search
Close search
Google apps
Main menu