64 datasets found
  1. b

    Amazon reviews Dataset

    • brightdata.com
    .json, .csv, .xlsx
    Updated Mar 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bright Data (2023). Amazon reviews Dataset [Dataset]. https://brightdata.com/products/datasets/amazon/reviews
    Explore at:
    .json, .csv, .xlsxAvailable download formats
    Dataset updated
    Mar 21, 2023
    Dataset authored and provided by
    Bright Data
    License

    https://brightdata.com/licensehttps://brightdata.com/license

    Area covered
    Worldwide
    Description

    Utilize our Amazon reviews dataset for diverse applications to enrich business strategies and market insights. Analyzing this dataset can aid in understanding customer behavior, product performance, and market trends, empowering organizations to refine their product and marketing strategies. Access the entire dataset or tailor a subset to fit your requirements. Popular use cases include: Product Performance Analysis: Analyze Amazon reviews to assess product performance, uncovering customer satisfaction levels, common issues, and highly praised features to inform product improvements and marketing messages. Customer Behavior Insights: Gain insights into customer behavior, purchasing patterns, and preferences, enabling more personalized marketing and product recommendations. Demand Forecasting: Leverage Amazon reviews to predict future product demand by analyzing historical review data and identifying trends, helping to optimize inventory management and sales strategies. Accessing and analyzing the Amazon reviews dataset supports market strategy optimization by leveraging insights to analyze key market trends and customer preferences, enhancing overall business decision-making.

  2. Amazon Product Review Sentiment Analysis Project

    • kaggle.com
    Updated Jul 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sanjana Murthy (2024). Amazon Product Review Sentiment Analysis Project [Dataset]. https://www.kaggle.com/datasets/sanjanamurthy392/amazon-product-review-sentiment-analysis-project
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 9, 2024
    Dataset provided by
    Kaggle
    Authors
    Sanjana Murthy
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    About Datasets: - Domain : Marketing - Project: Amazon Product Review Sentiment Analysis - Datasets: Reviews.csv - Dataset Type: Excel Data - Dataset Size: 56L+ records

    KPI's: 1. Distribution of Amazon Product Ratings 2. How most people rated the products they bought from Amazon 3. Total of all sentiment scores

    Process: 1. Understanding the problem 2. Data Collection 3. Data Cleaning 4. Exploring and analyzing the data 5. Interpreting the results

    This data contains pandas, seaborn, matplotlib, nltk.sentiment.vader, SentimentIntensityAnalyzer, value_counts(), custom_colors, figsize, pie, sentiment_score

  3. A

    ‘Amazon Product Reviews Dataset’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Feb 13, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com), ‘Amazon Product Reviews Dataset’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-amazon-product-reviews-dataset-7933/latest
    Explore at:
    Dataset updated
    Feb 13, 2022
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of ‘Amazon Product Reviews Dataset’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/yamqwe/amazon-product-reviews-datasete on 13 February 2022.

    --- Dataset description provided by original source is as follows ---

    About this dataset

    This dataset contains 30K records of product reviews from amazon.com.

    This dataset was created by PromptCloud and DataStock

    Content

    This dataset contains the following:

    • Total Records Count: 43729

    • Domain Name: amazon.com

    • Date Range: 01st Jan 2020 - 31st Mar 2020

    • File Extension: CSV

    • Available Fields:
      -- Uniq Id,
      -- Crawl Timestamp,
      -- Billing Uniq Id,
      -- Rating,
      -- Review Title,
      -- Review Rating,
      -- Review Date,
      -- User Id,
      -- Brand,
      -- Category,
      -- Sub Category,
      -- Product Description,
      -- Asin,
      -- Url,
      -- Review Content,
      -- Verified Purchase,
      -- Helpful Review Count,
      -- Manufacturer Response

    Acknowledgements

    We wouldn't be here without the help of our in house teams at PromptCloud and DataStock. Who has put their heart and soul into this project like all other projects? We want to provide the best quality data and we will continue to do so.

    Inspiration

    The inspiration for these datasets came from research. Reviews are something that is important wit everybody across the globe. So we decided to come up with this dataset that shows us exactly how the user reviews help companies to better their products.

    This dataset was created by PromptCloud and contains around 0 samples along with Billing Uniq Id, Verified Purchase, technical information and other features such as: - Crawl Timestamp - Manufacturer Response - and more.

    How to use this dataset

    • Analyze Helpful Review Count in relation to Sub Category
    • Study the influence of Review Date on Product Description
    • More datasets

    Acknowledgements

    If you use this dataset in your research, please credit PromptCloud

    Start A New Notebook!

    --- Original source retains full ownership of the source dataset ---

  4. Amazon Reviews Dataset

    • kaggle.com
    Updated Sep 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dongre Laxman (2024). Amazon Reviews Dataset [Dataset]. https://www.kaggle.com/datasets/dongrelaxman/amazon-reviews-dataset/versions/1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 20, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Dongre Laxman
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    This dataset comprises customer reviews for Amazon, an online retail giant, featuring insights into customer experiences, including ratings, review titles, texts, and metadata. It is valuable for analyzing customer satisfaction, sentiment, and trends.

    Column Descriptions:

    Reviewer Name: Identifies the reviewer. Profile Link: Links to the reviewer's profile for additional insights. Country: Indicates the reviewer's location. Review Count: Number of reviews by the same user, showing engagement level. Review Date: When the review was posted, useful for time analysis. Rating: Numerical satisfaction measure. Review Title: Summarizes the review sentiment. Review Text: Detailed customer feedback. Date of Experience: When the service/product was experienced.

    Prospective applications:

    Sentiment Analysis: Analyze review texts and titles to assess overall customer sentiment toward products, enabling the identification of strengths and weaknesses. Customer Satisfaction Tracking: Track and visualize rating trends over time to understand fluctuations in customer satisfaction. Product Improvement: Identify common themes in reviews to highlight areas for product enhancement or development. Market Segmentation: Use country and demographic information to customize marketing strategies and gain insights into regional preferences. Competitor Analysis: Evaluate customer feedback on Amazon products in comparison to competitors to determine market positioning. Recommendation Systems: Leverage review data to enhance recommendation algorithms, improving personalized shopping experiences. Trend Analysis: Investigate temporal patterns in reviews to link sentiment changes with marketing efforts or product launches.

    This extensive dataset serves as a valuable asset for various analyses focused on enhancing customer engagement and refining business strategies.

  5. Amazon Customer Review Data

    • zenodo.org
    pdf
    Updated Jul 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Akash Shashikant Vaykar; Abhishek Kaushik; Abhishek Kaushik; Akash Shashikant Vaykar (2024). Amazon Customer Review Data [Dataset]. http://doi.org/10.5281/zenodo.3549704
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jul 22, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Akash Shashikant Vaykar; Abhishek Kaushik; Abhishek Kaushik; Akash Shashikant Vaykar
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset: Amazon Customer Review Data for sentiment analysis

    Size: 60889 appox.

    Format: .CSV

    Period: 2013 to 2019

    Categories: 5…… (Mobiles, Smart TV, Books, Mobile Accessories, Refrigerator)

    Unique_ID: Customized (Primary Key)

    Review_Header: user’s comment in few words

    Review_Text: User’s comment in details (3-4 lines)

    Rating: (1- Very Low, 2 🡪 Low, 3🡪 Avg, 4 🡪 Good, 5 - Excellent)

    Posting Period: 2013 to 2019

    Own_Rating: for 1-2 🡪 Negative, 3🡪 Neutral, 4-5 🡪 Positive

  6. Datasets for Sentiment Analysis

    • zenodo.org
    csv
    Updated Dec 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Julie R. Repository creator - Campos Arias; Julie R. Repository creator - Campos Arias (2023). Datasets for Sentiment Analysis [Dataset]. http://doi.org/10.5281/zenodo.10157504
    Explore at:
    csvAvailable download formats
    Dataset updated
    Dec 10, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Julie R. Repository creator - Campos Arias; Julie R. Repository creator - Campos Arias
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This repository was created for my Master's thesis in Computational Intelligence and Internet of Things at the University of Córdoba, Spain. The purpose of this repository is to store the datasets found that were used in some of the studies that served as research material for this Master's thesis. Also, the datasets used in the experimental part of this work are included.

    Below are the datasets specified, along with the details of their references, authors, and download sources.

    ----------- STS-Gold Dataset ----------------

    The dataset consists of 2026 tweets. The file consists of 3 columns: id, polarity, and tweet. The three columns denote the unique id, polarity index of the text and the tweet text respectively.

    Reference: Saif, H., Fernandez, M., He, Y., & Alani, H. (2013). Evaluation datasets for Twitter sentiment analysis: a survey and a new dataset, the STS-Gold.

    File name: sts_gold_tweet.csv

    ----------- Amazon Sales Dataset ----------------

    This dataset is having the data of 1K+ Amazon Product's Ratings and Reviews as per their details listed on the official website of Amazon. The data was scraped in the month of January 2023 from the Official Website of Amazon.

    Owner: Karkavelraja J., Postgraduate student at Puducherry Technological University (Puducherry, Puducherry, India)

    Features:

    • product_id - Product ID
    • product_name - Name of the Product
    • category - Category of the Product
    • discounted_price - Discounted Price of the Product
    • actual_price - Actual Price of the Product
    • discount_percentage - Percentage of Discount for the Product
    • rating - Rating of the Product
    • rating_count - Number of people who voted for the Amazon rating
    • about_product - Description about the Product
    • user_id - ID of the user who wrote review for the Product
    • user_name - Name of the user who wrote review for the Product
    • review_id - ID of the user review
    • review_title - Short review
    • review_content - Long review
    • img_link - Image Link of the Product
    • product_link - Official Website Link of the Product

    License: CC BY-NC-SA 4.0

    File name: amazon.csv

    ----------- Rotten Tomatoes Reviews Dataset ----------------

    This rating inference dataset is a sentiment classification dataset, containing 5,331 positive and 5,331 negative processed sentences from Rotten Tomatoes movie reviews. On average, these reviews consist of 21 words. The first 5331 rows contains only negative samples and the last 5331 rows contain only positive samples, thus the data should be shuffled before usage.

    This data is collected from https://www.cs.cornell.edu/people/pabo/movie-review-data/ as a txt file and converted into a csv file. The file consists of 2 columns: reviews and labels (1 for fresh (good) and 0 for rotten (bad)).

    Reference: Bo Pang and Lillian Lee. Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL'05), pages 115–124, Ann Arbor, Michigan, June 2005. Association for Computational Linguistics

    File name: data_rt.csv

    ----------- Preprocessed Dataset Sentiment Analysis ----------------

    Preprocessed amazon product review data of Gen3EcoDot (Alexa) scrapped entirely from amazon.in
    Stemmed and lemmatized using nltk.
    Sentiment labels are generated using TextBlob polarity scores.

    The file consists of 4 columns: index, review (stemmed and lemmatized review using nltk), polarity (score) and division (categorical label generated using polarity score).

    DOI: 10.34740/kaggle/dsv/3877817

    Citation: @misc{pradeesh arumadi_2022, title={Preprocessed Dataset Sentiment Analysis}, url={https://www.kaggle.com/dsv/3877817}, DOI={10.34740/KAGGLE/DSV/3877817}, publisher={Kaggle}, author={Pradeesh Arumadi}, year={2022} }

    This dataset was used in the experimental phase of my research.

    File name: EcoPreprocessed.csv

    ----------- Amazon Earphones Reviews ----------------

    This dataset consists of a 9930 Amazon reviews, star ratings, for 10 latest (as of mid-2019) bluetooth earphone devices for learning how to train Machine for sentiment analysis.

    This dataset was employed in the experimental phase of my research. To align it with the objectives of my study, certain reviews were excluded from the original dataset, and an additional column was incorporated into this dataset.

    The file consists of 5 columns: ReviewTitle, ReviewBody, ReviewStar, Product and division (manually added - categorical label generated using ReviewStar score)

    License: U.S. Government Works

    Source: www.amazon.in

    File name (original): AllProductReviews.csv (contains 14337 reviews)

    File name (edited - used for my research) : AllProductReviews2.csv (contains 9930 reviews)

    ----------- Amazon Musical Instruments Reviews ----------------

    This dataset contains 7137 comments/reviews of different musical instruments coming from Amazon.

    This dataset was employed in the experimental phase of my research. To align it with the objectives of my study, certain reviews were excluded from the original dataset, and an additional column was incorporated into this dataset.

    The file consists of 10 columns: reviewerID, asin (ID of the product), reviewerName, helpful (helpfulness rating of the review), reviewText, overall (rating of the product), summary (summary of the review), unixReviewTime (time of the review - unix time), reviewTime (time of the review (raw) and division (manually added - categorical label generated using overall score).

    Source: http://jmcauley.ucsd.edu/data/amazon/

    File name (original): Musical_instruments_reviews.csv (contains 10261 reviews)

    File name (edited - used for my research) : Musical_instruments_reviews2.csv (contains 7137 reviews)

  7. h

    Amazon-Reviews-2023

    • huggingface.co
    Updated Sep 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    McAuley-Lab (2023). Amazon-Reviews-2023 [Dataset]. https://huggingface.co/datasets/McAuley-Lab/Amazon-Reviews-2023
    Explore at:
    Dataset updated
    Sep 15, 2023
    Dataset authored and provided by
    McAuley-Lab
    Description

    Amazon Review 2023 is an updated version of the Amazon Review 2018 dataset. This dataset mainly includes reviews (ratings, text) and item metadata (desc- riptions, category information, price, brand, and images). Compared to the pre- vious versions, the 2023 version features larger size, newer reviews (up to Sep 2023), richer and cleaner meta data, and finer-grained timestamps (from day to milli-second).

  8. amazon-reviews-sentiment-analysis

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    fastai X Hugging Face Group 2022, amazon-reviews-sentiment-analysis [Dataset]. https://huggingface.co/datasets/hugginglearners/amazon-reviews-sentiment-analysis
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset provided by
    Hugging Facehttps://huggingface.co/
    Authors
    fastai X Hugging Face Group 2022
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Dataset Card for amazon reviews for sentiment analysis

      Dataset Summary
    

    One of the most important problems in e-commerce is the correct calculation of the points given to after-sales products. The solution to this problem is to provide greater customer satisfaction for the e-commerce site, product prominence for sellers, and a seamless shopping experience for buyers. Another problem is the correct ordering of the comments given to the products. The prominence of misleading… See the full description on the dataset page: https://huggingface.co/datasets/hugginglearners/amazon-reviews-sentiment-analysis.

  9. d

    Amazon Product Reviews and Ratings | Amazon Data Extraction Services

    • datarade.ai
    .json, .csv, .sql
    Updated Aug 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Growth Marketing (2023). Amazon Product Reviews and Ratings | Amazon Data Extraction Services [Dataset]. https://datarade.ai/data-products/amazon-product-reviews-and-ratings-amazon-data-extraction-s-growth-marketing
    Explore at:
    .json, .csv, .sqlAvailable download formats
    Dataset updated
    Aug 21, 2023
    Dataset authored and provided by
    Growth Marketing
    Area covered
    Congo (Democratic Republic of the), Tanzania, Guernsey, Croatia, Poland, Brazil, Jamaica, Rwanda, El Salvador, Niger
    Description

    Get instant access to Amazon customer reviews. Fully customizable datasets based on your requirements.

    Gather product reviews, including fields like product name, product URL, reviewer name, review rating, review text and description, and data points and fields that look interesting for market insights analysis.

    Pricing (no order minimums): • <5000 reviews: $0.05 per row • 5001-50000 leads: $0.04 per row • 50000+ rows: $0.03 per row

    Fields: • country • countryCode • date • isVerified • position • productAsin • ratingScore • reviewCategoryUrl • reviewDescription • reviewImages/0 • reviewImages/1 • reviewImages/2 • reviewImages/3 • reviewImages/4 • reviewImages/5 • reviewImages/6 • reviewReaction • reviewTitle • reviewUrl • reviewedIn • totalCategoryRatings • totalCategoryReviews • variant

  10. T

    amazon_us_reviews

    • tensorflow.org
    • huggingface.co
    Updated Dec 6, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). amazon_us_reviews [Dataset]. https://www.tensorflow.org/datasets/catalog/amazon_us_reviews
    Explore at:
    Dataset updated
    Dec 6, 2022
    Description

    Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

    Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

    Each Dataset contains the following columns : marketplace - 2 letter country code of the marketplace where the review was written. customer_id - Random identifier that can be used to aggregate reviews written by a single author. review_id - The unique ID of the review. product_id - The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id. product_parent - Random identifier that can be used to aggregate reviews for the same product. product_title - Title of the product. product_category - Broad product category that can be used to group reviews (also used to group the dataset into coherent parts). star_rating - The 1-5 star rating of the review. helpful_votes - Number of helpful votes. total_votes - Number of total votes the review received. vine - Review was written as part of the Vine program. verified_purchase - The review is on a verified purchase. review_headline - The title of the review. review_body - The review text. review_date - The date the review was written.

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('amazon_us_reviews', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

  11. o

    Amazon Products

    • opendatabay.com
    .undefined
    Updated Jun 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bright Data (2025). Amazon Products [Dataset]. https://www.opendatabay.com/data/premium/2f7668e7-009e-4c7d-9822-78955a22a20a
    Explore at:
    .undefinedAvailable download formats
    Dataset updated
    Jun 19, 2025
    Dataset authored and provided by
    Bright Data
    Area covered
    Retail & Consumer Behavior
    Description

    Amazon Products dataset to explore detailed product listings, pricing, reviews, and sales data. Popular use cases include competitive analysis, market trend forecasting, and e-commerce strategy optimization.

    Use our Amazon Products dataset to explore detailed information on products across various categories, including pricing, reviews, ratings, and sales data. This dataset is ideal for e-commerce professionals, market analysts, and product managers looking to analyze market trends, optimize product listings, and refine competitive strategies.

    Leverage this dataset to track pricing trends, assess customer feedback, and uncover popular product categories. Whether you're conducting competitive analysis, performing market research, or optimizing product strategies, the Amazon Products dataset provides key insights to stay ahead in the e-commerce landscape.

    Dataset Features

    • Title: The name or title of the product.
    • seller_name: The name of the seller offering the product.
    • Brand: The brand associated with the product.
    • Description: A detailed description of the product, including key features.
    • initial_price: The original price of the product before any discounts.
    • final_price: The current price of the product after discounts.
    • Currency: The currency in which the product is priced (e.g., GBP, USD).
    • Availability: The stock status (e.g., in stock, out of stock).
    • reviews_count: The total number of customer reviews.
    • Categories: The specific category the product belongs to.
    • asin: Amazon Standard Identification Number.
    • buybox_seller: The seller currently winning the Amazon Buy Box.
    • number_of_sellers: The number of sellers offering this product.
    • root_bs_rank: The overall ranking of the product in the Amazon best-sellers list.
    • answered_questions: The number of questions answered in the product Q&A section.
    • domain: The website domain where the product is being sold.
    • images_count: The number of images available for the product.
    • URL: The link to the product page on Amazon.
    • video_count: The number of videos available for the product.
    • image_url: The URL of the primary image associated with the product.
    • item_weight: The weight of the product.
    • Rating: The average rating of the product based on customer reviews.
    • product_dimensions: The dimensions of the product (e.g., length, width, height) and weight.
    • seller_id: The unique identifier for the seller.
    • date_first_available: The date when the product was first made available on Amazon.
    • discount: Any discount applied to the product.
    • model_number: The model number of the product.
    • manufacturer: The company that manufactures the product.
    • department: The department under which the product is categorized (e.g., Health & Household).
    • plus_content: A flag indicating if the product has Amazon’s “Plus Content” (additional marketing content).
    • upc: The Universal Product Code (UPC) associated with the product.
    • video: URL(s) of any video content associated with the product.
    • top_review: A summary or excerpt from the top customer review.
    • variations: Different product variations (e.g., different sizes or flavors).
    • delivery: Information on the delivery options (e.g., free delivery or Prime delivery).
    • features: Key features or highlights of the product.
    • format: The format of the product (e.g., powder, liquid).
    • buybox_prices: Pricing details for the product, including the base and tiered prices.
    • parent_asin: The ASIN of the parent product (if the product is part of a larger group of similar products).
    • input_asin: The ASIN of the product as input for Amazon searches.
    • ingredients: List of ingredients in the product (if applicable).
    • origin_url: The source URL for product-related information or ingredients.
    • bought_past_month: A flag indicating if the product was bought in the past month.
    • is_available: Availability status of the product (True/False).
    • root_bs_category: The broad product category (e.g., Health & Household).
    • bs_category: The specific subcategory the product belongs to.
    • bs_rank: The rank of the product in its specific subcategory.
    • badge: Any badge or label the product has earned (e.g., Amazon's Choice).
    • subcategory_rank: The rank of the product within its subcategory.
    • amazon_choice: A flag indicating if the product has been selected as Amazon’s Choice.
    • images: A list of URLs for additional product images.
    • product_details: Detailed product specifications and features.
    • prices_breakdown: A breakdown of the price, including any discounts or promotions.
    • country_of_origin: The country where the product is made.
    • from_the_brand: Information from the brand or manufact
  12. Amazon Kindle Book Review for Sentiment Analysis

    • kaggle.com
    Updated Sep 3, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Meet Nagadia (2021). Amazon Kindle Book Review for Sentiment Analysis [Dataset]. https://www.kaggle.com/datasets/meetnagadia/amazon-kindle-book-review-for-sentiment-analysis/data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 3, 2021
    Dataset provided by
    Kaggle
    Authors
    Meet Nagadia
    License

    Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
    License information was derived automatically

    Description

    Context

    This is a small subset of dataset of Book reviews from Amazon Kindle Store category.

    Content

    5-core dataset of product reviews from Amazon Kindle Store category from May 1996 - July 2014. Contains total of 982619 entries. Each reviewer has at least 5 reviews and each product has at least 5 reviews in this dataset. Columns - asin - ID of the product, like B000FA64PK -helpful - helpfulness rating of the review - example: 2/3. -overall - rating of the product. -reviewText - text of the review (heading). -reviewTime - time of the review (raw). -reviewerID - ID of the reviewer, like A3SPTOKDG7WBLN -reviewerName - name of the reviewer. -summary - summary of the review (description). -unixReviewTime - unix timestamp.

    Which file to use?

    There are two files one is preprocessed ready for sentiment analysis and other is unprocessed to you basically have to process the dataset and then perform sentiment analysis

    Acknowledgements

    This dataset is taken from Amazon product data, Julian McAuley, UCSD website. http://jmcauley.ucsd.edu/data/amazon/

    License to the data files belong to them.

    Inspiration

    -Sentiment analysis on reviews. -Understanding how people rate usefulness of a review/ What factors influence helpfulness of a review. -Fake reviews/ outliers. -Best rated product IDs, or similarity between products based on reviews alone (not the best idea ikr). -Any other interesting analysis

  13. h

    amazon-food-reviews-dataset

    • huggingface.co
    Updated Dec 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    misschestnut (2023). amazon-food-reviews-dataset [Dataset]. https://huggingface.co/datasets/jhan21/amazon-food-reviews-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 12, 2023
    Authors
    misschestnut
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    Dataset Card for "Amazon Food Reviews"

      Dataset Summary
    

    This dataset consists of reviews of fine foods from amazon. The data span a period of more than 10 years, including all ~500,000 reviews up to October 2012. Reviews include product and user information, ratings, and a plain text review. It also includes reviews from all other Amazon categories.

      Supported Tasks and Leaderboards
    

    This dataset can be used for numerous tasks like sentiment analysis, text… See the full description on the dataset page: https://huggingface.co/datasets/jhan21/amazon-food-reviews-dataset.

  14. o

    Amazon Food Product Reviews & Ratings

    • opendatabay.com
    .csv
    Updated Jun 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vdt. Data (2025). Amazon Food Product Reviews & Ratings [Dataset]. https://www.opendatabay.com/data/consumer/fd13df3c-b1af-410c-8596-7e11961381ed
    Explore at:
    .csvAvailable download formats
    Dataset updated
    Jun 18, 2025
    Dataset authored and provided by
    Vdt. Data
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    E-commerce & Online Transactions
    Description

    The Amazon Food Products Dataset is a large-scale collection of product listings, reviews, and metadata sourced from Amazon. This dataset is valuable for understanding consumer behaviour, analyzing product trends, and training machine learning models for recommendation systems and sentiment analysis. It includes various categories, providing insights into customer preferences, product ratings, and review sentiments.

    Dataset Features

    Each record in the dataset contains the following key fields:

    • ProductId: Unique identifier for each product.
    • UserId: Unique identifier for the reviewer.
    • ProfileName: Display the name of the reviewer.
    • HelpfulnessNumerator: Number of users who found the review helpful.
    • HelpfulnessDenominator: Total number of users who rated the review’s helpfulness.
    • Score: Product rating (1 to 5 stars).
    • Time: Unix timestamp of the review.
    • Summary: Short summary of the review.
    • Text: Full text of the review.

    Distribution

    • Data Volume: 568454 rows and 9 columns.
    • Format: CSV.
    • Structure: Tabular format with numerical, categorical, and text data.

    Usage

    This dataset is ideal for a variety of applications:

    • Sentiment Analysis: Training NLP models to predict sentiment based on reviews.
    • Product Recommendation Systems: Building collaborative filtering models.
    • Trend Analysis: Identifying popular products and customer preferences.
    • Fake Review Detection: Detecting anomalous patterns in review behaviours.

    Coverage

    • Geographic Coverage: Global.
    • Time Range: Multi-year dataset (over 10 years of reviews).
    • Demographics: General Amazon shoppers; includes various age groups and customer segments.

    License

    CC0

    Who Can Use It

    • Data Scientists: For building machine learning models.
    • Researchers: For academic analysis of customer behaviour.
    • Businesses: For market insights and customer sentiment analysis.
  15. d

    Amazon Data & Amazon Reviews Data | eBay Data | Alibaba & AliExpress Data |...

    • datarade.ai
    .json, .csv, .xls
    Updated Sep 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Altosight (2024). Amazon Data & Amazon Reviews Data | eBay Data | Alibaba & AliExpress Data | Global Product Data | Unlimited Free Data Points | GDPR Compliant [Dataset]. https://datarade.ai/data-products/amazon-data-amazon-reviews-data-ebay-data-alibaba-ali-altosight
    Explore at:
    .json, .csv, .xlsAvailable download formats
    Dataset updated
    Sep 7, 2024
    Dataset authored and provided by
    Altosight
    Area covered
    Curaçao, Singapore, Antigua and Barbuda, Liberia, Ascension and Tristan da Cunha, South Georgia and the South Sandwich Islands, Christmas Island, Tonga, Malawi, Pakistan
    Description

    Altosight | AI-Powered Amazon Data, eBay Data & More | Global Marketplace Insights

    ✦ Altosight offers robust, AI-powered Amazon Data services that provide deep insights into product listings, reviews, prices, and sales trends.

    ✦ Amazon Reviews Data, eBay Data, Alibaba Data, and AliExpress Data are also covered, giving businesses the tools they need to make data-driven decisions across the world’s largest marketplaces.

    Our Amazon Data encompasses a broad range of publicly available information from Amazon’s marketplace, which can be used to improve customer experience, personalize recommendations, optimize operations, and drive business success.

    With unlimited free data points, fast delivery, and no setup costs, Altosight provides unparalleled flexibility and efficiency.

    ➤ We offer multiple data delivery options including API, CSV, JSON, and FTP, ensuring seamless integration into your business processes at no additional charge.

    ― Key Use Cases ―

    ➤ Marketplace Expansion & Product Assortment Optimization

    🔹 Identify gaps in your product offerings by comparing competitor inventories with Alibaba Data, Amazon Data, and eBay Data.

    🔹 Expand your product catalog by analyzing trends in best-sellers, emerging products, and market demand.

    🔹 Use Digital Shelf Data to track product placements, best-seller rankings, and availability across major marketplaces to optimize your digital shelf space.

    ➤ Customer Sentiment & Product Review Analysis

    🔹 Leverage Amazon Reviews Data to understand customer feedback, identify common complaints, and highlight product strengths.

    🔹 Analyze AliExpress Data to track seller ratings and customer reviews, providing insights into consumer sentiment across different marketplaces.

    🔹 Use these insights to refine product offerings, improve customer satisfaction, and enhance your brand’s reputation.

    ➤ Competitive Price Monitoring & Dynamic Repricing

    🔹 Track product prices across Amazon, eBay, Alibaba, and AliExpress to ensure you remain competitive in the marketplace.

    🔹 Use Amazon Data and eBay Data for real-time insights into competitor pricing and discounts.

    🔹 Implement dynamic repricing strategies to react to price changes in real-time, ensuring your products always stay competitively priced.

    ➤ Product Sourcing & Wholesaler Opportunities

    🔹 Use Alibaba Data and AliExpress Data to uncover new product opportunities and identify potential wholesalers.

    🔹 Discover trending products to source for your business and form partnerships with reliable suppliers, streamlining your supply chain and business growth.

    ➤ Market Trend Identification & Forecasting

    🔹 Use Alibaba Data and AliExpress Data to identify emerging trends in consumer behavior, product categories, and price fluctuations.

    🔹 Conduct comprehensive market research to forecast product demand and industry trends based on historical data from Amazon and other marketplaces.

    🔹 Stay ahead of market changes by leveraging real-time data for strategic decision-making, product launches, and marketing initiatives.

    ➤ Retailer & Brand Performance Tracking

    🔹 Track the performance of specific retailers or brands across Amazon, eBay, Alibaba, and AliExpress using detailed sales and review data.

    🔹 Monitor how frequently products move up or down in rankings, providing valuable insights for brand positioning and marketing effectiveness.

    🔹 Analyze which retailers sell particular brands and products, helping businesses identify new partnerships or distribution opportunities.

    ― Data Collection & Quality ―

    ✔ Publicly Sourced Data: Altosight collects Amazon Data, Amazon Reviews Data, eBay Data, Alibaba Data, and AliExpress Data from publicly available sources. This includes product information, transaction data, reviews, and other valuable data points that are essential for making informed business decisions.

    ✔ AI-Powered Scraping: Our AI-driven technology handles CAPTCHAs, dynamic content, and JavaScript-heavy websites to ensure continuous and accurate data collection. We extract and structure Amazon Reviews Data, Digital Shelf Data, and other relevant marketplace data for easy integration into your existing systems.

    ✔ High-Quality Data: Altosight ensures all data is cleaned, structured, and ready for use, with high accuracy and reliability. Our solutions are ideal for market research, competitor analysis, and operational optimization.

    ― Why Choose Altosight? ―

    ✔ Unlimited Data Points: Altosight offers unlimited free data points, allowing you to extract as many product attributes or sales data as needed without additional charges. This ensures cost-effectiveness while maintaining access to all the insights you require.

    ✔ Proprietary Anti-Blocking Technology: Our proprietary scraping technology ensures continuous access to Amazon Data, eBay Data, Alibaba Data, and AliExpress Data by bypassing CAPTCHAs, Cloudflare, and other blocking mechanisms.

    ✔ Custom & R...

  16. o

    Preprocessed Dataset Sentiment Analysis

    • opendatabay.com
    • kaggle.com
    .undefined
    Updated Jun 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Datasimple (2025). Preprocessed Dataset Sentiment Analysis [Dataset]. https://www.opendatabay.com/data/ai-ml/d9407519-5dc6-4a8d-a031-917714147912
    Explore at:
    .undefinedAvailable download formats
    Dataset updated
    Jun 16, 2025
    Dataset authored and provided by
    Datasimple
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Entertainment & Media Consumption
    Description

    Preprocessed amazon product review data of Gen3EcoDot scrapped entirely from amazon.in Stemmed and Lemmatized using nltk sentiment labels are generated using TextBlob polarity scores

    License

    CC0

    Original Data Source: Preprocessed Dataset Sentiment Analysis

  17. Amazon Fine Food Reviews

    • kaggle.com
    zip
    Updated May 1, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stanford Network Analysis Project (2017). Amazon Fine Food Reviews [Dataset]. https://www.kaggle.com/snap/amazon-fine-food-reviews
    Explore at:
    zip(253873708 bytes)Available download formats
    Dataset updated
    May 1, 2017
    Dataset authored and provided by
    Stanford Network Analysis Project
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    This dataset consists of reviews of fine foods from amazon. The data span a period of more than 10 years, including all ~500,000 reviews up to October 2012. Reviews include product and user information, ratings, and a plain text review. It also includes reviews from all other Amazon categories.

    Contents

    • Reviews.csv: Pulled from the corresponding SQLite table named Reviews in database.sqlite
    • database.sqlite: Contains the table 'Reviews'

    Data includes:
    - Reviews from Oct 1999 - Oct 2012
    - 568,454 reviews
    - 256,059 users
    - 74,258 products
    - 260 users with > 50 reviews

    wordcloud

    Acknowledgements

    See this SQLite query for a quick sample of the dataset.

    If you publish articles based on this dataset, please cite the following paper:

  18. Amazon Data, Product Data, Product Reviews, Best Sellers & Deals + More |...

    • datarade.ai
    .json, .csv
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OpenWeb Ninja, Amazon Data, Product Data, Product Reviews, Best Sellers & Deals + More | E-Commerce Data | Real-Time API [Dataset]. https://datarade.ai/data-products/openweb-ninja-amazon-data-product-data-product-reviews-d-openweb-ninja
    Explore at:
    .json, .csvAvailable download formats
    Dataset authored and provided by
    OpenWeb Ninja
    Area covered
    Japan, South Africa, China, Turkey, Netherlands, United States of America, Sweden, Canada, Germany, France
    Description

    OpenWeb Ninja's Amazon Data API provides fast, reliable, and real-time access to Amazon Data and on all 22 Amazon domains / countries.

    The OpenWeb Ninja's Amazon Data API covers over 300 million Product Listings Data (products, books, media, wine, and services) and provides over 40 data points per product.

    OpenWeb Ninja's Amazon Data common use cases: - Price Optimization & Price Comparison - Market Research - Competitive Analysis - Product Research & Trend Analysis - Customer Reviews Analysis

    OpenWeb Ninja's Amazon Data Stats & Capabilities: - 40+ data points per job listing - 300M+ Product Listings - 22 Amazon countries/domains supported - Real-time Amazon product data, including offers, and deals (Today's Deals) - Detailed product reviews and ratings data - Amazon Best Sellers across multiple Amazon categories - Conversion from Amazon ASIN to GTIN/EAN/ISBN

  19. o

    Amazon Review Dataset LLM

    • opendatabay.com
    .undefined
    Updated Jun 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Datasimple (2025). Amazon Review Dataset LLM [Dataset]. https://www.opendatabay.com/data/consumer/3769a0a1-dc8b-44e7-9bcf-1c8f2d3fdddc
    Explore at:
    .undefinedAvailable download formats
    Dataset updated
    Jun 14, 2025
    Dataset authored and provided by
    Datasimple
    Area covered
    Reviews & Ratings
    Description

    The Amazon Reviews Dataset is a comprehensive collection of customer reviews obtained from the popular e-commerce website, Amazon.com. This dataset encompasses reviews written in 5 different languages, making it a valuable resource for conducting multilingual sentiment analysis and opinion mining.

    💴 For Commercial Usage: To discuss your requirements, learn about the price and buy the dataset, leave a request on TrainingData to buy the dataset The dataset's multilingual nature makes it useful for natural language processing tasks, sentiment analysis algorithms, and other machine learning applications that require diverse language data for training and evaluation.

    The dataset can be highly valuable in training and fine-tuning machine learning models to automatically classify sentiments, predict customer satisfaction, or extract key information from customer reviews.

    Languages in the dataset: Italian German Spainish French English 💴 Buy the Dataset: This is just an example of the data. Leave a request on https://trainingdata.pro/datasets to discuss your requirements, learn about the price and buy the dataset Content For each item, we extracted:

    user_name: name of the reviewer stars: number of stars given to the review country: country of the author date: date of the review title: title of the review text: text of the review helpful: number of people who think that the review is helpful Amazon Reviews might be collected in accordance with your requirements. TrainingData provides high-quality data annotation tailored to your needs keywords: reviews dataset, text dataset, product reviews, ratings, user review data, consumer review data, sentiment analysis, product recommendation, llm dataset, language modeling, large language models, text classification, text mining dataset, natural language texts, nlp, nlp open-source dataset, text data

    Original Data Source: Amazon Review Dataset LLM

  20. Sentiment Analysis using Amazon Reviews

    • kaggle.com
    Updated Jul 6, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amena Najeeb (2021). Sentiment Analysis using Amazon Reviews [Dataset]. https://www.kaggle.com/amenanajeeb/sentiment-analysis-using-amazon-reviews/metadata
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 6, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Amena Najeeb
    License

    Attribution-ShareAlike 3.0 (CC BY-SA 3.0)https://creativecommons.org/licenses/by-sa/3.0/
    License information was derived automatically

    Description

    Context

    The data contains the reviews extracted from a product on Amazon.

    Content

    The data set has a column containing only the reviews (text) and rating from 1 to 5 out of 5 stars [5 (excellent), 4(V.good), 3(Satisfied), 2(Not Satisfied), 1(Bad)].

    Acknowledgements

    https://www.amazon.in/Samsung-MicroSDXC-Memory-Adapter-MB-MC128GA/dp/B06Y63ZKLS/ref=sr_1_6?dchild=1&keywords=samsung+laptops&qid=1595828089&sr=8-6#customerReviews

    Inspiration

    The data can be used to train a model. The expected output is that when we give a statement as input, the model should predict the sentiment.

    For reference, I am attaching the link of the expected model which was deployed on Heroku by me. https://sentiment-analysis-an.herokuapp.com/

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Bright Data (2023). Amazon reviews Dataset [Dataset]. https://brightdata.com/products/datasets/amazon/reviews

Amazon reviews Dataset

Explore at:
.json, .csv, .xlsxAvailable download formats
Dataset updated
Mar 21, 2023
Dataset authored and provided by
Bright Data
License

https://brightdata.com/licensehttps://brightdata.com/license

Area covered
Worldwide
Description

Utilize our Amazon reviews dataset for diverse applications to enrich business strategies and market insights. Analyzing this dataset can aid in understanding customer behavior, product performance, and market trends, empowering organizations to refine their product and marketing strategies. Access the entire dataset or tailor a subset to fit your requirements. Popular use cases include: Product Performance Analysis: Analyze Amazon reviews to assess product performance, uncovering customer satisfaction levels, common issues, and highly praised features to inform product improvements and marketing messages. Customer Behavior Insights: Gain insights into customer behavior, purchasing patterns, and preferences, enabling more personalized marketing and product recommendations. Demand Forecasting: Leverage Amazon reviews to predict future product demand by analyzing historical review data and identifying trends, helping to optimize inventory management and sales strategies. Accessing and analyzing the Amazon reviews dataset supports market strategy optimization by leveraging insights to analyze key market trends and customer preferences, enhancing overall business decision-making.

Search
Clear search
Close search
Google apps
Main menu