100+ datasets found
  1. Amazon Dataset

    • brightdata.com
    .json, .csv, .xlsx
    Updated Jul 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bright Data (2025). Amazon Dataset [Dataset]. https://brightdata.com/products/datasets/amazon
    Explore at:
    .json, .csv, .xlsxAvailable download formats
    Dataset updated
    Jul 11, 2025
    Dataset authored and provided by
    Bright Datahttps://brightdata.com/
    License

    https://brightdata.com/licensehttps://brightdata.com/license

    Area covered
    Worldwide
    Description

    Buy Amazon datasets and get access to over 300 million records from any Amazon domain. Get insights on Amazon products, sellers, and reviews.

  2. AmazonQAC

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amazon, AmazonQAC [Dataset]. https://huggingface.co/datasets/amazon/AmazonQAC
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset provided by
    Amazon.comhttp://amazon.com/
    Authors
    Amazon
    License

    https://choosealicense.com/licenses/cdla-permissive-2.0/https://choosealicense.com/licenses/cdla-permissive-2.0/

    Description

    AmazonQAC: A Large-Scale, Naturalistic Query Autocomplete Dataset

    Train Dataset Size: 395 million samplesTest Dataset Size: 20k samplesSource: Amazon Search LogsFile Format: ParquetCompression: Snappy If you use this dataset, please cite our EMNLP 2024 paper: @inproceedings{everaert-etal-2024-amazonqac, title = "{A}mazon{QAC}: A Large-Scale, Naturalistic Query Autocomplete Dataset", author = "Everaert, Dante and Patki, Rohit and Zheng, Tianqi and Potts… See the full description on the dataset page: https://huggingface.co/datasets/amazon/AmazonQAC.

  3. h

    Amazon-Reviews-2023

    • huggingface.co
    Updated Sep 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    McAuley-Lab (2023). Amazon-Reviews-2023 [Dataset]. https://huggingface.co/datasets/McAuley-Lab/Amazon-Reviews-2023
    Explore at:
    Dataset updated
    Sep 15, 2023
    Dataset authored and provided by
    McAuley-Lab
    Description

    Amazon Review 2023 is an updated version of the Amazon Review 2018 dataset. This dataset mainly includes reviews (ratings, text) and item metadata (desc- riptions, category information, price, brand, and images). Compared to the pre- vious versions, the 2023 version features larger size, newer reviews (up to Sep 2023), richer and cleaner meta data, and finer-grained timestamps (from day to milli-second).

  4. AmazonSalesReport

    • kaggle.com
    Updated Aug 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Arpit Mishra (2024). AmazonSalesReport [Dataset]. https://www.kaggle.com/datasets/arpit2712/amazonsalesreport
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 7, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Arpit Mishra
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Amazon Sales Report

    Overview:

    This dataset provides detailed sales data from Amazon, offering a comprehensive look at various product categories and their performance over time. It includes information on sales figures, order details, product categories, and customer demographics.

    Features:

    1. Order ID

    Description: A unique identifier for each order placed on Amazon. This field helps to track individual orders and link related records.

    2. Dates

    Description: The date when the order was placed. This field is crucial for analyzing sales trends over time and identifying seasonal patterns.

    3. Status

    Description: The current status of the order (e.g., Shipped, Delivered, Pending). This field provides insight into the order fulfillment process and helps monitor order processing efficiency.

    4. Fulfillment

    Description: Indicates the method used to fulfill the order (e.g., Fulfilled by Amazon, Fulfilled by Seller). This feature helps in analyzing the performance of different fulfillment methods and their impact on customer satisfaction.

    5. Sales Channel

    Description: The channel through which the sale was made (e.g., Amazon Website, Mobile App). This field is useful for evaluating the effectiveness of different sales channels and understanding customer preferences.

    6. Category

    Description: The product category to which the purchased item belongs (e.g., Electronics, Clothing, Home Goods). This feature aids in analyzing sales performance across various product categories.

    7. Ship Service Level

    Description: The shipping service level selected for the order (e.g., Standard Shipping, Two-Day Shipping). This field helps to assess the impact of shipping options on delivery times and customer satisfaction.

    8. Size

    Description: The size of the product ordered (e.g., Small, Medium, Large). This feature is relevant for analyzing sales performance based on product size and understanding inventory requirements.

    9. Carrier Status

    Description: The status of the shipment with the carrier (e.g., In Transit, Delivered, Returned). This field provides insights into the shipping process and helps in monitoring delivery performance and handling returns.

    Use Cases:

    Sales Analysis:

    Examine trends in sales over time, identify peak periods, and analyze performance by product category.

    Customer Insights:

    Explore customer demographics to understand purchasing behavior and preferences.

    Inventory Management:

    Assess which products are performing well and which are not, aiding in inventory and supply chain management.

    Marketing Strategies:

    Develop targeted marketing campaigns based on sales trends and customer profiles.

    Data Source:

    This dataset is a simulated collection of Amazon sales data and is intended for educational and analytical purposes.

    Acknowledgments:

    This dataset was created to facilitate data analysis and machine learning projects. It is ideal for practicing data manipulation, statistical analysis, and predictive modeling.

  5. T

    amazon_us_reviews

    • tensorflow.org
    • huggingface.co
    Updated Dec 6, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). amazon_us_reviews [Dataset]. https://www.tensorflow.org/datasets/catalog/amazon_us_reviews
    Explore at:
    Dataset updated
    Dec 6, 2022
    Description

    Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

    Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

    Each Dataset contains the following columns : marketplace - 2 letter country code of the marketplace where the review was written. customer_id - Random identifier that can be used to aggregate reviews written by a single author. review_id - The unique ID of the review. product_id - The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id. product_parent - Random identifier that can be used to aggregate reviews for the same product. product_title - Title of the product. product_category - Broad product category that can be used to group reviews (also used to group the dataset into coherent parts). star_rating - The 1-5 star rating of the review. helpful_votes - Number of helpful votes. total_votes - Number of total votes the review received. vine - Review was written as part of the Vine program. verified_purchase - The review is on a verified purchase. review_headline - The title of the review. review_body - The review text. review_date - The date the review was written.

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('amazon_us_reviews', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

  6. c

    Amazon products dataset oct 2022

    • crawlfeeds.com
    csv, zip
    Updated Apr 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Crawl Feeds (2024). Amazon products dataset oct 2022 [Dataset]. https://crawlfeeds.com/datasets/amazon-products-dataset-oct-2022
    Explore at:
    csv, zipAvailable download formats
    Dataset updated
    Apr 6, 2024
    Dataset authored and provided by
    Crawl Feeds
    License

    https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy

    Description

    Amazon random products data and its last extracted on 20 sept 2022.

    Downlod similar products data for months aug and sept

    1. https://crawlfeeds.com/datasets/amazon-products-dataset-sept-2022

    2. https://crawlfeeds.com/datasets/amazon-products-dataset-aug-2022

  7. Amazon Product Reviews for NLP

    • kaggle.com
    Updated Apr 13, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yeshan Santhush (2022). Amazon Product Reviews for NLP [Dataset]. https://www.kaggle.com/datasets/yeshmesh/inconsistent-and-consistent-amazon-reviews
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 13, 2022
    Dataset provided by
    Kaggle
    Authors
    Yeshan Santhush
    Description

    The dataset contains reviews which were web scraped with the Python library BeautifulSoup, where the reviews were webscraped from Amazon products.

    The columns of the dataset:

    1. reviewId
    2. reviewDate
    3. mainDepartment
    4. subDepartment
    5. productName
    6. reviewTitle
    7. reviewStar
    8. reviewText
    9. inconsistentStatus

    How did I label my dataset, or rather how did I label the reviews as inconsistent (1) or consistent (0) ?

    To begin, the VADER Sentiment tool was utilized to extract the compound sentiment value for each text review. Subsequently, the polarity of the review's text was assigned by labeling it as 'Positive' if the review's compound value exceeded 0.05, 'Negative' if the compound value was below -0.05, and 'Neutral' otherwise. Once the text polarity had been extracted for all reviews, the star polarity for each review was determined based on the number of stars assigned. Specifically, reviews that contained a star rating of 1 or 2 were labeled as 'Negative', reviews with a rating of 3 were labeled as 'Neutral', and those with 4 or 5 stars were labeled as 'Positive'.

    In order to identify inconsistencies or mismatches within a review, a comparison was made between the review's text polarity and star polarity. Reviews that had matching polarities were labeled as 'Consistent' (represented by 0 in binary). Conversely, if there was a mismatch between the two polarities, the review was labeled as 'Inconsistent' (represented by 1 in binary). This binary value was then recorded in the 'inconsistentStatus' column.

    FYI : You could delete off the column 'inconsistentStatus' and use your own logic for labelling the rows as consistent or inconsistent.

  8. o

    Amazon Food Product Reviews & Ratings

    • opendatabay.com
    .undefined
    Updated Jun 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vdt. Data (2025). Amazon Food Product Reviews & Ratings [Dataset]. https://www.opendatabay.com/data/consumer/fd13df3c-b1af-410c-8596-7e11961381ed
    Explore at:
    .undefinedAvailable download formats
    Dataset updated
    Jun 18, 2025
    Dataset authored and provided by
    Vdt. Data
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    E-commerce & Online Transactions
    Description

    The Amazon Food Products Dataset is a large-scale collection of product listings, reviews, and metadata sourced from Amazon. This dataset is valuable for understanding consumer behaviour, analyzing product trends, and training machine learning models for recommendation systems and sentiment analysis. It includes various categories, providing insights into customer preferences, product ratings, and review sentiments.

    Dataset Features

    Each record in the dataset contains the following key fields:

    • ProductId: Unique identifier for each product.
    • UserId: Unique identifier for the reviewer.
    • ProfileName: Display the name of the reviewer.
    • HelpfulnessNumerator: Number of users who found the review helpful.
    • HelpfulnessDenominator: Total number of users who rated the review’s helpfulness.
    • Score: Product rating (1 to 5 stars).
    • Time: Unix timestamp of the review.
    • Summary: Short summary of the review.
    • Text: Full text of the review.

    Distribution

    • Data Volume: 568454 rows and 9 columns.
    • Format: CSV.
    • Structure: Tabular format with numerical, categorical, and text data.

    Usage

    This dataset is ideal for a variety of applications:

    • Sentiment Analysis: Training NLP models to predict sentiment based on reviews.
    • Product Recommendation Systems: Building collaborative filtering models.
    • Trend Analysis: Identifying popular products and customer preferences.
    • Fake Review Detection: Detecting anomalous patterns in review behaviours.

    Coverage

    • Geographic Coverage: Global.
    • Time Range: Multi-year dataset (over 10 years of reviews).
    • Demographics: General Amazon shoppers; includes various age groups and customer segments.

    License

    CC0

    Who Can Use It

    • Data Scientists: For building machine learning models.
    • Researchers: For academic analysis of customer behaviour.
    • Businesses: For market insights and customer sentiment analysis.
  9. Amazon Bin Image Dataset

    • kaggle.com
    Updated Jan 30, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dhruvil Dave (2021). Amazon Bin Image Dataset [Dataset]. http://doi.org/10.34740/kaggle/dsv/1887853
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 30, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Dhruvil Dave
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    The Amazon Bin Image Dataset contains 50,000 images and metadata from bins of a pod in an operating Amazon Fulfillment Center. The bin images in this dataset are captured as robot units carry pods as part of normal Amazon Fulfillment Center operations. This dataset can be used for research in variety of areas like computer vision, counting genetic items and learning from weakly-tagged data.

    For each image, there is a corresponding entry of its metadata in JSON format stored in metadata.sqlite i.e. for image 01290.jpg, there is a corresponding json object in the data field of the metadata file which can be retrieved with query SELECT data FROM metadata WHERE img_id = 01290;

    Refer the Starter Notebook to see how to work with the dataset.

    Amazon uses a random storage scheme where items are placed into accessible bins with available space, so the contents of each bin are random, rather than organized by specific product types. Thus, each bin image may show only one type of product or a diverse range of products. Occasionally, items are misplaced while being handled, so the contents of some bin images may not match the recorded inventory of that bin.

    These are some typical images in the dataset. A bin contains multiple object categories and various number of instances. The corresponding metadata exist for each bin image and it includes the object category identification (ASIN - Amazon Standard Identification Number), quantity and dimensions of objects. The size of bins are various depending on the size of objects in it. The tapes in front of the bins are for preventing the items from falling out of the bins and sometimes it might make the objects unclear. Objects are sometimes heavily occluded by other objects or limited viewpoint of the images.

    Image Credits: Unsplash - helloimnik

  10. c

    Amazon Products Sales 2023 Dataset

    • cubig.ai
    Updated May 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CUBIG (2025). Amazon Products Sales 2023 Dataset [Dataset]. https://cubig.ai/store/products/369/amazon-products-sales-2023-dataset
    Explore at:
    Dataset updated
    May 28, 2025
    Dataset authored and provided by
    CUBIG
    License

    https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service

    Measurement technique
    Privacy-preserving data transformation via differential privacy, Synthetic data generation using AI techniques for model training
    Description

    1) Data Introduction • The Amazon Products Sales Dataset 2023 is a large e-commerce dataset that summarizes various product information in a tabular format, including product name, price, rating, discount information, images, and links by 142 major categories collected from Amazon's website.

    2) Data Utilization (1) Amazon Products Sales Dataset 2023 has characteristics that: • Each row contains 10 key attributes, including product name, main/subcategory, image, Amazon link, rating, number of ratings, discount price, and actual price. • The data encompasses a wide range of products and is structured to enable multi-faceted analysis such as price policy, customer evaluation, and trend by category. (2) Amazon Products Sales Dataset 2023 can be used to: • Product Recommendation and Marketing Strategy: Use rating, price, and category data to develop a customized recommendation system, analyze popular products, and establish a category-specific marketing strategy. • Price and Discount Policy Analysis—Based on discounted prices and actual prices, ratings, reviews, etc., it can be applied to effective pricing policies, promotion strategies, market competitiveness analyses, and more.

  11. YouTube 8 Million - Data Lakehouse Ready

    • registry.opendata.aws
    Updated Feb 17, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amazon Web Services (2022). YouTube 8 Million - Data Lakehouse Ready [Dataset]. https://registry.opendata.aws/yt8m/
    Explore at:
    Dataset updated
    Feb 17, 2022
    Dataset provided by
    Amazon Web Serviceshttp://aws.amazon.com/
    Area covered
    YouTube
    Description

    This both the original .tfrecords and a Parquet representation of the YouTube 8 Million dataset. YouTube-8M is a large-scale labeled video dataset that consists of millions of YouTube video IDs, with high-quality machine-generated annotations from a diverse vocabulary of 3,800+ visual entities. It comes with precomputed audio-visual features from billions of frames and audio segments, designed to fit on a single hard disk. This dataset also includes the YouTube-8M Segments data from June 2019. This dataset is 'Lakehouse Ready'. Meaning, you can query this data in-place straight out of the Registry of Open Data S3 bucket. Deploy this dataset's corresponding CloudFormation template to create the AWS Glue Catalog entries into your account in about 30 seconds. That one step will enable you to interact with the data with AWS Athena, AWS SageMaker, AWS EMR, or join into your AWS Redshift clusters. More detail in (the documentation)[https://github.com/aws-samples/data-lake-as-code/blob/roda-ml/README.md.

  12. w

    Dataset of business metrics of companies called Amazon

    • workwithdata.com
    Updated May 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2025). Dataset of business metrics of companies called Amazon [Dataset]. https://www.workwithdata.com/datasets/companies?col=ceo%2Cceo_approval%2Cceo_gender%2Ccity%2Cemployee_type&f=1&fcol0=company&fop0=%3D&fval0=Amazon
    Explore at:
    Dataset updated
    May 6, 2025
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about companies. It has 19 rows and is filtered where the company is Amazon. It features 5 columns: employee type, CEO, CEO gender, and CEO approval.

  13. w

    Dataset of news about Amazon

    • workwithdata.com
    Updated May 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2025). Dataset of news about Amazon [Dataset]. https://www.workwithdata.com/datasets/news?f=1&fcol0=page_name&fop0=%3D&fval0=Amazon
    Explore at:
    Dataset updated
    May 16, 2025
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about news. It has 6,974 rows and is filtered where the keywords includes Amazon. It features 10 columns including source, publication date, section, and news link.

  14. w

    Dataset of business metrics of companies called Amazon

    • workwithdata.com
    Updated May 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2025). Dataset of business metrics of companies called Amazon [Dataset]. https://www.workwithdata.com/datasets/companies?col=ceo%2Cceo_approval%2Cceo_gender%2Ccity%2Cemployees&f=1&fcol0=company&fop0=%3D&fval0=Amazon
    Explore at:
    Dataset updated
    May 6, 2025
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about companies. It has 19 rows and is filtered where the company is Amazon. It features 5 columns: employees, CEO, CEO gender, and CEO approval.

  15. Amazon Customer Review Data

    • zenodo.org
    pdf
    Updated Jul 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Akash Shashikant Vaykar; Abhishek Kaushik; Abhishek Kaushik; Akash Shashikant Vaykar (2024). Amazon Customer Review Data [Dataset]. http://doi.org/10.5281/zenodo.3549704
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jul 22, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Akash Shashikant Vaykar; Abhishek Kaushik; Abhishek Kaushik; Akash Shashikant Vaykar
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset: Amazon Customer Review Data for sentiment analysis

    Size: 60889 appox.

    Format: .CSV

    Period: 2013 to 2019

    Categories: 5…… (Mobiles, Smart TV, Books, Mobile Accessories, Refrigerator)

    Unique_ID: Customized (Primary Key)

    Review_Header: user’s comment in few words

    Review_Text: User’s comment in details (3-4 lines)

    Rating: (1- Very Low, 2 🡪 Low, 3🡪 Avg, 4 🡪 Good, 5 - Excellent)

    Posting Period: 2013 to 2019

    Own_Rating: for 1-2 🡪 Negative, 3🡪 Neutral, 4-5 🡪 Positive

  16. Language Generation Dataset: 200M Samples

    • kaggle.com
    zip
    Updated Sep 7, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abhishek Chatterjee (2019). Language Generation Dataset: 200M Samples [Dataset]. https://www.kaggle.com/datasets/imdeepmind/language-generation-dataset-200m-samples
    Explore at:
    zip(3416608411 bytes)Available download formats
    Dataset updated
    Sep 7, 2019
    Authors
    Abhishek Chatterjee
    Description

    Context

    Amazon Customer Reviews Dataset is a dataset of user-generated product reviews on the shopping website Amazon. It contains over 130 million product reviews.

    This dataset contains a tiny fraction of that dataset processed and prepared specifically for language generation.

    To know how the dataset is prepared, then please check the GitHub repository for this dataset. https://github.com/imdeepmind/AmazonReview-LanguageGenerationDataset

    Content

    The dataset is stored in an SQLite database. The database contains one table called reviews. This table contains two columns sequence and next.

    The sequence column contains sequences of characters. In this dataset, each sequence of 40 characters long.

    The next column contains the next character after the sequence.

    There are about 200 million samples are in the dataset.

    Acknowledgements

    Thanks to Amazon for making this awesome dataset. Here is the link for the dataset: https://s3.amazonaws.com/amazon-reviews-pds/readme.html

    Inspiration

    This dataset can be used for Language Generation. As it contains 200 million samples, complex Deep Learning models can be trained on this data.

  17. c

    Apple mobile phones reviews

    • crawlfeeds.com
    json, zip
    Updated Apr 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Crawl Feeds (2025). Apple mobile phones reviews [Dataset]. https://crawlfeeds.com/datasets/apple-mobile-phones-reviews
    Explore at:
    zip, jsonAvailable download formats
    Dataset updated
    Apr 29, 2025
    Dataset authored and provided by
    Crawl Feeds
    License

    https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy

    Description

    App mobile phones reviews structured dataset. This small dataset is ideal for NLP and to test machine learning algorithms.

    Get large dataset from our resources.

    Extracted from amazon.

    Data included only for apple mobile phones.

    Reach out to us for large datasets

  18. Importance of selected big data access methods worldwide 2019

    • statista.com
    Updated Jul 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Importance of selected big data access methods worldwide 2019 [Dataset]. https://www.statista.com/statistics/919491/worldwide-big-data-access-methods/
    Explore at:
    Dataset updated
    Jul 9, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2019
    Area covered
    Worldwide
    Description

    This statistic shows the importance of big data access methods worldwide as of 2019. Amazon S3 was seen as the most important big data access method, with around ** percent of respondents stating that it was critical or very important to their organization.

  19. u

    Pinterest Fashion Compatibility

    • cseweb.ucsd.edu
    • beta.data.urbandatacentre.ca
    json
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UCSD CSE Research Project, Pinterest Fashion Compatibility [Dataset]. https://cseweb.ucsd.edu/~jmcauley/datasets.html
    Explore at:
    jsonAvailable download formats
    Dataset authored and provided by
    UCSD CSE Research Project
    Description

    This dataset contains images (scenes) containing fashion products, which are labeled with bounding boxes and links to the corresponding products.

    Metadata includes

    • product IDs

    • bounding boxes

    Basic Statistics:

    • Scenes: 47,739

    • Products: 38,111

    • Scene-Product Pairs: 93,274

  20. u

    Product Exchange/Bartering Data

    • cseweb.ucsd.edu
    json
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UCSD CSE Research Project, Product Exchange/Bartering Data [Dataset]. https://cseweb.ucsd.edu/~jmcauley/datasets.html
    Explore at:
    jsonAvailable download formats
    Dataset authored and provided by
    UCSD CSE Research Project
    Description

    These datasets contain peer-to-peer trades from various recommendation platforms.

    Metadata includes

    • peer-to-peer trades

    • have and want lists

    • image data (tradesy)

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Bright Data (2025). Amazon Dataset [Dataset]. https://brightdata.com/products/datasets/amazon
Organization logo

Amazon Dataset

Explore at:
.json, .csv, .xlsxAvailable download formats
Dataset updated
Jul 11, 2025
Dataset authored and provided by
Bright Datahttps://brightdata.com/
License

https://brightdata.com/licensehttps://brightdata.com/license

Area covered
Worldwide
Description

Buy Amazon datasets and get access to over 300 million records from any Amazon domain. Get insights on Amazon products, sellers, and reviews.

Search
Clear search
Close search
Google apps
Main menu