Facebook
TwitterThis data is from E-Commerce. I used postgreSQL for data cleaning. I transformed NULL values to 'Not defined' and orginal data have only category name column(which was 'category_code') and that was 'DOT' seperated value which show us the products class from wide to specific. So I split them with delimeter('.').
| column name | description |
|---|---|
| time | Time when event happened at (in UTC). |
| event_name | 4 kinds of value: purchase, cart, view, remove_from_cart |
| product_id | ID of a product |
| category_id | Product's category ID |
| category_name | Product's category taxonomy (code name) if it was possible to make it. Usually present for meaningful categories and skipped for different kinds of accessories. |
| brand | Downcased string of brand name. |
| price | Float price of a product. |
| user_id | Permanent user ID. |
| session | Temporary user's session ID. Same for each user's session. Is changed every time user come back to online store from a long pause. |
| category_1 | Largest class of product included |
| category_2 | Bigger class of product included |
| category_3 | Smallest class of product included |
Many thanks Thanks to REES46 Marketing Platform for this dataset and Michael Kechinov
You can use this dataset for free. Just mention the source of it: link to this page and link to REES46 Marketing Platform and Origin data provider
Your data will be in front of the world's largest data science community. What questions do you want to see answered?
Facebook
Twitterhttps://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Unlock the power of Flipkart's extensive product catalog with our meticulously curated e-commerce dataset. This dataset provides detailed information on a wide range of products available on Flipkart, including product names, descriptions, prices, customer reviews, ratings, and images. Whether you're working on data analysis, machine learning models, or conducting in-depth market research, this dataset is an invaluable resource.
With our Flipkart e-commerce dataset, you can easily analyze trends, compare products, and gain insights into consumer behavior. The dataset is structured and high-quality, ensuring that you have the best foundation for your projects.
Flipkart is largest E-commerce website based out india. Pre crawled dataset having more than 5.7 million records.
Where to use dataset
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
📦 Ecommerce Dataset (Products & Sizes Included)
🛍️ Essential Data for Building an Ecommerce Website & Analyzing Online Shopping Trends 📌 Overview This dataset contains 1,000+ ecommerce products, including detailed information on pricing, ratings, product specifications, seller details, and more. It is designed to help data scientists, developers, and analysts build product recommendation systems, price prediction models, and sentiment analysis tools.
🔹 Dataset Features
Column Name Description product_id Unique identifier for the product title Product name/title product_description Detailed product description rating Average customer rating (0-5) ratings_count Number of ratings received initial_price Original product price discount Discount percentage (%) final_price Discounted price currency Currency of the price (e.g., USD, INR) images URL(s) of product images delivery_options Available delivery methods (e.g., standard, express) product_details Additional product attributes breadcrumbs Category path (e.g., Electronics > Smartphones) product_specifications Technical specifications of the product amount_of_stars Distribution of star ratings (1-5 stars) what_customers_said Customer reviews (sentiments) seller_name Name of the product seller sizes Available sizes (for clothing, shoes, etc.) videos Product video links (if available) seller_information Seller details, such as location and rating variations Different variants of the product (e.g., color, size) best_offer Best available deal for the product more_offers Other available deals/offers category Product category
📊 Potential Use Cases
📌 Build an Ecommerce Website: Use this dataset to design a functional online store with product listings, filtering, and sorting. 🔍 Price Prediction Models: Predict product prices based on features like ratings, category, and discount. 🎯 Recommendation Systems: Suggest products based on user preferences, rating trends, and customer feedback. 🗣 Sentiment Analysis: Analyze what_customers_said to understand customer satisfaction and product popularity. 📈 Market & Competitor Analysis: Track pricing trends, popular categories, and seller performance. 🔍 Why Use This Dataset? ✅ Rich Feature Set: Includes all necessary ecommerce attributes. ✅ Realistic Pricing & Rating Data: Useful for price analysis and recommendations. ✅ Multi-Purpose: Suitable for machine learning, web development, and data visualization. ✅ Structured Format: Easy-to-use CSV format for quick integration.
📂 Dataset Format
CSV file (ecommerce_dataset.csv)
1000+ samples
Multi-category coverage
🔗 How to Use?
Download the dataset from Kaggle.
Load it in Python using Pandas:
python
Copy
Edit
import pandas as pd
df = pd.read_csv("ecommerce_dataset.csv")
df.head()
Explore trends & patterns using visualization tools (Seaborn, Matplotlib).
Build models & applications based on the dataset!
Facebook
TwitterIntroducing E-Commerce Product Datasets!
Unlock the full potential of your product strategy with E-Commerce Product Datasets. Gain invaluable insights to optimize your product offerings and pricing, analyze top-selling strategies, and assess customer sentiment.
Our E-Commerce Datasets Source:
Amazon: Access accurate product data from Amazon, including categories, pricing, reviews, and more.
Walmart: Receive comprehensive product information from Walmart, covering pricing, sellers, ratings, availability, and more.
E-Commerce Product Datasets provide structured and actionable data, empowering you to understand customer needs and enhance product strategies. We deliver fresh and precise public e-commerce data, including product names, brands, prices, number of sellers, review counts, ratings, and availability.
You have the flexibility to tailor data delivery to your specific needs:
Why Choose Oxylabs E-Commerce Datasets:
Fresh and accurate data: Access clean and structured public e-commerce data collected by our leading web scraping professionals.
Time and resource savings: Let our experts handle data extraction at an affordable cost, allowing you to focus on your core business objectives.
Customizable solutions: Share your unique business needs, and our team will craft customized dataset solutions tailored to your requirements.
Legal compliance: Partner with a trusted leader in ethical data collection, endorsed by Fortune 500 companies and fully compliant with GDPR and CCPA regulations.
Pricing Options:
Custom Datasets: Tailor datasets from any public web domain to your unique business needs. Contact our sales team for custom pricing.
Experience a seamless journey with Oxylabs:
Unlock the potential of your e-commerce strategy with E-Commerce Product Datasets!
Facebook
Twitterhttps://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service
1) Data Introduction • The E-Commerce Data Dataset contains actual transaction records from an online retail company based in the UK. It includes various transaction-related attributes such as customer ID, product information, transaction date, quantity, and country.
2) Data Utilization (1) Characteristics of the E-Commerce Data Dataset: • This dataset is structured as time-series consumer behavior data at the transaction level. It includes attributes such as product category, quantity, unit price, and country, making it suitable for analyzing country-specific consumption patterns and developing region-based classification models.
(2) Applications of the E-Commerce Data Dataset: • Developing country-specific marketing strategies: By analyzing purchasing trends, frequently bought product categories, and transaction frequency by country, the dataset can be used to design regionally tailored marketing strategies.
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
International e-Commerce Benchmarking
Source agency: Office for National Statistics
Designation: Supporting material
Language: English
Alternative title: e-Commerce
Facebook
Twitterhttps://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
Explore the booming Big Data in E-commerce market, driven by personalized experiences and AI. Discover market size, CAGR, key drivers, trends, and regional growth forecasts for 2025-2033.
Facebook
Twitterhttps://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Unlock the potential of data with our extensive Furniture Images and Product Schema dataset from Target.
This meticulously curated dataset provides high-quality images and detailed product information, making it an invaluable resource for e-commerce platforms, market analysis, and data-driven insights.
What’s Included:
Benefits:
Use Cases:
Total records count 40K+ and total images 212K+.
Local image file path mentioned in JSON file.
Facebook
TwitterOpen Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
License information was derived automatically
E-commerce Product Dataset - Clean and Enhance Your Data Analysis Skills or Check Out The Cleaned File Below!
This dataset offers a comprehensive collection of product information from an e-commerce store, spread across 20+ CSV files and encompassing over 80,000+ products. It presents a valuable opportunity to test and refine your data cleaning and wrangling skills.
What's Included:
A variety of product categories, including:
Each product record contains details such as:
Challenges and Opportunities:
Data Cleaning: The dataset is "dirty," containing missing values, inconsistencies in formatting, and potential errors. This provides a chance to practice your data-cleaning techniques such as:
Feature Engineering: After cleaning, you can explore opportunities to create new features from the existing data, such as: - Extracting keywords from product titles and descriptions - Deriving price categories - Calculating average discounts
Who can benefit from this dataset?
Facebook
TwitterOpenWeb Ninja's Amazon Data API provides fast and reliable access to real-time Amazon data across all 22 Amazon domains. With over 600 million product listings and more than 40 data points per product, the API makes it simple to search products, query by category, and extract structured ecommerce product data at scale.
Key capabilities: - Product Search & Categories: search Amazon by keyword or retrieve products directly from categories. - Product Data: titles, descriptions, images, pricing, availability, attributes. - Amazon Reviews Data: full review content, ratings, timestamps, helpful counts. - Offers & Sellers Data: all current offers, with sellers data, and more. - Amazon Sellers Data: Amazon sellers profile, sold products, and seller reviews. - Best Sellers & Deals: Amazon Best Sellers by category, Today’s Deals, and promotions. - ASIN to GTIN: convert ASIN to GTIN/EAN/ISBN for external integrations.
Coverage & Scale: - 600M+ products across all major categories and industries. - 22 Amazon countries/domains supported. - 40+ structured data points per product. - Real-time updates, delivered via a fast and reliable REST API.
Use cases: - Pricing and product comparison tools. - Ecommerce and market research. - Seller and competitor monitoring. - Product discovery and trend analysis. - Sentiment analysis with customer product reviews data.
With OpenWeb Ninja's Amazon Data API, you get the most complete Amazon data - from product details and reviews to best sellers and deals - always delivered in real time through a fast and reliable REST API.
Facebook
TwitterAttribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
The E-commerce Order Dataset provides comprehensive information related to orders, items within orders, customers, payments, and products for an e-commerce platform. This dataset is structured with multiple tables, each containing specific information about various aspects of the e-commerce operations.
Facebook
Twitterhttps://www.wiseguyreports.com/pages/privacy-policyhttps://www.wiseguyreports.com/pages/privacy-policy
| BASE YEAR | 2024 |
| HISTORICAL DATA | 2019 - 2023 |
| REGIONS COVERED | North America, Europe, APAC, South America, MEA |
| REPORT COVERAGE | Revenue Forecast, Competitive Landscape, Growth Factors, and Trends |
| MARKET SIZE 2024 | 2.87(USD Billion) |
| MARKET SIZE 2025 | 3.15(USD Billion) |
| MARKET SIZE 2035 | 8.0(USD Billion) |
| SEGMENTS COVERED | Application, Deployment Type, End User, Data Type, Regional |
| COUNTRIES COVERED | US, Canada, Germany, UK, France, Russia, Italy, Spain, Rest of Europe, China, India, Japan, South Korea, Malaysia, Thailand, Indonesia, Rest of APAC, Brazil, Mexico, Argentina, Rest of South America, GCC, South Africa, Rest of MEA |
| KEY MARKET DYNAMICS | Data privacy regulations, Increasing data-driven decision making, Demand for competitive intelligence, Rising automation in analytics, Growth of e-commerce platforms |
| MARKET FORECAST UNITS | USD Billion |
| KEY COMPANIES PROFILED | Octoparse, WebHarvy, DataMiner, WebRobot, Zyte, DataSift, Scrapy, Import.io, Diffbot, Content Grabber, Mozenda, Fivetran, Beautiful Soup, Apify, ParseHub, Bright Data |
| MARKET FORECAST PERIOD | 2025 - 2035 |
| KEY MARKET OPPORTUNITIES | Increased demand for real-time data, Growth of e-commerce analytics, Rising need for competitive intelligence, Expansion of AI and ML integration, Enhanced regulatory compliance requirements |
| COMPOUND ANNUAL GROWTH RATE (CAGR) | 9.8% (2025 - 2035) |
Facebook
Twitter📦 DataSignals E-Commerce Signals (v1)
High-frequency web signals dataset generated automatically from E-commerce pages. This dataset contains structured signals extracted from HTML snapshots of E-commerce websites.The goal is to provide high-quality, machine-readable signals useful for:
🧠 AI training
📈 Market intelligence
🔎 Change detection
🛒 E-commerce research
🔮 Price prediction models
⚡ Real-time data pipelines
📊 Dataset Contents
Each row… See the full description on the dataset page: https://huggingface.co/datasets/datasignals-lab/datasignals-ecommerce-1.
Facebook
Twitterhttps://www.verifiedmarketresearch.com/privacy-policy/https://www.verifiedmarketresearch.com/privacy-policy/
Data Lakes Market size was valued at USD 17.21 Billion in 2024 and is projected to reach USD 79.09 Billion by 2031, growing at a CAGR of 21.00% during the forecasted period 2024 to 2031.
The data lakes market is driven by the growing need for organizations to manage and analyze vast amounts of unstructured and structured data for better decision-making and insights. As businesses increasingly rely on big data analytics, machine learning, and artificial intelligence to gain competitive advantages, data lakes provide a scalable and cost-effective solution to store raw data from diverse sources. The rising adoption of cloud-based solutions further fuels the market, as cloud data lakes offer flexibility, agility, and seamless integration with analytics tools. Additionally, the growing emphasis on digital transformation, real-time data processing, and enhanced data governance are key factors pushing the demand for data lakes across industries such as finance, healthcare, retail, and manufacturing.
Facebook
Twitterhttps://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
The size of the Slovakia E-Commerce Industry market was valued at USD XX Million in 2023 and is projected to reach USD XXX Million by 2032, with an expected CAGR of 14.22% during the forecast period. Recent developments include: December 2021 - The Slovak Republic's Ministry of Finance announced the implementation of a central national electronic invoicing system which allows businesses to send e-invoices in a structured data format to the tax authority. Such developments are expected to aid the growth of the studied market in the country.. Key drivers for this market are: High Emphasis on Fashion. Potential restraints include: , Lack of Comprehensiveness of Chinese Digital Design Tools. Notable trends are: Significant Growth in E-Commerce is Expected due to Digital Transformation.
Facebook
Twitter
According to our latest research, the global retail data monetization platform market size stood at USD 4.8 billion in 2024, with an observed compound annual growth rate (CAGR) of 19.2% from 2025 to 2033. The market is projected to reach USD 20.3 billion by 2033, driven primarily by the increasing adoption of advanced analytics, the proliferation of e-commerce, and the growing demand for actionable insights to enhance customer experiences in the retail sector. As per our latest research, the robust growth trajectory is underpinned by retailers’ urgent need to leverage data assets for competitive advantage, optimize operations, and unlock new revenue streams through data-driven strategies.
The primary growth factor for the retail data monetization platform market is the exponential increase in data generation across retail touchpoints, including point-of-sale systems, online transactions, customer loyalty programs, and social media interactions. Retailers are increasingly realizing the untapped value of their data assets, which can be transformed into actionable intelligence for internal optimization and external monetization. The integration of artificial intelligence (AI) and machine learning (ML) technologies within these platforms further enhances their predictive capabilities, enabling retailers to anticipate consumer trends, personalize marketing efforts, and drive operational efficiency. This trend is particularly pronounced among large enterprises, which possess the infrastructure and resources to invest in sophisticated data monetization solutions, thereby fueling overall market growth.
Another significant driver is the rapid digital transformation of the global retail landscape. The shift to omnichannel retailing and the rise of e-commerce have created vast reservoirs of structured and unstructured data. Retailers and e-commerce companies are increasingly seeking platforms that can aggregate, cleanse, analyze, and monetize this data, both internally and through partnerships with third parties such as consumer goods manufacturers, financial institutions, and advertising agencies. The increasing regulatory emphasis on data privacy and security, especially in regions like North America and Europe, has also prompted the adoption of platforms that offer robust compliance features, further boosting market demand. As retailers strive to deliver hyper-personalized experiences and optimize their supply chains, the adoption of retail data monetization platforms is set to accelerate.
The proliferation of cloud computing and the availability of scalable Software-as-a-Service (SaaS) solutions have democratized access to advanced data monetization tools, making them accessible to small and medium enterprises (SMEs) as well. Cloud-based deployment models offer cost-effectiveness, flexibility, and ease of integration with existing retail IT ecosystems, enabling even resource-constrained businesses to participate in the data economy. This democratization is expected to broaden the addressable market and drive higher adoption rates across diverse retail segments. Furthermore, the emergence of data marketplaces and collaborative analytics ecosystems is enabling retailers to monetize their data assets externally, creating new revenue streams and fostering innovation.
From a regional perspective, North America currently dominates the retail data monetization platform market, accounting for the largest market share in 2024, followed closely by Europe and Asia Pacific. The high concentration of technologically advanced retailers, mature e-commerce ecosystems, and a strong focus on data-driven decision-making are key factors supporting market leadership in these regions. Asia Pacific is anticipated to exhibit the fastest growth over the forecast period, fueled by rapid digitalization, expanding middle-class consumer base, and increasing investments in retail technology infrastructure. Meanwhile, Latin America and the Middle East & Africa are emerging as promising markets, driven by growing retail modernization initiatives and rising awareness of the benefits of data monetization.
Facebook
Twitterhttps://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Looking for a free Walmart product dataset? The Walmart Products Free Dataset delivers a ready-to-use ecommerce product data CSV containing ~2,100 verified product records from Walmart.com. It includes vital details like product titles, prices, categories, brand info, availability, and descriptions — perfect for data analysis, price comparison, market research, or building machine-learning models.
Complete Product Metadata: Each entry includes URL, title, brand, SKU, price, currency, description, availability, delivery method, average rating, total ratings, image links, unique ID, and timestamp.
CSV Format, Ready to Use: Download instantly - no need for scraping, cleaning or formatting.
Good for E-commerce Research & ML: Ideal for product cataloging, price tracking, demand forecasting, recommendation systems, or data-driven projects.
Free & Easy Access: Priced at USD $0.0, making it a great starting point for developers, data analysts or students.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This dataset provides a detailed, structured catalog of e-commerce products, including SKUs, descriptions, pricing, categories, images, inventory status, and store associations. It is ideal for powering online storefronts, supporting inventory management, and enabling advanced analytics for merchandising and sales optimization.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset contains detailed information on 100 e-commerce products, including product titles, descriptions, pricing, categories, and ratings. It is ideal for learning data cleaning, text processing (NLP), or building recommendation systems.
The data has been sourced via the DummyJSON API and structured to support tasks like:
Exploratory Data Analysis (EDA)
Building product recommendation models
Sentiment or review-based analysis
Data visualization and dashboarding
Columns include fields like title, description, brand, category, price, rating, and more.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset contains historical sales data for an e-commerce platform, including customer behavior, product preferences, and transaction details. The data is structured to support analyses aimed at understanding customer behavior, predicting product preferences, and improving overall revenue through strategic marketing and sales efforts. Full data: Brazilian E-commerce Public Dataset
The dataset is intended for: - Analyzing customer behavior to improve marketing strategies. - Predicting product preferences to enhance cross-selling and up-selling. - Automating reporting and creating real-time dashboards. - Implementing and testing machine learning models for sales prediction and customer retention strategies.
The dataset is provided in CSV format, with each file corresponding to a different aspect of the e-commerce data (e.g., customers, products, transactions, reviews). Each file includes relevant columns for the type of data it contains.
Facebook
TwitterThis data is from E-Commerce. I used postgreSQL for data cleaning. I transformed NULL values to 'Not defined' and orginal data have only category name column(which was 'category_code') and that was 'DOT' seperated value which show us the products class from wide to specific. So I split them with delimeter('.').
| column name | description |
|---|---|
| time | Time when event happened at (in UTC). |
| event_name | 4 kinds of value: purchase, cart, view, remove_from_cart |
| product_id | ID of a product |
| category_id | Product's category ID |
| category_name | Product's category taxonomy (code name) if it was possible to make it. Usually present for meaningful categories and skipped for different kinds of accessories. |
| brand | Downcased string of brand name. |
| price | Float price of a product. |
| user_id | Permanent user ID. |
| session | Temporary user's session ID. Same for each user's session. Is changed every time user come back to online store from a long pause. |
| category_1 | Largest class of product included |
| category_2 | Bigger class of product included |
| category_3 | Smallest class of product included |
Many thanks Thanks to REES46 Marketing Platform for this dataset and Michael Kechinov
You can use this dataset for free. Just mention the source of it: link to this page and link to REES46 Marketing Platform and Origin data provider
Your data will be in front of the world's largest data science community. What questions do you want to see answered?