28 datasets found
  1. E-Commerce Sales Dataset

    • kaggle.com
    Updated Dec 3, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2022). E-Commerce Sales Dataset [Dataset]. https://www.kaggle.com/datasets/thedevastator/unlock-profits-with-e-commerce-sales-data/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 3, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    The Devastator
    Description

    E-Commerce Sales Dataset

    Analyzing and Maximizing Online Business Performance

    By ANil [source]

    About this dataset

    This dataset provides an in-depth look at the profitability of e-commerce sales. It contains data on a variety of sales channels, including Shiprocket and INCREFF, as well as financial information on related expenses and profits. The columns contain data such as SKU codes, design numbers, stock levels, product categories, sizes and colors. In addition to this we have included the MRPs across multiple stores like Ajio MRP , Amazon MRP , Amazon FBA MRP , Flipkart MRP , Limeroad MRP Myntra MRP and PaytmMRP along with other key parameters like amount paid by customer for the purchase , rate per piece for every individual transaction Also we have added transactional parameters like Date of sale months category fulfilledby B2b Status Qty Currency Gross amt . This is a must-have dataset for anyone trying to uncover the profitability of e-commerce sales in today's marketplace

    More Datasets

    For more datasets, click here.

    Featured Notebooks

    • 🚨 Your notebook can be here! 🚨!

    How to use the dataset

    This dataset provides a comprehensive overview of e-commerce sales data from different channels covering a variety of products. Using this dataset, retailers and digital marketers can measure the performance of their campaigns more accurately and efficiently.

    The following steps help users make the most out of this dataset: - Analyze the general sales trends by examining info such as month, category, currency, stock level, and customer for each sale. This will give you an idea about how your e-commerce business is performing in each channel.
    - Review the Shiprocket and INCREF data to compare and analyze profitability via different fulfilment methods. This comparison would enable you to make better decisions towards maximizing profit while minimizing costs associated with each method’s referral fees and fulfillment rates.
    - Compare prices between various channels such as Amazon FBA MRP, Myntra MRP, Ajio MRP etc using the corresponding columns for each store (Amazon MRP etc). You can judge which stores are offering more profitable margins without compromising on quality by analyzing these pricing points in combination with other information related to product sales (TP1/TP2 - cost per piece).
    - Look at customer specific data such as TP 1/TP 2 combination wise Gross Amount or Rate info in terms price per piece or total gross amount generated by any SKU dispersed over multiple customers with relevant dates associated to track individual item performance relative to others within its category over time periods shortlisted/filtered appropriately.. Have an eye on items commonly utilized against offers or promotional discounts offered hence crafting strategies towards inventory optimization leading up-selling operations.?
    - Finally Use Overall ‘Stock’ details along all the P & L Data including Yearly Expenses_IIGF information record for takeaways which might be aimed towards essential cost cutting measures like switching amongst delivery options carefully chosen out of Shiprocket & INCREFF leadings away from manual inspections catering savings under support personnel outsourcing structures.?

    By employing a comprehensive understanding on how our internal subsidiaries perform globally unless attached respective audits may provide us remarkably lower operational costs servicing confidence; costing far lesser than being incurred taking into account entire pallet shipments tracking sheets representing current level supply chains efficiencies achieved internally., then one may finally scale profits exponentially increases cut down unseen losses followed up introducing newer marketing campaigns necessarily tailored according playing around multiple goods based spectrums due powerful backing suitable transportation boundaries set carefully

    Research Ideas

    • Analysing the difference in profitability between sales made through Shiprocket and INCREFF. This data can be used to see where the biggest profit margins lie, and strategize accordingly.
    • Examining the Complete Cost structure of a product with all its components and their contribution towards revenue or profitability, i.e., TP 1 & 2, MRP Old & Final MRP Old together with Platform based MRP - Amazon, Myntra and Paytm etc., Currency based Profit Margin etc.
    • Building a predictive model using Machine Learning by leveraging historical data to predict future sales volume and profits for e-commerce products across multiple categories/devices/platforms such as Amazon, Flipkart, Myntra etc as well providing m...
  2. Walmart products free dataset

    • crawlfeeds.com
    csv, zip
    Updated Apr 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Crawl Feeds (2025). Walmart products free dataset [Dataset]. https://crawlfeeds.com/datasets/walmart-products-free-dataset
    Explore at:
    zip, csvAvailable download formats
    Dataset updated
    Apr 27, 2025
    Dataset authored and provided by
    Crawl Feeds
    License

    https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy

    Description

    Discover the Walmart Products Free Dataset, featuring 2,000 records in CSV format. This dataset includes detailed information about various Walmart products, such as names, prices, categories, and descriptions.

    It’s perfect for data analysis, e-commerce research, and machine learning projects. Download now and kickstart your insights with accurate, real-world data.

  3. Linear Regression E-commerce Dataset

    • kaggle.com
    zip
    Updated Sep 16, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Saurabh Kolawale (2019). Linear Regression E-commerce Dataset [Dataset]. https://www.kaggle.com/datasets/kolawale/focusing-on-mobile-app-or-website
    Explore at:
    zip(44169 bytes)Available download formats
    Dataset updated
    Sep 16, 2019
    Authors
    Saurabh Kolawale
    Description

    This dataset is having data of customers who buys clothes online. The store offers in-store style and clothing advice sessions. Customers come in to the store, have sessions/meetings with a personal stylist, then they can go home and order either on a mobile app or website for the clothes they want.

    The company is trying to decide whether to focus their efforts on their mobile app experience or their website.

  4. Looker Ecommerce BigQuery Dataset

    • kaggle.com
    Updated Jan 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mustafa Keser (2024). Looker Ecommerce BigQuery Dataset [Dataset]. https://www.kaggle.com/datasets/mustafakeser4/looker-ecommerce-bigquery-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 18, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Mustafa Keser
    Description

    Looker Ecommerce Dataset Description

    CSV version of Looker Ecommerce Dataset.

    Overview Dataset in BigQuery TheLook is a fictitious eCommerce clothing site developed by the Looker team. The dataset contains information >about customers, products, orders, logistics, web events and digital marketing campaigns. The contents of this >dataset are synthetic, and are provided to industry practitioners for the purpose of product discovery, testing, and >evaluation. This public dataset is hosted in Google BigQuery and is included in BigQuery's 1TB/mo of free tier processing. This >means that each user receives 1TB of free BigQuery processing every month, which can be used to run queries on >this public dataset. Watch this short video to learn how to get started quickly using BigQuery to access public >datasets.

    1. distribution_centers.csv

    • Columns:
      • id: Unique identifier for each distribution center.
      • name: Name of the distribution center.
      • latitude: Latitude coordinate of the distribution center.
      • longitude: Longitude coordinate of the distribution center.

    2. events.csv

    • Columns:
      • id: Unique identifier for each event.
      • user_id: Identifier for the user associated with the event.
      • sequence_number: Sequence number of the event.
      • session_id: Identifier for the session during which the event occurred.
      • created_at: Timestamp indicating when the event took place.
      • ip_address: IP address from which the event originated.
      • city: City where the event occurred.
      • state: State where the event occurred.
      • postal_code: Postal code of the event location.
      • browser: Web browser used during the event.
      • traffic_source: Source of the traffic leading to the event.
      • uri: Uniform Resource Identifier associated with the event.
      • event_type: Type of event recorded.

    3. inventory_items.csv

    • Columns:
      • id: Unique identifier for each inventory item.
      • product_id: Identifier for the associated product.
      • created_at: Timestamp indicating when the inventory item was created.
      • sold_at: Timestamp indicating when the item was sold.
      • cost: Cost of the inventory item.
      • product_category: Category of the associated product.
      • product_name: Name of the associated product.
      • product_brand: Brand of the associated product.
      • product_retail_price: Retail price of the associated product.
      • product_department: Department to which the product belongs.
      • product_sku: Stock Keeping Unit (SKU) of the product.
      • product_distribution_center_id: Identifier for the distribution center associated with the product.

    4. order_items.csv

    • Columns:
      • id: Unique identifier for each order item.
      • order_id: Identifier for the associated order.
      • user_id: Identifier for the user who placed the order.
      • product_id: Identifier for the associated product.
      • inventory_item_id: Identifier for the associated inventory item.
      • status: Status of the order item.
      • created_at: Timestamp indicating when the order item was created.
      • shipped_at: Timestamp indicating when the order item was shipped.
      • delivered_at: Timestamp indicating when the order item was delivered.
      • returned_at: Timestamp indicating when the order item was returned.

    5. orders.csv

    • Columns:
      • order_id: Unique identifier for each order.
      • user_id: Identifier for the user who placed the order.
      • status: Status of the order.
      • gender: Gender information of the user.
      • created_at: Timestamp indicating when the order was created.
      • returned_at: Timestamp indicating when the order was returned.
      • shipped_at: Timestamp indicating when the order was shipped.
      • delivered_at: Timestamp indicating when the order was delivered.
      • num_of_item: Number of items in the order.

    6. products.csv

    • Columns:
      • id: Unique identifier for each product.
      • cost: Cost of the product.
      • category: Category to which the product belongs.
      • name: Name of the product.
      • brand: Brand of the product.
      • retail_price: Retail price of the product.
      • department: Department to which the product belongs.
      • sku: Stock Keeping Unit (SKU) of the product.
      • distribution_center_id: Identifier for the distribution center associated with the product.

    7. users.csv

    • Columns:
      • id: Unique identifier for each user.
      • first_name: First name of the user.
      • last_name: Last name of the user.
      • email: Email address of the user.
      • age: Age of the user.
      • gender: Gender of the user.
      • state: State where t...
  5. ECommerce Data Analysis

    • kaggle.com
    Updated Jan 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    M Mohaiminul Islam (2024). ECommerce Data Analysis [Dataset]. https://www.kaggle.com/datasets/mmohaiminulislam/ecommerce-data-analysis
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 1, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    M Mohaiminul Islam
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Objectives:

    • I leveraged advanced data visualization techniques to extract valuable insights from a comprehensive dataset. By visualizing sales patterns, customer behavior, and product trends, I identified key growth opportunities and provided actionable recommendations to optimize business strategies and enhance overall performance. you can find the GitHub repo here Link to GitHub Repository.

    Data Description:

    there are exactly 6 table and 1 is a fact table and the rest of them are dimension tables: Fact Table:

    payment_key:
      Description: An identifier representing the payment transaction associated with the fact.
      Use Case: This key links to a payment dimension table, providing details about the payment method and related information.
    
    customer_key:
      Description: An identifier representing the customer associated with the fact.
      Use Case: This key links to a customer dimension table, providing details about the customer, such as name, address, and other customer-specific information.
    
    time_key:
      Description: An identifier representing the time dimension associated with the fact.
      Use Case: This key links to a time dimension table, providing details about the time of the transaction, such as date, day of the week, and month.
    
    item_key:
      Description: An identifier representing the item or product associated with the fact.
      Use Case: This key links to an item dimension table, providing details about the product, such as category, sub-category, and product name.
    
    store_key:
      Description: An identifier representing the store or location associated with the fact.
      Use Case: This key links to a store dimension table, providing details about the store, such as location, store name, and other store-specific information.
    
    quantity:
      Description: The quantity of items sold or involved in the transaction.
      Use Case: Represents the amount or number of items associated with the transaction.
    
    unit:
      Description: The unit or measurement associated with the quantity (e.g., pieces, kilograms).
      Use Case: Specifies the unit of measurement for the quantity.
    
    unit_price:
      Description: The price per unit of the item.
      Use Case: Represents the cost or price associated with each unit of the item.
    
    total_price:
      Description: The total price of the transaction, calculated as the product of quantity and unit price.
      Use Case: Represents the overall cost or revenue generated by the transaction.
    

    Customer Table: customer_key:

    Description: An identifier representing a unique customer.
    Use Case: Serves as the primary key to link with the fact table, allowing for easy and efficient retrieval of customer-specific information.
    

    name:

    Description: The name of the customer.
    Use Case: Captures the personal or business name of the customer for identification and reference purposes.
    

    contact_no:

    Description: The contact number associated with the customer.
    Use Case: Stores the phone number or contact details for communication or outreach purposes.
    

    nid:

    Description: The National ID (NID) or a unique identification number for the customer.
    

    Item Table: item_key:

    Description: An identifier representing a unique item or product.
    Use Case: Serves as the primary key to link with the fact table, enabling retrieval of detailed information about specific items in transactions.
    

    item_name:

    Description: The name or title of the item.
    Use Case: Captures the descriptive name of the item, providing a recognizable label for the product.
    

    desc:

    Description: A description of the item.
    Use Case: Contains additional details about the item, such as features, specifications, or any relevant information.
    

    unit_price:

    Description: The price per unit of the item.
    Use Case: Represents the cost or price associated with each unit of the item.
    

    man_country:

    Description: The country where the item is manufactured.
    Use Case: Captures the origin or manufacturing location of the item.
    

    supplier:

    Description: The supplier or vendor providing the item.
    Use Case: Stores the name or identifier of the supplier, facilitating tracking of item sources.
    

    unit:

    Description: The unit of measurement associated with the item (e.g., pieces, kilograms).
    

    Store Table: store_key:

    Description: An identifier representing a unique store or location.
    Use Case: Serves as the primary key to link with the fact table, allowing for easy retrieval of information about transactions associated with specific stores.
    

    division:

    Description: The administrative division or region where the store is located.
    Use Case: Captures the broader geographical area in which...
    
  6. h

    Bitext-retail-ecommerce-llm-chatbot-training-dataset

    • huggingface.co
    Updated Aug 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bitext (2024). Bitext-retail-ecommerce-llm-chatbot-training-dataset [Dataset]. https://huggingface.co/datasets/bitext/Bitext-retail-ecommerce-llm-chatbot-training-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 6, 2024
    Dataset authored and provided by
    Bitext
    License

    https://choosealicense.com/licenses/cdla-sharing-1.0/https://choosealicense.com/licenses/cdla-sharing-1.0/

    Description

    Bitext - Retail (eCommerce) Tagged Training Dataset for LLM-based Virtual Assistants

      Overview
    

    This hybrid synthetic dataset is designed to be used to fine-tune Large Language Models such as GPT, Mistral and OpenELM, and has been generated using our NLP/NLG technology and our automated Data Labeling (DAL) tools. The goal is to demonstrate how Verticalization/Domain Adaptation for the [Retail (eCommerce)] sector can be easily achieved using our two-step approach to LLM… See the full description on the dataset page: https://huggingface.co/datasets/bitext/Bitext-retail-ecommerce-llm-chatbot-training-dataset.

  7. F

    E-Commerce Retail Sales as a Percent of Total Sales

    • fred.stlouisfed.org
    json
    Updated Aug 19, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). E-Commerce Retail Sales as a Percent of Total Sales [Dataset]. https://fred.stlouisfed.org/series/ECOMPCTSA
    Explore at:
    jsonAvailable download formats
    Dataset updated
    Aug 19, 2025
    License

    https://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain

    Description

    Graph and download economic data for E-Commerce Retail Sales as a Percent of Total Sales (ECOMPCTSA) from Q4 1999 to Q2 2025 about e-commerce, retail trade, percent, sales, retail, and USA.

  8. Data from: Shopee Dataset

    • brightdata.com
    .json, .csv, .xlsx
    Updated Apr 16, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bright Data (2024). Shopee Dataset [Dataset]. https://brightdata.com/products/datasets/shopee
    Explore at:
    .json, .csv, .xlsxAvailable download formats
    Dataset updated
    Apr 16, 2024
    Dataset authored and provided by
    Bright Datahttps://brightdata.com/
    License

    https://brightdata.com/licensehttps://brightdata.com/license

    Area covered
    Worldwide
    Description

    The Shopee Products Dataset is a comprehensive resource that empowers businesses, researchers, and analysts to gain a holistic view of the Shopee e-commerce ecosystem. Whether your goal is to conduct market analysis, optimize pricing strategies, understand customer behavior, or evaluate competitors, this dataset offers the essential information you need to make informed decisions and succeed in the dynamic world of Shopee. At its core, this dataset provides key attributes such as product ID, title, ratings, reviews, pricing details, and seller information, among others. These fundamental data elements offer insights into product performance, customer sentiment, and seller credibility.

  9. Ecommerce-FAQ-Chatbot-Dataset

    • kaggle.com
    Updated May 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Saad Makhdoom (2023). Ecommerce-FAQ-Chatbot-Dataset [Dataset]. https://www.kaggle.com/datasets/saadmakhdoom/ecommerce-faq-chatbot-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 19, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Muhammad Saad Makhdoom
    Description

    Dataset

    This dataset was created by Muhammad Saad Makhdoom

    Contents

  10. Walmart basic product details dataset

    • crawlfeeds.com
    csv, zip
    Updated Jul 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Crawl Feeds (2024). Walmart basic product details dataset [Dataset]. https://crawlfeeds.com/datasets/walmart-basic-product-details-dataset
    Explore at:
    csv, zipAvailable download formats
    Dataset updated
    Jul 28, 2024
    Dataset authored and provided by
    Crawl Feeds
    License

    https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy

    Description

    Get access to the Walmart Basic Product Details Dataset, which includes essential information on a wide range of products available at Walmart.

    This comprehensive dataset features product names, categories, descriptions, prices, and more. Ideal for market analysis, competitive research, and e-commerce applications.

    Download now to enhance your data-driven strategies and insights with detailed Walmart product information.

    The dataset having basic details of a dataset like title, id, image, price and descripton.

    Records count: 2.5 million +

  11. Ecommerce Merchant Data | Retail Executives Worldwide | Verified Emails &...

    • datarade.ai
    Updated Oct 27, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Success.ai (2021). Ecommerce Merchant Data | Retail Executives Worldwide | Verified Emails & Phone Numbers for Global Leaders | Best Price Guaranteed [Dataset]. https://datarade.ai/data-products/ecommerce-merchant-data-retail-executives-worldwide-verif-success-ai
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Oct 27, 2021
    Dataset provided by
    Area covered
    Denmark, Sint Eustatius and Saba, Papua New Guinea, Saudi Arabia, Nigeria, Aruba, India, Faroe Islands, Benin, Hungary
    Description

    Success.ai’s B2B Contact Data and Ecommerce Merchant Data for Retail Executives Worldwide provides a powerful solution for businesses looking to connect with decision-makers in the retail industry. With access to over 170 million verified professional profiles, this dataset includes the contact information you need to build relationships with retail executives globally. Whether you're targeting C-level leaders, operations managers, or marketing heads, Success.ai’s data ensures precise and impactful outreach.

    Why Choose Success.ai’s Retail Executives Data?

    1. Comprehensive Contact Information:
    2. Gain access to verified work emails, direct phone numbers, and LinkedIn profiles of retail executives worldwide.
    3. Data is AI-validated to ensure 99% accuracy for all your outreach efforts.

    4. Global Reach Across Retail Sectors:

    5. Includes executives from sectors like e-commerce, fashion, grocery, electronics, and luxury goods.

    6. Covers regions such as North America, Europe, Asia-Pacific, South America, and the Middle East.

    7. Continuously Updated Datasets:

    8. Real-time updates ensure accurate and current information about retail professionals in leadership roles.

    9. Compliance You Can Trust:

    10. Fully adheres to GDPR, CCPA, and other global privacy regulations, ensuring ethical data use.

    Data Highlights: - 170M+ Verified Professional Profiles: Drawn from diverse industries, including retail. - 50M Work Emails: AI-validated for high accuracy and reliability. - 30M Company Profiles: Detailed insights to support targeted campaigns. - 700M Global Professional Profiles: Enriched datasets to meet broad business objectives.

    Key Features of the Dataset: - Retail Decision-Maker Profiles: Includes profiles of CEOs, CFOs, CMOs, buyers, and merchandising directors. - Advanced Filters for Targeting: Refine your search by location, role, revenue, or retail category for optimal results. - AI-Driven Insights: Enriches profiles with valuable data to personalize and enhance your outreach.

    Strategic Use Cases:

    1. Sales Outreach and Business Growth:
    2. Directly engage retail leaders with tailored pitches to introduce your products or services.
    3. Build relationships with executives who influence major purchasing decisions.

    4. Recruitment for Retail Talent:

    5. Identify top retail professionals to fill critical leadership roles.

    6. Connect with candidates using updated and accurate contact information.

    7. Targeted Marketing Campaigns:

    8. Craft highly personalized campaigns aimed at retail decision-makers.

    9. Leverage detailed contact data for better conversion rates.

    10. Retail Technology Solutions:

    11. Present technology solutions like POS systems, inventory tools, or e-commerce platforms to relevant retail executives.

    12. Build connections with leaders looking to innovate their businesses.

    Why Choose Success.ai?

    1. Best Price Guarantee: Access premium-quality contact data at competitive prices.
    2. Seamless Integration: Download the dataset or integrate it via API for easy access.
    3. AI-Validated Accuracy: Confidence in data quality with AI-driven validation for maximum reliability.
    4. Customizable and Scalable Datasets: Filter datasets based on your specific industry, region, or executive role requirements.

    APIs for Enhanced Functionality

    1. Data Enrichment API: Add verified retail executive profiles to your CRM for better targeting.
    2. Lead Generation API: Automate lead generation for retail-focused campaigns.

    Unlock opportunities with B2B Contact Data for Retail Executives Worldwide from Success.ai. This dataset includes verified emails, phone numbers, and decision-maker profiles for leaders in the retail industry.

    With continuously updated data and a Best Price Guarantee, Success.ai ensures you have everything you need to connect with global retail executives effectively. Contact us now to elevate your business with precise and reliable data!

    No one beats us on price. Period.

  12. u

    Google Restaurants dataset

    • cseweb.ucsd.edu
    csv
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UCSD CSE Research Project, Google Restaurants dataset [Dataset]. https://cseweb.ucsd.edu/~jmcauley/datasets.html
    Explore at:
    csvAvailable download formats
    Dataset authored and provided by
    UCSD CSE Research Project
    Description

    This is a mutli-modal dataset for restaurants from Google Local (Google Maps). Data includes images and reviews posted by users, as well as metadata for each restaurant.

  13. u

    PDMX

    • cseweb.ucsd.edu
    json
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UCSD CSE Research Project, PDMX [Dataset]. https://cseweb.ucsd.edu/~jmcauley/datasets.html
    Explore at:
    jsonAvailable download formats
    Dataset authored and provided by
    UCSD CSE Research Project
    Description

    We introduce PDMX: a Public Domain MusicXML dataset for symbolic music processing, including over 250k musical scores in MusicXML format. PDMX is the largest publicly available, copyright-free MusicXML dataset in existence. PDMX includes genre, tag, description, and popularity metadata for every file.

  14. u

    Steam Video Game and Bundle Data

    • cseweb.ucsd.edu
    json
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UCSD CSE Research Project, Steam Video Game and Bundle Data [Dataset]. https://cseweb.ucsd.edu/~jmcauley/datasets.html
    Explore at:
    jsonAvailable download formats
    Dataset authored and provided by
    UCSD CSE Research Project
    Description

    These datasets contain reviews from the Steam video game platform, and information about which games were bundled together.

    Metadata includes

    • reviews

    • purchases, plays, recommends (likes)

    • product bundles

    • pricing information

    Basic Statistics:

    • Reviews: 7,793,069

    • Users: 2,567,538

    • Items: 15,474

    • Bundles: 615

  15. G

    Virtual Try-On Dataset

    • gts.ai
    jpg, json, png
    Updated Jul 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GTS (2024). Virtual Try-On Dataset [Dataset]. https://gts.ai/dataset-download/virtual-try-on-dataset/
    Explore at:
    json, png, jpgAvailable download formats
    Dataset updated
    Jul 15, 2024
    Dataset provided by
    GLOBOSE TECHNOLOGY SOLUTIONS PRIVATE LIMITED
    Authors
    GTS
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Variables measured
    Individuals wearing garments, Masked images for segmentation, Garments (variety of categories), OpenPose keypoints for pose estimation
    Description

    The Virtual Try-On Dataset is a large-scale benchmark designed for computer vision research in fashion, e-commerce, and augmented reality. It includes 13,679 high-quality images featuring garments and individuals wearing them. Additional data resources include masked images and OpenPose annotations for pose estimation. This dataset is widely used in state-of-the-art research for developing and evaluating virtual try-on models, offering diversity across garments, poses, and individuals.

  16. u

    Behance Community Art Data

    • cseweb.ucsd.edu
    json
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UCSD CSE Research Project, Behance Community Art Data [Dataset]. https://cseweb.ucsd.edu/~jmcauley/datasets.html
    Explore at:
    jsonAvailable download formats
    Dataset authored and provided by
    UCSD CSE Research Project
    Description

    Likes and image data from the community art website Behance. This is a small, anonymized, version of a larger proprietary dataset.

    Metadata includes

    • appreciates (likes)

    • timestamps

    • extracted image features

    Basic Statistics:

    • Users: 63,497

    • Items: 178,788

    • Appreciates (likes): 1,000,000

  17. d

    US Consumer Demographic Data - 269M+ Consumer Records - Programmatic Ads and...

    • datarade.ai
    Updated Jun 27, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Giant Partners (2025). US Consumer Demographic Data - 269M+ Consumer Records - Programmatic Ads and Email Marketing Automation [Dataset]. https://datarade.ai/data-products/us-consumer-demographic-data-269m-consumer-records-progr-giant-partners
    Explore at:
    Dataset updated
    Jun 27, 2025
    Dataset authored and provided by
    Giant Partners
    Area covered
    United States
    Description

    Premium B2C Consumer Database - 269+ Million US Records

    Supercharge your B2C marketing campaigns with comprehensive consumer database, featuring over 269 million verified US consumer records. Our 20+ year data expertise delivers higher quality and more extensive coverage than competitors.

    Core Database Statistics

    Consumer Records: Over 269 million

    Email Addresses: Over 160 million (verified and deliverable)

    Phone Numbers: Over 76 million (mobile and landline)

    Mailing Addresses: Over 116,000,000 (NCOA processed)

    Geographic Coverage: Complete US (all 50 states)

    Compliance Status: CCPA compliant with consent management

    Targeting Categories Available

    Demographics: Age ranges, education levels, occupation types, household composition, marital status, presence of children, income brackets, and gender (where legally permitted)

    Geographic: Nationwide, state-level, MSA (Metropolitan Service Area), zip code radius, city, county, and SCF range targeting options

    Property & Dwelling: Home ownership status, estimated home value, years in residence, property type (single-family, condo, apartment), and dwelling characteristics

    Financial Indicators: Income levels, investment activity, mortgage information, credit indicators, and wealth markers for premium audience targeting

    Lifestyle & Interests: Purchase history, donation patterns, political preferences, health interests, recreational activities, and hobby-based targeting

    Behavioral Data: Shopping preferences, brand affinities, online activity patterns, and purchase timing behaviors

    Multi-Channel Campaign Applications

    Deploy across all major marketing channels:

    Email marketing and automation

    Social media advertising

    Search and display advertising (Google, YouTube)

    Direct mail and print campaigns

    Telemarketing and SMS campaigns

    Programmatic advertising platforms

    Data Quality & Sources

    Our consumer data aggregates from multiple verified sources:

    Public records and government databases

    Opt-in subscription services and registrations

    Purchase transaction data from retail partners

    Survey participation and research studies

    Online behavioral data (privacy compliant)

    Technical Delivery Options

    File Formats: CSV, Excel, JSON, XML formats available

    Delivery Methods: Secure FTP, API integration, direct download

    Processing: Real-time NCOA, email validation, phone verification

    Custom Selections: 1,000+ selectable demographic and behavioral attributes

    Minimum Orders: Flexible based on targeting complexity

    Unique Value Propositions

    Dual Spouse Targeting: Reach both household decision-makers for maximum impact

    Cross-Platform Integration: Seamless deployment to major ad platforms

    Real-Time Updates: Monthly data refreshes ensure maximum accuracy

    Advanced Segmentation: Combine multiple targeting criteria for precision campaigns

    Compliance Management: Built-in opt-out and suppression list management

    Ideal Customer Profiles

    E-commerce retailers seeking customer acquisition

    Financial services companies targeting specific demographics

    Healthcare organizations with compliant marketing needs

    Automotive dealers and service providers

    Home improvement and real estate professionals

    Insurance companies and agents

    Subscription services and SaaS providers

    Performance Optimization Features

    Lookalike Modeling: Create audiences similar to your best customers

    Predictive Scoring: Identify high-value prospects using AI algorithms

    Campaign Attribution: Track performance across multiple touchpoints

    A/B Testing Support: Split audiences for campaign optimization

    Suppression Management: Automatic opt-out and DNC compliance

    Pricing & Volume Options

    Flexible pricing structures accommodate businesses of all sizes:

    Pay-per-record for small campaigns

    Volume discounts for large deployments

    Subscription models for ongoing campaigns

    Custom enterprise pricing for high-volume users

    Data Compliance & Privacy

    VIA.tools maintains industry-leading compliance standards:

    CCPA (California Consumer Privacy Act) compliant

    CAN-SPAM Act adherence for email marketing

    TCPA compliance for phone and SMS campaigns

    Regular privacy audits and data governance reviews

    Transparent opt-out and data deletion processes

    Getting Started

    Our data specialists work with you to:

    1. Define your target audience criteria

    2. Recommend optimal data selections

    3. Provide sample data for testing

    4. Configure delivery methods and formats

    5. Implement ongoing campaign optimization

    Why We Lead the Industry

    With over two decades of data industry experience, we combine extensive database coverage with advanced targeting capabilities. Our commitment to data quality, compliance, and customer success has made us the preferred choice for businesses seeking superior B2C marketing performance.

    Contact our team to discuss your specific ta...

  18. u

    Amazon Question and Answer Data

    • cseweb.ucsd.edu
    json
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UCSD CSE Research Project, Amazon Question and Answer Data [Dataset]. https://cseweb.ucsd.edu/~jmcauley/datasets.html
    Explore at:
    jsonAvailable download formats
    Dataset authored and provided by
    UCSD CSE Research Project
    Description

    These datasets contain 1.48 million question and answer pairs about products from Amazon.

    Metadata includes

    • question and answer text

    • is the question binary (yes/no), and if so does it have a yes/no answer?

    • timestamps

    • product ID (to reference the review dataset)

    Basic Statistics:

    • Questions: 1.48 million

    • Answers: 4,019,744

    • Labeled yes/no questions: 309,419

    • Number of unique products with questions: 191,185

  19. Data from: Laptops Dataset

    • kaggle.com
    Updated Feb 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sushmita (2023). Laptops Dataset [Dataset]. https://www.kaggle.com/datasets/sushmita36/laptops-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 21, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Sushmita
    Description

    Description: This dataset contains information about various laptops available in the market. The data was collected by scraping information from an e-commerce website. The dataset includes features such as the model name, brand, processor name, RAM, SSD, hard disk size, operating system, graphics card, screen size, resolution, number of cores, number of threads, and a spec score. The price of each laptop is also included in the dataset.

    This dataset contains the following columns:

    model_name: The name of the laptop model brand: The brand of the laptop processor_name: The name of the processor used in the laptop ram(GB): The amount of RAM (in GB) in the laptop ssd(GB): The size of the Solid State Drive (in GB) in the laptop, with 0 indicating no SSD Hard Disk(GB): The size of the Hard Disk Drive (in GB) in the laptop, with 0 indicating no hard disk Operating System: The operating system installed on the laptop graphics: The name of the graphics card used in the laptop screen_size(inches): The size of the screen (in inches) of the laptop resolution (pixels): The screen resolution (in pixels) of the laptop no_of_cores: The number of processor cores in the laptop, with 0 indicating missing information no_of_threads: The number of threads in the processor of the laptop, with 0 indicating missing information spec_score: A score based on the specifications of the laptop, where a higher score indicates better performance, with 0 indicating missing information price: The price of the laptop

    Use cases: This dataset can be used to analyze the laptop market and understand which features are most important to consumers. It can also be used to predict the price of a laptop based on its specifications or to recommend laptops to users based on their preferences.

  20. u

    Marketing Bias data

    • cseweb.ucsd.edu
    json
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UCSD CSE Research Project, Marketing Bias data [Dataset]. https://cseweb.ucsd.edu/~jmcauley/datasets.html
    Explore at:
    jsonAvailable download formats
    Dataset authored and provided by
    UCSD CSE Research Project
    Description

    These datasets contain attributes about products sold on ModCloth and Amazon which may be sources of bias in recommendations (in particular, attributes about how the products are marketed). Data also includes user/item interactions for recommendation.

    Metadata includes

    • ratings

    • product images

    • user identities

    • item sizes, user genders

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
The Devastator (2022). E-Commerce Sales Dataset [Dataset]. https://www.kaggle.com/datasets/thedevastator/unlock-profits-with-e-commerce-sales-data/code
Organization logo

E-Commerce Sales Dataset

Analyzing and Maximizing Online Business Performance

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 3, 2022
Dataset provided by
Kagglehttp://kaggle.com/
Authors
The Devastator
Description

E-Commerce Sales Dataset

Analyzing and Maximizing Online Business Performance

By ANil [source]

About this dataset

This dataset provides an in-depth look at the profitability of e-commerce sales. It contains data on a variety of sales channels, including Shiprocket and INCREFF, as well as financial information on related expenses and profits. The columns contain data such as SKU codes, design numbers, stock levels, product categories, sizes and colors. In addition to this we have included the MRPs across multiple stores like Ajio MRP , Amazon MRP , Amazon FBA MRP , Flipkart MRP , Limeroad MRP Myntra MRP and PaytmMRP along with other key parameters like amount paid by customer for the purchase , rate per piece for every individual transaction Also we have added transactional parameters like Date of sale months category fulfilledby B2b Status Qty Currency Gross amt . This is a must-have dataset for anyone trying to uncover the profitability of e-commerce sales in today's marketplace

More Datasets

For more datasets, click here.

Featured Notebooks

  • 🚨 Your notebook can be here! 🚨!

How to use the dataset

This dataset provides a comprehensive overview of e-commerce sales data from different channels covering a variety of products. Using this dataset, retailers and digital marketers can measure the performance of their campaigns more accurately and efficiently.

The following steps help users make the most out of this dataset: - Analyze the general sales trends by examining info such as month, category, currency, stock level, and customer for each sale. This will give you an idea about how your e-commerce business is performing in each channel.
- Review the Shiprocket and INCREF data to compare and analyze profitability via different fulfilment methods. This comparison would enable you to make better decisions towards maximizing profit while minimizing costs associated with each method’s referral fees and fulfillment rates.
- Compare prices between various channels such as Amazon FBA MRP, Myntra MRP, Ajio MRP etc using the corresponding columns for each store (Amazon MRP etc). You can judge which stores are offering more profitable margins without compromising on quality by analyzing these pricing points in combination with other information related to product sales (TP1/TP2 - cost per piece).
- Look at customer specific data such as TP 1/TP 2 combination wise Gross Amount or Rate info in terms price per piece or total gross amount generated by any SKU dispersed over multiple customers with relevant dates associated to track individual item performance relative to others within its category over time periods shortlisted/filtered appropriately.. Have an eye on items commonly utilized against offers or promotional discounts offered hence crafting strategies towards inventory optimization leading up-selling operations.?
- Finally Use Overall ‘Stock’ details along all the P & L Data including Yearly Expenses_IIGF information record for takeaways which might be aimed towards essential cost cutting measures like switching amongst delivery options carefully chosen out of Shiprocket & INCREFF leadings away from manual inspections catering savings under support personnel outsourcing structures.?

By employing a comprehensive understanding on how our internal subsidiaries perform globally unless attached respective audits may provide us remarkably lower operational costs servicing confidence; costing far lesser than being incurred taking into account entire pallet shipments tracking sheets representing current level supply chains efficiencies achieved internally., then one may finally scale profits exponentially increases cut down unseen losses followed up introducing newer marketing campaigns necessarily tailored according playing around multiple goods based spectrums due powerful backing suitable transportation boundaries set carefully

Research Ideas

  • Analysing the difference in profitability between sales made through Shiprocket and INCREFF. This data can be used to see where the biggest profit margins lie, and strategize accordingly.
  • Examining the Complete Cost structure of a product with all its components and their contribution towards revenue or profitability, i.e., TP 1 & 2, MRP Old & Final MRP Old together with Platform based MRP - Amazon, Myntra and Paytm etc., Currency based Profit Margin etc.
  • Building a predictive model using Machine Learning by leveraging historical data to predict future sales volume and profits for e-commerce products across multiple categories/devices/platforms such as Amazon, Flipkart, Myntra etc as well providing m...
Search
Clear search
Close search
Google apps
Main menu