MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
• I leveraged advanced data visualization techniques to extract valuable insights from a comprehensive dataset. By visualizing sales patterns, customer behavior, and product trends, I identified key growth opportunities and provided actionable recommendations to optimize business strategies and enhance overall performance. you can find the GitHub repo here Link to GitHub Repository.
there are exactly 6 table and 1 is a fact table and the rest of them are dimension tables: Fact Table:
payment_key:
Description: An identifier representing the payment transaction associated with the fact.
Use Case: This key links to a payment dimension table, providing details about the payment method and related information.
customer_key:
Description: An identifier representing the customer associated with the fact.
Use Case: This key links to a customer dimension table, providing details about the customer, such as name, address, and other customer-specific information.
time_key:
Description: An identifier representing the time dimension associated with the fact.
Use Case: This key links to a time dimension table, providing details about the time of the transaction, such as date, day of the week, and month.
item_key:
Description: An identifier representing the item or product associated with the fact.
Use Case: This key links to an item dimension table, providing details about the product, such as category, sub-category, and product name.
store_key:
Description: An identifier representing the store or location associated with the fact.
Use Case: This key links to a store dimension table, providing details about the store, such as location, store name, and other store-specific information.
quantity:
Description: The quantity of items sold or involved in the transaction.
Use Case: Represents the amount or number of items associated with the transaction.
unit:
Description: The unit or measurement associated with the quantity (e.g., pieces, kilograms).
Use Case: Specifies the unit of measurement for the quantity.
unit_price:
Description: The price per unit of the item.
Use Case: Represents the cost or price associated with each unit of the item.
total_price:
Description: The total price of the transaction, calculated as the product of quantity and unit price.
Use Case: Represents the overall cost or revenue generated by the transaction.
Customer Table: customer_key:
Description: An identifier representing a unique customer.
Use Case: Serves as the primary key to link with the fact table, allowing for easy and efficient retrieval of customer-specific information.
name:
Description: The name of the customer.
Use Case: Captures the personal or business name of the customer for identification and reference purposes.
contact_no:
Description: The contact number associated with the customer.
Use Case: Stores the phone number or contact details for communication or outreach purposes.
nid:
Description: The National ID (NID) or a unique identification number for the customer.
Item Table: item_key:
Description: An identifier representing a unique item or product.
Use Case: Serves as the primary key to link with the fact table, enabling retrieval of detailed information about specific items in transactions.
item_name:
Description: The name or title of the item.
Use Case: Captures the descriptive name of the item, providing a recognizable label for the product.
desc:
Description: A description of the item.
Use Case: Contains additional details about the item, such as features, specifications, or any relevant information.
unit_price:
Description: The price per unit of the item.
Use Case: Represents the cost or price associated with each unit of the item.
man_country:
Description: The country where the item is manufactured.
Use Case: Captures the origin or manufacturing location of the item.
supplier:
Description: The supplier or vendor providing the item.
Use Case: Stores the name or identifier of the supplier, facilitating tracking of item sources.
unit:
Description: The unit of measurement associated with the item (e.g., pieces, kilograms).
Store Table: store_key:
Description: An identifier representing a unique store or location.
Use Case: Serves as the primary key to link with the fact table, allowing for easy retrieval of information about transactions associated with specific stores.
division:
Description: The administrative division or region where the store is located.
Use Case: Captures the broader geographical area in which...
By ANil [source]
This dataset provides an in-depth look at the profitability of e-commerce sales. It contains data on a variety of sales channels, including Shiprocket and INCREFF, as well as financial information on related expenses and profits. The columns contain data such as SKU codes, design numbers, stock levels, product categories, sizes and colors. In addition to this we have included the MRPs across multiple stores like Ajio MRP , Amazon MRP , Amazon FBA MRP , Flipkart MRP , Limeroad MRP Myntra MRP and PaytmMRP along with other key parameters like amount paid by customer for the purchase , rate per piece for every individual transaction Also we have added transactional parameters like Date of sale months category fulfilledby B2b Status Qty Currency Gross amt . This is a must-have dataset for anyone trying to uncover the profitability of e-commerce sales in today's marketplace
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This dataset provides a comprehensive overview of e-commerce sales data from different channels covering a variety of products. Using this dataset, retailers and digital marketers can measure the performance of their campaigns more accurately and efficiently.
The following steps help users make the most out of this dataset: - Analyze the general sales trends by examining info such as month, category, currency, stock level, and customer for each sale. This will give you an idea about how your e-commerce business is performing in each channel.
- Review the Shiprocket and INCREF data to compare and analyze profitability via different fulfilment methods. This comparison would enable you to make better decisions towards maximizing profit while minimizing costs associated with each method’s referral fees and fulfillment rates.
- Compare prices between various channels such as Amazon FBA MRP, Myntra MRP, Ajio MRP etc using the corresponding columns for each store (Amazon MRP etc). You can judge which stores are offering more profitable margins without compromising on quality by analyzing these pricing points in combination with other information related to product sales (TP1/TP2 - cost per piece).
- Look at customer specific data such as TP 1/TP 2 combination wise Gross Amount or Rate info in terms price per piece or total gross amount generated by any SKU dispersed over multiple customers with relevant dates associated to track individual item performance relative to others within its category over time periods shortlisted/filtered appropriately.. Have an eye on items commonly utilized against offers or promotional discounts offered hence crafting strategies towards inventory optimization leading up-selling operations.?
- Finally Use Overall ‘Stock’ details along all the P & L Data including Yearly Expenses_IIGF information record for takeaways which might be aimed towards essential cost cutting measures like switching amongst delivery options carefully chosen out of Shiprocket & INCREFF leadings away from manual inspections catering savings under support personnel outsourcing structures.?By employing a comprehensive understanding on how our internal subsidiaries perform globally unless attached respective audits may provide us remarkably lower operational costs servicing confidence; costing far lesser than being incurred taking into account entire pallet shipments tracking sheets representing current level supply chains efficiencies achieved internally., then one may finally scale profits exponentially increases cut down unseen losses followed up introducing newer marketing campaigns necessarily tailored according playing around multiple goods based spectrums due powerful backing suitable transportation boundaries set carefully
- Analysing the difference in profitability between sales made through Shiprocket and INCREFF. This data can be used to see where the biggest profit margins lie, and strategize accordingly.
- Examining the Complete Cost structure of a product with all its components and their contribution towards revenue or profitability, i.e., TP 1 & 2, MRP Old & Final MRP Old together with Platform based MRP - Amazon, Myntra and Paytm etc., Currency based Profit Margin etc.
- Building a predictive model using Machine Learning by leveraging historical data to predict future sales volume and profits for e-commerce products across multiple categories/devices/platforms such as Amazon, Flipkart, Myntra etc as well providing m...
Success.ai’s Ecommerce Market Data for South-east Asia E-commerce Contacts provides a robust and accurate dataset tailored for businesses and organizations looking to connect with professionals in the fast-growing e-commerce industry across South-east Asia. Covering roles such as e-commerce managers, digital strategists, logistics experts, and online marketplace leaders, this dataset offers verified contact details, professional insights, and actionable market data.
With access to over 170 million verified profiles globally, Success.ai ensures your outreach, marketing, and research strategies are powered by accurate, continuously updated, and AI-validated data. Backed by our Best Price Guarantee, this solution empowers you to excel in one of the world’s most dynamic e-commerce regions.
Why Choose Success.ai’s Ecommerce Market Data?
Verified Contact Data for Precision Outreach
Comprehensive Coverage of South-east Asia’s E-commerce Market
Continuously Updated Datasets
Ethical and Compliant
Data Highlights:
Key Features of the Dataset:
Comprehensive Professional Profiles in E-commerce
Advanced Filters for Precision Campaigns
Regional and Market-specific Insights
AI-Driven Enrichment
Strategic Use Cases:
Marketing Campaigns and Digital Outreach
Market Research and Competitive Analysis
Partnership Development and Vendor Collaboration
Recruitment and Talent Acquisition
Why Choose Success.ai?
Best Price Guarantee
Seamless Integration
CSV version of Looker Ecommerce Dataset.
Overview Dataset in BigQuery TheLook is a fictitious eCommerce clothing site developed by the Looker team. The dataset contains information >about customers, products, orders, logistics, web events and digital marketing campaigns. The contents of this >dataset are synthetic, and are provided to industry practitioners for the purpose of product discovery, testing, and >evaluation. This public dataset is hosted in Google BigQuery and is included in BigQuery's 1TB/mo of free tier processing. This >means that each user receives 1TB of free BigQuery processing every month, which can be used to run queries on >this public dataset. Watch this short video to learn how to get started quickly using BigQuery to access public >datasets.
distribution_centers.csv
id
: Unique identifier for each distribution center.name
: Name of the distribution center.latitude
: Latitude coordinate of the distribution center.longitude
: Longitude coordinate of the distribution center.events.csv
id
: Unique identifier for each event.user_id
: Identifier for the user associated with the event.sequence_number
: Sequence number of the event.session_id
: Identifier for the session during which the event occurred.created_at
: Timestamp indicating when the event took place.ip_address
: IP address from which the event originated.city
: City where the event occurred.state
: State where the event occurred.postal_code
: Postal code of the event location.browser
: Web browser used during the event.traffic_source
: Source of the traffic leading to the event.uri
: Uniform Resource Identifier associated with the event.event_type
: Type of event recorded.inventory_items.csv
id
: Unique identifier for each inventory item.product_id
: Identifier for the associated product.created_at
: Timestamp indicating when the inventory item was created.sold_at
: Timestamp indicating when the item was sold.cost
: Cost of the inventory item.product_category
: Category of the associated product.product_name
: Name of the associated product.product_brand
: Brand of the associated product.product_retail_price
: Retail price of the associated product.product_department
: Department to which the product belongs.product_sku
: Stock Keeping Unit (SKU) of the product.product_distribution_center_id
: Identifier for the distribution center associated with the product.order_items.csv
id
: Unique identifier for each order item.order_id
: Identifier for the associated order.user_id
: Identifier for the user who placed the order.product_id
: Identifier for the associated product.inventory_item_id
: Identifier for the associated inventory item.status
: Status of the order item.created_at
: Timestamp indicating when the order item was created.shipped_at
: Timestamp indicating when the order item was shipped.delivered_at
: Timestamp indicating when the order item was delivered.returned_at
: Timestamp indicating when the order item was returned.orders.csv
order_id
: Unique identifier for each order.user_id
: Identifier for the user who placed the order.status
: Status of the order.gender
: Gender information of the user.created_at
: Timestamp indicating when the order was created.returned_at
: Timestamp indicating when the order was returned.shipped_at
: Timestamp indicating when the order was shipped.delivered_at
: Timestamp indicating when the order was delivered.num_of_item
: Number of items in the order.products.csv
id
: Unique identifier for each product.cost
: Cost of the product.category
: Category to which the product belongs.name
: Name of the product.brand
: Brand of the product.retail_price
: Retail price of the product.department
: Department to which the product belongs.sku
: Stock Keeping Unit (SKU) of the product.distribution_center_id
: Identifier for the distribution center associated with the product.users.csv
id
: Unique identifier for each user.first_name
: First name of the user.last_name
: Last name of the user.email
: Email address of the user.age
: Age of the user.gender
: Gender of the user.state
: State where t...Success.ai’s Ecommerce Store Data for the APAC E-commerce Sector provides a reliable and accurate dataset tailored for businesses aiming to connect with e-commerce professionals and organizations across the Asia-Pacific region. Covering roles and businesses involved in online retail, marketplace management, logistics, and digital commerce, this dataset includes verified business profiles, decision-maker contact details, and actionable insights.
With access to continuously updated, AI-validated data and over 700 million global profiles, Success.ai ensures your outreach, market analysis, and partnership strategies are effective and data-driven. Backed by our Best Price Guarantee, this solution helps you excel in one of the world’s fastest-growing e-commerce markets.
Why Choose Success.ai’s Ecommerce Store Data?
Verified Profiles for Precision Engagement
Comprehensive Coverage of the APAC E-commerce Sector
Continuously Updated Datasets
Ethical and Compliant
Data Highlights:
Key Features of the Dataset:
Comprehensive E-commerce Business Profiles
Advanced Filters for Precision Campaigns
Regional and Sector-specific Insights
AI-Driven Enrichment
Strategic Use Cases:
Marketing Campaigns and Outreach
Partnership Development and Vendor Collaboration
Market Research and Competitive Analysis
Recruitment and Talent Acquisition
Why Choose Success.ai?
Best Price Guarantee
Seamless Integration
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
This furniture e-commerce dataset includes 140,000+ structured product records collected from online retail sources. Each entry provides detailed product information, categories, and breadcrumb hierarchies, making it ideal for AI, machine learning, and analytics applications.
Key Features:
📊 140K+ furniture product records in structured format
🏷 Includes categories, subcategories, and breadcrumbs for taxonomy mapping
📂 Delivered as a clean CSV file for easy integration
🔎 Perfect dataset for AI, NLP, and machine learning model training
Best Use Cases:
✔ LLM training & fine-tuning with domain-specific data
✔ Product classification datasets for AI models
✔ Recommendation engines & personalization in e-commerce
✔ Market research & furniture retail analytics
✔ Search optimization & taxonomy enrichment
Why this dataset?
Large volume (140K+ furniture records) for robust training
Real-world e-commerce product data
Ready-to-use CSV, saving preprocessing time
Affordable licensing with bulk discounts for enterprise buyers
Note:
Each record in this dataset includes both a url
(main product page) and a buy_url
(the actual purchase page).
The dataset is structured so that records are based on the buy_url
, ensuring you get unique, actionable product-level data instead of just generic landing pages.
https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service
1) Data introduction • Womens-ecommerce-clothing-reviews dataset is a dataset containing 23,000 customer reviews and ratings.
2) Data utilization (1) Womens-ecommerce-clothing-reviews data has characteristics that: • We aim for high-quality NLP and multivariate analysis with a dataset consisting of 10 functional variables such as clothing, age, and review title and 23,486 rows. (2) Womens-ecommerce-clothing-reviews data can be used to: • Rating prediction: Develop machine learning models to predict the ratings customers might give based on review text and support automated review analysis. • Trend analysis: Companies can analyze data to identify trends and patterns in customer preferences and support inventory management and marketing strategies.
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Discover the Walmart Products Free Dataset, featuring 2,000 records in CSV format. This dataset includes detailed information about various Walmart products, such as names, prices, categories, and descriptions.
It’s perfect for data analysis, e-commerce research, and machine learning projects. Download now and kickstart your insights with accurate, real-world data.
This dataset is having data of customers who buys clothes online. The store offers in-store style and clothing advice sessions. Customers come in to the store, have sessions/meetings with a personal stylist, then they can go home and order either on a mobile app or website for the clothes they want.
The company is trying to decide whether to focus their efforts on their mobile app experience or their website.
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Unlock fashion retail intelligence with our comprehensive Zara UK products dataset. This premium collection contains 16,000 products from Zara's UK online store, providing detailed insights into one of the world's leading fast-fashion retailers. Perfect for fashion trend analysis, pricing strategies, competitive research, and machine learning applications.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
E-commerce Customer Order Behavior Dataset
A synthetic e-commerce dataset containing 10,000 orders with realistic customer behavior patterns, suitable for e-commerce analytics and machine learning tasks.
Dataset Card for E-commerce Orders
Dataset Summary
This dataset simulates customer order behavior in an e-commerce platform, containing detailed information about orders, customers, products, and delivery patterns. The data is synthetically generated with… See the full description on the dataset page: https://huggingface.co/datasets/millat/e-commerce-orders.
https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service
1) Data Introduction • The E-Commerce Shipping Data is collected to analyze customer behavior and build machine learning models with a variety of information, including purchase, delivery, and customer inquiries for 10,999 customers of international e-commerce companies.
2) Data Utilization (1) E-Commerce Shipping Data has characteristics that: • The dataset consists of 12 variables: customer ID, warehouse area, delivery method, customer center call count, customer rating (1 to 5), product price, number of previous purchases, product importance (up/medium/low), gender, discount rate, product weight, delivery time compliance (arrival delay, 0/1). • With delivery delays as a target variable, various factors such as customer behavior, product characteristics, and delivery process can be analyzed together. (2) E-Commerce Shipping Data can be used to: • Development of delivery delay prediction model: It can be used to build machine learning models that predict delivery delays by utilizing customer characteristics, product information, and delivery methods. • Analysis of customer satisfaction and service improvement: By analyzing the relationship between customer rating, number of inquiries, discount rate, and delivery performance, it can be applied to enhance customer satisfaction and establish service improvement strategies.
Success.ai delivers unparalleled access to Retail Store Data for Asia’s retail and e-commerce sectors, encompassing subcategories such as ecommerce data, ecommerce merchant data, ecommerce market data, and company data. Whether you’re targeting emerging markets or established players, our solutions provide the tools to connect with decision-makers, analyze market trends, and drive strategic growth. With continuously updated datasets and AI-validated accuracy, Success.ai ensures your data is always relevant and reliable.
Key Features of Success.ai's Retail Store Data for Retail & E-commerce in Asia:
Extensive Business Profiles: Access detailed profiles for 70M+ companies across Asia’s retail and e-commerce sectors. Profiles include firmographic data, revenue insights, employee counts, and operational scope.
Ecommerce Data: Gain insights into online marketplaces, customer demographics, and digital transaction patterns to refine your strategies.
Ecommerce Merchant Data: Understand vendor performance, supply chain metrics, and operational details to optimize partnerships.
Ecommerce Market Data: Analyze purchasing trends, regional preferences, and market demands to identify growth opportunities.
Contact Data for Decision-Makers: Reach key stakeholders, such as CEOs, marketing executives, and procurement managers. Verified contact details include work emails, phone numbers, and business addresses.
Real-Time Accuracy: AI-powered validation ensures a 99% accuracy rate, keeping your outreach efforts efficient and impactful.
Compliance and Ethics: All data is ethically sourced and fully compliant with GDPR and other regional data protection regulations.
Why Choose Success.ai for Retail Store Data?
Best Price Guarantee: We deliver industry-leading value with the most competitive pricing for comprehensive retail store data.
Customizable Solutions: Tailor your data to meet specific needs, such as targeting particular regions, industries, or company sizes.
Scalable Access: Our data solutions are built to grow with your business, supporting small startups to large-scale enterprises.
Seamless Integration: Effortlessly incorporate our data into your existing CRM, marketing, or analytics platforms.
Comprehensive Use Cases for Retail Store Data:
Identify potential partners, distributors, and clients to expand your footprint in Asia’s dynamic retail and e-commerce markets. Use detailed profiles to assess market opportunities and risks.
Leverage ecommerce data and consumer insights to craft highly targeted campaigns. Connect directly with decision-makers for precise and effective communication.
Analyze competitors’ operations, market positioning, and consumer strategies to refine your business plans and gain a competitive edge.
Evaluate potential suppliers or vendors using ecommerce merchant data, including financial health, operational details, and contact data.
Enhance customer loyalty programs and retention strategies by leveraging ecommerce market data and purchasing trends.
APIs to Amplify Your Results:
Enrichment API: Keep your CRM and analytics platforms up-to-date with real-time data enrichment, ensuring accurate and actionable company profiles.
Lead Generation API: Maximize your outreach with verified contact data for retail and e-commerce decision-makers. Ideal for driving targeted marketing and sales efforts.
Tailored Solutions for Industry Professionals:
Retailers: Expand your supply chain, identify new markets, and connect with key partners in the e-commerce ecosystem.
E-commerce Platforms: Optimize your vendor and partner selection with verified profiles and operational insights.
Marketing Agencies: Deliver highly personalized campaigns by leveraging detailed consumer data and decision-maker contacts.
Consultants: Provide data-driven recommendations to clients with access to comprehensive company data and market trends.
What Sets Success.ai Apart?
70M+ Business Profiles: Access an extensive and detailed database of companies across Asia’s retail and e-commerce sectors.
Global Compliance: All data is sourced ethically and adheres to international data privacy standards, including GDPR.
Real-Time Updates: Ensure your data remains accurate and relevant with our continuously updated datasets.
Dedicated Support: Our team of experts is available to help you maximize the value of our data solutions.
Empower Your Business with Success.ai:
Success.ai’s Retail Store Data for the retail and e-commerce sectors in Asia provides the insights and connections needed to thrive in this competitive market. Whether you’re entering a new region, launching a targeted campaign, or analyzing market trends, our data solutions ensure measurable success.
...
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset is synthetically generated fake data designed to simulate a realistic e-commerce environment.
To provide large-scale relational datasets for practicing database operations, analytics, and testing tools like DuckDB, Pandas, and SQL engines. Ideal for benchmarking, educational projects, and data engineering experiments.
int
): Unique identifier for each customer string
): Customer full name string
): Customer email address string
): Customer gender ('Male', 'Female', 'Other') date
): Date customer signed up string
): Customer country of residence int
): Unique identifier for each product string
): Name of the product string
): Product category (e.g., Electronics, Books) float
): Price per unit int
): Available stock count string
): Product brand name int
): Unique identifier for each order int
): ID of the customer who placed the order (foreign key to Customers) date
): Date when order was placed float
): Total amount for the order string
): Payment method used (Credit Card, PayPal, etc.) string
): Country where the order is shipped int
): Unique identifier for each order item int
): ID of the order this item belongs to (foreign key to Orders) int
): ID of the product ordered (foreign key to Products) int
): Number of units ordered float
): Price per unit at order time int
): Unique identifier for each review int
): ID of the reviewed product (foreign key to Products) int
): ID of the customer who wrote the review (foreign key to Customers) int
): Rating score (1 to 5) string
): Text content of the review date
): Date the review was written https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F9179978%2F7681afe8fc52a116ff56a2a4e179ad19%2FEDR.png?generation=1754741998037680&alt=media" alt="">
The script saves two folders inside the specified output path:
csv/ # CSV files
parquet/ # Parquet files
MIT License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘ E-Commerce Shipping Data ’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/prachi13/customer-analytics on 30 September 2021.
--- Dataset description provided by original source is as follows ---
An international e-commerce company based wants to discover key insights from their customer database. They want to use some of the most advanced machine learning techniques to study their customers. The company sells electronic products.
The dataset used for model building contained 10999 observations of 12 variables. The data contains the following information:
I would like to specify that I am only making available on Github in Data collected data about product shipment to Kagglers. I made this as my project on Customer Analytics stored in GitHub repository.
This data of Product Shipment Tracking, answer instantly to your questions: - What was Customer Rating? And was the product delivered on time? - Is Customer query is being answered? - If Product importance is high. having higest rating or being delivered on time?
--- Original source retains full ownership of the source dataset ---
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Asos
Using web scraping, we collected information on over 30,845 clothing items from the Asos website. The dataset can be applied in E-commerce analytics in the fashion industry.
💴 For Commercial Usage: To discuss your requirements, learn about the price and buy the dataset, leave a request on our website to buy the dataset
Dataset Info
For each item, we extracted:
url - link to the item on the website name - item's name size - sizes available on the… See the full description on the dataset page: https://huggingface.co/datasets/UniqueData/asos-e-commerce-dataset.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Analyze e-commerce trends using a dummy dataset of 1,000 products across categories like electronics, clothing, home & kitchen, books, and toys. Includes attributes such as price, rating, reviews, stock quantity, discounts, sales, and inventory dates. Ideal for exploratory data analysis, machine learning model training, and testing algorithms for product recommendation, sales prediction, and customer segmentation.
This is the largest retail e-commerce orders dataset from Pakistan. It contains half a million transaction records from March 2016 to August 2018. The data was collected from various e-commerce merchants as part of a research study. I am releasing this dataset as a capstone project for my data science course at Alnafi (alnafi.com/zusmani).
There is a dire need for such dataset to learn about Pakistan’s emerging e-commerce potential and I hope this will help many startups in many ways.
Geography: Pakistan
Time period: 03/2016 – 08/2018
Unit of analysis: E-Commerce Orders
Dataset: The dataset contains detailed information of half a million e-commerce orders in Pakistan from March 2016 to August 2018. It contains item details, shipping method, payment method like credit card, Easy-Paisa, Jazz-Cash, cash-on-delivery, product categories like fashion, mobile, electronics, appliance etc., date of order, SKU, price, quantity, total and customer ID. This is the most detailed dataset about e-commerce in Pakistan that you can find in the Public domain.
Variables: The dataset contains Item ID, Order Status (Completed, Cancelled, Refund), Date of Order, SKU, Price, Quantity, Grand Total, Category, Payment Method and Customer ID.
Size: 101 MB
File Type: CSV
I like to thank all the startups who are trying to make their mark in Pakistan despite the unavailability of research data.
I’d like to call the attention of my fellow Kagglers to use Machine Learning and Data Sciences to help me explore these ideas:
• What is the best-selling category? • Visualize payment method and order status frequency • Find a correlation between payment method and order status • Find a correlation between order date and item category • Find any hidden patterns that are counter-intuitive for a layman • Can we predict number of orders, or item category or number of customers/amount in advance?
Success.ai’s Ecommerce Leads Data for Retail, E-commerce & Consumer Goods Executives Worldwide delivers a robust and comprehensive dataset designed to help businesses connect with decision-makers and professionals in the global retail and e-commerce sectors. Covering industry leaders, marketing strategists, product managers, and logistics executives, this dataset offers verified contact details, business locations, and decision-maker insights.
With access to over 700 million verified global profiles and actionable data from retail and consumer goods companies, Success.ai ensures your outreach, market research, and business development initiatives are powered by accurate, continuously updated, and AI-validated data. Supported by our Best Price Guarantee, this solution equips you to succeed in the competitive e-commerce landscape.
Why Choose Success.ai’s Ecommerce Leads Data?
Verified Contact Data for Precision Outreach
Comprehensive Global Coverage
Continuously Updated Datasets
Ethical and Compliant
Data Highlights:
Key Features of the Dataset:
Professional Profiles in E-commerce and Retail
Advanced Filters for Precision Campaigns
Industry and Regional Insights
AI-Driven Enrichment
Strategic Use Cases:
Marketing Campaigns and Lead Generation
Product Development and Innovation
Partnership Development and Collaboration
Market Research and Competitive Analysis
Why Choose Success.ai?
Best Price Guarantee
Seamless Integration
https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy
The B2C E-commerce Market size was valued at USD 6.23 trillion in 2023 and is projected to reach USD 21.18 trillion by 2032, exhibiting a CAGR of 19.1 % during the forecasts period. The B2C e-commerce can be defined as the sale of commercial products or services through the internet between buyers and sellers. This market pertains to several industries that fall under its fold that includes the area of retail, travelling, electronics and digital products. Some of the most common implementations are in the ecommerce sites, mobile applications, and membership services. Some aspects of the B2C e-commerce market include increased popularity of omnichannel retailing that combines online and offline environments and the shift to the concept of individualization due to the digitalization and data processing using artificial intelligence and machine learning. Also, growth is noted in mobile commerce (m-commerce) as a result of the increase in the number of mobile devices and more effective mobile payments. To this list one should also include the concepts of social commerce and sustainability which also became significant in today’s society due to increasing importance of ethical and convenient shopping. Recent developments include: In March 2024, Blink, an Amazon company, launched the Blink Mini 2 camera. The new compact plug-in camera offers enhanced features such as person detection, a broader field of view, a built-in LED spotlight for night view in color, and improved image quality. The Blink Mini 2 is designed to work indoors and outdoors, with the option to purchase the Blink Weather Resistant Power Adapter for outdoor use. , In October 2023, Flipkart.com introduced the 'Flipkart Commerce Cloud,' a customized suite of AI-driven retail technology solutions for global retailers and e-commerce businesses. This extensive offering includes marketplace technology, retail media solutions, pricing, and inventory management features rigorously assessed by Flipkart.com. The company aims to equip international sellers with reliable and secure tools to enhance business expansion and efficiency within the competitive global market. , In August 2023, Shopify and Amazon.com, Inc. announced a strategic partnership that will allow Shopify merchants to seamlessly implement Amazon's "Buy with Prime" option on their sites. As a result of the agreement, Amazon.com, Inc. Prime customers will enjoy a more efficient checkout process on various platforms. This collaboration allows Amazon Prime members to utilize their existing Amazon payment options, while Shopify will handle the transaction processing through its system, showcasing a partnership between the two leading companies. , In February 2023, eBay acquired 3PM Shield, a developer of AI-powered online retail solutions. 3PM Shield uses machine learning and artificial intelligence to analyze extensive data sets, enhancing marketplace compliance and user experience. This acquisition aligns with eBay's goal to offer a "safe and reliable" platform by boosting its ability to block the sale of counterfeit and prohibited items. By incorporating 3PM Shield's sophisticated monitoring technologies, eBay seeks to enhance its capability to address problematic seller behavior and spot problematic listings, fostering a safer e-commerce space for its worldwide community of sellers and buyers. .
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
• I leveraged advanced data visualization techniques to extract valuable insights from a comprehensive dataset. By visualizing sales patterns, customer behavior, and product trends, I identified key growth opportunities and provided actionable recommendations to optimize business strategies and enhance overall performance. you can find the GitHub repo here Link to GitHub Repository.
there are exactly 6 table and 1 is a fact table and the rest of them are dimension tables: Fact Table:
payment_key:
Description: An identifier representing the payment transaction associated with the fact.
Use Case: This key links to a payment dimension table, providing details about the payment method and related information.
customer_key:
Description: An identifier representing the customer associated with the fact.
Use Case: This key links to a customer dimension table, providing details about the customer, such as name, address, and other customer-specific information.
time_key:
Description: An identifier representing the time dimension associated with the fact.
Use Case: This key links to a time dimension table, providing details about the time of the transaction, such as date, day of the week, and month.
item_key:
Description: An identifier representing the item or product associated with the fact.
Use Case: This key links to an item dimension table, providing details about the product, such as category, sub-category, and product name.
store_key:
Description: An identifier representing the store or location associated with the fact.
Use Case: This key links to a store dimension table, providing details about the store, such as location, store name, and other store-specific information.
quantity:
Description: The quantity of items sold or involved in the transaction.
Use Case: Represents the amount or number of items associated with the transaction.
unit:
Description: The unit or measurement associated with the quantity (e.g., pieces, kilograms).
Use Case: Specifies the unit of measurement for the quantity.
unit_price:
Description: The price per unit of the item.
Use Case: Represents the cost or price associated with each unit of the item.
total_price:
Description: The total price of the transaction, calculated as the product of quantity and unit price.
Use Case: Represents the overall cost or revenue generated by the transaction.
Customer Table: customer_key:
Description: An identifier representing a unique customer.
Use Case: Serves as the primary key to link with the fact table, allowing for easy and efficient retrieval of customer-specific information.
name:
Description: The name of the customer.
Use Case: Captures the personal or business name of the customer for identification and reference purposes.
contact_no:
Description: The contact number associated with the customer.
Use Case: Stores the phone number or contact details for communication or outreach purposes.
nid:
Description: The National ID (NID) or a unique identification number for the customer.
Item Table: item_key:
Description: An identifier representing a unique item or product.
Use Case: Serves as the primary key to link with the fact table, enabling retrieval of detailed information about specific items in transactions.
item_name:
Description: The name or title of the item.
Use Case: Captures the descriptive name of the item, providing a recognizable label for the product.
desc:
Description: A description of the item.
Use Case: Contains additional details about the item, such as features, specifications, or any relevant information.
unit_price:
Description: The price per unit of the item.
Use Case: Represents the cost or price associated with each unit of the item.
man_country:
Description: The country where the item is manufactured.
Use Case: Captures the origin or manufacturing location of the item.
supplier:
Description: The supplier or vendor providing the item.
Use Case: Stores the name or identifier of the supplier, facilitating tracking of item sources.
unit:
Description: The unit of measurement associated with the item (e.g., pieces, kilograms).
Store Table: store_key:
Description: An identifier representing a unique store or location.
Use Case: Serves as the primary key to link with the fact table, allowing for easy retrieval of information about transactions associated with specific stores.
division:
Description: The administrative division or region where the store is located.
Use Case: Captures the broader geographical area in which...