MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset is a cleaned and merged version of the original UCI Online Retail and Online Retail II datasets. It contains transaction data from a UK-based online retailer, covering a period from December 2009 to December 2011.
The original UCI Online Retail II dataset contains two separate sheets: - Year 2009–2010 - Year 2010–2011
These have been merged with the original UCI Online Retail dataset to create a unified and continuous dataset.
quantity
price
customer_id
total_price
column (quantity × price
)is_cancelled
column based on invoice format or return flaginvoicedate
formattingColumn | Description |
---|---|
invoice | Invoice number (returns start with 'C') |
stockcode | Product code |
description | Description of product |
quantity | Number of items purchased |
invoicedate | Date and time of invoice |
price | Unit price in GBP |
customer_id | Unique identifier for each customer |
country | Customer’s country |
is_cancelled | Boolean flag for cancelled transactions |
total_price | Computed total (quantity × price ) for each line item |
File | Type | Description |
---|---|---|
online_retail_cleaned.csv | Data | Cleaned and merged retail transactions from 2009–2011 |
rfm_final_score.csv | Output | Final RFM scores for each customer with segment labels |
Retail_Data_Analysis_Dashboard.xlsx | Excel | Interactive Excel dashboard with KPIs, CLV, monthly trends |
Retail_Data_Analysis_Dashboard.png | Image | Visual preview of the Excel dashboard |
RFM_Segmentation.sql | SQL | SQL logic to calculate RFM scores and assign segments |
Cohort_Analysis_on_Customer.sql | SQL | Cohort analysis based on acquisition month |
Cohort_Analysis_on_Revenue.sql | SQL | Cohort revenue tracking over time |
In addition to the cleaned dataset, this dataset includes complete analysis artifacts:
These files are provided in .xlsx
and .sql
formats and can be used for further business analysis or modeling.
Original datasets: - UCI Online Retail II: https://archive.ics.uci.edu/ml/datasets/Online+Retail+II
This version was cleaned and merged by: Md Shah Nawaj
retail, ecommerce, customer segmentation, transactions, time series, data cleaning, rfm, python, pandas, online retail
https://www.skyquestt.com/privacy/https://www.skyquestt.com/privacy/
Global Shopping Trolley Market size was valued at USD 1094.79 million in 2021 and is poised to grow from USD 1396.62 million in 2022 to USD 9745.28 million by 2030, growing at a CAGR of 27.48% in the forecast period (2023-2030).
Not seeing a result you expected?
Learn how you can add new datasets to our index.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset is a cleaned and merged version of the original UCI Online Retail and Online Retail II datasets. It contains transaction data from a UK-based online retailer, covering a period from December 2009 to December 2011.
The original UCI Online Retail II dataset contains two separate sheets: - Year 2009–2010 - Year 2010–2011
These have been merged with the original UCI Online Retail dataset to create a unified and continuous dataset.
quantity
price
customer_id
total_price
column (quantity × price
)is_cancelled
column based on invoice format or return flaginvoicedate
formattingColumn | Description |
---|---|
invoice | Invoice number (returns start with 'C') |
stockcode | Product code |
description | Description of product |
quantity | Number of items purchased |
invoicedate | Date and time of invoice |
price | Unit price in GBP |
customer_id | Unique identifier for each customer |
country | Customer’s country |
is_cancelled | Boolean flag for cancelled transactions |
total_price | Computed total (quantity × price ) for each line item |
File | Type | Description |
---|---|---|
online_retail_cleaned.csv | Data | Cleaned and merged retail transactions from 2009–2011 |
rfm_final_score.csv | Output | Final RFM scores for each customer with segment labels |
Retail_Data_Analysis_Dashboard.xlsx | Excel | Interactive Excel dashboard with KPIs, CLV, monthly trends |
Retail_Data_Analysis_Dashboard.png | Image | Visual preview of the Excel dashboard |
RFM_Segmentation.sql | SQL | SQL logic to calculate RFM scores and assign segments |
Cohort_Analysis_on_Customer.sql | SQL | Cohort analysis based on acquisition month |
Cohort_Analysis_on_Revenue.sql | SQL | Cohort revenue tracking over time |
In addition to the cleaned dataset, this dataset includes complete analysis artifacts:
These files are provided in .xlsx
and .sql
formats and can be used for further business analysis or modeling.
Original datasets: - UCI Online Retail II: https://archive.ics.uci.edu/ml/datasets/Online+Retail+II
This version was cleaned and merged by: Md Shah Nawaj
retail, ecommerce, customer segmentation, transactions, time series, data cleaning, rfm, python, pandas, online retail