Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Project Overview: Customer Segmentation Using K-Means Clustering
Introduction In this project, I analysed customer data from a retail store to identify distinct customer segments. The dataset includes key attributes such as age, city, and total sales of the customers. By leveraging K-Means clustering, an unsupervised machine learning technique, I aim to group customers based on their age and sales metrics. These insights will enable the creation of targeted marketing campaigns tailored to the specific needs and behaviours of each customer segment.
Objectives - Cluster Customers: Use K-Means clustering to group customers based on age and total sales. - Analyse Segments: Examine the characteristics of each customer segment. - Targeted Marketing: Develop strategies for personalized marketing campaigns targeting each identified customer group.
Data Description The dataset comprises:
Methodology - Data Preprocessing: Clean and preprocess the data to handle any missing or inconsistent entries. - Feature Selection: Focus on age and total sales as primary features for clustering. - K-Means Clustering: Apply the K-Means algorithm to identify distinct customer segments. - Cluster Analysis: Analyse the resulting clusters to understand the demographic and sales characteristics of each group. - Marketing Strategy Development: Create targeted marketing strategies for each customer segment to enhance engagement and sales.
Expected Outcomes - Customer Segments: Clear identification of customer groups based on age and purchasing behaviour. - Insights for Marketing: Detailed understanding of each segment to inform targeted marketing efforts. - Business Impact: Enhanced ability to tailor marketing campaigns, potentially leading to increased customer satisfaction and sales.
By clustering customers based on age and total sales, this project aims to provide actionable insights for personalized marketing, ultimately driving better customer engagement and higher sales for the retail store.
Facebook
Twitterhttps://cdla.io/sharing-1-0/https://cdla.io/sharing-1-0/
This dataset was sourced from KPMG AU's Data Analytics virtual internship course on Forage
Sprocket Pvt Ltd is a client of KPMG AU. Sprocket is a bike and bike accessories retail business. They need to find the right customer segment to target for marketing to boost revenue. The following dataset is of their customer demographics for the past 3 years.
The original dataset of 3 separate sheets of Customer demographic, Transactions, and Customer Addresses was fully cleaned and merged using a power query. Data types of columns were changed, and values of certain columns which had illegal values were corrected using a standard approach. This final master dataset can be used for customer segmentation projects using clustering methods.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This simulated customer dataset provides a practical foundation for performing segmentation analysis and identifying distinct customer groups. The dataset encompasses a blend of demographic and behavioral information, equipping users with the necessary data to develop targeted marketing strategies, personalize customer experiences, and ultimately drive sales growth.
This dataset is structured to provide a comprehensive view of each customer, combining demographic information with detailed purchasing behavior. The columns included are:
The insights derived from this dataset can be applied to several key business areas:
Facebook
TwitterPremium B2C Consumer Database - 269+ Million US Records
Supercharge your B2C marketing campaigns with comprehensive consumer database, featuring over 269 million verified US consumer records. Our 20+ year data expertise delivers higher quality and more extensive coverage than competitors.
Core Database Statistics
Consumer Records: Over 269 million
Email Addresses: Over 160 million (verified and deliverable)
Phone Numbers: Over 76 million (mobile and landline)
Mailing Addresses: Over 116,000,000 (NCOA processed)
Geographic Coverage: Complete US (all 50 states)
Compliance Status: CCPA compliant with consent management
Targeting Categories Available
Demographics: Age ranges, education levels, occupation types, household composition, marital status, presence of children, income brackets, and gender (where legally permitted)
Geographic: Nationwide, state-level, MSA (Metropolitan Service Area), zip code radius, city, county, and SCF range targeting options
Property & Dwelling: Home ownership status, estimated home value, years in residence, property type (single-family, condo, apartment), and dwelling characteristics
Financial Indicators: Income levels, investment activity, mortgage information, credit indicators, and wealth markers for premium audience targeting
Lifestyle & Interests: Purchase history, donation patterns, political preferences, health interests, recreational activities, and hobby-based targeting
Behavioral Data: Shopping preferences, brand affinities, online activity patterns, and purchase timing behaviors
Multi-Channel Campaign Applications
Deploy across all major marketing channels:
Email marketing and automation
Social media advertising
Search and display advertising (Google, YouTube)
Direct mail and print campaigns
Telemarketing and SMS campaigns
Programmatic advertising platforms
Data Quality & Sources
Our consumer data aggregates from multiple verified sources:
Public records and government databases
Opt-in subscription services and registrations
Purchase transaction data from retail partners
Survey participation and research studies
Online behavioral data (privacy compliant)
Technical Delivery Options
File Formats: CSV, Excel, JSON, XML formats available
Delivery Methods: Secure FTP, API integration, direct download
Processing: Real-time NCOA, email validation, phone verification
Custom Selections: 1,000+ selectable demographic and behavioral attributes
Minimum Orders: Flexible based on targeting complexity
Unique Value Propositions
Dual Spouse Targeting: Reach both household decision-makers for maximum impact
Cross-Platform Integration: Seamless deployment to major ad platforms
Real-Time Updates: Monthly data refreshes ensure maximum accuracy
Advanced Segmentation: Combine multiple targeting criteria for precision campaigns
Compliance Management: Built-in opt-out and suppression list management
Ideal Customer Profiles
E-commerce retailers seeking customer acquisition
Financial services companies targeting specific demographics
Healthcare organizations with compliant marketing needs
Automotive dealers and service providers
Home improvement and real estate professionals
Insurance companies and agents
Subscription services and SaaS providers
Performance Optimization Features
Lookalike Modeling: Create audiences similar to your best customers
Predictive Scoring: Identify high-value prospects using AI algorithms
Campaign Attribution: Track performance across multiple touchpoints
A/B Testing Support: Split audiences for campaign optimization
Suppression Management: Automatic opt-out and DNC compliance
Pricing & Volume Options
Flexible pricing structures accommodate businesses of all sizes:
Pay-per-record for small campaigns
Volume discounts for large deployments
Subscription models for ongoing campaigns
Custom enterprise pricing for high-volume users
Data Compliance & Privacy
VIA.tools maintains industry-leading compliance standards:
CCPA (California Consumer Privacy Act) compliant
CAN-SPAM Act adherence for email marketing
TCPA compliance for phone and SMS campaigns
Regular privacy audits and data governance reviews
Transparent opt-out and data deletion processes
Getting Started
Our data specialists work with you to:
Define your target audience criteria
Recommend optimal data selections
Provide sample data for testing
Configure delivery methods and formats
Implement ongoing campaign optimization
Why We Lead the Industry
With over two decades of data industry experience, we combine extensive database coverage with advanced targeting capabilities. Our commitment to data quality, compliance, and customer success has made us the preferred choice for businesses seeking superior B2C marketing performance.
Contact our team to discuss your specific ta...
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Hotel customer dataset with 31 variables describing a total of 83,590 instances (customers). It comprehends three full years of customer behavioral data. In addition to personal and behavioral information, the dataset also contains demographic and geographical information. This dataset contributes to reducing the lack of real-world business data that can be used for educational and research purposes. The dataset can be used in data mining, machine learning, and other analytical field problems in the scope of data science. Due to its unit of analysis, it is a dataset especially suitable for building customer segmentation models, including clustering and RFM (Recency, Frequency, and Monetary value) models, but also be used in classification and regression problems.
Facebook
TwitterSuccess.ai’s Consumer Marketing Data API empowers your marketing, analytics, and product teams with on-demand access to a vast and continuously updated dataset of consumer insights. Covering detailed demographics, behavioral patterns, and purchasing histories, this API enables you to go beyond generic outreach and craft tailored campaigns that truly resonate with your target audiences.
With AI-validated accuracy and support for precise filtering, the Consumer Marketing Data API ensures you’re always equipped with the most relevant data. Backed by our Best Price Guarantee, this solution is essential for refining your strategies, improving conversion rates, and driving sustainable growth in today’s competitive consumer landscape.
Why Choose Success.ai’s Consumer Marketing Data API?
Tailored Consumer Insights for Precision Targeting
Comprehensive Global Reach
Continuously Updated and Real-Time Data
Ethical and Compliant
Data Highlights:
Key Features of the Consumer Marketing Data API:
Granular Targeting and Segmentation
Flexible and Seamless Integration
Continuous Data Enrichment
AI-Driven Validation
Strategic Use Cases:
Highly Personalized Marketing Campaigns
Market Expansion and Product Launches
Competitive Analysis and Trend Forecasting
Customer Retention and Loyalty Programs
Why Choose Success.ai?
Best Price Guarantee
Seamless Integration
Data Accuracy with AI Validation
Customizable and Scalable Solutions
Facebook
TwitterThe User Profile Data is a structured, anonymized dataset designed to help organizations understand who their users are, what devices they use, and where they are located. Each record provides privacy-compliant linkages between user IDs, demographic profiles, device intelligence, and geolocation data, offering deep context for analytics, segmentation, and personalization.
Built for privacy-safe analytics, the dataset uses hashed identifiers like phone number and email and standardized formats, making it easy to integrate into big-data platforms, AI pipelines, and machine learning models for advanced analytics.
Demographic insights include gender, age, and age group, essential for audience profiling, marketing optimization, and consumer intelligence. All gender data is user-declared and AI-verified through image-based avatar validation, ensuring data accuracy and authenticity.
The dataset’s Device Intelligence Layer includes rich technical attributes such as device brand, model, OS version, user agent, RAM, language, and timezone, enabling technical segmentation, performance analytics, and targeted ad delivery across diverse device ecosystems.
On the location and POI front, the dataset combines GPS-based and IP-based coordinates—including country, region, city, latitude, longitude —to provide high-precision geospatial insights. This enables mobility pattern analysis, market expansion planning, and POI clustering for advanced location intelligence.
Each user record contains onboarding and lifecycle fields like unique IDs, and profile update timestamps, allowing accurate tracking of user acquisition trends, data freshness, and activity duration.
🔍 Key Features • 1st-party, consent-based demographic & device data • AI-verified gender insights via avatar recognition • OS-level app data with 120+ daily sessions per user • Global coverage across APAC and emerging markets • GPS + IP-based geolocation & POI intelligence • Privacy-compliant, hashed identifiers for safe integration
🚀 Use Cases • Audience segmentation & lookalike modeling • Ad-tech and mar-tech optimization • Geospatial & POI analytics • Fraud detection & risk scoring • Personalization & recommendation engines • App performance & device compatibility insights
🏢 Industries Served Ad-Tech • Mar-Tech • FinTech • Telecom • Retail Analytics • Consumer Intelligence • AI & ML Platforms
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Demographic Analysis of Shopping Behavior: Insights and Recommendations
Dataset Information: The Shopping Mall Customer Segmentation Dataset comprises 15,079 unique entries, featuring Customer ID, age, gender, annual income, and spending score. This dataset assists in understanding customer behavior for strategic marketing planning.
Cleaned Data Details: Data cleaned and standardized, 15,079 unique entries with attributes including - Customer ID, age, gender, annual income, and spending score. Can be used by marketing analysts to produce a better strategy for mall specific marketing.
Challenges Faced: 1. Data Cleaning: Overcoming inconsistencies and missing values required meticulous attention. 2. Statistical Analysis: Interpreting demographic data accurately demanded collaborative effort. 3. Visualization: Crafting informative visuals to convey insights effectively posed design challenges.
Research Topics: 1. Consumer Behavior Analysis: Exploring psychological factors driving purchasing decisions. 2. Market Segmentation Strategies: Investigating effective targeting based on demographic characteristics.
Suggestions for Project Expansion: 1. Incorporate External Data: Integrate social media analytics or geographic data to enrich customer insights. 2. Advanced Analytics Techniques: Explore advanced statistical methods and machine learning algorithms for deeper analysis. 3. Real-Time Monitoring: Develop tools for agile decision-making through continuous customer behavior tracking. This summary outlines the demographic analysis of shopping behavior, highlighting key insights, dataset characteristics, team contributions, challenges, research topics, and suggestions for project expansion. Leveraging these insights can enhance marketing strategies and drive business growth in the retail sector.
References OpenAI. (2022). ChatGPT [Computer software]. Retrieved from https://openai.com/chatgpt. Mustafa, Z. (2022). Shopping Mall Customer Segmentation Data [Data set]. Kaggle. Retrieved from https://www.kaggle.com/datasets/zubairmustafa/shopping-mall-customer-segmentation-data Donkeys. (n.d.). Kaggle Python API [Jupyter Notebook]. Kaggle. Retrieved from https://www.kaggle.com/code/donkeys/kaggle-python-api/notebook Pandas-Datareader. (n.d.). Retrieved from https://pypi.org/project/pandas-datareader/
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The dataset was collected from Kaggle. It includes various features related to customer demographics, purchasing behavior, and other relevant metrics.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains the full responses from a structured survey conducted in Colombia (2024) aimed at analyzing the relationships between perceived brand ethics, trust, service quality, customer experience, perceived value, brand engagement, and loyalty. The study includes socio-demographic segmentation by educational level and focuses on the consumer perception of Alpina®, a leading brand in the Latin American food industry.
Facebook
TwitterGapMaps GIS data for USA and Canada sourced from Applied Geographic Solutions (AGS) includes an extensive range of the highest quality demographic and lifestyle segmentation products. All databases are derived from superior source data and the most sophisticated, refined, and proven methodologies.
GIS Data attributes include:
Latest Estimates and Projections The estimates and projections database includes a wide range of core demographic data variables for the current year and 5- year projections, covering five broad topic areas: population, households, income, labor force, and dwellings.
Crime Risk Crime Risk is the result of an extensive analysis of a rolling seven years of FBI crime statistics. Based on detailed modeling of the relationships between crime and demographics, Crime Risk provides an accurate view of the relative risk of specific crime types (personal, property and total) at the block and block group level.
Panorama Segmentation AGS has created a segmentation system for the United States called Panorama. Panorama has been coded with the MRI Survey data to bring you Consumer Behavior profiles associated with this segmentation system.
Business Counts Business Counts is a geographic summary database of business establishments, employment, occupation and retail sales.
Non-Resident Population The AGS non-resident population estimates utilize a wide range of data sources to model the factors which drive tourists to particular locations, and to match that demand with the supply of available accommodations.
Consumer Expenditures AGS provides current year and 5-year projected expenditures for over 390 individual categories that collectively cover almost 95% of household spending.
Retail Potential This tabulation utilizes the Census of Retail Trade tables which cross-tabulate store type by merchandise line.
Environmental Risk The environmental suite of data consists of several separate database components including: -Weather Risks -Seismological Risks -Wildfire Risk -Climate -Air Quality -Elevation and terrain
Primary Use Cases for GapMaps GIS Data:
Integrate AGS demographic data with your existing GIS or BI platform to generate powerful visualizations.
Finance / Insurance (eg. Hedge Funds, Investment Advisors, Investment Research, REITs, Private Equity, VC)
Network Planning
Customer (Risk) Profiling for insurance/loan approvals
Target Marketing
Competitive Analysis
Market Optimization
Commercial Real-Estate (Brokers, Developers, Investors, Single & Multi-tenant O/O)
Tenant Recruitment
Target Marketing
Market Potential / Gap Analysis
Marketing / Advertising (Billboards/OOH, Marketing Agencies, Indoor Screens)
Customer Profiling
Target Marketing
Market Share Analysis
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The dataset comprises 788 entries with 37 columns, providing demographic, behavioral, attitudinal, and environmental data likely centered around consumer behaviors related to organic products. Demographic variables include age, sex, and education level, capturing essential background information on each respondent. Behavioral beliefs are represented across ten items (BB1 to BB10), suggesting specific beliefs or behaviors related to the topic. Additionally, variables such as frequency (Freq), volume of purchases (Vol), and average purchase amount (AvePurch) detail purchasing behaviors. The dataset also includes five belief items (Belief1 to Belief5) along with an aggregated Belief score, and similarly, five attitude items (Att1 to Att5) with an overall Attitude score. Environmental concerns are captured through five items (Env1 to Env5), with a combined Environ score that may represent an overall environmental attitude. Notably, the last two columns (Environ and Unnamed: 36) have numerous missing values, which may need addressing for analysis.
The survey was conducted from August 1 to September 30, 2024. Respondents are from different parts of Metro Manila and the province of Cavite.
Facebook
Twitterhttps://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service
1) Data Introduction • The Consumer Behavior and Shopping Habits Dataset is a tabular collection of customer demographics, purchase history, product preferences, shopping frequency, and online and offline purchasing behavior.
2) Data Utilization (1) Consumer Behavior and Shopping Habits Dataset has characteristics that: • Each row contains detailed consumer and transaction information such as customer ID, age, gender, purchased goods and categories, purchase amount, region, product attributes (size, color, season), review rating, subscription status, delivery method, discount/promotion usage, payment method, purchase frequency, etc. • Data is organized to cover a variety of variables and purchasing patterns to help segment customers, establish marketing strategies, analyze product preferences, and more. (2) Consumer Behavior and Shopping Habits Dataset can be used to: • Customer Segmentation and Target Marketing: You can analyze demographics and purchasing patterns to define different customer groups and use them to develop customized marketing strategies. • Product and service improvement: Based on purchase history, review ratings, discount/promotional responses, etc., it can be applied to product and service improvements such as identifying popular products, managing inventory, and analyzing promotion effects.
Facebook
TwitterTapestry segment descriptions can be found here..http://www.esri.com/library/brochures/pdfs/tapestry-segmentation.pdf For more than 30 years, companies, agencies, and organizations have used segmentation to divide and group their consumer markets to more precisely target their best customers and prospects. This targeting method is superior to using “scattershot” methods that might attract these preferred groups. Segmentation explains customer diversity, simplifies marketing campaigns, describes lifestyle and lifestage, and incorporates a wide range of data. Segmentation systems operate on the theory that people with similar tastes, lifestyles, and behaviors seek others with the same tastes—“like seeks like.” These behaviors can be measured, predicted, and targeted. Esri’s Tapestry Segmentation system combines the “who” of lifestyle demography with the “where” of local neighborhood geography to create a model of various lifestyle classifications or segments of actual neighborhoods with addresses—distinct behavioral market segments. The tapestry segmentation is almost comical in the sense that it trys to describe such small details of individuals daily lives just by analyzing the data provided on your CENSUS form. These segements are not only ideal for marketing and targeting lifestyles within a geographic location, but they are fun to read. Take the time to find out which segment you live in!
Facebook
Twitterhttps://brightdata.com/licensehttps://brightdata.com/license
We'll customize a Tmall dataset to align with your unique requirements, incorporating data on product titles, seller names, categories, prices, reviews, ratings, demographic insights, and other relevant metrics.
Leverage our Tmall datasets for various applications to strengthen strategic planning and market analysis. Examining these datasets enables organizations to understand consumer preferences and shopping trends, facilitating refined product offerings and optimized marketing strategies. Tailor your access to the complete dataset or specific subsets according to your business needs.
Popular use cases include optimizing product selections based on consumer insights, refining marketing strategies through targeted customer segmentation, and identifying and predicting trends to maintain a competitive edge in the online shopping market.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset provides an in-depth look at customer interactions and campaign performance within the digital marketing realm. It includes key metrics and demographic information that are crucial for analyzing marketing effectiveness and customer engagement. The dataset comprises the following columns:
Unique identifier for each customer, facilitating individual tracking and analysis.
Customer's age, offering insights into demographic segmentation and targeting strategies.
Customer's gender, useful for understanding gender-based preferences and behavior.
Customer's income level, providing context on purchasing power and conversion potential.
The medium through which the marketing campaign was delivered (e.g., email, social media).
The nature of the marketing campaign (e.g., promotional offer, product launch), helping to assess campaign effectiveness.
Amount spent on advertisements, crucial for evaluating cost-efficiency and ROI.
Ratio of clicks to impressions, indicating ad engagement and effectiveness.
Percentage of users who complete a desired action after interacting with an ad, reflecting the success of the campaign in driving actual sales or goals.
Number of visits to the website by the customer, measuring engagement and interest.
This dataset is ideal for exploring customer behavior, optimizing marketing strategies, and evaluating the success of various campaigns. Perfect for data scientists and marketers looking to derive actionable insights from digital marketing data.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This dataset contains synthetic customer profiles with detailed demographic, transactional, and interaction data, enriched by AI-driven segmentation and predictive metrics. It enables robust machine learning for identifying high-value segments, personalizing marketing, and optimizing customer retention strategies in a scalable, analytics-friendly format.
Facebook
TwitterArchetype Data’s B2C Consumer File is one of the most comprehensive and data-rich consumer datasets in the United States, encompassing over 260 million verified individuals and households. Designed for precision marketing, analytics, and customer intelligence, this dataset delivers unparalleled depth across lifestyle, demographic, financial, and behavioral dimensions enabling businesses to understand, segment, and engage consumers with accuracy and confidence.
Each consumer record includes fundamental demographic elements such as name, age, gender, location, household composition, and contact information. Building upon that, Archetype Data enriches every profile with 400+ lifestyle, financial, and behavioral variables that capture consumer intent, spending capacity, purchasing habits, media preferences, and digital engagement patterns. This multidimensional view empowers marketers, insurers, and data-driven enterprises to identify not just who a consumer is—but how they live, shop, and connect.
What truly differentiates Archetype Data’s B2C file is its integration with our Linq360™ B2B2C dataset, which links consumers to the businesses they own or operate. This linkage provides a powerful bridge between professional and personal identity, offering unparalleled insight into small business owners, entrepreneurs, and professionals as both business decision-makers and consumers.
Whether activating audiences across CTV, programmatic display, social, or direct mail, our data seamlessly maps into today’s leading marketing and advertising ecosystems, including LiveRamp, The Trade Desk, and other major platforms.
The B2C Consumer File supports a wide range of applications; audience segmentation, modeling, CRM enrichment, lookalike development, and attribution measurement—across industries such as retail, finance, insurance, media, and healthcare. Whether you’re building a custom audience for a digital campaign, enriching customer records, or analyzing lifestyle trends within a region, Archetype Data’s file provides the scale and precision needed to deliver meaningful results.
Facebook
TwitterKnowing who your consumers are is essential for businesses, marketers, and researchers. This detailed demographic file offers an in-depth look at American consumers, packed with insights about personal details, household information, financial status, and lifestyle choices. Let's take a closer look at the data:
Personal Identifiers and Basic Demographics At the heart of this dataset are the key details that make up a consumer profile:
Unique IDs (PID, HHID) for individuals and households Full names (First, Middle, Last) and suffixes Gender and age Date of birth Complete location details (address, city, state, ZIP) These identifiers are critical for accurate marketing and form the base for deeper analysis.
Geospatial Intelligence This file goes beyond just listing addresses by including rich geospatial data like:
Latitude and longitude Census tract and block details Codes for Metropolitan Statistical Areas (MSA) and Core-Based Statistical Areas (CBSA) County size codes Geocoding accuracy This allows for precise geographic segmentation and localized marketing.
Housing and Property Data The dataset covers a lot of ground when it comes to housing, providing valuable insights for real estate professionals, lenders, and home service providers:
Homeownership status Dwelling type (single-family, multi-family, etc.) Property values (market, assessed, and appraised) Year built and square footage Room count, amenities like fireplaces or pools, and building quality This data is crucial for targeting homeowners with products and services like refinancing or home improvement offers.
Wealth and Financial Data For a deeper dive into consumer wealth, the file includes:
Estimated household income Wealth scores Credit card usage Mortgage info (loan amounts, rates, terms) Home equity estimates and investment property ownership These indicators are invaluable for financial services, luxury brands, and fundraising organizations looking to reach affluent individuals.
Lifestyle and Interests One of the most useful features of the dataset is its extensive lifestyle segmentation:
Hobbies and interests (e.g., gardening, travel, sports) Book preferences, magazine subscriptions Outdoor activities (camping, fishing, hunting) Pet ownership, tech usage, political views, and religious affiliations This data is perfect for crafting personalized marketing campaigns and developing products that align with specific consumer preferences.
Consumer Behavior and Purchase Habits The file also sheds light on how consumers behave and shop:
Online and catalog shopping preferences Gift-giving tendencies, presence of children, vehicle ownership Media consumption (TV, radio, internet) Retailers and e-commerce businesses will find this behavioral data especially useful for tailoring their outreach.
Demographic Clusters and Segmentation Pre-built segments like:
Household, neighborhood, family, and digital clusters Generational and lifestage groups make it easier to quickly target specific demographics, streamlining the process for market analysis and campaign planning.
Ethnicity and Language Preferences In today's multicultural market, knowing your audience's cultural background is key. The file includes:
Ethnicity codes and language preferences Flags for Hispanic/Spanish-speaking households This helps ensure culturally relevant and sensitive communication.
Education and Occupation Data The dataset also tracks education and career info:
Education level and occupation codes Home-based business indicators This data is essential for B2B marketers, recruitment agencies, and education-focused campaigns.
Digital and Social Media Habits With everyone online, digital behavior insights are a must:
Internet, TV, radio, and magazine usage Social media platform engagement (Facebook, Instagram, LinkedIn) Streaming subscriptions (Netflix, Hulu) This data helps marketers, app developers, and social media managers connect with their audience in the digital space.
Political and Charitable Tendencies For political campaigns or non-profits, this dataset offers:
Political affiliations and outlook Charitable donation history Volunteer activities These insights are perfect for cause-related marketing and targeted political outreach.
Neighborhood Characteristics By incorporating census data, the file provides a bigger picture of the consumer's environment:
Population density, racial composition, and age distribution Housing occupancy and ownership rates This offers important context for understanding the demographic landscape.
Predictive Consumer Indexes The dataset includes forward-looking indicators in categories like:
Fashion, automotive, and beauty products Health, home decor, pet products, sports, and travel These predictive insights help businesses anticipate consumer trends and needs.
Contact Information Finally, the file includes key communication details:
Multiple phone numbers (landline, mobile) and email addresses Do Not Call (DNC) flags...
Facebook
TwitterThis dataset is developed as part of a business analysis project aimed at exploring sales performance and customer demographics. It is inspired by real-world scenarios where companies strive to enhance their marketing strategies by understanding consumer behavior. The project focuses on the year 2023 and provides insights into how targeted marketing impacts sales while emphasizing demographic characteristics such as age and gender.
The dataset is synthetically generated, designed to simulate real-world sales scenarios for 20 products. It includes data points that mirror industry practices, ensuring a realistic and comprehensive foundation for analysis. The structure and data content are informed by common business intelligence practices and hypothetical yet plausible marketing scenarios.
This dataset is inspired by the challenges businesses face in balancing targeted and broad marketing strategies. Companies frequently debate whether niche marketing for specific demographics or campaigns targeting a wider audience yields better outcomes. The dataset serves as a sandbox for exploring these questions, combining data analytics, visualization, and storytelling to drive actionable business insights.
Sales Data: Includes monthly sales records for 20 products, categorized by revenue, units sold, and discounts applied.
Demographic Information: Covers customer age, gender, and location to enable segmentation and trend analysis.
Business Insights: Explore product popularity trends across different demographic groups. Revenue Analysis: Understand revenue patterns throughout 2023 and their correlation with customer age and gender.
Marketing Strategy Optimization: Evaluate the effectiveness of targeted vs. broad campaigns, particularly those targeting specific gender or age groups.
Visualization and Storytelling: Build dashboards and presentations to communicate insights effectively. This dataset is ideal for analysts and students seeking hands-on experience in SQL, exploratory data analysis, and visualization tools like Power BI. It bridges the gap between data science and practical business decision-making.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Project Overview: Customer Segmentation Using K-Means Clustering
Introduction In this project, I analysed customer data from a retail store to identify distinct customer segments. The dataset includes key attributes such as age, city, and total sales of the customers. By leveraging K-Means clustering, an unsupervised machine learning technique, I aim to group customers based on their age and sales metrics. These insights will enable the creation of targeted marketing campaigns tailored to the specific needs and behaviours of each customer segment.
Objectives - Cluster Customers: Use K-Means clustering to group customers based on age and total sales. - Analyse Segments: Examine the characteristics of each customer segment. - Targeted Marketing: Develop strategies for personalized marketing campaigns targeting each identified customer group.
Data Description The dataset comprises:
Methodology - Data Preprocessing: Clean and preprocess the data to handle any missing or inconsistent entries. - Feature Selection: Focus on age and total sales as primary features for clustering. - K-Means Clustering: Apply the K-Means algorithm to identify distinct customer segments. - Cluster Analysis: Analyse the resulting clusters to understand the demographic and sales characteristics of each group. - Marketing Strategy Development: Create targeted marketing strategies for each customer segment to enhance engagement and sales.
Expected Outcomes - Customer Segments: Clear identification of customer groups based on age and purchasing behaviour. - Insights for Marketing: Detailed understanding of each segment to inform targeted marketing efforts. - Business Impact: Enhanced ability to tailor marketing campaigns, potentially leading to increased customer satisfaction and sales.
By clustering customers based on age and total sales, this project aims to provide actionable insights for personalized marketing, ultimately driving better customer engagement and higher sales for the retail store.