Facebook
Twitterhttps://cdla.io/sharing-1-0/https://cdla.io/sharing-1-0/
This dataset was sourced from KPMG AU's Data Analytics virtual internship course on Forage
Sprocket Pvt Ltd is a client of KPMG AU. Sprocket is a bike and bike accessories retail business. They need to find the right customer segment to target for marketing to boost revenue. The following dataset is of their customer demographics for the past 3 years.
The original dataset of 3 separate sheets of Customer demographic, Transactions, and Customer Addresses was fully cleaned and merged using a power query. Data types of columns were changed, and values of certain columns which had illegal values were corrected using a standard approach. This final master dataset can be used for customer segmentation projects using clustering methods.
Facebook
TwitterThis profile is designed to accompany the Joint Strategic Needs Assessment (JSNA) chapter on Demographics, which looks at segmenting the borough’s population by their most significant health and social care need. This supplement looks at adults (aged 18 and over) instead of the overall population, because the health and social care need segments covered in this section are more common in adults.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Description: This dataset contains 1,000 anonymized records of individuals, capturing a mix of demographic, financial, health, and behavioral attributes. The data is structured to support analysis in areas such as market research, risk assessment, public health studies, and customer segmentation.
Key Attributes: Personal Information
Name: Full name of the individual (synthetic).
Age: Age in years (range: 18–80).
Gender: Binary classification (Male/Female).
Financial Metrics
Annual Income: Yearly earnings in USD (range: 20K–250K).
Credit Score: FICO-like score (range: 300–850).
Transaction Frequency: Monthly transactions (count).
Health Indicators
BMI: Body Mass Index (range: 15–45).
Blood Pressure (Systolic): mmHg (range: 90–180).
Geospatial Data
Latitude: Approximate location (32.0°–42.0° N).
Longitude: Approximate location (-120.0°–-75.0° W).
Behavioral Data
Monthly Data Usage: Internet consumption in GB.
Potential Use Cases: Market Research: Segment customers by income, location, or spending habits.
Health Analytics: Study correlations between age, BMI, and blood pressure.
Financial Modeling: Assess credit risk based on income and transaction behavior.
Geospatial Analysis: Map demographic trends across regions.
Data Quality Notes: Contains 3% missing values (randomly distributed).
Numeric values are rounded for readability (e.g., BMI to 1 decimal place).
Facebook
TwitterThe User Profile Data is a structured, anonymized dataset designed to help organizations understand who their users are, what devices they use, and where they are located. Each record provides privacy-compliant linkages between user IDs, demographic profiles, device intelligence, and geolocation data, offering deep context for analytics, segmentation, and personalization.
Built for privacy-safe analytics, the dataset uses hashed identifiers like phone number and email and standardized formats, making it easy to integrate into big-data platforms, AI pipelines, and machine learning models for advanced analytics.
Demographic insights include gender, age, and age group, essential for audience profiling, marketing optimization, and consumer intelligence. All gender data is user-declared and AI-verified through image-based avatar validation, ensuring data accuracy and authenticity.
The dataset’s Device Intelligence Layer includes rich technical attributes such as device brand, model, OS version, user agent, RAM, language, and timezone, enabling technical segmentation, performance analytics, and targeted ad delivery across diverse device ecosystems.
On the location and POI front, the dataset combines GPS-based and IP-based coordinates—including country, region, city, latitude, longitude —to provide high-precision geospatial insights. This enables mobility pattern analysis, market expansion planning, and POI clustering for advanced location intelligence.
Each user record contains onboarding and lifecycle fields like unique IDs, and profile update timestamps, allowing accurate tracking of user acquisition trends, data freshness, and activity duration.
🔍 Key Features • 1st-party, consent-based demographic & device data • AI-verified gender insights via avatar recognition • OS-level app data with 120+ daily sessions per user • Global coverage across APAC and emerging markets • GPS + IP-based geolocation & POI intelligence • Privacy-compliant, hashed identifiers for safe integration
🚀 Use Cases • Audience segmentation & lookalike modeling • Ad-tech and mar-tech optimization • Geospatial & POI analytics • Fraud detection & risk scoring • Personalization & recommendation engines • App performance & device compatibility insights
🏢 Industries Served Ad-Tech • Mar-Tech • FinTech • Telecom • Retail Analytics • Consumer Intelligence • AI & ML Platforms
Facebook
TwitterThis factsheet breaks down Camden’s population by looking at health conditions, and then by their age, sex, ethnicity, and deprivation. Understanding the size and characteristics of each segment helps us plan healthcare resources and service delivery effectively for each group, as well as the population in general.
Facebook
TwitterODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
This filtered view contains the population estimates for San Francisco demographic groups from the U.S. Census Bureau’s American Community Survey that are used by Controller's Office - City Performance Unit for reporting on Police Stops
San Francisco Population and Demographic Census data dataset filtered on: "reporting_segment" = 'Police Reporting Demographic Categories'
A. SUMMARY This dataset contains population and demographic estimates and associated margins of error obtained and derived from the US Census. The data is presented over multiple years and geographies. The data is sourced primarily from the American Community Survey.
B. HOW THE DATASET IS CREATED The raw data is obtained from the census API. Some estimates as published as-is and some are derived.
C. UPDATE PROCESS New estimates and years of data are appended to this dataset. To request additional census data for San Francisco, email support@datasf.org
D. HOW TO USE THIS DATASET The dataset is long and contains multiple estimates, years and geographies. To use this dataset, you can filter by the overall segment which contains information about the source, years, geography, demographic category and reporting segment. For census data used in specific reports, you can filter to the reporting segment. To use a subset of the data, you can create a filtered view. More information of how to filter data and create a view can be found here
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This simulated customer dataset provides a practical foundation for performing segmentation analysis and identifying distinct customer groups. The dataset encompasses a blend of demographic and behavioral information, equipping users with the necessary data to develop targeted marketing strategies, personalize customer experiences, and ultimately drive sales growth.
This dataset is structured to provide a comprehensive view of each customer, combining demographic information with detailed purchasing behavior. The columns included are:
The insights derived from this dataset can be applied to several key business areas:
Facebook
TwitterSuccess.ai’s Consumer Marketing Data API empowers your marketing, analytics, and product teams with on-demand access to a vast and continuously updated dataset of consumer insights. Covering detailed demographics, behavioral patterns, and purchasing histories, this API enables you to go beyond generic outreach and craft tailored campaigns that truly resonate with your target audiences.
With AI-validated accuracy and support for precise filtering, the Consumer Marketing Data API ensures you’re always equipped with the most relevant data. Backed by our Best Price Guarantee, this solution is essential for refining your strategies, improving conversion rates, and driving sustainable growth in today’s competitive consumer landscape.
Why Choose Success.ai’s Consumer Marketing Data API?
Tailored Consumer Insights for Precision Targeting
Comprehensive Global Reach
Continuously Updated and Real-Time Data
Ethical and Compliant
Data Highlights:
Key Features of the Consumer Marketing Data API:
Granular Targeting and Segmentation
Flexible and Seamless Integration
Continuous Data Enrichment
AI-Driven Validation
Strategic Use Cases:
Highly Personalized Marketing Campaigns
Market Expansion and Product Launches
Competitive Analysis and Trend Forecasting
Customer Retention and Loyalty Programs
Why Choose Success.ai?
Best Price Guarantee
Seamless Integration
Data Accuracy with AI Validation
Customizable and Scalable Solutions
Facebook
TwitterTapestry segment descriptions can be found here..http://www.esri.com/library/brochures/pdfs/tapestry-segmentation.pdf For more than 30 years, companies, agencies, and organizations have used segmentation to divide and group their consumer markets to more precisely target their best customers and prospects. This targeting method is superior to using “scattershot” methods that might attract these preferred groups. Segmentation explains customer diversity, simplifies marketing campaigns, describes lifestyle and lifestage, and incorporates a wide range of data. Segmentation systems operate on the theory that people with similar tastes, lifestyles, and behaviors seek others with the same tastes—“like seeks like.” These behaviors can be measured, predicted, and targeted. Esri’s Tapestry Segmentation system combines the “who” of lifestyle demography with the “where” of local neighborhood geography to create a model of various lifestyle classifications or segments of actual neighborhoods with addresses—distinct behavioral market segments. The tapestry segmentation is almost comical in the sense that it trys to describe such small details of individuals daily lives just by analyzing the data provided on your CENSUS form. These segements are not only ideal for marketing and targeting lifestyles within a geographic location, but they are fun to read. Take the time to find out which segment you live in!
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This dataset provides detailed records of survey respondents, including demographic information, completion rates, segmentation labels, and response quality metrics. It enables in-depth analysis of participant behavior, demographic trends, and survey effectiveness, making it ideal for market research, academic studies, and customer insights.
Facebook
TwitterGapMaps GIS data for USA and Canada sourced from Applied Geographic Solutions (AGS) includes an extensive range of the highest quality demographic and lifestyle segmentation products. All databases are derived from superior source data and the most sophisticated, refined, and proven methodologies.
GIS Data attributes include:
Latest Estimates and Projections The estimates and projections database includes a wide range of core demographic data variables for the current year and 5- year projections, covering five broad topic areas: population, households, income, labor force, and dwellings.
Crime Risk Crime Risk is the result of an extensive analysis of a rolling seven years of FBI crime statistics. Based on detailed modeling of the relationships between crime and demographics, Crime Risk provides an accurate view of the relative risk of specific crime types (personal, property and total) at the block and block group level.
Panorama Segmentation AGS has created a segmentation system for the United States called Panorama. Panorama has been coded with the MRI Survey data to bring you Consumer Behavior profiles associated with this segmentation system.
Business Counts Business Counts is a geographic summary database of business establishments, employment, occupation and retail sales.
Non-Resident Population The AGS non-resident population estimates utilize a wide range of data sources to model the factors which drive tourists to particular locations, and to match that demand with the supply of available accommodations.
Consumer Expenditures AGS provides current year and 5-year projected expenditures for over 390 individual categories that collectively cover almost 95% of household spending.
Retail Potential This tabulation utilizes the Census of Retail Trade tables which cross-tabulate store type by merchandise line.
Environmental Risk The environmental suite of data consists of several separate database components including: -Weather Risks -Seismological Risks -Wildfire Risk -Climate -Air Quality -Elevation and terrain
Primary Use Cases for GapMaps GIS Data:
Integrate AGS demographic data with your existing GIS or BI platform to generate powerful visualizations.
Finance / Insurance (eg. Hedge Funds, Investment Advisors, Investment Research, REITs, Private Equity, VC)
Network Planning
Customer (Risk) Profiling for insurance/loan approvals
Target Marketing
Competitive Analysis
Market Optimization
Commercial Real-Estate (Brokers, Developers, Investors, Single & Multi-tenant O/O)
Tenant Recruitment
Target Marketing
Market Potential / Gap Analysis
Marketing / Advertising (Billboards/OOH, Marketing Agencies, Indoor Screens)
Customer Profiling
Target Marketing
Market Share Analysis
Facebook
TwitterA global database of population segmentation data that provides an understanding of population distribution at administrative and zip code levels over 55 years, past, present, and future.
Leverage up-to-date audience targeting data trends for market research, audience targeting, and sales territory mapping.
Self-hosted consumer data curated based on trusted sources such as the United Nations or the European Commission, with a 99% match accuracy. The Consumer Data is standardized, unified, and ready to use.
Use cases for the Global Population Database (Consumer Data Data/Segmentation data)
Ad targeting
B2B Market Intelligence
Customer analytics
Marketing campaign analysis
Demand forecasting
Sales territory mapping
Retail site selection
Reporting
Audience targeting
Segmentation data export methodology
Our location data packages are offered in CSV format. All geospatial data are optimized for seamless integration with popular systems like Esri ArcGIS, Snowflake, QGIS, and more.
Product Features
Historical population data (55 years)
Changes in population density
Urbanization Patterns
Accurate at zip code and administrative level
Optimized for easy integration
Easy customization
Global coverage
Updated yearly
Standardized and reliable
Self-hosted delivery
Fully aggregated (ready to use)
Rich attributes
Why do companies choose our Population Databases
Standardized and unified demographic data structure
Seamless integration in your system
Dedicated location data expert
Note: Custom population data packages are available. Please submit a request via the above contact button for more details.
Facebook
TwitterODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
A. SUMMARY This dataset contains population and demographic estimates and associated margins of error obtained and derived from the US Census. The data is presented over multiple years and geographies. The data is sourced primarily from the American Community Survey. B. HOW THE DATASET IS CREATED The raw data is obtained from the census API. Some estimates as published as-is and some are derived. C. UPDATE PROCESS New estimates and years of data are appended to this dataset. To request additional census data for San Francisco, email support@datasf.org D. HOW TO USE THIS DATASET The dataset is long and contains multiple estimates, years and geographies. To use this dataset, you can filter by the overall segment which contains information about the source, years, geography, demographic category and reporting segment. For census data used in specific reports, you can filter to the reporting segment. To use a subset of the data, you can create a filtered view. More information of how to filter data and create a view can be found here
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Hotel customer dataset with 31 variables describing a total of 83,590 instances (customers). It comprehends three full years of customer behavioral data. In addition to personal and behavioral information, the dataset also contains demographic and geographical information. This dataset contributes to reducing the lack of real-world business data that can be used for educational and research purposes. The dataset can be used in data mining, machine learning, and other analytical field problems in the scope of data science. Due to its unit of analysis, it is a dataset especially suitable for building customer segmentation models, including clustering and RFM (Recency, Frequency, and Monetary value) models, but also be used in classification and regression problems.
Facebook
TwitterGapMaps premium demographic data for USA and Canada sourced from Applied Geographic Solutions (AGS) includes an extensive range of the highest quality demographic and lifestyle segmentation products. All databases are derived from superior source data and the most sophisticated, refined, and proven methodologies.
Demographic Data attributes include:
Latest Estimates and Projections The estimates and projections database includes a wide range of core demographic data variables for the current year and 5- year projections, covering five broad topic areas: population, households, income, labor force, and dwellings.
Crime Risk Crime Risk is the result of an extensive analysis of a rolling seven years of FBI crime statistics. Based on detailed modeling of the relationships between crime and demographics, Crime Risk provides an accurate view of the relative risk of specific crime types (personal, property and total) at the block and block group level.
Panorama Segmentation AGS has created a segmentation system for the United States called Panorama. Panorama has been coded with the MRI Survey data to bring you Consumer Behavior profiles associated with this segmentation system.
Business Counts Business Counts is a geographic summary database of business establishments, employment, occupation and retail sales.
Non-Resident Population The AGS non-resident population estimates utilize a wide range of data sources to model the factors which drive tourists to particular locations, and to match that demand with the supply of available accommodations.
Consumer Expenditures AGS provides current year and 5-year projected expenditures for over 390 individual categories that collectively cover almost 95% of household spending.
Retail Potential This tabulation utilizes the Census of Retail Trade tables which cross-tabulate store type by merchandise line.
Environmental Risk The environmental suite of data consists of several separate database components including: -Weather Risks -Seismological Risks -Wildfire Risk -Climate -Air Quality -Elevation and terrain
Primary Use Cases for AGS Demographic Data:
Integrate AGS demographic data with your existing GIS or BI platform to generate powerful visualizations.
Finance / Insurance (eg. Hedge Funds, Investment Advisors, Investment Research, REITs, Private Equity, VC)
Network Planning
Customer (Risk) Profiling for insurance/loan approvals
Target Marketing
Competitive Analysis
Market Optimization
Commercial Real-Estate (Brokers, Developers, Investors, Single & Multi-tenant O/O)
Tenant Recruitment
Target Marketing
Market Potential / Gap Analysis
Marketing / Advertising (Billboards/OOH, Marketing Agencies, Indoor Screens)
Customer Profiling
Target Marketing
Market Share Analysis
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The datasets contain RGB photos of Scots pine seedlings of three populations from two different ecotypes originating in the Czech Republic:Plasy - lowland ecotype,Trebon - lowland ecotype,Decin - upland ecotype.These photos were taken in three different periods (September 10th 2021, October 23rd 2021, January 22nd 2022).File dataset_for_YOLOv7_training.zip contains image data with annotations for training YOLOv7 segmentation model (training and validation sets)The dataset also contains a table with information on individual Scots pine seedlings:affiliation to parent tree (mum)affiliation to population (site)row and column in which the seedling was grown (row, col)affiliation to the planter in which the seedling was grown (box)mean RGB values of pine seedling in three different periods (B_september, G_september, R_september B_october, G_october, R_october, B_january, G_january, R_january)mean HSV values of pine seedling in three different periods (H_september, S_september, V_september, H_october, S_october, V_october, H_january, S_january, V_january)
Facebook
TwitterUncover lifestyle patterns with geo-precision: 401M verified profiles across 7 Asian countries for segmentation and KYC. Our demographic datasets include rich geo-spatial attributes that power hyper-local segmentation, regional risk scoring, and location-driven behavioral insights.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
E-Commerce Customer Segmentation Dataset This synthetic dataset contains information about 20 customers of an e-commerce platform, designed for customer segmentation and classification tasks.
Dataset Overview Each record represents a unique customer with demographic and behavioral features that help classify them into different customer segments.
Features: customer_id: Unique identifier for each customer
age: Age of the customer (years)
annual_income_k$: Annual income in thousands of dollars
spending_score: A score between 0 and 100 indicating customer spending habits (higher means more spending)
membership_years: Length of membership in years
segment: Customer segment label; possible values are:
Low (low-value customers)
Medium (medium-value customers)
High (high-value customers)
Potential Use Cases Customer segmentation
Targeted marketing campaigns
Customer lifetime value prediction
Behavioral analytics and profiling
Clustering and classification algorithm testing
Dataset Size 20 samples
6 columns
License This dataset is provided under the Apache 2.0 License.
Facebook
TwitterData-driven segmentation methods for population segmentation based on healthcare utilization
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
WanFall is a large-scale synthetic activity recognition dataset designed for fall detection and activities of daily living research. The dataset features computer-generated videos of human actors performing various activities in controlled virtual environments.
Key Features: - 12,000 video clips with dense temporal annotations - 16 activity classes including falls, posture transitions, and static states - 19,228 temporal segments with frame-level precision - 5.0625 seconds per video clip (81 frames @ 16 fps) - Rich demographic metadata (soft labels): age, gender, ethnicity, body type, height, skin tone - Scene attributes: environment, camera angle, frame rate - Multiple evaluation splits: random (80/10/10) and cross-demographic (age, ethnicity, BMI)
Use Cases: - Fall detection research - Activity recognition with temporal segmentation - Bias and fairness analysis across demographics - Cross-demographic generalization studies
Facebook
Twitterhttps://cdla.io/sharing-1-0/https://cdla.io/sharing-1-0/
This dataset was sourced from KPMG AU's Data Analytics virtual internship course on Forage
Sprocket Pvt Ltd is a client of KPMG AU. Sprocket is a bike and bike accessories retail business. They need to find the right customer segment to target for marketing to boost revenue. The following dataset is of their customer demographics for the past 3 years.
The original dataset of 3 separate sheets of Customer demographic, Transactions, and Customer Addresses was fully cleaned and merged using a power query. Data types of columns were changed, and values of certain columns which had illegal values were corrected using a standard approach. This final master dataset can be used for customer segmentation projects using clustering methods.