https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset provides comprehensive customer data suitable for segmentation analysis. It includes anonymized demographic, transactional, and behavioral attributes, allowing for detailed exploration of customer segments. Leveraging this dataset, marketers, data scientists, and business analysts can uncover valuable insights to optimize targeted marketing strategies and enhance customer engagement. Whether you're looking to understand customer behavior or improve campaign effectiveness, this dataset offers a rich resource for actionable insights and informed decision-making.
Anonymized demographic, transactional, and behavioral data. Suitable for customer segmentation analysis. Opportunities to optimize targeted marketing strategies. Valuable insights for improving campaign effectiveness. Ideal for marketers, data scientists, and business analysts.
Segmenting customers based on demographic attributes. Analyzing purchase behavior to identify high-value customer segments. Optimizing marketing campaigns for targeted engagement. Understanding customer preferences and tailoring product offerings accordingly. Evaluating the effectiveness of marketing strategies and iterating for improvement. Explore this dataset to unlock actionable insights and drive success in your marketing initiatives!
Premium B2C Consumer Database - 269+ Million US Records
Supercharge your B2C marketing campaigns with comprehensive consumer database, featuring over 269 million verified US consumer records. Our 20+ year data expertise delivers higher quality and more extensive coverage than competitors.
Core Database Statistics
Consumer Records: Over 269 million
Email Addresses: Over 160 million (verified and deliverable)
Phone Numbers: Over 76 million (mobile and landline)
Mailing Addresses: Over 116,000,000 (NCOA processed)
Geographic Coverage: Complete US (all 50 states)
Compliance Status: CCPA compliant with consent management
Targeting Categories Available
Demographics: Age ranges, education levels, occupation types, household composition, marital status, presence of children, income brackets, and gender (where legally permitted)
Geographic: Nationwide, state-level, MSA (Metropolitan Service Area), zip code radius, city, county, and SCF range targeting options
Property & Dwelling: Home ownership status, estimated home value, years in residence, property type (single-family, condo, apartment), and dwelling characteristics
Financial Indicators: Income levels, investment activity, mortgage information, credit indicators, and wealth markers for premium audience targeting
Lifestyle & Interests: Purchase history, donation patterns, political preferences, health interests, recreational activities, and hobby-based targeting
Behavioral Data: Shopping preferences, brand affinities, online activity patterns, and purchase timing behaviors
Multi-Channel Campaign Applications
Deploy across all major marketing channels:
Email marketing and automation
Social media advertising
Search and display advertising (Google, YouTube)
Direct mail and print campaigns
Telemarketing and SMS campaigns
Programmatic advertising platforms
Data Quality & Sources
Our consumer data aggregates from multiple verified sources:
Public records and government databases
Opt-in subscription services and registrations
Purchase transaction data from retail partners
Survey participation and research studies
Online behavioral data (privacy compliant)
Technical Delivery Options
File Formats: CSV, Excel, JSON, XML formats available
Delivery Methods: Secure FTP, API integration, direct download
Processing: Real-time NCOA, email validation, phone verification
Custom Selections: 1,000+ selectable demographic and behavioral attributes
Minimum Orders: Flexible based on targeting complexity
Unique Value Propositions
Dual Spouse Targeting: Reach both household decision-makers for maximum impact
Cross-Platform Integration: Seamless deployment to major ad platforms
Real-Time Updates: Monthly data refreshes ensure maximum accuracy
Advanced Segmentation: Combine multiple targeting criteria for precision campaigns
Compliance Management: Built-in opt-out and suppression list management
Ideal Customer Profiles
E-commerce retailers seeking customer acquisition
Financial services companies targeting specific demographics
Healthcare organizations with compliant marketing needs
Automotive dealers and service providers
Home improvement and real estate professionals
Insurance companies and agents
Subscription services and SaaS providers
Performance Optimization Features
Lookalike Modeling: Create audiences similar to your best customers
Predictive Scoring: Identify high-value prospects using AI algorithms
Campaign Attribution: Track performance across multiple touchpoints
A/B Testing Support: Split audiences for campaign optimization
Suppression Management: Automatic opt-out and DNC compliance
Pricing & Volume Options
Flexible pricing structures accommodate businesses of all sizes:
Pay-per-record for small campaigns
Volume discounts for large deployments
Subscription models for ongoing campaigns
Custom enterprise pricing for high-volume users
Data Compliance & Privacy
VIA.tools maintains industry-leading compliance standards:
CCPA (California Consumer Privacy Act) compliant
CAN-SPAM Act adherence for email marketing
TCPA compliance for phone and SMS campaigns
Regular privacy audits and data governance reviews
Transparent opt-out and data deletion processes
Getting Started
Our data specialists work with you to:
Define your target audience criteria
Recommend optimal data selections
Provide sample data for testing
Configure delivery methods and formats
Implement ongoing campaign optimization
Why We Lead the Industry
With over two decades of data industry experience, we combine extensive database coverage with advanced targeting capabilities. Our commitment to data quality, compliance, and customer success has made us the preferred choice for businesses seeking superior B2C marketing performance.
Contact our team to discuss your specific targeting requirements and receive custom pricing for your marketing objectives.
Our consumer data is gathered and aggregated via surveys, digital services, and public data sources. We use powerful profiling algorithms to collect and ingest only fresh and reliable data points.
Our comprehensive data enrichment solution includes a variety of data sets that can help you address gaps in your customer data, gain a deeper understanding of your customers, and power superior client experiences.
Consumer Graph Schema & Reach: Our data reach represents the total number of counts available within various categories and comprises attributes such as country location, MAU, DAU & Monthly Location Pings:
Data Export Methodology: Since we collect data dynamically, we provide the most updated data and insights via a best-suited method on a suitable interval (daily/weekly/monthly).
Consumer Graph Use Cases:
360-Degree Customer View:Get a comprehensive image of customers by the means of internal and external data aggregation.
Data Enrichment:Leverage Online to offline consumer profiles to build holistic audience segments to improve campaign targeting using user data enrichment
Fraud Detection: Use multiple digital (web and mobile) identities to verify real users and detect anomalies or fraudulent activity.
Advertising & Marketing:Understand audience demographics, interests, lifestyle, hobbies, and behaviors to build targeted marketing campaigns.
Using Factori Consumer Data graph you can solve use cases like:
Acquisition Marketing Expand your reach to new users and customers using lookalike modeling with your first party audiences to extend to other potential consumers with similar traits and attributes.
Lookalike Modeling
Build lookalike audience segments using your first party audiences as a seed to extend your reach for running marketing campaigns to acquire new users or customers
And also, CRM Data Enrichment, Consumer Data Enrichment B2B Data Enrichment B2C Data Enrichment Customer Acquisition Audience Segmentation 360-Degree Customer View Consumer Profiling Consumer Behaviour Data
Here's the schema of Consumer Data:
person_id
first_name
last_name
age
gender
linkedin_url
twitter_url
facebook_url
city
state
address
zip
zip4
country
delivery_point_bar_code
carrier_route
walk_seuqence_code
fips_state_code
fips_country_code
country_name
latitude
longtiude
address_type
metropolitan_statistical_area
core_based+statistical_area
census_tract
census_block_group
census_block
primary_address
pre_address
streer
post_address
address_suffix
address_secondline
address_abrev
census_median_home_value
home_market_value
property_build+year
property_with_ac
property_with_pool
property_with_water
property_with_sewer
general_home_value
property_fuel_type
year
month
household_id
Census_median_household_income
household_size
marital_status
length+of_residence
number_of_kids
pre_school_kids
single_parents
working_women_in_house_hold
homeowner
children
adults
generations
net_worth
education_level
occupation
education_history
credit_lines
credit_card_user
newly_issued_credit_card_user
credit_range_new
credit_cards
loan_to_value
mortgage_loan2_amount
mortgage_loan_type
mortgage_loan2_type
mortgage_lender_code
mortgage_loan2_render_code
mortgage_lender
mortgage_loan2_lender
mortgage_loan2_ratetype
mortgage_rate
mortgage_loan2_rate
donor
investor
interest
buyer
hobby
personal_email
work_email
devices
phone
employee_title
employee_department
employee_job_function
skills
recent_job_change
company_id
company_name
company_description
technologies_used
office_address
office_city
office_country
office_state
office_zip5
office_zip4
office_carrier_route
office_latitude
office_longitude
office_cbsa_code
office_census_block_group
office_census_tract
office_county_code
company_phone
company_credit_score
company_csa_code
company_dpbc
company_franchiseflag
company_facebookurl
company_linkedinurl
company_twitterurl
company_website
company_fortune_rank
company_government_type
company_headquarters_branch
company_home_business
company_industry
company_num_pcs_used
company_num_employees
company_firm_individual
company_msa
company_msa_name
company_naics_code
company_naics_description
company_naics_code2
company_naics_description2
company_sic_code2
company_sic_code2_desc...
Most banks have a large customer base - with different characteristics in terms of age, income, values, lifestyle, and more. Customer segmentation is the process of dividing a customer dataset into specific groups based on shared traits.
According to a report from Ernst & Young, “A more granular understanding of consumers is no longer a nice-to-have item, but a strategic and competitive imperative for banking providers. Customer understanding should be a living, breathing part of everyday business, with insights underpinning the full range of banking operations.
This dataset consists of 1 Million+ transaction by over 800K customers for a bank in India. The data contains information such as - customer age (DOB), location, gender, account balance at the time of the transaction, transaction details, transaction amount, etc.
The dataset can be used for different analysis, example -
Consumer Insurance Experience & Demographic Profile
This dataset provides a detailed view of how individuals engage with insurance products, paired with demographic and lifestyle attributes to enable powerful segmentation, behavioral analysis, and customer journey mapping. By combining real-world insurance experiences with contextual information about each respondent’s background and preferences, this dataset supports a wide range of data-driven decision-making for insurance providers, policy designers, marketing teams, and product strategists.
Value of the Dataset Understanding how consumers perceive and interact with insurance offerings is critical to building products that resonate and services that retain. This dataset offers that visibility across multiple dimensions—capturing not only what type of insurance consumers hold and how they purchased it, but also what drives their satisfaction, loyalty, and likelihood to switch. Paired with demographic details like income, education, family status, and lifestyle, this information becomes a foundation for more personalized outreach, better-designed offerings, and improved customer experiences.
Because the data reflects lived experiences across diverse markets, it is particularly valuable for benchmarking consumer sentiment in emerging economies, identifying service delivery gaps, or evaluating potential uptake of new policy formats such as digital or personalized insurance.
Example Use Cases 1. Targeted Product Design A health insurer looking to launch short-term, digital-first plans could filter this dataset for consumers with low policy tenure, high digital communication preference, and dissatisfaction with current providers. This segment would inform feature design and positioning.
Competitive Analysis A provider evaluating churn risk can identify patterns among users who have filed claims but report dissatisfaction—indicating operational areas that may be driving customer loss and where improvements could increase retention.
Communication Channel Optimization By analyzing preferred communication methods across different demographic segments, insurers can tailor outreach strategies (e.g., SMS vs. in-app chat) to improve engagement and reduce support costs.
Market Expansion & Localization International insurers can explore regional variations in satisfaction drivers, awareness levels, and price sensitivity to refine go-to-market strategies in countries like Senegal, Tanzania, or the UAE.
Personalized Policy Offer Design Using data on interest in personalized policies and lifestyle indicators, providers can build customizable offerings for consumers more likely to value flexibility, such as frequent travelers or those with irregular incomes.
Insurance-Specific Fields & Descriptions Current Insurance Type Captures the kind of insurance the individual currently holds, with a focus on health insurance in this dataset.
Purchase Method Indicates how the insurance was obtained—through an agent, online, employer, etc.—to understand acquisition channels.
Policy Length Duration of the current policy, categorized (e.g., less than 1 year, 1–3 years, more than 5 years) to analyze tenure-based behaviors.
Satisfaction Self-reported satisfaction with the current insurance provider, useful for benchmarking sentiment.
Top Factor in Choosing Provider Highlights what influenced the purchase decision most—such as coverage options, customer service, pricing, or brand reputation.
Policy Review Frequency Shows how often individuals revisit their policy details or compare with alternatives, revealing levels of engagement or passive behavior.
Filed Claim A yes/no indicator showing whether the consumer has ever filed a claim, useful for analyzing downstream service experiences.
Claim Satisfaction Measures satisfaction with how past claims were handled, providing insight into operational effectiveness.
Primary Value Sought Captures what consumers value most from their insurance—e.g., peace of mind, financial protection, access to quality care.
Likelihood to Recommend Acts as a proxy for Net Promoter Score (NPS), indicating brand advocacy and potential referral behavior.
Biggest Areas for Improvement Open-ended or multi-select responses identifying where insurers can do better—lower premiums, faster claims, more digital tools, etc.
Preferred Method of Communication Indicates how consumers want to be contacted—via online chat, phone, email, SMS—supporting channel strategy optimization.
Preferred Services Details the types of updates or services consumers want—such as claims status, policy changes, or coverage recommendations.
Insurance Awareness Score Self-reported awareness of how insurance works, including policy options, rights, and terms.
Interest in Personalized Policies Captures whether the individual is open to customized insurance plans, an important indicator for usage-ba...
The Consumer Demographic database is comprised of over 80 sources and includes over 400 different data points for each individual in a household with complete PII. The fields provided include demographics, psychographic, lifestyle criteria, buying behavior, and real property identification.
Each record is ranked by confidence and only the highest quality data is used. The database is multi-sourced and contains both compiled and originated U.S. data. Additionally, the data goes through intensive cleansing including deceased processing and NCOA.
BIGDBM Privacy Policy: https://bigdbm.com/privacy.html
The 1998 Ghana Demographic and Health Survey (GDHS) is the latest in a series of national-level population and health surveys conducted in Ghana and it is part of the worldwide MEASURE DHS+ Project, designed to collect data on fertility, family planning, and maternal and child health.
The primary objective of the 1998 GDHS is to provide current and reliable data on fertility and family planning behaviour, child mortality, children’s nutritional status, and the utilisation of maternal and child health services in Ghana. Additional data on knowledge of HIV/AIDS are also provided. This information is essential for informed policy decisions, planning and monitoring and evaluation of programmes at both the national and local government levels.
The long-term objectives of the survey include strengthening the technical capacity of the Ghana Statistical Service (GSS) to plan, conduct, process, and analyse the results of complex national sample surveys. Moreover, the 1998 GDHS provides comparable data for long-term trend analyses within Ghana, since it is the third in a series of demographic and health surveys implemented by the same organisation, using similar data collection procedures. The GDHS also contributes to the ever-growing international database on demographic and health-related variables.
National
Sample survey data
The major focus of the 1998 GDHS was to provide updated estimates of important population and health indicators including fertility and mortality rates for the country as a whole and for urban and rural areas separately. In addition, the sample was designed to provide estimates of key variables for the ten regions in the country.
The list of Enumeration Areas (EAs) with population and household information from the 1984 Population Census was used as the sampling frame for the survey. The 1998 GDHS is based on a two-stage stratified nationally representative sample of households. At the first stage of sampling, 400 EAs were selected using systematic sampling with probability proportional to size (PPS-Method). The selected EAs comprised 138 in the urban areas and 262 in the rural areas. A complete household listing operation was then carried out in all the selected EAs to provide a sampling frame for the second stage selection of households. At the second stage of sampling, a systematic sample of 15 households per EA was selected in all regions, except in the Northern, Upper West and Upper East Regions. In order to obtain adequate numbers of households to provide reliable estimates of key demographic and health variables in these three regions, the number of households in each selected EA in the Northern, Upper West and Upper East regions was increased to 20. The sample was weighted to adjust for over sampling in the three northern regions (Northern, Upper East and Upper West), in relation to the other regions. Sample weights were used to compensate for the unequal probability of selection between geographically defined strata.
The survey was designed to obtain completed interviews of 4,500 women age 15-49. In addition, all males age 15-59 in every third selected household were interviewed, to obtain a target of 1,500 men. In order to take cognisance of non-response, a total of 6,375 households nation-wide were selected.
Note: See detailed description of sample design in APPENDIX A of the survey report.
Face-to-face
Three types of questionnaires were used in the GDHS: the Household Questionnaire, the Women’s Questionnaire, and the Men’s Questionnaire. These questionnaires were based on model survey instruments developed for the international MEASURE DHS+ programme and were designed to provide information needed by health and family planning programme managers and policy makers. The questionnaires were adapted to the situation in Ghana and a number of questions pertaining to on-going health and family planning programmes were added. These questionnaires were developed in English and translated into five major local languages (Akan, Ga, Ewe, Hausa, and Dagbani).
The Household Questionnaire was used to enumerate all usual members and visitors in a selected household and to collect information on the socio-economic status of the household. The first part of the Household Questionnaire collected information on the relationship to the household head, residence, sex, age, marital status, and education of each usual resident or visitor. This information was used to identify women and men who were eligible for the individual interview. For this purpose, all women age 15-49, and all men age 15-59 in every third household, whether usual residents of a selected household or visitors who slept in a selected household the night before the interview, were deemed eligible and interviewed. The Household Questionnaire also provides basic demographic data for Ghanaian households. The second part of the Household Questionnaire contained questions on the dwelling unit, such as the number of rooms, the flooring material, the source of water and the type of toilet facilities, and on the ownership of a variety of consumer goods.
The Women’s Questionnaire was used to collect information on the following topics: respondent’s background characteristics, reproductive history, contraceptive knowledge and use, antenatal, delivery and postnatal care, infant feeding practices, child immunisation and health, marriage, fertility preferences and attitudes about family planning, husband’s background characteristics, women’s work, knowledge of HIV/AIDS and STDs, as well as anthropometric measurements of children and mothers.
The Men’s Questionnaire collected information on respondent’s background characteristics, reproduction, contraceptive knowledge and use, marriage, fertility preferences and attitudes about family planning, as well as knowledge of HIV/AIDS and STDs.
A total of 6,375 households were selected for the GDHS sample. Of these, 6,055 were occupied. Interviews were completed for 6,003 households, which represent 99 percent of the occupied households. A total of 4,970 eligible women from these households and 1,596 eligible men from every third household were identified for the individual interviews. Interviews were successfully completed for 4,843 women or 97 percent and 1,546 men or 97 percent. The principal reason for nonresponse among individual women and men was the failure of interviewers to find them at home despite repeated callbacks.
Note: See summarized response rates by place of residence in Table 1.1 of the survey report.
The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors, and (2) sampling errors. Nonsampling errors are the results of shortfalls made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 1998 GDHS to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 1998 GDHS is only one of many samples that could have been selected from the same population, using the same design and expected size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95 percent of all possible samples of identical size and design.
If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 1998 GDHS sample is the result of a two-stage stratified design, and, consequently, it was necessary to use more complex formulae. The computer software used to calculate sampling errors for the 1998 GDHS is the ISSA Sampling Error Module. This module uses the Taylor linearization method of variance estimation for survey estimates that are means or proportions. The Jackknife repeated replication method is used for variance estimation of more complex statistics such as fertility and mortality rates.
Data Quality Tables - Household age distribution - Age distribution of eligible and interviewed women - Age distribution of eligible and interviewed men - Completeness of reporting - Births by calendar years - Reporting of age at death in days - Reporting of age at death in months
Note: See detailed tables in APPENDIX C of the survey report.
The 110th Congressional District Summary File (Sample) (110CDSAMPLE) contains the sample data, which is the information compiled from the questions asked of a sample of all people and housing units. Population items include basic population totals; urban and rural; households and families; marital status; grandparents as caregivers; language and ability to speak English; ancestry; place of birth, citizenship status, and year of entry; migration; place of work; journey to work (commuting); school enrollment and educational attainment; veteran status; disability; employment status; industry, occupation, and class of worker; income; and poverty status. Housing items include basic housing totals; urban and rural; number of rooms; number of bedrooms; year moved into unit; household size and occupants per room; units in structure; year structure built; heating fuel; telephone service; plumbing and kitchen facilities; vehicles available; value of home; monthly rent; and shelter costs. The file contains subject content identical to that shown in Summary File 3 (SF 3).
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This comprehensive dataset provides detailed educational attainment and demographic analysis across all 50 US states from 2021-2023, specifically designed for tech companies planning strategic market entry and product launch decisions.
Column Name | Data Type | Description | Example Value |
---|---|---|---|
NAME | String | Full US state name | "Massachusetts" |
total_population_25plus | Integer | Total population aged 25 and above | 4,975,152 |
bachelors_degree | Integer | Number of individuals with bachelor's degrees | 1,261,847 |
masters_degree | Integer | Number of individuals with master's degrees | 788,243 |
professional_degree | Integer | Number of individuals with professional degrees (JD, MD, etc.) | 157,762 |
doctoral_degree | Integer | Number of individuals with doctoral degrees (PhD, EdD, etc.) | 169,357 |
median_household_income | Integer | Median household income in USD | $99,858 |
total_households | Float | Total number of households (in millions) | 2.41 |
state | Integer | Numeric state identifier (1-50) | 25 |
year | Integer | Data collection year | 2023 |
college_graduates | Integer | Total college graduates (bachelor's + advanced degrees) | 2,377,209 |
college_graduate_percentage | Float | Percentage of population with college degrees | 47.78% |
graduate_degree_holders | Integer | Total with master's, professional, or doctoral degrees | 1,115,362 |
graduate_degree_percentage | Float | Percentage with graduate-level degrees | 22.42% |
advanced_degree_percentage | Float | Percentage with professional or doctoral degrees | 3.40% |
education_score | Float | Composite education ranking score | 28.76 |
education_rank | Integer | State ranking based on education score (1-50, 1=highest) | 1 |
The dataset reveals that Massachusetts consistently ranks #1 in education metrics with: - 47.78% college graduation rate (2023) - 22.42% graduate degree holders - $99,858 median household income - Education score of 28.76
Perfect for identifying premium tech markets and highly-educated consumer bases for sophisticated technology products.
This dataset is ideal for data scientists, market researchers, business analysts, and tech companies looking to make data-driven decisions about market entry, customer targeting, and regional strategy.
https://www.icpsr.umich.edu/web/ICPSR/studies/8353/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/8353/terms
This is an extract of the decennial Public Use Microdata Sample (PUMS) released by the Bureau of the Census. Because the complete PUMS files contain several hundred thousand records, ICPSR has constructed this subset to allow for easier and less costly analysis. The collection of data at ten year increments allows the user to follow various age cohorts through the life-cycle. Data include information on the household and its occupants such as size and value of dwelling, utility costs, number of people in the household, and their relationship to the respondent. More detailed information was collected on the respondent, the head of household, and the spouse, if present. Variables include education, marital status, occupation and income.
BackgroundMental health professionals play a crucial role in promoting the physical well-being of people with mental illness. Awareness of HIV status can enable professionals in mental health services to provide more comprehensive care. However, it remains uncertain whether mental health professionals consistently document HIV status in mental health records.AimsTo investigate the extent to which mental health professionals document previously established HIV diagnoses of people with mental illness in mental health records, and to identify the clinical and demographic factors associated with documentation or lack thereof.MethodsA retrospective cohort study was conducted using an established data linkage between routinely collected clinical data from secondary mental health services in South London, UK, and national HIV surveillance data from the UK Health Security Agency. Individuals with an HIV diagnosis prior to their last mental health service contact were included. Documented HIV diagnosis in mental health records was assessed.ResultsAmong the 4,032 individuals identified as living with HIV, 1,281 (31.8%) did not have their diagnosis recorded in their mental health records. Factors associated with the absence of an HIV diagnosis included being of Asian ethnicity, having certain primary mental health diagnoses including schizophrenia, being older, being with a mental health service for longer, having more clinical mental health appointments, and living in a less deprived area.ConclusionsA significant number of individuals living with HIV who are receiving mental healthcare in secondary mental health services did not have their HIV diagnosis documented in their mental health records. Addressing this gap could allow mental healthcare providers to support those living with HIV and severe mental illness to manage the complexity of comorbidities and psychosocial impacts of HIV. Mental health services should explore strategies to increase dialogue around HIV in mental health settings.
A global database of Census Data that provides an understanding of population distribution at administrative and zip code levels over 55 years, past, present, and future.
Leverage up-to-date census data with population trends for real estate, market research, audience targeting, and sales territory mapping.
Self-hosted commercial demographic dataset curated based on trusted sources such as the United Nations or the European Commission, with a 99% match accuracy. The global Census Data is standardized, unified, and ready to use.
Use cases for the Global Census Database (Consumer Demographic Data)
Ad targeting
B2B Market Intelligence
Customer analytics
Real Estate Data Estimations
Marketing campaign analysis
Demand forecasting
Sales territory mapping
Retail site selection
Reporting
Audience targeting
Census data export methodology
Our consumer demographic data packages are offered in CSV format. All Demographic data are optimized for seamless integration with popular systems like Esri ArcGIS, Snowflake, QGIS, and more.
Product Features
Historical population data (55 years)
Changes in population density
Urbanization Patterns
Accurate at zip code and administrative level
Optimized for easy integration
Easy customization
Global coverage
Updated yearly
Standardized and reliable
Self-hosted delivery
Fully aggregated (ready to use)
Rich attributes
Why do companies choose our demographic databases
Standardized and unified demographic data structure
Seamless integration in your system
Dedicated location data expert
Note: Custom population data packages are available. Please submit a request via the above contact button for more details.
Demographic characteristics of sample.
The State Legislative District Summary File (Sample) (SLDSAMPLE) contains the sample data, which is the information compiled from the questions asked of a sample of all people and housing units. Population items include basic population totals; urban and rural; households and families; marital status; grandparents as caregivers; language and ability to speak English; ancestry; place of birth, citizenship status, and year of entry; migration; place of work; journey to work (commuting); school enrollment and educational attainment; veteran status; disability; employment status; industry, occupation, and class of worker; income; and poverty status. Housing items include basic housing totals; urban and rural; number of rooms; number of bedrooms; year moved into unit; household size and occupants per room; units in structure; year structure built; heating fuel; telephone service; plumbing and kitchen facilities; vehicles available; value of home; monthly rent; and shelter costs. The file contains subject content identical to that shown in Summary File 3 (SF 3).
Table legend: SD = standard deviation; FSIQ = Full Scale Intelligence Quotient.
The 2013 Turkey Demographic and Health Survey (TDHS-2013) is a nationally representative sample survey. The primary objective of the TDHS-2013 is to provide data on socioeconomic characteristics of households and women between ages 15-49, fertility, childhood mortality, marriage patterns, family planning, maternal and child health, nutritional status of women and children, and reproductive health. The survey obtained detailed information on these issues from a sample of women of reproductive age (15-49). The TDHS-2013 was designed to produce information in the field of demography and health that to a large extent cannot be obtained from other sources.
Specifically, the objectives of the TDHS-2013 included: - Collecting data at the national level that allows the calculation of some demographic and health indicators, particularly fertility rates and childhood mortality rates, - Obtaining information on direct and indirect factors that determine levels and trends in fertility and childhood mortality, - Measuring the level of contraceptive knowledge and practice by contraceptive method and some background characteristics, i.e., region and urban-rural residence, - Collecting data relative to maternal and child health, including immunizations, antenatal care, and postnatal care, assistance at delivery, and breastfeeding, - Measuring the nutritional status of children under five and women in the reproductive ages, - Collecting data on reproductive-age women about marriage, employment status, and social status
The TDHS-2013 information is intended to provide data to assist policy makers and administrators to evaluate existing programs and to design new strategies for improving demographic, social and health policies in Turkey. Another important purpose of the TDHS-2013 is to sustain the flow of information for the interested organizations in Turkey and abroad on the Turkish population structure in the absence of a reliable and sufficient vital registration system. Additionally, like the TDHS-2008, TDHS-2013 is accepted as a part of the Official Statistic Program.
National coverage
The survey covered all de jure household members (usual residents), children age 0-5 years and women age 15-49 years resident in the household.
Sample survey data [ssd]
The sample design and sample size for the TDHS-2013 makes it possible to perform analyses for Turkey as a whole, for urban and rural areas, and for the five demographic regions of the country (West, South, Central, North, and East). The TDHS-2013 sample is of sufficient size to allow for analysis on some of the survey topics at the level of the 12 geographical regions (NUTS 1) which were adopted at the second half of the year 2002 within the context of Turkey’s move to join the European Union.
In the selection of the TDHS-2013 sample, a weighted, multi-stage, stratified cluster sampling approach was used. Sample selection for the TDHS-2013 was undertaken in two stages. The first stage of selection included the selection of blocks as primary sampling units from each strata and this task was requested from the TURKSTAT. The frame for the block selection was prepared using information on the population sizes of settlements obtained from the 2012 Address Based Population Registration System. Settlements with a population of 10,000 and more were defined as “urban”, while settlements with populations less than 10,000 were considered “rural” for purposes of the TDHS-2013 sample design. Systematic selection was used for selecting the blocks; thus settlements were given selection probabilities proportional to their sizes. Therefore more blocks were sampled from larger settlements.
The second stage of sample selection involved the systematic selection of a fixed number of households from each block, after block lists were obtained from TURKSTAT and were updated through a field operation; namely the listing and mapping fieldwork. Twentyfive households were selected as a cluster from urban blocks, and 18 were selected as a cluster from rural blocks. The total number of households selected in TDHS-2013 is 14,490.
The total number of clusters in the TDHS-2013 was set at 642. Block level household lists, each including approximately 100 households, were provided by TURKSTAT, using the National Address Database prepared for municipalities. The block lists provided by TURKSTAT were updated during the listing and mapping activities.
All women at ages 15-49 who usually live in the selected households and/or were present in the household the night before the interview were regarded as eligible for the Women’s Questionnaire and were interviewed. All analysis in this report is based on de facto women.
Note: A more technical and detailed description of the TDHS-2013 sample design, selection and implementation is presented in Appendix B of the final report of the survey.
Face-to-face [f2f]
Two main types of questionnaires were used to collect the TDHS-2013 data: the Household Questionnaire and the Individual Questionnaire for all women of reproductive age. The contents of these questionnaires were based on the DHS core questionnaire. Additions, deletions and modifications were made to the DHS model questionnaire in order to collect information particularly relevant to Turkey. Attention also was paid to ensuring the comparability of the TDHS-2013 findings with previous demographic surveys carried out by the Hacettepe Institute of Population Studies. In the process of designing the TDHS-2013 questionnaires, national and international population and health agencies were consulted for their comments.
The questionnaires were developed in Turkish and translated into English.
TDHS-2013 questionnaires were returned to the Hacettepe University Institute of Population Studies by the fieldwork teams for data processing as soon as interviews were completed in a province. The office editing staff checked that the questionnaires for all selected households and eligible respondents were returned from the field. A total of 29 data entry staff were trained for data entry activities of the TDHS-2013. The data entry of the TDHS-2013 began in late September 2013 and was completed at the end of January 2014.
The data were entered and edited on microcomputers using the Census and Survey Processing System (CSPro) software. CSPro is designed to fulfill the census and survey data processing needs of data-producing organizations worldwide. CSPro is developed by MEASURE partners, the U.S. Bureau of the Census, ICF International’s DHS Program, and SerPro S.A. CSPro allows range, skip, and consistency errors to be detected and corrected at the data entry stage. During the data entry process, 100% verification was performed by entering each questionnaire twice using different data entry operators and comparing the entered data.
In all, 14,490 households were selected for the TDHS-2013. At the time of the listing phase of the survey, 12,640 households were considered occupied and, thus, eligible for interview. Of the eligible households, 93 percent (11,794) households were successfully interviewed. The main reasons the field teams were unable to interview some households were because some dwelling units that had been listed were found to be vacant at the time of the interview or the household was away for an extended period.
In the interviewed 11,794 households, 10,840 women were identified as eligible for the individual interview, aged 15-49 and were present in the household on the night before the interview. Interviews were successfully completed with 9,746 of these women (90 percent). Among the eligible women not interviewed in the survey, the principal reason for nonresponse was the failure to find the women at home after repeated visits to the household.
The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors, and (2) sampling errors. Nonsampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the TDHS-2013 to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the TDHS-2013 is only one of many samples that could have been selected from the same population, using the same design and expected size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall
This study explores the nuances of information sharing in the context of infodemics, with a concentrated examination of the effects of opinion leaders and information attention on users’ disposition towards sharing information during public health emergencies. The research adopts a quantitative methodology, employing Structural Equation Modeling (SEM) to empirically test the proposed hypotheses. By employing a rigorous analytical framework, the research also scrutinizes the mediating role of risk perception in shaping users’ intentions to disseminate information related to public health emergencies. Additionally, it investigates the moderating effect of perceived usefulness, shedding light on how it influences the strength of the relationship between information attention and risk perception. The findings underscore the significance for public health communication strategies, emphasizing targeted messaging utilizing trusted opinion leaders and emphasizing information utility to foster responsible sharing. This research contributes to the academic conversation on infodemic management, providing empirical insights to guide policies and practices in mitigating misinformation during public health emergencies.
https://www.icpsr.umich.edu/web/ICPSR/studies/36998/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/36998/terms
The American Community Survey (ACS) is an ongoing statistical survey that samples a small percentage of the population every year -- giving communities the information they need to plan investments and services. The 5-year public use microdata sample (PUMS) for 2012-2016 is a subset of the 2012-2012 ACS sample. It contains the same sample as the combined PUMS 1-year files for 2012, 2013, 2014, 2015 and 2016. This data collection provides a person-level subset of 133,781 respondents whose occupations were coded as arts-related in the 2011-2015 ACS PUMS. The 2012-2016 PUMS is the seventh 5-year file published by the ACS. This data collection contains five years of data for the population from households and the group quarters (GQ) population. The GQ population and population from households are all weighted to agree with the ACS counts which are an average over the five year period (2012-2016). The ACS sample was selected from all counties across the nation. The ACS provides social, housing, and economic characteristics for demographic groups covering a broad spectrum of geographic areas in the United States. For a more detailed list of variables of what these categories include please see the decriptions of variables section.
The Consumer Expenditure Survey (CE) program provides a continuous and comprehensive flow of data on the buying habits of American consumers, including data on their expenditures, income, and consumer unit (families and single consumers) characteristics. These data are used widely in economic research and analysis, and in support of revisions of the Consumer Price Index.
The CE program is comprised of two separate components (each with its own survey questionnaire and independent sample), the Diary Survey and the quarterly Interview Survey (ICPSR 36237). This data collection contains the Diary Survey component, which was designed to obtain data on frequently purchased smaller items, including food, housing, apparel and services, transportation, entertainment, and out-of-pocket health care costs. Each consumer unit (CU) recorded its expenditures in a diary for two consecutive 1-week periods. Although the diary was designed to collect information on expenditures that could not be easily recalled over time, respondents were asked to report all expenses (except overnight travel) that the CU incurred during the survey week.
The 2013 Diary Survey release contains five sets of data files (FMLD, MEMD, EXPD, DTBD, DTID), and one processing file (DSTUB). The FMLD, MEMD, EXPD, DTBD, and DTID files are organized by the quarter of the calendar year in which the data were collected. There are four quarterly datasets for each of these files.
The FMLD files contain CU characteristics, income, and summary level expenditures; the MEMD files contain member characteristics and income data; the EXPD files contain detailed weekly expenditures at the Universal Classification Code (UCC) level; the DTBD files contain the CU's reported annual income values or the mean of the five imputed income values in the multiple imputation method; and the DTID files contain the five imputed income values. Please note that the summary level expenditure and income information on the FMLD files permit the data user to link consumer spending, by general expenditure category, and household characteristics and demographics on one set of files.
The DSTUB file provides the aggregation scheme used in the published consumer expenditure tables. The DSTUB file is further explained in Section III.F.6. 'Processing Files' of the Diary Survey Users' Guide. A second documentation guide, the 'Users' Guide to Income Imputation,' includes information on how to appropriately use the imputed income data.
Demographic and family characteristics data include age, sex, race, marital status, and CU relationships for each CU member. Income information was also collected, such as wage, salary, unemployment compensation, child support, and alimony, as well as information on the employment of each CU member age 14 and over.
The unpublished integrated CE data tables produced by the BLS are available to download through NADAC (click on 'Other' in the Dataset(s) section). The tables show average and percentile expenditures for detailed items, as well as the standard error and coefficient of variation (CV) for each spending estimate. The BLS unpublished integrated CE data tables are provided as an easy-to-use tool for obtaining spending estimates. However, users are cautioned to read the BLS explanatory letter accompanying the tables. The letter explains that estimates of average expenditures on detailed spending items (such as leisure and art-related categories) may be unreliable due to so few reports of expenditures for those items.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Data associated with the paper: Who Tweets with Their Location? Understanding the Relationship Between Demographic Characteristics and the Use of Geoservices and Geotagging on Twitter Luke Sloan & Jeffrey Morgan
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset provides comprehensive customer data suitable for segmentation analysis. It includes anonymized demographic, transactional, and behavioral attributes, allowing for detailed exploration of customer segments. Leveraging this dataset, marketers, data scientists, and business analysts can uncover valuable insights to optimize targeted marketing strategies and enhance customer engagement. Whether you're looking to understand customer behavior or improve campaign effectiveness, this dataset offers a rich resource for actionable insights and informed decision-making.
Anonymized demographic, transactional, and behavioral data. Suitable for customer segmentation analysis. Opportunities to optimize targeted marketing strategies. Valuable insights for improving campaign effectiveness. Ideal for marketers, data scientists, and business analysts.
Segmenting customers based on demographic attributes. Analyzing purchase behavior to identify high-value customer segments. Optimizing marketing campaigns for targeted engagement. Understanding customer preferences and tailoring product offerings accordingly. Evaluating the effectiveness of marketing strategies and iterating for improvement. Explore this dataset to unlock actionable insights and drive success in your marketing initiatives!