100+ datasets found

d
Statistics review 2: Samples and populations
catalog.data.gov
data.virginia.gov
Updated Sep 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Institutes of Health (2025). Statistics review 2: Samples and populations [Dataset]. https://catalog.data.gov/dataset/statistics-review-2-samples-and-populations
Explore at:
Dataset updated
Sep 6, 2025
Dataset provided by
National Institutes of Health
Description
The previous review in this series introduced the notion of data description and outlined some of the more common summary measures used to describe a dataset. However, a dataset is typically only of interest for the information it provides regarding the population from which it was drawn. The present review focuses on estimation of population values from a sample.
n
Census Microdata Samples Project
neuinfo.org
dknet.org
+2more
Updated Jan 29, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2022). Census Microdata Samples Project [Dataset]. http://identifiers.org/RRID:SCR_008902
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_008902
Dataset updated
Jan 29, 2022
Description
A data set of cross-nationally comparable microdata samples for 15 Economic Commission for Europe (ECE) countries (Bulgaria, Canada, Czech Republic, Estonia, Finland, Hungary, Italy, Latvia, Lithuania, Romania, Russia, Switzerland, Turkey, UK, USA) based on the 1990 national population and housing censuses in countries of Europe and North America to study the social and economic conditions of older persons. These samples have been designed to allow research on a wide range of issues related to aging, as well as on other social phenomena. A common set of nomenclatures and classifications, derived on the basis of a study of census data comparability in Europe and North America, was adopted as a standard for recoding. This series was formerly called Dynamics of Population Aging in ECE Countries. The recommendations regarding the design and size of the samples drawn from the 1990 round of censuses envisaged: (1) drawing individual-based samples of about one million persons; (2) progressive oversampling with age in order to ensure sufficient representation of various categories of older people; and (3) retaining information on all persons co-residing in the sampled individual''''s dwelling unit. Estonia, Latvia and Lithuania provided the entire population over age 50, while Finland sampled it with progressive over-sampling. Canada, Italy, Russia, Turkey, UK, and the US provided samples that had not been drawn specially for this project, and cover the entire population without over-sampling. Given its wide user base, the US 1990 PUMS was not recoded. Instead, PAU offers mapping modules, which recode the PUMS variables into the project''''s classifications, nomenclatures, and coding schemes. Because of the high sampling density, these data cover various small groups of older people; contain as much geographic detail as possible under each country''''s confidentiality requirements; include more extensive information on housing conditions than many other data sources; and provide information for a number of countries whose data were not accessible until recently. Data Availability: Eight of the fifteen participating countries have signed the standard data release agreement making their data available through NACDA/ICPSR (see links below). Hungary and Switzerland require a clearance to be obtained from their national statistical offices for the use of microdata, however the documents signed between the PAU and these countries include clauses stipulating that, in general, all scholars interested in social research will be granted access. Russia requested that certain provisions for archiving the microdata samples be removed from its data release arrangement. The PAU has an agreement with several British scholars to facilitate access to the 1991 UK data through collaborative arrangements. Statistics Canada and the Italian Institute of statistics (ISTAT) provide access to data from Canada and Italy, respectively. * Dates of Study: 1989-1992 * Study Features: International, Minority Oversamples * Sample Size: Approx. 1 million/country Links: * Bulgaria (1992), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/02200 * Czech Republic (1991), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06857 * Estonia (1989), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06780 * Finland (1990), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06797 * Romania (1992), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06900 * Latvia (1989), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/02572 * Lithuania (1989), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/03952 * Turkey (1990), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/03292 * U.S. (1990), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06219
d
Sample Size and Population Estimate Tables (Standard Errors and P Values) -...
catalog.data.gov
odgavaprod.ogopendata.com
Updated Sep 7, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Substance Abuse and Mental Health Services Administration (2025). Sample Size and Population Estimate Tables (Standard Errors and P Values) - 3.1 to 3.8 [Dataset]. https://catalog.data.gov/dataset/sample-size-and-population-estimate-tables-standard-errors-and-p-values-3-1-to-3-8
Explore at:
Dataset updated
Sep 7, 2025
Dataset provided by
Substance Abuse and Mental Health Services Administration
Description
These detailed tables show standard errors for sample sizes and population estimates from the 2012 National Survey on Drug Use and Health (NSDUH) Mental Health Detailed Tables. Samples sizes and population estimates are provided age group, gender, race/ethnicity, education level, employment status, county type, poverty level, insurance status, overal health, and geographic area.
w
Estimating the Size of Populations through a Household Survey 2011 - Rwanda
microdata.worldbank.org
datacatalog.ihsn.org
+1more
Updated Aug 15, 2017
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rwanda Biomedical Center/ Institute of HIV/AIDS, Disease Prevention and Control Department (RBC/IHDPC) (2017). Estimating the Size of Populations through a Household Survey 2011 - Rwanda [Dataset]. https://microdata.worldbank.org/index.php/catalog/2883
Explore at:
Dataset updated
Aug 15, 2017
Dataset authored and provided by
Rwanda Biomedical Center/ Institute of HIV/AIDS, Disease Prevention and Control Department (RBC/IHDPC)
Time period covered
2011
Area covered
Rwanda
Description
Abstract

The Estimating the Size of Populations through a Household Survey (EPSHS), sought to assess the feasibility of the network scale-up and proxy respondent methods for estimating the sizes of key populations at higher risk of HIV infection and to compare the results to other estimates of the population sizes. The study was undertaken based on the assumption that if these methods proved to be feasible with a reasonable amount of data collection for making adjustments, countries would be able to add this module to their standard household survey to produce size estimates for their key populations at higher risk of HIV infection. This would facilitate better programmatic responses for prevention and caring for people living with HIV and would improve the understanding of how HIV is being transmitted in the country.

The specific objectives of the ESPHS were: 1. To assess the feasibility of the network scale-up method for estimating the sizes of key populations at higher risk of HIV infection in a Sub-Saharan African context; 2. To assess the feasibility of the proxy respondent method for estimating the sizes of key populations at higher risk of HIV infection in a Sub-Saharan African context; 3. To estimate the population size of MSM, FSW, IDU, and clients of sex workers in Rwanda at a national level; 4. To compare the estimates of the sizes of key populations at higher risk for HIV produced by the network scale-up and proxy respondent methods with estimates produced using other methods; and 5. To collect data to be used in scientific publications comparing the use of the network scale-up method in different national and cultural environments.

Geographic coverage

National

Analysis unit

Household

Individual

Sampling procedure

The Estimating the Size of Populations through a Household Survey (ESPHS) used a two-stage sample design, implemented in a representative sample of 2,125 households selected nationwide in which all women and men age 15 years and above where eligible for an individual interview. The sampling frame used was the preparatory frame for the Rwanda Population and Housing Census (RPHC), which was conducted in 2012; it was provided by the National Institute of Statistics of Rwanda (NISR).

The sampling frame was a complete list of natural villages covering the whole country (14,837 villages). Two strata were defined: the city of Kigali and the rest of the country. One hundred and thirty Primary Sampling Units (PSU) were selected from the sampling frame (35 in Kigali and 95 in the other stratum). To reduce clustering effect, only 20 households were selected per cluster in Kigali and 15 in the other clusters. As a result, 33 percent of the households in the sample were located in Kigali.

The list of households in each cluster was updated upon arrival of the survey team in the cluster. Once the listing had been updated, a number was assigned to each existing household in the cluster. The supervisor then identified the households to be interviewed in the survey by using a table in which the households were randomly pre-selected. This table also provided the list of households pre-selected for each of the two different definitions of what it means "to know" someone.

For further details on sample design and implementation, see Appendix A of the final report.

Mode of data collection

Face-to-face [f2f]

Research instrument

The Estimating the Size of Populations through a Household Survey (ESPHS) used two types of questionnaires: a household questionnaire and an individual questionnaire. The same individual questionnaire was used to interview both women and men. In addition, two versions of the individual questionnaire were developed, using two different definitions of what it means “to know” someone. Each version of the individual questionnaire was used in half of the selected households.

Cleaning operations

The processing of the ESPHS data began shortly after the fieldwork commenced. Completed questionnaires were returned periodically from the field to the SPH office in Kigali, where they were entered and checked for consistency by data processing personnel who were specially trained for this task. Data were entered using CSPro, a programme specially developed for use in DHS surveys. All data were entered twice (100 percent verification). The concurrent processing of the data was a distinct advantage for data quality, because the School of Public Health had the opportunity to advise field teams of problems detected during data entry. The data entry and editing phase of the survey was completed in late August 2011.

Response rate

A total of 2,125 households were selected in the sample, of which 2,120 were actually occupied at the time of the interview. The number of occupied households successfully interviewed was 2,102, yielding a household response rate of 99 percent.

From the households interviewed, 2,629 women were found to be eligible and 2,567 were interviewed, giving a response rate of 98 percent. Interviews with men covered 2,102 of the eligible 2,149 men, yielding a response rate of 98 percent. The response rates do not significantly vary by type of questionnaire or residence.

Sampling error estimates

The estimates from a sample survey are affected by two types of errors: (1) non-sampling errors, and (2) sampling errors. Non-sampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made to minimize this type of error during the implementation of the Rwanda ESPHS 2011, non-sampling errors are impossible to avoid and difficult to evaluate statistically.

Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the ESPHS 2011 is only one of many samples that could have been selected from the same population, using the same design and identical size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.

A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95 percent of all possible samples of identical size and design.

If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the ESPHS 2011 sample is the result of a multi-stage stratified design, and, consequently, it was necessary to use more complex formulae. The computer software used to calculate sampling errors for the ESPHS 2011 is a SAS program. This program uses the Taylor linearization method for variance estimation for survey estimates that are means or proportions.

A more detailed description of estimates of sampling errors are presented in Appendix B of the survey report.
N
United States Population Dataset: Yearly Figures, Population Change, and...
neilsberg.com
csv, json
Updated Sep 18, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2023). United States Population Dataset: Yearly Figures, Population Change, and Percent Change Analysis [Dataset]. https://www.neilsberg.com/research/datasets/6f93a357-3d85-11ee-9abe-0aa64bf2eeb2/
Explore at:
csv, jsonAvailable download formats
Dataset updated
Sep 18, 2023
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
United States
Variables measured
Annual Population Growth Rate, Population Between 2000 and 2022, Annual Population Growth Rate Percent
Measurement technique
The data presented in this dataset is derived from the 20 years data of U.S. Census Bureau Population Estimates Program (PEP) 2000 - 2022. To measure the variables, namely (a) population and (b) population change in ( absolute and as a percentage ), we initially analyzed and tabulated the data for each of the years between 2000 and 2022. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the United States population over the last 20 plus years. It lists the population for each year, along with the year on year change in population, as well as the change in percentage terms for each year. The dataset can be utilized to understand the population change of United States across the last two decades. For example, using this dataset, we can identify if the population is declining or increasing. If there is a change, when the population peaked, or if it is still growing and has not reached its peak. We can also compare the trend with the overall trend of United States population over the same period of time.

Key observations

In 2022, the population of United States was 333,287,557, a 0.38% increase year-by-year from 2021. Previously, in 2021, United States population was 332,031,554, an increase of 0.16% compared to a population of 331,511,512 in 2020. Over the last 20 plus years, between 2000 and 2022, population of United States increased by 51,125,146. In this period, the peak population was 333,287,557 in the year 2022. The numbers suggest that the population has not reached its peak yet and is showing a trend of further growth. Source: U.S. Census Bureau Population Estimates Program (PEP).

Content

When available, the data consists of estimates from the U.S. Census Bureau Population Estimates Program (PEP).

Data Coverage:

From 2000 to 2022

Variables / Data Columns

Year: This column displays the data year (Measured annually and for years 2000 to 2022)

Population: The population for the specific year for the United States is shown in this column.

Year on Year Change: This column displays the change in United States population for each year compared to the previous year.

Change in Percent: This column displays the year on year change as a percentage. Please note that the sum of all percentages may not equal one due to rounding of values.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for United States Population by Year. You can refer the same here
General Population Census of 1999 - IPUMS Subset - France
microdata.worldbank.org
catalog.ihsn.org
+1more
Updated Aug 1, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
IPUMS (2025). General Population Census of 1999 - IPUMS Subset - France [Dataset]. https://microdata.worldbank.org/index.php/catalog/2147
Explore at:
Dataset updated
Aug 1, 2025
Dataset provided by
Institut National De La Statistique Et Des Etudes Economiqueshttp://insee.fr/
IPUMS
Time period covered
1999
Area covered
France
Description
Analysis unit

Persons, households, and dwellings

UNITS IDENTIFIED: - Dwellings: yes - Vacant Units: No - Households: yes - Individuals: yes - Group quarters: yes

UNIT DESCRIPTIONS: - Dwellings: no - Households: Yes - Group quarters: A collective household is a group of persons that does not live in an ordinary household, but lives in a collective establishment, sharing meal times.

Universe

Residents of France, of any nationality. Does not include French citizens living in other countries, foreign tourists, or people passing through. Reintegrated persons: Persons living in group quarters or without a fixed address but having a usual home elsewhere (i.e., enumerated away from their usual residence). During data processing, most of these people are reintegrated into their usual households. Legal population refers to the population without duplicate counts (population sans double compte) and the institutional population (population comptee a part).

Kind of data

Population and Housing Census [hh/popcen]

Sampling procedure

MICRODATA SOURCE: INSEE (Institut National de la Statisque et des Etudes Economiques)

SAMPLE SIZE (person records): 2934758.

SAMPLE DESIGN: 1/20 sample: A 1/5 systematic sample selected from 1/4 sample. 1/4 sample: a systematic sample of every 4th dwelling (or individual from institutional households). Dwellings, either for households/quasi-households or vacant dwellings, are sorted by locality and household size (if for households/quasi-households), before sampling. Individuals from communities/quasi-communities are sorted by locality, type of community and date of birth before sampling. All individuals within households constitute the 1/4 sample. Reintegrated persons: Persons living in group quarters or without a fixed address but having a usual home elsewhere (i.e., enumerated away from their usual residence). During data processing, most of these people are reintegrated into their usual households. Legal population refers to the population without duplicate counts (population sans double compte) and the institutional population (population comptee a part).

Mode of data collection

Face-to-face [f2f]

Research instrument

Form 1A for dwelling consists of (1) dwelling characteristics, (2) List A. permanent occupants of the dwelling, (3) List B. household members who do not live in the dwelling of enumeration, and (4) building characteristics; Form 2B. Individual form.
g
Sample Size and Population Estimates Tables (Prevalence Estimates) - 8.1 to...
gimi9.com
Updated Aug 1, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Sample Size and Population Estimates Tables (Prevalence Estimates) - 8.1 to 8.13 | gimi9.com [Dataset]. https://gimi9.com/dataset/data-gov_sample-size-and-population-estimates-tables-prevalence-estimates-8-1-to-8-13/
Explore at:
Dataset updated
Aug 1, 2025
Description
These detailed tables show sample sizes and population estimates from the 2010 National Survey on Drug Use and Health (NSDUH). Samples sizes and population estimates are provided by age group, gender, race/ethnicity, education level, employment status, geographic area, pregnancy status, college enrollment status, and probation/parole status.
undefined undefined: undefined | undefined (undefined)
data.census.gov
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
United States Census Bureau, undefined undefined: undefined | undefined (undefined) [Dataset]. https://data.census.gov/table/ACSDT1Y2022.B07004C?q=Race%20and%20Ethnicity&t=American%20Indian%20and%20Alaska%20Native&g=040XX00US26
Explore at:
Dataset provided by
United States Census Bureauhttp://census.gov/
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Although the American Community Survey (ACS) produces population, demographic and housing unit estimates, the decennial census is the official source of population totals for April 1st of each decennial year. In between censuses, the Census Bureau's Population Estimates Program produces and disseminates the official estimates of the population for the nation, states, counties, cities, and towns and estimates of housing units for states and counties..This table provides geographical mobility for persons relative to their residence at the time they were surveyed. The characteristics crossed by geographical mobility reflect the current survey year..Information about the American Community Survey (ACS) can be found on the ACS website. Supporting documentation including code lists, subject definitions, data accuracy, and statistical testing, and a full list of ACS tables and table shells (without estimates) can be found on the Technical Documentation section of the ACS website.Sample size and data quality measures (including coverage rates, allocation rates, and response rates) can be found on the American Community Survey website in the Methodology section..Source: U.S. Census Bureau, 2022 American Community Survey 1-Year Estimates.Data are based on a sample and are subject to sampling variability. The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. The value shown here is the 90 percent margin of error. The margin of error can be interpreted roughly as providing a 90 percent probability that the interval defined by the estimate minus the margin of error and the estimate plus the margin of error (the lower and upper confidence bounds) contains the true value. In addition to sampling variability, the ACS estimates are subject to nonsampling error (for a discussion of nonsampling variability, see ACS Technical Documentation). The effect of nonsampling error is not represented in these tables..The Hispanic origin and race codes were updated in 2020. For more information on the Hispanic origin and race code changes, please visit the American Community Survey Technical Documentation website..The 2022 American Community Survey (ACS) data generally reflect the March 2020 Office of Management and Budget (OMB) delineations of metropolitan and micropolitan statistical areas. In certain instances the names, codes, and boundaries of the principal cities shown in ACS tables may differ from the OMB delineations due to differences in the effective dates of the geographic entities..Estimates of urban and rural populations, housing units, and characteristics reflect boundaries of urban areas defined based on 2020 Census data. As a result, data for urban and rural areas from the ACS do not necessarily reflect the results of ongoing urbanization..Explanation of Symbols:- The estimate could not be computed because there were an insufficient number of sample observations. For a ratio of medians estimate, one or both of the median estimates falls in the lowest interval or highest interval of an open-ended distribution. For a 5-year median estimate, the margin of error associated with a median was larger than the median itself.N The estimate or margin of error cannot be displayed because there were an insufficient number of sample cases in the selected geographic area. (X) The estimate or margin of error is not applicable or not available.median- The median falls in the lowest interval of an open-ended distribution (for example "2,500-")median+ The median falls in the highest interval of an open-ended distribution (for example "250,000+").** The margin of error could not be computed because there were an insufficient number of sample observations.*** The margin of error could not be computed because the median falls in the lowest interval or highest interval of an open-ended distribution.***** A margin of error is not appropriate because the corresponding estimate is controlled to an independent population or housing estimate. Effectively, the corresponding estimate has no sampling error and the margin of error may be treated as zero.
V
Sample Size and Population Estimates Tables (Standard Errors and P Values) -...
data.virginia.gov
catalog.data.gov
html
Updated Sep 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Substance Abuse and Mental Health Services Administration (2025). Sample Size and Population Estimates Tables (Standard Errors and P Values) - 8.1 to 8.13 [Dataset]. https://data.virginia.gov/dataset/sample-size-and-population-estimates-tables-standard-errors-and-p-values-8-1-to-8-132
Explore at:
htmlAvailable download formats
Dataset updated
Sep 6, 2025
Dataset provided by
Substance Abuse and Mental Health Services Administration
Description
These detailed tables show standard errors for sample sizes and population estimates from the 2011 National Survey on Drug Use and Health (NSDUH). Standard errors for samples sizes and population estimates are provided by age group, gender, race/ethnicity, education level, employment status, geographic area, pregnancy status, college enrollment status, and probation/parole status.
Population Census 2000 - IPUMS Subset - Indonesia
microdata.worldbank.org
catalog.ihsn.org
Updated Aug 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Central Bureau of Statistics (2025). Population Census 2000 - IPUMS Subset - Indonesia [Dataset]. https://microdata.worldbank.org/index.php/catalog/1053
Explore at:
Dataset updated
Aug 1, 2025
Dataset provided by
Central Bureau of Statisticshttp://cbs.gov.np/
IPUMS
Time period covered
2000
Area covered
Indonesia
Description
Analysis unit

Persons, households, and dwellings

UNITS IDENTIFIED: - Dwellings: yes - Vacant Units: no - Households: yes - Individuals: yes - Group quarters: yes

UNIT DESCRIPTIONS: - Dwellings: Not available - Households: An individual or group of people who inhabit part or all of the physical or census building, usually live together, who eat from one kitchen or organize daily needs together as one unit. - Group quarters: A special household includes people living in dormitories, barracks, or institutions in which daily needs are under the responsibility of a foundation or other organization. Also includes groups of people in lodging houses or buildings, where the total number of lodgers is ten or more.

Universe

All population residing in the geographic area of Indonesia regardless of residence status. Diplomats and their families residing in Indonesia were excluded.

Kind of data

Population and Housing Census [hh/popcen]

Sampling procedure

MICRODATA SOURCE: Central Bureau of Statistics

SAMPLE SIZE (person records): 20112539.

SAMPLE DESIGN: Geographically stratified systematic sample (drawn by IPUMS).

Mode of data collection

Face-to-face [f2f]

Research instrument

L1 questionnaire for buildings and households; L2 questionnaire for permanent residents; and L3 questionnaire for non-permanent residents (boat people, homeless persons, etc).
w
Demographic and Health Survey 2002 - Viet Nam
microdata.worldbank.org
catalog.ihsn.org
Updated Oct 26, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
General Statistical Office (GSO) (2023). Demographic and Health Survey 2002 - Viet Nam [Dataset]. https://microdata.worldbank.org/index.php/catalog/1518
Explore at:
Dataset updated
Oct 26, 2023
Dataset authored and provided by
General Statistical Office (GSO)
Time period covered
2002
Area covered
Vietnam
Description
Abstract

The 2002 Vietnam Demographic and Health Survey (VNDHS 2002) is a nationally representative sample survey of 5,665 ever-married women age 15-49 selected from 205 sample points (clusters) throughout Vietnam. It provides information on levels of fertility, family planning knowledge and use, infant and child mortality, and indicators of maternal and child health. The survey included a Community/ Health Facility Questionnaire that was implemented in each of the sample clusters.

The survey was designed to measure change in reproductive health indicators over the five years since the VNDHS 1997, especially in the 18 provinces that were targeted in the Population and Family Health Project of the Committee for Population, Family and Children. Consequently, all provinces were separated into “project” and “nonproject” groups to permit separate estimates for each. Data collection for the survey took place from 1 October to 21 December 2002.

The Vietnam Demographic and Health Survey 2002 (VNDHS 2002) was the third DHS in Vietnam, with prior surveys implemented in 1988 and 1997. The VNDHS 2002 was carried out in the framework of the activities of the Population and Family Health Project of the Committee for Population, Family and Children (previously the National Committee for Population and Family Planning).

The main objectives of the VNDHS 2002 were to collect up-to-date information on family planning, childhood mortality, and health issues such as breastfeeding practices, pregnancy care, vaccination of children, treatment of common childhood illnesses, and HIV/AIDS, as well as utilization of health and family planning services. The primary objectives of the survey were to estimate changes in family planning use in comparison with the results of the VNDHS 1997, especially on issues in the scope of the project of the Committee for Population, Family and Children.

VNDHS 2002 data confirm the pattern of rapidly declining fertility that was observed in the VNDHS 1997. It also shows a sharp decline in child mortality, as well as a modest increase in contraceptive use. Differences between project and non-project provinces are generally small.

Geographic coverage

The 2002 Vietnam Demographic and Health Survey (VNDHS 2002) is a nationally representative sample survey. The VNDHS 1997 was designed to provide separate estimates for the whole country, urban and rural areas, for 18 project provinces and the remaining nonproject provinces as well. Project provinces refer to 18 focus provinces targeted for the strengthening of their primary health care systems by the Government's Population and Family Health Project to be implemented over a period of seven years, from 1996 to 2002 (At the outset of this project there were 15 focus provinces, which became 18 by the creation of 3 new provinces from the initial set of 15). These provinces were selected according to criteria based on relatively low health and family planning status, no substantial family planning donor presence, and regional spread. These criteria resulted in the selection of the country's poorer provinces. Nine of these provinces have significant proportions of ethnic minorities among their population.

Analysis unit

Household

Women age 15-49

Universe

The population covered by the 2002 VNDHS is defined as the universe of all women age 15-49 in Vietnam.

Kind of data

Sample survey data

Sampling procedure

The sample for the VNDHS 2002 was based on that used in the VNDHS 1997, which in turn was a subsample of the 1996 Multi-Round Demographic Survey (MRS), a semi-annual survey of about 243,000 households undertaken regularly by GSO. The MRS sample consisted of 1,590 sample areas known as enumeration areas (EAs) spread throughout the 53 provinces/cities of Vietnam, with 30 EAs in each province. On average, an EA comprises about 150 households. For the VNDHS 1997, a subsample of 205 EAs was selected, with 26 households in each urban EA and 39 households for each rural EA. A total of 7,150 households was selected for the survey. The VNDHS 1997 was designed to provide separate estimates for the whole country, urban and rural areas, for 18 project provinces and the remaining nonproject provinces as well. Because the main objective of the VNDHS 2002 was to measure change in reproductive health indicators over the five years since the VNDHS 1997, the sample design for the VNDHS 2002 was as similar as possible to that of the VNDHS 1997.

Although it would have been ideal to have returned to the same households or at least the same sample points as were selected for the VNDHS 1997, several factors made this undesirable. Revisiting the same households would have held the sample artificially rigid over time and would not allow for newly formed households. This would have conflicted with the other major survey objective, which was to provide up-to-date, representative data for the whole of Vietnam. Revisiting the same sample points that were covered in 1997 was complicated by the fact that the country had conducted a population census in 1999, which allowed for a more representative sample frame.

In order to balance the two main objectives of measuring change and providing representative data, it was decided to select enumeration areas from the 1999 Population Census, but to cover the same communes that were sampled in the VNDHS 1997 and attempt to obtain a sample point as close as possible to that selected in 1997. Consequently, the VNDHS 2002 sample also consisted of 205 sample points and reflects the oversampling in the 20 provinces that fall in the World Bank-supported Population and Family Health Project. The sample was designed to produce about 7,000 completed household interviews and 5,600 completed interviews with ever-married women age 15-49.

Mode of data collection

Face-to-face

Research instrument

As in the VNDHS 1997, three types of questionnaires were used in the 2002 survey: the Household Questionnaire, the Individual Woman's Questionnaire, and the Community/Health Facility Questionnaire. The first two questionnaires were based on the DHS Model A Questionnaire, with additions and modifications made during an ORC Macro staff visit in July 2002. The questionnaires were pretested in two clusters in Hanoi (one in a rural area and another in an urban area). After the pretest and consultation with ORC Macro, the drafts were revised for use in the main survey.

a) The Household Questionnaire was used to enumerate all usual members and visitors in selected households and to collect information on age, sex, education, marital status, and relationship to the head of household. The main purpose of the Household Questionnaire was to identify persons who were eligible for individual interview (i.e. ever-married women age 15-49). In addition, the Household Questionnaire collected information on characteristics of the household such as water source, type of toilet facilities, material used for the floor and roof, and ownership of various durable goods.

b) The Individual Questionnaire was used to collect information on ever-married women aged 15-49 in surveyed households. These women were interviewed on the following topics:
- Respondent's background characteristics (education, residential history, etc.); - Reproductive history; - Contraceptive knowledge and use;
- Antenatal and delivery care; - Infant feeding practices; - Child immunization; - Fertility preferences and attitudes about family planning; - Husband's background characteristics; - Women's work information; and - Knowledge of AIDS.

c) The Community/Health Facility Questionnaire was used to collect information on all communes in which the interviewed women lived and on services offered at the nearest health stations. The Community/Health Facility Questionnaire consisted of four sections. The first two sections collected information from community informants on some characteristics such as the major economic activities of residents, distance from people's residence to civic services and the location of the nearest sources of health care. The last two sections involved visiting the nearest commune health centers and intercommune health centers, if these centers were located within 30 kilometers from the surveyed cluster. For each visited health center, information was collected on the type of health services offered and the number of days services were offered per week; the number of assigned staff and their training; medical equipment and medicines available at the time of the visit.

Cleaning operations

The first stage of data editing was implemented by the field editors soon after each interview. Field editors and team leaders checked the completeness and consistency of all items in the questionnaires. The completed questionnaires were sent to the GSO headquarters in Hanoi by post for data processing. The editing staff of the GSO first checked the questionnaires for completeness. The data were then entered into microcomputers and edited using a software program specially developed for the DHS program, the Census and Survey Processing System, or CSPro. Data were verified on a 100 percent basis, i.e., the data were entered separately twice and the two results were compared and corrected. The data processing and editing staff of the GSO were trained and supervised for two weeks by a data processing specialist from ORC Macro. Office editing and processing activities were initiated immediately after the beginning of the fieldwork and were completed in late December 2002.

Response rate

The results of the household and individual
Data from: Census of Population, 1860 [United States]: Urban Household...
search.datacite.org
icpsr.umich.edu
Updated 1989
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jon Moen (1989). Census of Population, 1860 [United States]: Urban Household Sample [Dataset]. http://doi.org/10.3886/icpsr08930.v2
Explore at:
Unique identifier
https://doi.org/10.3886/icpsr08930.v2
Dataset updated
1989
Dataset provided by
Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
DataCitehttps://www.datacite.org/
Authors
Jon Moen
Dataset funded by
University of Chicago. Booth School of Business. Center for Population Economics
Description
The Urban Household Sample of the 1860 United States Census was designed to supplement the Bateman-Foust rural sample with observations from urban areas. The sample covers both northern and southern towns and cities and permits examination of female occupations and labor force participation rates. Information on individuals includes occupation, city of residence, age, sex, race, dollar value of real and personal property owned, whether American or foreign born, and literacy. The second release of this collection adds nine constructed variables, including several weight variables, collapsed occupation, ICPSR state code, region, and unique internal family and household identifier numbers.
V
Calculating Sample Size for the NYTD Follow-up Population
data.virginia.gov
catalog.data.gov
html
Updated Sep 6, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Administration for Children and Families (2025). Calculating Sample Size for the NYTD Follow-up Population [Dataset]. https://data.virginia.gov/dataset/calculating-sample-size-for-the-nytd-follow-up-population
Explore at:
htmlAvailable download formats
Dataset updated
Sep 6, 2025
Dataset provided by
Administration for Children and Families
Description
This brief provides more information about a how a State may, for planning purposes, calculate a sample size for the NYTD follow-up population.

Metadata-only record linking to the original dataset. Open original dataset below.
Census of Population and Housing, 1980: Public Use Microdata Sample B, .1%...
archive.ciser.cornell.edu
Updated Feb 9, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bureau of the Census (2020). Census of Population and Housing, 1980: Public Use Microdata Sample B, .1% Extract [Dataset]. http://doi.org/10.6077/j5/set2fg
Explore at:
Unique identifier
https://doi.org/10.6077/j5/set2fg
Dataset updated
Feb 9, 2020
Dataset provided by
United States Census Bureauhttp://census.gov/
Authors
Bureau of the Census
Variables measured
Individual, HousingUnit
Description
The Public Use Microdata Samples (PUMS) from the 1980 Census contain person- and household-level information from the "long-form" questionnaires distributed to a sample of the population enumerated in the 1980 Census. The B Sample contains information for each state, and for households and persons residing in metropolitan areas that are too small to be separately identified and/or that cross state boundaries. Standard Metropolitan Statistical Areas (SMSAs) and county groups are defined differently here than in the A Sample [CENSUS OF POPULATION AND HOUSING, 1980 [UNITED STATES]: PUBLIC USE MICRODATA SAMPLE (A SAMPLE): 5-PERCENT SAMPLE (ICPSR 8101)]. Most states cannot be identified in their entirety. As a percentage of the l-Percent Public Use Microdata Sample (B Sample) [CENSUS OF POPULATION AND HOUSING, 1980 [UNITED STATES]: PUBLIC USE MICRODATA SAMPLE (B SAMPLE): 1-PERCENT SAMPLE (ICPSR 8170)], this file constitutes a 1-in-1000 sample, and contains all household- and person-level variables from the original B Sample. Household-level variables include housing tenure, year structure was built, number and types of rooms in dwelling, plumbing facilities, heating equipment, taxes and mortgage costs, number of children, and household and family income. Person-level variables include sex, age, marital status, race, Spanish origin, income, occupation, transportation to work, and education. (Source: retrieved from ICPSR 06/15/2011)
undefined undefined: undefined | undefined (undefined)
data.census.gov
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
United States Census Bureau, undefined undefined: undefined | undefined (undefined) [Dataset]. https://data.census.gov/cedsci/table?q=B21100&g=9700000US4808310&table=B21100&tid=ACSDT5Y2021.B21100
Explore at:
Dataset provided by
United States Census Bureauhttp://census.gov/
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Although the American Community Survey (ACS) produces population, demographic and housing unit estimates, it is the Census Bureau's Population Estimates Program that produces and disseminates the official estimates of the population for the nation, states, counties, cities, and towns and estimates of housing units for states and counties..Supporting documentation on code lists, subject definitions, data accuracy, and statistical testing can be found on the American Community Survey website in the Technical Documentation section.Sample size and data quality measures (including coverage rates, allocation rates, and response rates) can be found on the American Community Survey website in the Methodology section..Source: U.S. Census Bureau, 2017-2021 American Community Survey 5-Year Estimates.Data are based on a sample and are subject to sampling variability. The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. The value shown here is the 90 percent margin of error. The margin of error can be interpreted roughly as providing a 90 percent probability that the interval defined by the estimate minus the margin of error and the estimate plus the margin of error (the lower and upper confidence bounds) contains the true value. In addition to sampling variability, the ACS estimates are subject to nonsampling error (for a discussion of nonsampling variability, see ACS Technical Documentation). The effect of nonsampling error is not represented in these tables..For more information about service-connected disability status and ratings, see the Veterans Statistics webpage..The 2017-2021 American Community Survey (ACS) data generally reflect the March 2020 Office of Management and Budget (OMB) delineations of metropolitan and micropolitan statistical areas. In certain instances, the names, codes, and boundaries of the principal cities shown in ACS tables may differ from the OMB delineation lists due to differences in the effective dates of the geographic entities..Estimates of urban and rural populations, housing units, and characteristics reflect boundaries of urban areas defined based on Census 2010 data. As a result, data for urban and rural areas from the ACS do not necessarily reflect the results of ongoing urbanization..Explanation of Symbols:- The estimate could not be computed because there were an insufficient number of sample observations. For a ratio of medians estimate, one or both of the median estimates falls in the lowest interval or highest interval of an open-ended distribution. For a 5-year median estimate, the margin of error associated with a median was larger than the median itself.N The estimate or margin of error cannot be displayed because there were an insufficient number of sample cases in the selected geographic area. (X) The estimate or margin of error is not applicable or not available.median- The median falls in the lowest interval of an open-ended distribution (for example "2,500-")median+ The median falls in the highest interval of an open-ended distribution (for example "250,000+").** The margin of error could not be computed because there were an insufficient number of sample observations.*** The margin of error could not be computed because the median falls in the lowest interval or highest interval of an open-ended distribution.***** A margin of error is not appropriate because the corresponding estimate is controlled to an independent population or housing estimate. Effectively, the corresponding estimate has no sampling error and the margin of error may be treated as zero.
World Health Survey 2003 - Senegal
microdata.worldbank.org
anads.ansd.sn
+2more
Updated Oct 17, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
World Health Organization (WHO) (2013). World Health Survey 2003 - Senegal [Dataset]. https://microdata.worldbank.org/index.php/catalog/1747
Explore at:
Dataset updated
Oct 17, 2013
Dataset provided by
World Health Organizationhttps://who.int/
Authors
World Health Organization (WHO)
Time period covered
2003
Area covered
Senegal
Description
Abstract

Different countries have different health outcomes that are in part due to the way respective health systems perform. Regardless of the type of health system, individuals will have health and non-health expectations in terms of how the institution responds to their needs. In many countries, however, health systems do not perform effectively and this is in part due to lack of information on health system performance, and on the different service providers.

The aim of the WHO World Health Survey is to provide empirical data to the national health information systems so that there is a better monitoring of health of the people, responsiveness of health systems and measurement of health-related parameters.

The overall aims of the survey is to examine the way populations report their health, understand how people value health states, measure the performance of health systems in relation to responsiveness and gather information on modes and extents of payment for health encounters through a nationally representative population based community survey. In addition, it addresses various areas such as health care expenditures, adult mortality, birth history, various risk factors, assessment of main chronic health conditions and the coverage of health interventions, in specific additional modules.

The objectives of the survey programme are to: 1. develop a means of providing valid, reliable and comparable information, at low cost, to supplement the information provided by routine health information systems. 2. build the evidence base necessary for policy-makers to monitor if health systems are achieving the desired goals, and to assess if additional investment in health is achieving the desired outcomes. 3. provide policy-makers with the evidence they need to adjust their policies, strategies and programmes as necessary.

Geographic coverage

The survey sampling frame must cover 100% of the country's eligible population, meaning that the entire national territory must be included. This does not mean that every province or territory need be represented in the survey sample but, rather, that all must have a chance (known probability) of being included in the survey sample.

There may be exceptional circumstances that preclude 100% national coverage. Certain areas in certain countries may be impossible to include due to reasons such as accessibility or conflict. All such exceptions must be discussed with WHO sampling experts. If any region must be excluded, it must constitute a coherent area, such as a particular province or region. For example if ¾ of region D in country X is not accessible due to war, the entire region D will be excluded from analysis.

Analysis unit

Households and individuals

Universe

The WHS will include all male and female adults (18 years of age and older) who are not out of the country during the survey period. It should be noted that this includes the population who may be institutionalized for health reasons at the time of the survey: all persons who would have fit the definition of household member at the time of their institutionalisation are included in the eligible population.

If the randomly selected individual is institutionalized short-term (e.g. a 3-day stay at a hospital) the interviewer must return to the household when the individual will have come back to interview him/her. If the randomly selected individual is institutionalized long term (e.g. has been in a nursing home the last 8 years), the interviewer must travel to that institution to interview him/her.

The target population includes any adult, male or female age 18 or over living in private households. Populations in group quarters, on military reservations, or in other non-household living arrangements will not be eligible for the study. People who are in an institution due to a health condition (such as a hospital, hospice, nursing home, home for the aged, etc.) at the time of the visit to the household are interviewed either in the institution or upon their return to their household if this is within a period of two weeks from the first visit to the household.

Kind of data

Sample survey data [ssd]

Sampling procedure

SAMPLING GUIDELINES FOR WHS

Surveys in the WHS program must employ a probability sampling design. This means that every single individual in the sampling frame has a known and non-zero chance of being selected into the survey sample. While a Single Stage Random Sample is ideal if feasible, it is recognized that most sites will carry out Multi-stage Cluster Sampling.

The WHS sampling frame should cover 100% of the eligible population in the surveyed country. This means that every eligible person in the country has a chance of being included in the survey sample. It also means that particular ethnic groups or geographical areas may not be excluded from the sampling frame.

The sample size of the WHS in each country is 5000 persons (exceptions considered on a by-country basis). An adequate number of persons must be drawn from the sampling frame to account for an estimated amount of non-response (refusal to participate, empty houses etc.). The highest estimate of potential non-response and empty households should be used to ensure that the desired sample size is reached at the end of the survey period. This is very important because if, at the end of data collection, the required sample size of 5000 has not been reached additional persons must be selected randomly into the survey sample from the sampling frame. This is both costly and technically complicated (if this situation is to occur, consult WHO sampling experts for assistance), and best avoided by proper planning before data collection begins.

All steps of sampling, including justification for stratification, cluster sizes, probabilities of selection, weights at each stage of selection, and the computer program used for randomization must be communicated to WHO

STRATIFICATION

Stratification is the process by which the population is divided into subgroups. Sampling will then be conducted separately in each subgroup. Strata or subgroups are chosen because evidence is available that they are related to the outcome (e.g. health, responsiveness, mortality, coverage etc.). The strata chosen will vary by country and reflect local conditions. Some examples of factors that can be stratified on are geography (e.g. North, Central, South), level of urbanization (e.g. urban, rural), socio-economic zones, provinces (especially if health administration is primarily under the jurisdiction of provincial authorities), or presence of health facility in area. Strata to be used must be identified by each country and the reasons for selection explicitly justified.

Stratification is strongly recommended at the first stage of sampling. Once the strata have been chosen and justified, all stages of selection will be conducted separately in each stratum. We recommend stratifying on 3-5 factors. It is optimum to have half as many strata (note the difference between stratifying variables, which may be such variables as gender, socio-economic status, province/region etc. and strata, which are the combination of variable categories, for example Male, High socio-economic status, Xingtao Province would be a stratum).

Strata should be as homogenous as possible within and as heterogeneous as possible between. This means that strata should be formulated in such a way that individuals belonging to a stratum should be as similar to each other with respect to key variables as possible and as different as possible from individuals belonging to a different stratum. This maximises the efficiency of stratification in reducing sampling variance.

MULTI-STAGE CLUSTER SELECTION

A cluster is a naturally occurring unit or grouping within the population (e.g. enumeration areas, cities, universities, provinces, hospitals etc.); it is a unit for which the administrative level has clear, nonoverlapping boundaries. Cluster sampling is useful because it avoids having to compile exhaustive lists of every single person in the population. Clusters should be as heterogeneous as possible within and as homogenous as possible between (note that this is the opposite criterion as that for strata). Clusters should be as small as possible (i.e. large administrative units such as Provinces or States are not good clusters) but not so small as to be homogenous.

In cluster sampling, a number of clusters are randomly selected from a list of clusters. Then, either all members of the chosen cluster or a random selection from among them are included in the sample. Multistage sampling is an extension of cluster sampling where a hierarchy of clusters are chosen going from larger to smaller.

In order to carry out multi-stage sampling, one needs to know only the population sizes of the sampling units. For the smallest sampling unit above the elementary unit however, a complete list of all elementary units (households) is needed; in order to be able to randomly select among all households in the TSU, a list of all those households is required. This information may be available from the most recent population census. If the last census was >3 years ago or the information furnished by it was of poor quality or unreliable, the survey staff will have the task of enumerating all households in the smallest randomly selected sampling unit. It is very important to budget for this step if it is necessary and ensure that all households are properly enumerated in order that a representative sample is obtained.

It is always best to have as many clusters in the PSU as possible. The reason for this is that the fewer the number of respondents in each PSU, the lower will be the clustering effect which
H
Current Population Survey
data.niaid.nih.gov
dataverse.harvard.edu
Updated May 31, 2011
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2011). Current Population Survey [Dataset]. http://doi.org/10.7910/DVN/35IUVQ
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/35IUVQ
Dataset updated
May 31, 2011
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Users can download data or view data tables on topics related to the labor force of the United States. Background Current Population Survey is a joint effort between the Bureau of Labor Statistics and the Census Bureau. It provides information and data on the labor force of the United States, such as: employment, unemployment, earnings, hours of work, school enrollment, health, employee benefits and income. The CPS is conducted monthly and has a sample of approximately 50,000 households. It is representative of the non-institutionalized US population. The sample provides estimates for the nation as a whole and serves as part of model-based estimates for individual states and other geographic areas. User Functionality Users can download data sets or view data tables on their topic of interest. Data can be organized by a variety of demographic variables, including: sex, age, race, marital status and educational attainment. Data is available on a national or state level. Data Notes The CPS is conducted monthly and has a sample of approximately 50,000 households. It is representative of the non-institutionalized US population. The sample provides estimates for th e nation as a whole and serves as part of model-based estimates for individual states and other geographic areas.
f
Overview of the population and sample population.
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated Jan 12, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Armond, Anna Catharina; Santos, Rita; Wall, P. J.; Santos, Júlio Borlido; Lund, Thomas Bøker; Clavien, Christine; Schöpfer, Céline; Goddiksen, Mads Paludan; Olsson, I. Anna S.; Merit, Marcus Tang; Sandøe, Peter; Kovács, Nóra; Hogan, Linda; Quinn, Una; Johansen, Mikkel Willum; Varga, Orsolya (2023). Overview of the population and sample population. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001076681
Explore at:
Dataset updated
Jan 12, 2023
Authors
Armond, Anna Catharina; Santos, Rita; Wall, P. J.; Santos, Júlio Borlido; Lund, Thomas Bøker; Clavien, Christine; Schöpfer, Céline; Goddiksen, Mads Paludan; Olsson, I. Anna S.; Merit, Marcus Tang; Sandøe, Peter; Kovács, Nóra; Hogan, Linda; Quinn, Una; Johansen, Mikkel Willum; Varga, Orsolya
Description
Overview of the population and sample population.
World Health Survey 2003 - Eswatini
microdata.worldbank.org
apps.who.int
+2more
Updated Jul 18, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
World Health Organization (WHO) (2018). World Health Survey 2003 - Eswatini [Dataset]. https://microdata.worldbank.org/index.php/catalog/1751
Explore at:
Dataset updated
Jul 18, 2018
Dataset provided by
World Health Organizationhttps://who.int/
Authors
World Health Organization (WHO)
Time period covered
2003
Area covered
Eswatini
Description
Abstract

Different countries have different health outcomes that are in part due to the way respective health systems perform. Regardless of the type of health system, individuals will have health and non-health expectations in terms of how the institution responds to their needs. In many countries, however, health systems do not perform effectively and this is in part due to lack of information on health system performance, and on the different service providers.

The aim of the WHO World Health Survey is to provide empirical data to the national health information systems so that there is a better monitoring of health of the people, responsiveness of health systems and measurement of health-related parameters.

The overall aims of the survey is to examine the way populations report their health, understand how people value health states, measure the performance of health systems in relation to responsiveness and gather information on modes and extents of payment for health encounters through a nationally representative population based community survey. In addition, it addresses various areas such as health care expenditures, adult mortality, birth history, various risk factors, assessment of main chronic health conditions and the coverage of health interventions, in specific additional modules.

The objectives of the survey programme are to: 1. develop a means of providing valid, reliable and comparable information, at low cost, to supplement the information provided by routine health information systems. 2. build the evidence base necessary for policy-makers to monitor if health systems are achieving the desired goals, and to assess if additional investment in health is achieving the desired outcomes. 3. provide policy-makers with the evidence they need to adjust their policies, strategies and programmes as necessary.

Geographic coverage

The survey sampling frame must cover 100% of the country's eligible population, meaning that the entire national territory must be included. This does not mean that every province or territory need be represented in the survey sample but, rather, that all must have a chance (known probability) of being included in the survey sample.

There may be exceptional circumstances that preclude 100% national coverage. Certain areas in certain countries may be impossible to include due to reasons such as accessibility or conflict. All such exceptions must be discussed with WHO sampling experts. If any region must be excluded, it must constitute a coherent area, such as a particular province or region. For example if ¾ of region D in country X is not accessible due to war, the entire region D will be excluded from analysis.

Analysis unit

Households and individuals

Universe

The WHS will include all male and female adults (18 years of age and older) who are not out of the country during the survey period. It should be noted that this includes the population who may be institutionalized for health reasons at the time of the survey: all persons who would have fit the definition of household member at the time of their institutionalisation are included in the eligible population.

If the randomly selected individual is institutionalized short-term (e.g. a 3-day stay at a hospital) the interviewer must return to the household when the individual will have come back to interview him/her. If the randomly selected individual is institutionalized long term (e.g. has been in a nursing home the last 8 years), the interviewer must travel to that institution to interview him/her.

The target population includes any adult, male or female age 18 or over living in private households. Populations in group quarters, on military reservations, or in other non-household living arrangements will not be eligible for the study. People who are in an institution due to a health condition (such as a hospital, hospice, nursing home, home for the aged, etc.) at the time of the visit to the household are interviewed either in the institution or upon their return to their household if this is within a period of two weeks from the first visit to the household.

Kind of data

Sample survey data [ssd]

Sampling procedure

SAMPLING GUIDELINES FOR WHS

Surveys in the WHS program must employ a probability sampling design. This means that every single individual in the sampling frame has a known and non-zero chance of being selected into the survey sample. While a Single Stage Random Sample is ideal if feasible, it is recognized that most sites will carry out Multi-stage Cluster Sampling.

The WHS sampling frame should cover 100% of the eligible population in the surveyed country. This means that every eligible person in the country has a chance of being included in the survey sample. It also means that particular ethnic groups or geographical areas may not be excluded from the sampling frame.

The sample size of the WHS in each country is 5000 persons (exceptions considered on a by-country basis). An adequate number of persons must be drawn from the sampling frame to account for an estimated amount of non-response (refusal to participate, empty houses etc.). The highest estimate of potential non-response and empty households should be used to ensure that the desired sample size is reached at the end of the survey period. This is very important because if, at the end of data collection, the required sample size of 5000 has not been reached additional persons must be selected randomly into the survey sample from the sampling frame. This is both costly and technically complicated (if this situation is to occur, consult WHO sampling experts for assistance), and best avoided by proper planning before data collection begins.

All steps of sampling, including justification for stratification, cluster sizes, probabilities of selection, weights at each stage of selection, and the computer program used for randomization must be communicated to WHO

STRATIFICATION

Stratification is the process by which the population is divided into subgroups. Sampling will then be conducted separately in each subgroup. Strata or subgroups are chosen because evidence is available that they are related to the outcome (e.g. health, responsiveness, mortality, coverage etc.). The strata chosen will vary by country and reflect local conditions. Some examples of factors that can be stratified on are geography (e.g. North, Central, South), level of urbanization (e.g. urban, rural), socio-economic zones, provinces (especially if health administration is primarily under the jurisdiction of provincial authorities), or presence of health facility in area. Strata to be used must be identified by each country and the reasons for selection explicitly justified.

Stratification is strongly recommended at the first stage of sampling. Once the strata have been chosen and justified, all stages of selection will be conducted separately in each stratum. We recommend stratifying on 3-5 factors. It is optimum to have half as many strata (note the difference between stratifying variables, which may be such variables as gender, socio-economic status, province/region etc. and strata, which are the combination of variable categories, for example Male, High socio-economic status, Xingtao Province would be a stratum).

Strata should be as homogenous as possible within and as heterogeneous as possible between. This means that strata should be formulated in such a way that individuals belonging to a stratum should be as similar to each other with respect to key variables as possible and as different as possible from individuals belonging to a different stratum. This maximises the efficiency of stratification in reducing sampling variance.

MULTI-STAGE CLUSTER SELECTION

A cluster is a naturally occurring unit or grouping within the population (e.g. enumeration areas, cities, universities, provinces, hospitals etc.); it is a unit for which the administrative level has clear, nonoverlapping boundaries. Cluster sampling is useful because it avoids having to compile exhaustive lists of every single person in the population. Clusters should be as heterogeneous as possible within and as homogenous as possible between (note that this is the opposite criterion as that for strata). Clusters should be as small as possible (i.e. large administrative units such as Provinces or States are not good clusters) but not so small as to be homogenous.

In cluster sampling, a number of clusters are randomly selected from a list of clusters. Then, either all members of the chosen cluster or a random selection from among them are included in the sample. Multistage sampling is an extension of cluster sampling where a hierarchy of clusters are chosen going from larger to smaller.

In order to carry out multi-stage sampling, one needs to know only the population sizes of the sampling units. For the smallest sampling unit above the elementary unit however, a complete list of all elementary units (households) is needed; in order to be able to randomly select among all households in the TSU, a list of all those households is required. This information may be available from the most recent population census. If the last census was >3 years ago or the information furnished by it was of poor quality or unreliable, the survey staff will have the task of enumerating all households in the smallest randomly selected sampling unit. It is very important to budget for this step if it is necessary and ensure that all households are properly enumerated in order that a representative sample is obtained.

It is always best to have as many clusters in the PSU as possible. The reason for this is that the fewer the number of respondents in each PSU, the lower will be the clustering effect which
World Health Survey 2003, Wave 0 - China
microdata.worldbank.org
catalog.ihsn.org
+2more
Updated Oct 17, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
World Health Organization (WHO) (2013). World Health Survey 2003, Wave 0 - China [Dataset]. https://microdata.worldbank.org/index.php/catalog/1699
Explore at:
Dataset updated
Oct 17, 2013
Dataset provided by
World Health Organizationhttps://who.int/
Authors
World Health Organization (WHO)
Time period covered
2003
Area covered
China
Description
Abstract

Different countries have different health outcomes that are in part due to the way respective health systems perform. Regardless of the type of health system, individuals will have health and non-health expectations in terms of how the institution responds to their needs. In many countries, however, health systems do not perform effectively and this is in part due to lack of information on health system performance, and on the different service providers.

The aim of the WHO World Health Survey is to provide empirical data to the national health information systems so that there is a better monitoring of health of the people, responsiveness of health systems and measurement of health-related parameters.

The overall aims of the survey is to examine the way populations report their health, understand how people value health states, measure the performance of health systems in relation to responsiveness and gather information on modes and extents of payment for health encounters through a nationally representative population based community survey. In addition, it addresses various areas such as health care expenditures, adult mortality, birth history, various risk factors, assessment of main chronic health conditions and the coverage of health interventions, in specific additional modules.

The objectives of the survey programme are to: 1. develop a means of providing valid, reliable and comparable information, at low cost, to supplement the information provided by routine health information systems. 2. build the evidence base necessary for policy-makers to monitor if health systems are achieving the desired goals, and to assess if additional investment in health is achieving the desired outcomes. 3. provide policy-makers with the evidence they need to adjust their policies, strategies and programmes as necessary.

Geographic coverage

The survey sampling frame must cover 100% of the country's eligible population, meaning that the entire national territory must be included. This does not mean that every province or territory need be represented in the survey sample but, rather, that all must have a chance (known probability) of being included in the survey sample.

There may be exceptional circumstances that preclude 100% national coverage. Certain areas in certain countries may be impossible to include due to reasons such as accessibility or conflict. All such exceptions must be discussed with WHO sampling experts. If any region must be excluded, it must constitute a coherent area, such as a particular province or region. For example if ¾ of region D in country X is not accessible due to war, the entire region D will be excluded from analysis.

Analysis unit

Households and individuals

Universe

The WHS will include all male and female adults (18 years of age and older) who are not out of the country during the survey period. It should be noted that this includes the population who may be institutionalized for health reasons at the time of the survey: all persons who would have fit the definition of household member at the time of their institutionalisation are included in the eligible population.

If the randomly selected individual is institutionalized short-term (e.g. a 3-day stay at a hospital) the interviewer must return to the household when the individual will have come back to interview him/her. If the randomly selected individual is institutionalized long term (e.g. has been in a nursing home the last 8 years), the interviewer must travel to that institution to interview him/her.

The target population includes any adult, male or female age 18 or over living in private households. Populations in group quarters, on military reservations, or in other non-household living arrangements will not be eligible for the study. People who are in an institution due to a health condition (such as a hospital, hospice, nursing home, home for the aged, etc.) at the time of the visit to the household are interviewed either in the institution or upon their return to their household if this is within a period of two weeks from the first visit to the household.

Kind of data

Sample survey data [ssd]

Sampling procedure

SAMPLING GUIDELINES FOR WHS

Surveys in the WHS program must employ a probability sampling design. This means that every single individual in the sampling frame has a known and non-zero chance of being selected into the survey sample. While a Single Stage Random Sample is ideal if feasible, it is recognized that most sites will carry out Multi-stage Cluster Sampling.

The WHS sampling frame should cover 100% of the eligible population in the surveyed country. This means that every eligible person in the country has a chance of being included in the survey sample. It also means that particular ethnic groups or geographical areas may not be excluded from the sampling frame.

The sample size of the WHS in each country is 5000 persons (exceptions considered on a by-country basis). An adequate number of persons must be drawn from the sampling frame to account for an estimated amount of non-response (refusal to participate, empty houses etc.). The highest estimate of potential non-response and empty households should be used to ensure that the desired sample size is reached at the end of the survey period. This is very important because if, at the end of data collection, the required sample size of 5000 has not been reached additional persons must be selected randomly into the survey sample from the sampling frame. This is both costly and technically complicated (if this situation is to occur, consult WHO sampling experts for assistance), and best avoided by proper planning before data collection begins.

All steps of sampling, including justification for stratification, cluster sizes, probabilities of selection, weights at each stage of selection, and the computer program used for randomization must be communicated to WHO

STRATIFICATION

Stratification is the process by which the population is divided into subgroups. Sampling will then be conducted separately in each subgroup. Strata or subgroups are chosen because evidence is available that they are related to the outcome (e.g. health, responsiveness, mortality, coverage etc.). The strata chosen will vary by country and reflect local conditions. Some examples of factors that can be stratified on are geography (e.g. North, Central, South), level of urbanization (e.g. urban, rural), socio-economic zones, provinces (especially if health administration is primarily under the jurisdiction of provincial authorities), or presence of health facility in area. Strata to be used must be identified by each country and the reasons for selection explicitly justified.

Stratification is strongly recommended at the first stage of sampling. Once the strata have been chosen and justified, all stages of selection will be conducted separately in each stratum. We recommend stratifying on 3-5 factors. It is optimum to have half as many strata (note the difference between stratifying variables, which may be such variables as gender, socio-economic status, province/region etc. and strata, which are the combination of variable categories, for example Male, High socio-economic status, Xingtao Province would be a stratum).

Strata should be as homogenous as possible within and as heterogeneous as possible between. This means that strata should be formulated in such a way that individuals belonging to a stratum should be as similar to each other with respect to key variables as possible and as different as possible from individuals belonging to a different stratum. This maximises the efficiency of stratification in reducing sampling variance.

MULTI-STAGE CLUSTER SELECTION

A cluster is a naturally occurring unit or grouping within the population (e.g. enumeration areas, cities, universities, provinces, hospitals etc.); it is a unit for which the administrative level has clear, nonoverlapping boundaries. Cluster sampling is useful because it avoids having to compile exhaustive lists of every single person in the population. Clusters should be as heterogeneous as possible within and as homogenous as possible between (note that this is the opposite criterion as that for strata). Clusters should be as small as possible (i.e. large administrative units such as Provinces or States are not good clusters) but not so small as to be homogenous.

In cluster sampling, a number of clusters are randomly selected from a list of clusters. Then, either all members of the chosen cluster or a random selection from among them are included in the sample. Multistage sampling is an extension of cluster sampling where a hierarchy of clusters are chosen going from larger to smaller.

In order to carry out multi-stage sampling, one needs to know only the population sizes of the sampling units. For the smallest sampling unit above the elementary unit however, a complete list of all elementary units (households) is needed; in order to be able to randomly select among all households in the TSU, a list of all those households is required. This information may be available from the most recent population census. If the last census was >3 years ago or the information furnished by it was of poor quality or unreliable, the survey staff will have the task of enumerating all households in the smallest randomly selected sampling unit. It is very important to budget for this step if it is necessary and ensure that all households are properly enumerated in order that a representative sample is obtained.

It is always best to have as many clusters in the PSU as possible. The reason for this is that the fewer the number of respondents in each PSU, the lower will be the clustering effect which

Facebook

Twitter

Click to copy link

Link copied

Cite

National Institutes of Health (2025). Statistics review 2: Samples and populations [Dataset]. https://catalog.data.gov/dataset/statistics-review-2-samples-and-populations

Statistics review 2: Samples and populations

Explore at:

Dataset updated

Sep 6, 2025

Dataset provided by

National Institutes of Health

Description

The previous review in this series introduced the notion of data description and outlined some of the more common summary measures used to describe a dataset. However, a dataset is typically only of interest for the information it provides regarding the population from which it was drawn. The present review focuses on estimation of population values from a sample.

Clear search

Close search

Google apps

Main menu

Statistics review 2: Samples and populations

Census Microdata Samples Project

Sample Size and Population Estimate Tables (Standard Errors and P Values) -...

Estimating the Size of Populations through a Household Survey 2011 - Rwanda

Abstract

Geographic coverage

Analysis unit

Sampling procedure

Mode of data collection

Research instrument

Cleaning operations

Response rate

Sampling error estimates

United States Population Dataset: Yearly Figures, Population Change, and...

About this dataset

Content

Inspiration

Recommended for further research

General Population Census of 1999 - IPUMS Subset - France

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Sample Size and Population Estimates Tables (Prevalence Estimates) - 8.1 to...

undefined undefined: undefined | undefined (undefined)

Sample Size and Population Estimates Tables (Standard Errors and P Values) -...

Population Census 2000 - IPUMS Subset - Indonesia

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Demographic and Health Survey 2002 - Viet Nam

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Cleaning operations

Response rate

Data from: Census of Population, 1860 [United States]: Urban Household...

Calculating Sample Size for the NYTD Follow-up Population

Census of Population and Housing, 1980: Public Use Microdata Sample B, .1%...

undefined undefined: undefined | undefined (undefined)

World Health Survey 2003 - Senegal

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Current Population Survey

Overview of the population and sample population.

World Health Survey 2003 - Eswatini

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

World Health Survey 2003, Wave 0 - China

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Statistics review 2: Samples and populations