100+ datasets found

d
Statistics review 2: Samples and populations
catalog.data.gov
data.virginia.gov
Updated Sep 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Institutes of Health (2025). Statistics review 2: Samples and populations [Dataset]. https://catalog.data.gov/dataset/statistics-review-2-samples-and-populations
Explore at:
Dataset updated
Sep 6, 2025
Dataset provided by
National Institutes of Health
Description
The previous review in this series introduced the notion of data description and outlined some of the more common summary measures used to describe a dataset. However, a dataset is typically only of interest for the information it provides regarding the population from which it was drawn. The present review focuses on estimation of population values from a sample.
World Health Survey 2003 - Belgium
microdata.worldbank.org
catalog.ihsn.org
+2more
Updated Oct 17, 2013
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
World Health Organization (WHO) (2013). World Health Survey 2003 - Belgium [Dataset]. https://microdata.worldbank.org/index.php/catalog/1694
Explore at:
Dataset updated
Oct 17, 2013
Dataset provided by
World Health Organizationhttps://who.int/
Authors
World Health Organization (WHO)
Time period covered
2003
Area covered
Belgium
Description
Abstract

Different countries have different health outcomes that are in part due to the way respective health systems perform. Regardless of the type of health system, individuals will have health and non-health expectations in terms of how the institution responds to their needs. In many countries, however, health systems do not perform effectively and this is in part due to lack of information on health system performance, and on the different service providers.

The aim of the WHO World Health Survey is to provide empirical data to the national health information systems so that there is a better monitoring of health of the people, responsiveness of health systems and measurement of health-related parameters.

The overall aims of the survey is to examine the way populations report their health, understand how people value health states, measure the performance of health systems in relation to responsiveness and gather information on modes and extents of payment for health encounters through a nationally representative population based community survey. In addition, it addresses various areas such as health care expenditures, adult mortality, birth history, various risk factors, assessment of main chronic health conditions and the coverage of health interventions, in specific additional modules.

The objectives of the survey programme are to: 1. develop a means of providing valid, reliable and comparable information, at low cost, to supplement the information provided by routine health information systems. 2. build the evidence base necessary for policy-makers to monitor if health systems are achieving the desired goals, and to assess if additional investment in health is achieving the desired outcomes. 3. provide policy-makers with the evidence they need to adjust their policies, strategies and programmes as necessary.

Geographic coverage

The survey sampling frame must cover 100% of the country's eligible population, meaning that the entire national territory must be included. This does not mean that every province or territory need be represented in the survey sample but, rather, that all must have a chance (known probability) of being included in the survey sample.

There may be exceptional circumstances that preclude 100% national coverage. Certain areas in certain countries may be impossible to include due to reasons such as accessibility or conflict. All such exceptions must be discussed with WHO sampling experts. If any region must be excluded, it must constitute a coherent area, such as a particular province or region. For example if ¾ of region D in country X is not accessible due to war, the entire region D will be excluded from analysis.

Analysis unit

Households and individuals

Universe

The WHS will include all male and female adults (18 years of age and older) who are not out of the country during the survey period. It should be noted that this includes the population who may be institutionalized for health reasons at the time of the survey: all persons who would have fit the definition of household member at the time of their institutionalisation are included in the eligible population.

If the randomly selected individual is institutionalized short-term (e.g. a 3-day stay at a hospital) the interviewer must return to the household when the individual will have come back to interview him/her. If the randomly selected individual is institutionalized long term (e.g. has been in a nursing home the last 8 years), the interviewer must travel to that institution to interview him/her.

The target population includes any adult, male or female age 18 or over living in private households. Populations in group quarters, on military reservations, or in other non-household living arrangements will not be eligible for the study. People who are in an institution due to a health condition (such as a hospital, hospice, nursing home, home for the aged, etc.) at the time of the visit to the household are interviewed either in the institution or upon their return to their household if this is within a period of two weeks from the first visit to the household.

Kind of data

Sample survey data [ssd]

Sampling procedure

SAMPLING GUIDELINES FOR WHS

Surveys in the WHS program must employ a probability sampling design. This means that every single individual in the sampling frame has a known and non-zero chance of being selected into the survey sample. While a Single Stage Random Sample is ideal if feasible, it is recognized that most sites will carry out Multi-stage Cluster Sampling.

The WHS sampling frame should cover 100% of the eligible population in the surveyed country. This means that every eligible person in the country has a chance of being included in the survey sample. It also means that particular ethnic groups or geographical areas may not be excluded from the sampling frame.

The sample size of the WHS in each country is 5000 persons (exceptions considered on a by-country basis). An adequate number of persons must be drawn from the sampling frame to account for an estimated amount of non-response (refusal to participate, empty houses etc.). The highest estimate of potential non-response and empty households should be used to ensure that the desired sample size is reached at the end of the survey period. This is very important because if, at the end of data collection, the required sample size of 5000 has not been reached additional persons must be selected randomly into the survey sample from the sampling frame. This is both costly and technically complicated (if this situation is to occur, consult WHO sampling experts for assistance), and best avoided by proper planning before data collection begins.

All steps of sampling, including justification for stratification, cluster sizes, probabilities of selection, weights at each stage of selection, and the computer program used for randomization must be communicated to WHO

STRATIFICATION

Stratification is the process by which the population is divided into subgroups. Sampling will then be conducted separately in each subgroup. Strata or subgroups are chosen because evidence is available that they are related to the outcome (e.g. health, responsiveness, mortality, coverage etc.). The strata chosen will vary by country and reflect local conditions. Some examples of factors that can be stratified on are geography (e.g. North, Central, South), level of urbanization (e.g. urban, rural), socio-economic zones, provinces (especially if health administration is primarily under the jurisdiction of provincial authorities), or presence of health facility in area. Strata to be used must be identified by each country and the reasons for selection explicitly justified.

Stratification is strongly recommended at the first stage of sampling. Once the strata have been chosen and justified, all stages of selection will be conducted separately in each stratum. We recommend stratifying on 3-5 factors. It is optimum to have half as many strata (note the difference between stratifying variables, which may be such variables as gender, socio-economic status, province/region etc. and strata, which are the combination of variable categories, for example Male, High socio-economic status, Xingtao Province would be a stratum).

Strata should be as homogenous as possible within and as heterogeneous as possible between. This means that strata should be formulated in such a way that individuals belonging to a stratum should be as similar to each other with respect to key variables as possible and as different as possible from individuals belonging to a different stratum. This maximises the efficiency of stratification in reducing sampling variance.

MULTI-STAGE CLUSTER SELECTION

A cluster is a naturally occurring unit or grouping within the population (e.g. enumeration areas, cities, universities, provinces, hospitals etc.); it is a unit for which the administrative level has clear, nonoverlapping boundaries. Cluster sampling is useful because it avoids having to compile exhaustive lists of every single person in the population. Clusters should be as heterogeneous as possible within and as homogenous as possible between (note that this is the opposite criterion as that for strata). Clusters should be as small as possible (i.e. large administrative units such as Provinces or States are not good clusters) but not so small as to be homogenous.

In cluster sampling, a number of clusters are randomly selected from a list of clusters. Then, either all members of the chosen cluster or a random selection from among them are included in the sample. Multistage sampling is an extension of cluster sampling where a hierarchy of clusters are chosen going from larger to smaller.

In order to carry out multi-stage sampling, one needs to know only the population sizes of the sampling units. For the smallest sampling unit above the elementary unit however, a complete list of all elementary units (households) is needed; in order to be able to randomly select among all households in the TSU, a list of all those households is required. This information may be available from the most recent population census. If the last census was >3 years ago or the information furnished by it was of poor quality or unreliable, the survey staff will have the task of enumerating all households in the smallest randomly selected sampling unit. It is very important to budget for this step if it is necessary and ensure that all households are properly enumerated in order that a representative sample is obtained.

It is always best to have as many clusters in the PSU as possible. The reason for this is that the fewer the number of respondents in each PSU, the lower will be the clustering effect which
i
Population and Family Health Survey 1997 - Jordan
catalog.ihsn.org
datacatalog.ihsn.org
+1more
Updated Mar 29, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department of Statistics (DOS) (2019). Population and Family Health Survey 1997 - Jordan [Dataset]. http://catalog.ihsn.org/catalog/182
Explore at:
Dataset updated
Mar 29, 2019
Dataset authored and provided by
Department of Statistics (DOS)
Time period covered
1997
Area covered
Jordan
Description
Abstract

The 1997 Jordan Population and Family Health Survey (JPFHS) is a national sample survey carried out by the Department of Statistics (DOS) as part of its National Household Surveys Program (NHSP). The JPFHS was specifically aimed at providing information on fertility, family planning, and infant and child mortality. Information was also gathered on breastfeeding, on maternal and child health care and nutritional status, and on the characteristics of households and household members. The survey will provide policymakers and planners with important information for use in formulating informed programs and policies on reproductive behavior and health.

Geographic coverage

National

Analysis unit

Household

Children under five years

Women age 15-49

Men

Kind of data

Sample survey data

Sampling procedure

SAMPLE DESIGN AND IMPLEMENTATION

The 1997 JPFHS sample was designed to produce reliable estimates of major survey variables for the country as a whole, for urban and rural areas, for the three regions (each composed of a group of governorates), and for the three major governorates, Amman, Irbid, and Zarqa.

The 1997 JPFHS sample is a subsample of the master sample that was designed using the frame obtained from the 1994 Population and Housing Census. A two-stage sampling procedure was employed. First, primary sampling units (PSUs) were selected with probability proportional to the number of housing units in the PSU. A total of 300 PSUs were selected at this stage. In the second stage, in each selected PSU, occupied housing units were selected with probability inversely proportional to the number of housing units in the PSU. This design maintains a self-weighted sampling fraction within each governorate.

UPDATING OF SAMPLING FRAME

Prior to the main fieldwork, mapping operations were carried out and the sample units/blocks were selected and then identified and located in the field. The selected blocks were delineated and the outer boundaries were demarcated with special signs. During this process, the numbers on buildings and housing units were updated, listed and documented, along with the name of the owner/tenant of the unit or household and the name of the household head. These activities took place between January 7 and February 28, 1997.

Note: See detailed description of sample design in APPENDIX A of the survey report.

Mode of data collection

Face-to-face

Research instrument

The 1997 JPFHS used two questionnaires, one for the household interview and the other for eligible women. Both questionnaires were developed in English and then translated into Arabic. The household questionnaire was used to list all members of the sampled households, including usual residents as well as visitors. For each member of the household, basic demographic and social characteristics were recorded and women eligible for the individual interview were identified. The individual questionnaire was developed utilizing the experience gained from previous surveys, in particular the 1983 and 1990 Jordan Fertility and Family Health Surveys (JFFHS).

The 1997 JPFHS individual questionnaire consists of 10 sections: - Respondent’s background - Marriage - Reproduction (birth history) - Contraception - Pregnancy, breastfeeding, health and immunization - Fertility preferences - Husband’s background, woman’s work and residence - Knowledge of AIDS - Maternal mortality - Height and weight of children and mothers.

Cleaning operations

Fieldwork and data processing activities overlapped. After a week of data collection, and after field editing of questionnaires for completeness and consistency, the questionnaires for each cluster were packaged together and sent to the central office in Amman where they were registered and stored. Special teams were formed to carry out office editing and coding.

Data entry started after a week of office data processing. The process of data entry, editing, and cleaning was done by means of the ISSA (Integrated System for Survey Analysis) program DHS has developed especially for such surveys. The ISSA program allows data to be edited while being entered. Data entry was completed on November 14, 1997. A data processing specialist from Macro made a trip to Jordan in November and December 1997 to identify problems in data entry, editing, and cleaning, and to work on tabulations for both the preliminary and final report.

Response rate

A total of 7,924 occupied housing units were selected for the survey; from among those, 7,592 households were found. Of the occupied households, 7,335 (97 percent) were successfully interviewed. In those households, 5,765 eligible women were identified, and complete interviews were obtained with 5,548 of them (96 percent of all eligible women). Thus, the overall response rate of the 1997 JPFHS was 93 percent. The principal reason for nonresponse among the women was the failure of interviewers to find them at home despite repeated callbacks.

Note: See summarized response rates by place of residence in Table 1.1 of the survey report.

Sampling error estimates

The estimates from a sample survey are subject to two types of errors: nonsampling errors and sampling errors. Nonsampling errors are the result of mistakes made in implementing data collection and data processing (such as failure to locate and interview the correct household, misunderstanding questions either by the interviewer or the respondent, and data entry errors). Although during the implementation of the 1997 JPFHS numerous efforts were made to minimize this type of error, nonsampling errors are not only impossible to avoid but also difficult to evaluate statistically.

Sampling errors, on the other hand, can be evaluated statistically. The respondents selected in the 1997 JPFHS constitute only one of many samples that could have been selected from the same population, given the same design and expected size. Each of those samples would have yielded results differing somewhat from the results of the sample actually selected. Sampling errors are a measure of the variability among all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.

A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95 percent of all possible samples of identical size and design.

If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, since the 1997 JDHS-II sample resulted from a multistage stratified design, formulae of higher complexity had to be used. The computer software used to calculate sampling errors for the 1997 JDHS-II was the ISSA Sampling Error Module, which uses the Taylor linearization method of variance estimation for survey estimates that are means or proportions. The Jackknife repeated replication method is used for variance estimation of more complex statistics, such as fertility and mortality rates.

Note: See detailed estimate of sampling error calculation in APPENDIX B of the survey report.

Data appraisal

Data Quality Tables - Household age distribution - Age distribution of eligible and interviewed women - Completeness of reporting - Births by calendar years - Reporting of age at death in days - Reporting of age at death in months

Note: See detailed tables in APPENDIX C of the survey report.
n
Census Microdata Samples Project
neuinfo.org
dknet.org
+2more
Updated Jan 29, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2022). Census Microdata Samples Project [Dataset]. http://identifiers.org/RRID:SCR_008902
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_008902
Dataset updated
Jan 29, 2022
Description
A data set of cross-nationally comparable microdata samples for 15 Economic Commission for Europe (ECE) countries (Bulgaria, Canada, Czech Republic, Estonia, Finland, Hungary, Italy, Latvia, Lithuania, Romania, Russia, Switzerland, Turkey, UK, USA) based on the 1990 national population and housing censuses in countries of Europe and North America to study the social and economic conditions of older persons. These samples have been designed to allow research on a wide range of issues related to aging, as well as on other social phenomena. A common set of nomenclatures and classifications, derived on the basis of a study of census data comparability in Europe and North America, was adopted as a standard for recoding. This series was formerly called Dynamics of Population Aging in ECE Countries. The recommendations regarding the design and size of the samples drawn from the 1990 round of censuses envisaged: (1) drawing individual-based samples of about one million persons; (2) progressive oversampling with age in order to ensure sufficient representation of various categories of older people; and (3) retaining information on all persons co-residing in the sampled individual''''s dwelling unit. Estonia, Latvia and Lithuania provided the entire population over age 50, while Finland sampled it with progressive over-sampling. Canada, Italy, Russia, Turkey, UK, and the US provided samples that had not been drawn specially for this project, and cover the entire population without over-sampling. Given its wide user base, the US 1990 PUMS was not recoded. Instead, PAU offers mapping modules, which recode the PUMS variables into the project''''s classifications, nomenclatures, and coding schemes. Because of the high sampling density, these data cover various small groups of older people; contain as much geographic detail as possible under each country''''s confidentiality requirements; include more extensive information on housing conditions than many other data sources; and provide information for a number of countries whose data were not accessible until recently. Data Availability: Eight of the fifteen participating countries have signed the standard data release agreement making their data available through NACDA/ICPSR (see links below). Hungary and Switzerland require a clearance to be obtained from their national statistical offices for the use of microdata, however the documents signed between the PAU and these countries include clauses stipulating that, in general, all scholars interested in social research will be granted access. Russia requested that certain provisions for archiving the microdata samples be removed from its data release arrangement. The PAU has an agreement with several British scholars to facilitate access to the 1991 UK data through collaborative arrangements. Statistics Canada and the Italian Institute of statistics (ISTAT) provide access to data from Canada and Italy, respectively. * Dates of Study: 1989-1992 * Study Features: International, Minority Oversamples * Sample Size: Approx. 1 million/country Links: * Bulgaria (1992), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/02200 * Czech Republic (1991), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06857 * Estonia (1989), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06780 * Finland (1990), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06797 * Romania (1992), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06900 * Latvia (1989), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/02572 * Lithuania (1989), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/03952 * Turkey (1990), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/03292 * U.S. (1990), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06219
Data from: RESEARCH METHODOLOGY FOR NOVELTY TECHNOLOGY
scielo.figshare.com
search.datacite.org
jpeg
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
P.C. Lai (2023). RESEARCH METHODOLOGY FOR NOVELTY TECHNOLOGY [Dataset]. http://doi.org/10.6084/m9.figshare.7482734.v1
Explore at:
jpegAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.7482734.v1
Dataset updated
May 31, 2023
Dataset provided by
SciELOhttp://www.scielo.org/
Authors
P.C. Lai
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Abstract This paper contributes to the existing literature by reviewing the research methodology and the literature review with the focus on potential applications for the novelty technology of the single platform E-payment. These included, but were not restricted to the subjects, population, sample size requirement, data collection method and measurement of variables, pilot study and statistical techniques for data analysis. The reviews will shed some light and potential applications for future researchers, students and others to conceptualize, operationalize and analyze the underlying research methodology to assist in the development of their research methodology.
C
China Population Statistics: Sample Survey: Sampling Fraction
ceicdata.com
Updated Oct 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CEICdata.com (2025). China Population Statistics: Sample Survey: Sampling Fraction [Dataset]. https://www.ceicdata.com/en/china/population-sample-survey-level-of-education/population-statistics-sample-survey-sampling-fraction
Explore at:
Dataset updated
Oct 15, 2025
Dataset provided by
CEICdata.com
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Dec 1, 2012 - Dec 1, 2023
Area covered
China
Variables measured
Population
Description
China Population Statistics: Sample Survey: Sampling Fraction data was reported at 0.105 % in 2023. This records an increase from the previous number of 0.102 % for 2022. China Population Statistics: Sample Survey: Sampling Fraction data is updated yearly, averaging 0.100 % from Dec 1982 (Median) to 2023, with 37 observations. The data reached an all-time high of 100.000 % in 2020 and a record low of 0.063 % in 1994. China Population Statistics: Sample Survey: Sampling Fraction data remains active status in CEIC and is reported by National Bureau of Statistics. The data is categorized under China Premium Database’s Socio-Demographic – Table CN.GA: Population: Sample Survey: Level of Education.
d
Current Population Survey (CPS)
search.dataone.org
dataverse.harvard.edu
Updated Nov 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Damico, Anthony (2023). Current Population Survey (CPS) [Dataset]. http://doi.org/10.7910/DVN/AK4FDD
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/AK4FDD
Dataset updated
Nov 21, 2023
Dataset provided by
Harvard Dataverse
Authors
Damico, Anthony
Description
analyze the current population survey (cps) annual social and economic supplement (asec) with r the annual march cps-asec has been supplying the statistics for the census bureau's report on income, poverty, and health insurance coverage since 1948. wow. the us census bureau and the bureau of labor statistics ( bls) tag-team on this one. until the american community survey (acs) hit the scene in the early aughts (2000s), the current population survey had the largest sample size of all the annual general demographic data sets outside of the decennial census - about two hundred thousand respondents. this provides enough sample to conduct state- and a few large metro area-level analyses. your sample size will vanish if you start investigating subgroups b y state - consider pooling multiple years. county-level is a no-no. despite the american community survey's larger size, the cps-asec contains many more variables related to employment, sources of income, and insurance - and can be trended back to harry truman's presidency. aside from questions specifically asked about an annual experience (like income), many of the questions in this march data set should be t reated as point-in-time statistics. cps-asec generalizes to the united states non-institutional, non-active duty military population. the national bureau of economic research (nber) provides sas, spss, and stata importation scripts to create a rectangular file (rectangular data means only person-level records; household- and family-level information gets attached to each person). to import these files into r, the parse.SAScii function uses nber's sas code to determine how to import the fixed-width file, then RSQLite to put everything into a schnazzy database. you can try reading through the nber march 2012 sas importation code yourself, but it's a bit of a proc freak show. this new github repository contains three scripts: 2005-2012 asec - download all microdata.R down load the fixed-width file containing household, family, and person records import by separating this file into three tables, then merge 'em together at the person-level download the fixed-width file containing the person-level replicate weights merge the rectangular person-level file with the replicate weights, then store it in a sql database create a new variable - one - in the data table 2012 asec - analysis examples.R connect to the sql database created by the 'download all microdata' progr am create the complex sample survey object, using the replicate weights perform a boatload of analysis examples replicate census estimates - 2011.R connect to the sql database created by the 'download all microdata' program create the complex sample survey object, using the replicate weights match the sas output shown in the png file below 2011 asec replicate weight sas output.png statistic and standard error generated from the replicate-weighted example sas script contained in this census-provided person replicate weights usage instructions document. click here to view these three scripts for more detail about the current population survey - annual social and economic supplement (cps-asec), visit: the census bureau's current population survey page the bureau of labor statistics' current population survey page the current population survey's wikipedia article notes: interviews are conducted in march about experiences during the previous year. the file labeled 2012 includes information (income, work experience, health insurance) pertaining to 2011. when you use the current populat ion survey to talk about america, subract a year from the data file name. as of the 2010 file (the interview focusing on america during 2009), the cps-asec contains exciting new medical out-of-pocket spending variables most useful for supplemental (medical spending-adjusted) poverty research. confidential to sas, spss, stata, sudaan users: why are you still rubbing two sticks together after we've invented the butane lighter? time to transition to r. :D
N
Combined Locks, WI Annual Population and Growth Analysis Dataset: A...
neilsberg.com
csv, json
Updated Jul 30, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2024). Combined Locks, WI Annual Population and Growth Analysis Dataset: A Comprehensive Overview of Population Changes and Yearly Growth Rates in Combined Locks from 2000 to 2023 // 2024 Edition [Dataset]. https://www.neilsberg.com/insights/combined-locks-wi-population-by-year/
Explore at:
json, csvAvailable download formats
Dataset updated
Jul 30, 2024
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Combined Locks, Wisconsin
Variables measured
Annual Population Growth Rate, Population Between 2000 and 2023, Annual Population Growth Rate Percent
Measurement technique
The data presented in this dataset is derived from the 20 years data of U.S. Census Bureau Population Estimates Program (PEP) 2000 - 2023. To measure the variables, namely (a) population and (b) population change in ( absolute and as a percentage ), we initially analyzed and tabulated the data for each of the years between 2000 and 2023. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the Combined Locks population over the last 20 plus years. It lists the population for each year, along with the year on year change in population, as well as the change in percentage terms for each year. The dataset can be utilized to understand the population change of Combined Locks across the last two decades. For example, using this dataset, we can identify if the population is declining or increasing. If there is a change, when the population peaked, or if it is still growing and has not reached its peak. We can also compare the trend with the overall trend of United States population over the same period of time.

Key observations

In 2023, the population of Combined Locks was 3,654, a 0.11% decrease year-by-year from 2022. Previously, in 2022, Combined Locks population was 3,658, an increase of 0.83% compared to a population of 3,628 in 2021. Over the last 20 plus years, between 2000 and 2023, population of Combined Locks increased by 1,198. In this period, the peak population was 3,658 in the year 2022. The numbers suggest that the population has already reached its peak and is showing a trend of decline. Source: U.S. Census Bureau Population Estimates Program (PEP).

Content

When available, the data consists of estimates from the U.S. Census Bureau Population Estimates Program (PEP).

Data Coverage:

From 2000 to 2023

Variables / Data Columns

Year: This column displays the data year (Measured annually and for years 2000 to 2023)

Population: The population for the specific year for the Combined Locks is shown in this column.

Year on Year Change: This column displays the change in Combined Locks population for each year compared to the previous year.

Change in Percent: This column displays the year on year change as a percentage. Please note that the sum of all percentages may not equal one due to rounding of values.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Combined Locks Population by Year. You can refer the same here
'Dataset2' - Who Tweets with Their Location? Understanding the Relationship...
figshare.com
datasetcatalog.nlm.nih.gov
zip
Updated Jan 20, 2016
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Luke Sloan (2016). 'Dataset2' - Who Tweets with Their Location? Understanding the Relationship Between Demographic Characteristics and the Use of Geoservices and Geotagging on Twitter [Dataset]. http://doi.org/10.6084/m9.figshare.1572292.v3
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.1572292.v3
Dataset updated
Jan 20, 2016
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Luke Sloan
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
'Dataset2' associated with: Who Tweets with Their Location? Understanding the Relationship Between Demographic Characteristics and the Use of Geoservices and Geotagging on Twitter

Luke Sloan and Jeffrey Morgan.
U.S. population data for human identification markers
catalog.data.gov
Updated Jun 7, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Institute of Standards and Technology (2023). U.S. population data for human identification markers [Dataset]. https://catalog.data.gov/dataset/u-s-population-data-for-human-identification-markers
Explore at:
Dataset updated
Jun 7, 2023
Dataset provided by
National Institute of Standards and Technologyhttp://www.nist.gov/
Area covered
United States
Description
The primary data consist of allele or haplotype frequencies for N=1036 anonymized U.S. population samples. Additional files are supplements to the associated publications. Any changes to spreadsheets are listed in the "Change Log" tab within each spreadsheet. DOI numbers for associated publications are listed below, under "References".
f
Descriptive statistics for the healthy population sample (N = 40).
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated Dec 12, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Borkelmans, Karel W. H.; Verhagen, Simone J. W.; Bartels, Sara Laureen; Delespaul, Philippe A. E. G.; Daniëls, Naomi E. M.; Tans, Sulina; de Vugt, Marjolein E. (2019). Descriptive statistics for the healthy population sample (N = 40). [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000105410
Explore at:
Dataset updated
Dec 12, 2019
Authors
Borkelmans, Karel W. H.; Verhagen, Simone J. W.; Bartels, Sara Laureen; Delespaul, Philippe A. E. G.; Daniëls, Naomi E. M.; Tans, Sulina; de Vugt, Marjolein E.
Description
Descriptive statistics for the healthy population sample (N = 40).
N
Lebanon, KS Annual Population and Growth Analysis Dataset: A Comprehensive...
neilsberg.com
csv, json
Updated Jul 30, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2024). Lebanon, KS Annual Population and Growth Analysis Dataset: A Comprehensive Overview of Population Changes and Yearly Growth Rates in Lebanon from 2000 to 2023 // 2024 Edition [Dataset]. https://www.neilsberg.com/insights/lebanon-ks-population-by-year/
Explore at:
csv, jsonAvailable download formats
Dataset updated
Jul 30, 2024
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Kansas, Lebanon
Variables measured
Annual Population Growth Rate, Population Between 2000 and 2023, Annual Population Growth Rate Percent
Measurement technique
The data presented in this dataset is derived from the 20 years data of U.S. Census Bureau Population Estimates Program (PEP) 2000 - 2023. To measure the variables, namely (a) population and (b) population change in ( absolute and as a percentage ), we initially analyzed and tabulated the data for each of the years between 2000 and 2023. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the Lebanon population over the last 20 plus years. It lists the population for each year, along with the year on year change in population, as well as the change in percentage terms for each year. The dataset can be utilized to understand the population change of Lebanon across the last two decades. For example, using this dataset, we can identify if the population is declining or increasing. If there is a change, when the population peaked, or if it is still growing and has not reached its peak. We can also compare the trend with the overall trend of United States population over the same period of time.

Key observations

In 2023, the population of Lebanon was 182, a 0.55% increase year-by-year from 2022. Previously, in 2022, Lebanon population was 181, a decline of 0% compared to a population of 181 in 2021. Over the last 20 plus years, between 2000 and 2023, population of Lebanon decreased by 120. In this period, the peak population was 302 in the year 2000. The numbers suggest that the population has already reached its peak and is showing a trend of decline. Source: U.S. Census Bureau Population Estimates Program (PEP).

Content

When available, the data consists of estimates from the U.S. Census Bureau Population Estimates Program (PEP).

Data Coverage:

From 2000 to 2023

Variables / Data Columns

Year: This column displays the data year (Measured annually and for years 2000 to 2023)

Population: The population for the specific year for the Lebanon is shown in this column.

Year on Year Change: This column displays the change in Lebanon population for each year compared to the previous year.

Change in Percent: This column displays the year on year change as a percentage. Please note that the sum of all percentages may not equal one due to rounding of values.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Lebanon Population by Year. You can refer the same here
C
China Population: Resided more than Half Year: Floating
ceicdata.com
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CEICdata.com, China Population: Resided more than Half Year: Floating [Dataset]. https://www.ceicdata.com/en/china/population-sample-survey/population-resided-more-than-half-year-floating
Explore at:
Dataset provided by
CEICdata.com
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Dec 1, 2010 - Dec 1, 2021
Area covered
China
Variables measured
Population
Description
China Population: Resided more than Half Year: Floating data was reported at 384,670.000 Person th in 2021. This records an increase from the previous number of 375,816.759 Person th for 2020. China Population: Resided more than Half Year: Floating data is updated yearly, averaging 245,000.000 Person th from Dec 1982 (Median) to 2021, with 16 observations. The data reached an all-time high of 384,670.000 Person th in 2021 and a record low of 6,709.164 Person th in 1982. China Population: Resided more than Half Year: Floating data remains active status in CEIC and is reported by National Bureau of Statistics. The data is categorized under China Premium Database’s Socio-Demographic – Table CN.GA: Population.
Namibia Population and Housing Census 2011 - Namibia
microdata.nsanamibia.com
Updated Sep 30, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Namibia Statistics Agency (2024). Namibia Population and Housing Census 2011 - Namibia [Dataset]. https://microdata.nsanamibia.com/index.php/catalog/9
Explore at:
Dataset updated
Sep 30, 2024
Dataset authored and provided by
Namibia Statistics Agencyhttps://nsa.org.na/
Time period covered
2011
Area covered
Namibia
Description
Abstract

The 2011 Population and Housing Census is the third national Census to be conducted in Namibia after independence. The first was conducted 1991 followed by the 2001 Census. Namibia is therefore one of the countries in sub-Saharan Africa that has participated in the 2010 Round of Censuses and followed the international best practice of conducting decennial Censuses, each of which attempts to count and enumerate every person and household in a country every ten years. Surveys, by contrast, collect data from samples of people and/or households.

Censuses provide reliable and critical data on the socio-economic and demographic status of any country. In Namibia, Census data has provided crucial information for development planning and programme implementation. Specifically, the information has assisted in setting benchmarks, formulating policy and the evaluation and monitoring of national development programmes including NDP4, Vision 2030 and several sector programmes. The information has also been used to update the national sampling frame which is used to select samples for household-based surveys, including labour force surveys, demographic and health surveys, household income and expenditure surveys. In addition, Census information will be used to guide the demarcation of Namibia's administrative boundaries where necessary.

At the international level, Census information has been used extensively in monitoring progress towards Namibia's achievement of international targets, particularly the Millennium Development Goals (MDGs).

The latest and most comprehensive Census was conducted in August 2011. Preparations for the Census started in the 2007/2008 financial year under the auspices of the then Central Bureau of Statistics (CBS) which was later transformed into the Namibia Statistics Agency (NSA). The NSA was established under the Statistics Act No. 9 of 2011, with the legal mandate and authority to conduct population Censuses every 10 years. The Census was implemented in three broad phases; pre-enumeration, enumeration and post enumeration.

During the first pre-enumeration phase, activities accomplished including the preparation of a project document, establishing Census management and technical committees, and establishing the Census cartography unit which demarcated the Enumeration Areas (EAs). Other activities included the development of Census instruments and tools, such as the questionnaires, manuals and field control forms.

Field staff were recruited, trained and deployed during the initial stages of the enumeration phase. The actual enumeration exercise was undertaken over a period of about three weeks from 28 August to 15 September 2011, while 28 August 2011 was marked as the reference period or 'Census Day'.

Great efforts were made to check and ensure that the Census data was of high quality to enhance its credibility and increase its usage. Various quality controls were implemented to ensure relevance, timeliness, accuracy, coherence and proper data interpretation. Other activities undertaken to enhance quality included the demarcation of the country into small enumeration areas to ensure comprehensive coverage; the development of structured Census questionnaires after consultat.The post-enumeration phase started with the sending of completed questionnaires to Head Office and the preparation of summaries for the preliminary report, which was published in April 2012. Processing of the Census data began with manual editing and coding, which focused on the household identification section and un-coded parts of the questionnaire. This was followed by the capturing of data through scanning. Finally, the data were verified and errors corrected where necessary. This took longer than planned due to inadequate technical skills.

Geographic coverage

National coverage

Analysis unit

Households and persons

Universe

The sampling universe is defined as all households (private and institutions) from 2011 Census dataset.

Kind of data

Census/enumeration data [cen]

Sampling procedure

Sample Design

The stratified random sample was applied on the constituency and urban/rural variables of households list from Namibia 2011 Population and Housing Census for the Public Use Microdata Sample (PUMS) file. The sampling universe is defined as all households (private and institutions) from 2011 Census dataset. Since urban and rural are very important factor in the Namibia situation, it was then decided to take the stratum at the constituency and urban/rural levels. Some constituencies have very lower households in the urban or rural, the office therefore decided for a threshold (low boundary) for sampling within stratum. Based on data analysis, the threshold for stratum of PUMS file is 250 households. Thus, constituency and urban/rural areas with less than 250 households in total were included in the PUMS file. Otherwise, a simple random sampling (SRS) at a 20% sample rate was applied for each stratum. The sampled households include 93,674 housing units and 418,362 people.

Sample Selection

The PUMS sample is selected from households. The PUMS sample of persons in households is selected by keeping all persons in PUMS households. Sample selection process is performed using Census and Survey Processing System (CSPro).

The sample selection program first identifies the 7 census strata with less than 250 households and the households (private and institutions) with more than 50 people. The households in these areas and with this large size are all included in the sample. For the other households, the program randomly generates a number n from 0 to 4. Out of every 5 households, the program selects the nth household to export to the PUMS data file, creating a 20 percent sample of households. Private households and institutions are equally sampled in the PUMS data file.

Note: The 7 census strata with less than 250 households are: Arandis Constituency Rural, Rehoboth East Urban Constituency Rural, Walvis Bay Rural Constituency Rural, Mpungu Constituency Urban, Etayi Constituency Urban, Kalahari Constituency Urban, and Ondobe Constituency Urban.

Mode of data collection

Face-to-face [f2f]

Research instrument

The following questionnaire instruments were used for the Namibia 2011 Population and and Housing Census:

Form A (Long Form): For conventional households and residential institutions

Form B1 (Short Form): For special population groups such as persons in transit (travellers), police cells, homeless and off-shore populations

Form B2 (Short Form): For hotels/guesthouses

Form B3 (Short Form): For foreign missions/diplomatic corps

Cleaning operations

Data editing took place at a number of stages throughout the processing, including: a) During data collection in the field b) Manual editing and coding in the office c) During data entry (Primary validation/editing) Structure checking and completeness using Structured Query Language (SQL) program d) Secondary editing: i. Imputations of variables ii. Structural checking in Census and Survey Processing System (CSPro) program

Sampling error estimates

Sampling Error The standard errors of survey estimates are needed to evaluate the precision of the survey estimation. The statistical software package such as SPSS or SAS can accurately estimate the mean and variance of estimates from the survey. SPSS or SAS software package makes use of the Taylor series approach in computing the variance.

Data appraisal

Data quality Great efforts were made to check and ensure that the Census data was of high quality to enhance its credibility and increase its usage. Various quality controls were implemented to ensure relevance, timeliness, accuracy, coherence and proper data interpretation. Other activities undertaken to enhance quality included the demarcation of the country into small enumeration areas to ensure comprehensive coverage; the development of structured Census questionnaires after consultation with government ministries, university expertise and international partners; the preparation of detailed supervisors' and enumerators' instruction manuals to guide field staff during enumeration; the undertaking of comprehensive publicity and advocacy programmes to ensure full Government support and cooperation from the general public; the testing of questionnaires and other procedures; the provision of adequate training and undertaking of intensive supervision using four supervisory layers; the editing of questionnaires at field level; establishing proper mechanisms which ensured that all completed questionnaires were properly accounted for; ensuring intensive verification, validating all information and error corrections; and developing capacity in data processing with support from the international community.
i
Demographic and Health Survey 1993 - Kenya
datacatalog.ihsn.org
catalog.ihsn.org
+1more
Updated Jul 6, 2017
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Central Bureau of Statistics (CBS) (2017). Demographic and Health Survey 1993 - Kenya [Dataset]. https://datacatalog.ihsn.org/catalog/2434
Explore at:
Dataset updated
Jul 6, 2017
Dataset provided by
Central Bureau of Statistics (CBS)
National Council for Population Development (NCPD)
Time period covered
1993
Area covered
Kenya
Description
Abstract

The 1993 Kenya Demographic and Health Survey (KDHS) was a nationally representative survey of 7,540 women age 15-49 and 2,336 men age 20-54. The KDHS was designed to provide information on levels and trends of fertility, infant and child mortality, family planning knowledge and use, maternal and child health, and knowledge of AIDS. In addition, the male survey obtained data on men's knowledge and attitudes towards family planning and awareness of AIDS. The data are intended for use by programme managers and policymakers to evaluate and improve family planning and matemal and child health programmes. Fieldwork for the KDHS took place from mid-February until mid-August 1993. All areas of Kenya were covered by the survey, except for seven northem districts which together contain less than four percent of the country's population.

The KDHS was conducted by the National Council for Population and Development (NCPD) and the Central Bureau of Statistics of the Government of Kenya. Macro International Inc. provided financial and technical assistance to the project through the intemational Demographic and Health Surveys (DHS) contract with the U.S. Agency for International Development.

OBJECTIVES

The KDHS is intended to serve as a source of population and health data for policymakers and the research community. It was designed as a follow-on to the 1989 KDHS, a national-level survey of similar size that was implemented by the same organisations. In general, the objectives of KDHS are to: - assess the overall demographic situation in Kenya, - assist in the evaluation of the population and health programmes in Kenya, - advance survey methodology, and - assist the NCPD to strengthen and improve its technical skills to conduct demographic and health surveys.

The KDHS was specifically designed to: - provide data on the family planning and fertility behaviour of the Kenyan population to enable the NCPD to evaluate and enhance the National Family Planning Programme, - measure changes in fertility and contraceptive prevalence and at the same time study the factors which affect these changes, such as marriage patterns, urban/rural residence, availability of contraception, breastfeeding habits and other socioeconomic factors, and - examine the basic indicators of maternal and child health in Kenya.

KEY FINDINGS

The 1993 KDHS reinforces evidence of a major decline in fertility which was first revealed by the findings of the 1989 KDHS. Fertility continues to decline and family planning use has increased. However, the disparity between knowledge and use of family planning remains quite wide. There are indications that infant and under five child mortality rates are increasing, which in part might be attributed to the increase in AIDS prevalence.

Geographic coverage

The 1993 KDHS sample is national in scope, with the exclusion of all three districts in North Eastern Province and four other northern districts (Samburu and Turkana in Rift Valley Province and Isiolo and 4 Marsabit in Eastern Province). Together the excluded areas account for less than 4 percent of Kenya's population.

Analysis unit

Household

Women age 15-49

Men age 20-54

Children under five

Universe

The population covered by the 1993 KDHS is defined as the universe of all women age 15-49 in Kenya and all husband age 20-54 living in the household.

Kind of data

Sample survey data

Sampling procedure

The sample for the 1993 KDHS was national in scope, with the exclusion of all three districts in Northeastern Province and four other northern districts (Isiolo and Marsabit from Eastern Province and Samburu and Turkana from Rift Valley Province). Together the excluded areas account for less than four percent of Kenya's population. The KDHS sample points were selected from a national master sample maintained by the Central Bureau of Statistics, the third National Sample Survey and Evaluation Programme (NASSEP-3), which is an improved version of NASSEP2 used in the 1989 survey. This master sample follows a two-stage design, stratified by urban-rural residence, and within the rural stratum, by individual district. In the first stage, 1989 census enumeration areas (EAs) were selected with probability proportional to size. The selected EAs were segmented into the expected number of standard-sized clusters to form NASSEP clusters. The entire master sample consists of 1,048 rural and 325 urban ~ sample points ("clusters"). A total of 536 clusters---92 urban and 444 rural--were selected for coverage in the KDHS. Of these, 520 were successfully covered. Sixteen clusters were inaccessible for various reasons.

As in the 1989 KDHS, selected districts were oversampled in the 1993 survey in order to produce more reliable estimates for certain variables at the district level. Fifteen districts were thus targetted in the 1993 KDHS: Bungoma, Kakamega, Kericho, Kilifi, Kisii, Machakos, Meru, Murang'a, Nakuru, Nandi, Nyeri, Siaya, South Nyanza, Taita-Taveta, and Uasin Gishu; in addition, Nairobi and Mombasa were also targetted. Although six of these districts were subdivided shortly before the sample design was finalised) the previous boundaries of these districts were used for the KDHS in order to maintain comparability with the 1989 survey. About 400 rural households were selected in each of these 15 districts, just over 1000 rural households in other districts, and about 18130 households in urban areas, for a total of almost 9,000 households. Due to this oversampling, the KDHS sample is not self-weighting at the national level.

After the selection of the KDHS sample points, fieldstaff from the Central Bureau of Statistics conducted a household listing operation in January and early February 1993, immediately prior to the launching of the fieldwork. A systematic sample of households was then selected from these lists, with an average "take" of 20 households in the urban clusters and 16 households in rural clusters, for a total of 8,864 households selected. Every other household was identified as selected for the male survey, meaning that, in addition to interviewing all women age 15-49, interviewers were to also interview all men age 20-54. It was expected that the sample would yield interviews with approximately 8,000 women age 15-49 and 2,500 men age 20-54.

Mode of data collection

Face-to-face

Research instrument

Four types of questionnaires were used for the KDHS: a Household Questionnaire, a Woman's Questionnaire, a Man's Questionnaire and a Services Availability Questionnaire. The contents of these questionnaires were based on the DHS Model B Questionnaire, which is designed for use in countries with low levels of contraceptive use. Additions and modifications to the model questionnaires were made during a series of meetings organised around specific topics or sections of the questionnaires (e.g., fertility, family planning). The NCPD invited staff from a variety of organisations to attend these meetings, including the Population Studies Research Institute and other departments of the University of Nairobi, the Woman's Bureau, and various units of the Ministry of Health. The questionnaires were developed in English and then translated into and printed in Kiswahili and eight of the most widely spoken local languages in Kenya (Kalenjin, Kamba, Kikuyu, Kisii, Luhya, Luo, Meru, and Mijikenda).

a) The Household Questionnaire was used to list all the usual members and visitors of selected households. Some basic information was collected on the characteristics of each person listed, including his/her age, sex, education, and relationship to the head of the household. The main purpose of the Household Questionnaire was to identify women and men who were eligible for individual interview. In addition, information was collected about the dwelling itself, such as the source of water, type of toilet facilities, materials used to construct the house, and ownership of various consumer goods.

b) The Woman's Questionnaire was used to collect information from women aged 15-49. These women were asked questions on the following topics: Background characteristics (age, education, religion, etc.), Reproductive history, Knowledge and use of family planning methods, Antenatal and delivery care, Breastfeeding and weaning practices, Vaccinations and health of children under age five, Marriage, Fertility preferences, Husband's background and respondent's work, Awareness of AIDS. In addition, interviewing teams measured the height and weight of children under age five (identified through the birth histories) and their mothers.

c) Information from a subsample of men aged 20-54 was collected using a Man's Questionnaire. Men were asked about their background characteristics, knowledge and use of family planning methods, marriage, fertility preferences, and awareness of AIDS.

d) The Services Availability Questionnaire was used to collect information on the health and family planning services obtained within the cluster areas. One service availability questionnaire was to be completed in each cluster.

Cleaning operations

All questionnaires for the KDHS were returned to the NCPD headquarters for data processing. The processing operation consisted of office editing, coding of open-ended questions, data entry, and editing errors found by the computer programs. One NCPD officer, one data processing supervisor, one questionnaire administrator, two office editors, and initially four data entry operators were responsible for the data processing operation. Due to attrition and the need to speed up data processing, another four data entry operators were later hired
g
GESIS Panel.pop Population Sample – Extended Edition
search.gesis.org
datacatalogue.cessda.eu
Updated Oct 31, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GESIS Panel Team (2025). GESIS Panel.pop Population Sample – Extended Edition [Dataset]. http://doi.org/10.4232/1.14586
Explore at:
(96470)Available download formats
Unique identifier
https://doi.org/10.4232/1.14586
Dataset updated
Oct 31, 2025
Dataset provided by
GESIS
GESIS search
Authors
GESIS Panel Team
License
https://www.gesis.org/en/institute/data-usage-termshttps://www.gesis.org/en/institute/data-usage-terms
Time period covered
May 2, 2013 - Jul 15, 2025
Description
The GESIS Panel provides a probability-based mixed-mode access panel infrastructure located at GESIS Leibniz Institute for the Social Sciences in Mannheim, Germany. The project offers the social science community an opportunity to collect survey data from a representative sample of the German population. Submitted study proposals are evaluated based on a scientific review process.

Panel members were initially recruited in 2013 in face-to-face interviews followed by a self-administered profile survey. The mode was chosen by the participants. All participants of the profile survey are considered as members of the panel and invited to the bi-monthly regular waves. The starting cohort encompassed 4900 panelists at the beginning of 2014.

In order to compensate for panel attrition, a refreshment sample was drawn in 2016, using the General Social Survey (ALLBUS) interview as vehicle. The initial cohort encompasses German speaking respondents aged between 18 and 70 years (at the time of recruitment) and permanently residing in Germany, whereas the second cohort includes respondents from the age of 18 without upper restriction.

In 2018 a third recruitment sample was drawn, which was integrated with the wave ge. The third cohort also includes respondents aged 18 and over without an upper limit. Retroactively, cases up to and including wave fc (third wave from 2018) were added to the data. The Data Manual (ZA5664-65_sd_data-manual) has been reissued and there is a corresponding recruitment report (ZA5664-65_mb_recruitment2018).

The ALLBUS Sample is based on a disproportional sampling of respondents from the western and eastern part of Germany. A design weight that enables integration of the two recruitment cohorts is included into the dataset. For more details, please see the methods reports of the recruitment processes and die GESIS Panel reference paper (Bosnjak et al., 2017).

In March 2020, a special GESIS panel survey was conducted on the SARS-CoV-2 resp. COVID-19 coronavirus outbreak in Germany.

In 2021, the fourth recruitment sample was drawn using the German International Social Survey Programme (ISSP), which was integrated with wave ja. The fourth cohort also includes respondents aged 18 and older with no upper limit. For more information, see the corresponding recruitment report (ZA5664-65_r_i12.pdf).

In 2023, the fifth recruitment sample was drawn using the German European Social Survey (ESS Round 11), which was integrated with the wave la. The fifth cohort includes respondents aged 18 and over with no upper limit. For more information, see the corresponding recruitment report (ZA5664-65_r_k12.pdf).

GESIS Panel Demographic Dataset Starting with version 43-0-0 the longitudinal demographic dataset became part of the dissemination package. The dataset is a longitudinal dataset (long format), with harmonized measurements on demographic variables: Respondent ID; timepoint of survey; corresponding wave; survey year; recruitment cohort; sex of respondent; year of birth; month of birth; highest level of education; personal net income; household net income; marital status; AAPOR disposition code; mode of invitation; mode of participation.
w
Synthetic Data for an Imaginary Country, Full Population, 2023 - World
microdata.worldbank.org
Updated Jul 3, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Development Data Group, Data Analytics Unit (2023). Synthetic Data for an Imaginary Country, Full Population, 2023 - World [Dataset]. https://microdata.worldbank.org/index.php/catalog/5908
Explore at:
Dataset updated
Jul 3, 2023
Dataset authored and provided by
Development Data Group, Data Analytics Unit
Time period covered
2023
Area covered
World
Description
Abstract

The dataset is a relational dataset of 10,003,891 individuals (2,501,755 households), representing the entire population of an imaginary middle-income country. The dataset contains two data files: one with variables at the household level, the other one with variables at the individual level. It includes variables that are typically collected in population censuses (demography, education, occupation, dwelling characteristics, fertility, mortality, and migration) and in household surveys (household expenditure, anthropometric data for children, assets ownership). The data only includes ordinary households (no community households). The dataset was created using REaLTabFormer, a model that leverages deep learning methods. The dataset was created for the purpose of training and simulation and is not intended to be representative of any specific country.

A sample dataset of 8000 households was created out of this full-population dataset, and is also distributed as open data.

Geographic coverage

The dataset is a synthetic dataset for an imaginary country. It was created to represent the full national population of this country, by province and district (equivalent to admin1 and admin2 levels) and by urban/rural areas of residence.

Analysis unit

household, Individual

Universe

The dataset is a fully-synthetic dataset representative of the resident population of ordinary households for an imaginary middle-income country.

Kind of data

cen

Mode of data collection

other

Research instrument

The dataset is a synthetic dataset. Although the variables it contains are variables typically collected from sample surveys or population censuses, no questionnaire is available for this dataset. A "fake" questionnaire was however created for the sample dataset extracted from this dataset, to be used as training material.

Cleaning operations

The synthetic data generation process included a set of "validators" (consistency checks, based on which synthetic observation were assessed and rejected/replaced when needed). Also, some post-processing was applied to the data to result in the distributed data files.
V
NYTD Technical Bulletin #5: Cohort Management and Sampling
data.virginia.gov
gimi9.com
+1more
html
Updated Sep 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Administration for Children and Families (2025). NYTD Technical Bulletin #5: Cohort Management and Sampling [Dataset]. https://data.virginia.gov/dataset/nytd-technical-bulletin-5-cohort-management-and-sampling
Explore at:
htmlAvailable download formats
Dataset updated
Sep 6, 2025
Dataset provided by
Administration for Children and Families
Description
This TB describes how ACF will identify and finalize each cohort of youth in the NYTD follow-up population (or follow-up population sample for those States that opt to sample) for the purposes of assessing States' compliance with NYTD data collection and reporting requirements. The TB also specifies how States may opt to sample the baseline population for the purposes of collecting information on the follow-up population.

Metadata-only record linking to the original dataset. Open original dataset below.
European Union Statistics on Income and Living Conditions 2005 -...
catalog.ihsn.org
datacatalog.ihsn.org
Updated Mar 29, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Eurostat (2019). European Union Statistics on Income and Living Conditions 2005 - Cross-Sectional User Database - Netherlands [Dataset]. https://catalog.ihsn.org/index.php/catalog/5733
Explore at:
Dataset updated
Mar 29, 2019
Dataset authored and provided by
Eurostathttps://ec.europa.eu/eurostat
Time period covered
2005
Area covered
Netherlands
Description
Abstract

In 2005, the EU-SILC instrument covered all EU Member States plus Iceland, Turkey, Norway, Switzerland and Croatia. EU-SILC has become the EU reference source for comparative statistics on income distribution and social exclusion at European level, particularly in the context of the "Program of Community action to encourage cooperation between Member States to combat social exclusion" and for producing structural indicators on social cohesion for the annual spring report to the European Council. The first priority is to be given to the delivery of comparable, timely and high quality cross-sectional data.

There are two types of datasets: 1) Cross-sectional data pertaining to fixed time periods, with variables on income, poverty, social exclusion and living conditions. 2) Longitudinal data pertaining to individual-level changes over time, observed periodically - usually over four years.

Social exclusion and housing-condition information is collected at household level. Income at a detailed component level is collected at personal level, with some components included in the "Household" section. Labor, education and health observations only apply to persons aged 16 and over. EU-SILC was established to provide data on structural indicators of social cohesion (at-risk-of-poverty rate, S80/S20 and gender pay gap) and to provide relevant data for the two 'open methods of coordination' in the field of social inclusion and pensions in Europe.

The fifth revision of the 2005 Cross-Sectional User Database is documented here.

Geographic coverage

National

Analysis unit

Households;

Individuals 16 years and older.

Universe

The survey covered all household members over 16 years old. Persons living in collective households and in institutions are generally excluded from the target population.

Kind of data

Sample survey data [ssd]

Sampling procedure

On the basis of various statistical and practical considerations and the precision requirements for the most critical variables, the minimum effective sample sizes to be achieved were defined. Sample size for the longitudinal component refers, for any pair of consecutive years, to the number of households successfully interviewed in the first year in which all or at least a majority of the household members aged 16 or over are successfully interviewed in both the years.

For the cross-sectional component, the plans are to achieve the minimum effective sample size of around 131.000 households in the EU as a whole (137.000 including Iceland and Norway). The allocation of the EU sample among countries represents a compromise between two objectives: the production of results at the level of individual countries, and production for the EU as a whole. Requirements for the longitudinal data will be less important. For this component, an effective sample size of around 98.000 households (103.000 including Iceland and Norway) is planned.

Member States using registers for income and other data may use a sample of persons (selected respondents) rather than a sample of complete households in the interview survey. The minimum effective sample size in terms of the number of persons aged 16 or over to be interviewed in detail is in this case taken as 75 % of the figures shown in columns 3 and 4 of the table I, for the cross-sectional and longitudinal components respectively.

The reference is to the effective sample size, which is the size required if the survey were based on simple random sampling (design effect in relation to the 'risk of poverty rate' variable = 1.0). The actual sample sizes will have to be larger to the extent that the design effects exceed 1.0 and to compensate for all kinds of non-response. Furthermore, the sample size refers to the number of valid households which are households for which, and for all members of which, all or nearly all the required information has been obtained. For countries with a sample of persons design, information on income and other data shall be collected for the household of each selected respondent and for all its members.

At the beginning, a cross-sectional representative sample of households is selected. It is divided into say 4 sub-samples, each by itself representative of the whole population and similar in structure to the whole sample. One sub-sample is purely cross-sectional and is not followed up after the first round. Respondents in the second sub-sample are requested to participate in the panel for 2 years, in the third sub-sample for 3 years, and in the fourth for 4 years. From year 2 onwards, one new panel is introduced each year, with request for participation for 4 years. In any one year, the sample consists of 4 sub-samples, which together constitute the cross-sectional sample. In year 1 they are all new samples; in all subsequent years, only one is new sample. In year 2, three are panels in the second year; in year 3, one is a panel in the second year and two in the third year; in subsequent years, one is a panel for the second year, one for the third year, and one for the fourth (final) year.

According to the Commission Regulation on sampling and tracing rules, the selection of the sample will be drawn according to the following requirements:

For all components of EU-SILC (whether survey or register based), the crosssectional and longitudinal (initial sample) data shall be based on a nationally representative probability sample of the population residing in private households within the country, irrespective of language, nationality or legal residence status. All private households and all persons aged 16 and over within the household are eligible for the operation.

Representative probability samples shall be achieved both for households, which form the basic units of sampling, data collection and data analysis, and for individual persons in the target population.

The sampling frame and methods of sample selection shall ensure that every individual and household in the target population is assigned a known and non-zero probability of selection.

By way of exception, paragraphs 1 to 3 shall apply in Germany exclusively to the part of the sample based on probability sampling according to Article 8 of the Regulation of the European Parliament and of the Council (EC) No 1177/2003 concerning

Community Statistics on Income and Living Conditions. Article 8 of the EU-SILC Regulation of the European Parliament and of the Council mentions: 1. The cross-sectional and longitudinal data shall be based on nationally representative probability samples. 2. By way of exception to paragraph 1, Germany shall supply cross-sectional data based on a nationally representative probability sample for the first time for the year 2008. For the year 2005, Germany shall supply data for one fourth based on probability sampling and for three fourths based on quota samples, the latter to be progressively replaced by random selection so as to achieve fully representative probability sampling by 2008. For the longitudinal component, Germany shall supply for the year 2006 one third of longitudinal data (data for year 2005 and 2006) based on probability sampling and two thirds based on quota samples. For the year 2007, half of the longitudinal data relating to years 2005, 2006 and 2007 shall be based on probability sampling and half on quota sample. After 2007 all of the longitudinal data shall be based on probability sampling.

Detailed information about sampling is available in Quality Reports in Documentation.

Mode of data collection

Mixed
n
Public Use Microdata Sample for the Older Population
neuinfo.org
dknet.org
+2more
Updated Feb 1, 2001
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2001). Public Use Microdata Sample for the Older Population [Dataset]. http://identifiers.org/RRID:SCR_010487
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_010487 https://identifiers.org/RRID:SCR_010487/resolver?q=&i=rrid
Dataset updated
Feb 1, 2001
Description
A public-use microdata sample focusing on the older population created from the 1990 census. This sample consists of 3 percent of households with at least one member aged 60 or older. Although, the highest age presented is age 90, this allows analysis of data on the very old for most states with a reasonable degree of reliability. Since data for all members in households containing a person 60 years and over will be on the file, users will be able to analyze patterns such as living arrangements and sources of household income from which older members may benefit. Additionally, users will be able to augment the PUMS-O sample with a PUMS file. The Census Bureau has issued two regular PUMS files for the entire population. One PUMS file will contain 1 percent of all households; the other PUMS file will contain 5 percent of all households. Both files have most sample data items, and differ only in geographical composition. The 1-percent file contains geographic areas that reflect metropolitan vs. non-metropolitan areas. The 5-percent file shows counties or groups of counties as well as large sub-county areas such as places of 100,000 or more. The geography on the 5-percent PUMS file matches that of the PUMS-O file. Since data for different households are present on the two files, users can merge the PUMS-O file with the 5-percent PUMS to construct an 8-percent sample. However, weighted averages must be constructed for any estimates created because each sample yields state-level estimates. Thus, it is possible to analyze substate areas even for the very old. In states where the geographic areas identified on the PUMS-O and the 5-percent PUMS are coterminous with State Planning and Service Areas (used by service providers in relation to the Older Americans Act), the Planning and Service Areas are identified. * Dates of Study: 1990-2000 Links: 1980: http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/08101 2000: http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/04204

Facebook

Twitter

Click to copy link

Link copied

Cite

National Institutes of Health (2025). Statistics review 2: Samples and populations [Dataset]. https://catalog.data.gov/dataset/statistics-review-2-samples-and-populations

Statistics review 2: Samples and populations

Explore at:

Dataset updated

Sep 6, 2025

Dataset provided by

National Institutes of Health

Description

The previous review in this series introduced the notion of data description and outlined some of the more common summary measures used to describe a dataset. However, a dataset is typically only of interest for the information it provides regarding the population from which it was drawn. The present review focuses on estimation of population values from a sample.

Clear search

Close search

Google apps

Main menu

Statistics review 2: Samples and populations

World Health Survey 2003 - Belgium

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Population and Family Health Survey 1997 - Jordan

Abstract

Geographic coverage

Analysis unit

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Cleaning operations

Response rate

Sampling error estimates

Data appraisal

Census Microdata Samples Project

Data from: RESEARCH METHODOLOGY FOR NOVELTY TECHNOLOGY

China Population Statistics: Sample Survey: Sampling Fraction

Current Population Survey (CPS)

Combined Locks, WI Annual Population and Growth Analysis Dataset: A...

About this dataset

Content

Inspiration

Recommended for further research

'Dataset2' - Who Tweets with Their Location? Understanding the Relationship...

U.S. population data for human identification markers

Descriptive statistics for the healthy population sample (N = 40).

Lebanon, KS Annual Population and Growth Analysis Dataset: A Comprehensive...

About this dataset

Content

Inspiration

Recommended for further research

China Population: Resided more than Half Year: Floating

Namibia Population and Housing Census 2011 - Namibia

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Cleaning operations

Sampling error estimates

Data appraisal

Demographic and Health Survey 1993 - Kenya

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Cleaning operations

GESIS Panel.pop Population Sample – Extended Edition

Synthetic Data for an Imaginary Country, Full Population, 2023 - World

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Mode of data collection

Research instrument

Cleaning operations

NYTD Technical Bulletin #5: Cohort Management and Sampling

European Union Statistics on Income and Living Conditions 2005 -...

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Public Use Microdata Sample for the Older Population