100+ datasets found

2012-2016 American Community Survey: 5-Year Estimates - Public Use Microdata...
catalog.data.gov
s.cnmilf.com
Updated Jul 19, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Census Bureau (2023). 2012-2016 American Community Survey: 5-Year Estimates - Public Use Microdata Sample [Dataset]. https://catalog.data.gov/dataset/2012-2016-american-community-survey-5-year-estimates-public-use-microdata-sample
Explore at:
Dataset updated
Jul 19, 2023
Dataset provided by
United States Census Bureauhttp://census.gov/
Description
The American Community Survey (ACS) Public Use Microdata Sample (PUMS) contains a sample of responses to the ACS. The ACS PUMS dataset includes variables for nearly every question on the survey, as well as many new variables that were derived after the fact from multiple survey responses (such as poverty status).Each record in the file represents a single person, or, in the household-level dataset, a single housing unit. In the person-level file, individuals are organized into households, making possible the study of people within the contexts of their families and other household members. Individuals living in Group Quarters, such as nursing facilities or college facilities, are also included on the person file. ACS PUMS data are available at the nation, state, and Public Use Microdata Area (PUMA) levels. PUMAs are special non-overlapping areas that partition each state into contiguous geographic units containing roughly 100,000 people each. ACS PUMS files for an individual year, such as 2019, contain data on approximately one percent of the United States population.
H
Survey of Income and Program Participation (SIPP)
dataverse.harvard.edu
Updated May 30, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anthony Damico (2013). Survey of Income and Program Participation (SIPP) [Dataset]. http://doi.org/10.7910/DVN/I0FFJV
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/I0FFJV
Dataset updated
May 30, 2013
Dataset provided by
Harvard Dataverse
Authors
Anthony Damico
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
analyze the survey of income and program participation (sipp) with r if the census bureau's budget was gutted and only one complex sample survey survived, pray it's the survey of income and program participation (sipp). it's giant. it's rich with variables. it's monthly. it follows households over three, four, now five year panels. the congressional budget office uses it for their health insurance simulation . analysts read that sipp has person-month files, get scurred, and retreat to inferior options. the american community survey may be the mount everest of survey data, but sipp is most certainly the amazon. questions swing wild and free through the jungle canopy i mean core data dictionary. legend has it that there are still species of topical module variables that scientists like you have yet to analyze. ponce de león would've loved it here. ponce. what a name. what a guy. the sipp 2008 panel data started from a sample of 105,663 individuals in 42,030 households. once the sample gets drawn, the census bureau surveys one-fourth of the respondents every four months, over f our or five years (panel durations vary). you absolutely must read and understand pdf pages 3, 4, and 5 of this document before starting any analysis (start at the header 'waves and rotation groups'). if you don't comprehend what's going on, try their survey design tutorial. since sipp collects information from respondents regarding every month over the duration of the panel, you'll need to be hyper-aware of whether you want your results to be point-in-time, annualized, or specific to some other period. the analysis scripts below provide examples of each. at every four-month interview point, every respondent answers every core question for the previous four months. after that, wave-specific addenda (called topical modules) get asked, but generally only regarding a single prior month. to repeat: core wave files contain four records per person, topical modules contain one. if you stacked every core wave, you would have one record per person per month for the duration o f the panel. mmmassive. ~100,000 respondents x 12 months x ~4 years. have an analysis plan before you start writing code so you extract exactly what you need, nothing more. better yet, modify something of mine. cool? this new github repository contains eight, you read me, eight scripts: 1996 panel - download and create database.R 2001 panel - download and create database.R 2004 panel - download and create database.R 2008 panel - download and create database.R since some variables are character strings in one file and integers in anoth er, initiate an r function to harmonize variable class inconsistencies in the sas importation scripts properly handle the parentheses seen in a few of the sas importation scripts, because the SAScii package currently does not create an rsqlite database, initiate a variant of the read.SAScii function that imports ascii data directly into a sql database (.db) download each microdata file - weights, topical modules, everything - then read 'em into sql 2008 panel - full year analysis examples.R< br /> define which waves and specific variables to pull into ram, based on the year chosen loop through each of twelve months, constructing a single-year temporary table inside the database read that twelve-month file into working memory, then save it for faster loading later if you like read the main and replicate weights columns into working memory too, merge everything construct a few annualized and demographic columns using all twelve months' worth of information construct a replicate-weighted complex sample design with a fay's adjustment factor of one-half, again save it for faster loading later, only if you're so inclined reproduce census-publish ed statistics, not precisely (due to topcoding described here on pdf page 19) 2008 panel - point-in-time analysis examples.R define which wave(s) and specific variables to pull into ram, based on the calendar month chosen read that interview point (srefmon)- or calendar month (rhcalmn)-based file into working memory read the topical module and replicate weights files into working memory too, merge it like you mean it construct a few new, exciting variables using both core and topical module questions construct a replicate-weighted complex sample design with a fay's adjustment factor of one-half reproduce census-published statistics, not exactly cuz the authors of this brief used the generalized variance formula (gvf) to calculate the margin of error - see pdf page 4 for more detail - the friendly statisticians at census recommend using the replicate weights whenever possible. oh hayy, now it is. 2008 panel - median value of household assets.R define which wave(s) and spe cific variables to pull into ram, based on the topical module chosen read the topical module and replicate weights files into working memory too, merge once again construct a replicate-weighted complex sample design with a...
D
Census Tract Top 50 American Community Survey Data
data.seattle.gov
hub.arcgis.com
+1more
csv, xlsx, xml
Updated Feb 3, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Census Tract Top 50 American Community Survey Data [Dataset]. https://data.seattle.gov/dataset/Census-Tract-Top-50-American-Community-Survey-Data/jya9-y5bv/data
Explore at:
csv, xlsx, xmlAvailable download formats
Dataset updated
Feb 3, 2025
Description
Data from: American Community Survey, 5-year Series

King County, Washington census tracts with nonoverlapping vintages of the 5-year American Community Survey (ACS) estimates starting in 2010 of over 50 attributes of the most requested data derived from the U.S. Census Bureau's demographic profiles (DP02-DP05). Also includes the most recent release annually with the vintage identified in the "ACS Vintage" field.

The census tract boundaries match the vintage of the ACS data (currently 2010 and 2020) so please note the geographic changes between the decades.

Tracts have been coded as being within the City of Seattle as well as assigned to neighborhood groups called "Community Reporting Areas". These areas were created after the 2000 census to provide geographically consistent neighborhoods through time for reporting U.S. Census Bureau data. This is not an attempt to identify neighborhood boundaries as defined by neighborhoods themselves.

Vintages: 2010, 2015, 2020, 2021, 2022, 2023
ACS Table(s): DP02, DP03, DP04, DP05
Data downloaded from: Census Bureau's Explore Census Data

The United States Census Bureau's American Community Survey (ACS):
About the Survey
Geography & ACS
Technical Documentation
News & Updates
This ready-to-use layer can be used within ArcGIS Pro, ArcGIS Online, its configurable apps, dashboards, Story Maps, custom apps, and mobile apps. Data can also be exported for offline workflows. Please cite the Census and ACS when using this data.

Data Note from the Census:
Data are based on a sample and are subject to sampling variability. The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. The value shown here is the 90 percent margin of error. The margin of error can be interpreted as providing a 90 percent probability that the interval defined by the estimate minus the margin of error and the estimate plus the margin of error (the lower and upper confidence bounds) contains the true value. In addition to sampling variability, the ACS estimates are subject to nonsampling error (for a discussion of nonsampling variability, see Accuracy of the Data). The effect of nonsampling error is not represented in these tables.

Data Processing Notes:
Boundaries come from the US Census TIGER geodatabases, specifically, the National Sub-State Geography Database (named tlgdb(year)a_us_substategeo.gdb). Boundaries are updated at the same time as the data updates (annually), and the boundary vintage appropriately matches the data vintage as specified by the Census. These are Census boundaries with water and/or coastlines erased for cartographic and mapping purposes. For census tracts, the water cutouts are derived from a subset of the <a href='https://www2.census.gov/geo/tiger/TGRGDB20/tlgdb_2020_a_us_areawater.gdb.zip' style='color:rgb(0,
d
ACS 5-Year Social Characteristics DC Census Tract
opdatahub.dc.gov
adoptablock.dc.gov
+4more
Updated Feb 28, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
City of Washington, DC (2025). ACS 5-Year Social Characteristics DC Census Tract [Dataset]. https://opdatahub.dc.gov/datasets/acs-5-year-social-characteristics-dc-census-tract
Explore at:
Dataset updated
Feb 28, 2025
Dataset authored and provided by
City of Washington, DC
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered

Description
Household type, Education, Disability, Language, Computer/Internet Use, and more. This service is updated annually with American Community Survey (ACS) 5-year data. Contact: District of Columbia, Office of Planning. Email: planning@dc.gov. Geography: Census Tracts. Current Vintage: 2019-2023. ACS Table(s): DP02. Data downloaded from: Census Bureau's API for American Community Survey. Date of API call: January 2, 2025. National Figures: data.census.gov. Please cite the Census and ACS when using this data. Data Note from the Census: Data are based on a sample and are subject to sampling variability. The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. The value shown here is the 90 percent margin of error. The margin of error can be interpreted as providing a 90 percent probability that the interval defined by the estimate minus the margin of error and the estimate plus the margin of error (the lower and upper confidence bounds) contains the true value. In addition to sampling variability, the ACS estimates are subject to nonsampling error (for a discussion of nonsampling variability, see Accuracy of the Data). The effect of nonsampling error is not represented in these tables. Data Processing Notes: This layer is updated automatically when the most current vintage of ACS data is released each year, usually in December. The layer always contains the latest available ACS 5-year estimates. It is updated annually within days of the Census Bureau's release schedule. Boundaries come from the US Census TIGER geodatabases. Boundaries are updated at the same time as the data updates (annually), and the boundary vintage appropriately matches the data vintage as specified by the Census. These are Census boundaries with water and/or coastlines clipped for cartographic purposes. For census tracts, the water cutouts are derived from a subset of the 2020 AWATER (Area Water) boundaries offered by TIGER. For state and county boundaries, the water and coastlines are derived from the coastlines of the 500k TIGER Cartographic Boundary Shapefiles. The original AWATER and ALAND fields are still available as attributes within the data table (units are square meters). Field alias names were created based on the Table Shells file available from the American Community Survey Summary File Documentation page. Data processed using R statistical package and ArcGIS Desktop. Margin of Error was not included in this layer but is available from the Census Bureau. Contact the Office of Planning for more information about obtaining Margin of Error values.
a
Maryland American Community Survey - ACS Census Tracts
arc-gis-hub-home-arcgishub.hub.arcgis.com
data.imap.maryland.gov
+3more
Updated Feb 9, 2016
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ArcGIS Online for Maryland (2016). Maryland American Community Survey - ACS Census Tracts [Dataset]. https://arc-gis-hub-home-arcgishub.hub.arcgis.com/datasets/maryland::maryland-american-community-survey-acs-census-tracts/about
Explore at:
Dataset updated
Feb 9, 2016
Dataset authored and provided by
ArcGIS Online for Maryland
Area covered

Description
The American Community Survey (ACS) is a nationwide, continuous survey designed to provide communities with reliable and timely demographic, housing, social and economic data. The ACS replaces the decennial census long form in 2010 and every year thereafter. The annual ACS sample is smaller than that of previous long form surveys resulting in a larger sampling error. Coefficients of Variation (CVs), which are statistical measures that show the relative amount of sampling error associated with an estimate, are presented here as a measure of reliability and usability of the data. The unit of geography used for the 2010 - 2014 data is the census tract - a small statistical area within a county, which is delineated every 10 years prior to the decennial census.Last Updated: UnknownThis is a MD iMAP hosted service. Find more information at https://imap.maryland.gov.Feature Service Link:https://mdgeodata.md.gov/imap/rest/services/Demographics/MD_AmericanCommunitySurvey/FeatureServer/0
Demographic and Health Survey 2017 - Indonesia
microdata.worldbank.org
catalog.ihsn.org
+1more
Updated Jul 12, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ministry of Health (Kemenkes) (2019). Demographic and Health Survey 2017 - Indonesia [Dataset]. https://microdata.worldbank.org/index.php/catalog/3477
Explore at:
Dataset updated
Jul 12, 2019
Dataset provided by
Statistics Indonesiahttp://www.bps.go.id/
Ministry of Health (Kemenkes)
National Population and Family Planning Board (BKKBN)
Time period covered
2017
Area covered
Indonesia
Description
Abstract

The primary objective of the 2017 Indonesia Dmographic and Health Survey (IDHS) is to provide up-to-date estimates of basic demographic and health indicators. The IDHS provides a comprehensive overview of population and maternal and child health issues in Indonesia. More specifically, the IDHS was designed to: - provide data on fertility, family planning, maternal and child health, and awareness of HIV/AIDS and sexually transmitted infections (STIs) to help program managers, policy makers, and researchers to evaluate and improve existing programs; - measure trends in fertility and contraceptive prevalence rates, and analyze factors that affect such changes, such as residence, education, breastfeeding practices, and knowledge, use, and availability of contraceptive methods; - evaluate the achievement of goals previously set by national health programs, with special focus on maternal and child health; - assess married men’s knowledge of utilization of health services for their family’s health and participation in the health care of their families; - participate in creating an international database to allow cross-country comparisons in the areas of fertility, family planning, and health.

Geographic coverage

National coverage

Analysis unit

Household

Individual

Children age 0-5

Woman age 15-49

Man age 15-54

Universe

The survey covered all de jure household members (usual residents), all women age 15-49 years resident in the household, and all men age 15-54 years resident in the household.

Kind of data

Sample survey data [ssd]

Sampling procedure

The 2017 IDHS sample covered 1,970 census blocks in urban and rural areas and was expected to obtain responses from 49,250 households. The sampled households were expected to identify about 59,100 women age 15-49 and 24,625 never-married men age 15-24 eligible for individual interview. Eight households were selected in each selected census block to yield 14,193 married men age 15-54 to be interviewed with the Married Man's Questionnaire. The sample frame of the 2017 IDHS is the Master Sample of Census Blocks from the 2010 Population Census. The frame for the household sample selection is the updated list of ordinary households in the selected census blocks. This list does not include institutional households, such as orphanages, police/military barracks, and prisons, or special households (boarding houses with a minimum of 10 people).

The sampling design of the 2017 IDHS used two-stage stratified sampling: Stage 1: Several census blocks were selected with systematic sampling proportional to size, where size is the number of households listed in the 2010 Population Census. In the implicit stratification, the census blocks were stratified by urban and rural areas and ordered by wealth index category.

Stage 2: In each selected census block, 25 ordinary households were selected with systematic sampling from the updated household listing. Eight households were selected systematically to obtain a sample of married men.

For further details on sample design, see Appendix B of the final report.

Mode of data collection

Face-to-face [f2f]

Research instrument

The 2017 IDHS used four questionnaires: the Household Questionnaire, Woman’s Questionnaire, Married Man’s Questionnaire, and Never Married Man’s Questionnaire. Because of the change in survey coverage from ever-married women age 15-49 in the 2007 IDHS to all women age 15-49, the Woman’s Questionnaire had questions added for never married women age 15-24. These questions were part of the 2007 Indonesia Young Adult Reproductive Survey Questionnaire. The Household Questionnaire and the Woman’s Questionnaire are largely based on standard DHS phase 7 questionnaires (2015 version). The model questionnaires were adapted for use in Indonesia. Not all questions in the DHS model were included in the IDHS. Response categories were modified to reflect the local situation.

Cleaning operations

All completed questionnaires, along with the control forms, were returned to the BPS central office in Jakarta for data processing. The questionnaires were logged and edited, and all open-ended questions were coded. Responses were entered in the computer twice for verification, and they were corrected for computer-identified errors. Data processing activities were carried out by a team of 34 editors, 112 data entry operators, 33 compare officers, 19 secondary data editors, and 2 data entry supervisors. The questionnaires were entered twice and the entries were compared to detect and correct keying errors. A computer package program called Census and Survey Processing System (CSPro), which was specifically designed to process DHS-type survey data, was used in the processing of the 2017 IDHS.

Response rate

Of the 49,261 eligible households, 48,216 households were found by the interviewer teams. Among these households, 47,963 households were successfully interviewed, a response rate of almost 100%.

In the interviewed households, 50,730 women were identified as eligible for individual interview and, from these, completed interviews were conducted with 49,627 women, yielding a response rate of 98%. From the selected household sample of married men, 10,440 married men were identified as eligible for interview, of which 10,009 were successfully interviewed, yielding a response rate of 96%. The lower response rate for men was due to the more frequent and longer absence of men from the household. In general, response rates in rural areas were higher than those in urban areas.

Sampling error estimates

The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors and (2) sampling errors. Nonsampling errors result from mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2017 Indonesia Demographic and Health Survey (2017 IDHS) to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.

Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 2017 IDHS is only one of many samples that could have been selected from the same population, using the same design and identical size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling error is a measure of the variability among all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.

A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95 percent of all possible samples of identical size and design.

If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 2017 IDHS sample is the result of a multi-stage stratified design, and, consequently, it was necessary to use more complex formulas. The computer software used to calculate sampling errors for the 2017 IDHS is a STATA program. This program used the Taylor linearization method for variance estimation for survey estimates that are means or proportions. The Jackknife repeated replication method is used for variance estimation of more complex statistics such as fertility and mortality rates.

A more detailed description of estimates of sampling errors are presented in Appendix C of the survey final report.

Data appraisal

Data Quality Tables - Household age distribution - Age distribution of eligible and interviewed women - Age distribution of eligible and interviewed men - Completeness of reporting - Births by calendar year - Reporting of age at death in days - Reporting of age at death in months

See details of the data quality tables in Appendix D of the survey final report.
2018 American Community Survey: 1-Year Estimates - Public Use Microdata...
catalog.data.gov
Updated Jul 19, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Census Bureau (2023). 2018 American Community Survey: 1-Year Estimates - Public Use Microdata Sample [Dataset]. https://catalog.data.gov/dataset/2018-american-community-survey-1-year-estimates-public-use-microdata-sample
Explore at:
Dataset updated
Jul 19, 2023
Dataset provided by
United States Census Bureauhttp://census.gov/
Description
The American Community Survey (ACS) Public Use Microdata Sample (PUMS) contains a sample of responses to the ACS. The ACS PUMS dataset includes variables for nearly every question on the survey, as well as many new variables that were derived after the fact from multiple survey responses (such as poverty status).Each record in the file represents a single person, or, in the household-level dataset, a single housing unit. In the person-level file, individuals are organized into households, making possible the study of people within the contexts of their families and other household members. Individuals living in Group Quarters, such as nursing facilities or college facilities, are also included on the person file. ACS PUMS data are available at the nation, state, and Public Use Microdata Area (PUMA) levels. PUMAs are special non-overlapping areas that partition each state into contiguous geographic units containing roughly 100,000 people each. ACS PUMS files for an individual year, such as 2020, contain data on approximately one percent of the United States population
n
Agriculture Sample Census Survey 2007/08 - Tanzania
microdata.nbs.go.tz
Updated May 26, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Bureau of Statistics (2022). Agriculture Sample Census Survey 2007/08 - Tanzania [Dataset]. https://microdata.nbs.go.tz/index.php/catalog/5
Explore at:
Dataset updated
May 26, 2022
Dataset provided by
Office of Chief Government Statistician-Zanzibar
National Bureau of Statistics
Time period covered
2009
Area covered
Tanzania
Description
Abstract

The 2007/08 Agricultural Sample Census was designed to meet the data needs of a wide range of users down to district level including policy makers at local, regional and national levels, rural development agencies, funding institutions, researchers, NGOs, farmers' organizations, and others. The dataset is both more numerous in its sample and detailed in its scope and coverage so as to meet the user demand.

The census was carried out in order to:

Identify any structural changes,in the size of farm household holdings, crop and livestock production, farm input and implement use. It also seeks to determine if there are any improvements in the rural infrastructure and the level of agricultural households living conditions;

-Provide benchmark data on productivity, production and agricultural practices in relation to policies and interventions promoted by the Ministry of Agriculture and Food Security and other stakeholders; and

Obtain data that will be used to address specific issues such as: food security, rural poverty, gender, agro-processing, marketing and service delivery.

Geographic coverage

Tanzania Mainland and Zanzibar

Analysis unit

Community, Household, Individual

Universe

Small scale farmers, Large Scale Farmers, Community

Kind of data

Sample survey data [ssd]

Sampling procedure

The Mainland sample consisted of 3,192 villages. The total Mainland sample was 47,880 agricultural households while in Zanzibar, a total of 317 EAs were selected and 4,755 agricultural households were covered.

The villages were drawn from the National Master Sample (NMS) developed by the National Bureau of Statistics (NBS) to serve as a national framework for the conduct of household based surveys in the country. The National Master Sample was developed from the previous 2002 Population and Housing Census.

The numbers of villages/Enumeration Areas (EAs) were selected for the first stage with a probability proportional to the number of villages/EAs in each district. In the second stage, 15 households were selected from a list of agricultural households in each village/EA using systematic random sampling.

Mode of data collection

Face-to-face [f2f]

Research instrument

The census used three different questionnaires: - Small scale farm questionnaire - Community level questionnaire - Large scale farm questionnaire

The small scale farm questionnaire was the main census instrument and it included questions related to crop and livestock production and practices; population demographics; access to services, community resources and infrastructure; issues on poverty and gender. The main topics covered were:

Household demographics and activities of the household members

Land access, ownership, tenure and use

Crop and livestock production and productivity

Access to inputs and farming implements

Access and use of credit

Access to infrastructure (roads, district and regional headquarters, markets, advisory services, schools, hospitals).

Crop marketing, storage and agro processing

Tree farming, agro-forestry, and fish farming

Access and use of communal resources (grazing land, communal forests, water for humans and livestock, beekeeping)

Investment activities ( irrigation structures, water harvesting, erosion control, fencing)

Off farm income and non agricultural related activities

Households living conditions (housing, sanitary facilities )

Livelihood constraints

Poverty Indicators

The community level questionnaire was designed to collect village level data such as access and use of common resources, community tree plantation and seasonal farm gate prices.

The Large Scale Farm questionnaire was administered to large farms either privately or corporately managed.

Cleaning operations

Data editing took place at a number of stages throughout the processing, including: - Manual cleaning exercisePrior to scanning. (Questionnaires found dirty or damaged and generally unsuitable for scanning were put aside for manual data entry ) - CSPro was used for data entry of all Large Scale Farms and Community based questionnaires - Scanning and ICR data capture technology for the smallholder questionnaire - There was an Interactive validation during the ICR extraction process. - The use of a batch validation program developed in CSPro. This was used in order to identify inconsistencies within a questionnaire. - Statistical Package for Social Sciences (SPSS) was used to produce the Census tabulations - Microsoft Excel was used to organize the tables, charts and compute additional indicators -Arc GIS (Geographical Information System) was used in producing the maps. - Microsoft Word was used in compiling and writing up the reports
e
HSRC Master Sample II - Dataset - B2FIND
b2find.eudat.eu
Updated Aug 12, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). HSRC Master Sample II - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/96d7c5e3-e8c8-5eb6-a25b-c22ad9f86fba
Explore at:
Dataset updated
Aug 12, 2025
Description
Description: The 2005 HSRC Master Sample was used for SABSSM 2008 and 2012, the SANHANES study in 2012 and SASAS 2007-2010 (adjacent EAs) to obtain an understanding of geographical spread of HIV/AIDS, perceptions and attitudes of people and other health related studies over time. Abstract: A sample can be defined as a subset containing the characteristics of a larger population. Samples are used in statistical testing when population sizes are too large for the test to include all possible members or observations. A sample should represent the whole population and not reflect bias toward a specific attribute.[1] One of the most crucial aspects of sample design in household surveys is its frame. The sampling frame has significant implications on the cost and the quality of any survey, household or otherwise.[2] The sampling frame .... in a household survey must cover the entire target population. When that frame is used for multiple surveys or multiple rounds of the same survey it is known as a master sample frame or .... master sample.[3] A master sample is a sample drawn from a population for use on a number of future occasions, so as to avoid ad hoc sampling on each occasion. Sometimes the master sample is large and subsequent inquiries are based on a sub-sample from it.[4] The HSRC compiles master samples in order to construct samples for various HSRC research studies. The 2005 HSRC Master Sample was used for SABSSM 2008 and 2012, SASAS 2007-2010 and the SANHANES study in 2012 to obtain an understanding of geographical spread of HIV/AIDS, perceptions and attitudes of people and other health related studies over time. The 2005 HSRC Master Sample was created in the following way: South Africa was delineated into EAs according to municipality and province. Municipal boundaries were obtained from the Municipal Demarcation Board. An Enumeration area (EA) is the smallest geographical unit (piece of land) into which the country is divided for census or survey enumeration.[5] The concepts and definitions of terms used for Census 2001 comply in most instances with United Nations standards for censuses. A total of 1,000 census enumeration areas (EAs) from the 2001 population census were randomly selected using probability proportional to size and stratified by province, locality type and race in urban areas from a database of 80 787 EAs that were mapped using aerial photography to develop an HSRC master sample for selecting households. The ideal frame would be complete with respect to the target population if all of its members (the universe) are covered by the frame. Ideal characteristics of a master sample: The master frame should be as complete, accurate and current as practicable. A master sample frame for household surveys is typically developed from the most recent census, just as a regular sample frame is. Because the master frame may be used during an entire intercensal (between census) period, however, it will usually require periodic and regular updating such as every 2-3 years. This is in contrast to a regular frame which is more likely to be up-dated on an ad hoc basis and only when a particular survey is being planned[6] [1] http://www.investopedia.com/terms/s/sample.asp [2] http://unstats.un.org/unsd/demographic/meetings/egm/sampling_1203/docs/no_3.pdf [3] http://unstats.un.org/unsd/demographic/meetings/egm/sampling_1203/docs/no_3.pdf [4] A Dictionary of Statistical Terms, 5th edition, prepared for the International Statistical Institute by F.H.C. Marriott. Published for the International Statistical Institute by Longman Scientific and Technical. http://stats.oecd.org/glossary/detail.asp?ID=3708 [5] http://africageodownloads.info/128_mokgokolo.pdf [6] http://unstats.un.org/unsd/demographic/meetings/egm/sampling_1203/docs/no_3.pdf All enumeration areas (80 787 EAs) within the South African borders during the 2001 Census. The whole country was delimited into EAs according to municipality and province. Municipal boundaries were obtained from the Municipal Demarcation Board. A total of 1,000 census enumeration areas (EAs) from the 2001 population census were randomly selected using probability proportional to size and stratified by province, locality type and race in urban areas from a database of 80 787 EAs that were mapped in all surveys using aerial photography to develop all HSRC master sample for selecting households. The first digit represents the province The second and third digits represent the municipality
N
Sweet Home, OR Age Group Population Dataset: A complete breakdown of Sweet...
neilsberg.com
csv, json
Updated Sep 16, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2023). Sweet Home, OR Age Group Population Dataset: A complete breakdown of Sweet Home age demographics from 0 to 85 years, distributed across 18 age groups [Dataset]. https://www.neilsberg.com/research/datasets/5fc3378d-3d85-11ee-9abe-0aa64bf2eeb2/
Explore at:
json, csvAvailable download formats
Dataset updated
Sep 16, 2023
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Sweet Home
Variables measured
Population Under 5 Years, Population over 85 years, Population Between 5 and 9 years, Population Between 10 and 14 years, Population Between 15 and 19 years, Population Between 20 and 24 years, Population Between 25 and 29 years, Population Between 30 and 34 years, Population Between 35 and 39 years, Population Between 40 and 44 years, and 9 more
Measurement technique
The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates. To measure the two variables, namely (a) population and (b) population as a percentage of the total population, we initially analyzed and categorized the data for each of the age groups. For age groups we divided it into roughly a 5 year bucket for ages between 0 and 85. For over 85, we aggregated data into a single group for all ages. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the Sweet Home population distribution across 18 age groups. It lists the population in each age group along with the percentage population relative of the total population for Sweet Home. The dataset can be utilized to understand the population distribution of Sweet Home by age. For example, using this dataset, we can identify the largest age group in Sweet Home.

Key observations

The largest age group in Sweet Home, OR was for the group of age 60-64 years with a population of 833 (8.57%), according to the 2021 American Community Survey. At the same time, the smallest age group in Sweet Home, OR was the 80-84 years with a population of 210 (2.16%). Source: U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.

Age groups:

Under 5 years

5 to 9 years

10 to 14 years

15 to 19 years

20 to 24 years

25 to 29 years

30 to 34 years

35 to 39 years

40 to 44 years

45 to 49 years

50 to 54 years

55 to 59 years

60 to 64 years

65 to 69 years

70 to 74 years

75 to 79 years

80 to 84 years

85 years and over

Variables / Data Columns

Age Group: This column displays the age group in consideration

Population: The population for the specific age group in the Sweet Home is shown in this column.

% of Total Population: This column displays the population of each age group as a proportion of Sweet Home total population. Please note that the sum of all percentages may not equal one due to rounding of values.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Sweet Home Population by Age. You can refer the same here
u
American Community Survey
gstore.unm.edu
csv, geojson, gml +5
Updated Mar 6, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Earth Data Analysis Center (2020). American Community Survey [Dataset]. https://gstore.unm.edu/apps/rgis/datasets/f82f2633-574f-4e5f-99dc-603db9849887/metadata/FGDC-STD-001-1998.html
Explore at:
gml(5), json(5), kml(5), zip(1), shp(5), xls(5), geojson(5), csv(5)Available download formats
Dataset updated
Mar 6, 2020
Dataset provided by
Earth Data Analysis Center
Time period covered
2017
Area covered
New Mexico, West Bounding Coordinate -109.05017 East Bounding Coordinate -103.00196 North Bounding Coordinate 37.000293 South Bounding Coordinate 31.33217
Description
A broad and generalized selection of 2013-2017 US Census Bureau 2017 5-year American Community Survey income and earnings data estimates, obtained via Census API and joined to the appropriate geometry (in this case, New Mexico counties). The selection, while not comprehensive, provides a first-level characterization of the household income, median household income by race and by age group, Social Security income, the GINI Index, per capita income, median family income, and median household earnings by age, and by education level, in New Mexico. The determination of which estimates to include was based upon level of interest and providing a manageable dataset for users. The U.S. Census Bureau's American Community Survey (ACS) is a nationwide, continuous survey designed to provide communities with reliable and timely demographic, housing, social, and economic data every year. The ACS collects long-form-type information throughout the decade rather than only once every 10 years. As in the decennial census, strict confidentiality laws protect all information that could be used to identify individuals or households.The ACS combines population or other data from multiple years to produce reliable numbers for small counties, neighborhoods, and other local areas. To provide information for communities each year, the ACS provides 1-, 3-, and 5-year estimates. ACS 5-year estimates (multiyear estimates) are “period” estimates that represent data collected over a 60-month period of time (as opposed to “point-in-time” estimates, such as the decennial census, that approximate the characteristics of an area on a specific date). ACS data are released in the year immediately following the year in which they are collected. ACS estimates based on data collected from 2009–2014 should not be called “2009” or “2014” estimates. Multiyear estimates should be labeled to indicate clearly the full period of time. The primary advantage of using multiyear estimates is the increased statistical reliability of the data for less populated areas and small population subgroups. Data are based on a sample and are subject to sampling variability. The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. While each full Data Profile contains margin of error (MOE) information, this dataset does not. Those individuals requiring more complete data are directed to download the more detailed datasets from the ACS American FactFinder website. This dataset is organized by New Mexico county boundaries, based on TIGER/Line Files: shapefiles and related database files (.dbf) that are an extract of selected geographic and cartographic information from the U.S. Census Bureau's Master Address File / Topologically Integrated Geographic Encoding and Referencing (MAF/TIGER) Database. NOTE: A '-666666666' entry indicates that either no sample observations or too few sample observations were available to compute an estimate, or a ratio of medians cannot be calculated because one or both of the median estimates falls in the lowest interval or upper interval of an open-ended distribution.
Pew Survey on Israel's Religiously Divided Society Data Set
thearda.com
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pew Forum on Religion and Public Life, Pew Survey on Israel's Religiously Divided Society Data Set [Dataset]. http://doi.org/10.17605/OSF.IO/GSQVJ
Explore at:
Unique identifier
https://doi.org/10.17605/OSF.IO/GSQVJ
Dataset provided by
Association of Religion Data Archives
Authors
Pew Forum on Religion and Public Life
Dataset funded by
The Pew Charitable Trusts
The Neubauer Family Foundation
Pew Research Centerhttp://pewresearch.org/
Description
Between Oct. 14, 2014, and May 21, 2015, Pew Research Center, with generous funding from The Pew Charitable Trusts and the Neubauer Family Foundation, completed 5,601 face-to-face interviews with non-institutionalized adults ages 18 and older living in Israel.

The survey sampling plan was based on six districts defined in the 2008 Israeli census. In addition, Jewish residents of West Bank (Judea and Samaria) were included.

The sample includes interviews with 3,789 respondents defined as Jews, 871 Muslims, 468 Christians and 439 Druze. An additional 34 respondents belong to other religions or are religiously unaffiliated. Five groups were oversampled as part of the survey design: Jews living in the West Bank, Haredim, Christian Arabs, Arabs living in East Jerusalem and Druze.

Interviews were conducted under the direction of Public Opinion and Marketing Research of Israel (PORI). Surveys were administered through face-to-face, paper and pencil interviews conducted at the respondent's place of residence. Sampling was conducted through a multi-stage stratified area probability sampling design based on national population data available through the Israel's Central Bureau of Statistics' 2008 census.

The questionnaire was designed by Pew Research Center staff in consultation with subject matter experts and advisers to the project. The questionnaire was translated into Hebrew, Russian and Arabic, independently verified by professional linguists conversant in regional dialects and pretested prior to fieldwork.

The questionnaire was divided into four sections. All respondents who took the survey in Russian or Hebrew were branched into the Jewish questionnaire (Questionnaire A). Arabic-speaking respondents were branched into the Muslim (Questionnaire B), Christian (Questionnaire C) or Druze questionnaire (D) based on their response to the religious identification question. For the full question wording and exact order of questions, please see the questionnaire.

Note that not all respondents who took the questionnaire in Hebrew or Russian are classified as Jews in this study. For further details on how respondents were classified as Jews, Muslims, Christians and Druze in the study, please see sidebar in the report titled "http://www.pewforum.org/2016/03/08/israels-religiously-divided-society/" Target="_blank">"How Religious are Defined".

Following fieldwork, survey performance was assessed by comparing the results for key demographic variables with population statistics available through the census. Data were weighted to account for different probabilities of selection among respondents. Where appropriate, data also were weighted through an iterative procedure to more closely align the samples with official population figures for gender, age and education. The reported margins of sampling error and the statistical tests of significance used in the analysis take into account the design effects due to weighting and sample design.

In addition to sampling error and other practical difficulties, one should bear in mind that question wording also can have an impact on the findings of opinion polls.
w
Synthetic Data for an Imaginary Country, Sample, 2023 - World
microdata.worldbank.org
nada-demo.ihsn.org
Updated Jul 7, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Development Data Group, Data Analytics Unit (2023). Synthetic Data for an Imaginary Country, Sample, 2023 - World [Dataset]. https://microdata.worldbank.org/index.php/catalog/5906
Explore at:
Dataset updated
Jul 7, 2023
Dataset authored and provided by
Development Data Group, Data Analytics Unit
Time period covered
2023
Area covered
World, World
Description
Abstract

The dataset is a relational dataset of 8,000 households households, representing a sample of the population of an imaginary middle-income country. The dataset contains two data files: one with variables at the household level, the other one with variables at the individual level. It includes variables that are typically collected in population censuses (demography, education, occupation, dwelling characteristics, fertility, mortality, and migration) and in household surveys (household expenditure, anthropometric data for children, assets ownership). The data only includes ordinary households (no community households). The dataset was created using REaLTabFormer, a model that leverages deep learning methods. The dataset was created for the purpose of training and simulation and is not intended to be representative of any specific country.

The full-population dataset (with about 10 million individuals) is also distributed as open data.

Geographic coverage

The dataset is a synthetic dataset for an imaginary country. It was created to represent the population of this country by province (equivalent to admin1) and by urban/rural areas of residence.

Analysis unit

Household, Individual

Universe

The dataset is a fully-synthetic dataset representative of the resident population of ordinary households for an imaginary middle-income country.

Kind of data

ssd

Sampling procedure

The sample size was set to 8,000 households. The fixed number of households to be selected from each enumeration area was set to 25. In a first stage, the number of enumeration areas to be selected in each stratum was calculated, proportional to the size of each stratum (stratification by geo_1 and urban/rural). Then 25 households were randomly selected within each enumeration area. The R script used to draw the sample is provided as an external resource.

Mode of data collection

other

Research instrument

The dataset is a synthetic dataset. Although the variables it contains are variables typically collected from sample surveys or population censuses, no questionnaire is available for this dataset. A "fake" questionnaire was however created for the sample dataset extracted from this dataset, to be used as training material.

Cleaning operations

The synthetic data generation process included a set of "validators" (consistency checks, based on which synthetic observation were assessed and rejected/replaced when needed). Also, some post-processing was applied to the data to result in the distributed data files.

Response rate

This is a synthetic dataset; the "response rate" is 100%.
2024 Public Sector: GS00SS14 | Percentage Distribution of Revenue of Public...
data.census.gov
test.data.census.gov
Updated Mar 28, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ECN (2025). 2024 Public Sector: GS00SS14 | Percentage Distribution of Revenue of Public Elementary-Secondary School Systems in the United States: Fiscal Year 2012- 2023 (PUB Public Sector Annual Surveys and Census of Governments) [Dataset]. https://data.census.gov/table/GOVSTIMESERIES.GS00SS14?q=GS00SS14
Explore at:
Dataset updated
Mar 28, 2025
Dataset provided by
United States Census Bureauhttp://census.gov/
Authors
ECN
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Time period covered
2024
Area covered
United States
Description
Key Table Information.Table Title.Percentage Distribution of Revenue of Public Elementary-Secondary School Systems in the United States: Fiscal Year 2012- 2023.Table ID.GOVSTIMESERIES.GS00SS14.Survey/Program.Public Sector.Year.2024.Dataset.PUB Public Sector Annual Surveys and Census of Governments.Source.U.S. Census Bureau, Public Sector.Release Date.2025-05-01.Release Schedule.The Annual Survey of School System Finances occurs every year. Data are typically released in early May. There are approximately two years between the reference period and data release..Dataset Universe.Census of Governments - Organization (CG):The universe of this file is all federal, state, and local government units in the United States. In addition to the federal government and the 50 state governments, the Census Bureau recognizes five basic types of local governments. The government types are: County, Municipal, Township, Special District, and School District. Of these five types, three are categorized as General Purpose governments: County, municipal, and township governments are readily recognized and generally present no serious problem of classification. However, legislative provisions for school district and special district governments are diverse. These two types are categorized as Special Purpose governments. Numerous single-function and multiple-function districts, authorities, commissions, boards, and other entities, which have varying degrees of autonomy, exist in the United States. The basic pattern of these entities varies widely from state to state. Moreover, various classes of local governments within a particular state also differ in their characteristics. Refer to the Individual State Descriptions report for an overview of all government entities authorized by state.The Public Use File provides a listing of all independent government units, and dependent school districts active as of fiscal year ending June 30, 2024. The Annual Surveys of Public Employment & Payroll (EP) and State and Local Government Finances (LF):The target population consists of all 50 state governments, the District of Columbia, and a sample of local governmental units (counties, cities, townships, special districts, school districts). In years ending in '2' and '7' the entire universe is canvassed. In intervening years, a sample of the target population is surveyed. Additional details on sampling are available in the survey methodology descriptions for those years.The Annual Survey of Public Pensions (PP):The target population consists of state- and locally-administered defined benefit funds and systems of all 50 state governments, the District of Columbia, and a sample of local governmental units (counties, cities, townships, special districts, school districts). In years ending in '2' and '7' the entire universe is canvassed. In intervening years, a sample of the target population is surveyed. Additional details on sampling are available in the survey methodology descriptions for those years.The Annual Surveys of State Government Finance (SG) and State Government Tax Collections (TC):The target population consists of all 50 state governments. No local governments are included. For the purpose of Census Bureau statistics, the term "state government" refers not only to the executive, legislative, and judicial branches of a given state, but it also includes agencies, institutions, commissions, and public authorities that operate separately or somewhat autonomously from the central state government but where the state government maintains administrative or fiscal control over their activities as defined by the Census Bureau. Additional details are available in the survey methodology description.The Annual Survey of School System Finances (SS):The Annual Survey of School System Finances targets all public school systems providing elementary and/or secondary education in all 50 states and the District of Columbia..Methodology.Data Items and Other Identifying Records.Fall enrollmentTotal percentage distribution of revenuePercentage distribution of revenue - Revenue from federal sources - TotalPercentage distribution of revenue - Revenue from federal sources - Title IPercentage distribution of revenue - Revenue from state sources - TotalPercentage distribution of revenue - Revenue from state sources - General formula assistancePercentage distribution of revenue - Revenue from local sources - TotalPercentage distribution of revenue - Revenue from local sources - Taxes and parent government contributionsPercentage distribution of revenue - Revenue from local sources - Other local governmentsPercentage distribution of revenue - Revenue from local sources - Current chargesDefinitions can be found by clicking on the column header in the table or by accessing the Glossary.For detailed information, see Government Finance and Employment Classification Manual..Unit(s) of Observation.The basic reporting unit is the governmental unit, defined as an org...
N
Adams, OR Age Group Population Dataset: A Complete Breakdown of Adams Age...
neilsberg.com
csv, json
Updated Feb 22, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2025). Adams, OR Age Group Population Dataset: A Complete Breakdown of Adams Age Demographics from 0 to 85 Years and Over, Distributed Across 18 Age Groups // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/adams-or-population-by-age/
Explore at:
json, csvAvailable download formats
Dataset updated
Feb 22, 2025
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Adams, OR
Variables measured
Population Under 5 Years, Population over 85 years, Population Between 5 and 9 years, Population Between 10 and 14 years, Population Between 15 and 19 years, Population Between 20 and 24 years, Population Between 25 and 29 years, Population Between 30 and 34 years, Population Between 35 and 39 years, Population Between 40 and 44 years, and 9 more
Measurement technique
The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. To measure the two variables, namely (a) population and (b) population as a percentage of the total population, we initially analyzed and categorized the data for each of the age groups. For age groups we divided it into roughly a 5 year bucket for ages between 0 and 85. For over 85, we aggregated data into a single group for all ages. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the Adams population distribution across 18 age groups. It lists the population in each age group along with the percentage population relative of the total population for Adams. The dataset can be utilized to understand the population distribution of Adams by age. For example, using this dataset, we can identify the largest age group in Adams.

Key observations

The largest age group in Adams, OR was for the group of age 45 to 49 years years with a population of 117 (32.50%), according to the ACS 2019-2023 5-Year Estimates. At the same time, the smallest age group in Adams, OR was the Under 5 years years with a population of 0 (0%). Source: U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates

Age groups:

Under 5 years

5 to 9 years

10 to 14 years

15 to 19 years

20 to 24 years

25 to 29 years

30 to 34 years

35 to 39 years

40 to 44 years

45 to 49 years

50 to 54 years

55 to 59 years

60 to 64 years

65 to 69 years

70 to 74 years

75 to 79 years

80 to 84 years

85 years and over

Variables / Data Columns

Age Group: This column displays the age group in consideration

Population: The population for the specific age group in the Adams is shown in this column.

% of Total Population: This column displays the population of each age group as a proportion of Adams total population. Please note that the sum of all percentages may not equal one due to rounding of values.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Adams Population by Age. You can refer the same here
N
Salem, OR Age Group Population Dataset: A complete breakdown of Salem age...
neilsberg.com
csv, json
Updated Sep 16, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2023). Salem, OR Age Group Population Dataset: A complete breakdown of Salem age demographics from 0 to 85 years, distributed across 18 age groups [Dataset]. https://www.neilsberg.com/research/datasets/712d8f4d-3d85-11ee-9abe-0aa64bf2eeb2/
Explore at:
csv, jsonAvailable download formats
Dataset updated
Sep 16, 2023
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Salem, Oregon
Variables measured
Population Under 5 Years, Population over 85 years, Population Between 5 and 9 years, Population Between 10 and 14 years, Population Between 15 and 19 years, Population Between 20 and 24 years, Population Between 25 and 29 years, Population Between 30 and 34 years, Population Between 35 and 39 years, Population Between 40 and 44 years, and 9 more
Measurement technique
The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates. To measure the two variables, namely (a) population and (b) population as a percentage of the total population, we initially analyzed and categorized the data for each of the age groups. For age groups we divided it into roughly a 5 year bucket for ages between 0 and 85. For over 85, we aggregated data into a single group for all ages. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the Salem population distribution across 18 age groups. It lists the population in each age group along with the percentage population relative of the total population for Salem. The dataset can be utilized to understand the population distribution of Salem by age. For example, using this dataset, we can identify the largest age group in Salem.

Key observations

The largest age group in Salem, OR was for the group of age 30-34 years with a population of 13,522 (7.76%), according to the 2021 American Community Survey. At the same time, the smallest age group in Salem, OR was the 85+ years with a population of 2,671 (1.53%). Source: U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.

Age groups:

Under 5 years

5 to 9 years

10 to 14 years

15 to 19 years

20 to 24 years

25 to 29 years

30 to 34 years

35 to 39 years

40 to 44 years

45 to 49 years

50 to 54 years

55 to 59 years

60 to 64 years

65 to 69 years

70 to 74 years

75 to 79 years

80 to 84 years

85 years and over

Variables / Data Columns

Age Group: This column displays the age group in consideration

Population: The population for the specific age group in the Salem is shown in this column.

% of Total Population: This column displays the population of each age group as a proportion of Salem total population. Please note that the sum of all percentages may not equal one due to rounding of values.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Salem Population by Age. You can refer the same here
n
Agriculture Sample Census Survey 2002-2003 - Tanzania
microdata.nbs.go.tz
Updated Mar 10, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Bureau of Statistics (2022). Agriculture Sample Census Survey 2002-2003 - Tanzania [Dataset]. https://microdata.nbs.go.tz/index.php/catalog/4
Explore at:
Dataset updated
Mar 10, 2022
Dataset provided by
Office of Chief Government Statistician-Zanzibar
National Bureau of Statistics
Time period covered
2004
Area covered
Tanzania
Description
Abstract

The 2003 Agriculture Sample Census was designed to meet the data needs of a wide range of users down to district level including policy makers at local, regional and national levels, rural development agencies, funding institutions, researchers, NGOs, farmer organisations, etc. As a result the dataset is both more numerous in its sample and detailed in its scope compared to previous censuses and surveys. To date this is the most detailed Agricultural Census carried out in Africa. The census was carried out in order to:

· Identify structural changes if any, in the size of farm household holdings, crop and livestock production, farm input and implement use. It also seeks to determine if there are any improvements in rural infrastructure and in the level of agriculture household living conditions; · Provide benchmark data on productivity, production and agricultural practices in relation to policies and interventions promoted by the Ministry of Agriculture and Food Security and other stake holders. · Establish baseline data for the measurement of the impact of high level objectives of the Agriculture Sector Development Programme (ASDP), National Strategy for Growth and Reduction of Poverty (NSGRP) and other rural development programs and projects. · Obtain benchmark data that will be used to address specific issues such as: food security, rural poverty, gender, agro-processing, marketing, service delivery, etc.

Geographic coverage

Tanzania Mainland and Zanzibar

Analysis unit

Household, Individual

Universe

Small scale farmers, Large Scale Farmers, Community Level

Kind of data

Sample survey data [ssd]

Sampling procedure

The Mainland sample consisted of 3,221 villages. These villages were drawn from the National Master Sample (NMS) developed by the National Bureau of Statistics (NBS) to serve as a national framework for the conduct of household based surveys in the country. The National Master Sample was developed from the 2002 Population and Housing Census. The total Mainland sample was 48,315 agricultural households. In Zanzibar a total of 317 EAs were selected and 4,755 agriculture households were covered. Nationwide, all regions and districts were sampled except three urban district (two from Mainland and one from Zanzibar). In both Mainland and Zanzibar, a stratified two stage sample was used. The number of villages/Enumeration Areas (EAs) were selected for the first stage with a probability proportional to the number of villages in each district. In the second stage, 15 households were selected from a list of farming households in each Village/EA using systematic random sampling. Table 1.1 gives the sample size of households, villages and districts for Tanzania Mainland and Zanzibar.

1.3.4 Questionnaire Design and Other Census Instruments The questionnaires were designed following user meetings to ensure that the questions asked were in line with users data needs. Several features were incorporated into the design of the questionnaires to increase the accuracy of the data: · Where feasible all variables were extensively coded to reduce post enumeration coding error. · The definitions for each section were printed on the opposite page so that the enumerator could easily refer to the instructions whilst interviewing the farmer. · The responses to all questions were placed in boxes printed on the questionnaire, with one box per character. This feature made it possible to use scanning and Intelligent Character Recognition (ICR) technologies for data entry. · Skip patterns were used to reduce unnecessary and incorrect coding of sections which do not apply to the respondent. · Each section was clearly numbered, which facilitated the use of skip patterns and provided a reference for data type coding for the programming of CSPro, SPSS and the dissemination applications.

Three other instruments were used: · Village Listing Forms were used for listing households in the village and from this list a systematic sample of 15 agricultural households were selected. · A Training Manual which was used by the trainers for the cascade/pyramid training of supervisors and enumerators · Enumerator Instruction Manual which was used as reference material.

1.3.5 Field Pre-testing of the Census Instruments The Questionnaire was pre-tested in five locations (Arusha, Dodoma, Tanga, Unguja and Pemba). This was done to test the wording, flow and relevance of the questions and to finalise crop lists, questionnaire coding and manuals. In addition to this, several data collection methodologies had to be finalised, namely, livestock numbers in pastoralist communities, cut flower production, mixed cropping, use of percentages in the questionnaire and finalising skip patterns and documenting consistency checks.

Mode of data collection

Face-to-face [f2f]

Research instrument

The census covered agriculture in detail as well as many other aspects of rural development and was conducted using three different questionnaires: · Small scale farm questionnaire · Community level questionnaire · Large scale farm questionnaire The small scale farm questionnaire was the main census instrument and includes questions related to crop and livestock production and practices; population demographics; access to services, resources and infrastructure; and issues on poverty, gender and subsistence versus profit making production units. The main topics covered were: · Household demographics and activities of the household members · Land access/ownership/tenure and use · Crop and livestock production and productivity · Access to inputs and farming implements · Access and use of credit · Access to infrastructure (roads, district and regional headquarters, markets, advisory services, schools, hospitals, veterinary clinics, etc...) · Crop marketing, storage and agro processing · Tree farming, agro-forestry and fish farming · Access and use of communal resources (grazing, communal forest, water for humans and livestock, beekeeping etc.) · Investment activities: Irrigation structures, water harvesting, erosion control, fencing, etc. · Off farm income and non agriculture related activities · Households living conditions (housing, sanitary facilities, etc.) · Labour use, livelihood constraints and subsistence versus non subsistence activities · Gender issues. The community level questionnaire was designed to collect village level data such as access and use of common resources, community tree plantations and seasonal farm gate prices.

The large scale farm questionnaire was administered to large farms which were either privately or corporately managed. Some data from the large scale farm questionnaire are incorporated in this report, however an in depth analysis of large scale farms is presented in a separate report.

Cleaning operations

Data processing consisted of the following processes: · Data entry · Data structure formatting · Batch validation · Tabulation

Data Entry Scanning and ICR data capture technology for the small holder questionnaire were used on the Mainland. This not only increased the speed of data entry, it also increased the accuracy due to the reduction of keystroke errors. Interactive validation routines were incorporated into the ICR software to track errors during the verification process. The scanning operation was so successful that it is highly recommended for adoption in future censuses/surveys. In Zanzibar all data was entered manually using CSPro.

Prior to scanning, all questionnaires underwent a manual cleaning exercise. This involved checking that the questionnaire had a full set of pages, correct identification and good handwriting. A score was given to each questionnaire based on the legibility and the completeness of enumeration. This score will be used to assess the quality of enumeration and supervision in order to select the best field staff for future censuses/surveys.

CSPro was used for data entry of all Large Scale Farm and community based questionnaires due to the relatively small number of questionnaires. It was also used to enter data from the 2,880 small holder questionnaires that were rejected by the ICR extraction application.

Data Structure Formatting A program was developed in visual basic to automatically alter the structure of the output from the scanning/extraction process in order to harmonise it with the manually entered data. The program automatically checked and changed the number of digits for each variable, the record type code, the number of questionnaires in the village, the consistency of the Village ID Code and saved the data of one village in a file named after the village code.

Batch Validation A batch validation program was developed in order to identify inconsistencies within a questionnaire. This is in addition to the interactive validation during the ICR extraction process. The procedures varied from simple range checking within each variable to the more complex checking between variables. It took six months to screen, edit and validate the data from the smallholder questionnaires. After the long process of data cleaning, tabulations were prepared based on a pre-designed tabulation plan.

Tabulations Statistical Package for Social Sciences (SPSS) was used to produce the Census tabulations and Microsoft Excel was used to organize the tables and compute additional indicators. Excel was also used to produce charts while ArcView and Freehand were used for the maps.

Analysis and Report Preparation The analysis in this report focuses on regional comparisons, time series and national production estimates. Microsoft Excel was
w
Living Standards Survey V 2005-2006 - World Bank SHIP Harmonized Dataset -...
microdata.worldbank.org
catalog.ihsn.org
Updated Dec 9, 2014
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ghana Statistical Service (GSS) (2014). Living Standards Survey V 2005-2006 - World Bank SHIP Harmonized Dataset - Ghana [Dataset]. https://microdata.worldbank.org/index.php/catalog/1064
Explore at:
Dataset updated
Dec 9, 2014
Dataset authored and provided by
Ghana Statistical Service (GSS)
Time period covered
2005 - 2006
Area covered
Ghana
Description
Abstract

Survey based Harmonized Indicators (SHIP) files are harmonized data files from household surveys that are conducted by countries in Africa. To ensure the quality and transparency of the data, it is critical to document the procedures of compiling consumption aggregation and other indicators so that the results can be duplicated with ease. This process enables consistency and continuity that make temporal and cross-country comparisons consistent and more reliable.

Four harmonized data files are prepared for each survey to generate a set of harmonized variables that have the same variable names. Invariably, in each survey, questions are asked in a slightly different way, which poses challenges on consistent definition of harmonized variables. The harmonized household survey data present the best available variables with harmonized definitions, but not identical variables. The four harmonized data files are

a) Individual level file (Labor force indicators in a separate file): This file has information on basic characteristics of individuals such as age and sex, literacy, education, health, anthropometry and child survival. b) Labor force file: This file has information on labor force including employment/unemployment, earnings, sectors of employment, etc. c) Household level file: This file has information on household expenditure, household head characteristics (age and sex, level of education, employment), housing amenities, assets, and access to infrastructure and services. d) Household Expenditure file: This file has consumption/expenditure aggregates by consumption groups according to Purpose (COICOP) of Household Consumption of the UN.

Geographic coverage

National

Analysis unit

Individual level for datasets with suffix _I and _L

Household level for datasets with suffix _H and _E

Universe

The survey covered all de jure household members (usual residents).

Kind of data

Sample survey data [ssd]

Sampling procedure

Sampling Frame and Units As in all probability sample surveys, it is important that each sampling unit in the surveyed population has a known, non-zero probability of selection. To achieve this, there has to be an appropriate list, or sampling frame of the primary sampling units (PSUs).The universe defined for the GLSS 5 is the population living within private households in Ghana. The institutional population (such as schools, hospitals etc), which represents a very small percentage in the 2000 Population and Housing Census (PHC), is excluded from the frame for the GLSS 5.

The Ghana Statistical Service (GSS) maintains a complete list of census EAs, together with their respective population and number of households as well as maps, with well defined boundaries, of the EAs. . This information was used as the sampling frame for the GLSS 5. Specifically, the EAs were defined as the primary sampling units (PSUs), while the households within each EA constituted the secondary sampling units (SSUs).

Stratification In order to take advantage of possible gains in precision and reliability of the survey estimates from stratification, the EAs were first stratified into the ten administrative regions. Within each region, the EAs were further sub-divided according to their rural and urban areas of location. The EAs were also classified according to ecological zones and inclusion of Accra (GAMA) so that the survey results could be presented according to the three ecological zones, namely 1) Coastal, 2) Forest, and 3) Northern Savannah, and for Accra.

Sample size and allocation The number and allocation of sample EAs for the GLSS 5 depend on the type of estimates to be obtained from the survey and the corresponding precision required. It was decided to select a total sample of around 8000 households nationwide.

To ensure adequate numbers of complete interviews that will allow for reliable estimates at the various domains of interest, the GLSS 5 sample was designed to ensure that at least 400 households were selected from each region.

A two-stage stratified random sampling design was adopted. Initially, a total sample of 550 EAs was considered at the first stage of sampling, followed by a fixed take of 15 households per EA. The distribution of the selected EAs into the ten regions or strata was based on proportionate allocation using the population.

For example, the number of selected EAs allocated to the Western Region was obtained as: 1924577/18912079*550 = 56

Under this sampling scheme, it was observed that the 400 households minimum requirement per region could be achieved in all the regions but not the Upper West Region. The proportionate allocation formula assigned only 17 EAs out of the 550 EAs nationwide and selecting 15 households per EA would have yielded only 255 households for the region. In order to surmount this problem, two options were considered: retaining the 17 EAs in the Upper West Region and increasing the number of selected households per EA from 15 to about 25, or increasing the number of selected EAs in the region from 17 to 27 and retaining the second stage sample of 15 households per EA.

The second option was adopted in view of the fact that it was more likely to provide smaller sampling errors for the separate domains of analysis. Based on this, the number of EAs in Upper East and the Upper West were adjusted from 27 and 17 to 40 and 34 respectively, bringing the total number of EAs to 580 and the number of households to 8,700.

A complete household listing exercise was carried out between May and June 2005 in all the selected EAs to provide the sampling frame for the second stage selection of households. At the second stage of sampling, a fixed number of 15 households per EA was selected in all the regions. In addition, five households per EA were selected as replacement samples.The overall sample size therefore came to 8,700 households nationwide.

Mode of data collection

Face-to-face [f2f]
2024 Public Sector: GS00EP01 | State and Local Government Employment and...
data.census.gov
Updated Mar 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ECN (2025). 2024 Public Sector: GS00EP01 | State and Local Government Employment and Payroll Data: U.S. and States: 2017 - 2024 (PUB Public Sector Annual Surveys and Census of Governments) [Dataset]. https://data.census.gov/table/GOVSTIMESERIES.GS00EP01
Explore at:
Dataset updated
Mar 27, 2025
Dataset provided by
United States Census Bureauhttp://census.gov/
Authors
ECN
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Time period covered
2024
Area covered
United States
Description
Key Table Information.Table Title.State and Local Government Employment and Payroll Data: U.S. and States: 2017 - 2024.Table ID.GOVSTIMESERIES.GS00EP01.Survey/Program.Public Sector.Year.2024.Dataset.PUB Public Sector Annual Surveys and Census of Governments.Source.U.S. Census Bureau, Public Sector.Release Date.2025-03-27.Release Schedule.The Annual Survey of Public Employment & Payroll occurs every year, except in Census years. Data are typically released yearly in the first quarter. There is approximately one year between the reference period and data release. Revisions to published data occur annually for the next two years. Census of Governments years, those ending in '2' and '7' may have slightly later releases due to extended processing time..Dataset Universe.Census of Governments - Organization (CG):The universe of this file is all federal, state, and local government units in the United States. In addition to the federal government and the 50 state governments, the Census Bureau recognizes five basic types of local governments. The government types are: County, Municipal, Township, Special District, and School District. Of these five types, three are categorized as General Purpose governments: County, municipal, and township governments are readily recognized and generally present no serious problem of classification. However, legislative provisions for school district and special district governments are diverse. These two types are categorized as Special Purpose governments. Numerous single-function and multiple-function districts, authorities, commissions, boards, and other entities, which have varying degrees of autonomy, exist in the United States. The basic pattern of these entities varies widely from state to state. Moreover, various classes of local governments within a particular state also differ in their characteristics. Refer to the Individual State Descriptions report for an overview of all government entities authorized by state.The Public Use File provides a listing of all independent government units, and dependent school districts active as of fiscal year ending June 30, 2024. The Annual Surveys of Public Employment & Payroll (EP) and State and Local Government Finances (LF):The target population consists of all 50 state governments, the District of Columbia, and a sample of local governmental units (counties, cities, townships, special districts, school districts). In years ending in '2' and '7' the entire universe is canvassed. In intervening years, a sample of the target population is surveyed. Additional details on sampling are available in the survey methodology descriptions for those years.The Annual Survey of Public Pensions (PP):The target population consists of state- and locally-administered defined benefit funds and systems of all 50 state governments, the District of Columbia, and a sample of local governmental units (counties, cities, townships, special districts, school districts). In years ending in '2' and '7' the entire universe is canvassed. In intervening years, a sample of the target population is surveyed. Additional details on sampling are available in the survey methodology descriptions for those years.The Annual Surveys of State Government Finance (SG) and State Government Tax Collections (TC):The target population consists of all 50 state governments. No local governments are included. For the purpose of Census Bureau statistics, the term "state government" refers not only to the executive, legislative, and judicial branches of a given state, but it also includes agencies, institutions, commissions, and public authorities that operate separately or somewhat autonomously from the central state government but where the state government maintains administrative or fiscal control over their activities as defined by the Census Bureau. Additional details are available in the survey methodology description.The Annual Survey of School System Finances (SS):The Annual Survey of School System Finances targets all public school systems providing elementary and/or secondary education in all 50 states and the District of Columbia..Methodology.Data Items and Other Identifying Records.Full-time and part-time employmentFull-time and part-time payrollPart-time hours worked (prior to 2019)Full-time equivalent employmentTotal full-time and part-time employmentTotal full-time and part-time payrollDefinitions can be found by clicking on the column header in the table or by accessing the Glossary.For detailed information, see Government Finance and Employment Classification Manual..Unit(s) of Observation.The basic reporting unit is the governmental unit, defined as an organized entity which in addition to having governmental character, has sufficient discretion in the management of its own affairs to distinguish it as separate from the administrative structure of any other governmental unit.The reporting units for the Annual Survey of School System Finances are public school sy...
Namibia Population and Housing Census 2011 - Namibia
microdata.nsanamibia.com
Updated Sep 30, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Namibia Statistics Agency (2024). Namibia Population and Housing Census 2011 - Namibia [Dataset]. https://microdata.nsanamibia.com/index.php/catalog/9
Explore at:
Dataset updated
Sep 30, 2024
Dataset authored and provided by
Namibia Statistics Agencyhttps://nsa.org.na/
Time period covered
2011
Area covered
Namibia
Description
Abstract

The 2011 Population and Housing Census is the third national Census to be conducted in Namibia after independence. The first was conducted 1991 followed by the 2001 Census. Namibia is therefore one of the countries in sub-Saharan Africa that has participated in the 2010 Round of Censuses and followed the international best practice of conducting decennial Censuses, each of which attempts to count and enumerate every person and household in a country every ten years. Surveys, by contrast, collect data from samples of people and/or households.

Censuses provide reliable and critical data on the socio-economic and demographic status of any country. In Namibia, Census data has provided crucial information for development planning and programme implementation. Specifically, the information has assisted in setting benchmarks, formulating policy and the evaluation and monitoring of national development programmes including NDP4, Vision 2030 and several sector programmes. The information has also been used to update the national sampling frame which is used to select samples for household-based surveys, including labour force surveys, demographic and health surveys, household income and expenditure surveys. In addition, Census information will be used to guide the demarcation of Namibia's administrative boundaries where necessary.

At the international level, Census information has been used extensively in monitoring progress towards Namibia's achievement of international targets, particularly the Millennium Development Goals (MDGs).

The latest and most comprehensive Census was conducted in August 2011. Preparations for the Census started in the 2007/2008 financial year under the auspices of the then Central Bureau of Statistics (CBS) which was later transformed into the Namibia Statistics Agency (NSA). The NSA was established under the Statistics Act No. 9 of 2011, with the legal mandate and authority to conduct population Censuses every 10 years. The Census was implemented in three broad phases; pre-enumeration, enumeration and post enumeration.

During the first pre-enumeration phase, activities accomplished including the preparation of a project document, establishing Census management and technical committees, and establishing the Census cartography unit which demarcated the Enumeration Areas (EAs). Other activities included the development of Census instruments and tools, such as the questionnaires, manuals and field control forms.

Field staff were recruited, trained and deployed during the initial stages of the enumeration phase. The actual enumeration exercise was undertaken over a period of about three weeks from 28 August to 15 September 2011, while 28 August 2011 was marked as the reference period or 'Census Day'.

Great efforts were made to check and ensure that the Census data was of high quality to enhance its credibility and increase its usage. Various quality controls were implemented to ensure relevance, timeliness, accuracy, coherence and proper data interpretation. Other activities undertaken to enhance quality included the demarcation of the country into small enumeration areas to ensure comprehensive coverage; the development of structured Census questionnaires after consultat.The post-enumeration phase started with the sending of completed questionnaires to Head Office and the preparation of summaries for the preliminary report, which was published in April 2012. Processing of the Census data began with manual editing and coding, which focused on the household identification section and un-coded parts of the questionnaire. This was followed by the capturing of data through scanning. Finally, the data were verified and errors corrected where necessary. This took longer than planned due to inadequate technical skills.

Geographic coverage

National coverage

Analysis unit

Households and persons

Universe

The sampling universe is defined as all households (private and institutions) from 2011 Census dataset.

Kind of data

Census/enumeration data [cen]

Sampling procedure

Sample Design

The stratified random sample was applied on the constituency and urban/rural variables of households list from Namibia 2011 Population and Housing Census for the Public Use Microdata Sample (PUMS) file. The sampling universe is defined as all households (private and institutions) from 2011 Census dataset. Since urban and rural are very important factor in the Namibia situation, it was then decided to take the stratum at the constituency and urban/rural levels. Some constituencies have very lower households in the urban or rural, the office therefore decided for a threshold (low boundary) for sampling within stratum. Based on data analysis, the threshold for stratum of PUMS file is 250 households. Thus, constituency and urban/rural areas with less than 250 households in total were included in the PUMS file. Otherwise, a simple random sampling (SRS) at a 20% sample rate was applied for each stratum. The sampled households include 93,674 housing units and 418,362 people.

Sample Selection

The PUMS sample is selected from households. The PUMS sample of persons in households is selected by keeping all persons in PUMS households. Sample selection process is performed using Census and Survey Processing System (CSPro).

The sample selection program first identifies the 7 census strata with less than 250 households and the households (private and institutions) with more than 50 people. The households in these areas and with this large size are all included in the sample. For the other households, the program randomly generates a number n from 0 to 4. Out of every 5 households, the program selects the nth household to export to the PUMS data file, creating a 20 percent sample of households. Private households and institutions are equally sampled in the PUMS data file.

Note: The 7 census strata with less than 250 households are: Arandis Constituency Rural, Rehoboth East Urban Constituency Rural, Walvis Bay Rural Constituency Rural, Mpungu Constituency Urban, Etayi Constituency Urban, Kalahari Constituency Urban, and Ondobe Constituency Urban.

Mode of data collection

Face-to-face [f2f]

Research instrument

The following questionnaire instruments were used for the Namibia 2011 Population and and Housing Census:

Form A (Long Form): For conventional households and residential institutions

Form B1 (Short Form): For special population groups such as persons in transit (travellers), police cells, homeless and off-shore populations

Form B2 (Short Form): For hotels/guesthouses

Form B3 (Short Form): For foreign missions/diplomatic corps

Cleaning operations

Data editing took place at a number of stages throughout the processing, including: a) During data collection in the field b) Manual editing and coding in the office c) During data entry (Primary validation/editing) Structure checking and completeness using Structured Query Language (SQL) program d) Secondary editing: i. Imputations of variables ii. Structural checking in Census and Survey Processing System (CSPro) program

Sampling error estimates

Sampling Error The standard errors of survey estimates are needed to evaluate the precision of the survey estimation. The statistical software package such as SPSS or SAS can accurately estimate the mean and variance of estimates from the survey. SPSS or SAS software package makes use of the Taylor series approach in computing the variance.

Data appraisal

Data quality Great efforts were made to check and ensure that the Census data was of high quality to enhance its credibility and increase its usage. Various quality controls were implemented to ensure relevance, timeliness, accuracy, coherence and proper data interpretation. Other activities undertaken to enhance quality included the demarcation of the country into small enumeration areas to ensure comprehensive coverage; the development of structured Census questionnaires after consultation with government ministries, university expertise and international partners; the preparation of detailed supervisors' and enumerators' instruction manuals to guide field staff during enumeration; the undertaking of comprehensive publicity and advocacy programmes to ensure full Government support and cooperation from the general public; the testing of questionnaires and other procedures; the provision of adequate training and undertaking of intensive supervision using four supervisory layers; the editing of questionnaires at field level; establishing proper mechanisms which ensured that all completed questionnaires were properly accounted for; ensuring intensive verification, validating all information and error corrections; and developing capacity in data processing with support from the international community.

Facebook

Twitter

Click to copy link

Link copied

Cite

U.S. Census Bureau (2023). 2012-2016 American Community Survey: 5-Year Estimates - Public Use Microdata Sample [Dataset]. https://catalog.data.gov/dataset/2012-2016-american-community-survey-5-year-estimates-public-use-microdata-sample

2012-2016 American Community Survey: 5-Year Estimates - Public Use Microdata Sample

Explore at:

Dataset updated

Jul 19, 2023

Dataset provided by

United States Census Bureauhttp://census.gov/

Description

The American Community Survey (ACS) Public Use Microdata Sample (PUMS) contains a sample of responses to the ACS. The ACS PUMS dataset includes variables for nearly every question on the survey, as well as many new variables that were derived after the fact from multiple survey responses (such as poverty status).Each record in the file represents a single person, or, in the household-level dataset, a single housing unit. In the person-level file, individuals are organized into households, making possible the study of people within the contexts of their families and other household members. Individuals living in Group Quarters, such as nursing facilities or college facilities, are also included on the person file. ACS PUMS data are available at the nation, state, and Public Use Microdata Area (PUMA) levels. PUMAs are special non-overlapping areas that partition each state into contiguous geographic units containing roughly 100,000 people each. ACS PUMS files for an individual year, such as 2019, contain data on approximately one percent of the United States population.

Clear search

Close search

Google apps

Main menu

2012-2016 American Community Survey: 5-Year Estimates - Public Use Microdata...

Survey of Income and Program Participation (SIPP)

Census Tract Top 50 American Community Survey Data

ACS 5-Year Social Characteristics DC Census Tract

Maryland American Community Survey - ACS Census Tracts

Demographic and Health Survey 2017 - Indonesia

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Cleaning operations

Response rate

Sampling error estimates

Data appraisal

2018 American Community Survey: 1-Year Estimates - Public Use Microdata...

Agriculture Sample Census Survey 2007/08 - Tanzania

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Cleaning operations

HSRC Master Sample II - Dataset - B2FIND

Sweet Home, OR Age Group Population Dataset: A complete breakdown of Sweet...

About this dataset

Content

Inspiration

Recommended for further research

American Community Survey

Pew Survey on Israel's Religiously Divided Society Data Set

Synthetic Data for an Imaginary Country, Sample, 2023 - World

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Cleaning operations

Response rate

2024 Public Sector: GS00SS14 | Percentage Distribution of Revenue of Public...

Adams, OR Age Group Population Dataset: A Complete Breakdown of Adams Age...

About this dataset

Content

Inspiration

Recommended for further research

Salem, OR Age Group Population Dataset: A complete breakdown of Salem age...

About this dataset

Content

Inspiration

Recommended for further research

Agriculture Sample Census Survey 2002-2003 - Tanzania

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Cleaning operations

Living Standards Survey V 2005-2006 - World Bank SHIP Harmonized Dataset -...

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

2024 Public Sector: GS00EP01 | State and Local Government Employment and...

Namibia Population and Housing Census 2011 - Namibia

Abstract

2012-2016 American Community Survey: 5-Year Estimates - Public Use Microdata Sample