Pursuant to Local Laws 126, 127, and 128 of 2016, certain demographic data is collected voluntarily and anonymously by persons voluntarily seeking social services. This data can be used by agencies and the public to better understand the demographic makeup of client populations and to better understand and serve residents of all backgrounds and identities. The data presented here has been collected through either electronic form or paper surveys offered at the point of application for services. These surveys are anonymous. Each record represents an anonymized demographic profile of an individual applicant for social services, disaggregated by response option, agency, and program. Response options include information regarding ancestry, race, primary and secondary languages, English proficiency, gender identity, and sexual orientation. Idiosyncrasies or Limitations: Note that while the dataset contains the total number of individuals who have identified their ancestry or languages spoke, because such data is collected anonymously, there may be instances of a single individual completing multiple voluntary surveys. Additionally, the survey being both voluntary and anonymous has advantages as well as disadvantages: it increases the likelihood of full and honest answers, but since it is not connected to the individual case, it does not directly inform delivery of services to the applicant. The paper and online versions of the survey ask the same questions but free-form text is handled differently. Free-form text fields are expected to be entered in English although the form is available in several languages. Surveys are presented in 11 languages. Paper Surveys 1. Are optional 2. Survey taker is expected to specify agency that provides service 2. Survey taker can skip or elect not to answer questions 3. Invalid/unreadable data may be entered for survey date or date may be skipped 4. OCRing of free-form tet fields may fail. 5. Analytical value of free-form text answers is unclear Online Survey 1. Are optional 2. Agency is defaulted based on the URL 3. Some questions must be answered 4. Date of survey is automated
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Open Science in (Higher) Education – data of the February 2017 survey
This data set contains:
Full raw (anonymised) data set (completed responses) of Open Science in (Higher) Education February 2017 survey. Data are in xlsx and sav format.
Survey questionnaires with variables and settings (German original and English translation) in pdf. The English questionnaire was not used in the February 2017 survey, but only serves as translation.
Readme file (txt)
Survey structure
The survey includes 24 questions and its structure can be separated in five major themes: material used in courses (5), OER awareness, usage and development (6), collaborative tools used in courses (2), assessment and participation options (5), demographics (4). The last two questions include an open text questions about general issues on the topics and singular open education experiences, and a request on forwarding the respondent's e-mail address for further questionings. The online survey was created with Limesurvey[1]. Several questions include filters, i.e. these questions were only shown if a participants did choose a specific answer beforehand ([n/a] in Excel file, [.] In SPSS).
Demographic questions
Demographic questions asked about the current position, the discipline, birth year and gender. The classification of research disciplines was adapted to general disciplines at German higher education institutions. As we wanted to have a broad classification, we summarised several disciplines and came up with the following list, including the option "other" for respondents who do not feel confident with the proposed classification:
Natural Sciences
Arts and Humanities or Social Sciences
Economics
Law
Medicine
Computer Sciences, Engineering, Technics
Other
The current job position classification was also chosen according to common positions in Germany, including positions with a teaching responsibility at higher education institutions. Here, we also included the option "other" for respondents who do not feel confident with the proposed classification:
Professor
Special education teacher
Academic/scientific assistant or research fellow (research and teaching)
Academic staff (teaching)
Student assistant
Other
We chose to have a free text (numerical) for asking about a respondent's year of birth because we did not want to pre-classify respondents' age intervals. It leaves us options to have different analysis on answers and possible correlations to the respondents' age. Asking about the country was left out as the survey was designed for academics in Germany.
Remark on OER question
Data from earlier surveys revealed that academics suffer confusion about the proper definition of OER[2]. Some seem to understand OER as free resources, or only refer to open source software (Allen & Seaman, 2016, p. 11). Allen and Seaman (2016) decided to give a broad explanation of OER, avoiding details to not tempt the participant to claim "aware". Thus, there is a danger of having a bias when giving an explanation. We decided not to give an explanation, but keep this question simple. We assume that either someone knows about OER or not. If they had not heard of the term before, they do not probably use OER (at least not consciously) or create them.
Data collection
The target group of the survey was academics at German institutions of higher education, mainly universities and universities of applied sciences. To reach them we sent the survey to diverse institutional-intern and extern mailing lists and via personal contacts. Included lists were discipline-based lists, lists deriving from higher education and higher education didactic communities as well as lists from open science and OER communities. Additionally, personal e-mails were sent to presidents and contact persons from those communities, and Twitter was used to spread the survey.
The survey was online from Feb 6th to March 3rd 2017, e-mails were mainly sent at the beginning and around mid-term.
Data clearance
We got 360 responses, whereof Limesurvey counted 208 completes and 152 incompletes. Two responses were marked as incomplete, but after checking them turned out to be complete, and we added them to the complete responses dataset. Thus, this data set includes 210 complete responses. From those 150 incomplete responses, 58 respondents did not answer 1st question, 40 respondents discontinued after 1st question. Data shows a constant decline in response answers, we did not detect any striking survey question with a high dropout rate. We deleted incomplete responses and they are not in this data set.
Due to data privacy reasons, we deleted seven variables automatically assigned by Limesurvey: submitdate, lastpage, startlanguage, startdate, datestamp, ipaddr, refurl. We also deleted answers to question No 24 (email address).
References
Allen, E., & Seaman, J. (2016). Opening the Textbook: Educational Resources in U.S. Higher Education, 2015-16.
First results of the survey are presented in the poster:
Heck, Tamara, Blümel, Ina, Heller, Lambert, Mazarakis, Athanasios, Peters, Isabella, Scherp, Ansgar, & Weisel, Luzian. (2017). Survey: Open Science in Higher Education. Zenodo. http://doi.org/10.5281/zenodo.400561
Contact:
Open Science in (Higher) Education working group, see http://www.leibniz-science20.de/forschung/projekte/laufende-projekte/open-science-in-higher-education/.
[1] https://www.limesurvey.org
[2] The survey question about the awareness of OER gave a broad explanation, avoiding details to not tempt the participant to claim "aware".
https://www.icpsr.umich.edu/web/ICPSR/studies/29646/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/29646/terms
This data collection is comprised of responses from the March and April installments of the 2008 Current Population Survey (CPS). Both the March and April surveys used two sets of questions, the basic CPS and a separate supplement for each month.The CPS, administered monthly, is a labor force survey providing current estimates of the economic status and activities of the population of the United States. Specifically, the CPS provides estimates of total employment (both farm and nonfarm), nonfarm self-employed persons, domestics, and unpaid helpers in nonfarm family enterprises, wage and salaried employees, and estimates of total unemployment.In addition to the basic CPS questions, respondents were asked questions from the March supplement, known as the Annual Social and Economic (ASEC) supplement. The ASEC provides supplemental data on work experience, income, noncash benefits, and migration. Comprehensive work experience information was given on the employment status, occupation, and industry of persons 15 years old and older. Additional data for persons 15 years old and older are available concerning weeks worked and hours per week worked, reason not working full time, total income and income components, and place of residence on March 1, 2007. The March supplement also contains data covering nine noncash income sources: food stamps, school lunch program, employer-provided group health insurance plan, employer-provided pension plan, personal health insurance, Medicaid, Medicare, CHAMPUS or military health care, and energy assistance. Questions covering training and assistance received under welfare reform programs, such as job readiness training, child care services, or job skill training were also asked in the March supplement.The April supplement, sponsored by the Department of Health and Human Services, queried respondents on the economic situation of persons and families for the previous year. Moreover, all household members 15 years of age and older that are a biological parent of children in the household that have an absent parent were asked detailed questions about child support and alimony. Information regarding child support was collected to determine the size and distribution of the population with children affected by divorce or separation, or other relationship status change. Moreover, the data were collected to better understand the characteristics of persons requiring child support, and to help develop and maintain programs designed to assist in obtaining child support. These data highlight alimony and child support arrangements made at the time of separation or divorce, amount of payments actually received, and value and type of any property settlement.The April supplement data were matched to March supplement data for households that were in the sample in both March and April 2008. In March 2008, there were 4,522 household members eligible, of which 1,431 required imputation of child support data. When matching the March 2008 and April 2008 data sets, there were 170 eligible people on the March file that did not match to people on the April file. Child support data for these 170 people were imputed. The remaining 1,261 imputed cases were due to nonresponse to the child support questions. Demographic variables include age, sex, race, Hispanic origin, marital status, veteran status, educational attainment, occupation, and income. Data on employment and income refer to the preceding year, although other demographic data refer to the time at which the survey was administered.
analyze the current population survey (cps) annual social and economic supplement (asec) with r the annual march cps-asec has been supplying the statistics for the census bureau's report on income, poverty, and health insurance coverage since 1948. wow. the us census bureau and the bureau of labor statistics ( bls) tag-team on this one. until the american community survey (acs) hit the scene in the early aughts (2000s), the current population survey had the largest sample size of all the annual general demographic data sets outside of the decennial census - about two hundred thousand respondents. this provides enough sample to conduct state- and a few large metro area-level analyses. your sample size will vanish if you start investigating subgroups b y state - consider pooling multiple years. county-level is a no-no. despite the american community survey's larger size, the cps-asec contains many more variables related to employment, sources of income, and insurance - and can be trended back to harry truman's presidency. aside from questions specifically asked about an annual experience (like income), many of the questions in this march data set should be t reated as point-in-time statistics. cps-asec generalizes to the united states non-institutional, non-active duty military population. the national bureau of economic research (nber) provides sas, spss, and stata importation scripts to create a rectangular file (rectangular data means only person-level records; household- and family-level information gets attached to each person). to import these files into r, the parse.SAScii function uses nber's sas code to determine how to import the fixed-width file, then RSQLite to put everything into a schnazzy database. you can try reading through the nber march 2012 sas importation code yourself, but it's a bit of a proc freak show. this new github repository contains three scripts: 2005-2012 asec - download all microdata.R down load the fixed-width file containing household, family, and person records import by separating this file into three tables, then merge 'em together at the person-level download the fixed-width file containing the person-level replicate weights merge the rectangular person-level file with the replicate weights, then store it in a sql database create a new variable - one - in the data table 2012 asec - analysis examples.R connect to the sql database created by the 'download all microdata' progr am create the complex sample survey object, using the replicate weights perform a boatload of analysis examples replicate census estimates - 2011.R connect to the sql database created by the 'download all microdata' program create the complex sample survey object, using the replicate weights match the sas output shown in the png file below 2011 asec replicate weight sas output.png statistic and standard error generated from the replicate-weighted example sas script contained in this census-provided person replicate weights usage instructions document. click here to view these three scripts for more detail about the current population survey - annual social and economic supplement (cps-asec), visit: the census bureau's current population survey page the bureau of labor statistics' current population survey page the current population survey's wikipedia article notes: interviews are conducted in march about experiences during the previous year. the file labeled 2012 includes information (income, work experience, health insurance) pertaining to 2011. when you use the current populat ion survey to talk about america, subract a year from the data file name. as of the 2010 file (the interview focusing on america during 2009), the cps-asec contains exciting new medical out-of-pocket spending variables most useful for supplemental (medical spending-adjusted) poverty research. confidential to sas, spss, stata, sudaan users: why are you still rubbing two sticks together after we've invented the butane lighter? time to transition to r. :D
The Gallup Poll Social Series (GPSS) is a set of public opinion surveys designed to monitor U.S. adults' views on numerous social, economic, and political topics. The topics are arranged thematically across 12 surveys. Gallup administers these surveys during the same month every year and includes the survey's core trend questions in the same order each administration. Using this consistent standard allows for unprecedented analysis of changes in trend data that are not susceptible to question order bias and seasonal effects.
Introduced in 2001, the GPSS is the primary method Gallup uses to update several hundred long-term Gallup trend questions, some dating back to the 1930s. The series also includes many newer questions added to address contemporary issues as they emerge.
The dataset currently includes responses from up to and including 2025.
Gallup conducts one GPSS survey per month, with each devoted to a different topic, as follows:
January: Mood of the Nation
February: World Affairs
March: Environment
April: Economy and Finance
May: Values and Beliefs
June: Minority Rights and Relations (discontinued after 2016)
July: Consumption Habits
August: Work and Education
September: Governance
October: Crime
November: Health
December: Lifestyle (conducted 2001-2008)
The core questions of the surveys differ each month, but several questions assessing the state of the nation are standard on all 12: presidential job approval, congressional job approval, satisfaction with the direction of the U.S., assessment of the U.S. job market, and an open-ended measurement of the nation's "most important problem." Additionally, Gallup includes extensive demographic questions on each survey, allowing for in-depth analysis of trends.
Interviews are conducted with U.S. adults aged 18 and older living in all 50 states and the District of Columbia using a dual-frame design, which includes both landline and cellphone numbers. Gallup samples landline and cellphone numbers using random-digit-dial methods. Gallup purchases samples for this study from Survey Sampling International (SSI). Gallup chooses landline respondents at random within each household based on which member had the next birthday. Each sample of national adults includes a minimum quota of 70% cellphone respondents and 30% landline respondents, with additional minimum quotas by time zone within region. Gallup conducts interviews in Spanish for respondents who are primarily Spanish-speaking.
Gallup interviews a minimum of 1,000 U.S. adults aged 18 and older for each GPSS survey. Samples for the June Minority Rights and Relations survey are significantly larger because Gallup includes oversamples of Blacks and Hispanics to allow for reliable estimates among these key subgroups.
Gallup weights samples to correct for unequal selection probability, nonresponse, and double coverage of landline and cellphone users in the two sampling frames. Gallup also weights its final samples to match the U.S. population according to gender, age, race, Hispanic ethnicity, education, region, population density, and phone status (cellphone only, landline only, both, and cellphone mostly).
Demographic weighting targets are based on the most recent Current Population Survey figures for the aged 18 and older U.S. population. Phone status targets are based on the most recent National Health Interview Survey. Population density targets are based on the most recent U.S. Census.
The year appended to each table name represents when the data was last updated. For example, January: Mood of the Nation - 2025** **has survey data collected up to and including 2025.
For more information about what survey questions were asked over time, see the Supporting Files.
Data access is required to view this section.
A random sample of households were invited to participate in this survey. In the dataset, you will find the respondent level data in each row with the questions in each column. The numbers represent a scale option from the survey, such as 1=Excellent, 2=Good, 3=Fair, 4=Poor. The question stem, response option, and scale information for each field can be found in the var "variable labels" and "value labels" sheets. VERY IMPORTANT NOTE: The scientific survey data were weighted, meaning that the demographic profile of respondents was compared to the demographic profile of adults in Bloomington from US Census data. Statistical adjustments were made to bring the respondent profile into balance with the population profile. This means that some records were given more "weight" and some records were given less weight. The weights that were applied are found in the field "wt". If you do not apply these weights, you will not obtain the same results as can be found in the report delivered to the Bloomington. The easiest way to replicate these results is likely to create pivot tables, and use the sum of the "wt" field rather than a count of responses.
The National Health and Nutrition Examination Surveys (NHANES) is a program of studies designed to assess the health and nutritional status of adults and children in the United States. The NHANES combines personal interviews and physical examinations, which focus on different population groups or health topics. These surveys have been conducted by the National Center for Health Statistics (NCHS) on a periodic basis from 1971 to 1994. In 1999, the NHANES became a continuous program with a changing focus on a variety of health and nutrition measurements which were designed to meet current and emerging concerns. The sample for the survey is selected to represent the U.S. population of all ages. Many of the NHANES 2007-2008 questions also were asked in NHANES II 1976-1980, Hispanic NHANES 1982-1984, NHANES III 1988-1994, and NHANES 1999-2006. New questions were added to the survey based on recommendations from survey collaborators, NCHS staff, and other interagency work groups. Estimates for previously undiagnosed conditions, as well as those known to and reported by survey respondents, are produced through the survey. In the 2003-2004 wave, the NHANES includes more than 100 datasets. Most have been combined into three datasets for convenience. Each starts with the Demographic dataset and includes datasets of a specific type. 1. National Health and Nutrition Examination Survey (NHANES), Demographic & Examination Data, 2003-2004 (The base of the Demographic dataset + all data from medical examinations). 2. National Health and Nutrition Examination Survey (NHANES), Demographic & Laboratory Data, 2003-2004 (The base of the Demographic dataset + all data from medical laboratories). 3. National Health and Nutrition Examination Survey (NHANES), Demographic & Questionnaire Data, 2003-2004 (The base of the Demographic dataset + all data from questionnaires) Variable SEQN is included for merging files within the waves. All data files should be sorted by SEQN. Additional details of the design and content of each survey are available at the NHANES website.
Abstract copyright UK Data Service and data collection copyright owner. The Online Time Use Survey (OTUS) was developed by the Office for National Statistics to help improve the measurement of unpaid household production and caring activities that are not captured within traditional economic measures, and to understand better time use from a well-being and quality of life perspective. The survey collects information from adults aged 18 years and over who are randomly sampled from the NatCen Opinion Panel, which is representative of the UK population. Data collected between March 2020 and March 2021 covers Great Britain and data collected from March 2022 onwards covers the United Kingdom. Participants were issued with two pre-allocated diary days (one on a weekday and one on a weekend day). They were asked to record their main activities (in 10-minute intervals) and up to five secondary activities (in five-minute intervals) in every 24 hours within an online diary tool. Respondents were able to select activities from a pre-defined list. They were also asked to rate how much they enjoyed different activities. In addition, respondents were asked to complete a demographic questionnaire which records personal and household characteristics.Latest edition informationFor the third edition (August 2024), data and documentation for Wave 8 (9 to 17 March 2024) were added to the study. Main Topics: The annual data files include the following variables:main activities (in 10-minute periods) up to five secondary activities (in five-minute periods)count of all 5-minute primary activities total in minutes of all primary activitiescount of all 5-minute secondary activitiestotal in minutes of all secondary activitiesenjoyment level (scale 1-7) for all primary activities (in 10-minute periods) enjoyment level (scale 1-7) for all secondary activities (in five-minute periods)basic demographics, including personal well-being rating variables.
The Indonesia Demographic and Health Survey (IDHS) is part of the worldwide Demographic and Health Surveys program, which is designed to collect data on fertility, family planning, and maternal and child health. The 2002-2003 IDHS follows a sequence of several previous surveys: the 1987 National Indonesia Contraceptive Prevalence Survey (NICPS), the 1991 IDHS, the 1994 IDHS, and the 1997 IDHS. The 2002-2003 IDHS is expanded from the 1997 IDHS by including a collection of information on the participation of currently married men and their wives and children in the health care.
The main objective of the 2002-2003 IDHS is to provide policymakers and program managers in population and health with detailed information on population, family planning, and health. In particular, the 2002-2003 IDHS collected information on the female respondents’ socioeconomic background, fertility levels, marriage and sexual activity, fertility preferences, knowledge and use of family planning methods, breastfeeding practices, childhood and adult mortality including maternal mortality, maternal and child health, and awareness and behavior regarding AIDS and other sexually transmitted infections in Indonesia.
The 2002-2003 IDHS was specifically designed to meet the following objectives: - Provide data concerning fertility, family planning, maternal and child health, maternal mortality, and awareness of AIDS/STIs to program managers, policymakers, and researchers to help them evaluate and improve existing programs - Measure trends in fertility and contraceptive prevalence rates, analyze factors that affect such changes, such as marital status and patterns, residence, education, breastfeeding habits, and knowledge, use, and availability of contraception - Evaluate achievement of goals previously set by the national health programs, with special focus on maternal and child health - Assess men’s participation and utilization of health services, as well as of their families - Assist in creating an international database that allows cross-country comparisons that can be used by the program managers, policymakers, and researchers in the area of family planning, fertility, and health in general.
National
Sample survey data
SAMPLE DESIGN AND IMPLEMENTATION
Administratively, Indonesia is divided into 30 provinces. Each province is subdivided into districts (regency in areas mostly rural and municipality in urban areas). Districts are subdivided into subdistricts and each subdistrict is divided into villages. The entire village is classified as urban or rural.
The primary objective of the 2002-2003 IDHS is to provide estimates with acceptable precision for the following domains: · Indonesia as a whole; · Each of 26 provinces covered in the survey. The four provinces excluded due to political instability are Nanggroe Aceh Darussalam, Maluku, North Maluku and Papua. These provinces cover 4 percent of the total population. · Urban and rural areas of Indonesia; · Each of the five districts in Central Java and the five districts in East Java covered in the Safe Motherhood Project (SMP), to provide information for the monitoring and evaluation of the project. These districts are: - in Central Java: Cilacap, Rembang, Jepara, Pemalang, and Brebes. - in East Java: Trenggalek, Jombang, Ngawi, Sampang and Pamekasan.
The census blocks (CBs) are the primary sampling unit for the 2002-2003 IDHS. CBs were formed during the preparation of the 2000 Population Census. Each CB includes approximately 80 households. In the master sample frame, the CBs are grouped by province, by regency/municipality within a province, and by subdistricts within a regency/municipality. In rural areas, the CBs in each district are listed by their geographical location. In urban areas, the CBs are distinguished by the urban classification (large, medium and small cities) in each subdistrict.
Note: See detailed description of sample design in APPENDIX B of the survey report.
Face-to-face
The 2002-2003 IDHS used three questionnaires: the Household Questionnaire, the Women’s Questionnaire for ever-married women 15-49 years old, and the Men’s Questionnaire for currently married men 15-54 years old. The Household Questionnaire and the Women’s Questionnaire were based on the DHS Model “A” Questionnaire, which is designed for use in countries with high contraceptive prevalence. In consultation with the NFPCB and MOH, BPS modified these questionnaires to reflect relevant issues in family planning and health in Indonesia. Inputs were also solicited from potential data users to optimize the IDHS in meeting the country’s needs for population and health data. The questionnaires were translated from English into the national language, Bahasa Indonesia.
The Household Questionnaire was used to list all the usual members and visitors in the selected households. Basic information collected for each person listed includes the following: age, sex, education, and relationship to the head of the household. The main purpose of the Household Questionnaire was to identify women and men who were eligible for the individual interview. In addition, the Household Questionnaire also identifies unmarried women and men age 15-24 who are eligible for the individual interview in the Indonesia Young Adult Reproductive Health Survey (IYARHS). Information on characteristics of the household’s dwelling unit, such as the source of water, type of toilet facilities, construction materials used for the floor and outer walls of the house, and ownership of various durable goods were also recorded in the Household Questionnaire. These items reflect the household’s socioeconomic status.
The Women’s Questionnaire was used to collect information from all ever-married women age 15-49. These women were asked questions on the following topics: • Background characteristics, such as age, marital status, education, and media exposure • Knowledge and use of family planning methods • Fertility preferences • Antenatal, delivery, and postnatal care • Breastfeeding and infant feeding practices • Vaccinations and childhood illnesses • Marriage and sexual activity • Woman’s work and husband’s background characteristics • Childhood mortality • Awareness and behavior regarding AIDS and other sexually transmitted infections (STIs) • Sibling mortality, including maternal mortality.
The Men’s Questionnaire was administered to all currently married men age 15-54 in every third household in the IDHS sample. The Men’s Questionnaire collected much of the same information included in the Women’s Questionnaire, but was shorter because it did not contain questions on reproductive history, maternal and child health, nutrition, and maternal mortality. Instead, men were asked about their knowledge and participation in the health-seeking practices for their children.
All completed questionnaires for IDHS, accompanied by their control forms, were returned to the BPS central office in Jakarta for data processing. This process consisted of office editing, coding of open-ended questions, data entry, verification, and editing computer-identified errors. A team of about 40 data entry clerks, data editors, and two data entry supervisors processed the data. Data entry and editing started on November 4, 2002 using a computer package program called CSPro, which was specifically designed to process DHS-type survey data. To prepare the data entry programs, two BPS staff spent three weeks in ORC Macro offices in Calverton, Maryland in April 2002.
A total of 34,738 households were selected for the survey, of which 33,419 were found. Of the encountered households, 33,088 (99 percent) were successfully interviewed. In these households, 29,996 ever-married women 15-49 were identified, and complete interviews were obtained from 29,483 of them (98 percent). From the households selected for interviews with men, 8,740 currently married men 15-54 were identified, and complete interviews were obtained from 8,310 men, or 95 percent of all eligible men. The generally high response rates for both household and individual interviews (for eligible women and men) were due mainly to the strict enforcement of the rule to revisit the originally selected household if no one was at home initially. No substitution for the originally selected households was allowed. Interviewers were instructed to make at least three visits in an effort to contact the household, eligible women, and eligible men.
Note: See summarized response rates by place of residence in Table 1.2 of the survey report.
The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors, and (2) sampling errors. Nonsampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2002-2003 Indonesia Demographic and Health Survey (IDHS) to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents
The Afrobarometer is a comparative series of public attitude surveys that assess African citizen's attitudes to democracy and governance, markets, and civil society, among other topics. The surveys have been undertaken at periodic intervals since 1999. The Afrobarometer's coverage has increased over time. Round 1 (1999-2001) initially covered 7 countries and was later extended to 12 countries. Round 2 (2002-2004) surveyed citizens in 16 countries. Round 3 (2005-2006) 18 countries, Round 4 (2008) 20 countries, Round 5 (2011-2013) 34 countries, Round 6 (2014-2015) 36 countries, Round 7 (2016-2018) 34 countries, and Round 8 (2019-2021). The survey covered 39 countries in Round 9 (2021-2023).
National coverage
Individual
Citizens of Gabon who are 18 years and older
Sample survey data [ssd]
Afrobarometer uses national probability samples designed to meet the following criteria. Samples are designed to generate a sample that is a representative cross-section of all citizens of voting age in a given country. The goal is to give every adult citizen an equal and known chance of being selected for an interview. They achieve this by:
• using random selection methods at every stage of sampling; • sampling at all stages with probability proportionate to population size wherever possible to ensure that larger (i.e., more populated) geographic units have a proportionally greater probability of being chosen into the sample.
The sampling universe normally includes all citizens age 18 and older. As a standard practice, we exclude people living in institutionalized settings, such as students in dormitories, patients in hospitals, and persons in prisons or nursing homes. Occasionally, we must also exclude people living in areas determined to be inaccessible due to conflict or insecurity. Any such exclusion is noted in the technical information report (TIR) that accompanies each data set.
Sample size and design Samples usually include either 1,200 or 2,400 cases. A randomly selected sample of n=1200 cases allows inferences to national adult populations with a margin of sampling error of no more than +/-2.8% with a confidence level of 95 percent. With a sample size of n=2400, the margin of error decreases to +/-2.0% at 95 percent confidence level.
The sample design is a clustered, stratified, multi-stage, area probability sample. Specifically, we first stratify the sample according to the main sub-national unit of government (state, province, region, etc.) and by urban or rural location.
Area stratification reduces the likelihood that distinctive ethnic or language groups are left out of the sample. Afrobarometer occasionally purposely oversamples certain populations that are politically significant within a country to ensure that the size of the sub-sample is large enough to be analysed. Any oversamples is noted in the TIR.
Sample stages Samples are drawn in either four or five stages:
Stage 1: In rural areas only, the first stage is to draw secondary sampling units (SSUs). SSUs are not used in urban areas, and in some countries they are not used in rural areas. See the TIR that accompanies each data set for specific details on the sample in any given country. Stage 2: We randomly select primary sampling units (PSU). Stage 3: We then randomly select sampling start points. Stage 4: Interviewers then randomly select households. Stage 5: Within the household, the interviewer randomly selects an individual respondent. Each interviewer alternates in each household between interviewing a man and interviewing a woman to ensure gender balance in the sample.
Gabon - Sample size: 1,200 - Sample design: Nationally representative, random, clustered, stratified, multi-stage area probability sample - Stratification: Region and urban-rural location - Stages: PSUs (from strata), start points, households, respondents - PSU selection: Probability Proportionate to Population Size (PPPS) - Cluster size: 8 households per PSU - Household selection: Randomly selected start points, followed by walk pattern using 5/10 interval - Respondent selection: Gender quota filled by alternating interviews between men and women; respondents of appropriate gender listed, after which computer randomly selects individual - Weighting: Weighted to account for individual selection probabilities - Sampling frame: Recensement Général de la Population et des Logements (RGPL) de 2013 réalisée par la Direction Générale de la Statistique et des Etudes Economiques
Face-to-face [f2f]
The Round 9 questionnaire has been developed by the Questionnaire Committee after reviewing the findings and feedback obtained in previous Rounds, and securing input on preferred new topics from a host of donors, analysts, and users of the data.
The questionnaire consists of three parts: 1. Part 1 captures the steps for selecting households and respondents, and includes the introduction to the respondent and (pp.1-4). This section should be filled in by the Fieldworker. 2. Part 2 covers the core attitudinal and demographic questions that are asked by the Fieldworker and answered by the Respondent (Q1 – Q100). 3. Part 3 includes contextual questions about the setting and atmosphere of the interview, and collects information on the Fieldworker. This section is completed by the Fieldworker (Q101 – Q123).
Response rate was 99%.
The sample size yields country-level results with a margin of error of +/-3 percentage points at a 95% confidence level.
The 2022 Ghana Demographic and Health Survey (2022 GDHS) is the seventh in the series of DHS surveys conducted by the Ghana Statistical Service (GSS) in collaboration with the Ministry of Health/Ghana Health Service (MoH/GHS) and other stakeholders, with funding from the United States Agency for International Development (USAID) and other partners.
The primary objective of the 2022 GDHS is to provide up-to-date estimates of basic demographic and health indicators. Specifically, the GDHS collected information on: - Fertility levels and preferences, contraceptive use, antenatal and delivery care, maternal and child health, childhood mortality, childhood immunisation, breastfeeding and young child feeding practices, women’s dietary diversity, violence against women, gender, nutritional status of adults and children, awareness regarding HIV/AIDS and other sexually transmitted infections, tobacco use, and other indicators relevant for the Sustainable Development Goals - Haemoglobin levels of women and children - Prevalence of malaria parasitaemia (rapid diagnostic testing and thick slides for malaria parasitaemia in the field and microscopy in the lab) among children age 6–59 months - Use of treated mosquito nets - Use of antimalarial drugs for treatment of fever among children under age 5
The information collected through the 2022 GDHS is intended to assist policymakers and programme managers in designing and evaluating programmes and strategies for improving the health of the country’s population.
National coverage
The survey covered all de jure household members (usual residents), all women aged 15-49, men aged 15-59, and all children aged 0-4 resident in the household.
Sample survey data [ssd]
To achieve the objectives of the 2022 GDHS, a stratified representative sample of 18,450 households was selected in 618 clusters, which resulted in 15,014 interviewed women age 15–49 and 7,044 interviewed men age 15–59 (in one of every two households selected).
The sampling frame used for the 2022 GDHS is the updated frame prepared by the GSS based on the 2021 Population and Housing Census.1 The sampling procedure used in the 2022 GDHS was stratified two-stage cluster sampling, designed to yield representative results at the national level, for urban and rural areas, and for each of the country’s 16 regions for most DHS indicators. In the first stage, 618 target clusters were selected from the sampling frame using a probability proportional to size strategy for urban and rural areas in each region. Then the number of targeted clusters were selected with equal probability systematic random sampling of the clusters selected in the first phase for urban and rural areas. In the second stage, after selection of the clusters, a household listing and map updating operation was carried out in all of the selected clusters to develop a list of households for each cluster. This list served as a sampling frame for selection of the household sample. The GSS organized a 5-day training course on listing procedures for listers and mappers with support from ICF. The listers and mappers were organized into 25 teams consisting of one lister and one mapper per team. The teams spent 2 months completing the listing operation. In addition to listing the households, the listers collected the geographical coordinates of each household using GPS dongles provided by ICF and in accordance with the instructions in the DHS listing manual. The household listing was carried out using tablet computers, with software provided by The DHS Program. A fixed number of 30 households in each cluster were randomly selected from the list for interviews.
For further details on sample design, see APPENDIX A of the final report.
Face-to-face computer-assisted interviews [capi]
Four questionnaires were used in the 2022 GDHS: the Household Questionnaire, the Woman’s Questionnaire, the Man’s Questionnaire, and the Biomarker Questionnaire. The questionnaires, based on The DHS Program’s model questionnaires, were adapted to reflect the population and health issues relevant to Ghana. In addition, a self-administered Fieldworker Questionnaire collected information about the survey’s fieldworkers.
The GSS organized a questionnaire design workshop with support from ICF and obtained input from government and development partners expected to use the resulting data. The DHS Program optional modules on domestic violence, malaria, and social and behavior change communication were incorporated into the Woman’s Questionnaire. ICF provided technical assistance in adapting the modules to the questionnaires.
DHS staff installed all central office programmes, data structure checks, secondary editing, and field check tables from 17–20 October 2022. Central office training was implemented using the practice data to test the central office system and field check tables. Seven GSS staff members (four male and three female) were trained on the functionality of the central office menu, including accepting clusters from the field, data editing procedures, and producing reports to monitor fieldwork.
From 27 February to 17 March, DHS staff visited the Ghana Statistical Service office in Accra to work with the GSS central office staff on finishing the secondary editing and to clean and finalize all data received from the 618 clusters.
A total of 18,540 households were selected for the GDHS sample, of which 18,065 were found to be occupied. Of the occupied households, 17,933 were successfully interviewed, yielding a response rate of 99%. In the interviewed households, 15,317 women age 15–49 were identified as eligible for individual interviews. Interviews were completed with 15,014 women, yielding a response rate of 98%. In the subsample of households selected for the male survey, 7,263 men age 15–59 were identified as eligible for individual interviews and 7,044 were successfully interviewed.
The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors and (2) sampling errors. Nonsampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2022 Ghana Demographic and Health Survey (2022 GDHS) to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 2022 GDHS is only one of many samples that could have been selected from the same population, using the same design and identical size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results. A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95% of all possible samples of identical size and design.
If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 2022 GDHS sample was the result of a multistage stratified design, and, consequently, it was necessary to use more complex formulas. The computer software used to calculate sampling errors for the GDHS 2022 is an SAS program. This program used the Taylor linearization method to estimate variances for survey estimates that are means, proportions, or ratios. The Jackknife repeated replication method is used for variance estimation of more complex statistics such as fertility and mortality rates.
A more detailed description of estimates of sampling errors are presented in APPENDIX B of the survey report.
Data Quality Tables
Abstract copyright UK Data Service and data collection copyright owner.The Opinions and Lifestyle Survey (formerly known as the ONS Opinions Survey or Omnibus) is an omnibus survey that began in 1990, collecting data on a range of subjects commissioned by both the ONS internally and external clients (limited to other government departments, charities, non-profit organisations and academia).Data are collected from one individual aged 16 or over, selected from each sampled private household. Personal data include data on the individual, their family, address, household, income and education, plus responses and opinions on a variety of subjects within commissioned modules. The questionnaire collects timely data for research and policy analysis evaluation on the social impacts of recent topics of national importance, such as the coronavirus (COVID-19) pandemic and the cost of living, on individuals and households in Great Britain. From April 2018 to November 2019, the design of the OPN changed from face-to-face to a mixed-mode design (online first with telephone interviewing where necessary). Mixed-mode collection allows respondents to complete the survey more flexibly and provides a more cost-effective service for customers. In March 2020, the OPN was adapted to become a weekly survey used to collect data on the social impacts of the coronavirus (COVID-19) pandemic on the lives of people of Great Britain. These data are held in the Secure Access study, SN 8635, ONS Opinions and Lifestyle Survey, Covid-19 Module, 2020-2022: Secure Access. From August 2021, as coronavirus (COVID-19) restrictions were lifting across Great Britain, the OPN moved to fortnightly data collection, sampling around 5,000 households in each survey wave to ensure the survey remains sustainable. The OPN has since expanded to include questions on other topics of national importance, such as health and the cost of living. For more information about the survey and its methodology, see the ONS OPN Quality and Methodology Information webpage.Secure Access Opinions and Lifestyle Survey dataOther Secure Access OPN data cover modules run at various points from 1997-2019, on Census religion (SN 8078), cervical cancer screening (SN 8080), contact after separation (SN 8089), contraception (SN 8095), disability (SNs 8680 and 8096), general lifestyle (SN 8092), illness and activity (SN 8094), and non-resident parental contact (SN 8093). See Opinions and Lifestyle Survey: Secure Access for details. Main Topics:Each month's questionnaire consists of two elements: core questions, covering demographic information, are asked each month together with non-core questions that vary from month to month. The non-core questions for this month were: Company Cars (Module 1a): questions about the number of petrol-fuelled and diesel-fuelled company cars as well as total mileage and total business mileage. Memory (Module 46): questions to test respondents' memory about events in the past. Mortgage Arrears (Module 2): source of mortgage, if any, and whether behind in payments. Also 2 questions on whether bought from a Right to Buy scheme. Stepchildren (Module 5): existence of stepchildren of informant/partner in household, and of dependent children of informant/partner outside the household. Heights and Weights (Module 42): respondents are asked to estimate their heights and weights and to say how certain they are of the estimates. Professional Fees (Module 47): methodological experiment to see if HOH or spouse are able to estimate amount spent by all household members on professional fees. Investment Income (Module 7a): ownership of shares and income from shares and bank accounts. Multi-stage stratified random sample Face-to-face interview
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
analyze the american community survey (acs) with r and monetdb experimental. think of the american community survey (acs) as the united states' census for off-years - the ones that don't end in zero. every year, one percent of all americans respond, making it the largest complex sample administered by the u.s. government (the decennial census has a much broader reach, but since it attempts to contact 100% of the population, it's not a sur vey). the acs asks how people live and although the questionnaire only includes about three hundred questions on demography, income, insurance, it's often accurate at sub-state geographies and - depending how many years pooled - down to small counties. households are the sampling unit, and once a household gets selected for inclusion, all of its residents respond to the survey. this allows household-level data (like home ownership) to be collected more efficiently and lets researchers examine family structure. the census bureau runs and finances this behemoth, of course. the dow nloadable american community survey ships as two distinct household-level and person-level comma-separated value (.csv) files. merging the two just rectangulates the data, since each person in the person-file has exactly one matching record in the household-file. for analyses of small, smaller, and microscopic geographic areas, choose one-, three-, or fiv e-year pooled files. use as few pooled years as you can, unless you like sentences that start with, "over the period of 2006 - 2010, the average american ... [insert yer findings here]." rather than processing the acs public use microdata sample line-by-line, the r language brazenly reads everything into memory by default. to prevent overloading your computer, dr. thomas lumley wrote the sqlsurvey package principally to deal with t his ram-gobbling monster. if you're already familiar with syntax used for the survey package, be patient and read the sqlsurvey examples carefully when something doesn't behave as you expect it to - some sqlsurvey commands require a different structure (i.e. svyby gets called through svymean) and others might not exist anytime soon (like svyolr). gimme some good news: sqlsurvey uses ultra-fast monetdb (click here for speed tests), so follow the monetdb installation instructions before running this acs code. monetdb imports, writes, recodes data slowly, but reads it hyper-fast . a magnificent trade-off: data exploration typically requires you to think, send an analysis command, think some more, send another query, repeat. importation scripts (especially the ones i've already written for you) can be left running overnight sans hand-holding. the acs weights generalize to the whole united states population including individuals living in group quarters, but non-residential respondents get an abridged questionnaire, so most (not all) analysts exclude records with a relp variable of 16 or 17 right off the bat. this new github repository contains four scripts: 2005-2011 - download all microdata.R create the batch (.bat) file needed to initiate the monet database in the future download, unzip, and import each file for every year and size specified by the user create and save household- and merged/person-level replicate weight complex sample designs create a well-documented block of code to re-initiate the monet db server in the future fair warning: this full script takes a loooong time. run it friday afternoon, commune with nature for the weekend, and if you've got a fast processor and speedy internet connection, monday morning it should be ready for action. otherwise, either download only the years and sizes you need or - if you gotta have 'em all - run it, minimize it, and then don't disturb it for a week. 2011 single-year - analysis e xamples.R run the well-documented block of code to re-initiate the monetdb server load the r data file (.rda) containing the replicate weight designs for the single-year 2011 file perform the standard repertoire of analysis examples, only this time using sqlsurvey functions 2011 single-year - variable reco de example.R run the well-documented block of code to re-initiate the monetdb server copy the single-year 2011 table to maintain the pristine original add a new age category variable by hand add a new age category variable systematically re-create then save the sqlsurvey replicate weight complex sample design on this new table close everything, then load everything back up in a fresh instance of r replicate a few of the census statistics. no muss, no fuss replicate census estimates - 2011.R run the well-documented block of code to re-initiate the monetdb server load the r data file (.rda) containing the replicate weight designs for the single-year 2011 file match every nation wide statistic on the census bureau's estimates page, using sqlsurvey functions click here to view these four scripts for more detail about the american community survey (acs), visit: < ul> the us census...
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This study engaged 409 participants over a period spanning from July 10 to August 8, 2023, ensuring representation across various demographic factors: 221 females, 186 males, 2 non-binary, year of birth between 1951 and 2005, with varied annual incomes and from 15 Spanish regions. The MobileWell400+ dataset, openly accessible, encompasses a wide array of data collected via the participants' mobile phone, including demographic, emotional, social, behavioral, and well-being data. Methodologically, the project presents a promising avenue for uncovering new social, behavioral, and emotional indicators, supplementing existing literature. Notably, artificial intelligence is considered to be instrumental in analysing these data, discerning patterns, and forecasting trends, thereby advancing our comprehension of individual and population well-being. Ethical standards were upheld, with participants providing informed consent.
The following is a non-exhaustive list of collected data:
For a more detailed description of the study please refer to MobileWell400+StudyDescription.pdf.
For a more detailed description of the collected data, variables and data files please refer to MobileWell400+FilesDescription.pdf.
The primary objective of the 2018 ZDHS was to provide up-to-date estimates of basic demographic and health indicators. Specifically, the ZDHS collected information on: - Fertility levels and preferences; contraceptive use; maternal and child health; infant, child, and neonatal mortality levels; maternal mortality; and gender, nutrition, and awareness regarding HIV/AIDS and other health issues relevant to the achievement of the Sustainable Development Goals (SDGs) - Ownership and use of mosquito nets as part of the national malaria eradication programmes - Health-related matters such as breastfeeding, maternal and childcare (antenatal, delivery, and postnatal), children’s immunisations, and childhood diseases - Anaemia prevalence among women age 15-49 and children age 6-59 months - Nutritional status of children under age 5 (via weight and height measurements) - HIV prevalence among men age 15-59 and women age 15-49 and behavioural risk factors related to HIV - Assessment of situation regarding violence against women
National coverage
The survey covered all de jure household members (usual residents), all women age 15-49, all men age 15-59, and all children age 0-5 years who are usual members of the selected households or who spent the night before the survey in the selected households.
Sample survey data [ssd]
The sampling frame used for the 2018 ZDHS is the Census of Population and Housing (CPH) of the Republic of Zambia, conducted in 2010 by ZamStats. Zambia is divided into 10 provinces. Each province is subdivided into districts, each district into constituencies, and each constituency into wards. In addition to these administrative units, during the 2010 CPH each ward was divided into convenient areas called census supervisory areas (CSAs), and in turn each CSA was divided into enumeration areas (EAs). An enumeration area is a geographical area assigned to an enumerator for the purpose of conducting a census count; according to the Zambian census frame, each EA consists of an average of 110 households.
The current version of the EA frame for the 2010 CPH was updated to accommodate some changes in districts and constituencies that occurred between 2010 and 2017. The list of EAs incorporates census information on households and population counts. Each EA has a cartographic map delineating its boundaries, with identification information and a measure of size, which is the number of residential households enumerated in the 2010 CPH. This list of EAs was used as the sampling frame for the 2018 ZDHS.
The 2018 ZDHS followed a stratified two-stage sample design. The first stage involved selecting sample points (clusters) consisting of EAs. EAs were selected with a probability proportional to their size within each sampling stratum. A total of 545 clusters were selected.
The second stage involved systematic sampling of households. A household listing operation was undertaken in all of the selected clusters. During the listing, an average of 133 households were found in each cluster, from which a fixed number of 25 households were selected through an equal probability systematic selection process, to obtain a total sample size of 13,625 households. Results from this sample are representative at the national, urban and rural, and provincial levels.
For further details on sample selection, see Appendix A of the final report.
Face-to-face [f2f]
Four questionnaires were used in the 2018 ZDHS: the Household Questionnaire, the Woman’s Questionnaire, the Man’s Questionnaire, and the Biomarker Questionnaire. The questionnaires, based on The DHS Program’s Model Questionnaires, were adapted to reflect the population and health issues relevant to Zambia. Input on questionnaire content was solicited from various stakeholders representing government ministries and agencies, nongovernmental organisations, and international cooperating partners. After all questionnaires were finalised in English, they were translated into seven local languages: Bemba, Kaonde, Lozi, Lunda, Luvale, Nyanja, and Tonga. In addition, information about the fieldworkers for the survey was collected through a self-administered Fieldworker Questionnaire.
All electronic data files were transferred via a secure internet file streaming system to the ZamStats central office in Lusaka, where they were stored on a password-protected computer. The data processing operation included secondary editing, which required resolution of computer-identified inconsistencies and coding of open-ended questions. The data were processed by two IT specialists and one secondary editor who took part in the main fieldwork training; they were supervised remotely by staff from The DHS Program. Data editing was accomplished using CSPro software. During the fieldwork, field-check tables were generated to check various data quality parameters, and specific feedback was given to the teams to improve performance. Secondary editing and data processing were initiated in July 2018 and completed in March 2019.
Of the 13,595 households in the sample, 12,943 were occupied. Of these occupied households, 12,831 were successfully interviewed, yielding a response rate of 99%.
In the interviewed households, 14,189 women age 15-49 were identified as eligible for individual interviews; 13,683 women were interviewed, yielding a response rate of 96% (the same rate achieved in the 2013-14 survey). A total of 13,251 men were eligible for individual interviews; 12,132 of these men were interviewed, producing a response rate of 92% (a 1 percentage point increase from the previous survey).
Of the households successfully interviewed, 12,505 were interviewed in 2018 and 326 in 2019. As the large majority of households were interviewed in 2018 and the year for reference indicators is 2018.
The estimates from a sample survey are affected by two types of errors: nonsampling errors and sampling errors. Nonsampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2018 Zambia Demographic and Health Survey (ZDHS) to minimise this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 2018 ZDHS is only one of many samples that could have been selected from the same population, using the same design and expected size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability among all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
Sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95% of all possible samples of identical size and design.
If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 2018 ZDHS sample is the result of a multi-stage stratified design, and, consequently, it was necessary to use more complex formulas. Sampling errors are computed in SAS, using programs developed by ICF. These programs use the Taylor linearisation method to estimate variances for survey estimates that are means, proportions, or ratios. The Jackknife repeated replication method is used for variance estimation of more complex statistics such as fertility and mortality rates.
Note: A more detailed description of estimates of sampling errors are presented in APPENDIX B of the survey report.
Data Quality Tables - Household age distribution - Age distribution of eligible and interviewed women - Age distribution of eligible and interviewed men - Completeness of reporting - Births by calendar years - Reporting of age at death in days - Reporting of age at death in months - Completeness of information on siblings - Sibship size and sex ratio of siblings - Height and weight data completeness and quality for children - Number of enumeration areas completed by month, according to province, Zambia DHS 2018
Note: Data quality tables are presented in APPENDIX C of the report.
The 2022 Nepal Demographic and Health Survey (NDHS) is the sixth survey of its kind implemented in the country as part of the worldwide Demographic and Health Surveys (DHS) Program. It was implemented by New ERA under the aegis of the Ministry of Health and Population (MoHP) of the Government of Nepal with the objective of providing reliable, accurate, and up-to-date data for the country.
The primary objective of the 2022 NDHS is to provide up-to-date estimates of basic demographic and health indicators. Specifically, the 2022 NDHS collected information on fertility, marriage, family planning, breastfeeding practices, nutrition, food insecurity, maternal and child health, childhood mortality, awareness and behavior regarding HIV/AIDS and other sexually transmitted infections (STIs), women’s empowerment, domestic violence, fistula, mental health, accident and injury, disability, and other healthrelated issues such as smoking, knowledge of tuberculosis, and prevalence of hypertension.
The information collected through the 2022 NDHS is intended to assist policymakers and program managers in evaluating and designing programs and strategies for improving the health of Nepal’s population. The survey also provides indicators relevant to the Sustainable Development Goals (SDGs) for Nepal.
National coverage
The survey covered all de jure household members (usual residents), all women aged 15-49, men ageed 15-49, and all children aged 0-4 resident in the household.
Sample survey data [ssd]
The sampling frame used for the 2022 NDHS is an updated version of the frame from the 2011 Nepal Population and Housing Census (NPHC) provided by the National Statistical Office. The 2022 NDHS considered wards from the 2011 census as sub-wards, the smallest administrative unit for the survey. The census frame includes a complete list of Nepal’s 36,020 sub-wards. Each sub-ward has a residence type (urban or rural), and the measure of size is the number of households.
In September 2015, Nepal’s Constituent Assembly declared changes in the administrative units and reclassified urban and rural areas in the country. Nepal is divided into seven provinces: Koshi Province, Madhesh Province, Bagmati Province, Gandaki Province, Lumbini Province, Karnali Province, and Sudurpashchim Province. Provinces are divided into districts, districts into municipalities, and municipalities into wards. Nepal has 77 districts comprising a total of 753 (local-level) municipalities. Of the municipalities, 293 are urban and 460 are rural.
Originally, the 2011 NPHC included 58 urban municipalities. This number increased to 217 as of 2015. On March 10, 2017, structural changes were made in the classification system for urban (Nagarpalika) and rural (Gaonpalika) locations. Nepal currently has 293 Nagarpalika, with 65% of the population living in these urban areas. The 2022 NDHS used this updated urban-rural classification system. The survey sample is a stratified sample selected in two stages. Stratification was achieved by dividing each of the seven provinces into urban and rural areas that together formed the sampling stratum for that province. A total of 14 sampling strata were created in this way. Implicit stratification with proportional allocation was achieved at each of the lower administrative levels by sorting the sampling frame within each sampling stratum before sample selection, according to administrative units at the different levels, and by using a probability-proportional-to-size selection at the first stage of sampling. In the first stage of sampling, 476 primary sampling units (PSUs) were selected with probability proportional to PSU size and with independent selection in each sampling stratum within the sample allocation. Among the 476 PSUs, 248 were from urban areas and 228 from rural areas. A household listing operation was carried out in all of the selected PSUs before the main survey. The resulting list of households served as the sampling frame for the selection of sample households in the second stage. Thirty households were selected from each cluster, for a total sample size of 14,280 households. Of these households, 7,440 were in urban areas and 6,840 were in rural areas. Some of the selected sub-wards were found to be overly large during the household listing operation. Selected sub-wards with an estimated number of households greater than 300 were segmented. Only one segment was selected for the survey with probability proportional to segment size.
For further details on sample design, see APPENDIX A of the final report.
Computer Assisted Personal Interview [capi]
Four questionnaires were used in the 2022 NDHS: the Household Questionnaire, the Woman’s Questionnaire, the Man’s Questionnaire, and the Biomarker Questionnaire. The questionnaires, based on The DHS Program’s model questionnaires, were adapted to reflect the population and health issues relevant to Nepal. In addition, a self-administered Fieldworker Questionnaire collected information about the survey’s fieldworkers.
Input was solicited from various stakeholders representing government ministries and agencies, nongovernmental organizations, and international donors. After all questionnaires were finalized in English, they were translated into Nepali, Maithili, and Bhojpuri. The Household, Woman’s, and Man’s Questionnaires were programmed into tablet computers to facilitate computer-assisted personal interviewing (CAPI) for data collection purposes, with the capability to choose any of the three languages for each questionnaire. The Biomarker Questionnaire was completed on paper during data collection and then entered in the CAPI system.
Data capture for the 2022 NDHS was carried out with Microsoft Surface Go 2 tablets running Windows 10.1. Software was prepared for the survey using CSPro. The processing of the 2022 NDHS data began shortly after the fieldwork started. When data collection was completed in each cluster, the electronic data files were transferred via the Internet File Streaming System (IFSS) to the New ERA central office in Kathmandu. The data files were registered and checked for inconsistencies, incompleteness, and outliers. Errors and inconsistencies were immediately communicated to the field teams for review so that problems would be mitigated going forward. Secondary editing, carried out in the central office at New ERA, involved resolving inconsistencies and coding the open-ended questions. The New ERA senior data processor coordinated the exercise at the central office. The NDHS core team members assisted with the secondary editing. The paper Biomarker Questionnaires were compared with the electronic data file to check for any inconsistencies in data entry. The pictures of vaccination cards that were captured during data collection were verified with the data entered. Data processing and editing were carried out using the CSPro software package. The concurrent data collection and processing offered a distinct advantage because it maximized the likelihood of the data being error-free and accurate. Timely generation of field check tables allowed for effective monitoring. The secondary editing of the data was completed by July 2022, and the final cleaning of the data set was completed by the end of August.
A total of 14,243 households were selected for the sample, of which 13,833 were found to be occupied. Of the occupied households, 13,786 were successfully interviewed, yielding a response rate of more than 99%. In the interviewed households, 15,238 women age 15-49 were identified as eligible for individual interviews. Interviews were completed with 14,845 women, yielding a response rate of 97%. In the subsample of households selected for the men’s survey, 5,185 men age 15-49 were identified as eligible for individual interviews and 4,913 were successfully interviewed, yielding a response rate of 95%.
The estimates from a sample survey are affected by two types of errors: nonsampling errors and sampling errors. Nonsampling errors result from mistakes made in implementing data collection and in data processing, such as failing to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and entering the data incorrectly. Although numerous efforts were made during the implementation of the 2022 Nepal Demographic and Health Survey (2022 NDHS) to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 2022 NDHS is only one of many samples that could have been selected from the same population, using the same design and expected sample size. Each of these samples would yield results that differ somewhat from the results of the selected sample. Sampling errors are a measure of the variability among all possible samples. Although the exact degree of variability is unknown, it can be estimated from the survey results.
Sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, and so on), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the
The Afrobarometer is a comparative series of public attitude surveys that assess African citizen's attitudes to democracy and governance, markets, and civil society, among other topics. The surveys have been undertaken at periodic intervals since 1999. The Afrobarometer's coverage has increased over time. Round 1 (1999-2001) initially covered 7 countries and was later extended to 12 countries. Round 2 (2002-2004) surveyed citizens in 16 countries. Round 3 (2005-2006) 18 countries, Round 4 (2008) 20 countries, Round 5 (2011-2013) 34 countries, Round 6 (2014-2015) 36 countries, and Round 7 (2016-2018) 34 countries. The survey covered 34 countries in Round 8 (2019-2021).
National coverage
Individual
The sample universe for Afrobarometer surveys includes all citizens of voting age within the country. In other words, we exclude anyone who is not a citizen and anyone who has not attained this age (usually 18 years) on the day of the survey. Also excluded are areas determined to be either inaccessible or not relevant to the study, such as those experiencing armed conflict or natural disasters, as well as national parks and game reserves. As a matter of practice, we have also excluded people living in institutionalized settings, such as students in dormitories and persons in prisons or nursing homes.
Sample survey data [ssd]
Afrobarometer Sampling Procedure
Afrobarometer uses national probability samples designed to meet the following criteria. Samples are designed to generate a sample that is a representative cross-section of all citizens of voting age in a given country. The goal is to give every adult citizen an equal and known chance of being selected for an interview. They achieve this by:
• using random selection methods at every stage of sampling; • sampling at all stages with probability proportionate to population size wherever possible to ensure that larger (i.e., more populated) geographic units have a proportionally greater probability of being chosen into the sample.
The sampling universe normally includes all citizens age 18 and older. As a standard practice, we exclude people living in institutionalized settings, such as students in dormitories, patients in hospitals, and persons in prisons or nursing homes. Occasionally, we must also exclude people living in areas determined to be inaccessible due to conflict or insecurity. Any such exclusion is noted in the technical information report (TIR) that accompanies each data set.
Sample size and design Samples usually include either 1,200 or 2,400 cases. A randomly selected sample of n=1200 cases allows inferences to national adult populations with a margin of sampling error of no more than +/-2.8% with a confidence level of 95 percent. With a sample size of n=2400, the margin of error decreases to +/-2.0% at 95 percent confidence level.
The sample design is a clustered, stratified, multi-stage, area probability sample. Specifically, we first stratify the sample according to the main sub-national unit of government (state, province, region, etc.) and by urban or rural location.
Area stratification reduces the likelihood that distinctive ethnic or language groups are left out of the sample. Afrobarometer occasionally purposely oversamples certain populations that are politically significant within a country to ensure that the size of the sub-sample is large enough to be analysed. Any oversamples is noted in the TIR.
Sample stages Samples are drawn in either four or five stages:
Stage 1: In rural areas only, the first stage is to draw secondary sampling units (SSUs). SSUs are not used in urban areas, and in some countries they are not used in rural areas. See the TIR that accompanies each data set for specific details on the sample in any given country. Stage 2: We randomly select primary sampling units (PSU). Stage 3: We then randomly select sampling start points. Stage 4: Interviewers then randomly select households. Stage 5: Within the household, the interviewer randomly selects an individual respondent. Each interviewer alternates in each household between interviewing a man and interviewing a woman to ensure gender balance in the sample.
To keep the costs and logistics of fieldwork within manageable limits, eight interviews are clustered within each selected PSU.
Benin - Sample size: 1,200 - Sampling frame: 2013 sampling frame updated from the General Population and Housing Census (RGPH 4) - Sample design: Nationally representative, random, clustered, stratified, multistage area, probability sampling - Stratification: Region, urban-rural distributio - Stages: PSUs (from strata), start points, households, respondents - PSU selection: Probability proportional to population size (PPPS) - Cluster size: 8 households per PSU - Household selection: Random choice of the starting point, followed by the sampling interval using an interval of 5/10 households - Respondent selection: Gender quota to be achieved by alternating interviews between men and women; potential respondents (i.e. household members) of the appropriate gender are listed, then the computer randomly selects the individual
Face-to-face [f2f]
The Round 8 questionnaire has been developed by the Questionnaire Committee after reviewing the findings and feedback obtained in previous Rounds, and securing input on preferred new topics from a host of donors, analysts, and users of the data.
The questionnaire consists of three parts: 1. Part 1 captures the steps for selecting households and respondents, and includes the introduction to the respondent and (pp.1-4). This section should be filled in by the Fieldworker. 2. Part 2 covers the core attitudinal and demographic questions that are asked by the Fieldworker and answered by the Respondent (Q1 – Q100). 3. Part 3 includes contextual questions about the setting and atmosphere of the interview, and collects information on the Fieldworker. This section is completed by the Fieldworker (Q101 – Q123).
Response rate was 80%.
The Afrobarometer is a comparative series of public attitude surveys that assess African citizen's attitudes to democracy and governance, markets, and civil society, among other topics. The surveys have been undertaken at periodic intervals since 1999. The Afrobarometer's coverage has increased over time. Round 1 (1999-2001) initially covered 7 countries and was later extended to 12 countries. Round 2 (2002-2004) surveyed citizens in 16 countries. Round 3 (2005-2006) 18 countries, Round 4 (2008) 20 countries, Round 5 (2011-2013) 34 countries, Round 6 (2014-2015) 36 countries, and Round 7 (2016-2018) 34 countries. The survey covered 34 countries in Round 8 (2019-2021).
National coverage
Individual
Citizens of Tunisia who are 18 years and older
Sample survey data [ssd]
Afrobarometer uses national probability samples designed to meet the following criteria. Samples are designed to generate a sample that is a representative cross-section of all citizens of voting age in a given country. The goal is to give every adult citizen an equal and known chance of being selected for an interview. They achieve this by:
• using random selection methods at every stage of sampling; • sampling at all stages with probability proportionate to population size wherever possible to ensure that larger (i.e., more populated) geographic units have a proportionally greater probability of being chosen into the sample.
The sampling universe normally includes all citizens age 18 and older. As a standard practice, we exclude people living in institutionalized settings, such as students in dormitories, patients in hospitals, and persons in prisons or nursing homes. Occasionally, we must also exclude people living in areas determined to be inaccessible due to conflict or insecurity. Any such exclusion is noted in the technical information report (TIR) that accompanies each data set.
Sample size and design Samples usually include either 1,200 or 2,400 cases. A randomly selected sample of n=1200 cases allows inferences to national adult populations with a margin of sampling error of no more than +/-2.8% with a confidence level of 95 percent. With a sample size of n=2400, the margin of error decreases to +/-2.0% at 95 percent confidence level.
The sample design is a clustered, stratified, multi-stage, area probability sample. Specifically, we first stratify the sample according to the main sub-national unit of government (state, province, region, etc.) and by urban or rural location.
Area stratification reduces the likelihood that distinctive ethnic or language groups are left out of the sample. Afrobarometer occasionally purposely oversamples certain populations that are politically significant within a country to ensure that the size of the sub-sample is large enough to be analysed. Any oversamples is noted in the TIR.
Sample stages Samples are drawn in either four or five stages:
Stage 1: In rural areas only, the first stage is to draw secondary sampling units (SSUs). SSUs are not used in urban areas, and in some countries they are not used in rural areas. See the TIR that accompanies each data set for specific details on the sample in any given country. Stage 2: We randomly select primary sampling units (PSU). Stage 3: We then randomly select sampling start points. Stage 4: Interviewers then randomly select households. Stage 5: Within the household, the interviewer randomly selects an individual respondent. Each interviewer alternates in each household between interviewing a man and interviewing a woman to ensure gender balance in the sample.
To keep the costs and logistics of fieldwork within manageable limits, eight interviews are clustered within each selected PSU.
Tunisia - Sample size: 1,200 - Sampling Frame: The sampling frame was created based on the final results of the last census done in Tunisia in 2014 by the National Institute of Statistics. - Sample design: Nationally representative, random, clustered, stratified, multi-stage area probability sample - Stratification: Region and rural-urban location - Stages: PSUs (from strata), start points, households, respondents - PSU selection: Probability Proportionate to Population Size (PPPS) - Cluster size: 8 households per PSU - Household selection: Randomly selected start points, followed by walk pattern using 5/10 interval - Respondent selection: Gender quota filled by alternating interviews between men and women; respondents of appropriate gender listed, after which computer randomly selects individual
Face-to-face [f2f]
The Round 8 questionnaire has been developed by the Questionnaire Committee after reviewing the findings and feedback obtained in previous Rounds, and securing input on preferred new topics from a host of donors, analysts, and users of the data.
The questionnaire consists of three parts: 1. Part 1 captures the steps for selecting households and respondents, and includes the introduction to the respondent and (pp.1-4). This section should be filled in by the Fieldworker. 2. Part 2 covers the core attitudinal and demographic questions that are asked by the Fieldworker and answered by the Respondent (Q1 – Q100). 3. Part 3 includes contextual questions about the setting and atmosphere of the interview, and collects information on the Fieldworker. This section is completed by the Fieldworker (Q101 – Q123).
Outcome rates: - Contact rate: 52% - Cooperation rate: 47% - Refusal rate: 20% - Response rate: 24%
The sample size yields country-level results with a margin of error of +/-3 percentage points at a 95% confidence level.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset tabulates the population of Norfolk by gender across 18 age groups. It lists the male and female population in each age group along with the gender ratio for Norfolk. The dataset can be utilized to understand the population distribution of Norfolk by gender and age. For example, using this dataset, we can identify the largest age group for both Men and Women in Norfolk. Additionally, it can be used to see how the gender ratio changes from birth to senior most age group and male to female ratio across each age group for Norfolk.
Key observations
Largest age group (population): Male # 20-24 years (15,849) | Female # 20-24 years (11,008). Source: U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.
Age groups:
Scope of gender :
Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis.
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for Norfolk Population by Gender. You can refer the same here
The Afrobarometer is a comparative series of public attitude surveys that assess African citizen's attitudes to democracy and governance, markets, and civil society, among other topics. The surveys have been undertaken at periodic intervals since 1999. The Afrobarometer's coverage has increased over time. Round 1 (1999-2001) initially covered 7 countries and was later extended to 12 countries. Round 2 (2002-2004) surveyed citizens in 16 countries. Round 3 (2005-2006) 18 countries, Round 4 (2008) 20 countries, Round 5 (2011-2013) 34 countries, Round 6 (2014-2015) 36 countries, and Round 7 (2016-2018) 34 countries. The survey covered 34 countries in Round 8 (2019-2021).
National coverage
Individual
Citizens of Mozambique who are 18 years and older
Sample survey data [ssd]
Afrobarometer uses national probability samples designed to meet the following criteria. Samples are designed to generate a sample that is a representative cross-section of all citizens of voting age in a given country. The goal is to give every adult citizen an equal and known chance of being selected for an interview. They achieve this by:
• using random selection methods at every stage of sampling; • sampling at all stages with probability proportionate to population size wherever possible to ensure that larger (i.e., more populated) geographic units have a proportionally greater probability of being chosen into the sample.
The sampling universe normally includes all citizens age 18 and older. As a standard practice, we exclude people living in institutionalized settings, such as students in dormitories, patients in hospitals, and persons in prisons or nursing homes. Occasionally, we must also exclude people living in areas determined to be inaccessible due to conflict or insecurity. Any such exclusion is noted in the technical information report (TIR) that accompanies each data set.
Sample size and design Samples usually include either 1,200 or 2,400 cases. A randomly selected sample of n=1200 cases allows inferences to national adult populations with a margin of sampling error of no more than +/-2.8% with a confidence level of 95 percent. With a sample size of n=2400, the margin of error decreases to +/-2.0% at 95 percent confidence level.
The sample design is a clustered, stratified, multi-stage, area probability sample. Specifically, we first stratify the sample according to the main sub-national unit of government (state, province, region, etc.) and by urban or rural location.
Area stratification reduces the likelihood that distinctive ethnic or language groups are left out of the sample. Afrobarometer occasionally purposely oversamples certain populations that are politically significant within a country to ensure that the size of the sub-sample is large enough to be analysed. Any oversamples is noted in the TIR.
Sample stages Samples are drawn in either four or five stages:
Stage 1: In rural areas only, the first stage is to draw secondary sampling units (SSUs). SSUs are not used in urban areas, and in some countries they are not used in rural areas. See the TIR that accompanies each data set for specific details on the sample in any given country. Stage 2: We randomly select primary sampling units (PSU). Stage 3: We then randomly select sampling start points. Stage 4: Interviewers then randomly select households. Stage 5: Within the household, the interviewer randomly selects an individual respondent. Each interviewer alternates in each household between interviewing a man and interviewing a woman to ensure gender balance in the sample.
To keep the costs and logistics of fieldwork within manageable limits, eight interviews are clustered within each selected PSU.
Face-to-face [f2f]
The Round 8 questionnaire has been developed by the Questionnaire Committee after reviewing the findings and feedback obtained in previous Rounds, and securing input on preferred new topics from a host of donors, analysts, and users of the data.
The questionnaire consists of three parts: 1. Part 1 captures the steps for selecting households and respondents, and includes the introduction to the respondent and (pp.1-4). This section should be filled in by the Fieldworker. 2. Part 2 covers the core attitudinal and demographic questions that are asked by the Fieldworker and answered by the Respondent (Q1 – Q100). 3. Part 3 includes contextual questions about the setting and atmosphere of the interview, and collects information on the Fieldworker. This section is completed by the Fieldworker (Q101 – Q123).
Pursuant to Local Laws 126, 127, and 128 of 2016, certain demographic data is collected voluntarily and anonymously by persons voluntarily seeking social services. This data can be used by agencies and the public to better understand the demographic makeup of client populations and to better understand and serve residents of all backgrounds and identities. The data presented here has been collected through either electronic form or paper surveys offered at the point of application for services. These surveys are anonymous. Each record represents an anonymized demographic profile of an individual applicant for social services, disaggregated by response option, agency, and program. Response options include information regarding ancestry, race, primary and secondary languages, English proficiency, gender identity, and sexual orientation. Idiosyncrasies or Limitations: Note that while the dataset contains the total number of individuals who have identified their ancestry or languages spoke, because such data is collected anonymously, there may be instances of a single individual completing multiple voluntary surveys. Additionally, the survey being both voluntary and anonymous has advantages as well as disadvantages: it increases the likelihood of full and honest answers, but since it is not connected to the individual case, it does not directly inform delivery of services to the applicant. The paper and online versions of the survey ask the same questions but free-form text is handled differently. Free-form text fields are expected to be entered in English although the form is available in several languages. Surveys are presented in 11 languages. Paper Surveys 1. Are optional 2. Survey taker is expected to specify agency that provides service 2. Survey taker can skip or elect not to answer questions 3. Invalid/unreadable data may be entered for survey date or date may be skipped 4. OCRing of free-form tet fields may fail. 5. Analytical value of free-form text answers is unclear Online Survey 1. Are optional 2. Agency is defaulted based on the URL 3. Some questions must be answered 4. Date of survey is automated