https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Professional organizations in STEM (science, technology, engineering, and mathematics) can use demographic data to quantify recruitment and retention (R&R) of underrepresented groups within their memberships. However, variation in the types of demographic data collected can influence the targeting and perceived impacts of R&R efforts - e.g., giving false signals of R&R for some groups. We obtained demographic surveys from 73 U.S.-affiliated STEM organizations, collectively representing 712,000 members and conference-attendees. We found large differences in the demographic categories surveyed (e.g., disability status, sexual orientation) and the available response options. These discrepancies indicate a lack of consensus regarding the demographic groups that should be recognized and, for groups that are omitted from surveys, an inability of organizations to prioritize and evaluate R&R initiatives. Aligning inclusive demographic surveys across organizations will provide baseline data that can be used to target and evaluate R&R initiatives to better serve underrepresented groups throughout STEM. Methods We surveyed 164 STEM organizations (73 responses, rate = 44.5%) between December 2020 and July 2021 with the goal of understanding what demographic data each organization collects from its constituents (i.e., members and conference-attendees) and how the data are used. Organizations were sourced from a list of professional societies affiliated with the American Association for the Advancement of Science, AAAS, (n = 156) or from social media (n = 8). The survey was sent to the elected leadership and management firms for each organization, and follow-up reminders were sent after one month. The responding organizations represented a wide range of fields: 31 life science organizations (157,000 constituents), 5 mathematics organizations (93,000 constituents), 16 physical science organizations (207,000 constituents), 7 technology organizations (124,000 constituents), and 14 multi-disciplinary organizations spanning multiple branches of STEM (131,000 constituents). A list of the responding organizations is available in the Supplementary Materials. Based on the AAAS-affiliated recruitment of the organizations and the similar distribution of constituencies across STEM fields, we conclude that the responding organizations are a representative cross-section of the most prominent STEM organizations in the U.S. Each organization was asked about the demographic information they collect from their constituents, the response rates to their surveys, and how the data were used. Survey description The following questions are written as presented to the participating organizations. Question 1: What is the name of your STEM organization? Question 2: Does your organization collect demographic data from your membership and/or meeting attendees? Question 3: When was your organization’s most recent demographic survey (approximate year)? Question 4: We would like to know the categories of demographic information collected by your organization. You may answer this question by either uploading a blank copy of your organization’s survey (linked provided in online version of this survey) OR by completing a short series of questions. Question 5: On the most recent demographic survey or questionnaire, what categories of information were collected? (Please select all that apply)
Disability status Gender identity (e.g., male, female, non-binary) Marital/Family status Racial and ethnic group Religion Sex Sexual orientation Veteran status Other (please provide)
Question 6: For each of the categories selected in Question 5, what options were provided for survey participants to select? Question 7: Did the most recent demographic survey provide a statement about data privacy and confidentiality? If yes, please provide the statement. Question 8: Did the most recent demographic survey provide a statement about intended data use? If yes, please provide the statement. Question 9: Who maintains the demographic data collected by your organization? (e.g., contracted third party, organization executives) Question 10: How has your organization used members’ demographic data in the last five years? Examples: monitoring temporal changes in demographic diversity, publishing diversity data products, planning conferences, contributing to third-party researchers. Question 11: What is the size of your organization (number of members or number of attendees at recent meetings)? Question 12: What was the response rate (%) for your organization’s most recent demographic survey? *Organizations were also able to upload a copy of their demographics survey instead of responding to Questions 5-8. If so, the uploaded survey was used (by the study authors) to evaluate Questions 5-8.
The Turkey Demographic and Health Survey (DHS) 2008 has been conducted by the Haccettepe University Institute of Population Studies in collaboration with the Ministry of health General Directorate of Mother and Child Health and Family Planning and Undersecretary of State Planning Organization. The Turkey Demographic and Health Survey 2008 has been financed the scientific and Technological research Council of Turkey (TUBITAK) under the support program for Research Projects of Public Institutions.
The primary objective of the Turkey DHS 2008 is to provide data on fertility, contraceptive methods, maternal and child health. Detailed information on these issues is obtained through questionnaires, filled by face-to face interviews with ever-married women in reproductive ages (15-49).
Another important objective of the survey, with aims to contribute to the knowledge on population and health as well, is to maintain the flow of information for the related organizations in Turkey on the Turkish demographic structure and change in the absence of reliable vital registration system and ascertain the continuity of data on demographic and health necessary for sustainable development in the absence of a reliable vital registration system. In terms of survey methodology and content, the Turkey DHS 2008 is comparable with the previous demographic surveys in Turkey (MEASURE DHS+).
National
Sample survey data
Face-to-face
Two main types of questionnaires were used to collect the TDHS-2008 data: a) The Household Questionnaire; b) The Individual Questionnaire for Ever-Married Women of Reproductive Ages.
The contents of these questionnaires were based on the DHS Model "A" Questionnaire, which was designed for the DHS program for use in countries with high contraceptive prevalence. Additions, deletions and modifications were made to the DHS model questionnaire in order to collect information particularly relevant to Turkey. Attention also was paid to ensuring the comparability of the DHS-2008 findings with previous demographic surveys carried out by the Hacettepe Institute of Population Studies. In the process of designing the TDHS-2003 questionnaires, national and international population and health agencies were consulted for their comments.
a) The Household Questionnaire was used to enumerate all usual members of and visitors to the selected households and to collect information relating to the socioeconomic position of the households. In the first part of the Household Questionnaire, basic information was collected on the age, sex, educational attainment, recent migration and residential mobility, employment, marital status, and relationship to the head of household of each person listed as a household member or visitor. The objective of the first part of the Household Questionnaire was to obtain the information needed to identify women who were eligible for the individual interview as well as to provide basic demographic data for Turkish households. The second part of the Household Questionnaire included questions on never married women age 15-49, with the objective of collecting information on basic background characteristics of women in this age group. The third section was used to collect information on the welfare of the elderly people. The final section of the Household Questionnaire was used to collect information on housing characteristics, such as the number of rooms, the flooring material, the source of water, and the type of toilet facilities, and on the household's ownership of a variety of consumer goods. This section also incorporated a module that was only administered in Istanbul metropolitan households, on house ownership, use of municipal facilities and the like, as well as a module that was used to collect information, from one-half of households, on salt iodization. In households where salt was present, test kits were used to test whether the salt used in the household was fortified with potassium iodine or potassium iodate, i.e. whether salt was iodized.
b) The Individual Questionnaire for ever-married women obtained information on the following subjects:
- Background characteristics
- Reproduction
- Marriage
- Knowledge and use of family planning
- Maternal care and breastfeeding
- Immunization and health
- Fertility preferences
- Husband's background
- Women's work and status
- Sexually transmitted diseases and AIDS
- Maternal and child anthropometry.
The questionnaires were returned to the Hacettepe Institute of Population Studies by the fieldwork teams for data processing as soon as interviews were completed in a province. The office editing staff checked that the questionnaires for all the selected households and eligible respondents were returned from the field.
The 2002 Vietnam Demographic and Health Survey (VNDHS 2002) is a nationally representative sample survey of 5,665 ever-married women age 15-49 selected from 205 sample points (clusters) throughout Vietnam. It provides information on levels of fertility, family planning knowledge and use, infant and child mortality, and indicators of maternal and child health. The survey included a Community/ Health Facility Questionnaire that was implemented in each of the sample clusters.
The survey was designed to measure change in reproductive health indicators over the five years since the VNDHS 1997, especially in the 18 provinces that were targeted in the Population and Family Health Project of the Committee for Population, Family and Children. Consequently, all provinces were separated into “project” and “nonproject” groups to permit separate estimates for each. Data collection for the survey took place from 1 October to 21 December 2002.
The Vietnam Demographic and Health Survey 2002 (VNDHS 2002) was the third DHS in Vietnam, with prior surveys implemented in 1988 and 1997. The VNDHS 2002 was carried out in the framework of the activities of the Population and Family Health Project of the Committee for Population, Family and Children (previously the National Committee for Population and Family Planning).
The main objectives of the VNDHS 2002 were to collect up-to-date information on family planning, childhood mortality, and health issues such as breastfeeding practices, pregnancy care, vaccination of children, treatment of common childhood illnesses, and HIV/AIDS, as well as utilization of health and family planning services. The primary objectives of the survey were to estimate changes in family planning use in comparison with the results of the VNDHS 1997, especially on issues in the scope of the project of the Committee for Population, Family and Children.
VNDHS 2002 data confirm the pattern of rapidly declining fertility that was observed in the VNDHS 1997. It also shows a sharp decline in child mortality, as well as a modest increase in contraceptive use. Differences between project and non-project provinces are generally small.
The 2002 Vietnam Demographic and Health Survey (VNDHS 2002) is a nationally representative sample survey. The VNDHS 1997 was designed to provide separate estimates for the whole country, urban and rural areas, for 18 project provinces and the remaining nonproject provinces as well. Project provinces refer to 18 focus provinces targeted for the strengthening of their primary health care systems by the Government's Population and Family Health Project to be implemented over a period of seven years, from 1996 to 2002 (At the outset of this project there were 15 focus provinces, which became 18 by the creation of 3 new provinces from the initial set of 15). These provinces were selected according to criteria based on relatively low health and family planning status, no substantial family planning donor presence, and regional spread. These criteria resulted in the selection of the country's poorer provinces. Nine of these provinces have significant proportions of ethnic minorities among their population.
The population covered by the 2002 VNDHS is defined as the universe of all women age 15-49 in Vietnam.
Sample survey data
The sample for the VNDHS 2002 was based on that used in the VNDHS 1997, which in turn was a subsample of the 1996 Multi-Round Demographic Survey (MRS), a semi-annual survey of about 243,000 households undertaken regularly by GSO. The MRS sample consisted of 1,590 sample areas known as enumeration areas (EAs) spread throughout the 53 provinces/cities of Vietnam, with 30 EAs in each province. On average, an EA comprises about 150 households. For the VNDHS 1997, a subsample of 205 EAs was selected, with 26 households in each urban EA and 39 households for each rural EA. A total of 7,150 households was selected for the survey. The VNDHS 1997 was designed to provide separate estimates for the whole country, urban and rural areas, for 18 project provinces and the remaining nonproject provinces as well. Because the main objective of the VNDHS 2002 was to measure change in reproductive health indicators over the five years since the VNDHS 1997, the sample design for the VNDHS 2002 was as similar as possible to that of the VNDHS 1997.
Although it would have been ideal to have returned to the same households or at least the same sample points as were selected for the VNDHS 1997, several factors made this undesirable. Revisiting the same households would have held the sample artificially rigid over time and would not allow for newly formed households. This would have conflicted with the other major survey objective, which was to provide up-to-date, representative data for the whole of Vietnam. Revisiting the same sample points that were covered in 1997 was complicated by the fact that the country had conducted a population census in 1999, which allowed for a more representative sample frame.
In order to balance the two main objectives of measuring change and providing representative data, it was decided to select enumeration areas from the 1999 Population Census, but to cover the same communes that were sampled in the VNDHS 1997 and attempt to obtain a sample point as close as possible to that selected in 1997. Consequently, the VNDHS 2002 sample also consisted of 205 sample points and reflects the oversampling in the 20 provinces that fall in the World Bank-supported Population and Family Health Project. The sample was designed to produce about 7,000 completed household interviews and 5,600 completed interviews with ever-married women age 15-49.
Face-to-face
As in the VNDHS 1997, three types of questionnaires were used in the 2002 survey: the Household Questionnaire, the Individual Woman's Questionnaire, and the Community/Health Facility Questionnaire. The first two questionnaires were based on the DHS Model A Questionnaire, with additions and modifications made during an ORC Macro staff visit in July 2002. The questionnaires were pretested in two clusters in Hanoi (one in a rural area and another in an urban area). After the pretest and consultation with ORC Macro, the drafts were revised for use in the main survey.
a) The Household Questionnaire was used to enumerate all usual members and visitors in selected households and to collect information on age, sex, education, marital status, and relationship to the head of household. The main purpose of the Household Questionnaire was to identify persons who were eligible for individual interview (i.e. ever-married women age 15-49). In addition, the Household Questionnaire collected information on characteristics of the household such as water source, type of toilet facilities, material used for the floor and roof, and ownership of various durable goods.
b) The Individual Questionnaire was used to collect information on ever-married women aged 15-49 in surveyed households. These women were interviewed on the following topics:
- Respondent's background characteristics (education, residential history, etc.);
- Reproductive history;
- Contraceptive knowledge and use;
- Antenatal and delivery care;
- Infant feeding practices;
- Child immunization;
- Fertility preferences and attitudes about family planning;
- Husband's background characteristics;
- Women's work information; and
- Knowledge of AIDS.
c) The Community/Health Facility Questionnaire was used to collect information on all communes in which the interviewed women lived and on services offered at the nearest health stations. The Community/Health Facility Questionnaire consisted of four sections. The first two sections collected information from community informants on some characteristics such as the major economic activities of residents, distance from people's residence to civic services and the location of the nearest sources of health care. The last two sections involved visiting the nearest commune health centers and intercommune health centers, if these centers were located within 30 kilometers from the surveyed cluster. For each visited health center, information was collected on the type of health services offered and the number of days services were offered per week; the number of assigned staff and their training; medical equipment and medicines available at the time of the visit.
The first stage of data editing was implemented by the field editors soon after each interview. Field editors and team leaders checked the completeness and consistency of all items in the questionnaires. The completed questionnaires were sent to the GSO headquarters in Hanoi by post for data processing. The editing staff of the GSO first checked the questionnaires for completeness. The data were then entered into microcomputers and edited using a software program specially developed for the DHS program, the Census and Survey Processing System, or CSPro. Data were verified on a 100 percent basis, i.e., the data were entered separately twice and the two results were compared and corrected. The data processing and editing staff of the GSO were trained and supervised for two weeks by a data processing specialist from ORC Macro. Office editing and processing activities were initiated immediately after the beginning of the fieldwork and were completed in late December 2002.
The results of the household and individual
The Gallup Poll Social Series (GPSS) is a set of public opinion surveys designed to monitor U.S. adults' views on numerous social, economic, and political topics. The topics are arranged thematically across 12 surveys. Gallup administers these surveys during the same month every year and includes the survey's core trend questions in the same order each administration. Using this consistent standard allows for unprecedented analysis of changes in trend data that are not susceptible to question order bias and seasonal effects.
Introduced in 2001, the GPSS is the primary method Gallup uses to update several hundred long-term Gallup trend questions, some dating back to the 1930s. The series also includes many newer questions added to address contemporary issues as they emerge.
The dataset currently includes responses from up to and including 2025.
Gallup conducts one GPSS survey per month, with each devoted to a different topic, as follows:
January: Mood of the Nation
February: World Affairs
March: Environment
April: Economy and Finance
May: Values and Beliefs
June: Minority Rights and Relations (discontinued after 2016)
July: Consumption Habits
August: Work and Education
September: Governance
October: Crime
November: Health
December: Lifestyle (conducted 2001-2008)
The core questions of the surveys differ each month, but several questions assessing the state of the nation are standard on all 12: presidential job approval, congressional job approval, satisfaction with the direction of the U.S., assessment of the U.S. job market, and an open-ended measurement of the nation's "most important problem." Additionally, Gallup includes extensive demographic questions on each survey, allowing for in-depth analysis of trends.
Interviews are conducted with U.S. adults aged 18 and older living in all 50 states and the District of Columbia using a dual-frame design, which includes both landline and cellphone numbers. Gallup samples landline and cellphone numbers using random-digit-dial methods. Gallup purchases samples for this study from Survey Sampling International (SSI). Gallup chooses landline respondents at random within each household based on which member had the next birthday. Each sample of national adults includes a minimum quota of 70% cellphone respondents and 30% landline respondents, with additional minimum quotas by time zone within region. Gallup conducts interviews in Spanish for respondents who are primarily Spanish-speaking.
Gallup interviews a minimum of 1,000 U.S. adults aged 18 and older for each GPSS survey. Samples for the June Minority Rights and Relations survey are significantly larger because Gallup includes oversamples of Blacks and Hispanics to allow for reliable estimates among these key subgroups.
Gallup weights samples to correct for unequal selection probability, nonresponse, and double coverage of landline and cellphone users in the two sampling frames. Gallup also weights its final samples to match the U.S. population according to gender, age, race, Hispanic ethnicity, education, region, population density, and phone status (cellphone only, landline only, both, and cellphone mostly).
Demographic weighting targets are based on the most recent Current Population Survey figures for the aged 18 and older U.S. population. Phone status targets are based on the most recent National Health Interview Survey. Population density targets are based on the most recent U.S. Census.
The year appended to each table name represents when the data was last updated. For example, January: Mood of the Nation - 2025** **has survey data collected up to and including 2025.
For more information about what survey questions were asked over time, see the Supporting Files.
Data access is required to view this section.
The 1998 Ghana Demographic and Health Survey (GDHS) is the latest in a series of national-level population and health surveys conducted in Ghana and it is part of the worldwide MEASURE DHS+ Project, designed to collect data on fertility, family planning, and maternal and child health.
The primary objective of the 1998 GDHS is to provide current and reliable data on fertility and family planning behaviour, child mortality, children’s nutritional status, and the utilisation of maternal and child health services in Ghana. Additional data on knowledge of HIV/AIDS are also provided. This information is essential for informed policy decisions, planning and monitoring and evaluation of programmes at both the national and local government levels.
The long-term objectives of the survey include strengthening the technical capacity of the Ghana Statistical Service (GSS) to plan, conduct, process, and analyse the results of complex national sample surveys. Moreover, the 1998 GDHS provides comparable data for long-term trend analyses within Ghana, since it is the third in a series of demographic and health surveys implemented by the same organisation, using similar data collection procedures. The GDHS also contributes to the ever-growing international database on demographic and health-related variables.
National
Sample survey data
The major focus of the 1998 GDHS was to provide updated estimates of important population and health indicators including fertility and mortality rates for the country as a whole and for urban and rural areas separately. In addition, the sample was designed to provide estimates of key variables for the ten regions in the country.
The list of Enumeration Areas (EAs) with population and household information from the 1984 Population Census was used as the sampling frame for the survey. The 1998 GDHS is based on a two-stage stratified nationally representative sample of households. At the first stage of sampling, 400 EAs were selected using systematic sampling with probability proportional to size (PPS-Method). The selected EAs comprised 138 in the urban areas and 262 in the rural areas. A complete household listing operation was then carried out in all the selected EAs to provide a sampling frame for the second stage selection of households. At the second stage of sampling, a systematic sample of 15 households per EA was selected in all regions, except in the Northern, Upper West and Upper East Regions. In order to obtain adequate numbers of households to provide reliable estimates of key demographic and health variables in these three regions, the number of households in each selected EA in the Northern, Upper West and Upper East regions was increased to 20. The sample was weighted to adjust for over sampling in the three northern regions (Northern, Upper East and Upper West), in relation to the other regions. Sample weights were used to compensate for the unequal probability of selection between geographically defined strata.
The survey was designed to obtain completed interviews of 4,500 women age 15-49. In addition, all males age 15-59 in every third selected household were interviewed, to obtain a target of 1,500 men. In order to take cognisance of non-response, a total of 6,375 households nation-wide were selected.
Note: See detailed description of sample design in APPENDIX A of the survey report.
Face-to-face
Three types of questionnaires were used in the GDHS: the Household Questionnaire, the Women’s Questionnaire, and the Men’s Questionnaire. These questionnaires were based on model survey instruments developed for the international MEASURE DHS+ programme and were designed to provide information needed by health and family planning programme managers and policy makers. The questionnaires were adapted to the situation in Ghana and a number of questions pertaining to on-going health and family planning programmes were added. These questionnaires were developed in English and translated into five major local languages (Akan, Ga, Ewe, Hausa, and Dagbani).
The Household Questionnaire was used to enumerate all usual members and visitors in a selected household and to collect information on the socio-economic status of the household. The first part of the Household Questionnaire collected information on the relationship to the household head, residence, sex, age, marital status, and education of each usual resident or visitor. This information was used to identify women and men who were eligible for the individual interview. For this purpose, all women age 15-49, and all men age 15-59 in every third household, whether usual residents of a selected household or visitors who slept in a selected household the night before the interview, were deemed eligible and interviewed. The Household Questionnaire also provides basic demographic data for Ghanaian households. The second part of the Household Questionnaire contained questions on the dwelling unit, such as the number of rooms, the flooring material, the source of water and the type of toilet facilities, and on the ownership of a variety of consumer goods.
The Women’s Questionnaire was used to collect information on the following topics: respondent’s background characteristics, reproductive history, contraceptive knowledge and use, antenatal, delivery and postnatal care, infant feeding practices, child immunisation and health, marriage, fertility preferences and attitudes about family planning, husband’s background characteristics, women’s work, knowledge of HIV/AIDS and STDs, as well as anthropometric measurements of children and mothers.
The Men’s Questionnaire collected information on respondent’s background characteristics, reproduction, contraceptive knowledge and use, marriage, fertility preferences and attitudes about family planning, as well as knowledge of HIV/AIDS and STDs.
A total of 6,375 households were selected for the GDHS sample. Of these, 6,055 were occupied. Interviews were completed for 6,003 households, which represent 99 percent of the occupied households. A total of 4,970 eligible women from these households and 1,596 eligible men from every third household were identified for the individual interviews. Interviews were successfully completed for 4,843 women or 97 percent and 1,546 men or 97 percent. The principal reason for nonresponse among individual women and men was the failure of interviewers to find them at home despite repeated callbacks.
Note: See summarized response rates by place of residence in Table 1.1 of the survey report.
The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors, and (2) sampling errors. Nonsampling errors are the results of shortfalls made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 1998 GDHS to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 1998 GDHS is only one of many samples that could have been selected from the same population, using the same design and expected size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95 percent of all possible samples of identical size and design.
If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 1998 GDHS sample is the result of a two-stage stratified design, and, consequently, it was necessary to use more complex formulae. The computer software used to calculate sampling errors for the 1998 GDHS is the ISSA Sampling Error Module. This module uses the Taylor linearization method of variance estimation for survey estimates that are means or proportions. The Jackknife repeated replication method is used for variance estimation of more complex statistics such as fertility and mortality rates.
Data Quality Tables - Household age distribution - Age distribution of eligible and interviewed women - Age distribution of eligible and interviewed men - Completeness of reporting - Births by calendar years - Reporting of age at death in days - Reporting of age at death in months
Note: See detailed tables in APPENDIX C of the survey report.
The 2017-18 Albania Demographic and Health Survey (2017-18 ADHS) is a nationwide survey with a nationally representative sample of approximately 17,160 households. All women age 15-49 who are usual residents of the selected households or who slept in the households the night before the survey were eligible for the survey. Women 50-59 years old were interviewed with an abbreviated questionnaire that only covered background characteristics and questions related to noncommunicable diseases.
The primary objective of the 2017-2018 ADHS was to provide estimates of basic sociodemographic and health indicators for the country as a whole and the twelve prefectures. Specifically, the survey collected information on basic characteristics of the respondents, fertility, family planning, nutrition, maternal and child health, knowledge of HIV behaviors, health-related lifestyle, and noncommunicable diseases (NCDs). The information collected in the ADHS will assist policymakers and program managers in evaluating and designing programs and in developing strategies for improving the health of the country’s population.
The sample for the 2017-18 ADHS was designed to produce representative results for the country as a whole, for urban and rural areas separately, and for each of the twelve prefectures known as Berat, Diber, Durres, Elbasan, Fier, Gjirokaster, Korce, Kukes, Lezhe, Shkoder, Tirana, and Vlore.
National coverage
The survey covered all de jure household members (usual residents), children age 0-4 years, women age 15-49 years and men age 15-59 years resident in the household.
Sample survey data [ssd]
The ADHS surveys were done on a nationally representative sample that was representative at the prefecture level as well by rural and urban areas. A total of 715 enumeration areas (EAs) were selected as sample clusters, with probability proportional to each prefecture's population size. The sample design called for 24 households to be randomly selected in every sampling cluster, regardless of its size, but some of the EAs contained fewer than 24 households. In these EAs, all households were included in the survey. The EAs are considered the sample's primary sampling unit (PSU). The team of interviewers updated and listed the households in the selected EAs. Upon arriving in the selected clusters, interviewers spent the first day of fieldwork carrying out an exhaustive enumeration of households, recording the name of each head of household and the location of the dwelling. The listing was done with tablet PCs, using a digital listing application. When interviewers completed their respective sections of the EA, they transferred their files into the supervisor's tablet PC, where the information was automatically compiled into a single file in which all households in the EA were entered. The software and field procedures were designed to ensure there were no duplications or omissions during the household listing process. The supervisor used the software in his tablet to randomly select 24 households for the survey from the complete list of households.
All women age 15-49 who were usual residents of the selected households or who slept in the households the night before the survey were eligible for individual interviews with the full Woman's Questionnaire. Women age 50-59 were also interviewed, but with an abbreviated questionnaire that left out all questions related to reproductive health and mother and child health. A 50% subsample was selected for the survey of men. Every man age 15-59 who was a usual resident of or had slept in the household the night before the survey was eligible for an individual interview in these households.
For further details on sample design, see Appendix A of the final report.
Face-to-face [f2f]
Four questionnaires were used in the ADHS, one for the household and others for women age 15-49, for women age 50-59, and for men age 15-59. In addition to these four questionnaires, a form was used to record the vaccination information for children born in the 5 years preceding the survey whose mothers had been successfully interviewed.
Supervisors sent the accumulated fieldwork data to INSTAT’s central office via internet every day, unless for some reason the teams did not have access to the internet at the time. The data received from the various teams were combined into a single file, which was used to produce quality control tables, known as field check tables. These tables reveal systematic errors in the data such as omission of potential respondents, age displacement, inaccurate recording of date of birth and age at death, inaccurate measurement of height and weight, and other key indicators of data quality. These tables were reviewed and evaluated by ADHS senior staff, which in turn provided feedback and advice to the teams in the field.
A total of 16,955 households were selected for the sample, of which 16,634 were occupied. Of the occupied households, 15,823 were successfully interviewed, which represents a response rate of 95%. In the interviewed households, 11,680 women age 15-49 were identified for individual interviews. Interviews were completed for 10,860 of these women, yielding a response rate of 93%. In the same households, 4,289 women age 50-59 were identified, of which 4,140 were successfully interviewed, yielding a 97% response rate. In the 50% subsample of households selected for the male survey, 7,103 eligible men age 15-59 were identified, of which 6,142 were successfully interviewed, yielding a response rate of 87%.
Response rates were higher in rural than in urban areas, which is a pattern commonly found in household surveys because in urban areas more people work and carry out activities outside the home.
The estimates from a sample survey are affected by two types of errors: nonsampling errors and sampling errors. Nonsampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2017-18 Albania Demographic and Health Survey (ADHS) to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 2017-18 ADHS is only one of many samples that could have been selected from the same population, using the same design and expected size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability among all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
Sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95% of all possible samples of identical size and design.
If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 2017-18 ADHS sample is the result of a multi-stage stratified design, and, consequently, it was necessary to use more complex formulas. Sampling errors are computed in SAS, using programs developed by ICF. These programs use the Taylor linearization method to estimate variances for survey estimates that are means, proportions, or ratios. The Jackknife repeated replication method is used for variance estimation of more complex statistics such as fertility and mortality rates.
A more detailed description of estimates of sampling errors are presented in Appendix B of the survey final report.
Data Quality Tables - Household age distribution - Age distribution of eligible and interviewed women - Age distribution of eligible and interviewed men - Completeness of reporting - Births by calendar years - Reporting of age at death in days - Reporting of age at death in months
See details of the data quality tables in Appendix C of the survey final report.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
The STAMINA study examined the nutritional risks of low-income peri-urban mothers, infants and young children (IYC), and households in Peru during the COVID-19 pandemic. The study was designed to capture information through three, repeated cross-sectional surveys at approximately 6 month intervals over an 18 month period, starting in December 2020. The surveys were carried out by telephone in November-December 2020, July-August 2021 and in February-April 2022. The third survey took place over a longer period to allow for a household visit after the telephone interview.The study areas were Manchay (Lima) and Huánuco district in the Andean highlands (~ 1900m above sea level).In each study area, we purposively selected the principal health centre and one subsidiary health centre. Peri-urban communities under the jurisdiction of these health centres were then selected to participate. Systematic random sampling was employed with quotas for IYC age (6-11, 12-17 and 18-23 months) to recruit a target sample size of 250 mother-infant pairs for each survey.Data collected included: household socio-demographic characteristics; infant and young child feeding practices (IYCF), child and maternal qualitative 24-hour dietary recalls/7 day food frequency questionnaires, household food insecurity experience measured using the validated Food Insecurity Experience Scale (FIES) survey module (Cafiero, Viviani, & Nord, 2018), and maternal mental health.In addition, questions that assessed the impact of COVID-19 on households including changes in employment status, adaptations to finance, sources of financial support, household food insecurity experience as well as access to, and uptake of, well-child clinics and vaccination health services were included.This folder includes the questionnaire for survey 3 in both English and Spanish languages.The corresponding dataset and dictionary of variables for survey 3 are available at 10.17028/rd.lboro.21741014
The 1993 Ghana Demographic and Health Survey (GDHS) is a nationally representative survey of 4,562 women age 15-49 and 1,302 men age 15-59. The survey is designed to furnish policymakers, planners and program managers with factual, reliable and up-to-date information on fertility, family planning and the status of maternal and child health care in the country. The survey, which was carried out by the Ghana Statistical Service (GSS), marks Ghana's second participation in the worldwide Demographic and Health Surveys (DHS) program.
The principal objective of the 1993 GDHS is to generate reliable and current information on fertility, mortality, contraception and maternal and child health indicators. Such data are necessary for effective policy formulation as well as program design, monitoring and evaluation. The 1993 GDHS is, in large measure, an update to the 1988 GDHS. Together, the two surveys provide comparable information for two points in time, thus allowing assessment of changes and trends in various demographic and health indicators over time.
Long-term objectives of the survey include (i) strengthening the capacity of the Ghana Statistical Service to plan, conduct, process and analyze data from a complex, large-scale survey such as the Demographic and Health Survey, and (ii) contributing to the ever-expanding international database on demographic and health-related variables.
National
Sample survey data
The 1993 GDHS is a stratified, self-weighting, nationally representative sample of households chosen from 400 Enumeration Areas (EAs). The 1984 Population Census EAs constituted the sampling frame. The frame was first stratified into three ecological zones, namely coastal, forest and savannah, and then into urban and rural EAs. The EAs were selected with probability proportional to the number of households. Households within selected EAs were subsequently listed and a systematic sample of households was selected for the survey. The survey was designed to yield a sample of 5,400 women age 15-49 and a sub-sample of males age 15-59 systematically selected from one-third of the 400 EAs.
Note: See detailed description of sample design in APPENDIX A of the survey report.
Face-to-face
Survey instruments used to elicit information for the 1993 GDHS are 1) Household Schedule 2) Women's Questionnaire and 3) Men's Questionnaire.
The questionnaires were structured based on the Demographic and Health Survey Model B Questionnaire designed for countries with low levels of contraceptive use. The final version of the questionnaires evolved out of a series of meetings with personnel of relevant ministries, institutions and organizations engaged in activities relating to fertility and family planning, health and nutrition and rehabilitation of persons with disabilities.
The questionnaires were first developed in English and later translated and printed in five major local languages, namely: Akan, Dagbani, Ewe, Ga, and Hausa. In the selected households, all usual members and visitors were listed in the household schedule. Background information, such as age, sex, relationship to head of household, marital status and level of education, was collected on each listed person. Questions on economic activity, occupation, industry, employment status, number of days worked in the past week and number of hours worked per day was asked of all persons age seven years and over. Those who did not work during the reference period were asked whether or not they actively looked for work.
Information on the health and disability status of all persons was also collected in the household schedule. Migration history was elicited from all persons age 15 years and over, as well as information on the survival status and residence of natural parents of all children less than 15 years in the household.
Data on source of water supply, type of toilet facility, number of sleeping rooms available to the household, material of floor and ownership of specified durable consumer goods were also elicited.
Finally, the household schedule was the instrument used to identify eligible women and men from whom detailed information was collected during the individual interview.
The women's questionnaire was used to collect information on eligible women identified in the household schedule. Eligible women were defined as those age 15-49 years who are usual members of the household and visitors who spent the night before the interview with the household. Questions asked in the questionnaire were on the following topics:
All female respondents with at least one live birth since January 1990 and their children born since 1st January 1990 had their height and weight taken.
The men's questionnaire was administered to men in sample households in a third of selected EAs. An eligible man was 15-59 years old who is either a usual household member or a visitor who spent the night preceding the day of interview with the household.
Topics enquired about in the men's questionnaire included the following: - Background Characteristics - Reproductive History - Contraceptive Knowledge and Use - Marriage - Fertility Preferences - Knowledge of AIDS and Other STDs.
Questionnaires from the field were sent to the secretariat at the Head Office for checking and office editing. The office editing, which was undertaken by two officers, involved correcting inconsistencies in the questionnaire responses and coding open-ended questions. The questionnaires were then forwarded to the data processing unit for data entry. Data capture and verification were undertaken by four data entry operators. Nearly 20 percent of the questionnaires were verified. This phase of the survey covered four and a half months - that is, from mid-October, 1993 to the end of February, 1994.
After the data entry, three professional staff members performed the secondary editing of questionnaires that were flagged either because entries were inconsistent or values of specific variables were out of range or missing. The secondary editing was completed on 17th March, 1994 and the tables for the preliminary report were generated on 18th March, 1994. The software package used for the data processing was the Integrated System for Survey Analysis (ISSA).
A sample of 6,161 households was selected, from which 5,919 households were contacted for interview. Interviews were successfully completed in 5,822 households, indicating a household response rate of 98 percent. About 3 percent of selected households were absent during the interviewing period, and are excluded from the calculations of the response rate.
Even though the sample was designed to yield interviews with nearly 5,400 women age 15-49 only 4,700 women were identified as eligible for the individual interview. Individual interviews were successfully completed for 4,562 eligible women, giving a response rate of 97 percent. Similarly, instead of the expected 1,700 eligible men being identified in the households only 1,354 eligible men were found and 1,302 of these were successfully interviewed, with a response rate of 96 percent.
The principal reason for non-response among eligible women and men was not finding them at home despite repeated visits to the households. However, refusal rates for both eligible women and men were low, 0.3 percent and 0.2 percent, respectively.
Note: See summarized response rates in Table 1.1 of the survey report.
The results from sample surveys are affected by two types of errors, non-sampling error and sampling error. Non-sampling error is due to mistakes made in carrying out field activities, such as failure to locate and interview the correct household, errors in the way the questions are asked, misunderstanding on the part of either the interviewer or the respondent, data entry errors, etc. Although efforts were made during the design and implementation of the 1993 GDHS to minimize this type of error, non-sampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be measured statistically. The sample of eligible women selected in the 1993 GDHS is only one of many samples that could have been selected from the same population, using the same design and expected size. Each one would have yielded results that differed somewhat from the actual sample selected. The sampling error is a measure of the variability between all possible samples; although it is not known exactly, it can be estimated from the survey results.
Sampling error is usually measured in terms of standard error of a particular statistic (mean, percentage, etc.), which is the square root of the variance of the statistic. The standard error can be used to calculate confidence intervals within which, apart from non-sampling errors, the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that same statistic as measured in 95 percent of all possible samples with the same design (and expected size) will fall within a range
The 2013 Turkey Demographic and Health Survey (TDHS-2013) is a nationally representative sample survey. The primary objective of the TDHS-2013 is to provide data on socioeconomic characteristics of households and women between ages 15-49, fertility, childhood mortality, marriage patterns, family planning, maternal and child health, nutritional status of women and children, and reproductive health. The survey obtained detailed information on these issues from a sample of women of reproductive age (15-49). The TDHS-2013 was designed to produce information in the field of demography and health that to a large extent cannot be obtained from other sources.
Specifically, the objectives of the TDHS-2013 included: - Collecting data at the national level that allows the calculation of some demographic and health indicators, particularly fertility rates and childhood mortality rates, - Obtaining information on direct and indirect factors that determine levels and trends in fertility and childhood mortality, - Measuring the level of contraceptive knowledge and practice by contraceptive method and some background characteristics, i.e., region and urban-rural residence, - Collecting data relative to maternal and child health, including immunizations, antenatal care, and postnatal care, assistance at delivery, and breastfeeding, - Measuring the nutritional status of children under five and women in the reproductive ages, - Collecting data on reproductive-age women about marriage, employment status, and social status
The TDHS-2013 information is intended to provide data to assist policy makers and administrators to evaluate existing programs and to design new strategies for improving demographic, social and health policies in Turkey. Another important purpose of the TDHS-2013 is to sustain the flow of information for the interested organizations in Turkey and abroad on the Turkish population structure in the absence of a reliable and sufficient vital registration system. Additionally, like the TDHS-2008, TDHS-2013 is accepted as a part of the Official Statistic Program.
National coverage
The survey covered all de jure household members (usual residents), children age 0-5 years and women age 15-49 years resident in the household.
Sample survey data [ssd]
The sample design and sample size for the TDHS-2013 makes it possible to perform analyses for Turkey as a whole, for urban and rural areas, and for the five demographic regions of the country (West, South, Central, North, and East). The TDHS-2013 sample is of sufficient size to allow for analysis on some of the survey topics at the level of the 12 geographical regions (NUTS 1) which were adopted at the second half of the year 2002 within the context of Turkey’s move to join the European Union.
In the selection of the TDHS-2013 sample, a weighted, multi-stage, stratified cluster sampling approach was used. Sample selection for the TDHS-2013 was undertaken in two stages. The first stage of selection included the selection of blocks as primary sampling units from each strata and this task was requested from the TURKSTAT. The frame for the block selection was prepared using information on the population sizes of settlements obtained from the 2012 Address Based Population Registration System. Settlements with a population of 10,000 and more were defined as “urban”, while settlements with populations less than 10,000 were considered “rural” for purposes of the TDHS-2013 sample design. Systematic selection was used for selecting the blocks; thus settlements were given selection probabilities proportional to their sizes. Therefore more blocks were sampled from larger settlements.
The second stage of sample selection involved the systematic selection of a fixed number of households from each block, after block lists were obtained from TURKSTAT and were updated through a field operation; namely the listing and mapping fieldwork. Twentyfive households were selected as a cluster from urban blocks, and 18 were selected as a cluster from rural blocks. The total number of households selected in TDHS-2013 is 14,490.
The total number of clusters in the TDHS-2013 was set at 642. Block level household lists, each including approximately 100 households, were provided by TURKSTAT, using the National Address Database prepared for municipalities. The block lists provided by TURKSTAT were updated during the listing and mapping activities.
All women at ages 15-49 who usually live in the selected households and/or were present in the household the night before the interview were regarded as eligible for the Women’s Questionnaire and were interviewed. All analysis in this report is based on de facto women.
Note: A more technical and detailed description of the TDHS-2013 sample design, selection and implementation is presented in Appendix B of the final report of the survey.
Face-to-face [f2f]
Two main types of questionnaires were used to collect the TDHS-2013 data: the Household Questionnaire and the Individual Questionnaire for all women of reproductive age. The contents of these questionnaires were based on the DHS core questionnaire. Additions, deletions and modifications were made to the DHS model questionnaire in order to collect information particularly relevant to Turkey. Attention also was paid to ensuring the comparability of the TDHS-2013 findings with previous demographic surveys carried out by the Hacettepe Institute of Population Studies. In the process of designing the TDHS-2013 questionnaires, national and international population and health agencies were consulted for their comments.
The questionnaires were developed in Turkish and translated into English.
TDHS-2013 questionnaires were returned to the Hacettepe University Institute of Population Studies by the fieldwork teams for data processing as soon as interviews were completed in a province. The office editing staff checked that the questionnaires for all selected households and eligible respondents were returned from the field. A total of 29 data entry staff were trained for data entry activities of the TDHS-2013. The data entry of the TDHS-2013 began in late September 2013 and was completed at the end of January 2014.
The data were entered and edited on microcomputers using the Census and Survey Processing System (CSPro) software. CSPro is designed to fulfill the census and survey data processing needs of data-producing organizations worldwide. CSPro is developed by MEASURE partners, the U.S. Bureau of the Census, ICF International’s DHS Program, and SerPro S.A. CSPro allows range, skip, and consistency errors to be detected and corrected at the data entry stage. During the data entry process, 100% verification was performed by entering each questionnaire twice using different data entry operators and comparing the entered data.
In all, 14,490 households were selected for the TDHS-2013. At the time of the listing phase of the survey, 12,640 households were considered occupied and, thus, eligible for interview. Of the eligible households, 93 percent (11,794) households were successfully interviewed. The main reasons the field teams were unable to interview some households were because some dwelling units that had been listed were found to be vacant at the time of the interview or the household was away for an extended period.
In the interviewed 11,794 households, 10,840 women were identified as eligible for the individual interview, aged 15-49 and were present in the household on the night before the interview. Interviews were successfully completed with 9,746 of these women (90 percent). Among the eligible women not interviewed in the survey, the principal reason for nonresponse was the failure to find the women at home after repeated visits to the household.
The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors, and (2) sampling errors. Nonsampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the TDHS-2013 to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the TDHS-2013 is only one of many samples that could have been selected from the same population, using the same design and expected size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall
The 2015-16 Armenia Demographic and Health Survey (2015-16 ADHS) is the fourth in a series of nationally representative sample surveys designed to provide information on population and health issues. It is conducted in Armenia under the worldwide Demographic and Health Surveys program. Specifically, the objective of the 2015-16 ADHS is to provide current and reliable information on fertility and abortion levels, marriage, sexual activity, fertility preferences, awareness and use of family planning methods, breastfeeding practices, nutritional status of young children, childhood mortality, maternal and child health, domestic violence against women, child discipline, awareness and behavior regarding AIDS and other sexually transmitted infections (STIs), and other health-related issues such as smoking, tuberculosis, and anemia. The survey obtained detailed information on these issues from women of reproductive age and, for certain topics, from men as well.
The 2015-16 ADHS results are intended to provide information needed to evaluate existing social programs and to design new strategies to improve the health of and health services for the people of Armenia. Data are presented by region (marz) wherever sample size permits. The information collected in the 2015-16 ADHS will provide updated estimates of basic demographic and health indicators covered in the 2000, 2005, and 2010 surveys.
The long-term objective of the survey includes strengthening the technical capacity of major government institutions, including the NSS. The 2015-16 ADHS also provides comparable data for longterm trend analysis because the 2000, 2005, 2010, and 2015-16 surveys were implemented by the same organization and used similar data collection procedures. It also adds to the international database of demographic and health–related information for research purposes.
National coverage
The survey covered all de jure household members (usual residents), children age 0-4 years, women age 15-49 years and men age 15-49 years resident in the household.
Sample survey data [ssd]
The sample was designed to produce representative estimates of key indicators at the national level, for Yerevan, and for total urban and total rural areas separately. Many indicators can also be estimated at the regional (marz) level.
The sampling frame used for the 2015-16 ADHS is the Armenia Population and Housing Census, which was conducted in Armenia in 2011 (APHC 2011). The sampling frame is a complete list of enumeration areas (EAs) covering the whole country, a total number of 11,571 EAs, provided by the National Statistical Service (NSS) of Armenia, the implementing agency for the 2015-16 ADHS. This EA frame was created from the census data base by summarizing the households down to EA level. A representative probability sample of 8,749 households was selected for the 2015-16 ADHS sample. The sample was selected in two stages. In the first stage, 313 clusters (192 in urban areas and 121 in rural areas) were selected from a list of EAs in the sampling frame. In the second stage, a complete listing of households was carried out in each selected cluster. Households were then systematically selected for participation in the survey. Appendix A provides additional information on the sample design of the 2015-16 Armenia DHS. Because of the approximately equal sample size in each marz, the sample is not self-weighting at the national level, and weighting factors have been calculated, added to the data file, and applied so that results are representative at the national level.
For further details on sample design, see Appendix A of the final report.
Face-to-face [f2f]
Five questionnaires were used for the 2015-16 ADHS: the Household Questionnaire, the Woman’s Questionnaire, the Man’s Questionnaire, the Biomarker Questionnaire, and the Fieldworker Questionnaire. These questionnaires, based on The DHS Program’s standard Demographic and Health Survey questionnaires, were adapted to reflect the population and health issues relevant to Armenia. Input was solicited from various stakeholders representing government ministries and agencies, nongovernmental organizations, and international donors. After all questionnaires were finalized in English, they were translated into Armenian. They were pretested in September-October 2015.
The processing of the 2015-16 ADHS data began shortly after fieldwork commenced. All completed questionnaires were edited immediately by field editors while still in the field and checked by the supervisors before being dispatched to the data processing center at the NSS central office in Yerevan. These completed questionnaires were edited and entered by 15 data processing personnel specially trained for this task. All data were entered twice for 100 percent verification. Data were entered using the CSPro computer package. The concurrent processing of the data was an advantage because the senior ADHS technical staff were able to advise field teams of problems detected during the data entry. In particular, tables were generated to check various data quality parameters. Moreover, the double entry of data enabled easy comparison and identification of errors and inconsistencies. As a result, specific feedback was given to the teams to improve performance. The data entry and editing phase of the survey was completed in June 2016.
A total of 8,749 households were selected in the sample, of which 8,205 were occupied at the time of the fieldwork. The main reason for the difference is that some of the dwelling units that were occupied during the household listing operation were either vacant or the household was away for an extended period at the time of interviewing. The number of occupied households successfully interviewed was 7,893, yielding a household response rate of 96 percent. The household response rate in urban areas (96 percent) was nearly the same as in rural areas (97 percent).
In these households, a total of 6,251 eligible women were identified; interviews were completed with 6,116 of these women, yielding a response rate of 98 percent. In one-half of the households, a total of 2,856 eligible men were identified, and interviews were completed with 2,755 of these men, yielding a response rate of 97 percent. Among men, response rates are slightly lower in urban areas (96 percent) than in rural areas (97 percent), whereas rates for women are the same in urban and in rural areas (98 percent).
The 2015-16 ADHS achieved a slightly higher response rate for households than the 2010 ADHS (NSS 2012). The increase is only notable for urban households (96 percent in 2015-16 compared with 94 percent in 2010). Response rates in all other categories are very close to what they were in 2010.
SAS computer software were used to calculate sampling errors for the 2015-16 ADHS. The programs used the Taylor linearization method of variance estimation for means or proportions and the Jackknife repeated replication method for variance estimation of more complex statistics such as fertility and mortality rates.
A more detailed description of estimates of sampling errors are presented in Appendix B of the survey final report.
Data Quality Tables - Household age distribution - Age distribution of eligible and interviewed women - Age distribution of eligible and interviewed men - Completeness of reporting - Births by calendar years - Reporting of age at death in days - Reporting of age at death in months - Nutritional status of children based on the NCHS/CDC/WHO International Reference Population - Vaccinations by background characteristics for children age 18-29 months
See details of the data quality tables in Appendix C of the survey final report.
The primary objective of the 2017 Indonesia Dmographic and Health Survey (IDHS) is to provide up-to-date estimates of basic demographic and health indicators. The IDHS provides a comprehensive overview of population and maternal and child health issues in Indonesia. More specifically, the IDHS was designed to: - provide data on fertility, family planning, maternal and child health, and awareness of HIV/AIDS and sexually transmitted infections (STIs) to help program managers, policy makers, and researchers to evaluate and improve existing programs; - measure trends in fertility and contraceptive prevalence rates, and analyze factors that affect such changes, such as residence, education, breastfeeding practices, and knowledge, use, and availability of contraceptive methods; - evaluate the achievement of goals previously set by national health programs, with special focus on maternal and child health; - assess married men’s knowledge of utilization of health services for their family’s health and participation in the health care of their families; - participate in creating an international database to allow cross-country comparisons in the areas of fertility, family planning, and health.
National coverage
The survey covered all de jure household members (usual residents), all women age 15-49 years resident in the household, and all men age 15-54 years resident in the household.
Sample survey data [ssd]
The 2017 IDHS sample covered 1,970 census blocks in urban and rural areas and was expected to obtain responses from 49,250 households. The sampled households were expected to identify about 59,100 women age 15-49 and 24,625 never-married men age 15-24 eligible for individual interview. Eight households were selected in each selected census block to yield 14,193 married men age 15-54 to be interviewed with the Married Man's Questionnaire. The sample frame of the 2017 IDHS is the Master Sample of Census Blocks from the 2010 Population Census. The frame for the household sample selection is the updated list of ordinary households in the selected census blocks. This list does not include institutional households, such as orphanages, police/military barracks, and prisons, or special households (boarding houses with a minimum of 10 people).
The sampling design of the 2017 IDHS used two-stage stratified sampling: Stage 1: Several census blocks were selected with systematic sampling proportional to size, where size is the number of households listed in the 2010 Population Census. In the implicit stratification, the census blocks were stratified by urban and rural areas and ordered by wealth index category.
Stage 2: In each selected census block, 25 ordinary households were selected with systematic sampling from the updated household listing. Eight households were selected systematically to obtain a sample of married men.
For further details on sample design, see Appendix B of the final report.
Face-to-face [f2f]
The 2017 IDHS used four questionnaires: the Household Questionnaire, Woman’s Questionnaire, Married Man’s Questionnaire, and Never Married Man’s Questionnaire. Because of the change in survey coverage from ever-married women age 15-49 in the 2007 IDHS to all women age 15-49, the Woman’s Questionnaire had questions added for never married women age 15-24. These questions were part of the 2007 Indonesia Young Adult Reproductive Survey Questionnaire. The Household Questionnaire and the Woman’s Questionnaire are largely based on standard DHS phase 7 questionnaires (2015 version). The model questionnaires were adapted for use in Indonesia. Not all questions in the DHS model were included in the IDHS. Response categories were modified to reflect the local situation.
All completed questionnaires, along with the control forms, were returned to the BPS central office in Jakarta for data processing. The questionnaires were logged and edited, and all open-ended questions were coded. Responses were entered in the computer twice for verification, and they were corrected for computer-identified errors. Data processing activities were carried out by a team of 34 editors, 112 data entry operators, 33 compare officers, 19 secondary data editors, and 2 data entry supervisors. The questionnaires were entered twice and the entries were compared to detect and correct keying errors. A computer package program called Census and Survey Processing System (CSPro), which was specifically designed to process DHS-type survey data, was used in the processing of the 2017 IDHS.
Of the 49,261 eligible households, 48,216 households were found by the interviewer teams. Among these households, 47,963 households were successfully interviewed, a response rate of almost 100%.
In the interviewed households, 50,730 women were identified as eligible for individual interview and, from these, completed interviews were conducted with 49,627 women, yielding a response rate of 98%. From the selected household sample of married men, 10,440 married men were identified as eligible for interview, of which 10,009 were successfully interviewed, yielding a response rate of 96%. The lower response rate for men was due to the more frequent and longer absence of men from the household. In general, response rates in rural areas were higher than those in urban areas.
The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors and (2) sampling errors. Nonsampling errors result from mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2017 Indonesia Demographic and Health Survey (2017 IDHS) to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 2017 IDHS is only one of many samples that could have been selected from the same population, using the same design and identical size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling error is a measure of the variability among all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95 percent of all possible samples of identical size and design.
If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 2017 IDHS sample is the result of a multi-stage stratified design, and, consequently, it was necessary to use more complex formulas. The computer software used to calculate sampling errors for the 2017 IDHS is a STATA program. This program used the Taylor linearization method for variance estimation for survey estimates that are means or proportions. The Jackknife repeated replication method is used for variance estimation of more complex statistics such as fertility and mortality rates.
A more detailed description of estimates of sampling errors are presented in Appendix C of the survey final report.
Data Quality Tables - Household age distribution - Age distribution of eligible and interviewed women - Age distribution of eligible and interviewed men - Completeness of reporting - Births by calendar year - Reporting of age at death in days - Reporting of age at death in months
See details of the data quality tables in Appendix D of the survey final report.
The primary objective of the 2012 Indonesia Demographic and Health Survey (IDHS) is to provide policymakers and program managers with national- and provincial-level data on representative samples of all women age 15-49 and currently-married men age 15-54.
The 2012 IDHS was specifically designed to meet the following objectives: • Provide data on fertility, family planning, maternal and child health, adult mortality (including maternal mortality), and awareness of AIDS/STIs to program managers, policymakers, and researchers to help them evaluate and improve existing programs; • Measure trends in fertility and contraceptive prevalence rates, and analyze factors that affect such changes, such as marital status and patterns, residence, education, breastfeeding habits, and knowledge, use, and availability of contraception; • Evaluate the achievement of goals previously set by national health programs, with special focus on maternal and child health; • Assess married men’s knowledge of utilization of health services for their family’s health, as well as participation in the health care of their families; • Participate in creating an international database that allows cross-country comparisons that can be used by the program managers, policymakers, and researchers in the areas of family planning, fertility, and health in general
National coverage
Sample survey data [ssd]
Indonesia is divided into 33 provinces. Each province is subdivided into districts (regency in areas mostly rural and municipality in urban areas). Districts are subdivided into subdistricts, and each subdistrict is divided into villages. The entire village is classified as urban or rural.
The 2012 IDHS sample is aimed at providing reliable estimates of key characteristics for women age 15-49 and currently-married men age 15-54 in Indonesia as a whole, in urban and rural areas, and in each of the 33 provinces included in the survey. To achieve this objective, a total of 1,840 census blocks (CBs)-874 in urban areas and 966 in rural areas-were selected from the list of CBs in the selected primary sampling units formed during the 2010 population census.
Because the sample was designed to provide reliable indicators for each province, the number of CBs in each province was not allocated in proportion to the population of the province or its urban-rural classification. Therefore, a final weighing adjustment procedure was done to obtain estimates for all domains. A minimum of 43 CBs per province was imposed in the 2012 IDHS design.
Refer to Appendix B in the final report for details of sample design and implementation.
Face-to-face [f2f]
The 2012 IDHS used four questionnaires: the Household Questionnaire, the Woman’s Questionnaire, the Currently Married Man’s Questionnaire, and the Never-Married Man’s Questionnaire. Because of the change in survey coverage from ever-married women age 15-49 in the 2007 IDHS to all women age 15-49 in the 2012 IDHS, the Woman’s Questionnaire now has questions for never-married women age 15-24. These questions were part of the 2007 Indonesia Young Adult Reproductive Survey questionnaire.
The Household and Woman’s Questionnaires are largely based on standard DHS phase VI questionnaires (March 2011 version). The model questionnaires were adapted for use in Indonesia. Not all questions in the DHS model were adopted in the IDHS. In addition, the response categories were modified to reflect the local situation.
The Household Questionnaire was used to list all the usual members and visitors who spent the previous night in the selected households. Basic information collected on each person listed includes age, sex, education, marital status, education, and relationship to the head of the household. Information on characteristics of the housing unit, such as the source of drinking water, type of toilet facilities, construction materials used for the floor, roof, and outer walls of the house, and ownership of various durable goods were also recorded in the Household Questionnaire. These items reflect the household’s socioeconomic status and are used to calculate the household wealth index. The main purpose of the Household Questionnaire was to identify women and men who were eligible for an individual interview.
The Woman’s Questionnaire was used to collect information from all women age 15-49. These women were asked questions on the following topics: • Background characteristics (marital status, education, media exposure, etc.) • Reproductive history and fertility preferences • Knowledge and use of family planning methods • Antenatal, delivery, and postnatal care • Breastfeeding and infant and young children feeding practices • Childhood mortality • Vaccinations and childhood illnesses • Marriage and sexual activity • Fertility preferences • Woman’s work and husband’s background characteristics • Awareness and behavior regarding HIV-AIDS and other sexually transmitted infections (STIs) • Sibling mortality, including maternal mortality • Other health issues
Questions asked to never-married women age 15-24 addressed the following: • Additional background characteristics • Knowledge of the human reproduction system • Attitudes toward marriage and children • Role of family, school, the community, and exposure to mass media • Use of tobacco, alcohol, and drugs • Dating and sexual activity
The Man’s Questionnaire was administered to all currently married men age 15-54 living in every third household in the 2012 IDHS sample. This questionnaire includes much of the same information included in the Woman’s Questionnaire, but is shorter because it did not contain questions on reproductive history or maternal and child health. Instead, men were asked about their knowledge of and participation in health-careseeking practices for their children.
The questionnaire for never-married men age 15-24 includes the same questions asked to nevermarried women age 15-24.
All completed questionnaires, along with the control forms, were returned to the BPS central office in Jakarta for data processing. The questionnaires were logged and edited, and all open-ended questions were coded. Responses were entered in the computer twice for verification, and they were corrected for computeridentified errors. Data processing activities were carried out by a team of 58 data entry operators, 42 data editors, 14 secondary data editors, and 14 data entry supervisors. A computer package program called Census and Survey Processing System (CSPro), which was specifically designed to process DHS-type survey data, was used in the processing of the 2012 IDHS.
The response rates for both the household and individual interviews in the 2012 IDHS are high. A total of 46,024 households were selected in the sample, of which 44,302 were occupied. Of these households, 43,852 were successfully interviewed, yielding a household response rate of 99 percent.
Refer to Table 1.2 in the final report for more detailed summarized results of the of the 2012 IDHS fieldwork for both the household and individual interviews, by urban-rural residence.
The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors, and (2) sampling errors. Nonsampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2012 Indonesia Demographic and Health Survey (2012 IDHS) to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 2012 IDHS is only one of many samples that could have been selected from the same population, using the same design and identical size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling error is a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95 percent of all possible samples of identical size and design.
If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 2012 IDHS sample is the result of a multi-stage stratified design, and, consequently, it was necessary to use more complex formulae. The computer software used to calculate sampling errors for the 2012 IDHS is a SAS program. This program used the Taylor linearization method
The Bangladesh Demographic and Health Survey (BDHS) is part of the worldwide Demographic and Health Surveys program, which is designed to collect data on fertility, family planning, and maternal and child health.
The BDHS is intended to serve as a source of population and health data for policymakers and the research community. In general, the objectives of the BDHS are to: - assess the overall demographic situation in Bangladesh, - assist in the evaluation of the population and health programs in Bangladesh, and - advance survey methodology.
More specifically, the objective of the BDHS is to provide up-to-date information on fertility and childhood mortality levels; nuptiality; fertility preferences; awareness, approval, and use of family planning methods; breastfeeding practices; nutrition levels; and maternal and child health. This information is intended to assist policymakers and administrators in evaluating and designing programs and strategies for improving health and family planning services in the country.
National
Sample survey data
Bangladesh is divided into six administrative divisions, 64 districts (zillas), and 490 thanas. In rural areas, thanas are divided into unions and then mauzas, a land administrative unit. Urban areas are divided into wards and then mahallas. The 1996-97 BDHS employed a nationally-representative, two-stage sample that was selected from the Integrated Multi-Purpose Master Sample (IMPS) maintained by the Bangladesh Bureau of Statistics. Each division was stratified into three groups: 1 ) statistical metropolitan areas (SMAs), 2) municipalities (other urban areas), and 3) rural areas. 3 In the rural areas, the primary sampling unit was the mauza, while in urban areas, it was the mahalla. Because the primary sampling units in the IMPS were selected with probability proportional to size from the 1991 Census frame, the units for the BDHS were sub-selected from the IMPS with equal probability so as to retain the overall probability proportional to size. A total of 316 primary sampling units were utilized for the BDHS (30 in SMAs, 42 in municipalities, and 244 in rural areas). In order to highlight changes in survey indicators over time, the 1996-97 BDHS utilized the same sample points (though not necessarily the same households) that were selected for the 1993-94 BDHS, except for 12 additional sample points in the new division of Sylhet. Fieldwork in three sample points was not possible (one in Dhaka Cantonment and two in the Chittagong Hill Tracts), so a total of 313 points were covered.
Since one objective of the BDHS is to provide separate estimates for each division as well as for urban and rural areas separately, it was necessary to increase the sampling rate for Barisal and Sylhet Divisions and for municipalities relative to the other divisions, SMAs and rural areas. Thus, the BDHS sample is not self-weighting and weighting factors have been applied to the data in this report.
Mitra and Associates conducted a household listing operation in all the sample points from 15 September to 15 December 1996. A systematic sample of 9,099 households was then selected from these lists. Every second household was selected for the men's survey, meaning that, in addition to interviewing all ever-married women age 10-49, interviewers also interviewed all currently married men age 15-59. It was expected that the sample would yield interviews with approximately 10,000 ever-married women age 10-49 and 3,000 currently married men age 15-59.
Note: See detailed in APPENDIX A of the survey report.
Face-to-face
Four types of questionnaires were used for the BDHS: a Household Questionnaire, a Women's Questionnaire, a Men' s Questionnaire and a Community Questionnaire. The contents of these questionnaires were based on the DHS Model A Questionnaire, which is designed for use in countries with relatively high levels of contraceptive use. These model questionnaires were adapted for use in Bangladesh during a series of meetings with a small Technical Task Force that consisted of representatives from NIPORT, Mitra and Associates, USAID/Bangladesh, the International Centre for Diarrhoeal Disease Research, Bangladesh (ICDDR,B), Population Council/Dhaka, and Macro International Inc (see Appendix D for a list of members). Draft questionnaires were then circulated to other interested groups and were reviewed by the BDHS Technical Review Committee (see Appendix D for list of members). The questionnaires were developed in English and then translated into and printed in Bangla (see Appendix E for final version in English).
The Household Questionnaire was used to list all the usual members and visitors in the selected households. Some basic information was collected on the characteristics of each person listed, including his/her age, sex, education, and relationship to the head of the household. The main purpose of the Household Questionnaire was to identify women and men who were eligible for the individual interview. In addition, information was collected about the dwelling itself, such as the source of water, type of toilet facilities, materials used to construct the house, and ownership of various consumer goods.
The Women's Questionnaire was used to collect information from ever-married women age 10-49. These women were asked questions on the following topics: - Background characteristics (age, education, religion, etc.), - Reproductive history, - Knowledge and use of family planning methods, - Antenatal and delivery care, - Breastfeeding and weaning practices, - Vaccinations and health of children under age five, - Marriage, - Fertility preferences, - Husband's background and respondent's work, - Knowledge of AIDS, - Height and weight of children under age five and their mothers.
The Men's Questionnaire was used to interview currently married men age 15-59. It was similar to that for women except that it omitted the sections on reproductive history, antenatal and delivery care, breastfeeding, vaccinations, and height and weight. The Community Questionnaire was completed for each sample point and included questions about the existence in the community of income-generating activities and other development organizations and the availability of health and family planning services.
A total of 9,099 households were selected for the sample, of which 8,682 were successfully interviewed. The shortfall is primarily due to dwellings that were vacant or in which the inhabitants had left for an extended period at the time they were visited by the interviewing teams. Of the 8,762 households occupied, 99 percent were successfully interviewed. In these households, 9,335 women were identified as eligible for the individual interview (i.e., ever-married and age 10-49) and interviews were completed for 9,127 or 98 percent of them. In the half of the households that were selected for inclusion in the men's survey, 3,611 eligible ever-married men age 15-59 were identified, of whom 3,346 or 93 percent were interviewed.
The principal reason for non-response among eligible women and men was the failure to find them at home despite repeated visits to the household. The refusal rate was low.
Note: See summarized response rates by residence (urban/rural) in Table 1.1 of the survey report.
The estimates from a sample survey are affected by two types of errors: (1) non-sampling errors, and (2) sampling errors. Non-sampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the BDHS to minimize this type of error, non-sampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the BDHS is only one of many samples that could have been selected from the same population, using the same design and expected size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95 percent of all possible samples of identical size and design.
If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the BDHS sample is the result of a two-stage stratified design, and, consequently, it was necessary to use more complex formulae. The computer software used to calculate sampling errors for the BDHS is the ISSA Sampling Error Module. This module used the Taylor
analyze the current population survey (cps) annual social and economic supplement (asec) with r the annual march cps-asec has been supplying the statistics for the census bureau's report on income, poverty, and health insurance coverage since 1948. wow. the us census bureau and the bureau of labor statistics ( bls) tag-team on this one. until the american community survey (acs) hit the scene in the early aughts (2000s), the current population survey had the largest sample size of all the annual general demographic data sets outside of the decennial census - about two hundred thousand respondents. this provides enough sample to conduct state- and a few large metro area-level analyses. your sample size will vanish if you start investigating subgroups b y state - consider pooling multiple years. county-level is a no-no. despite the american community survey's larger size, the cps-asec contains many more variables related to employment, sources of income, and insurance - and can be trended back to harry truman's presidency. aside from questions specifically asked about an annual experience (like income), many of the questions in this march data set should be t reated as point-in-time statistics. cps-asec generalizes to the united states non-institutional, non-active duty military population. the national bureau of economic research (nber) provides sas, spss, and stata importation scripts to create a rectangular file (rectangular data means only person-level records; household- and family-level information gets attached to each person). to import these files into r, the parse.SAScii function uses nber's sas code to determine how to import the fixed-width file, then RSQLite to put everything into a schnazzy database. you can try reading through the nber march 2012 sas importation code yourself, but it's a bit of a proc freak show. this new github repository contains three scripts: 2005-2012 asec - download all microdata.R down load the fixed-width file containing household, family, and person records import by separating this file into three tables, then merge 'em together at the person-level download the fixed-width file containing the person-level replicate weights merge the rectangular person-level file with the replicate weights, then store it in a sql database create a new variable - one - in the data table 2012 asec - analysis examples.R connect to the sql database created by the 'download all microdata' progr am create the complex sample survey object, using the replicate weights perform a boatload of analysis examples replicate census estimates - 2011.R connect to the sql database created by the 'download all microdata' program create the complex sample survey object, using the replicate weights match the sas output shown in the png file below 2011 asec replicate weight sas output.png statistic and standard error generated from the replicate-weighted example sas script contained in this census-provided person replicate weights usage instructions document. click here to view these three scripts for more detail about the current population survey - annual social and economic supplement (cps-asec), visit: the census bureau's current population survey page the bureau of labor statistics' current population survey page the current population survey's wikipedia article notes: interviews are conducted in march about experiences during the previous year. the file labeled 2012 includes information (income, work experience, health insurance) pertaining to 2011. when you use the current populat ion survey to talk about america, subract a year from the data file name. as of the 2010 file (the interview focusing on america during 2009), the cps-asec contains exciting new medical out-of-pocket spending variables most useful for supplemental (medical spending-adjusted) poverty research. confidential to sas, spss, stata, sudaan users: why are you still rubbing two sticks together after we've invented the butane lighter? time to transition to r. :D
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Open Science in (Higher) Education – data of the February 2017 survey
This data set contains:
Survey structure
The survey includes 24 questions and its structure can be separated in five major themes: material used in courses (5), OER awareness, usage and development (6), collaborative tools used in courses (2), assessment and participation options (5), demographics (4). The last two questions include an open text questions about general issues on the topics and singular open education experiences, and a request on forwarding the respondent’s e-mail address for further questionings. The online survey was created with Limesurvey[1]. Several questions include filters, i.e. these questions were only shown if a participants did choose a specific answer beforehand ([n/a] in Excel file, [.] In SPSS).
Demographic questions
Demographic questions asked about the current position, the discipline, birth year and gender. The classification of research disciplines was adapted to general disciplines at German higher education institutions. As we wanted to have a broad classification, we summarised several disciplines and came up with the following list, including the option “other” for respondents who do not feel confident with the proposed classification:
The current job position classification was also chosen according to common positions in Germany, including positions with a teaching responsibility at higher education institutions. Here, we also included the option “other” for respondents who do not feel confident with the proposed classification:
We chose to have a free text (numerical) for asking about a respondent’s year of birth because we did not want to pre-classify respondents’ age intervals. It leaves us options to have different analysis on answers and possible correlations to the respondents’ age. Asking about the country was left out as the survey was designed for academics in Germany.
Remark on OER question
Data from earlier surveys revealed that academics suffer confusion about the proper definition of OER[2]. Some seem to understand OER as free resources, or only refer to open source software (Allen & Seaman, 2016, p. 11). Allen and Seaman (2016) decided to give a broad explanation of OER, avoiding details to not tempt the participant to claim “aware”. Thus, there is a danger of having a bias when giving an explanation. We decided not to give an explanation, but keep this question simple. We assume that either someone knows about OER or not. If they had not heard of the term before, they do not probably use OER (at least not consciously) or create them.
Data collection
The target group of the survey was academics at German institutions of higher education, mainly universities and universities of applied sciences. To reach them we sent the survey to diverse institutional-intern and extern mailing lists and via personal contacts. Included lists were discipline-based lists, lists deriving from higher education and higher education didactic communities as well as lists from open science and OER communities. Additionally, personal e-mails were sent to presidents and contact persons from those communities, and Twitter was used to spread the survey.
The survey was online from Feb 6th to March 3rd 2017, e-mails were mainly sent at the beginning and around mid-term.
Data clearance
We got 360 responses, whereof Limesurvey counted 208 completes and 152 incompletes. Two responses were marked as incomplete, but after checking them turned out to be complete, and we added them to the complete responses dataset. Thus, this data set includes 210 complete responses. From those 150 incomplete responses, 58 respondents did not answer 1st question, 40 respondents discontinued after 1st question. Data shows a constant decline in response answers, we did not detect any striking survey question with a high dropout rate. We deleted incomplete responses and they are not in this data set.
Due to data privacy reasons, we deleted seven variables automatically assigned by Limesurvey: submitdate, lastpage, startlanguage, startdate, datestamp, ipaddr, refurl. We also deleted answers to question No 24 (email address).
References
Allen, E., & Seaman, J. (2016). Opening the Textbook: Educational Resources in U.S. Higher Education, 2015-16.
First results of the survey are presented in the poster:
Heck, Tamara, Blümel, Ina, Heller, Lambert, Mazarakis, Athanasios, Peters, Isabella, Scherp, Ansgar, & Weisel, Luzian. (2017). Survey: Open Science in Higher Education. Zenodo. http://doi.org/10.5281/zenodo.400561
Contact:
Open Science in (Higher) Education working group, see http://www.leibniz-science20.de/forschung/projekte/laufende-projekte/open-science-in-higher-education/.
[1] https://www.limesurvey.org
[2] The survey question about the awareness of OER gave a broad explanation, avoiding details to not tempt the participant to claim “aware”.
The 2022 Socio-Demographic and Economic Survey is a nationally representative household survey designed to provide information on population, migration, education, labour and employment, fertility, disability, household, and housing characteristics. The key objectives of the survey are:
-to generate essential key indicators as inputs in the preparation of national plans and programs for the well-being of the population -to monitor the progress of development programs as stipulated in the Sustainable Development Goals (SDGs), Medium Term Development Plans, Vision 2050 and other national policies/plans and priorities.
National coverage. 43 strata and 22 provinces were covered.
Household and Individual.
Sample survey data [ssd]
-Used a stratified, two-stage cluster sampling method, with a third stage in very large sample census units (CU, enumeration areas selected within the sample CUs).
-Produced 43 strata, 22 provinces by urban/rural (National Capital District has only urban areas).
-Allocation was done proportionately according to size (in terms of the number of households).
-Thus, 335 CUs / clusters were selected in the first- stage while a fixed number of 15 households per cluster were selected at the second stage resulting to a total sample size of 5,025 households.
Coverage: 95.8% (14 out of 335 clusters not accessed) due to security issues (tribal fights/lawlessness), and election related misconceptions.
Computer Assisted Personal Interview [capi]
The questionnaire was generated using the World Bank's software Survey Solutions. It contains a set of 47 questions covering several modules such as Employment, Fertility, Housing, Disability, Education. The questionnaire is provided in English in the External Resources section in this documentation.
-Checking of data submitted from field, identifying unique / valid households and removing invalid or duplicate households, coding of responses, consistency checks -Tabulations - generating tables for data analysis and generation of key indicators
The City of Norfolk is committed to using data to inform decisions and allocate resources. An important source of data is input from residents about their priorities and satisfaction with the services we provide. Norfolk last conducted a citywide survey of residents in 2022.
To provide up-to-date information regarding resident priorities and satisfaction, Norfolk contracted with ETC Institute to conduct a survey of residents. This survey was conducted in May and June 2024; surveys were sent via the U.S. Postal Service, and respondents were given the choice of responding by mail or online. This survey represents a random and statistically valid sample of residents from across the city, including each Ward. ETC Institute monitored responses and followed up to ensure all sections of the city were represented. Additionally, an opportunity was provided for residents not included in the random sample to take the survey and express their views. This dataset includes all random sample survey data including demographic information; it excludes free-form comments to protect privacy. It is grouped by Question Category, Question, Response, Demographic Question, and Demographic Question Response. This dataset will be updated every two years.
The 2005 Armenia Demographic and Health Survey (2005 ADHS) is the second in a series of nationally representative sample surveys designed to provide information on population and health issues in Armenia. As in the 2000 ADHS, the primary goal of the 2005 survey was to develop a single integrated set of demographic and health data pertaining to the population of the Republic of Armenia. In addition to integrating measures of reproductive, child, and adult health, another feature of the 2005 ADHS survey is that the majority of data are presented at the marz (region) level.
The 2005 ADHS was conducted by the National Statistical Service (NSS) and the MOH of the Republic of Armenia from September through December 2005. ORC Macro provided technical support for the survey through the MEASURE DHS project. MEASURE DHS is a worldwide project, sponsored by the United States Agency for International Development (USAID), with a mandate to assist countries in obtaining information on key population and health indicators. USAID/Armenia provided funding for the survey, while the United Nations Children’s Fund (UNICEF)/Armenia and the United Nations Population Fund (UNFPA)/Armenia supported the survey through in-kind contributions.
The 2005 ADHS collected national- and regional-level data on fertility and contraceptive use, maternal and child health, adult health, and HIV/AIDS and other sexually transmitted diseases. The survey obtained detailed information on these issues from women of reproductive age and, on certain topics, from men as well. Data are presented by marz wherever sample size permits.
The 2005 ADHS results are intended to provide the information needed to evaluate existing social programs and to design new strategies for improving the health of and health services for the people of Armenia. The 2005 ADHS also contributes to the growing international database on demographic and health-related variables.
National
Sample survey data
The sample was designed to permit detailed analysis-including the estimation of rates of fertility, infant/child mortality, and abortion-for the national level, for Yerevan, and for total urban and total rural areas separately. Many indicators can also be estimated at the regional (marz) level.
A representative probability sample of 7,565 households was selected for the 2005 ADHS sample. The sample was selected in two stages. In the first stage, 308 clusters were selected from a list of enumeration areas in a subsample from a master sample that was designed from the 2001 Population Census. In the second stage, a complete listing of households was carried out in each selected cluster. Households were then systematically selected for participation in the survey.
All women age 15-49 who were either permanent residents of the households in the 2005 ADHS sample or visitors present in the household on the night before the survey were eligible to be interviewed. Interviews were completed with 6,566 women. In addition, in a subsample of one-third of all the households selected for the survey, all men age 15-49 were eligible to be interviewed if they were either permanent residents or visitors present in the household on the night before the survey. Interviews were completed with 1,447 men.
Note: See detailed summarized sample implementation tables in APPENDIX A of the report which is presented in this documentation.
Face-to-face [f2f]
Three questionnaires were used in the 2005 ADHS: a Household Questionnaire, a Women’s Questionnaire, and a Men’s questionnaire. The Household and Individual Questionnaires were based on model survey instruments developed in the MEASURE DHS program and on questionnaires used in the 2000 ADHS. The model questionnaires were adapted for use by experts from the NSS and MOH. Input was also sought from a number of non-governmental organizations. The questionnaires were developed in English and translated into Armenian. The Household and Individual Questionnaires were pretested in June 2005.
The Household Questionnaire was used to list all usual members of and visitors to the selected households and to collect information on the socioeconomic status of the household. The first part of the Household Questionnaire collected information on the age, sex, educational attainment, and relationship to the household head of each household member or visitor. This information provides basic demographic data for Armenian households. It also was used to identify the women and men who were eligible for the individual interview (i.e., women and men age 15-49). In the second part of the Household Questionnaire, there were questions on housing characteristics (e.g., flooring material, source of water, type of toilet facilities), on ownership of a variety of consumer goods, and other questions relating to the socioeconomic status of the household. In addition, the Household Questionnaire was used to record height and weight measurements of women, men, and children under age five; hemoglobin measurement of women and children under age five; and blood pressure measurement of women and men.
The Women’s Questionnaire obtained data from women age 15-49 on the following topics: • Background characteristics • Pregnancy history • Antenatal, delivery, and postnatal care • Knowledge, attitudes, and use of contraception • Reproductive and adult health • Health care utilization • Vaccinations, birth registration, and health of children under age five • Episodes of diarrhea and respiratory illness of children under age five • Breastfeeding and weaning practices • Marriage and recent sexual activity • Fertility preferences • Knowledge of and attitude toward HIV/AIDS and other sexually transmitted infections
The Men’s Questionnaire, administered to men age 15-49, focused on the following topics: • Background characteristics • Health and health care utilization • Marriage and recent sexual activity • Attitudes toward and use of condoms • Knowledge of and attitude toward HIV/AIDS and other sexually transmitted infections • Attitudes toward women’s status
A total of 7,565 households were selected for the sample, of which 7,003 were occupied at the time of fieldwork. The main reason for the difference is that some of the dwelling units that were occupied during the household listing operation were either vacant or the household was away for an extended period at the time of interviewing. Of the occupied households, 96 percent were successfully interviewed.
In these households, 6,773 women were identified as eligible for the individual interview, and interviews were completed with 97 percent of them. Of the 1,630 eligible men identified, 89 percent were successfully interviewed. Response rates are almost identical in urban and rural areas.
Note: See summarized response rates by residence (urban/rural) in Table 1.1 of the report which is presented this documentation.
Estimates derived from a sample survey are affected by two types of errors: 1) non-sampling errors, and 2) sampling errors. Non-sampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2005 Armenia DHS (2005 ADHS) to minimize this type of error, non-sampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 2005 ADHS is only one of many samples that could have been selected from the same population, using the same design and expected size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95 percent of all possible samples of identical size and design.
If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 2005 ADHS sample is the result of a multi-stage stratified design, and, consequently, it was necessary to use a more complex formula. The computer software used to calculate sampling errors for the 2005 ADHS is the sampling error module in ISSA (Integrated System for Survey Analysis). This module uses the Taylor linearization method of variance estimation for survey estimates that are means or proportions. Another approach, the Jackknife repeated replication method is used for variance estimation of more complex statistics such as fertility and mortality rates.
Note: See detailed
The 2022 Kenya Demographic and Health Survey (2022 KDHS) is the seventh DHS survey implemented in Kenya. The Kenya National Bureau of Statistics (KNBS) in collaboration with the Ministry of Health (MoH) and other stakeholders implemented the survey. Survey planning began in late 2020 with data collection taking place from February 17 to July 19, 2022. ICF provided technical assistance through The DHS Program, which is funded by the United States Agency for International Development (USAID) and offers financial support and technical assistance for population and health surveys in countries worldwide. Other agencies and organizations that facilitated the successful implementation of the survey through technical or financial support were the Bill & Melinda Gates Foundation, the World Bank, the United Nations Children's Fund (UNICEF), the United Nations Population Fund (UNFPA), Nutrition International, the World Food Programme (WFP), the United Nations Entity for Gender Equality and the Empowerment of Women (UN Women), the World Health Organization (WHO), the Clinton Health Access Initiative, and the Joint United Nations Programme on HIV/AIDS (UNAIDS).
SURVEY OBJECTIVES The primary objective of the 2022 KDHS is to provide up-to-date estimates of demographic, health, and nutrition indicators to guide the planning, implementation, monitoring, and evaluation of population and health-related programs at the national and county levels. The specific objectives of the 2022 KDHS are to: Estimate fertility levels and contraceptive prevalence Estimate childhood mortality Provide basic indicators of maternal and child health Estimate the Early Childhood Development Index (ECDI) Collect anthropometric measures for children, women, and men Collect information on children's nutrition Collect information on women's dietary diversity Obtain information on knowledge and behavior related to transmission of HIV and other sexually transmitted infections (STIs) Obtain information on noncommunicable diseases and other health issues Ascertain the extent and patterns of domestic violence and female genital mutilation/cutting
National coverage
Household, individuals, county and national level
The survey covered sampled households
The sample for the 2022 KDHS was drawn from the Kenya Household Master Sample Frame (K-HMSF). This is the frame that KNBS currently operates to conduct household-based sample surveys in Kenya. In 2019, Kenya conducted a Population and Housing Census, and a total of 129,067 enumeration areas (EAs) were developed. Of these EAs, 10,000 were selected with probability proportional to size to create the K-HMSF. The 10,000 EAs were randomized into four equal subsamples. The survey sample was drawn from one of the four subsamples. The EAs were developed into clusters through a process of household listing and geo-referencing. To design the frame, each of the 47 counties in Kenya was stratified into rural and urban strata, resulting in 92 strata since Nairobi City and Mombasa counties are purely urban.
The 2022 KDHS was designed to provide estimates at the national level, for rural and urban areas, and, for some indicators, at the county level. Given this, the sample was designed to have 42,300 households, with 25 households selected per cluster, resulting into 1,692 clusters spread across the country with 1,026 clusters in rural areas and 666 in urban areas.
Computer Assisted Personal Interview [capi]
Eight questionnaires were used for the 2022 KDHS: 1. A full Household Questionnaire 2. A short Household Questionnaire 3. A full Woman's Questionnaire 4. A short Woman's Questionnaire 5. A Man's Questionnaire 6. A full Biomarker Questionnaire 7. A short Biomarker Questionnaire 8. A Fieldworker Questionnaire.
The Household Questionnaire collected information on: o Background characteristics of each person in the household (for example, name, sex, age, education, relationship to the household head, survival of parents among children under age 18) o Disability o Assets, land ownership, and housing characteristics o Sanitation, water, and other environmental health issues o Health expenditures o Accident and injury o COVID-19 (prevalence, vaccination, and related deaths) o Household food consumption
The Woman's Questionnaire was used to collect information from women age 15-49 on the following topics: o Socioeconomic and demographic characteristics o Reproduction o Family planning o Maternal health care and breastfeeding o Vaccination and health of children o Children's nutrition o Woman's dietary diversity o Early childhood development o Marriage and sexual activity o Fertility preferences o Husbands' background characteristics and women's employment activity o HIV/AIDS, other sexually transmitted infections (STIs), and tuberculosis (TB) o Other health issues o Early Childhood Development Index 2030 o Chronic diseases o Female genital mutilation/cutting o Domestic violence
The Man's Questionnaire was administered to men age 15-54 living in the households selected for long Household Questionnaires. The questionnaire collected information on: o Socioeconomic and demographic characteristics o Reproduction o Family planning o Marriage and sexual activity o Fertility preferences o Employment and gender roles o HIV/AIDS, other STIs, and TB o Other health issues o Chronic diseases o Female genital mutilation/cutting o Domestic violence
The Biomarker Questionnaire collected information on anthropometry (weight and height). The long Biomarker Questionnaire collected anthropometry measurements for children age 0-59 months, women age 15-49, and men age 15-54, while the short questionnaire collected weight and height measurements only for children age 0-59 months.
The Fieldworker Questionnaire was used to collect basic background information on the people who collected data in the field. This included team supervisors, interviewers, and biomarker technicians.
All questionnaires except the Fieldworker Questionnaire were translated into the Swahili language to make it easier for interviewers to ask questions in a language that respondents could understand.
Data were downloaded from the central servers and checked against the inventory of expected returns to account for all data collected in the field. SyncCloud was also used to generate field check tables to monitor progress and flag any errors, which were communicated back to the field teams for correction.
Secondary editing was done by members of the central office team, who resolved any errors that were not corrected by field teams during data collection. A CSPro batch editing tool was used for cleaning and tabulation during data analysis.
A total of 42,022 households were selected for the sample, of which 38,731 (92%) were found to be occupied. Among the occupied households, 37,911 were successfully interviewed, yielding a response rate of 98%. The response rates for urban and rural households were 96% and 99%, respectively. In the interviewed households, 33,879 women age 15-49 were identified as eligible for individual interviews. Interviews were completed with 32,156 women, yielding a response rate of 95%. The response rates among women selected for the full and short questionnaires were the similar (95%). In the households selected for the male survey, 16,552 men age 15-54 were identified as eligible for individual interviews and 14,453 were successfully interviewed, yielding a response rate of 87%.
Rwanda Interim Demographic and Health Survey (RIDHS) follows the Demographic and Health Surveys (RDHS) that were successfully conducted in 1992, 2000, and 2005, and is part of a broad, worldwide program of socio-demographic and health surveys conducted in developing countries since the mid-1980s. RIDHS collected the indicators on fertility, family planning and maternal and child health which the survey normally provides. In addition, RIDHS integrated a malaria module and tests for the prevalence of malaria and anemia among women and children, thus determining the prevalence of malaria and anemia for women and children at the national level.
The main objectives of the RIDHS were: • At the national level, gather data to determine demographic rates, particularly fertility and infant and child mortality rates, and analyze the direct and indirect factors that determine fertility and child mortality rates and trends. • Evaluate the level of knowledge and use of contraceptives among women and men. • Gather data concerning family health: vaccinations; prevalence and treatment of diarrhea, acute respiratory infections (ARI), and fever in children under the age of five; antenatal care visits; and assistance during childbirth. • Gather data concerning the prevention and treatment of malaria, particularly the possession and use of mosquito nets, and the prevention of malaria in pregnant women. • Gather data concerning child feeding practices, including breastfeeding. • Gather data concerning circumcision among men between the ages of 15 and 59. • Collect blood samples in all of the households surveyed for anemia testing of women age 15-49, pregnant women and children under age five. • Collect blood samples in all of the households surveyed for hemoglobin and malaria diagnostic testing of women age 15 to 49, pregnant women and children under age five.
National coverage
Household Individual Woman age 15-49 Man age 15-59
Sample survey data [ssd]
The sample for the RIDHS is a two-stage stratified area sample. Clusters are the primary sampling units and are constituted from enumeration areas (EA). The EA were defined in the 2002 General Population and Housing Census (RGPH) (SNR, 2005).
These enumeration areas provided the master frame for the drawing of 250 clusters (187 rural and 63 urban), selected with a representative probability proportional to their size. Only 249 of these clusters were surveyed, because one cluster located in a refugee camp had to be eliminated from the sample. A strictly proportional sample allocation would have resulted in a very low number of urban households in certain provinces. It was therefore necessary to slightly oversample urban areas in order to survey a sufficient number of households to produce reliable estimates for urban areas. The second stage involved selecting a sample of households in these enumeration areas. In order to adequately guarantee the accuracy of the indicators, the total number drawn was limited to 30 households per cluster. Because of the nonproportional distribution of the sample among the different strata and the fact that the number of households was set for each cluster, weighting was used to ensure the validity of the sample at both national and provincial levels.
All women age 15-49 years who were either usual residents of the selected household or visitors present in the household on the night before the survey were eligible to be interviewed (7,528 women). In addition, a sample of men age 15-59 who were either usual residents of the selected household or visitors present in the household on the night before the survey were eligible for the survey (7,168 men). Finally, all women age 15-49 and all children under the age of five were eligible for the anemia and malaria diagnostic tests.
The sample for the 2007-08 RIDHS covered the population residing in ordinary households across the country. A national sample of 7,469 households (1,863 in urban areas and 5,606 in rural areas) was selected. The sample was first stratified to provide adequate representation from urban and rural areas as well as all the four provinces and the city of Kigali, the nation’s capital.
One cluster located in a refugee camp had to be eliminated from the sample.
Face-to-face [f2f]
Three questionnaires were used in the 2007-08 RIDHS: the Household Questionnaire, the Women’s Questionnaire, and the Men’s Questionnaire. The content of these questionnaires was based on model questionnaires developed by the MEASURE DHS project.
Initial technical meetings that were held beginning in September 2007 allowed a wide range of government agencies as well as local and international organizations to contribute to the development of the questionnaires. Based on these discussions, the DHS model questionnaires were modified to reflect the needs of users and relevant issues in population, family planning, anemia, malaria and other health concerns in Rwanda. The questionnaires were then translated from French into Kinyarwanda. These questionnaires were finalized in December 2007 before the training of male and female interviewers.
The Household Questionnaire was used to list all of the usual members and visitors in the selected households. In addition, some basic information was collected on the characteristics of each person listed, including age, sex, education, and relationship to the head of the household. The main purpose of the Household Questionnaire was to identify women and men who were eligible for the individual interview. The Household Questionnaire also collected information on characteristics of the household’s dwelling unit such as the main source of drinking water, type of toilet facilities, materials used for the floor of the house, the main energy source used for cooking and ownership of various durable goods. Finally, the Household Questionnaire was also used to identify women and children eligible for the hemoglobin (anemia) and malaria diagnostic tests.
The Women’s Questionnaire was used to collect information on women of reproductive age (15-49 years) and covered questions on the following topics: • Background characteristics • Marital status • Birth history • Knowledge and use of family planning methods • Fertility preferences • Antenatal and delivery care • Breastfeeding practices • Vaccinations and childhood illnesses
The Men’s Questionnaire was administered to all men age 15-59 years living in the selected households. The Men’s Questionnaire collected information similar to that of the Women’s Questionnaire, with the only difference being that it did not include birth history or questions on maternal and child health or nutrition. In addition, the Men’s Questionnaire also collected information on circumcision.
Data entry began on January 7, 2008, three weeks after the beginning of data collection activities in the field. Data were entered by a team of five data processing personnel recruited and trained by staff from ICF Macro. The data entry team was reinforced during this work with an additional staffer. Completed questionnaires were periodically brought in from the field to the National Institute of Statistics in Kigali, where assigned staff checked them and coded the open-ended questions. Next, the questionnaires were sent to the data entry staff. Data were entered using CSPro, a program developed jointly by the United States Census Bureau, the ICF Macro MEASURE DHS program, and Serpro S.A. All questionnaires were entered twice to eliminate as many data entry errors as possible from the files. In addition, a quality control program was used to detect data collection errors for each team. This information was shared with field teams during supervisory visits to improve data quality. The data entry and internal consistency verification phase of the survey was completed on May 14, 2008.
The response rate was high for both men (95.4 percent) and women (97.5 percent).
The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors, and (2) sampling errors. Nonsampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2007-08 RIDHS to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 2007-08 RIDHS is only one of many samples that could have been selected from the same population, using the same design and expected size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Professional organizations in STEM (science, technology, engineering, and mathematics) can use demographic data to quantify recruitment and retention (R&R) of underrepresented groups within their memberships. However, variation in the types of demographic data collected can influence the targeting and perceived impacts of R&R efforts - e.g., giving false signals of R&R for some groups. We obtained demographic surveys from 73 U.S.-affiliated STEM organizations, collectively representing 712,000 members and conference-attendees. We found large differences in the demographic categories surveyed (e.g., disability status, sexual orientation) and the available response options. These discrepancies indicate a lack of consensus regarding the demographic groups that should be recognized and, for groups that are omitted from surveys, an inability of organizations to prioritize and evaluate R&R initiatives. Aligning inclusive demographic surveys across organizations will provide baseline data that can be used to target and evaluate R&R initiatives to better serve underrepresented groups throughout STEM. Methods We surveyed 164 STEM organizations (73 responses, rate = 44.5%) between December 2020 and July 2021 with the goal of understanding what demographic data each organization collects from its constituents (i.e., members and conference-attendees) and how the data are used. Organizations were sourced from a list of professional societies affiliated with the American Association for the Advancement of Science, AAAS, (n = 156) or from social media (n = 8). The survey was sent to the elected leadership and management firms for each organization, and follow-up reminders were sent after one month. The responding organizations represented a wide range of fields: 31 life science organizations (157,000 constituents), 5 mathematics organizations (93,000 constituents), 16 physical science organizations (207,000 constituents), 7 technology organizations (124,000 constituents), and 14 multi-disciplinary organizations spanning multiple branches of STEM (131,000 constituents). A list of the responding organizations is available in the Supplementary Materials. Based on the AAAS-affiliated recruitment of the organizations and the similar distribution of constituencies across STEM fields, we conclude that the responding organizations are a representative cross-section of the most prominent STEM organizations in the U.S. Each organization was asked about the demographic information they collect from their constituents, the response rates to their surveys, and how the data were used. Survey description The following questions are written as presented to the participating organizations. Question 1: What is the name of your STEM organization? Question 2: Does your organization collect demographic data from your membership and/or meeting attendees? Question 3: When was your organization’s most recent demographic survey (approximate year)? Question 4: We would like to know the categories of demographic information collected by your organization. You may answer this question by either uploading a blank copy of your organization’s survey (linked provided in online version of this survey) OR by completing a short series of questions. Question 5: On the most recent demographic survey or questionnaire, what categories of information were collected? (Please select all that apply)
Disability status Gender identity (e.g., male, female, non-binary) Marital/Family status Racial and ethnic group Religion Sex Sexual orientation Veteran status Other (please provide)
Question 6: For each of the categories selected in Question 5, what options were provided for survey participants to select? Question 7: Did the most recent demographic survey provide a statement about data privacy and confidentiality? If yes, please provide the statement. Question 8: Did the most recent demographic survey provide a statement about intended data use? If yes, please provide the statement. Question 9: Who maintains the demographic data collected by your organization? (e.g., contracted third party, organization executives) Question 10: How has your organization used members’ demographic data in the last five years? Examples: monitoring temporal changes in demographic diversity, publishing diversity data products, planning conferences, contributing to third-party researchers. Question 11: What is the size of your organization (number of members or number of attendees at recent meetings)? Question 12: What was the response rate (%) for your organization’s most recent demographic survey? *Organizations were also able to upload a copy of their demographics survey instead of responding to Questions 5-8. If so, the uploaded survey was used (by the study authors) to evaluate Questions 5-8.