89 datasets found
  1. w

    Demographic and Health Survey 2002 - Viet Nam

    • microdata.worldbank.org
    • catalog.ihsn.org
    Updated Oct 26, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    General Statistical Office (GSO) (2023). Demographic and Health Survey 2002 - Viet Nam [Dataset]. https://microdata.worldbank.org/index.php/catalog/1518
    Explore at:
    Dataset updated
    Oct 26, 2023
    Dataset authored and provided by
    General Statistical Office (GSO)
    Time period covered
    2002
    Area covered
    Vietnam
    Description

    Abstract

    The 2002 Vietnam Demographic and Health Survey (VNDHS 2002) is a nationally representative sample survey of 5,665 ever-married women age 15-49 selected from 205 sample points (clusters) throughout Vietnam. It provides information on levels of fertility, family planning knowledge and use, infant and child mortality, and indicators of maternal and child health. The survey included a Community/ Health Facility Questionnaire that was implemented in each of the sample clusters.

    The survey was designed to measure change in reproductive health indicators over the five years since the VNDHS 1997, especially in the 18 provinces that were targeted in the Population and Family Health Project of the Committee for Population, Family and Children. Consequently, all provinces were separated into “project” and “nonproject” groups to permit separate estimates for each. Data collection for the survey took place from 1 October to 21 December 2002.

    The Vietnam Demographic and Health Survey 2002 (VNDHS 2002) was the third DHS in Vietnam, with prior surveys implemented in 1988 and 1997. The VNDHS 2002 was carried out in the framework of the activities of the Population and Family Health Project of the Committee for Population, Family and Children (previously the National Committee for Population and Family Planning).

    The main objectives of the VNDHS 2002 were to collect up-to-date information on family planning, childhood mortality, and health issues such as breastfeeding practices, pregnancy care, vaccination of children, treatment of common childhood illnesses, and HIV/AIDS, as well as utilization of health and family planning services. The primary objectives of the survey were to estimate changes in family planning use in comparison with the results of the VNDHS 1997, especially on issues in the scope of the project of the Committee for Population, Family and Children.

    VNDHS 2002 data confirm the pattern of rapidly declining fertility that was observed in the VNDHS 1997. It also shows a sharp decline in child mortality, as well as a modest increase in contraceptive use. Differences between project and non-project provinces are generally small.

    Geographic coverage

    The 2002 Vietnam Demographic and Health Survey (VNDHS 2002) is a nationally representative sample survey. The VNDHS 1997 was designed to provide separate estimates for the whole country, urban and rural areas, for 18 project provinces and the remaining nonproject provinces as well. Project provinces refer to 18 focus provinces targeted for the strengthening of their primary health care systems by the Government's Population and Family Health Project to be implemented over a period of seven years, from 1996 to 2002 (At the outset of this project there were 15 focus provinces, which became 18 by the creation of 3 new provinces from the initial set of 15). These provinces were selected according to criteria based on relatively low health and family planning status, no substantial family planning donor presence, and regional spread. These criteria resulted in the selection of the country's poorer provinces. Nine of these provinces have significant proportions of ethnic minorities among their population.

    Analysis unit

    • Household
    • Women age 15-49

    Universe

    The population covered by the 2002 VNDHS is defined as the universe of all women age 15-49 in Vietnam.

    Kind of data

    Sample survey data

    Sampling procedure

    The sample for the VNDHS 2002 was based on that used in the VNDHS 1997, which in turn was a subsample of the 1996 Multi-Round Demographic Survey (MRS), a semi-annual survey of about 243,000 households undertaken regularly by GSO. The MRS sample consisted of 1,590 sample areas known as enumeration areas (EAs) spread throughout the 53 provinces/cities of Vietnam, with 30 EAs in each province. On average, an EA comprises about 150 households. For the VNDHS 1997, a subsample of 205 EAs was selected, with 26 households in each urban EA and 39 households for each rural EA. A total of 7,150 households was selected for the survey. The VNDHS 1997 was designed to provide separate estimates for the whole country, urban and rural areas, for 18 project provinces and the remaining nonproject provinces as well. Because the main objective of the VNDHS 2002 was to measure change in reproductive health indicators over the five years since the VNDHS 1997, the sample design for the VNDHS 2002 was as similar as possible to that of the VNDHS 1997.

    Although it would have been ideal to have returned to the same households or at least the same sample points as were selected for the VNDHS 1997, several factors made this undesirable. Revisiting the same households would have held the sample artificially rigid over time and would not allow for newly formed households. This would have conflicted with the other major survey objective, which was to provide up-to-date, representative data for the whole of Vietnam. Revisiting the same sample points that were covered in 1997 was complicated by the fact that the country had conducted a population census in 1999, which allowed for a more representative sample frame.

    In order to balance the two main objectives of measuring change and providing representative data, it was decided to select enumeration areas from the 1999 Population Census, but to cover the same communes that were sampled in the VNDHS 1997 and attempt to obtain a sample point as close as possible to that selected in 1997. Consequently, the VNDHS 2002 sample also consisted of 205 sample points and reflects the oversampling in the 20 provinces that fall in the World Bank-supported Population and Family Health Project. The sample was designed to produce about 7,000 completed household interviews and 5,600 completed interviews with ever-married women age 15-49.

    Mode of data collection

    Face-to-face

    Research instrument

    As in the VNDHS 1997, three types of questionnaires were used in the 2002 survey: the Household Questionnaire, the Individual Woman's Questionnaire, and the Community/Health Facility Questionnaire. The first two questionnaires were based on the DHS Model A Questionnaire, with additions and modifications made during an ORC Macro staff visit in July 2002. The questionnaires were pretested in two clusters in Hanoi (one in a rural area and another in an urban area). After the pretest and consultation with ORC Macro, the drafts were revised for use in the main survey.

    a) The Household Questionnaire was used to enumerate all usual members and visitors in selected households and to collect information on age, sex, education, marital status, and relationship to the head of household. The main purpose of the Household Questionnaire was to identify persons who were eligible for individual interview (i.e. ever-married women age 15-49). In addition, the Household Questionnaire collected information on characteristics of the household such as water source, type of toilet facilities, material used for the floor and roof, and ownership of various durable goods.

    b) The Individual Questionnaire was used to collect information on ever-married women aged 15-49 in surveyed households. These women were interviewed on the following topics:
    - Respondent's background characteristics (education, residential history, etc.); - Reproductive history; - Contraceptive knowledge and use;
    - Antenatal and delivery care; - Infant feeding practices; - Child immunization; - Fertility preferences and attitudes about family planning; - Husband's background characteristics; - Women's work information; and - Knowledge of AIDS.

    c) The Community/Health Facility Questionnaire was used to collect information on all communes in which the interviewed women lived and on services offered at the nearest health stations. The Community/Health Facility Questionnaire consisted of four sections. The first two sections collected information from community informants on some characteristics such as the major economic activities of residents, distance from people's residence to civic services and the location of the nearest sources of health care. The last two sections involved visiting the nearest commune health centers and intercommune health centers, if these centers were located within 30 kilometers from the surveyed cluster. For each visited health center, information was collected on the type of health services offered and the number of days services were offered per week; the number of assigned staff and their training; medical equipment and medicines available at the time of the visit.

    Cleaning operations

    The first stage of data editing was implemented by the field editors soon after each interview. Field editors and team leaders checked the completeness and consistency of all items in the questionnaires. The completed questionnaires were sent to the GSO headquarters in Hanoi by post for data processing. The editing staff of the GSO first checked the questionnaires for completeness. The data were then entered into microcomputers and edited using a software program specially developed for the DHS program, the Census and Survey Processing System, or CSPro. Data were verified on a 100 percent basis, i.e., the data were entered separately twice and the two results were compared and corrected. The data processing and editing staff of the GSO were trained and supervised for two weeks by a data processing specialist from ORC Macro. Office editing and processing activities were initiated immediately after the beginning of the fieldwork and were completed in late December 2002.

    Response rate

    The results of the household and individual

  2. i

    Project for Statistics on Living Standards and Development 1993 - South...

    • catalog.ihsn.org
    • microdata.fao.org
    • +2more
    Updated Mar 29, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Southern Africa Labour and Development Research Unit (2019). Project for Statistics on Living Standards and Development 1993 - South Africa [Dataset]. https://catalog.ihsn.org/catalog/4628
    Explore at:
    Dataset updated
    Mar 29, 2019
    Dataset authored and provided by
    Southern Africa Labour and Development Research Unit
    Time period covered
    1993
    Area covered
    South Africa
    Description

    Abstract

    The Project for Statistics on Living standards and Development was a coutrywide World Bank Living Standards Measurement Survey. It covered approximately 9000 households, drawn from a representative sample of South African households. The fieldwork was undertaken during the nine months leading up to the country's first democratic elections at the end of April 1994. The purpose of the survey was to collect statistical information about the conditions under which South Africans live in order to provide policymakers with the data necessary for planning strategies. This data would aid the implementation of goals such as those outlined in the Government of National Unity's Reconstruction and Development Programme.

    Geographic coverage

    National coverage

    Analysis unit

    • Households
    • Individuals
    • Community

    Universe

    All Household members.

    Individuals in hospitals, old age homes, hotels and hostels of educational institutions were not included in the sample. Migrant labour hostels were included. In addition to those that turned up in the selected ESDs, a sample of three hostels was chosen from a national list provided by the Human Sciences Research Council and within each of these hostels a representative sample was drawn on a similar basis as described above for the households in ESDs.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    Sample size is 9,000 households

    The sample design adopted for the study was a two-stage self-weightingdesign in which the first stage units were Census Enumerator Subdistricts (ESDs, or their equivalent) and the second stage were households.

    The advantage of using such a design is that it provides a representative sample that need not be based on accurate census population distribution.in the case of South Africa, the sample will automatically include many poor people, without the need to go beyond this and oversample the poor. Proportionate sampling as in such a self-weighting sample design offers the simplest possible data files for further analysis, as weights do not have to be added. However, in the end this advantage could not be retained and weights had to be added.

    The sampling frame was drawn up on the basis of small, clearly demarcated area units, each with a population estimate. The nature of the self-weighting procedure adopted ensured that this population estimate was not important for determining the final sample, however. For most of the country, census ESDs were used. Where some ESDs comprised relatively large populations as for instance in some black townships such as Soweto, aerial photographs were used to divide the areas into blocks of approximately equal population size. In other instances, particularly in some of the former homelands, the area units were not ESDs but villages or village groups.

    In the sample design chosen, the area stage units (generally ESDs) were selected with probability proportional to size, based on the census population. Systematic sampling was used throughout that is, sampling at fixed interval in a list of ESDs, starting at a randomly selected starting point. Given that sampling was self-weighting, the impact of stratification was expected to be modest. The main objective was to ensure that the racial and geographic breakdown approximated the national population distribution. This was done by listing the area stage units (ESDs) by statistical region and then within the statistical region by urban or rural. Within these sub-statistical regions, the ESDs were then listed in order of percentage African. The sampling interval for the selection of the ESDs was obtained by dividing the 1991 census population of 38,120,853 by the 300 clusters to be selected. This yielded 105,800. Starting at a randomly selected point, every 105,800th person down the cluster list was selected. This ensured both geographic and racial diversity (ESDs were ordered by statistical sub-region and proportion of the population African). In three or four instances, the ESD chosen was judged inaccessible and replaced with a similar one.

    In the second sampling stage the unit of analysis was the household. In each selected ESD a listing or enumeration of households was carried out by means of a field operation. From the households listed in an ESD a sample of households was selected by systematic sampling. Even though the ultimate enumeration unit was the household, in most cases "stands" were used as enumeration units. However, when a stand was chosen as the enumeration unit all households on that stand had to be interviewed.

    Census population data, however, was available only for 1991. An assumption on population growth was thus made to obtain an approximation of the population size for 1993, the year of the survey. The sampling interval at the level of the household was determined in the following way: Based on the decision to have a take of 125 individuals on average per cluster (i.e. assuming 5 members per household to give an average cluster size of 25 households), the interval of households to be selected was determined as the census population divided by 118.1, i.e. allowing for population growth since the census. It was subsequently discovered that population growth was slightly over-estimated but this had little effect on the findings of the survey.

    Individuals in hospitals, old age homes, hotels and hostels of educational institutions were not included in the sample. Migrant labour hostels were included. In addition to those that turned up in the selected ESDs, a sample of three hostels was chosen from a national list provided by the Human Sciences Research Council and within each of these hostels a representative sample was drawn on a similar basis as described abovefor the households in ESDs.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    The main instrument used in the survey was a comprehensive household questionnaire. This questionnaire covered a wide range of topics but was not intended to provide exhaustive coverage of any single subject. In other words, it was an integrated questionnaire aimed at capturing different aspects of living standards. The topics covered included demography, household services, household expenditure, educational status and expenditure, remittances and marital maintenance, land access and use, employment and income, health status and expenditure and anthropometry (children under the age of six were weighed and their heights measured). This questionnaire was available to households in two languages, namely English and Afrikaans. In addition, interviewers had in their possession a translation in the dominant African language/s of the region.

    In addition to the detailed household questionnaire referred to above, a community questionnaire was administered in each cluster of the sample. The purpose of this questionnaire was to elicit information on the facilities available to the community in each cluster. Questions related primarily to the provision of education, health and recreational facilities. Furthermore there was a detailed section for the prices of a range of commodities from two retail sources in or near the cluster: a formal source such as a supermarket and a less formal one such as the "corner cafe" or a "spaza". The purpose of this latter section was to obtain a measure of regional price variation both by region and by retail source. These prices were obtained by the interviewer. For the questions relating to the provision of facilities, respondents were "prominent" members of the community such as school principals, priests and chiefs.

    Cleaning operations

    All the questionnaires were checked when received. Where information was incomplete or appeared contradictory, the questionnaire was sent back to the relevant survey organization. As soon as the data was available, it was captured using local development platform ADE. This was completed in February 1994. Following this, a series of exploratory programs were written to highlight inconsistencies and outlier. For example, all person level files were linked together to ensure that the same person code reported in different sections of the questionnaire corresponded to the same person. The error reports from these programs were compared to the questionnaires and the necessary alterations made. This was a lengthy process, as several files were checked more than once, and completed at the beginning of August 1994. In some cases questionnaires would contain missing values, or comments that the respondent did not know, or refused to answer a question.

    These responses are coded in the data files with the following values: VALUE MEANING -1 : The data was not available on the questionnaire or form -2 : The field is not applicable -3 : Respondent refused to answer -4 : Respondent did not know answer to question

    Data appraisal

    The data collected in clusters 217 and 218 should be viewed as highly unreliable and therefore removed from the data set. The data currently available on the web site has been revised to remove the data from these clusters. Researchers who have downloaded the data in the past should revise their data sets. For information on the data in those clusters, contact SALDRU http://www.saldru.uct.ac.za/.

  3. a

    Demographic and Health Survey 2015-2016 - Armenia

    • microdata.armstat.am
    • catalog.ihsn.org
    • +2more
    Updated Oct 11, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ministry of Health (MOH) (2019). Demographic and Health Survey 2015-2016 - Armenia [Dataset]. https://microdata.armstat.am/index.php/catalog/8
    Explore at:
    Dataset updated
    Oct 11, 2019
    Dataset provided by
    Ministry of Health (MOH)
    National Statistical Service (NSSS)
    Time period covered
    2015 - 2016
    Area covered
    Armenia
    Description

    Abstract

    The 2015-16 Armenia Demographic and Health Survey (2015-16 ADHS) is the fourth in a series of nationally representative sample surveys designed to provide information on population and health issues. It is conducted in Armenia under the worldwide Demographic and Health Surveys program. Specifically, the objective of the 2015-16 ADHS is to provide current and reliable information on fertility and abortion levels, marriage, sexual activity, fertility preferences, awareness and use of family planning methods, breastfeeding practices, nutritional status of young children, childhood mortality, maternal and child health, domestic violence against women, child discipline, awareness and behavior regarding AIDS and other sexually transmitted infections (STIs), and other health-related issues such as smoking, tuberculosis, and anemia. The survey obtained detailed information on these issues from women of reproductive age and, for certain topics, from men as well.

    The 2015-16 ADHS results are intended to provide information needed to evaluate existing social programs and to design new strategies to improve the health of and health services for the people of Armenia. Data are presented by region (marz) wherever sample size permits. The information collected in the 2015-16 ADHS will provide updated estimates of basic demographic and health indicators covered in the 2000, 2005, and 2010 surveys.

    The long-term objective of the survey includes strengthening the technical capacity of major government institutions, including the NSS. The 2015-16 ADHS also provides comparable data for longterm trend analysis because the 2000, 2005, 2010, and 2015-16 surveys were implemented by the same organization and used similar data collection procedures. It also adds to the international database of demographic and health–related information for research purposes.

    Geographic coverage

    National coverage

    Analysis unit

    • Household
    • Individual
    • Children age 0-5
    • Woman age 15-49
    • Man age 15-49

    Universe

    The survey covered all de jure household members (usual residents), children age 0-4 years, women age 15-49 years and men age 15-49 years resident in the household.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    The sample was designed to produce representative estimates of key indicators at the national level, for Yerevan, and for total urban and total rural areas separately. Many indicators can also be estimated at the regional (marz) level.

    The sampling frame used for the 2015-16 ADHS is the Armenia Population and Housing Census, which was conducted in Armenia in 2011 (APHC 2011). The sampling frame is a complete list of enumeration areas (EAs) covering the whole country, a total number of 11,571 EAs, provided by the National Statistical Service (NSS) of Armenia, the implementing agency for the 2015-16 ADHS. This EA frame was created from the census data base by summarizing the households down to EA level. A representative probability sample of 8,749 households was selected for the 2015-16 ADHS sample. The sample was selected in two stages. In the first stage, 313 clusters (192 in urban areas and 121 in rural areas) were selected from a list of EAs in the sampling frame. In the second stage, a complete listing of households was carried out in each selected cluster. Households were then systematically selected for participation in the survey. Appendix A provides additional information on the sample design of the 2015-16 Armenia DHS. Because of the approximately equal sample size in each marz, the sample is not self-weighting at the national level, and weighting factors have been calculated, added to the data file, and applied so that results are representative at the national level.

    For further details on sample design, see Appendix A of the final report.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    Five questionnaires were used for the 2015-16 ADHS: the Household Questionnaire, the Woman’s Questionnaire, the Man’s Questionnaire, the Biomarker Questionnaire, and the Fieldworker Questionnaire. These questionnaires, based on The DHS Program’s standard Demographic and Health Survey questionnaires, were adapted to reflect the population and health issues relevant to Armenia. Input was solicited from various stakeholders representing government ministries and agencies, nongovernmental organizations, and international donors. After all questionnaires were finalized in English, they were translated into Armenian. They were pretested in September-October 2015.

    Cleaning operations

    The processing of the 2015-16 ADHS data began shortly after fieldwork commenced. All completed questionnaires were edited immediately by field editors while still in the field and checked by the supervisors before being dispatched to the data processing center at the NSS central office in Yerevan. These completed questionnaires were edited and entered by 15 data processing personnel specially trained for this task. All data were entered twice for 100 percent verification. Data were entered using the CSPro computer package. The concurrent processing of the data was an advantage because the senior ADHS technical staff were able to advise field teams of problems detected during the data entry. In particular, tables were generated to check various data quality parameters. Moreover, the double entry of data enabled easy comparison and identification of errors and inconsistencies. As a result, specific feedback was given to the teams to improve performance. The data entry and editing phase of the survey was completed in June 2016.

    Response rate

    A total of 8,749 households were selected in the sample, of which 8,205 were occupied at the time of the fieldwork. The main reason for the difference is that some of the dwelling units that were occupied during the household listing operation were either vacant or the household was away for an extended period at the time of interviewing. The number of occupied households successfully interviewed was 7,893, yielding a household response rate of 96 percent. The household response rate in urban areas (96 percent) was nearly the same as in rural areas (97 percent).

    In these households, a total of 6,251 eligible women were identified; interviews were completed with 6,116 of these women, yielding a response rate of 98 percent. In one-half of the households, a total of 2,856 eligible men were identified, and interviews were completed with 2,755 of these men, yielding a response rate of 97 percent. Among men, response rates are slightly lower in urban areas (96 percent) than in rural areas (97 percent), whereas rates for women are the same in urban and in rural areas (98 percent).

    The 2015-16 ADHS achieved a slightly higher response rate for households than the 2010 ADHS (NSS 2012). The increase is only notable for urban households (96 percent in 2015-16 compared with 94 percent in 2010). Response rates in all other categories are very close to what they were in 2010.

    Sampling error estimates

    SAS computer software were used to calculate sampling errors for the 2015-16 ADHS. The programs used the Taylor linearization method of variance estimation for means or proportions and the Jackknife repeated replication method for variance estimation of more complex statistics such as fertility and mortality rates.

    A more detailed description of estimates of sampling errors are presented in Appendix B of the survey final report.

    Data appraisal

    Data Quality Tables - Household age distribution - Age distribution of eligible and interviewed women - Age distribution of eligible and interviewed men - Completeness of reporting - Births by calendar years - Reporting of age at death in days - Reporting of age at death in months - Nutritional status of children based on the NCHS/CDC/WHO International Reference Population - Vaccinations by background characteristics for children age 18-29 months

    See details of the data quality tables in Appendix C of the survey final report.

  4. B

    Alberta Survey, 2012B

    • borealisdata.ca
    Updated Mar 2, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Population Research Laboratory (2018). Alberta Survey, 2012B [Dataset]. http://doi.org/10.7939/DVN/10004
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 2, 2018
    Dataset provided by
    Borealis
    Authors
    Population Research Laboratory
    License

    https://borealisdata.ca/api/datasets/:persistentId/versions/7.1/customlicense?persistentId=doi:10.7939/DVN/10004https://borealisdata.ca/api/datasets/:persistentId/versions/7.1/customlicense?persistentId=doi:10.7939/DVN/10004

    Time period covered
    Jul 2012
    Area covered
    Alberta, Canada
    Description

    The Population Research Laboratory (PRL), a member of the Association of Academic Survey Research Organizations (AASRO), seeks to advance the research, education and service goals of the University of Alberta by helping academic researchers and policy makers design and implement applied social science research projects. The PRL specializes in the gathering, analysis, and presentation of data about demographic, social and public issues. The PRL research team provides expert consultation and implementation of quantitative and qualitative research methods, project design, sample design, web-based, paper-based and telephone surveys, field site testing, data analysis and report writing. The PRL follows scientifically rigorous and transparent methods in each phase of a research project. Research Coordinators are members of the American Association for Public Opinion Research (AAPOR) and use best practices when conducting all types of research. The PRL has particular expertise in conducting computer-assisted telephone interviews (referred to as CATI surveys). When conducting telephone surveys, all calls are displayed as being from the "U of A PRL", a procedure that assures recipients that the call is not from a telemarketer, and thus helps increase response rates. The PRL maintains a complement of highly skilled telephone interviewers and supervisors who are thoroughly trained in FOIPP requirements, respondent selection procedures, questionnaire instructions, and neutral probing. A subset of interviewers are specially trained to convince otherwise reluctant respondents to participate in the study, a practice that increases response rates and lowers selection bias. PRL staff monitors data collection on a daily basis to allow any necessary adjustments to the volume and timing of calls and respondent selection criteria. The Population Research Laboratory (PRL) administered the 2012 Alberta Survey B. This survey of households across the province of Alberta continues to enable academic researchers, government departments, and non-profit organizations to explore a wide range of topics in a structured research framework and environment. Sponsors' research questions are asked together with demographic questions in a telephone interview of Alberta households. This data consists of the information from 1207 Alberta residence, interviewed between June 5, 2012 and June 27, 2012. The amount of responses indicates that the response rate, as calculated percentages representing the number of people who participated in the survey divided by the number selected in the eligible sample, was 27.6% for survey B. The subject ares included in the 2012 Alberta Survey B includes socio-demographic and background variables such as: household composition, age, gender, marital status, highest level of education, household income, religion, ethnic background, place of birth, employment status, home ownership, political party support and perceptions of financial status. In addition, the topics of public health and injury control, tobacco reduction, activity limitations and personal directives, unions, politics and health.

  5. r

    Data Sharing for Demographics Research (DSDR)

    • rrid.site
    Updated Jan 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). Data Sharing for Demographics Research (DSDR) [Dataset]. http://identifiers.org/RRID:SCR_016310/resolver?q=&i=rrid
    Explore at:
    Dataset updated
    Jan 29, 2022
    Description

    DSDR disseminates, archives, and preserves data for population-based studies. By providing access to data on topics including mortality and health, fertility, family and household structure, and children and youth, DSDR aims to facilitate demographic research.

  6. Sales data based on demographics

    • kaggle.com
    zip
    Updated Jan 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2023). Sales data based on demographics [Dataset]. https://www.kaggle.com/datasets/thedevastator/demographical-shopping-purchases-data
    Explore at:
    zip(1541029 bytes)Available download formats
    Dataset updated
    Jan 12, 2023
    Authors
    The Devastator
    Description

    Demographical Shopping Purchases Data

    Analyzing customer purchasing patterns and preferences

    By Joseph Nowicki [source]

    About this dataset

    This dataset contains demographic information about customers who have made purchases in a store, including their name, IP address, region, age, items purchased, and total amount spent. Furthermore, this data can provide insights into customer shopping behaviour for the store in question - from their geographical information to the types of products they purchase. With detailed demographic data like this at hand it is possible to make strategic decisions regarding target customers as well as developing specific marketing campaigns or promotions tailored to meet their needs and interests. By gaining deeper understanding of customer habits through this dataset we unlock more possibilities for businesses seeking higher engagement levels with shoppers

    More Datasets

    For more datasets, click here.

    Featured Notebooks

    • 🚨 Your notebook can be here! 🚨!

    How to use the dataset

    This dataset includes information such as customer's names, IP address, age, items purchased and amount spent. This data can be used to uncover patterns in spending behavior of shoppers from different areas or regions across demographics like age group or gender.

    Research Ideas

    • Analyze customer shopping trends based on age and region to maximize targetted advertising.
    • Analyze the correlation between customer spending habits based on store versus online behavior.
    • Use IP addresses to track geographical trends in items purchased from a particular online store to identify new markets for targeted expansion

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. Data Source

    License

    See the dataset description for more information.

    Columns

    File: Demographic_Data_Orig.csv | Column name | Description | |:---------------|:------------------------------------------------------------------------------------------------| | full.name | The full name of the customer. (String) | | ip.address | The IP address of the customer. (String) | | region | The region of residence of the customer. (String) | | in.store | A boolean value indicating whether the customer made the purchase in-store or online. (Boolean) | | age | The age of the customer. (Integer) | | items | The number of items purchased by the customer. (Integer) | | amount | The total amount spent by the customer. (Float) |

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. If you use this dataset in your research, please credit Joseph Nowicki.

  7. g

    Data from: Longitudinal Analysis of Historical Demographic Data

    • search.gesis.org
    • openicpsr.org
    • +1more
    Updated May 1, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GESIS search (2021). Longitudinal Analysis of Historical Demographic Data [Dataset]. http://doi.org/10.3886/E34554V1
    Explore at:
    Dataset updated
    May 1, 2021
    Dataset provided by
    ICPSR - Interuniversity Consortium for Political and Social Research
    GESIS search
    License

    https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de452467https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de452467

    Description

    Abstract (en): This study contains teaching materials developed over a period of years for a four-week workshop, Longitudinal Analysis of Historical Demographic Data (LAHDD), offered through the ICPSR Summer Program in 2006, 2007, 2009, 2011 and 2013, with one-day alumni workshops in 2010, 2012, and 2014. Instructors in the workshops are listed below. Funding was provided by The Eunice Kennedy Shriver National Institute of Child Health and Human Development, grants R25-HD040525 and R25-HD-049479, the ICPSR Summer Program and the ICPSR Director. The course was designed to teach students the theories, methods, and practices of historical demography and to give them first-hand experience working with historical data. This training is valuable not only to those interested in the analysis historical data. The techniques of historical demography rest on methodological insights that can be applied to many problems in population studies and other social sciences. While historical demography remains a flourishing research area with publications in key journals like Demography, Population Studies, and Population, practitioners were dispersed, and training was not available at any of the population research centers in the U.S. or elsewhere. One hundred and ten participants from around the globe took part in the workshops, and have gone on to establish courses of their own or teach in other workshops. We offer these materials here in the hopes that others will find them useful in developing courses on historical demography and/or longitudinal data analysis. The workshop was organized in three tracks: A brief tour of historical demography, event-history analysis, and data management for longitudinal data using Stata and Microsoft Access. The data management track includes 13 exercises designed for hands-on learning and reinforcement. Included in this project are the syllabii and reading lists for the three tracks, datasets used in the exercises, documents setting out each exercise, a file with the expected results, and for many of the exercises, an explanation. Video tutorials helpful with the Access exercises are accessible from ICPSR's YouTube channel https://www.youtube.com/playlist?list=PLqC9lrhW1Vvb9M1QpQH23z9UlPYxHbUMF. Users are encouraged to use these materials to develop their own courses and workshops in any of the topics covered. Please acknowledge NICHD R25-HD040525 and R25-HD-049479 whenever appropriate. Historical demography instructors: Myron P. Gutmann, University of Colorado Boulder Cameron Campbell, Hong Kong University of Science and Technology J. David Hacker, University of Minnesota Satomi Kurosu, Reitaku University Katherine A. Lynch, Carnegie Mellon University Event history instructors: Cameron Campbell, Hong Kong University of Science and Technology Glenn Deane, State University of New York at Albany Ken R. Smith, Huntsman Cancer Institute and University of Utah Database management instructors: George Alter, University of Michigan Susan Hautaniemi Leonard, University of Michigan Teaching Assistants: Mathew Creighton, University of Massachusetts Boston Emily Merchant, University of Michigan Luciana Quaranta, Lund University Kristine Witkowski, University of Michigan Project Manager: Susan Hautaniemi Leonard, University of Michigan Funding insitution(s): United States Department of Health and Human Services. National Institutes of Health. Eunice Kennedy Shriver National Institute of Child Health and Human Development (R25 HD040525).

  8. United Nations Population Division

    • kaggle.com
    zip
    Updated Sep 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bhanupratap Biswas (2023). United Nations Population Division [Dataset]. https://www.kaggle.com/datasets/bhanupratapbiswas/united-nations-population-division
    Explore at:
    zip(78437 bytes)Available download formats
    Dataset updated
    Sep 12, 2023
    Authors
    Bhanupratap Biswas
    License

    ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
    License information was derived automatically

    Area covered
    United Nations
    Description

    The United Nations Population Division is a part of the United Nations Department of Economic and Social Affairs (UNDESA). Its primary mission is to provide timely and accurate demographic information and analysis to assist countries in making informed policy decisions related to population and development. The division produces a wide range of demographic data, reports, and publications, and it serves as a key source of information on global population trends.

    Some of the main functions and activities of the United Nations Population Division include:

    1. Data Collection and Analysis: The division collects and compiles data on population, fertility, mortality, migration, and other demographic variables from member states and other international sources. It analyzes this data to track global demographic trends and provides population estimates and projections.

    2. World Population Prospects: The division publishes the "World Population Prospects," which is a comprehensive set of demographic data and projections for countries around the world. This report is regularly updated and is widely used by governments, researchers, and policymakers.

    3. Demographic Research: The division conducts research on a wide range of demographic issues, including aging populations, urbanization, family planning, and more. This research helps to inform policies and programs aimed at addressing demographic challenges.

    4. Technical Assistance: The division provides technical assistance to countries in areas related to population and development, including capacity building, data collection, and analysis.

    5. Reports and Publications: The division produces a variety of reports, publications, and working papers on demographic topics. These resources are made available to the public and serve as valuable references for researchers and policymakers.

    6. Population Conferences: The United Nations Population Division plays a role in organizing and supporting international conferences and events related to population and development issues. These conferences provide a platform for countries to discuss and coordinate actions to address demographic challenges.

    Overall, the United Nations Population Division plays a crucial role in monitoring and understanding global demographic trends and supporting countries in their efforts to develop policies and programs that promote sustainable development and address population-related challenges.

  9. World Bank Indicators (1960‑Present)

    • kaggle.com
    zip
    Updated May 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    George DiNicola (2025). World Bank Indicators (1960‑Present) [Dataset]. https://www.kaggle.com/datasets/georgejdinicola/world-bank-indicators
    Explore at:
    zip(52559856 bytes)Available download formats
    Dataset updated
    May 29, 2025
    Authors
    George DiNicola
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Overview

    This dataset provides a comprehensive collection of time series data sourced from the World Bank Open Data Platform, covering a wide range of global indicators from 1960 to the most recently published year. It includes economic, social, environmental, and demographic metrics, making it an ideal resource for researchers, data scientists, and policymakers interested in global development trends, economic forecasting, or socio-economic analysis.

    A tutorial on how to combined the dataset topics together into one large dataset can be found here

    Why this Dataset?

    My motivation for this project was to curate a high-quality collection of datasets for World Bank indicators organized by topics and structured in time-series, making them more accessible for data science projects. Since the World Bank’s Kaggle datasets have not been updated since 2019 https://www.kaggle.com/organizations/theworldbank, I saw an opportunity to provide more current data for the data analysis community.

    Dataset Collection Contents

    This collection brings together more than 800 World Bank indicators organized into 18 topic‑specific CSV files. Each file is structured as a country‑year panel: every row represents a unique combination of year (1960‑present) and ISO‑3 country code, while the columns hold the topic’s indicators.

    The collection includes datasets with a variety of indicators, such as: - Economic Metrics: GDP growth (%), GDP per capita, consumer price inflation, merchandise trade, gross capital formation, and more.
    - Social Metrics: School enrollment (primary, secondary, tertiary), infant mortality rate, maternal mortality rate, poverty headcount, and more.
    - Environmental Metrics: Forest area, renewable energy consumption, food production indices, and more.
    - Demographic Metrics: Urban population, life expectancy, net migration, and more.

    Usage

    This dataset is ideal for a variety of applications, including: - Economic forecasting and trend analysis (e.g., GDP growth, inflation).
    - Socio-economic studies (e.g., education, health, poverty).
    - Environmental impact analysis (e.g., renewable energy adoption).
    - Demographic research (e.g., population trends, migration).

    Topic datasets can be merged with each other using year and country code. This tutorial with notebook code can help you get started quickly.

    Collection Methodology

    The data is collected via a custom software application that discovers and groups high-quality indicators with rules-based logic & artificial intelligence, generates metadata, and performs ETL for the data from the World Bank API. The result is a clean, up‑to‑date collection of World Bank indicators in time-series format that is ready for analysis—no manual downloads or data wrangling required.

    Modifications

    The original World Bank data has been aggregated and transformed for ease of use. Missing values have been preserved as provided by the World Bank, and no significant transformations have been applied beyond formatting and aggregation into a single file.

    Source & Attribution

    The World Bank: World Development Indicators

    This dataset is publicly available and sourced from the World Bank Open Data Platform and is made available under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. When using this data, please attribute the World Bank as follows: "Data sourced from the World Bank, licensed under CC BY 4.0." For more details on the World Bank’s terms of use, visit: https://www.worldbank.org/en/about/legal/terms-of-use-for-datasets.

    License

    This dataset is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) license.

    Feel free to use this data in Kaggle notebooks, academic research, or policy analysis. If you create a derived dataset or analysis, I encourage you to share it with the Kaggle community.

  10. n

    Data for: Citizen science as an ecosystem of engagement: Implications for...

    • data.niaid.nih.gov
    • search.dataone.org
    • +1more
    zip
    Updated May 2, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bradley Allf (2022). Data for: Citizen science as an ecosystem of engagement: Implications for learning and broadening participation [Dataset]. http://doi.org/10.5061/dryad.0gb5mkm3k
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 2, 2022
    Dataset provided by
    North Carolina State University
    Authors
    Bradley Allf
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Description

    The bulk of research on citizen science participants is project-centric, based on an assumption that volunteers experience a single project. Contrary to this assumption, survey responses (n=3,894) and digital trace data (n=3,649) from volunteers, who collectively engaged in 1,126 unique projects, revealed that multi-project participation was the norm. Only 23% of volunteers were singletons (who participated in only one project), and multi-project participants split evenly between disciplines specialists (39%) and discipline spanners (38% joined projects with different disciplinary topics), and unevenly between mode specialists (67%) and mode spanners (33% participated in online and offline projects). Public engagement was narrow: multi-project participants were eight times more likely to be white, and five times more likely to hold advanced degrees, than the general population. We propose a volunteer-centric framework that explores how the dynamic accumulation of experiences in a project ecosystem can support broad learning objectives and inclusive citizen science. Methods The purpose of this project was to collect data about volunteers who do citizen science projects, particilarly the number and type of projects that these participants do, and what demographic communities these volunteers represent. There were four data sources: digital trace data from the website "SciStarter.org," a survey distributed to SciStarter volunteers, a survey distributed to volunteers with the project "The Christmas Bird Count" and volunteers with the project "Candid Critters." We used this data to create a list of citizen science projects, which we categorized according to disciplinary topic (ecology, astronomy, etc.) and participation mode (online or offline). We then categorized each volunteer in our data source according to how many projects they did, and whether the project(s) they did were from multiple disciplinary topics and modes. Finally, we used regression to assess what demographics and other factors predicted joining multiple projects, joining projects from multiple disciplines, and joining projects from multiple modes.

  11. a

    Demographic and Health Survey 2000 - Armenia

    • microdata.armstat.am
    • catalog.ihsn.org
    • +1more
    Updated Oct 10, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Statistical Service (2019). Demographic and Health Survey 2000 - Armenia [Dataset]. https://microdata.armstat.am/index.php/catalog/1
    Explore at:
    Dataset updated
    Oct 10, 2019
    Dataset provided by
    National Statistical Service
    Ministry of Health
    Time period covered
    2000
    Area covered
    Armenia
    Description

    Abstract

    The Armenia Demographic and Health Survey (ADHS) was a nationally representative sample survey designed to provide information on population and health issues in Armenia. The primary goal of the survey was to develop a single integrated set of demographic and health data, the first such data set pertaining to the population of the Republic of Armenia. In addition to integrating measures of reproductive, child, and adult health, another feature of the DHS survey is that the majority of data are presented at the marz level.

    The ADHS was conducted by the National Statistical Service and the Ministry of Health of the Republic of Armenia during October through December 2000. ORC Macro provided technical support for the survey through the MEASURE DHS+ project. MEASURE DHS+ is a worldwide project, sponsored by the USAID, with a mandate to assist countries in obtaining information on key population and health indicators. USAID/Armenia provided funding for the survey. The United Nations Children’s Fund (UNICEF)/Armenia provided support through the donation of equipment.

    The ADHS collected national- and regional-level data on fertility and contraceptive use, maternal and child health, adult health, and AIDS and other sexually transmitted diseases. The survey obtained detailed information on these issues from women of reproductive age and, on certain topics, from men as well. Data are presented by marz wherever sample size permits.

    The ADHS results are intended to provide the information needed to evaluate existing social programs and to design new strategies for improving the health of and health services for the people of Armenia. The ADHS also contributes to the growing international database on demographic and health-related variables.

    Geographic coverage

    National

    Analysis unit

    • Household
    • Children under five years
    • Women age 15-49
    • Men age 15-54

    Kind of data

    Sample survey data

    Sampling procedure

    The sample was designed to provide estimates of most survey indicators (including fertility, abortion, and contraceptive prevalence) for Yerevan and each of the other ten administrative regions (marzes). The design also called for estimates of infant and child mortality at the national level for Yerevan and other urban areas and rural areas.

    The target sample size of 6,500 completed interviews with women age 15-49 was allocated as follows: 1,500 to Yerevan and 500 to each of the ten marzes. Within each marz, the sample was allocated between urban and rural areas in proportion to the population size. This gave a target sample of approximately 2,300 completed interviews for urban areas exclusive of Yerevan and 2,700 completed interviews for the rural sector. Interviews were completed with 6,430 women. Men age 15-54 were interviewed in every third household; this yielded 1,719 completed interviews.

    A two-stage sample was used. In the first stage, 260 areas or primary sampling units (PSUs) were selected with probability proportional to population size (PPS) by systematic selection from a list of areas. The list of areas was the 1996 Data Base of Addresses and Households constructed by the National Statistical Service. Because most selected areas were too large to be directly listed, a separate segmentation operation was conducted prior to household listing. Large selected areas were divided into segments of which two segments were included in the sample. A complete listing of households was then carried out in selected segments as well as selected areas that were not segmented.

    The listing of households served as the sampling frame for the selection of households in the second stage of sampling. Within each area, households were selected systematically so as to yield an average of 25 completed interviews with eligible women per area. All women 15-49 who stayed in the sampled households on the night before the interview were eligible for the survey. In each segment, a subsample of one-third of all households was selected for the men's component of the survey. In these households, all men 15-54 who stayed in the household on the previous night were eligible for the survey.

    Note: See detailed description of sample design in APPENDIX A of the survey report.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    Three questionnaires were used in the ADHS: a Household Questionnaire, a Women’s Questionnaire, and a Men’s Questionnaire. The questionnaires were based on the model survey instruments developed for the MEASURE DHS+ program. The model questionnaires were adapted for use during a series of expert meetings hosted by the Center of Perinatology, Obstetrics, and Gynecology. The questionnaires were developed in English and translated into Armenian and Russian. The questionnaires were pretested in July 2000.

    The Household Questionnaire was used to list all usual members of and visitors to a household and to collect information on the physical characteristics of the dwelling unit. The first part of the household questionnaire collected information on the age, sex, residence, educational attainment, and relationship to the household head of each household member or visitor. This information provided basic demographic data for Armenian households. It also was used to identify the women and men who were eligible for the individual interview (i.e., women 15-49 and men 15-54). The second part of the Household Questionnaire consisted of questions on housing characteristics (e.g., the flooring material, the source of water, and the type of toilet facilities) and on ownership of a variety of consumer goods.

    The Women’s Questionnaire obtained information on the following topics: - Background characteristics - Pregnancy history - Antenatal, delivery, and postnatal care - Knowledge and use of contraception - Attitudes toward contraception and abortion - Reproductive and adult health - Vaccinations, birth registration, and health of children under age five - Episodes of diarrhea and respiratory illness of children under age five - Breastfeeding and weaning practices - Height and weight of women and children under age five - Hemoglobin measurement of women and children under age five - Marriage and recent sexual activity - Fertility preferences - Knowledge of and attitude toward AIDS and other sexually transmitted infections.

    The Men’s Questionnaire focused on the following topics: - Background characteristics - Health - Marriage and recent sexual activity - Attitudes toward and use of condoms - Knowledge of and attitude toward AIDS and other sexually transmitted infections.

    Cleaning operations

    After a team had completed interviewing in a cluster, questionnaires were returned promptly to the National Statistical Service in Yerevan for data processing. The office editing staff first checked that questionnaires for all selected households and eligible respondents had been received from the field staff. In addition, a few questions that had not been precoded (e.g., occupation) were coded at this time. Using the ISSA (Integrated System for Survey Analysis) software, a specially trained team of data processing staff entered the questionnaires and edited the resulting data set on microcomputers. The process of office editing and data processing was initiated soon after the beginning of fieldwork and was completed by the end of January 2001.

    Response rate

    A total of 6,524 households were selected for the sample, of which 6,150 were occupied at the time of fieldwork. The main reason for the difference is that some of the dwelling units that were occupied during the household listing operation were either vacant or the household was away for an extended period at the time of interviewing. Of the occupied households, 97 percent were successfully interviewed.

    In these households, 6,685 women were identified as eligible for the individual interview (i.e., age 15-49). Interviews were completed with 96 percent of them. Of the 1,913 eligible men identified, 90 percent were successfully interviewed. The principal reason for non-response among eligible women and men was the failure to find them at home despite repeated visits to the household. The refusal rate was low.

    The overall response rates, the product of the household and the individual response rates, were 94 percent for women and 87 percent for men.

    Note: See summarized response rates by residence (urban/rural) in Table 1.1 of the survey report.

    Sampling error estimates

    The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors, and (2) sampling errors. Nonsampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2000 Armenia Demographic and Health Survey (ADHS) to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.

    Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the ADHS is only one of many samples that could have been selected from the same population, using the same design and expected size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey

  12. f

    Table_1_Operational Challenges in the Use of Structured Secondary Data for...

    • frontiersin.figshare.com
    • datasetcatalog.nlm.nih.gov
    docx
    Updated May 30, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kelsy N. Areco; Tulio Konstantyner; Paulo Bandiera-Paiva; Rita C. X. Balda; Daniela T. Costa-Nobre; Adriana Sanudo; Carlos Roberto V. Kiffer; Mandira D. Kawakami; Milton H. Miyoshi; Ana Sílvia Scavacini Marinonio; Rosa M. V. Freitas; Liliam C. C. Morais; Monica L. P. Teixeira; Bernadette Waldvogel; Maria Fernanda B. Almeida; Ruth Guinsburg (2023). Table_1_Operational Challenges in the Use of Structured Secondary Data for Health Research.DOCX [Dataset]. http://doi.org/10.3389/fpubh.2021.642163.s001
    Explore at:
    docxAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    Frontiers
    Authors
    Kelsy N. Areco; Tulio Konstantyner; Paulo Bandiera-Paiva; Rita C. X. Balda; Daniela T. Costa-Nobre; Adriana Sanudo; Carlos Roberto V. Kiffer; Mandira D. Kawakami; Milton H. Miyoshi; Ana Sílvia Scavacini Marinonio; Rosa M. V. Freitas; Liliam C. C. Morais; Monica L. P. Teixeira; Bernadette Waldvogel; Maria Fernanda B. Almeida; Ruth Guinsburg
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Background: In Brazil, secondary data for epidemiology are largely available. However, they are insufficiently prepared for use in research, even when it comes to structured data since they were often designed for other purposes. To date, few publications focus on the process of preparing secondary data. The present findings can help in orienting future research projects that are based on secondary data.Objective: Describe the steps in the process of ensuring the adequacy of a secondary data set for a specific use and to identify the challenges of this process.Methods: The present study is qualitative and reports methodological issues about secondary data use. The study material was comprised of 6,059,454 live births and 73,735 infant death records from 2004 to 2013 of children whose mothers resided in the State of São Paulo - Brazil. The challenges and description of the procedures to ensure data adequacy were undertaken in 6 steps: (1) problem understanding, (2) resource planning, (3) data understanding, (4) data preparation, (5) data validation and (6) data distribution. For each step, procedures, and challenges encountered, and the actions to cope with them and partial results were described. To identify the most labor-intensive tasks in this process, the steps were assessed by adding the number of procedures, challenges, and coping actions. The highest values were assumed to indicate the most critical steps.Results: In total, 22 procedures and 23 actions were needed to deal with the 27 challenges encountered along the process of ensuring the adequacy of the study material for the intended use. The final product was an organized database for a historical cohort study suitable for the intended use. Data understanding and data preparation were identified as the most critical steps, accounting for about 70% of the challenges observed for data using.Conclusion: Significant challenges were encountered in the process of ensuring the adequacy of secondary health data for research use, mainly in the data understanding and data preparation steps. The use of the described steps to approach structured secondary data and the knowledge of the potential challenges along the process may contribute to planning health research.

  13. Data from: Invisible Structures of Segregation in the Metropolitan Region of...

    • search.datacite.org
    • scielo.figshare.com
    Updated Mar 24, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ernesto Friedrich De Lima Amaral; Camilo Vladimir De Lima Amaral (2021). Invisible Structures of Segregation in the Metropolitan Region of Goiânia [Dataset]. http://doi.org/10.6084/m9.figshare.11313119
    Explore at:
    Dataset updated
    Mar 24, 2021
    Dataset provided by
    DataCitehttps://www.datacite.org/
    SciELO journals
    Authors
    Ernesto Friedrich De Lima Amaral; Camilo Vladimir De Lima Amaral
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Abstract This article aims to research the invisible structure of segregation of the Metropolitan Region of Goiânia (RMG), through the interrelations among processes of urban space production, urban planning principles, and population dynamics. These interrelations are fundamental to understanding the structure of socioeconomic inequalities in metropolitan areas. We analyzed the process of formation of segregation structure in the region, addressing the various urban plans developed for Goiânia. We provide a brief analysis of population indicators based on Demographic Census data from 1950 to 2010 and local indicators of spatial association in 2010 to characterize different dynamics in the region. We developed a critical analysis of these aspects, in order to identify and illustrate the main characteristics of the formation of this region in a summary diagram. We discussed how this overall spatial structure contributes to the reproduction of segregated social relations. Main results indicate that RMG does not have a simple centrality or a multi-centrality. There are a series of concentric rings with different types of centralities, which function in an integrated – but not inclusive – segregation system. We also identify research topics of social, demographic, and economic dynamics that would improve our understanding of spatial formation and urban planning in this region.

  14. S

    Supplementary materials for "Sample Representativeness in Psychological and...

    • scidb.cn
    Updated Mar 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Liu Weibiao; Chen Zhiyi; Hu Chuan-Peng (2024). Supplementary materials for "Sample Representativeness in Psychological and Brain Science Research" [Dataset]. http://doi.org/10.57760/sciencedb.17369
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 26, 2024
    Dataset provided by
    Science Data Bank
    Authors
    Liu Weibiao; Chen Zhiyi; Hu Chuan-Peng
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Psychological and brain science explore human behavior and the human brain by studying volunteers who participate in these studies. Given that the mind and behavior of participants are influenced by their own biological and social factors, the generalizability of findings in these fields largely depends on the representativeness of samples. However, the representativeness of samples in psychological and brain science has long been criticized as “WEIRD” (Western, Educated, Industrialized, Rich, and Democratic). In recent years, several meta-researches have surveyed the representativeness of samples in published studies from different sub-fields, but an overall understanding of the representativeness of samples in psychological and brain science is lacking. In this review, we analyze these meta-researches to provide a comprehensive perspective on the current state of sample representativeness. Two common issues emerged across these meta-researches. Firstly, the demographics of participants were incomplete in most of the published studies. Most psychological and brain science studies reported participants' gender, age, and country, but participants' race/ethnicity, education level, and socioeconomic status were far less reported. Other important demographics, such as rural/urban division, were not reported at all. Additionally, the reporting of these demographics has increased only slightly in recent years compared to decades ago. Thus, the under-reporting of demographic information in literature was largely unchanged. Secondly, based on the reported demographics, we found that samples in the field are far from being representative of the world population: most participants are young, highly educated Caucasian females in Western countries; middle-aged and older, less educated, colored people in and outside Western countries are less likely to be studied. In terms of countries, Southeast Asian, African, Latin American, and Middle Eastern countries appear fewer in psychological and brain science research.These two issues may be due to the following reasons: convenience sampling dominates psychological and brain science; Western researchers dominate the field of psychology and brain science, with most of the editors-in-chief, editorial board members, and authors coming from Europe and America; psychology and brain science undervalued the effect of socioeconomic and cultural factors; and researchers mistakenly believe that findings from Western participants can be generalized to all human beings. Addressing the issue of sample representativeness in psychological and brain sciences requires a concerted effort by researchers, academic societies, journals, and funding agencies: Researchers should collect and report detailed demographic information about participants, state the limitations of generalizability, and use sampling methods that can increase representativeness whenever possible (e.g., probability sampling); academic societies should pay attention to the representativeness issues by organizing more academic symposium or workshops on this topic; journals should increase the representativeness of editorial board members and encourage more rigorous research with samples from underrepresented groups or studies that examine the generalizability of important findings; funding agencies can encourage researchers to pay more attention to study groups from underrepresented countries, and provide financial support for studying hard-to-research population. Improving sample representativeness will enhance the value of applying psychological and brain science knowledge in real-life settings and promote the building of a community with a shared future for mankind.

  15. p

    Demographic Health Survey 2007 - Nauru

    • microdata.pacificdata.org
    Updated Aug 18, 2013
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nauru Bureau of Statistics (2013). Demographic Health Survey 2007 - Nauru [Dataset]. https://microdata.pacificdata.org/index.php/catalog/25
    Explore at:
    Dataset updated
    Aug 18, 2013
    Dataset authored and provided by
    Nauru Bureau of Statistics
    Time period covered
    2007
    Area covered
    Nauru
    Description

    Abstract

    The main objective of a demographic household survey (DHS) is to provide estimates of a number of basic demographic and health variables. This is done through interviews with a scientifically selected probability sample that is chosen from a well-defined population.

    The 2007 Nauru Demographic and Health Survey (2007 NDHS) was one of four pilot demographic and health surveys conducted in the Pacific under an Asian Development Bank ADB/ Secretariat of the Pacific Community (SPC) Regional DHS Pilot Project. The primary objective of this survey was to provide up-to-date information for policy-makers, planners, researchers and programme managers, for use in planning, implementing, monitoring and evaluating population and health programmes within the country. The survey was intended to provide key estimates of Nauru's demographics and health situation. The findings of the 2007 NDHS are very important in measuring the achievements of family planning and other health programmes. To ensure better understanding and use of these data, the results of this survey should be widely disseminated at different planning levels. Different dissemination techniques will be used to reach different segments of society.

    The primary purpose of the 2007 NDHS was to furnish policy-makers and planners with detailed information on fertility, family planning, infant and child mortality, maternal and child health, nutrition, and knowledge of HIV and AIDS and other sexually transmitted infections.

    NOTE: The only dissemination used was wide distribution of the report. A planned data use workshop was not undertaken. Hence there is some misconceptions and lack of awareness on the results obtained from the survey. The report is provided on the NBOS website free for download.

    Geographic coverage

    National Coverage - Districts

    Analysis unit

    • Households
    • Children (0-14yrs)
    • Individual women of reproductive age (15-49 yrs)
    • Individual men of reproductive age (15yrs+)
    • Facilities providing reproductive and child health services

    Universe

    The survey covered all household members (usual residents), - All children (aged 0-14 years) resident in the household - All women of reproductive age (15-49 years) resident in all household - All males (15yrs and above) in every second household (approx. 50%) resident in selected household

    Results: The 2007 Nauru Demographic Health Survey (2007 NDHS) is a nationally representative survey of 655 eligible women (aged 15-49) and 392 eligible men (aged 15 and above).

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    IDG NOTES: Locate sampling documentation with SPC (Graeme Brown) and internal files. Add in this sections. Or second option dilute appendix A Sampling and extract key issues.

    ESTIMATES OF SAMPLING ERRORS - Refer to Appendix A of final NDHS2007 report or; - External Resources - 2007 DHS- Appendix A and B Sampling (to be created separatedly by IDG progress ongoing)

    Sampling deviation

    IDG NOTES: Locate sampling documentation with Macro and internal files. Add in this section. Or second option dilute appendix B Sampling and extract key issues.

    ESTIMATES OF SAMPLING ERRORS - Refer to Appendix B of final NDHS2007 report or;

    • External Resources
      • 2007 DHS- Appendix A and B Sampling (to be created separatedly by IDG progress ongoing)

    Extract:

    In the 2007 NDHS Report of the survey results, sampling errors for selected variables have been presented in a tabular format. The sampling error tables should include:

    .. Variable name

    R: Value of the estimate; SE: Sampling error of the estimate; N: Unweighted number of cases on which the estimate is based; WN: Weighted number of cases; DEFT: Design effect value that compensates for the loss of precision that results from using cluster rather than simple random sampling; SE/R: Relative standard error (i.e. ratio of the sampling error to the value estimate); R-2SE: Lower limit of the 95% confidence interval; R+2SE: Upper limit of the 95% confidence interval (never >1.000 for a proportion).

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    DHS questionnaire for women cover the following sections:

    • Background characteristics (age, education, religion, etc)
    • Reproductive history
    • Knowledge and use of contraception methods
    • Antenatal care, delivery care and postnatal care
    • Breastfeeding and infant feeding
    • Immunization, child health and nutrition
    • Marriage and recent sexual activity
    • Fertility preferences
    • Knowledge about HIV/AIDS and other sexually transmitted infections
    • Husbands background and women's work

    The men's questionnaire covers the same except for sections 4, 5, 6 which are not applicable to men.

    It was also recognized that some countries have a need for special information that is not contained in the core questionnaire. Separate questionnaire modules were developed on a series of topics. These topics are optional and include:

    • maternal mortality
    • pill-taking behaviour
    • sterilization experience
    • children's education
    • women's status
    • domestic violence
    • health expenditures
    • consanguinity

    The Papua New Guinea (PNG) questionnaire was proposed for Nauru to adapt as in comparison to the existing DHS model, this is not as lengthy and time-consuming. The PNG questionnaire also dealt with high incidence of alcohol and tobacco in Nauru. Questions on HIV/AIDS and STI knowledge were included in the men's questionnaire where it was not included in the PNG questionnaire.

    Response rate

    IDG NOTES: Locate response rate documentation with SPC (Graeme Brown) and internal files. Add in this sections.

  16. d

    FReDA – The German Family Demography Panel Study

    • da-ra.de
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin Bujard; Tobias Gummer; Karsten Hank; Franz J. Neyer; Reinhard Pollak; Norbert F. Schneider; C. Katharina Spieß; Christof Wolf (2023). FReDA – The German Family Demography Panel Study [Dataset]. http://doi.org/10.4232/1.14065
    Explore at:
    Dataset updated
    May 31, 2023
    Dataset provided by
    GESIS
    da|ra
    Authors
    Martin Bujard; Tobias Gummer; Karsten Hank; Franz J. Neyer; Reinhard Pollak; Norbert F. Schneider; C. Katharina Spieß; Christof Wolf
    Time period covered
    Apr 7, 2021 - Jun 29, 2021
    Description

    Persons aged 18 to 49 living in private households at the timepoint of survey

  17. Facebook users worldwide 2017-2027

    • statista.com
    • de.statista.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stacy Jo Dixon, Facebook users worldwide 2017-2027 [Dataset]. https://www.statista.com/topics/1164/social-networks/
    Explore at:
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Stacy Jo Dixon
    Description

    The global number of Facebook users was forecast to continuously increase between 2023 and 2027 by in total 391 million users (+14.36 percent). After the fourth consecutive increasing year, the Facebook user base is estimated to reach 3.1 billion users and therefore a new peak in 2027. Notably, the number of Facebook users was continuously increasing over the past years. User figures, shown here regarding the platform Facebook, have been estimated by taking into account company filings or press material, secondary research, app downloads and traffic data. They refer to the average monthly active users over the period and count multiple accounts by persons only once.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).

  18. w

    Azerbaijan - Demographic and Health Survey 2006 - Dataset - waterdata

    • wbwaterdata.org
    Updated Mar 16, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2020). Azerbaijan - Demographic and Health Survey 2006 - Dataset - waterdata [Dataset]. https://wbwaterdata.org/dataset/azerbaijan-demographic-and-health-survey-2006
    Explore at:
    Dataset updated
    Mar 16, 2020
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Azerbaijan
    Description

    The 2006 Azerbaijan Demographic and Health Survey (2006 AzDHS) is a nationally representative sample survey designed to provide information on population and health issues in Azerbaijan. The primary goal of the survey was to develop a single integrated set of demographic and health data pertaining to the population of the Republic of Azerbaijan. The 2006 AzDHS was conducted from July to November by the State Statistical Committee (SSC) of the Republic of Azerbaijan. Macro International Inc. provided technical support for the survey through the MEASURE DHS project. USAID Caucasus, Azerbaijan provided funding for the survey through the MEASURE DHS project. MEASURE DHS is sponsored by the United States Agency for International Development (USAID) to assist countries worldwide in obtaining information on key population and health indicators. The UNICEF/Azerbaijan country office was instrumental for political mobilization during the early stages of the 2006 AzDHS negotiation with the Government of Azerbaijan and also supported the survey through in-kind contributions. The 2006 AzDHS collected national- and regional-level data on fertility and contraceptive use, maternal and child health, adult health, tuberculosis, and HIV/AIDS and other sexually transmitted diseases. The survey obtained detailed information on these issues from women of reproductive age and, on certain topics, from men as well. The 2006 AzDHS results are intended to provide the information needed to evaluate existing social programs and to design new strategies for improving the health of Azerbaijanis and health services for the people of Azerbaijan. The 2006 AzDHS also contributes to the growing international database on demographic and health-related variables.

  19. i

    Demographic and Health Survey 1987 - Thailand

    • catalog.ihsn.org
    • datacatalog.ihsn.org
    • +1more
    Updated Mar 29, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Institute of Population Studies (IPS) (2019). Demographic and Health Survey 1987 - Thailand [Dataset]. https://catalog.ihsn.org/catalog/2489
    Explore at:
    Dataset updated
    Mar 29, 2019
    Dataset authored and provided by
    Institute of Population Studies (IPS)
    Time period covered
    1987
    Area covered
    Thailand
    Description

    Abstract

    The Thai Demographic and Health Survey (TDHS) was a nationally representative sample survey conducted from March through June 1988 to collect data on fertility, family planning, and child and maternal health. A total of 9,045 households and 6,775 ever-married women aged 15 to 49 were interviewed. Thai Demographic and Health Survey (TDHS) is carried out by the Institute of Population Studies (IPS) of Chulalongkorn University with the financial support from USAID through the Institute for Resource Development (IRD) at Westinghouse. The Institute of Population Studies was responsible for the overall implementation of the survey including sample design, preparation of field work, data collection and processing, and analysis of data. IPS has made available its personnel and office facilities to the project throughout the project duration. It serves as the headquarters for the survey.

    The Thai Demographic and Health Survey (TDHS) was undertaken for the main purpose of providing data concerning fertility, family planning and maternal and child health to program managers and policy makers to facilitate their evaluation and planning of programs, and to population and health researchers to assist in their efforts to document and analyze the demographic and health situation. It is intended to provide information both on topics for which comparable data is not available from previous nationally representative surveys as well as to update trends with respect to a number of indicators available from previous surveys, in particular the Longitudinal Study of Social Economic and Demographic Change in 1969-73, the Survey of Fertility in Thailand in 1975, the National Survey of Family Planning Practices, Fertility and Mortality in 1979, and the three Contraceptive Prevalence Surveys in 1978/79, 1981 and 1984.

    Geographic coverage

    National

    Analysis unit

    • Household
    • Women age 15-49

    Universe

    The population covered by the 1987 THADHS is defined as the universe of all women Ever-married women in the reproductive ages (i.e., women 15-49). This covered women in private households on the basis of a de facto coverage definition. Visitors and usual residents who were in the household the night before the first visit or before any subsequent visit during the few days the interviewing team was in the area were eligible. Excluded were the small number of married women aged under 15 and women not present in private households.

    Kind of data

    Sample survey data

    Sampling procedure

    SAMPLE SIZE AND ALLOCATION

    The objective of the survey was to provide reliable estimates for major domains of the country. This consisted of two overlapping sets of reporting domains: (a) Five regions of the country namely Bangkok, north, northeast, central region (excluding Bangkok), and south; (b) Bangkok versus all provincial urban and all rural areas of the country. These requirements could be met by defining six non-overlapping sampling domains (Bangkok, provincial urban, and rural areas of each of the remaining 4 regions), and allocating approximately equal sample sizes to them. On the basis of past experience, available budget and overall reporting requirement, the target sample size was fixed at 7,000 interviews of ever-married women aged 15-49, expected to be found in around 9,000 households. Table A.I shows the actual number of households as well as eligible women selected and interviewed, by sampling domain (see Table i.I for reporting domains).

    THE FRAME AND SAMPLE SELECTION

    The frame for selecting the sample for urban areas, was provided by the National Statistical Office of Thailand and by the Ministry of the Interior for rural areas. It consisted of information on population size of various levels of administrative and census units, down to blocks in urban areas and villages in rural areas. The frame also included adequate maps and descriptions to identify these units. The extent to which the data were up-to-date as well as the quality of the data varied somewhat in different parts of the frame. Basically, the multi-stage stratified sampling design involved the following procedure. A specified number of sample areas were selected systematically from geographically/administratively ordered lists with probabilities proportional to the best available measure of size (PPS). Within selected areas (blocks or villages) new lists of households were prepared and systematic samples of households were selected. In principle, the sampling interval for the selection of households from lists was determined so as to yield a self weighting sample of households within each domain. However, in the absence of good measures of population size for all areas, these sampling intervals often required adjustments in the interest of controlling the size of the resulting sample. Variations in selection probabilities introduced due to such adjustment, where required, were compensated for by appropriate weighting of sample cases at the tabulation stage.

    SAMPLE OUTCOME

    The final sample of households was selected from lists prepared in the sample areas. The time interval between household listing and enumeration was generally very short, except to some extent in Bangkok where the listing itself took more time. In principle, the units of listing were the same as the ultimate units of sampling, namely households. However in a small proportion of cases, the former differed from the latter in several respects, identified at the stage of final enumeration: a) Some units listed actually contained more than one household each b) Some units were "blanks", that is, were demolished or not found to contain any eligible households at the time of enumeration. c) Some units were doubtful cases in as much as the household was reported as "not found" by the interviewer, but may in fact have existed.

    Mode of data collection

    Face-to-face

    Research instrument

    The DHS core questionnaires (Household, Eligible Women Respondent, and Community) were translated into Thai. A number of modifications were made largely to adapt them for use with an ever- married woman sample and to add a number of questions in areas that are of special interest to the Thai investigators but which were not covered in the standard core. Examples of such modifications included adding marital status and educational attainment to the household schedule, elaboration on questions in the individual questionnaire on educational attainment to take account of changes in the educational system during recent years, elaboration on questions on postnuptial residence, and adaptation of the questionnaire to take into account that only ever-married women are being interviewed rather than all women. More generally, attention was given to the wording of questions in Thai to ensure that the intent of the original English-language version was preserved.

    a) Household questionnaire

    The household questionnaire was used to list every member of the household who usually lives in the household and as well as visitors who slept in the household the night before the interviewer's visit. Information contained in the household questionnaire are age, sex, marital status, and education for each member (the last two items were asked only to members aged 13 and over). The head of the household or the spouse of the head of the household was the preferred respondent for the household questionnaire. However, if neither was available for interview, any adult member of the household was accepted as the respondent. Information from the household questionnaire was used to identify eligible women for the individual interview. To be eligible, a respondent had to be an ever-married woman aged 15-49 years old who had slept in the household 'the previous night'.

    Prior evidence has indicated that when asked about current age, Thais are as likely to report age at next birthday as age at last birthday (the usual demographic definition of age). Since the birth date of each household number was not asked in the household questionnaire, it was not possible to calculate age at last birthday from the birthdate. Therefore a special procedure was followed to ensure that eligible women just under the higher boundary for eligible ages (i.e. 49 years old) were not mistakenly excluded from the eligible woman sample because of an overstated age. Ever-married women whose reported age was between 50-52 years old and who slept in the household the night before birthdate of the woman, it was discovered that these women (or any others being interviewed) were not actually within the eligible age range of 15-49, the interview was terminated and the case disqualified. This attempt recovered 69 eligible women who otherwise would have been missed because their reported age was over 50 years old or over.

    b) Individual questionnaire

    The questionnaire administered to eligible women was based on the DHS Model A Questionnaire for high contraceptive prevalence countries. The individual questionnaire has 8 sections: - Respondent's background - Reproduction - Contraception - Health and breastfeeding - Marriage - Fertility preference - Husband's background and woman's work - Heights and weights of children and mothers

    The questionnaire was modified to suit the Thai context. As noted above, several questions were added to the standard DHS core questionnaire not only to meet the interest of IPS researchers hut also because of their relevance to the current demographic situation in Thailand. The supplemental questions are marked with an asterisk in the individual questionnaire. Questions concerning the following items were added in the individual questionnaire: - Did the respondent ever

  20. Dataset - Understanding the software and data used in the social sciences

    • eprints.soton.ac.uk
    Updated Mar 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chue Hong, Neil; Aragon, Selina; Antonioletti, Mario; Walker, Johanna (2023). Dataset - Understanding the software and data used in the social sciences [Dataset]. http://doi.org/10.5281/zenodo.7785710
    Explore at:
    Dataset updated
    Mar 30, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Chue Hong, Neil; Aragon, Selina; Antonioletti, Mario; Walker, Johanna
    Description

    This is a repository for a UKRI Economic and Social Research Council (ESRC) funded project to understand the software used to analyse social sciences data. Any software produced has been made available under a BSD 2-Clause license and any data and other non-software derivative is made available under a CC-BY 4.0 International License. Note that the software that analysed the survey is provided for illustrative purposes - it will not work on the decoupled anonymised data set. Exceptions to this are: Data from the UKRI ESRC is mostly made available under a CC BY-NC-SA 4.0 Licence. Data from Gateway to Research is made available under an Open Government Licence (Version 3.0). Contents Survey data & analysis: esrc_data-survey-analysis-data.zip Other data: esrc_data-other-data.zip Transcripts: esrc_data-transcripts.zip Data Management Plan: esrc_data-dmp.zip Survey data & analysis The survey ran from 3rd February 2022 to 6th March 2023 during which 168 responses were received. Of these responses, three were removed because they were supplied by people from outside the UK without a clear indication of involvement with the UK or associated infrastructure. A fourth response was removed as both came from the same person which leaves us with 164 responses in the data. The survey responses, Question (Q) Q1-Q16, have been decoupled from the demographic data, Q17-Q23. Questions Q24-Q28 are for follow-up and have been removed from the data. The institutions (Q17) and funding sources (Q18) have been provided in a separate file as this could be used to identify respondents. Q17, Q18 and Q19-Q23 have all been independently shuffled. The data has been made available as Comma Separated Values (CSV) with the question number as the header of each column and the encoded responses in the column below. To see what the question and the responses correspond to you will have to consult the survey-results-key.csv which decodes the question and responses accordingly. A pdf copy of the survey questions is available on GitHub. The survey data has been decoupled into: survey-results-key.csv - maps a question number and the responses to the actual question values. q1-16-survey-results.csv- the non-demographic component of the survey responses (Q1-Q16). q19-23-demographics.csv - the demographic part of the survey (Q19-Q21, Q23). q17-institutions.csv - the institution/location of the respondent (Q17). q18-funding.csv - funding sources within the last 5 years (Q18). Please note the code that has been used to do the analysis will not run with the decoupled survey data. Other data files included CleanedLocations.csv - normalised version of the institutions that the survey respondents volunteered. DTPs.csv - information on the UKRI Doctoral Training Partnerships (DTPs) scaped from the UKRI DTP contacts web page in October 2021. projectsearch-1646403729132.csv.gz - data snapshot from the UKRI Gateway to Research released on the 24th February 2022 made available under an Open Government Licence. locations.csv - latitude and longitude for the institutions in the cleaned locations. subjects.csv - research classifications for the ESRC projects for the 24th February data snapshot. topics.csv - topic classification for the ESRC projects for the 24th February data snapshot. Interview transcripts The interview transcripts have been anonymised and converted to markdown so that it's easier to process in general. List of interview transcripts: 1269794877.md 1578450175.md 1792505583.md 2964377624.md 3270614512.md 40983347262.md 4288358080.md 4561769548.md 4938919540.md 5037840428.md 5766299900.md 5996360861.md 6422621713.md 6776362537.md 7183719943.md 7227322280.md 7336263536.md 75909371872.md 7869268779.md 8031500357.md 9253010492.md Data Management Plan The study's Data Management Plan is provided in PDF format and shows the different data sets used throughout the duration of the study and where they have been deposited, as well as how long the SSI will keep these records.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
General Statistical Office (GSO) (2023). Demographic and Health Survey 2002 - Viet Nam [Dataset]. https://microdata.worldbank.org/index.php/catalog/1518

Demographic and Health Survey 2002 - Viet Nam

Explore at:
Dataset updated
Oct 26, 2023
Dataset authored and provided by
General Statistical Office (GSO)
Time period covered
2002
Area covered
Vietnam
Description

Abstract

The 2002 Vietnam Demographic and Health Survey (VNDHS 2002) is a nationally representative sample survey of 5,665 ever-married women age 15-49 selected from 205 sample points (clusters) throughout Vietnam. It provides information on levels of fertility, family planning knowledge and use, infant and child mortality, and indicators of maternal and child health. The survey included a Community/ Health Facility Questionnaire that was implemented in each of the sample clusters.

The survey was designed to measure change in reproductive health indicators over the five years since the VNDHS 1997, especially in the 18 provinces that were targeted in the Population and Family Health Project of the Committee for Population, Family and Children. Consequently, all provinces were separated into “project” and “nonproject” groups to permit separate estimates for each. Data collection for the survey took place from 1 October to 21 December 2002.

The Vietnam Demographic and Health Survey 2002 (VNDHS 2002) was the third DHS in Vietnam, with prior surveys implemented in 1988 and 1997. The VNDHS 2002 was carried out in the framework of the activities of the Population and Family Health Project of the Committee for Population, Family and Children (previously the National Committee for Population and Family Planning).

The main objectives of the VNDHS 2002 were to collect up-to-date information on family planning, childhood mortality, and health issues such as breastfeeding practices, pregnancy care, vaccination of children, treatment of common childhood illnesses, and HIV/AIDS, as well as utilization of health and family planning services. The primary objectives of the survey were to estimate changes in family planning use in comparison with the results of the VNDHS 1997, especially on issues in the scope of the project of the Committee for Population, Family and Children.

VNDHS 2002 data confirm the pattern of rapidly declining fertility that was observed in the VNDHS 1997. It also shows a sharp decline in child mortality, as well as a modest increase in contraceptive use. Differences between project and non-project provinces are generally small.

Geographic coverage

The 2002 Vietnam Demographic and Health Survey (VNDHS 2002) is a nationally representative sample survey. The VNDHS 1997 was designed to provide separate estimates for the whole country, urban and rural areas, for 18 project provinces and the remaining nonproject provinces as well. Project provinces refer to 18 focus provinces targeted for the strengthening of their primary health care systems by the Government's Population and Family Health Project to be implemented over a period of seven years, from 1996 to 2002 (At the outset of this project there were 15 focus provinces, which became 18 by the creation of 3 new provinces from the initial set of 15). These provinces were selected according to criteria based on relatively low health and family planning status, no substantial family planning donor presence, and regional spread. These criteria resulted in the selection of the country's poorer provinces. Nine of these provinces have significant proportions of ethnic minorities among their population.

Analysis unit

  • Household
  • Women age 15-49

Universe

The population covered by the 2002 VNDHS is defined as the universe of all women age 15-49 in Vietnam.

Kind of data

Sample survey data

Sampling procedure

The sample for the VNDHS 2002 was based on that used in the VNDHS 1997, which in turn was a subsample of the 1996 Multi-Round Demographic Survey (MRS), a semi-annual survey of about 243,000 households undertaken regularly by GSO. The MRS sample consisted of 1,590 sample areas known as enumeration areas (EAs) spread throughout the 53 provinces/cities of Vietnam, with 30 EAs in each province. On average, an EA comprises about 150 households. For the VNDHS 1997, a subsample of 205 EAs was selected, with 26 households in each urban EA and 39 households for each rural EA. A total of 7,150 households was selected for the survey. The VNDHS 1997 was designed to provide separate estimates for the whole country, urban and rural areas, for 18 project provinces and the remaining nonproject provinces as well. Because the main objective of the VNDHS 2002 was to measure change in reproductive health indicators over the five years since the VNDHS 1997, the sample design for the VNDHS 2002 was as similar as possible to that of the VNDHS 1997.

Although it would have been ideal to have returned to the same households or at least the same sample points as were selected for the VNDHS 1997, several factors made this undesirable. Revisiting the same households would have held the sample artificially rigid over time and would not allow for newly formed households. This would have conflicted with the other major survey objective, which was to provide up-to-date, representative data for the whole of Vietnam. Revisiting the same sample points that were covered in 1997 was complicated by the fact that the country had conducted a population census in 1999, which allowed for a more representative sample frame.

In order to balance the two main objectives of measuring change and providing representative data, it was decided to select enumeration areas from the 1999 Population Census, but to cover the same communes that were sampled in the VNDHS 1997 and attempt to obtain a sample point as close as possible to that selected in 1997. Consequently, the VNDHS 2002 sample also consisted of 205 sample points and reflects the oversampling in the 20 provinces that fall in the World Bank-supported Population and Family Health Project. The sample was designed to produce about 7,000 completed household interviews and 5,600 completed interviews with ever-married women age 15-49.

Mode of data collection

Face-to-face

Research instrument

As in the VNDHS 1997, three types of questionnaires were used in the 2002 survey: the Household Questionnaire, the Individual Woman's Questionnaire, and the Community/Health Facility Questionnaire. The first two questionnaires were based on the DHS Model A Questionnaire, with additions and modifications made during an ORC Macro staff visit in July 2002. The questionnaires were pretested in two clusters in Hanoi (one in a rural area and another in an urban area). After the pretest and consultation with ORC Macro, the drafts were revised for use in the main survey.

a) The Household Questionnaire was used to enumerate all usual members and visitors in selected households and to collect information on age, sex, education, marital status, and relationship to the head of household. The main purpose of the Household Questionnaire was to identify persons who were eligible for individual interview (i.e. ever-married women age 15-49). In addition, the Household Questionnaire collected information on characteristics of the household such as water source, type of toilet facilities, material used for the floor and roof, and ownership of various durable goods.

b) The Individual Questionnaire was used to collect information on ever-married women aged 15-49 in surveyed households. These women were interviewed on the following topics:
- Respondent's background characteristics (education, residential history, etc.); - Reproductive history; - Contraceptive knowledge and use;
- Antenatal and delivery care; - Infant feeding practices; - Child immunization; - Fertility preferences and attitudes about family planning; - Husband's background characteristics; - Women's work information; and - Knowledge of AIDS.

c) The Community/Health Facility Questionnaire was used to collect information on all communes in which the interviewed women lived and on services offered at the nearest health stations. The Community/Health Facility Questionnaire consisted of four sections. The first two sections collected information from community informants on some characteristics such as the major economic activities of residents, distance from people's residence to civic services and the location of the nearest sources of health care. The last two sections involved visiting the nearest commune health centers and intercommune health centers, if these centers were located within 30 kilometers from the surveyed cluster. For each visited health center, information was collected on the type of health services offered and the number of days services were offered per week; the number of assigned staff and their training; medical equipment and medicines available at the time of the visit.

Cleaning operations

The first stage of data editing was implemented by the field editors soon after each interview. Field editors and team leaders checked the completeness and consistency of all items in the questionnaires. The completed questionnaires were sent to the GSO headquarters in Hanoi by post for data processing. The editing staff of the GSO first checked the questionnaires for completeness. The data were then entered into microcomputers and edited using a software program specially developed for the DHS program, the Census and Survey Processing System, or CSPro. Data were verified on a 100 percent basis, i.e., the data were entered separately twice and the two results were compared and corrected. The data processing and editing staff of the GSO were trained and supervised for two weeks by a data processing specialist from ORC Macro. Office editing and processing activities were initiated immediately after the beginning of the fieldwork and were completed in late December 2002.

Response rate

The results of the household and individual

Search
Clear search
Close search
Google apps
Main menu