The population and housing census (PHC) is the unique source of reliable and comprehensive data about the size of population and also on major socio-economic & socio-demographic characteristics of the country. It provides data on geographic and administrative distribution of population and household in addition to the demographic and socio-economic characteristics of all the people in the country. Generally, it provides for comparing and projecting demographic data, social and economic characteristics, as well as household and housing conditions at all levels of the country’s administrative units and dimensions: national, regional, districts and localities. The data from the census is classified, tabulated and disseminated so that researchers, administrators, policy makers and development partners can use the information in formulating and implementing various multi-sectorial development programs at the national and community levels. Data on all key variables namely area, household, population, economic activity, literacy and education, fertility and child survival, housing conditions and sanitation are collected and available in the census data. The 2021 PHC in Ghana had an overarching goal of generating updated demographic, social and economic data, housing characteristics and dwelling conditions to support national development planning activities.
National Coverage , Region , District
All persons who spent census night (midnight of 27th June 2021) in Ghana
Census/enumeration data [cen]
This 10% sample data for the 2021 PHC is representative at the district/subdistrict level and also by the urban rural classification.
Computer Assisted Personal Interview [capi]
GSS developed two categories of instruments for the 2021 PHC: the listing form and the enumeration instruments. The listing form was only one, while the enumeration instruments comprised six questionnaires, designated as PHC 1A, PHC 1B, PHC 1C, PHC 1D, PHC 1E and PHC 1F. The PHC 1A was the most comprehensive with the others being its subsets.
Listing Form: The listing form was developed to collect data on type of structures, level of completion, whether occupied or vacant and use(s) of the structures. It was also used to collect information about the availability, number and types of toilet facilities in the structures. It was also used to capture the number of households in a structure, number of persons in households and the sex of the persons residing in the households if occupied. Finally, the listing form was used to capture data on non-household populations such as the population in institutions, floating population and sex of the non-household populations.
PHC 1A: The PHC 1A questionnaire was used to collect data from all households in the country. Primarily, it was used to capture household members and visitors who spent the Census Night in the dwelling of the household, and their relationship with the head of the household. It was also used to collect data on homeless households. Members of the households who were absent were enumerated at the place where they had spent the Census Night. The questionnaire was also used to collect the following household information: emigration; socio-demographic characteristics (sex, age, place of birth and enumeration, survival status of parents, literacy and education; economic activities; difficulty in performing activities; ownership and usage of information, technology and communication facilities; fertility; mortality; housing characteristics and conditions and sanitation.
PHC 1B: The PHC 1B questionnaire was used to collect data from persons in stable institutions comprising boarding houses, hostels and prisons who were present on Census Night. Other information that was captured with this instrument are socio-demographic characteristics, literacy and education, economic activities, difficulty in performing activities; ownership and usage of information, technology and communication facilities; fertility; mortality; housing characteristics and conditions and sanitation.
PHC 1C: The PHC 1C questionnaire was used to collect data from persons in “unstable” institutions such as hospitals and prayer camps who were present at these places on Census Night. The instrument was used to capture only the socio-demographic characteristics of individuals.
PHC 1D: The PHC 1D questionnaire was used to collect data from the floating population. This constitutes persons who were found at airports, seaports, lorry stations and similar locations waiting for or embarking on long-distance travel, as well as outdoor sleepers on Census Night. The instrument captured the socio-demographic information of individuals.
PHC 1E: All persons who spent the Census Night at hotels, motels and guest houses were enumerated using the PHC 1E. The content of the questionnaire was similar to that of the PHC 1D.
PHC 1F: The PHC 1F questionnaire was administered to diplomats in the country.
The Census data editing was implemented at three levels: 1. data editing by enumerators and supervisors during data collection 2. data editing was done at the regional level by the regional data quality monitors during data collection 3. Final data editing was done at the national level using the batch edits in CSPro and STATA Data editing and cleaning was mainly digital.
100 percent
A post Enumeration Survey (PES) was conducted to assess the extent of coverage and content error.
https://borealisdata.ca/api/datasets/:persistentId/versions/11.2/customlicense?persistentId=doi:10.5683/SP3/8PUZQAhttps://borealisdata.ca/api/datasets/:persistentId/versions/11.2/customlicense?persistentId=doi:10.5683/SP3/8PUZQA
Note: The data release is complete as of August 14th, 2023. 1. (Added April 4th) Canada and Census Divisions = Early April 2023 2. (Added May 1st) Ontario, British Columbia, and Alberta Census Subdivisions (CSDs) = Late April 2023 3a. (Added June 8th) Manitoba and Saskatchewan CSDs 3b. (Added June 12th) Quebec CSDs = June 12th 2023 4. (Added June 30th) Newfoundland and Labrador, Prince Edward Island, New Brunswick, and Nova Scotia CSDs = Early July 2023 5. (Added August 14th) Yukon, Northwest Territories, and Nunavut CSDs = Early August 2023. For more information, please visit HART.ubc.ca. Housing Assessment Resource Tools (HART) This dataset contains 18 tables which draw upon data from the 2021 Census of Canada. The tables are a custom order and contains data pertaining to core housing need and characteristics of households. 17 of the tables each cover a different geography in Canada: one for Canada as a whole, one for all Canadian census divisions (CD), and 15 for all census subdivisions (CSD) across Canada. The last table contains the median income for all geographies. Statistics Canada used these median incomes as the "area median household income (AMHI)," from which they derived some of the data fields within the Shelter Costs/Household Income dimension. Included alongside the data tables is a guide to HART's housing need assessment methodology. This guide is intended to support independent use of HART's custom data both to allow for transparent verification of our analysis, as well as supporting efforts to utilize the data for analysis beyond what HART did. There are many data fields in the data order that we did not use that may be of value for others. The dataset is in Beyond 20/20 (.ivt) format. The Beyond 20/20 browser is required in order to open it. This software can be freely downloaded from the Statistics Canada website: https://www.statcan.gc.ca/eng/public/beyond20-20 (Windows only). For information on how to use Beyond 20/20, please see: http://odesi2.scholarsportal.info/documentation/Beyond2020/beyond20-quickstart.pdf https://wiki.ubc.ca/Library:Beyond_20/20_Guide Custom order from Statistics Canada includes the following dimensions and data fields: Geography: - Country of Canada, all CDs & Country as a whole - All 10 Provinces (Newfoundland, Prince Edward Island (PEI), Nova Scotia, New Brunswick, Quebec, Ontario, Manitoba, Saskatchewan, Alberta, and British Columbia), all CSDs & each Province as a whole - All 3 Territories (Nunavut, Northwest Territories, Yukon), all CSDs & each Territory as a whole Data Quality and Suppression: - The global non-response rate (GNR) is an important measure of census data quality. It combines total non-response (households) and partial non-response (questions). A lower GNR indicates a lower risk of non-response bias and, as a result, a lower risk of inaccuracy. The counts and estimates for geographic areas with a GNR equal to or greater than 50% are not published in the standard products. The counts and estimates for these areas have a high risk of non-response bias, and in most cases, should not be released. - Area suppression is used to replace all income characteristic data with an 'x' for geographic areas with populations and/or number of households below a specific threshold. If a tabulation contains quantitative income data (e.g., total income, wages), qualitative data based on income concepts (e.g., low income before tax status) or derived data based on quantitative income variables (e.g., indexes) for individuals, families or households, then the following rule applies: income characteristic data are replaced with an 'x' for areas where the population is less than 250 or where the number of private households is less than 40. Source: Statistics Canada - When showing count data, Statistics Canada employs random rounding in order to reduce the possibility of identifying individuals within the tabulations. Random rounding transforms all raw counts to random rounded counts. Reducing the possibility of identifying individuals within the tabulations becomes pertinent for very small (sub)populations. All counts greater than 10 are rounded to a base of 5, meaning they will end in either 0 or 5. The random rounding algorithm controls the results and rounds the unit value of the count according to a predetermined frequency. Counts ending in 0 or 5 are not changed. Counts of 10 or less are rounded to a base of 10, meaning they will be rounded to either 10 or zero. Universe: Full Universe: Private Households in Non-farm Non-band Off-reserve Occupied Private Dwellings with Income Greater than zero. Households examined for Core Housing Need: Private, non-farm, non-reserve, owner- or renter-households with incomes greater than zero and shelter-cost-to-income ratios less than 100% are assessed for 'Core Housing Need.' Non-family Households with at least one household maintainer aged 15 to 29 attending school are considered not to be in Core Housing...
VITAL SIGNS INDICATOR
Income (EC4)
FULL MEASURE NAME
Household income by place of residence
LAST UPDATED
January 2023
DESCRIPTION
Income reflects the median earnings of individuals and households from employment, as well as the income distribution by quintile. Income data highlight how employees are being compensated for their work on an inflation-adjusted basis.
DATA SOURCE
U.S. Census Bureau: Decennial Census - https://nhgis.org
Count 4Pb (1970)
Form STF3 (1980-1990)
Form SF3a (2000)
U.S. Census Bureau: American Community Survey - https://data.census.gov/
Form B19001 (2005-2021; household income by place of residence)
Form B19013 (2005-2021; median household income by place of residence)
Form B08521 (2005-2021; median worker earnings by place of employment)
Bureau of Labor Statistics: Consumer Price Index - https://www.bls.gov/data/
1970-2021
CONTACT INFORMATION
vitalsigns.info@bayareametro.gov
METHODOLOGY NOTES (across all datasets for this indicator)
Income derived from the decennial Census data reflects the income earned in the prior calendar year, whereas income derived from the American Community Survey (ACS) data reflects the prior 12 month period; note that this inconsistency has a minor effect on historical comparisons (see Income and Earnings Data section of the ACS General Handbook - https://www.census.gov/content/dam/Census/library/publications/2020/acs/acs_general_handbook_2020_ch09.pdf). ACS 1-year data is used for larger geographies – Bay counties and most metropolitan area counties – while smaller geographies rely upon 5-year rolling average data due to their smaller sample sizes. Note that 2020 data uses the 5-year estimates because the ACS did not collect 1-year data for 2020.
Quintile income for 1970-2000 is imputed from decennial Census data using methodology from the California Department of Finance. Bay Area income is the population weighted average of county-level income.
Income has been inflated using the Consumer Price Index (CPI) for 2021 specific to each metro area; however, some metro areas lack metro-specific CPI data back to 1970 and therefore adjusted data uses national CPI for 1970. Note that current MSA boundaries were used for historical comparison by identifying counties included in today’s metro areas.
VITAL SIGNS INDICATOR
Income (EC4)
FULL MEASURE NAME
Household income by place of residence
LAST UPDATED
January 2023
DESCRIPTION
Income reflects the median earnings of individuals and households from employment, as well as the income distribution by quintile. Income data highlight how employees are being compensated for their work on an inflation-adjusted basis.
DATA SOURCE
U.S. Census Bureau: Decennial Census - https://nhgis.org
Count 4Pb (1970)
Form STF3 (1980-1990)
Form SF3a (2000)
U.S. Census Bureau: American Community Survey - https://data.census.gov/
Form B19001 (2005-2021; household income by place of residence)
Form B19013 (2005-2021; median household income by place of residence)
Form B08521 (2005-2021; median worker earnings by place of employment)
Bureau of Labor Statistics: Consumer Price Index - https://www.bls.gov/data/
1970-2021
CONTACT INFORMATION
vitalsigns.info@bayareametro.gov
METHODOLOGY NOTES (across all datasets for this indicator)
Income derived from the decennial Census data reflects the income earned in the prior calendar year, whereas income derived from the American Community Survey (ACS) data reflects the prior 12 month period; note that this inconsistency has a minor effect on historical comparisons (see Income and Earnings Data section of the ACS General Handbook - https://www.census.gov/content/dam/Census/library/publications/2020/acs/acs_general_handbook_2020_ch09.pdf). ACS 1-year data is used for larger geographies – Bay counties and most metropolitan area counties – while smaller geographies rely upon 5-year rolling average data due to their smaller sample sizes. Note that 2020 data uses the 5-year estimates because the ACS did not collect 1-year data for 2020.
Quintile income for 1970-2000 is imputed from decennial Census data using methodology from the California Department of Finance. Bay Area income is the population weighted average of county-level income.
Income has been inflated using the Consumer Price Index (CPI) for 2021 specific to each metro area; however, some metro areas lack metro-specific CPI data back to 1970 and therefore adjusted data uses national CPI for 1970. Note that current MSA boundaries were used for historical comparison by identifying counties included in today’s metro areas.
Census 2020 blocks in King County with selected P.L. 94-171 redistricting data.
Important note: The Census Bureau advises analysts to aggregate blocks together to form larger geographic units before using the 2020 Census data.
Background: The Bureau used a new tool, called Differential Privacy, to inject statistical noise into the 2020 Census data in order to protect privacy. The resulting noise can cause substantial inaccuracy at the block level; combining data for blocks and other small geographies reduces the inaccuracy. For more information see Redistricting Data: What to Expect and When (census.gov), 2020 Census Data Products: Disclosure Avoidance Modernization.
The 2022 Ghana Demographic and Health Survey (2022 GDHS) is the seventh in the series of DHS surveys conducted by the Ghana Statistical Service (GSS) in collaboration with the Ministry of Health/Ghana Health Service (MoH/GHS) and other stakeholders, with funding from the United States Agency for International Development (USAID) and other partners.
The primary objective of the 2022 GDHS is to provide up-to-date estimates of basic demographic and health indicators. Specifically, the GDHS collected information on: - Fertility levels and preferences, contraceptive use, antenatal and delivery care, maternal and child health, childhood mortality, childhood immunisation, breastfeeding and young child feeding practices, women’s dietary diversity, violence against women, gender, nutritional status of adults and children, awareness regarding HIV/AIDS and other sexually transmitted infections, tobacco use, and other indicators relevant for the Sustainable Development Goals - Haemoglobin levels of women and children - Prevalence of malaria parasitaemia (rapid diagnostic testing and thick slides for malaria parasitaemia in the field and microscopy in the lab) among children age 6–59 months - Use of treated mosquito nets - Use of antimalarial drugs for treatment of fever among children under age 5
The information collected through the 2022 GDHS is intended to assist policymakers and programme managers in designing and evaluating programmes and strategies for improving the health of the country’s population.
National coverage
The survey covered all de jure household members (usual residents), all women aged 15-49, men aged 15-59, and all children aged 0-4 resident in the household.
Sample survey data [ssd]
To achieve the objectives of the 2022 GDHS, a stratified representative sample of 18,450 households was selected in 618 clusters, which resulted in 15,014 interviewed women age 15–49 and 7,044 interviewed men age 15–59 (in one of every two households selected).
The sampling frame used for the 2022 GDHS is the updated frame prepared by the GSS based on the 2021 Population and Housing Census.1 The sampling procedure used in the 2022 GDHS was stratified two-stage cluster sampling, designed to yield representative results at the national level, for urban and rural areas, and for each of the country’s 16 regions for most DHS indicators. In the first stage, 618 target clusters were selected from the sampling frame using a probability proportional to size strategy for urban and rural areas in each region. Then the number of targeted clusters were selected with equal probability systematic random sampling of the clusters selected in the first phase for urban and rural areas. In the second stage, after selection of the clusters, a household listing and map updating operation was carried out in all of the selected clusters to develop a list of households for each cluster. This list served as a sampling frame for selection of the household sample. The GSS organized a 5-day training course on listing procedures for listers and mappers with support from ICF. The listers and mappers were organized into 25 teams consisting of one lister and one mapper per team. The teams spent 2 months completing the listing operation. In addition to listing the households, the listers collected the geographical coordinates of each household using GPS dongles provided by ICF and in accordance with the instructions in the DHS listing manual. The household listing was carried out using tablet computers, with software provided by The DHS Program. A fixed number of 30 households in each cluster were randomly selected from the list for interviews.
For further details on sample design, see APPENDIX A of the final report.
Face-to-face computer-assisted interviews [capi]
Four questionnaires were used in the 2022 GDHS: the Household Questionnaire, the Woman’s Questionnaire, the Man’s Questionnaire, and the Biomarker Questionnaire. The questionnaires, based on The DHS Program’s model questionnaires, were adapted to reflect the population and health issues relevant to Ghana. In addition, a self-administered Fieldworker Questionnaire collected information about the survey’s fieldworkers.
The GSS organized a questionnaire design workshop with support from ICF and obtained input from government and development partners expected to use the resulting data. The DHS Program optional modules on domestic violence, malaria, and social and behavior change communication were incorporated into the Woman’s Questionnaire. ICF provided technical assistance in adapting the modules to the questionnaires.
DHS staff installed all central office programmes, data structure checks, secondary editing, and field check tables from 17–20 October 2022. Central office training was implemented using the practice data to test the central office system and field check tables. Seven GSS staff members (four male and three female) were trained on the functionality of the central office menu, including accepting clusters from the field, data editing procedures, and producing reports to monitor fieldwork.
From 27 February to 17 March, DHS staff visited the Ghana Statistical Service office in Accra to work with the GSS central office staff on finishing the secondary editing and to clean and finalize all data received from the 618 clusters.
A total of 18,540 households were selected for the GDHS sample, of which 18,065 were found to be occupied. Of the occupied households, 17,933 were successfully interviewed, yielding a response rate of 99%. In the interviewed households, 15,317 women age 15–49 were identified as eligible for individual interviews. Interviews were completed with 15,014 women, yielding a response rate of 98%. In the subsample of households selected for the male survey, 7,263 men age 15–59 were identified as eligible for individual interviews and 7,044 were successfully interviewed.
The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors and (2) sampling errors. Nonsampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2022 Ghana Demographic and Health Survey (2022 GDHS) to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 2022 GDHS is only one of many samples that could have been selected from the same population, using the same design and identical size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results. A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95% of all possible samples of identical size and design.
If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 2022 GDHS sample was the result of a multistage stratified design, and, consequently, it was necessary to use more complex formulas. The computer software used to calculate sampling errors for the GDHS 2022 is an SAS program. This program used the Taylor linearization method to estimate variances for survey estimates that are means, proportions, or ratios. The Jackknife repeated replication method is used for variance estimation of more complex statistics such as fertility and mortality rates.
A more detailed description of estimates of sampling errors are presented in APPENDIX B of the survey report.
Data Quality Tables
The Ethiopia Socioeconomic Panel Survey (ESPS) is a collaborative project between the Ethiopian Statistical Service (ESS) and the World Bank Living Standards Measurement Study-Integrated Surveys on Agriculture (LSMS-ISA) team. The objective of the LSMS-ISA is to collect multi-topic, household-level panel data with a special focus on improving agriculture statistics and generating a clearer understanding of the link between agriculture and other sectors of the economy. The project also aims to build capacity, share knowledge across countries, and improve survey methodologies and technology. ESPS is a long-term project to collect panel data. The project responds to the data needs of the country, given the dependence of a high percentage of households on agriculture activities in the country. The ESPS collects information on household agricultural activities along with other information on the households like human capital, other economic activities, and access to services and resources. The ability to follow the same households over time makes the ESPS a new and powerful tool for studying and understanding the role of agriculture in household welfare over time as it allows analyses of how households add to their human and physical capital, how education affects earnings, and the role of government policies and programs on poverty, inter alia. The ESPS is the first-panel survey to be carried out by the Ethiopian Statistical Service that links a multi-topic household questionnaire with detailed data on agriculture.
National Regional Urban and Rural
The survey covered all de jure households excluding prisons, hospitals, military barracks, and school dormitories.
Sample survey data [ssd]
The sampling frame for the second phase ESPS panel survey is based on the updated 2018 pre-census cartographic database of enumeration areas by the Ethiopian Statistical Service (ESS). The sample is a two-stage stratified probability sample. The ESPS EAs in rural areas are the subsample of the AgSS EA sample. That means the first stage of sampling in the rural areas entailed selecting enumeration areas (i.e., the primary sampling units) using simple random sampling (SRS) from the sample of the 2018 AgSS enumeration areas (EAs). The first stage of sampling for urban areas is selecting EAs directly from the urban frame of EAs within each region using systematic PPS. This is designed to automatically result in a proportional allocation of the urban sample by zone within each region. Following the selection of sample EAs, they are allocated by urban rural strata using power allocation which is happened to be closer to proportional allocation.
The second stage of sampling is the selection of households to be surveyed in each sampled EA using systematic random sampling. From the rural EAs, 10 agricultural households are selected as a subsample of the households selected for the AgSS, and 2 non-agricultural households are selected from the non-agriculture households list in that specific EA. The non-agriculture household selection follows the same sampling method i.e., systematic random sampling. One important issue to note in ESPS sampling is that the total number of agriculture households per EA remains at 10 even though there are less than 2 or no non-agriculture households are listed and sampled in that EA. For urban areas, a total of 15 households are selected per EA regardless of the households’ economic activity. The households are selected using systematic random sampling from the total households listed in that specific EA.
The ESPS-5 kept all the ESPS-4 samples except for those in the Tigray region and a few other places. A more detailed description of the sample design is provided in Section 3 of the Basic Information Document provided under the Related Materials tab.
Computer Assisted Personal Interview [capi]
The ESPS-5 survey consisted of four questionnaires (household, community, post-planting, and post-harvest questionnaires), similar to those used in previous waves but revised based on the results of those waves and on the need for new data they revealed. The following new topics are included in ESPS-5:
a. Dietary Quality: This module collected information on the household’s consumption of specified food items.
b. Food Insecurity Experience Scale (FIES): In this round the survey has implemented FIES. The scale is based on the eight food insecurity experience questions on the Food Insecurity Experience Scale | Voices of the Hungry | Food and Agriculture Organization of the United Nations (fao.org).
c. Basic Agriculture Information: This module is designed to collect minimal agriculture information from households. It is primarily for urban households. However, it was also used for a few rural households where it was not possible to implement the full agriculture module due to security reasons and administered for urban households. It asked whether they had undertaken any agricultural activity, such as crop farming and tending livestock) in the last 12 months. For crop farming, the questions were on land tenure, crop type, input use, and production. For livestock there were also questions on their size and type, livestock products, and income from sales of livestock or livestock products.
d. Climate Risk Perception: This module was intended to elicit both rural and urban households perceptions, beliefs, and attitudes about different climate-related risks. It also asked where and how households were obtaining information on climate and weather-related events.
e. Agriculture Mechanization and Video-Based Agricultural Extension: The rural area community questionnaire covered these areas rural areas. On mechanization the questions related to the penetration, availability and accessibility of agricultural machinery. Communities were also asked if they had received video-based extension services.
Final data cleaning was carried out on all data files. Only errors that could be clearly and confidently fixed by the team were corrected; errors that had no clear fix were left in the datasets. Cleaning methods for these errors are left up to the data user.
ESPS-5 planned to interview 7,527 households from 565 enumeration areas (EAs) (Rural 316 EAs and Urban 249 EAs). However, due to the security situation in northern Ethiopia and to a lesser extent in the western part of the country, only a total of 4999 households from 438 EAs were interviewed for both the agriculture and household modules. The security situation in northern parts of Ethiopia meant that, in Tigray, ESPS-5 did not cover any of the EAs and households previously sampled. In Afar, while 275 households in 44 EAs had been covered by both the ESPS-4 agriculture and household modules, in ESPS-5 only 252 households in 22 EAs were covered by both modules. During the fifth wave, security was also a problem in both the Amhara and Oromia regions, so there was a comparable reduction in the number of households and EAs covered there.
More detailed information is available in the BID.
The World Bank Enterprise Survey (WBES) is a firm-level survey of a representative sample of an economy's private sector. The surveys cover a broad range of topics related to the business environment including access to finance, corruption, infrastructure, competition, and performance.
National coverage
The primary sampling unit of the study is the establishment. An establishment is a physical location where business is carried out and where industrial operations take place or services are provided. A firm may be composed of one or more establishments. For example, a brewery may have several bottling plants and several establishments for distribution. For the purposes of this survey an establishment must make its own financial decisions and have its own financial statements separate from those of the firm. An establishment must also have its own management and control over its payroll.
The universe of inference includes all formal (i.e., registered) private sector businesses (with at least 1% private ownership) and with at least five employees. In terms of sectoral criteria, all manufacturing businesses (ISIC Rev 4. codes 10-33) are eligible; for services businesses, those corresponding to the ISIC Rev 4 codes 41-43, 45-47, 49-53, 55-56, 58, 61-62, 69-75, 79, and 95 are included in the Enterprise Surveys. Cooperatives and collectives are excluded from the Enterprise Surveys. All eligible establishments must be registered with the registration agency. In the case of Viet Nam, the listing from the General Statistics Office of Vietnam, the 2021 Economic Census, was used. The registration agency is the Department of Planning and investment.
Sample survey data [ssd]
The WBES use stratified random sampling, where the population of establishments is first separated into non-overlapping groups, called strata, and then respondents are selected through simple random sampling from each stratum. The detailed methodology is provided in the Sampling Note (https://www.enterprisesurveys.org/content/dam/enterprisesurveys/documents/methodology/Sampling_Note-Consolidated-2-16-22.pdf). Stratified random sampling has several advantages over simple random sampling. In particular, it:
The WBES typically use three levels of stratification: industry classification, establishment size, and subnational region (used in combination). Starting in 2022, the WBES bases the industry classification on ISIC Rev. 4 (with earlier surveys using ISIC Rev. 3.1). For regional coverage within a country, the WBES has national coverage.
Note: Refer to Sampling Structure section in "The Viet Nam 2023 World Bank Enterprise Survey Implementation Report" for detailed methodology on sampling.
Face-to-face [f2f]
The standard WBES questionnaire covers several topics regarding the business environment and business performance. These topics include general firm characteristics, infrastructure, sales and supplies, management practices, competition, innovation, capacity, land and permits, finance, business-government relations, exposure to bribery, labor, and performance. Information about the general structure of the questionnaire is available in the Enterprise Surveys Manual and Guide (https://www.enterprisesurveys.org/content/dam/enterprisesurveys/documents/methodology/Enterprise-Surveys-Manual-and-Guide.pdf).
The questionnaire implemented in the Viet Nam 2023 WBES included additional questions tailored for the Business Ready Report covering infrastructure, trade, government regulations, finance, labor, and other topics.
Overall survey response rate was 31.7%.
The Annual Agricultural Sample Survey (AASS) for the year 2022/23 aimed to enhance the understanding of agricultural activities across the United Republic of Tanzania by collecting comprehensive data on various aspects of the agricultural sector. This survey is crucial for policy formulation, development planning, and service delivery, providing reliable data to monitor and evaluate national and international development frameworks.
The 2022/23 survey is particularly significant as it informs the monitoring and evaluation of key agricultural development strategies and frameworks. The collected data will contribute to the Tanzania Development Vision 2025, Zanzibar Development Vision 2020, the Five-Year Development Plan 2021/22–2025/26, the National Strategy for Growth and Reduction of Poverty (NSGRP) known as MKUKUTA, and the Zanzibar Strategy for Growth and Reduction of Poverty (ZSGRP) known as MKUZA. The survey data also supports the evaluation of Sustainable Development Goals (SDGs) and Comprehensive Africa Agriculture Development Programme (CAADP). Key indicators for agricultural performance and poverty monitoring are directly measured from the survey data.
The 2022/23 AASS provides a detailed descriptive analysis and related tables on the main thematic areas. These areas include household members and holder identification, field roster, seasonal plot and crop rosters (Vuli, Masika, and Dry Season), permanent crop production, crop harvest use, seed and seedling acquisition, input use and acquisition (fertilizers and pesticides), livestock inventory and changes, livestock production costs, milk and eggs production, other livestock products, aquaculture production, and labor dynamics. The 2022/23 AASS offers an extensive dataset essential for understanding the current state of agriculture in Tanzania. The insights gained will support the development of policies and interventions aimed at enhancing agricultural productivity, sustainability, and the livelihoods of farming communities. This data is indispensable for stakeholders addressing challenges in the agricultural sector and promoting sustainable agricultural development.
Statistical Disclosure Control (SDC) methods have been applied to the microdata, to protect the confidentiality of the individual data collected. Users must be aware that these anonymization or SDC methods modify the data, including suppression of some data points. This affects the aggregated values derived from the anonymized microdata, and may have other unwanted consequences, such as sampling error and bias. Additional details about the SDC methods and data access conditions are provided in the data processing and data access conditions below.
National, Mainland Tanzania and Zanzibar, Regions
Households for Smallholder Farmers and Farm for Large Scale Farms
The survey covered agricultural households and large-scale farms.
Agricultural households are those that meet one or more of the following two conditions: a) Have or operate at least 25 square meters of arable land, b) Own or keep at least one head of cattle or five goats/sheep/pigs or fifty chicken/ducks/turkeys during the agriculture year.
Large-scale farms are those farms with at least 20 hectares of cultivated land, or 50 herds of cattle, or 100 goats/sheep/pigs, or 1,000 chickens. In addition to this, they should fulfill all of the following four conditions: i) The greater part of the produce should go to the market, ii) Operation of farm should be continuous, iii) There should be application of machinery / implements on the farm, and iv) There should be at least one permanent employee.
Sample survey data [ssd]
The frame used to extract the sample for the Annual Agricultural Sample Survey (AASS-2022/23) in Tanzania was derived from the 2022 Population and Housing Census (PHC-2022) Frame that lists all the Enumeration Areas (EAs/Hamlets) of the country. The AASS 2022/23 used a stratified two-stage sampling design which allows to produce reliable estimates at regional level for both Mainland Tanzania and Zanzibar.
In the first stage, the EAs (primary sampling units) were stratified into 2-3 strata within each region and then selected by using a systematic sampling procedure with probability proportional to size (PPS), where the measure of size is the number of agricultural households in the EA. Before the selection, within each stratum and domain (region), the Enumeration Areas (EAs) were ordered according to the codes of District and Council which reflect the geographical proximity, and then ordered according to the codes of Constituency, Division, Wards, and Village. An implicit stratification was also performed, ordering by Urban/Rural type at Ward level.
In the second stage, a simple random sampling selection was conducted . In hamlets with more than 200 households, twelve (12) agricultural households were drawn from the PHC 2022 list with a simple random sampling without replacement procedure in each sampled hamlet. In hamlets with 200 households or less, a listing exercise was carried out in each sampled hamlet, and twelve (12) agricultural households were selected with a simple random sampling without replacement procedure. A total of 1,352 PSUs were selected from the 2022 Population and Housing Census frame, of which 1,234 PSUs were from Mainland Tanzania and 118 from Zanzibar. A total number of 16,224 agricultural households were sampled (14,808 households from Mainland Tanzania and 1,416 from Zanzibar).
Computer Assisted Personal Interview [capi]
The 2022/23 Annual Agricultural Survey used two main questionnaires consolidated into a single questionnaire within the CAPIthe CAPI System, Smallholder Farmers and Large-Scale Farms Questionnaire. Smallholder Farmers questionnaire captured information at household level while Large Scale Farms questionnaire captured information at establishment/holding level. These questionnaires were used for data collection that covered core agricultural activities (crops, livestock, and fish farming) in both short and long rainy seasons. The 2022/23 AASS questionnaire covered 23 sections which are:
COVER; The cover page included the title of the survey, survey year (2022/23), general instructions for both the interviewers and respondents. It sets the context for the survey and also it shows the survey covers the United Republic of Tanzania.
SCREENING: Included preliminary questions designed to determine if the respondent or household is eligible to participate in the survey. It checks for core criteria such as involvement in agricultural activities.
START INTERVIEW: The introductory section where basic details about the interview are recorded, such as the date, location, and interviewer’s information. This helped in the identification and tracking of the interview process.
HOUSEHOLD MEMBERS AND HOLDER IDENTIFICATION: Collected information about all household members, including age, gender, relationship to the household head, and the identification of the main agricultural holder. This section helped in understanding the demographic composition of the agriculture household.
FIELD ROSTER: Provided the details of the various agricultural fields operated by the agriculture household. Information includes the size, location, and identification of each field. This section provided a comprehensive overview of the land resources available to the household.
VULI PLOT ROSTER: Focused on plots used during the Vuli season (short rainy season). It includes details on the crops planted, plot sizes, and any specific characteristics of these plots. This helps in assessing seasonal agricultural activities.
VULI CROP ROSTER: Provided detailed information on the types of crops grown during the Vuli season, including quantities produced and intended use (e.g., consumption, sale, storage). This section captures the output of short rainy season farming.
MASIKA PLOT ROSTER: Similar to Section 4 but focuses on the Masika season (long rainy season). It collects data on plot usage, crop types, and sizes. This helps in understanding the agricultural practices during the primary growing season.
MASIKA CROP ROSTER: Provided detailed information on crops grown during the Masika season, including production quantities and uses. This section captures the output from the main agricultural season.
PERMANENT CROP PRODUCTION: Focuses on perennial or permanent crops (e.g., fruit trees, tea, coffee). It includes data on the types of permanent crops, area under cultivation, production volumes, and uses. This section tracks long-term agricultural investments.
CROP HARVEST USE: In this, provided the details how harvested crops are utilized within the household. Categories included consumption, sale, storage, and other uses. This section helps in understanding food security and market engagement.
SEED AND SEEDLINGS ACQUISITION: Collected information on how the agriculture household acquires seeds and seedlings, including sources (e.g., purchased, saved, gifted) and types (local, improved, etc). This section provided insights into input supply chains and planting decisions based on the households, or head.
INPUT USE AND ACQUISITION (FERTILIZERS AND PESTICIDES): It provided the details of the use and acquisition of agricultural inputs such as fertilizers and pesticides. It included information on quantities used, sources, and types of inputs. This section assessed the input dependency and agricultural practices.
LIVESTOCK IN STOCK AND CHANGE IN STOCK: The
Not seeing a result you expected?
Learn how you can add new datasets to our index.
The population and housing census (PHC) is the unique source of reliable and comprehensive data about the size of population and also on major socio-economic & socio-demographic characteristics of the country. It provides data on geographic and administrative distribution of population and household in addition to the demographic and socio-economic characteristics of all the people in the country. Generally, it provides for comparing and projecting demographic data, social and economic characteristics, as well as household and housing conditions at all levels of the country’s administrative units and dimensions: national, regional, districts and localities. The data from the census is classified, tabulated and disseminated so that researchers, administrators, policy makers and development partners can use the information in formulating and implementing various multi-sectorial development programs at the national and community levels. Data on all key variables namely area, household, population, economic activity, literacy and education, fertility and child survival, housing conditions and sanitation are collected and available in the census data. The 2021 PHC in Ghana had an overarching goal of generating updated demographic, social and economic data, housing characteristics and dwelling conditions to support national development planning activities.
National Coverage , Region , District
All persons who spent census night (midnight of 27th June 2021) in Ghana
Census/enumeration data [cen]
This 10% sample data for the 2021 PHC is representative at the district/subdistrict level and also by the urban rural classification.
Computer Assisted Personal Interview [capi]
GSS developed two categories of instruments for the 2021 PHC: the listing form and the enumeration instruments. The listing form was only one, while the enumeration instruments comprised six questionnaires, designated as PHC 1A, PHC 1B, PHC 1C, PHC 1D, PHC 1E and PHC 1F. The PHC 1A was the most comprehensive with the others being its subsets.
Listing Form: The listing form was developed to collect data on type of structures, level of completion, whether occupied or vacant and use(s) of the structures. It was also used to collect information about the availability, number and types of toilet facilities in the structures. It was also used to capture the number of households in a structure, number of persons in households and the sex of the persons residing in the households if occupied. Finally, the listing form was used to capture data on non-household populations such as the population in institutions, floating population and sex of the non-household populations.
PHC 1A: The PHC 1A questionnaire was used to collect data from all households in the country. Primarily, it was used to capture household members and visitors who spent the Census Night in the dwelling of the household, and their relationship with the head of the household. It was also used to collect data on homeless households. Members of the households who were absent were enumerated at the place where they had spent the Census Night. The questionnaire was also used to collect the following household information: emigration; socio-demographic characteristics (sex, age, place of birth and enumeration, survival status of parents, literacy and education; economic activities; difficulty in performing activities; ownership and usage of information, technology and communication facilities; fertility; mortality; housing characteristics and conditions and sanitation.
PHC 1B: The PHC 1B questionnaire was used to collect data from persons in stable institutions comprising boarding houses, hostels and prisons who were present on Census Night. Other information that was captured with this instrument are socio-demographic characteristics, literacy and education, economic activities, difficulty in performing activities; ownership and usage of information, technology and communication facilities; fertility; mortality; housing characteristics and conditions and sanitation.
PHC 1C: The PHC 1C questionnaire was used to collect data from persons in “unstable” institutions such as hospitals and prayer camps who were present at these places on Census Night. The instrument was used to capture only the socio-demographic characteristics of individuals.
PHC 1D: The PHC 1D questionnaire was used to collect data from the floating population. This constitutes persons who were found at airports, seaports, lorry stations and similar locations waiting for or embarking on long-distance travel, as well as outdoor sleepers on Census Night. The instrument captured the socio-demographic information of individuals.
PHC 1E: All persons who spent the Census Night at hotels, motels and guest houses were enumerated using the PHC 1E. The content of the questionnaire was similar to that of the PHC 1D.
PHC 1F: The PHC 1F questionnaire was administered to diplomats in the country.
The Census data editing was implemented at three levels: 1. data editing by enumerators and supervisors during data collection 2. data editing was done at the regional level by the regional data quality monitors during data collection 3. Final data editing was done at the national level using the batch edits in CSPro and STATA Data editing and cleaning was mainly digital.
100 percent
A post Enumeration Survey (PES) was conducted to assess the extent of coverage and content error.