Sukuma was the largest ethnic group in Tanzania as of 2021. Around **** percent of the surveyed population identified themselves as from the Bantu ethnic group. Nearly *** percent belonged to the Ha group, while *** percent were from the Gogo group. About *** percent of the respondents identified themselves as Tanzanians only or reported that they don't think of themselves in terms of ethnic communities, cultural groups, or tribes. Overall, around *** ethnic groups are estimated to live in Tanzania.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The population of the world, allocated to 1 arcsecond blocks. This refines CIESIN’s Gridded Population of the World project, using machine learning models on high-resolution worldwide Digital Globe satellite imagery. More information.
There is also a tiled version of this dataset that may be easier to use if you are interested in many countries.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Maasai of northern Tanzania, a semi-nomadic ethnic group predominantly reliant on pastoralism, face a number of challenges anticipated to have negative impacts on child health, including marginalisation, vulnerabilities to drought, substandard service provision and on-going land grabbing conflicts. Yet, stemming from a lack of appropriate national survey data, no large-scale comparative study of Maasai child health has been conducted. Savannas Forever Tanzania surveyed the health of over 3500 children from 56 villages in northern Tanzania between 2009 and 2011. The major ethnic groups sampled were the Maasai, Sukuma, Rangi, and the Meru. Using multilevel regression we compare each ethnic group on the basis of (i) measurements of child health, including anthropometric indicators of nutritional status and self-reported incidence of disease; and (ii) important proximate determinants of child health, including food insecurity, diet, breastfeeding behaviour and vaccination coverage. We then (iii) contrast households among the Maasai by the extent to which subsistence is reliant on livestock herding. Measures of both child nutritional status and disease confirm that the Maasai are substantially disadvantaged compared to neighbouring ethnic groups, Meru are relatively advantaged, and Rangi and Sukuma intermediate in most comparisons. However, Maasai children were less likely to report malaria and worm infections. Food insecurity was high throughout the study site, but particularly severe for the Maasai, and reflected in lower dietary intake of carbohydrate-rich staple foods, and fruits and vegetables. Breastfeeding was extended in the Maasai, despite higher reported consumption of cow's milk, a potential weaning food. Vaccination coverage was lowest in Maasai and Sukuma. Maasai who rely primarily on livestock herding showed signs of further disadvantage compared to Maasai relying primarily on agriculture. We discuss the potential ecological, socioeconomic, demographic and cultural factors responsible for these differences and the implications for population health research and policy.
The Afrobarometer is a comparative series of public attitude surveys that assess African citizen's attitudes to democracy and governance, markets, and civil society, among other topics. The surveys have been undertaken at periodic intervals since 1999. The Afrobarometer's coverage has increased over time. Round 1 (1999-2001) initially covered 7 countries and was later extended to 12 countries. Round 2 (2002-2004) surveyed citizens in 16 countries. Round 3 (2005-2006) 18 countries, Round 4 (2008) 20 countries, Round 5 (2011-2013) 34 countries, Round 6 (2014-2015) 36 countries, and Round 7 (2016-2018) 34 countries. The survey covered 34 countries in Round 8 (2019-2021).
National coverage
Individual
Citizens of Tanzania who are 18 years and older
Sample survey data [ssd]
Afrobarometer uses national probability samples designed to meet the following criteria. Samples are designed to generate a sample that is a representative cross-section of all citizens of voting age in a given country. The goal is to give every adult citizen an equal and known chance of being selected for an interview. They achieve this by:
• using random selection methods at every stage of sampling; • sampling at all stages with probability proportionate to population size wherever possible to ensure that larger (i.e., more populated) geographic units have a proportionally greater probability of being chosen into the sample.
The sampling universe normally includes all citizens age 18 and older. As a standard practice, we exclude people living in institutionalized settings, such as students in dormitories, patients in hospitals, and persons in prisons or nursing homes. Occasionally, we must also exclude people living in areas determined to be inaccessible due to conflict or insecurity. Any such exclusion is noted in the technical information report (TIR) that accompanies each data set.
Sample size and design Samples usually include either 1,200 or 2,400 cases. A randomly selected sample of n=1200 cases allows inferences to national adult populations with a margin of sampling error of no more than +/-2.8% with a confidence level of 95 percent. With a sample size of n=2400, the margin of error decreases to +/-2.0% at 95 percent confidence level.
The sample design is a clustered, stratified, multi-stage, area probability sample. Specifically, we first stratify the sample according to the main sub-national unit of government (state, province, region, etc.) and by urban or rural location.
Area stratification reduces the likelihood that distinctive ethnic or language groups are left out of the sample. Afrobarometer occasionally purposely oversamples certain populations that are politically significant within a country to ensure that the size of the sub-sample is large enough to be analysed. Any oversamples is noted in the TIR.
Sample stages Samples are drawn in either four or five stages:
Stage 1: In rural areas only, the first stage is to draw secondary sampling units (SSUs). SSUs are not used in urban areas, and in some countries they are not used in rural areas. See the TIR that accompanies each data set for specific details on the sample in any given country. Stage 2: We randomly select primary sampling units (PSU). Stage 3: We then randomly select sampling start points. Stage 4: Interviewers then randomly select households. Stage 5: Within the household, the interviewer randomly selects an individual respondent. Each interviewer alternates in each household between interviewing a man and interviewing a woman to ensure gender balance in the sample.
To keep the costs and logistics of fieldwork within manageable limits, eight interviews are clustered within each selected PSU.
Tanzania - Sample size: 2,398 - Sampling Frame: 2012 National Population and Housing Census produced by the Tanzania National Bureau of Statistics - Sample design: Nationally representative, random, clustered, stratified, multi-stage area probability sample - Stratification: Region and rural-urban location - Stages: PSUs (from strata), start points, households, respondents - PSU selection: Probability Proportionate to Population Size (PPPS) - Cluster size: 8 households per PSU - Household selection: Randomly selected start points, followed by walk pattern using 5/10 interval - Respondent selection: Gender quota filled by alternating interviews between men and women; respondents of appropriate gender listed, after which computer randomly selects individual
Face-to-face [f2f]
The Round 8 questionnaire has been developed by the Questionnaire Committee after reviewing the findings and feedback obtained in previous Rounds, and securing input on preferred new topics from a host of donors, analysts, and users of the data.
The questionnaire consists of three parts: 1. Part 1 captures the steps for selecting households and respondents, and includes the introduction to the respondent and (pp.1-4). This section should be filled in by the Fieldworker. 2. Part 2 covers the core attitudinal and demographic questions that are asked by the Fieldworker and answered by the Respondent (Q1 – Q100). 3. Part 3 includes contextual questions about the setting and atmosphere of the interview, and collects information on the Fieldworker. This section is completed by the Fieldworker (Q101 – Q123).
Outcome rates: - Contact rate: 97% - Cooperation rate: 88% - Refusal rate: 1% - Response rate: 85%
The sample size yields country-level results with a margin of error of +/-2 percentage points at a 95% confidence level.
Information about the ethnic affiliation(s) and characteristics of a human population. Includes, for example, information about: the ethnic groups located within a geographic region, their community social structures, their mutual associations and conflicts with other groups, their historic roles and influence, and the physical distribution of their members. Ethnic groups are human populations whose members identify with each other, usually on the basis of having a common cultural traditions and heritage (for example: as distinguished by customs, language, religious practices, or common history) or a presumed common genealogy or ancestry.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Tanzania: Demographic pressures, 0 (low) - 10 (high): The latest value from 2024 is 8.8 index points, an increase from 8.6 index points in 2023. In comparison, the world average is 5.80 index points, based on data from 176 countries. Historically, the average for Tanzania from 2007 to 2024 is 8.39 index points. The minimum value, 7.4 index points, was reached in 2007 while the maximum of 8.9 index points was recorded in 2016.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
aNumbers in square brackets are standard deviations.Descriptive Statistics for Households Contributing Child Health Data by Ethnicity.
The primary objective of the 2022 Tanzania Demographic and Health Survey and Malaria Indicator Survey (2022 TDHSMIS) is to provide current and reliable information on population and health issues. Specifically, the 2022 TDHS-MIS collected information on marriage and sexual activity, fertility and fertility preferences, family planning, infant and child mortality, maternal health care, disability among the household population, child health, nutrition of children and women, malaria prevalence, knowledge, and communication, women’s empowerment, women’s experience of domestic violence, adult maternal mortality via sisterhood method, awareness and behaviour regarding HIV/AIDS and other sexually transmitted infections (STIs), female genital cutting, and early childhood development. Other information collected on health-related issues included smoking, blood pressure, anaemia, malaria, and iodine testing, height and weight, and micronutrients.
The information collected through the 2022 TDHS-MIS is intended to assist policymakers and programme managers in evaluating and designing programmes and strategies for improving the health of Tanzania’s population. The 2022 TDHS-MIS also provides indicators to monitor and evaluate international, regional, and national programmes, such as the Global Agenda 2030 on Sustainable Development Goals (2030 SDGs), Tanzania Development Vision 2025, the Third National Five-Year Development Plan (FYDP III 2021/22–2025/26), East Africa Community Vision 2050 (EAC 2050), and Africa Development Agenda 2063 (ADA 2063).
National coverage
The survey covered all de jure household members (usual residents), all women aged 15-49, men aged 15-49, and all children aged 0-4 resident in the household.
Sample survey data [ssd]
The sample design for the 2022 TDHS-MIS was carried out in two stages and was intended to provide estimates for the entire country, for urban and rural areas in Tanzania Mainland, and for Zanzibar. For specific indicators such as contraceptive use, the sample design allows for estimation of indicators for each of the 31 regions—26 regions in Tanzania Mainland and 5 regions in Zanzibar.
The sampling frame excluded institutional populations, such as persons in hospitals, hotels, barracks, camps, hostels, and prisons. The 2022 TDHS-MIS followed a stratified two-stage sample design. The first stage involved selection of sampling points (clusters) consisting of enumeration areas (EAs) delineated for the 2012 Tanzania Population and Housing Census (2012 PHC). The EAs were selected with a probability proportional to their size within each sampling stratum. A total of 629 clusters were selected. Among the 629 EAs, 211 were from urban areas and 418 were from rural areas.
In the second stage, 26 households were selected systematically from each cluster, for a total anticipated sample size of 16,354 households for the 2022 TDHS-MIS. A household listing operation was carried out in all the selected EAs before the main survey. During the household listing operation, field staff visited each of the selected EAs to draw location maps and detailed sketch maps and to list all residential households found in each EA with addresses and the names of the heads of the households. The resulting list of households served as a sampling frame for the selection of households in the second stage. During the listing operation, field teams collected global positioning system (GPS) data—latitude, longitude, and altitude readings—to produce one GPS point per EA. To estimate geographic differentials for certain demographic indicators, Tanzania was divided into nine geographic zones. Although these zones are not official administrative areas, this classification system is also used by the Reproductive and Child Health Section of the Ministry of Health. Grouping of regions into zones allows for larger denominators and smaller sampling errors for indicators at the zonal level.
For further details on sample design, see APPENDIX A of the final report.
Computer Assisted Personal Interview [capi]
Five questionnaires were used for the 2022 TDHS-MIS: the Household Questionnaire, the Woman’s Questionnaire, the Man’s Questionnaire, the Biomarker Questionnaire, and the Micronutrient Questionnaire. The questionnaires, based on The DHS Program’s Model Questionnaires, were adapted to reflect the population and health issues relevant to Tanzania. In addition, a self-administered Fieldworker’s Questionnaire collected information about the survey’s fieldworkers.
In the 2022 TDHS-MIS survey, CAPI was used during data collection. The devices used for CAPI were Android-based computer tablets programmed using a mobile version of CSPro. Programming of questionnaires into the android application was done by ICF, while configuration of tablets was done by NBS and OCGS in collaboration with ICF. All fieldwork personnel were assigned usernames, and devices were password protected to ensure the integrity of the data collected. Selected households were assigned to CAPI supervisors, whereas households were assigned to interviewers’ tablets via Bluetooth. The data for all interviewed households were sent back to CAPI supervisors, who were responsible for initial data consistency and editing, before being sent to the central servers hosted at NBS Headquarters via Syncloud.
The data processing of the 2022 TDHS-MIS ran concurrently with the data collection exercise. The electronic data files from each completed cluster were transferred via Syncloud to the NBS central office server in Dodoma. The data files were registered and checked for inconsistencies, incompleteness, and outliers. Errors and inconsistencies were communicated to the field teams for review and correction. Secondary central data editing was done by NBS and OCGS survey staff at the central office. A CSPro batch editing tool was used for cleaning data and included coding of open-ended questions and resolving inconsistencies.
The Biomarker paper questionnaires were collected by field supervisors and compared with the electronic data files to check for any inconsistencies that may have occurred during data entry. The concurrent data collection and processing offered an advantage because it maximised the likelihood of having error-free data. Timely generation of field check tables allowed effective monitoring. The secondary data editing exercise was completed in October 2022.
A total of 16,312 households were selected for the 2022 TDHS-MIS sample. This number is slightly less than the targeted sample size of 16,354 because one EA could not be reached due to security reasons, while a few EAs had less than the targeted 26 households. Of the 16,312 households selected, 15,907 were found to be occupied. Of the occupied households, 15,705 were successfully interviewed, yielding a response rate of 99%. In the interviewed households, 15,699 women age 15–49 were identified as eligible for individual interviews. Interviews were completed with 15,254 women, yielding a response rate of 97%. In the subsample (50% of households) of households selected for the male questionnaire, 6,367 men age 15–49 were identified as eligible for individual interviews, and 5,763 were successfully interviewed, yielding a response rate of 91%.
The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors and (2) sampling errors. Nonsampling errors are the results of mistakes made in implementing data collection and in data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2022 Tanzania Demographic and Health Survey and Malaria Indicator Survey (2022 TDHS-MIS) to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 2022 TDHS-MIS is only one of many samples that could have been selected from the same population, using the same design and identical size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95% of all possible samples of identical size and design.
If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 2022 TDHS-MIS sample was the result of a multistage stratified design, and,
https://ega-archive.org/dacs/EGAC00001000169https://ega-archive.org/dacs/EGAC00001000169
We aim to provide a powerful reference set for genome-wide association studies (GWAS) in African populations. Our pilot study to sequence 100 individuals each from Fula, Jola, Mandinka and Wollof from the Gambia to low coverage has been completed - this first part of the main effort will make available low coverage WGS data for 400 individuals from multiple ethnic groups in Burkina Faso, Cameroon, Ghana and Tanzania. This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Adjusted odds ratios are adjusted for hunger season and a random intercept for village. See File SI2 for details on the Household Food Insecurity Access Scale.***p
https://ega-archive.org/dacs/EGAC00001000168https://ega-archive.org/dacs/EGAC00001000168
We aim to provide a powerful reference set for genome-wide association studies (GWAS) in African populations. Our pilot study to sequence 100 individuals each from Fula, Jola, Mandinka and Wollof from the Gambia to low coverage has been completed - this first part of the main effort will make available low coverage WGS data for 400 individuals from multiple ethnic groups in Burkina Faso, Cameroon, Ghana and Tanzania. This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Adjusted odds ratios are adjusted for child age, child sex, hunger season and a random intercept for village.+ p
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Adjusted B coefficients are adjusted for child age, child sex, hunger season and a random intercept for village.*p
https://ega-archive.org/dacs/EGAC00001000174https://ega-archive.org/dacs/EGAC00001000174
We aim to provide a powerful reference set for genome-wide association studies (GWAS) in African populations. Our pilot study to sequence 100 individuals each from Fula, Jola, Mandinka and Wollof from the Gambia to low coverage has been completed - this first part of the main effort will make available low coverage WGS data for 400 individuals from multiple ethnic groups in Burkina Faso, Cameroon, Ghana and Tanzania. This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Adjusted odds ratios are adjusted for child age, child sex, hunger season and a random intercept for village.*p
https://ega-archive.org/dacs/EGAC00001000176https://ega-archive.org/dacs/EGAC00001000176
We aim to provide a powerful reference set for genome-wide association studies (GWAS) in African populations. Our pilot study to sequence 100 individuals each from Fula, Jola, Mandinka and Wollof from the Gambia to low coverage has been completed - this first part of the main effort will make available low coverage WGS data for 400 individuals from multiple ethnic groups in Burkina Faso, Cameroon, Ghana and Tanzania. This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
https://ega-archive.org/dacs/EGAC00001000175https://ega-archive.org/dacs/EGAC00001000175
We aim to provide a powerful reference set for genome-wide association studies (GWAS) in African populations. Our pilot study to sequence 100 individuals each from Fula, Jola, Mandinka and Wollof from the Gambia to low coverage has been completed - this first part of the main effort will make available low coverage WGS data for 400 individuals from multiple ethnic groups in Burkina Faso, Cameroon, Ghana and Tanzania. This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Frequent and unregulated use of antimicrobials (AM) in livestock requires public health attention as a likely selection pressure for resistant bacteria. Studies among small-holders, who own a large percentage of the world’s livestock, are vital for understanding how practices involving AM use might influence resistance. We present a cultural-ecological mixed-methods analysis to explore sectors of veterinary care, loosely regulated AM use, and human exposure to AMs through meat and milk consumption across three rural to peri-urban Tanzanian ethnic groups (N = 415 households). Reported use of self-administered AMs varied by ethnic group (Maasai: 74%, Arusha: 21%, Chagga: 1%) as did consultation with professional veterinarians (Maasai: 36%, Arusha: 45%, Chagga: 96%) and observation of withdrawal of meat and milk from consumption during and following AM treatment (Maasai: 7%, Arusha: 72%, Chagga: 96%). The antibiotic oxytetracycline was by far the most common AM in this sample. Within ethnic groups, herd composition differences, particularly size of small-stock and cattle herds, were most strongly associated with differences in lay AM use. Among the Arusha, proxies for urbanization, including owning transportation and reliance on “zero-grazing” herds had the strongest positive associations with veterinarian consultation, while distance to urban centers was negatively associated. For Maasai, consultation was negatively associated with use of traditional healers or veterinary drug-shops. Observation of withdrawal was most strongly associated with owning technology among Maasai while Arusha observance displayed seasonal differences. This “One-Health” analysis suggests that livelihood and cultural niche factors, through their association with practices in smallholder populations, provide insight into the selection pressures that may contribute to the evolution and dissemination of antimicrobial resistance.
The Afrobarometer project assesses attitudes and public opinion on democracy, markets, and civil society in several sub-Saharan African.This dataset was compiled from the studies in Round 3 of the Afrobarometer survey, conducted from 2005-2006 in 18 African countries (Benin, Botswana, Cape Verde, Ghana, Kenya, Lesotho, Madagascar, Malawi, Mali, Mozambique, Namibia, Nigeria, Senegal, South Africa, Tanzania, Uganda, Zambia, Zimbabwe).
The Afrobarometer surveys have national coverage
Botswana Lesotho Malawi Namibia South Africa Zambia Zimbabwe Ghana Mali Nigeria Tanzania Uganda Cape Verde Mozambique Senegal Kenya Benin Madagascar
Basic units of analysis that the study investigates include: individuals and groups
The sample universe for Afrobarometer surveys includes all citizens of voting age within the country. In other words, we exclude anyone who is not a citizen and anyone who has not attained this age (usually 18 years) on the day of the survey. Also excluded are areas determined to be either inaccessible or not relevant to the study, such as those experiencing armed conflict or natural disasters, as well as national parks and game reserves. As a matter of practice, we have also excluded people living in institutionalized settings, such as students in dormitories and persons in prisons or nursing homes.
What to do about areas experiencing political unrest? On the one hand we want to include them because they are politically important. On the other hand, we want to avoid stretching out the fieldwork over many months while we wait for the situation to settle down. It was agreed at the 2002 Cape Town Planning Workshop that it is difficult to come up with a general rule that will fit all imaginable circumstances. We will therefore make judgments on a case-by-case basis on whether or not to proceed with fieldwork or to exclude or substitute areas of conflict. National Partners are requested to consult Core Partners on any major delays, exclusions or substitutions of this sort.
Sample survey data [ssd]
A new sample has to be drawn for each round of Afrobarometer surveys. Whereas the standard sample size for Round 3 surveys will be 1200 cases, a larger sample size will be required in societies that are extremely heterogeneous (such as South Africa and Nigeria), where the sample size will be increased to 2400. Other adaptations may be necessary within some countries to account for the varying quality of the census data or the availability of census maps.
The sample is designed as a representative cross-section of all citizens of voting age in a given country. The goal is to give every adult citizen an equal and known chance of selection for interview. We strive to reach this objective by (a) strictly applying random selection methods at every stage of sampling and by (b) applying sampling with probability proportionate to population size wherever possible. A randomly selected sample of 1200 cases allows inferences to national adult populations with a margin of sampling error of no more than plus or minus 2.5 percent with a confidence level of 95 percent. If the sample size is increased to 2400, the confidence interval shrinks to plus or minus 2 percent.
Sample Universe
The sample universe for Afrobarometer surveys includes all citizens of voting age within the country. In other words, we exclude anyone who is not a citizen and anyone who has not attained this age (usually 18 years) on the day of the survey. Also excluded are areas determined to be either inaccessible or not relevant to the study, such as those experiencing armed conflict or natural disasters, as well as national parks and game reserves. As a matter of practice, we have also excluded people living in institutionalized settings, such as students in dormitories and persons in prisons or nursing homes.
What to do about areas experiencing political unrest? On the one hand we want to include them because they are politically important. On the other hand, we want to avoid stretching out the fieldwork over many months while we wait for the situation to settle down. It was agreed at the 2002 Cape Town Planning Workshop that it is difficult to come up with a general rule that will fit all imaginable circumstances. We will therefore make judgments on a case-by-case basis on whether or not to proceed with fieldwork or to exclude or substitute areas of conflict. National Partners are requested to consult Core Partners on any major delays, exclusions or substitutions of this sort.
Sample Design
The sample design is a clustered, stratified, multi-stage, area probability sample.
To repeat the main sampling principle, the objective of the design is to give every sample element (i.e. adult citizen) an equal and known chance of being chosen for inclusion in the sample. We strive to reach this objective by (a) strictly applying random selection methods at every stage of sampling and by (b) applying sampling with probability proportionate to population size wherever possible.
In a series of stages, geographically defined sampling units of decreasing size are selected. To ensure that the sample is representative, the probability of selection at various stages is adjusted as follows:
The sample is stratified by key social characteristics in the population such as sub-national area (e.g. region/province) and residential locality (urban or rural). The area stratification reduces the likelihood that distinctive ethnic or language groups are left out of the sample. And the urban/rural stratification is a means to make sure that these localities are represented in their correct proportions. Wherever possible, and always in the first stage of sampling, random sampling is conducted with probability proportionate to population size (PPPS). The purpose is to guarantee that larger (i.e., more populated) geographical units have a proportionally greater probability of being chosen into the sample. The sampling design has four stages
A first-stage to stratify and randomly select primary sampling units;
A second-stage to randomly select sampling start-points;
A third stage to randomly choose households;
A final-stage involving the random selection of individual respondents
We shall deal with each of these stages in turn.
STAGE ONE: Selection of Primary Sampling Units (PSUs)
The primary sampling units (PSU's) are the smallest, well-defined geographic units for which reliable population data are available. In most countries, these will be Census Enumeration Areas (or EAs). Most national census data and maps are broken down to the EA level. In the text that follows we will use the acronyms PSU and EA interchangeably because, when census data are employed, they refer to the same unit.
We strongly recommend that NIs use official national census data as the sampling frame for Afrobarometer surveys. Where recent or reliable census data are not available, NIs are asked to inform the relevant Core Partner before they substitute any other demographic data. Where the census is out of date, NIs should consult a demographer to obtain the best possible estimates of population growth rates. These should be applied to the outdated census data in order to make projections of population figures for the year of the survey. It is important to bear in mind that population growth rates vary by area (region) and (especially) between rural and urban localities. Therefore, any projected census data should include adjustments to take such variations into account.
Indeed, we urge NIs to establish collegial working relationships within professionals in the national census bureau, not only to obtain the most recent census data, projections, and maps, but to gain access to sampling expertise. NIs may even commission a census statistician to draw the sample to Afrobarometer specifications, provided that provision for this service has been made in the survey budget.
Regardless of who draws the sample, the NIs should thoroughly acquaint themselves with the strengths and weaknesses of the available census data and the availability and quality of EA maps. The country and methodology reports should cite the exact census data used, its known shortcomings, if any, and any projections made from the data. At minimum, the NI must know the size of the population and the urban/rural population divide in each region in order to specify how to distribute population and PSU's in the first stage of sampling. National investigators should obtain this written data before they attempt to stratify the sample.
Once this data is obtained, the sample population (either 1200 or 2400) should be stratified, first by area (region/province) and then by residential locality (urban or rural). In each case, the proportion of the sample in each locality in each region should be the same as its proportion in the national population as indicated by the updated census figures.
Having stratified the sample, it is then possible to determine how many PSU's should be selected for the country as a whole, for each region, and for each urban or rural locality.
The total number of PSU's to be selected for the whole country is determined by calculating the maximum degree of clustering of interviews one can accept in any PSU. Because PSUs (which are usually geographically small EAs) tend to be socially homogenous we do not want to select too many people in any one place. Thus, the Afrobarometer has established a standard of no more than 8 interviews per PSU. For a sample size of 1200, the sample must therefore contain 150 PSUs/EAs (1200 divided by 8). For a sample size of 2400, there must be 300 PSUs/EAs.
These PSUs should then be allocated
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
FGDs per village, district, division and ethnic groups.
Sukuma was the largest ethnic group in Tanzania as of 2021. Around **** percent of the surveyed population identified themselves as from the Bantu ethnic group. Nearly *** percent belonged to the Ha group, while *** percent were from the Gogo group. About *** percent of the respondents identified themselves as Tanzanians only or reported that they don't think of themselves in terms of ethnic communities, cultural groups, or tribes. Overall, around *** ethnic groups are estimated to live in Tanzania.