Facebook
TwitterThe GHS is an annual household survey which measures the living circumstances of South African households. The GHS collects data on education, health, and social development, housing, access to services and facilities, food security, and agriculture.
National coverage
Households and individuals
The survey covers all de jure household members (usual residents) of households in the nine provinces of South Africa, and residents in workers' hostels. The survey does not cover collective living quarters such as student hostels, old age homes, hospitals, prisons, and military barracks.
Sample survey data [ssd]
From 2015 the General Household Survey (GHS) uses a Master Sample (MS) frame developed in 2013 as a general-purpose sampling frame to be used for all Stats SA household-based surveys. This MS has design requirements that are reasonably compatible with the GHS. The 2013 Master Sample is based on information collected during the 2011 Census conducted by Stats SA. In preparation for Census 2011, the country was divided into 103576 enumeration areas (EAs). The census EAs, together with the auxiliary information for the EAs, were used as the frame units or building blocks for the formation of primary sampling units (PSUs) for the Master Sample, since they covered the entire country, and had other information that is crucial for stratification and creation of PSUs. There are 3324 primary sampling units (PSUs) in the Master Sample, with an expected sample of approximately 33000 dwelling units (DUs). The number of PSUs in the current Master Sample (3324) reflect an 8,0% increase in the size of the Master Sample compared to the previous (2008) Master Sample (which had 3080 PSUs). The larger Master Sample of PSUs was selected to improve the precision (smaller coefficients of variation, known as CVs) of the GHS estimates. The Master Sample is designed to be representative at provincial level and within provinces at metro/non-metro levels. Within the metros, the sample is further distributed by geographical type. The three geography types are Urban, Tribal and Farms. This implies, for example, that within a metropolitan area, the sample is representative of the different geography types that may exist within that metro.
The sample for the GHS is based on a stratified two-stage design with probability proportional to size (PPS) sampling of PSUs in the first stage, and sampling of dwelling units (DUs) with systematic sampling in the second stage. After allocating the sample to the provinces, the sample was further stratified by geography (primary stratification), and by population attributes using Census 2011 data (secondary stratification).
Computer Assisted Telephone Interview [cati]
Data was collected with a household questionnaire and a questionnaire administered to a household member to elicit information on household members.
Since 2019, the questionnaire for the GHS series changed and the variables were also renamed. For correspondence between old names (GHS pre-2019) and new name (GHS post-2019), see the document ghs-2019-variables-renamed.
Facebook
TwitterThe Quarterly Labour Force Survey (QLFS) is a household-based sample survey conducted by Statistics South Africa (StatsSA). It collects data on the labour market activities of individuals aged 15 years or older who live in South Africa. Since 2008, StatsSA have produced an annual dataset based on the QLFS data, "Labour Market Dynamics in South Africa". The dataset is constructed using data from all all four QLFS datasets in the year. The dataset also includes a number of variables (including income) that are not available in any of the QLFS datasets from 2010.
National coverage
Individuals
The QLFS sample covers the non-institutional population except for those in workers' hostels. However, persons living in private dwelling units within institutions are enumerated. For example, within a school compound, one would enumerate the schoolmaster's house and teachers' accommodation because these are private dwellings. Students living in a dormitory on the school compound would, however, be excluded.
Sample survey data [ssd]
Each year the LMDSA is created by combining the QLFS waves for that year and then including some additional variables. The QLFS master frame for this LMDSA was based on the 2011 population census by Stas SA. The sampling is stratified by province, district, and geographic type (urban, traditional, farm). There are 3324 PSUs drawn each year, using probability proportional to size (PPS) sampling. In the second stage Dwelling Units (DUs) are systematically selected from PSUs. The 3324 PSU are split into four groups for the year, and at each quarter the DUs from the given group are replaced by substitute DUs from the same PSU or the next PSU on the list (in the same group). It should be noted that the sampling unit is the dwelling, and the unit of observation is the household. Therefore, if a household moves out of a dwelling after being in the sample for, two quarters and a new household moves in, the new household will be enumerated for two more quarters until the DU is rotated out. If no household moves into the sampled dwelling, the dwelling will be classified as vacant (or unoccupied).
Computer Assisted Telephone Interview [cati]
The survey questionnaire consists of the following sections: - Particulars of each person in the household - Economic activities in the last week for persons aged 15 years - Unemployment and economic inactivity for persons aged 15 years - Main work activity in the last week for persons aged 15 years - Earnings in the main job for employees, employers and own-account workers aged 15 years - Migration for all persons aged 15 years
The statistical release notes that missing values were "generally imputed" for item non-response but provides no detail on how Statistics SA did so.
Facebook
TwitterIn Southern Africa, women and men are equally present on social media . As of January 2024, ** percent of the social media users in the region were women and the remaining ** percent men. Southern Africa was the only area in the African continent with an equal proportion of women using social media.
Facebook
TwitterThis statistic shows the median age of the population in South Africa from 1950 to 2100*.The median age is the age that divides a population into two numerically equal groups; that is, half the people are younger than this age and half are older. It is a single index that summarizes the age distribution of a population. In 2020, the median age of the South African population was 27.3 years.
Facebook
TwitterIn April 2018, StatsSA launched the Governance Public Safety and Justice Survey (GPSJS) in response to the need for standardised international reporting standards on governance and access to justice that are recommended by the SDGs, ShaSA and Agenda 2063. In compliance with these standards, Stats SA discontinued the separate publication of the Victims of Crime Survey (VCS) and incorporated it within the new GPSJS series. Therefore, the GPSJS represents the new source of microdata on the experience and prevalence of particular kinds of crime within South Africa.
The GPSJS is a countrywide household-based survey which collects data on two types of crimes, namely, vehicle hijacking and home robbery. Business robbery is not covered by the survey. The survey includes information on victimisation experienced by individuals and households and their perspectives on community responses to crime. Additionally, the survey data includes information on legitimacy, voice, equity and discrimination. Therefore, GPSJS data can be used for research in the development of policies and strategies for governance, crime prevention, public safety and justice programmes. The main objectives of the survey are to:
• Provide information about the dynamics of crime from the perspective of households and the victims of crime.
• Explore public perceptions of the activities of the police, prosecutors, courts and correctional services in the prevention of crime and victimisation.
• Provide complimentary data on the level of crime within South Africa in addition to the statistics published annually by the South African Police Service.
NOTE: The GPSJS is a continuation of the VCS series, which ended with VCS 2017/18. Therefore, the VCS 2018/19 can be exctracted from GPSJS 2018/19 and is comparable to previous VCS's only where questions remained the same. Please see Data Quality Notes for more infomation on comparability.
The survey has national coverage.
Households and individuals
The target population of the survey consists of all private households in all nine provinces of South Africa, as well as residents in workers' hostels. The survey does not cover other collective living quarters such as students' hostels, old-age homes, hospitals, prisons and military barracks. It is only representative of non-institutionalised and non-military persons or households in South Africa.
Sample survey data [ssd]
The GPSJS 2020/21 uses the master sample (MS) sampling frame which has been developed as a general-purpose household survey frame that can be used by all other Stats SA household-based surveys having design requirements that are reasonably compatible with GPSJS. The GPSJS 2020/21 collection was drawn from the 2013 master sample. This master sample is based on information collected during Census 2011. In preparation for Census 2011, the country was divided into 103 576 enumeration areas (EAs). The census EAs, together with the auxiliary information for the EAs, were used as the frame units or building blocks for the formation of primary sampling units (PSUs) for the master sample, since they covered the entire country and had other information that is crucial for stratification and creation of PSUs.
There are 3 324 primary sampling units (PSUs) in the master sample with an expected sample of approximately 33 000 dwelling units (DUs). The number of PSUs in the current master sample (3 324) reflect an 8,0% increase in the size of the master sample compared to the previous (2008) master sample (which had 3 080 PSUs). The larger master sample of PSUs was selected to improve the precision (smaller coefficients of variation, known as CVs) of the GPSJS estimates.
Computer Assisted Telephone Interview [cati]
The GPSJS 2020/21 questionnaire is based on international reporting standards of governance, public safety and justice defined by the SDGs.
Sections 1 to 3 of the questionnaire relate to household crimes. A proxy respondent (preferably head of the household or acting head of household) answered on behalf of the household. Section 4 to 9 of the questionnaire relate to crimes experienced by individuals and were asked of a household member who was selected using the birthday section method. This methodology selects an individual who is 16 years or older, whose birthday is soonest after the survey date.
Comparability to VCS series:
While redesigning the VCS into the GPSJS, some questions were modified in order to align the series with international reporting demands (e.g. SDGs) and to improve the accuracy of victim reporting. This caused a break of series for affected questions, in particular questions on 12-month experience of crime. The question on 5-year experience of crime was not changed and hence there is no break of series. The 5-year trends can therefore be used as a proxy for the 12-month series as the two follow similar patterns. Similarity of shapes of the two series makes it possible to predict increase or decrease of crime during the past 12 months using the 5-year series.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Historical dataset showing South Africa tourist spending by year from 1995 to 2020.
Facebook
TwitterSouth Africa faced its first coronavirus (COVID-19) casualty on March 27, 2020. Ever since, the country has registered roughly **** thousand civilians who lost their lives to the pandemic. Moreover, throughout the outbreak, the largest daily death recorded was on January 19, 2021, when *** people were involved. As of October 24, 2021, South Africa was the most affected country on the continent, with over **** million cases of infections.
Facebook
TwitterThis cumulative dataset contains statistics on mortality and causes of death in South Africa covering the period 1997-2020. The mortality and causes of death dataset is part of a regular series published by Stats SA, based on data collected through the civil registration system. This dataset is the most recent cumulative round in the series which began with the separately available dataset Recorded Deaths 1996.
The main objective of this dataset is to outline emerging trends and differentials in mortality by selected socio-demographic and geographic characteristics for deaths that occurred in the registered year and over time. Reliable mortality statistics, are the cornerstone of national health information systems, and are necessary for population health assessment, health policy and service planning; and programme evaluation. They are essential for studying the occurrence and distribution of health-related events, their determinants and management of related health problems. These data are particularly critical for monitoring the Sustainable Development Goals (SDGs) and Agenda 2063 which share the same goal for a high standard of living and quality of life, sound health and well-being for all and at all ages. Mortality statistics are also required for assessing the impact of non-communicable diseases (NCD's), emerging infectious diseases, injuries and natural disasters.
The survey has national coverage.
Individuals
This dataset is based on information on mortality and causes of death from the South African civil registration system. It covers all death notification forms from the Department of Home Affairs for deaths that occurred in 1997-2020, that reached Stats SA during the 2021/2022 processing phase.
Administrative records
Other
The registration of deaths is captured using two instruments: form BI-1663 and form DHA-1663 (Notification/Register of death/stillbirth).
This cumulative dataset is part of a regular series published by Stats SA and includes all previous rounds in the series (excluding Recorded Deaths 1996). Stats SA only includes one variable to classify the occupation group of the deceased (OccupationGrp) in the current round (1997-2020). Prior to 2016, Stats SA included both occupation group (OccupationGrp) and industry classifcation (Industry) in all previous rounds. Therefore, DataFirst has made the 1997-2015 cumulative round available as a separately downloadable dataset which includes both occupation group and industry classification of the deceased spanning the years 1997-2015.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Unemployment Rate in South Africa decreased to 31.90 percent in the third quarter of 2025 from 33.20 percent in the second quarter of 2025. This dataset provides - South Africa Unemployment Rate - actual values, historical data, forecast, chart, statistics, economic calendar and news.
Facebook
TwitterNational coverage
households/individuals
survey
Quarterly: average based on 3 monthly data points
Sample size:
Facebook
TwitterThe population share with mobile internet access in South Africa was forecast to continuously increase between 2024 and 2029 by in total 16.7 percentage points. After the ninth consecutive increasing year, the mobile internet penetration is estimated to reach 38.27 percent and therefore a new peak in 2029. Notably, the population share with mobile internet access of was continuously increasing over the past years.The penetration rate refers to the share of the total population having access to the internet via a mobile broadband connection.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).
Facebook
TwitterThe National Household Travel Survey (NHTS) 2020 is the third round of the survey series designed to assess domestic transport and tourism travel patterns of South African households as well as their attitudes about transport. The NHTS collects data on general household characteristics, travel patterns of households, and attitudes about transport.
National coverage
Households and individuals
The target population of the survey consists of all private households in all nine provinces of South Africa and residents in workers' hostels. The survey does not cover other collective living quarters such as students' hostels, old-age homes, hospitals, prisons and military barracks.
Sample survey data [ssd]
The sample design for the NHTS was based on a master sample (MS) that used a two-stage, stratified design with probability-proportional-to-size (PPS) sampling of PSUs from within strata, and systematic sampling of dwelling units (DUs) from the sampled primary sampling units (PSUs). A self-weighting design at provincial level was used and MS stratification was divided into two levels, primary and secondary stratification. Primary stratification was defined by metropolitan and non-metropolitan geographic area type. During secondary stratification, the Census 2001 data were summarized at PSU level. The following variables were used for secondary stratification, household size, education, occupancy status, gender, industry and income.
Census enumeration areas (EAs) as delineated for Census 2001 formed the basis of the PSUs. The following additional rules were used: • Where possible, PSU sizes were kept between 100 and 500 dwelling units (DUs); • EAs with fewer than 25 DUs were excluded. • EAs with between 26 and 99 DUs were pooled to form larger PSUs and the criteria used was same settlement type; • Virtual splits were applied to large PSUs: 500 to 999 splits into two; 1000 to 1499 splits into three; and 1500 plus split into four PSUs; and • Informal PSUs were segmented.
A Randomized Probability Proportional to Size (RPPS) systematic sample of PSUs was drawn in each stratum, with the measure of size being the number of households in the PSU. Altogether approximately 3080 PSUs were selected. In each selected PSU a systematic sample of dwelling units was drawn. The number of DUs selected per PSU varies from PSU to PSU and depends on the Inverse Sampling Ratios (ISR) of each PSU.
Face-to-face [f2f]
The survey questionnaire consisted of 9 sections: Section 1: General health and functioning, social grants and social relief Section 2: General travel patterns Section 3: Education and education-related travel patterns Section 4: Work-related travel patterns (age 15 years and above) Section 5: Business trips Section 6: Other travel patterns Section 7: General household information Section 8: Household attitudes and perceptions about transport Section 9: Survey officer questions
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Key information about South Africa Real GDP Growth
Facebook
TwitterThe Quarterly Labour Force Survey (QLFS) is a household-based sample survey conducted by Statistics South Africa (Stats SA). It collects data on the labour market activities of individuals aged 15 years or older who live in South Africa.
National coverage
Individuals
The QLFS sample covers the non-institutional population of South Africa with one exception. The only institutional subpopulation included in the QLFS sample are individuals in worker's hostels. Persons living in private dwelling units within institutions are also enumerated. For example, within a school compound, one would enumerate the schoolmaster's house and teachers' accommodation because these are private dwellings. Students living in a dormitory on the school compound would, however, be excluded.
Sample survey data [ssd]
The Quarterly Labour Force Survey (QLFS) uses a master sample frame which has been developed as a general-purpose household survey frame that can be used by all other Stats SA household surveys that have reasonably compatible design requirement as the QLFS. The 2013 master sample is based on information collected during the 2011 population Census conducted by Stats SA. In preparation for Census 2011, the country was divided into 103 576 enumeration areas (EAs). The census EAs, together with the auxiliary information for the EAs were used as the frame units or building blocks for the formation of primary sampling units (PSUs) for the master sample since they covered the entire country and had other information that is crucial for stratification and creation of PSUs. There are 3 324 primary sampling units (PSUs) in the master sample with an expected sample of approximately 33 000 dwelling units (DUs). The number of PSUs in the current master sample (3 324) reflects an 8,0% increase in the size of the master sample compared to the previous (2007) master sample (which had 3 080 PSUs). The larger master sample of PSUs was selected to improve the precision (smaller CVs) of the QLFS estimates.
From the master sample frame, the QLFS takes draws employing a stratified two-stage design with probability proportional to size (PPS) sampling of PSUs in the first stage, and sampling of dwelling units (DUs) with systematic sampling in the second stage. The primary stratification occurred at provincial, metro/non-metro, mining and geography type while the secondary strata were created within the primary strata based on the demographic and socio-economic characteristics of the population. For each quarter of the QLFS, a ¼ of the sampled dwellings is rotated out of the sample. These dwellings are replaced by new dwellings from the same PSU or the next PSU on the list.
For more see the release document that is distributed with the data.
It should be noted that the Quarterly Labour Force Survey (QLFS) for Quarter 1 (January to March) of 2020 data collection was disrupted when Stats SA suspended face-to-face data collection for all its surveys on 19 March 2020 as a result of the COVID-19 pandemic and restricted movement. This was to ensure that the field staff and respondents were not exposed to the risk of contracting coronavirus and to contain its spread. As a result, some dwellings (621 or 2,0% of the 30 608 sampled dwelling units) were not interviewed which otherwise would have been interviewed. To compensate for this, Stats SA made use of the fact that the design of the QLFS is such that sampled dwelling units are in the sample for four successive quarters. So, for persons in dwelling units that were not visited as a result of the lockdown, imputations were done where possible using data from the previous quarter. For respondents who were not visited in the first quarter of 2020 but had information from the fourth quarter of 2019, their responses were carried over to the first quarter of 2020.
If the person was shown as unemployed or not economically active in the last quarter of 2019, that was the status assigned to them for the first quarter of 2020. If the person was shown as employed in the fourth quarter of 2019, the imputation was somewhat more complex. This was necessitated by the fact that that there are usually temporary jobs created in the fourth quarter of each year that do not continue into the following year. Accordingly, if the person started the job that he/she held in Q4: 2019 in some previous quarter, it was assumed that the job continued into Q1: 2020. On the other hand, if the job held in Q4: 2019 had only started in that quarter, that person was treated as non-respondent in Q1: 2020.
Face-to-face [f2f]
The survey questionnaire consists of five section: Section 1: Biographical information (marital status, language, migration, education, training, literacy, etc.) Section 2: Economic activities for persons aged 15 years and older Section 3: Unemployment and economic inactivity for persons aged 15 years and older Section 4: Main work activities in the last week for persons aged 15 years and older Section 5: Earnings in the main job for employees, employers and own-account workers aged 15 years and older
COVID 19 Affected data collection for QLFS 2020 Q1. Please see the sampling section for more on this.
Facebook
TwitterThe Quarterly Labour Force Survey (QLFS) is a household-based sample survey conducted by Statistics South Africa (Stats SA). It collects data on the labour market activities of individuals aged 15 years or older who live in South Africa.
National coverage
Individuals
The QLFS sample covers the non-institutional population of South Africa with one exception. The only institutional subpopulation included in the QLFS sample are individuals in worker's hostels. Persons living in private dwelling units within institutions are also enumerated. For example, within a school compound, one would enumerate the schoolmaster's house and teachers' accommodation because these are private dwellings. Students living in a dormitory on the school compound would, however, be excluded.
Sample survey data [ssd]
The sampling procedure for this quarter of the QLFS had to be changed from the norm due to covid-19. In this 2020 Q4 the sample used was the same sample as from Q3, Q2 and Q1 from 2020. Because the interviews were done telephonically, any sampling units that did not have telephones dropped out of the sample. This was adjusted for in the weighting procedure.
Computer Assisted Telephone Interview [cati]
The survey questionnaire consists of the following sections: - Biographical information (marital status, education, etc.) - Economic activities in the last week for persons aged 15 years and older - Unemployment and economic inactivity for persons aged 15 years and above - Main work activity in the last week for persons aged 15 years and above - Earnings in the main job for employees, employers and own-account workers aged 15 years and above
From 2010 the income data collected by South Africa's Quarterly Labour Force Survey is no longer provided in the QLFS dataset (except for a brief return in QLFS 2010 Q3 which may be an error). Possibly because the data is unreliable at the level of the quarter, Statistics South Africa now provides the income data from the QLFS in an annualised dataset called Labour Market Dynamics in South Africa (LMDSA). The datasets for LMDSA are available from DataFirst's website.
In general, imputation was used for item non-response (i.e., blanks within the questionnaire) and edit failures (i.e., invalid or inconsistent responses).
Facebook
TwitterNational coverage
households/individuals
survey
Yearly
Sample size:
Facebook
TwitterAs of March 06, 2022, the amount of positive coronavirus (COVID-19) tests grew by 1,147 in South Africa, reaching 3,684,319 cases in total. As of the same date, Gauteng was the most affected region with 1,196,591 confirmed cases, whereas KwaZulu-Natal and Western Cape counted 653,945 and 642,153 positive tested coronavirus cases, respectively.
Facebook
TwitterThe National Household Travel Survey 2020 is the third round of the survey series designed to assess domestic transport and tourism travel patterns of South African households as well as their attitudes about transport.
The survey has national coverage.
Households and individuals
The target population of the survey consists of all private households in all nine provinces of South Africa and residents in workers' hostels. The survey does not cover other collective living quarters such as students' hostels, old-age homes, hospitals, prisons and military barracks.
Sample survey data
The sample design for the NHTS was based on a master sample (MS) that used a two-stage, stratified design with probability-proportional-to-size (PPS) sampling of PSUs from within strata, and systematic sampling of dwelling units (DUs) from the sampled primary sampling units (PSUs). A self-weighting design at provincial level was used and MS stratification was divided into two levels, primary and secondary stratification. Primary stratification was defined by metropolitan and non-metropolitan geographic area type. During secondary stratification, the Census 2001 data were summarised at PSU level. The following variables were used for secondary stratification; household size, education, occupancy status, gender, industry and income.
Census enumeration areas (EAs) as delineated for Census 2001 formed the basis of the PSUs. The following additional rules were used: • Where possible, PSU sizes were kept between 100 and 500 dwelling units (DUs); • EAs with fewer than 25 DUs were excluded; • EAs with between 26 and 99 DUs were pooled to form larger PSUs and the criteria used was same settlement type; • Virtual splits were applied to large PSUs: 500 to 999 split into two; 1 000 to 1 499 split into three; and 1 500 plus split into four PSUs; and • Informal PSUs were segmented.
A Randomised Probability Proportional to Size (RPPS) systematic sample of PSUs was drawn in each stratum, with the measure of size being the number of households in the PSU. Altogether approximately 3 080 PSUs were selected. In each selected PSU a systematic sample of dwelling units was drawn. The number of DUs selected per PSU varies from PSU to PSU and depends on the Inverse Sampling Ratios (ISR) of each PSU.
Face-to-face [f2f]
Facebook
TwitterIn 1870, the average life expectancy in South Africa was 33.5 years from birth. This life expectancy would remain largely unchanged until the late-1910s, where life expectancy would drop to as low as thirty years as a result of the 1918 Spanish Flu epidemic. In the 1930s, life expectancy in South Africa would begin to steadily rise, peaking at over 63 years in 1995, as industrialization and greater access to healthcare and vaccinations led to significantly reduced child mortality rates across the region. However, life expectancy experienced a sudden drop beginning after 1995, as the HIV/AIDS epidemic spread throughout the country, beginning in the early 1990s. As the epidemic spread through the country, life expectancy would fall by almost 10 years, bottoming out below 54 years in 2005. Life expectancy would begin to rise again beginning in the early 2010s however, as access to HIV counselling and treatments, such as antiretroviral therapy, became more widely available throughout the region. Life expectancy in the country is estimated to be almost 64 years from birth in 2020; a return to the pre-HIV figures of the early 1990s.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The purpose of the SAM is to improve quality of database for modelling (multiplier analysis, price analysis, policy analysis and Computable General Equilibrium (CGE).
Facebook
TwitterThe GHS is an annual household survey which measures the living circumstances of South African households. The GHS collects data on education, health, and social development, housing, access to services and facilities, food security, and agriculture.
National coverage
Households and individuals
The survey covers all de jure household members (usual residents) of households in the nine provinces of South Africa, and residents in workers' hostels. The survey does not cover collective living quarters such as student hostels, old age homes, hospitals, prisons, and military barracks.
Sample survey data [ssd]
From 2015 the General Household Survey (GHS) uses a Master Sample (MS) frame developed in 2013 as a general-purpose sampling frame to be used for all Stats SA household-based surveys. This MS has design requirements that are reasonably compatible with the GHS. The 2013 Master Sample is based on information collected during the 2011 Census conducted by Stats SA. In preparation for Census 2011, the country was divided into 103576 enumeration areas (EAs). The census EAs, together with the auxiliary information for the EAs, were used as the frame units or building blocks for the formation of primary sampling units (PSUs) for the Master Sample, since they covered the entire country, and had other information that is crucial for stratification and creation of PSUs. There are 3324 primary sampling units (PSUs) in the Master Sample, with an expected sample of approximately 33000 dwelling units (DUs). The number of PSUs in the current Master Sample (3324) reflect an 8,0% increase in the size of the Master Sample compared to the previous (2008) Master Sample (which had 3080 PSUs). The larger Master Sample of PSUs was selected to improve the precision (smaller coefficients of variation, known as CVs) of the GHS estimates. The Master Sample is designed to be representative at provincial level and within provinces at metro/non-metro levels. Within the metros, the sample is further distributed by geographical type. The three geography types are Urban, Tribal and Farms. This implies, for example, that within a metropolitan area, the sample is representative of the different geography types that may exist within that metro.
The sample for the GHS is based on a stratified two-stage design with probability proportional to size (PPS) sampling of PSUs in the first stage, and sampling of dwelling units (DUs) with systematic sampling in the second stage. After allocating the sample to the provinces, the sample was further stratified by geography (primary stratification), and by population attributes using Census 2011 data (secondary stratification).
Computer Assisted Telephone Interview [cati]
Data was collected with a household questionnaire and a questionnaire administered to a household member to elicit information on household members.
Since 2019, the questionnaire for the GHS series changed and the variables were also renamed. For correspondence between old names (GHS pre-2019) and new name (GHS post-2019), see the document ghs-2019-variables-renamed.