CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
There are a number of sources for estimates of the size and distribution of ethnic group populations in England. These estimates vary in quality, accuracy, timeliness, and detail; in some cases, the underlying definition of what constitutes the resident population is different. This document outlines in some detail the major sources of ethnic group information currently available at the national and regional level. It also gives a brief summary of the estimates themselves.
The Bureau of the Census has released Census 2000 Summary File 1 (SF1) 100-Percent data. The file includes the following population items: sex, age, race, Hispanic or Latino origin, household relationship, and household and family characteristics. Housing items include occupancy status and tenure (whether the unit is owner or renter occupied). SF1 does not include information on incomes, poverty status, overcrowded housing or age of housing. These topics will be covered in Summary File 3. Data are available for states, counties, county subdivisions, places, census tracts, block groups, and, where applicable, American Indian and Alaskan Native Areas and Hawaiian Home Lands. The SF1 data are available on the Bureau's web site and may be retrieved from American FactFinder as tables, lists, or maps. Users may also download a set of compressed ASCII files for each state via the Bureau's FTP server. There are over 8000 data items available for each geographic area. The full listing of these data items is available here as a downloadable compressed data base file named TABLES.ZIP. The uncompressed is in FoxPro data base file (dbf) format and may be imported to ACCESS, EXCEL, and other software formats. While all of this information is useful, the Office of Community Planning and Development has downloaded selected information for all states and areas and is making this information available on the CPD web pages. The tables and data items selected are those items used in the CDBG and HOME allocation formulas plus topics most pertinent to the Comprehensive Housing Affordability Strategy (CHAS), the Consolidated Plan, and similar overall economic and community development plans. The information is contained in five compressed (zipped) dbf tables for each state. When uncompressed the tables are ready for use with FoxPro and they can be imported into ACCESS, EXCEL, and other spreadsheet, GIS and database software. The data are at the block group summary level. The first two characters of the file name are the state abbreviation. The next two letters are BG for block group. Each record is labeled with the code and name of the city and county in which it is located so that the data can be summarized to higher-level geography. The last part of the file name describes the contents . The GEO file contains standard Census Bureau geographic identifiers for each block group, such as the metropolitan area code and congressional district code. The only data included in this table is total population and total housing units. POP1 and POP2 contain selected population variables and selected housing items are in the HU file. The MA05 table data is only for use by State CDBG grantees for the reporting of the racial composition of beneficiaries of Area Benefit activities. The complete package for a state consists of the dictionary file named TABLES, and the five data files for the state. The logical record number (LOGRECNO) links the records across tables.
This collection automatically captures metadata, the source of which is the GOVERNMENT OF THE REPUBLIC OF SLOVENIA STATISTICAL USE OF THE REPUBLIC OF SLOVENIA and corresponding to the source database entitled “Basic population groups by sex, by statistical definition of population from 1996, municipalities, Slovenia, 1999H2-2008H1”.
Actual data are available in Px-Axis format (.px). With additional links, you can access the source portal page for viewing and selecting data, as well as the PX-Win program, which can be downloaded free of charge. Both allow you to select data for display, change the format of the printout, and store it in different formats, as well as view and print tables of unlimited size, as well as some basic statistical analyses and graphics.
The study included four separate surveys:
The survey of Family Income Support (MOP in Serbian) recipients in 2002 These two datasets are published together separately from the 2003 datasets.
The LSMS survey of general population of Serbia in 2003 (panel survey)
The survey of Roma from Roma settlements in 2003 These two datasets are published together.
Objectives
LSMS represents multi-topical study of household living standard and is based on international experience in designing and conducting this type of research. The basic survey was carried out in 2002 on a representative sample of households in Serbia (without Kosovo and Metohija). Its goal was to establish a poverty profile according to the comprehensive data on welfare of households and to identify vulnerable groups. Also its aim was to assess the targeting of safety net programs by collecting detailed information from individuals on participation in specific government social programs. This study was used as the basic document in developing Poverty Reduction Strategy (PRS) in Serbia which was adopted by the Government of the Republic of Serbia in October 2003.
The survey was repeated in 2003 on a panel sample (the households which participated in 2002 survey were re-interviewed).
Analysis of the take-up and profile of the population in 2003 was the first step towards formulating the system of monitoring in the Poverty Reduction Strategy (PRS). The survey was conducted in accordance with the same methodological principles used in 2002 survey, with necessary changes referring only to the content of certain modules and the reduction in sample size. The aim of the repeated survey was to obtain panel data to enable monitoring of the change in the living standard within a period of one year, thus indicating whether there had been a decrease or increase in poverty in Serbia in the course of 2003. [Note: Panel data are the data obtained on the sample of households which participated in the both surveys. These data made possible tracking of living standard of the same persons in the period of one year.]
Along with these two comprehensive surveys, conducted on national and regional representative samples which were to give a picture of the general population, there were also two surveys with particular emphasis on vulnerable groups. In 2002, it was the survey of living standard of Family Income Support recipients with an aim to validate this state supported program of social welfare. In 2003 the survey of Roma from Roma settlements was conducted. Since all present experiences indicated that this was one of the most vulnerable groups on the territory of Serbia and Montenegro, but with no ample research of poverty of Roma population made, the aim of the survey was to compare poverty of this group with poverty of basic population and to establish which categories of Roma population were at the greatest risk of poverty in 2003. However, it is necessary to stress that the LSMS of the Roma population comprised potentially most imperilled Roma, while the Roma integrated in the main population were not included in this study.
The surveys were conducted on the whole territory of Serbia (without Kosovo and Metohija).
Sample survey data [ssd]
Sample frame for both surveys of general population (LSMS) in 2002 and 2003 consisted of all permanent residents of Serbia, without the population of Kosovo and Metohija, according to definition of permanently resident population contained in UN Recommendations for Population Censuses, which were applied in 2002 Census of Population in the Republic of Serbia. Therefore, permanent residents were all persons living in the territory Serbia longer than one year, with the exception of diplomatic and consular staff.
The sample frame for the survey of Family Income Support recipients included all current recipients of this program on the territory of Serbia based on the official list of recipients given by Ministry of Social affairs.
The definition of the Roma population from Roma settlements was faced with obstacles since precise data on the total number of Roma population in Serbia are not available. According to the last population Census from 2002 there were 108,000 Roma citizens, but the data from the Census are thought to significantly underestimate the total number of the Roma population. However, since no other more precise data were available, this number was taken as the basis for estimate on Roma population from Roma settlements. According to the 2002 Census, settlements with at least 7% of the total population who declared itself as belonging to Roma nationality were selected. A total of 83% or 90,000 self-declared Roma lived in the settlements that were defined in this way and this number was taken as the sample frame for Roma from Roma settlements.
Planned sample: In 2002 the planned size of the sample of general population included 6.500 households. The sample was both nationally and regionally representative (representative on each individual stratum). In 2003 the planned panel sample size was 3.000 households. In order to preserve the representative quality of the sample, we kept every other census block unit of the large sample realized in 2002. This way we kept the identical allocation by strata. In selected census block unit, the same households were interviewed as in the basic survey in 2002. The planned sample of Family Income Support recipients in 2002 and Roma from Roma settlements in 2003 was 500 households for each group.
Sample type: In both national surveys the implemented sample was a two-stage stratified sample. Units of the first stage were enumeration districts, and units of the second stage were the households. In the basic 2002 survey, enumeration districts were selected with probability proportional to number of households, so that the enumeration districts with bigger number of households have a higher probability of selection. In the repeated survey in 2003, first-stage units (census block units) were selected from the basic sample obtained in 2002 by including only even numbered census block units. In practice this meant that every second census block unit from the previous survey was included in the sample. In each selected enumeration district the same households interviewed in the previous round were included and interviewed. On finishing the survey in 2003 the cases were merged both on the level of households and members.
Stratification: Municipalities are stratified into the following six territorial strata: Vojvodina, Belgrade, Western Serbia, Central Serbia (Šumadija and Pomoravlje), Eastern Serbia and South-east Serbia. Primary units of selection are further stratified into enumeration districts which belong to urban type of settlements and enumeration districts which belong to rural type of settlement.
The sample of Family Income Support recipients represented the cases chosen randomly from the official list of recipients provided by Ministry of Social Affairs. The sample of Roma from Roma settlements was, as in the national survey, a two-staged stratified sample, but the units in the first stage were settlements where Roma population was represented in the percentage over 7%, and the units of the second stage were Roma households. Settlements are stratified in three territorial strata: Vojvodina, Beograd and Central Serbia.
Face-to-face [f2f]
In all surveys the same questionnaire with minimal changes was used. It included different modules, topically separate areas which had an aim of perceiving the living standard of households from different angles. Topic areas were the following: 1. Roster with demography. 2. Housing conditions and durables module with information on the age of durables owned by a household with a special block focused on collecting information on energy billing, payments, and usage. 3. Diary of food expenditures (weekly), including home production, gifts and transfers in kind. 4. Questionnaire of main expenditure-based recall periods sufficient to enable construction of annual consumption at the household level, including home production, gifts and transfers in kind. 5. Agricultural production for all households which cultivate 10+ acres of land or who breed cattle. 6. Participation and social transfers module with detailed breakdown by programs 7. Labour Market module in line with a simplified version of the Labour Force Survey (LFS), with special additional questions to capture various informal sector activities, and providing information on earnings 8. Health with a focus on utilization of services and expenditures (including informal payments) 9. Education module, which incorporated pre-school, compulsory primary education, secondary education and university education. 10. Special income block, focusing on sources of income not covered in other parts (with a focus on remittances).
During field work, interviewers kept a precise diary of interviews, recording both successful and unsuccessful visits. Particular attention was paid to reasons why some households were not interviewed. Separate marks were given for households which were not interviewed due to refusal and for cases when a given household could not be found on the territory of the chosen census block.
In 2002 a total of 7,491 households were contacted. Of this number a total of 6,386 households in 621 census rounds were interviewed. Interviewers did not manage to collect the data for 1,106 or 14.8% of selected households. Out of this number 634 households
The Ouagadougou Health and Demographic Surveillance System (Ouagadougou HDSS), located in five neighborhoods at the northern periphery of the capital of Burkina Faso, was established in 2008. Data on vital events (births, deaths, unions, migration events) are collected during household visits that have taken place every 10 months.
The areas were selected to contrast informal neighborhoods (40,000 residents) with formal areas (40,000 residents), with the aims of understanding the problems of the urban poor, and testing innovative programs that promote the well-being of this population. People living in informal areas tend to be marginalized in several ways: they are younger, poorer, less educated, farther from public services and more often migrants. Half of the residents live in the Sanitary District of Kossodo and the other half in the District of Sig-Nonghin.
The Ouaga HDSS has been used to study health inequalities, conduct a surveillance of typhoid fever, measure water quality in informal areas, study the link between fertility and school investments, test a non-governmental organization (NGO)-led program of poverty alleviation and test a community-led targeting of the poor eligible for benefits in the urban context. Key informants help maintain a good rapport with the community.
The areas researchers follow consist of 55 census tracks divided into 494 blocks. Researchers mapped all the census tracks and blocks using fieldworkers with handheld global positioning system (GPS) receivers and ArcGIS. During a first census (October 2008 to March 2009), the demographic surveillance system was explained to every head of household and a consent form was signed; during subsequent censuses, new households were enrolled in the same way.
Ouagadougou is the capital city of Burkina Faso and lies at the centre of this country, located in the middle of West Africa (128 North of the Equator and 18 West of the Prime Meridian).
Individual
Resident household members of households resident within the demographic surveillance area. Inmigrants (visitors) are defined by intention to become resident, but actual residence episodes of less than six months (180 days) are censored. Outmigrants are defined by intention to become resident elsewhere, but actual periods of non-residence less than six months (180 days) are censored. Children born to resident women are considered resident by default, irrespective of actual place of birth. The dataset contains the events of all individuals ever residents during the study period (03 Oct. 2009 to 31 Dec. 2014).
Event history data
This dataset contains rounds 0 to 7 of demographic surveillance data covering the period from 07 Oct. 2008 to 31 December 2014.
This dataset is not based on a sample, it contains information from the complete demographic surveillance area of Ouagadougou in Burkina Faso.
Reponse units (households) by Round:
Round Households
2008 4941
2009 19159
2010 21168
2011 12548
2012 24174
2013 22326
None
Proxy Respondent [proxy]
List of questionnaires:
Collective Housing Unit (UCH) Survey Form - Used to register characteristics of the house - Use to register Sanitation installations - All registered house as at previous round are uploaded behind the PDA or tablet.
Household registration (HHR) or update (HHU) Form - Used to register characteristics of the HH - Used to update information about the composition of the household - All registered households as at previous rounds are uploaded behind the PDA or tablet.
Household Membership Registration (HMR) or update (HMU) - Used to link individuals to households. - Used to update information about the household memberships and member status observations - All member status observations as at previous rounds are uploaded behind the PDA or tablet.
Presences registration form (PDR) - Used to uniquely identify the presence of each individual in the household and to identify the new individual in the household - Mainly to ensure members with multiple household memberships are appropriately captured - All presences observations as at previous rounds are uploaded behind the PDA or tablet.
Visitor registration form (VDR) - Used register the characteristics of the new individual in the household - Used to capt the internal migration - Use matching form to facilitate pairing migration
Out Migration notification form (MGN) - Used to record change in the status of residency of individuals or households - Migrants are tracked and updated in the database
Pregnancy history form (PGH) & pregnancy outcome notification form (PON) - Records details of pregnancies and their outcomes - Only if woman is a new member - Only if woman has never completed WHL or WGH - All member pregnancy without pregnancy outcome as at previous rounds are uploaded behind the PDA or tablet.
Death notification form (DTN) - Records all deaths that have recently occurred - Includes information about time, place, circumstances and possible cause of death
Updated Basic information Form (UBIF) - Use to change the individual basic information
Health questionnaire (adults, women, child, elder) - Family planning - Chronic illnesses - Violence and accident - Mental health - Nutrition, alcohol, tobacco - Access to health services - Anthropometric measures - Physical limitations - Self-rated health - Food security
Variability of climate and water accessibility - accessibility to water - child health outcomes - gender outcomes - data on rainfall, temperatures, water quality
The data collection system is composed by two databases: - A temporary database, which contains data collected and transferred each day during the round. - A reference database, which contains all data of Ouagadougou Health and Demographic Surveillance System, in which is transferred the data of the temporary database to the end of each round. The temporary database is emptied at the end of the round for a new round.
The data processing takes place in two ways:
1) When collecting data with PDAs or tablets and theirs transfers by Wi-Fi, data consistency and plausibility are controlled by verification rules in the mobile application and in the database. In addition to these verifications, the data from the temporary database undergo validation. This validation is performed each week and produces a validation report for the data collection team. After the validation, if the error is due to an error in the data collection, the field worker equipped with his PDA or tablet go back to the field to revisit and correct this error. At the end of this correction, the field worker makes again the transfer of data through the wireless access points on the server. If the error is due to data inconsistencies that might not be directly related to an error in data collection, the case is remanded to the scientific team of the main database that could resolve the inconsistency directly in the database or could with supervisors perform a thorough investigation in order to correct the error.
2) At the end of the round, the data from the temporary database are automatically transferred into the reference database by a transfer program. After the success of this transfer, further validation is performed on the data in the database to ensure data consistency and plausibility. This still produces a validation report for the data collection team. And the same process of error correction is taken.
Household response rates are as follows (assuming that if a household has not responded for 2 years following the last recorded visit to that household, that the household is lost to follow-up and no longer part of the response rate denominator):
Year Response Rate
2008 100%
2009 100%
2010 100%
2011 98%
2012 100%
2013 95%
Not applicable
CentreId MetricTable QMetric Illegal Legal Total Metric RunDate
BF041 MicroDataCleaned Starts 151624 2017-05-16 13:36
BF041 MicroDataCleaned Transitions 0 314778 314778 0 2017-05-16 13:36
BF041 MicroDataCleaned Ends 151624 2017-05-16 13:36
BF041 MicroDataCleaned SexValues 314778 2017-05-16 13:36
BF041 MicroDataCleaned DoBValues 314778 2017-05-16 13:36
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
List of rural municipalities within the meaning of “Eligibility to the GIP”, a global allocation of equipment paid to the department of Saône and Loire. Prefectural Order No. 2017103-001 of 13 April 2017. Article D3334-8-1 of the General Code of Local and Regional Authorities: The following municipalities in metropolitan France are considered to be rural municipalities for the purposes of Articles L. 3334-10 and R. 3334-8: — municipalities whose population does not exceed 2 000 inhabitants; — municipalities whose population exceeds 2 000 inhabitants and does not exceed 5 000 inhabitants, if they do not belong to an urban unit or if they belong to an urban unit whose population does not exceed 5000 inhabitants. The urban reference unit is that defined by the National Institute of Statistics and Economic Studies. The population taken into account is the total population authenticated at the end of the population census.
The Health and Demographic Surveillance System (HDSS) in Niakhar, a rural area of Senegal, is located 135 km east of Dakar. This HDSS has been set up in 1962 by the Institut de Recherche pour le Développement (IRD) to face the shortcomings of the civil registration system and provide demographic indicators.
Some 65 villages were followed annually in the Niakhar area from 1962 to 1969. The study zone was reduced to eight villages from 1969 to 1983, and from then on the HDSS was extended to include 22 other villages, covering a total of 30 villages for a population estimated at 45,000 in December 2013. Thus 8 villages have been under demographic surveillance for almost 50 years and 30 villages for 30years.
Vital events, migrations, marital changes, pregnancies, immunization are routinely recorded (every four months). The database also includes epidemiological, economic and environmental information coming from specific surveys. Data were collected through annual rounds from 1962 to 1987; rounds became weekly from 1987 to 1997; routine visits were conducted every three months between 1997and 2007 and every four months since then.
The current objectives are 1) to obtain a long-term assessment of demographic and socio-economic indicators necessary for bio-medical and social sciences research, 2) to keep up epidemiological and environmental monitoring, 3) to provide a research platform for clinical and interdisciplinary research (medical, social and environmental sciences). Research projects during the last 5 years are listed in Table 2. The Niakhar HDSS has institutional affiliation with the Institut de Recherche pour le Développement (IRD, formerly ORSTOM).
The study zone of Niakhar is located in Senegal, 14.5ºN Latitude and 16.5ºW Longitude in the department of Fatick (Sine-Saloum), 135 km east of Dakar. The Niakhar study zone covers 203 square kilometres and is located in the continental Sahelian-Sudanese climatic zone. For thirty years the region has suffered from drought. The average annual rainfall has decreased from 800 mm in the 1950s to 500 mm in the 1980s. Increasing amounts of precipitation have been observed since the mid-2000s with an average annual rainfall of 600 mm between 2005 and 2010. The area is 203 square kilometers.
Individual
Members of households reside within the demographic surveillance area. Inmigrants are defined by intention to become resident, but actual residence episodes of less than 180 days are censored. Outmigrants are defined by intention to become resident elsewhere, but actual periods of non-residence less than 180 days are censored, except seasonal work migrants, worker with a wife resident, pupils or students. Children born to resident women are considered resident by default, irrespective of actual place of birth. The dataset contains the events of all individuals ever resident during the study period (1 Jan 1990 to 31 Dec 2013).
The Niakhar HDSS collects for each resident the following basic data: individual, household and compound identifying information, mother and father identification, relationship to the head of household and spousal relationship. From 1983 to 2007, the HDSS routinely monitored deaths, pregnancies, births, miscarriages, stillbirths, weaning, migrations, changes of marital status, immunizations, and cases of measles and whooping cough. For the last 5 years, the HDSS only recorded demographic events related to each resident including cause of death. Verbal autopsies have been conducted after all deaths except for those that occurred between 1999 and 2004 where only deaths for people aged 0-55 years were investigated. The Niakhar HDSS also registers visitors as well as all the demographic events related to them in case of in-migration. Household characteristics (living conditions, domestic equipment, etc.) were collected in 1998 and 2003, and community equipment (schools, boreholes, etc.) in 2003. Economic and environmental data will be collected in 2013. Table 3 presents further details on the data items collected. The Niakhar HDSS interviewers collect data with tablet PCs that are loaded with the last updated database linked to a user-friendly interface indicating the household members and the questionnaire. Daily backups are performed on an external hard drive and weekly synchronizations are scheduled during the round, helping to update the database and check data consistency (i.e. residential moves within the study area or marriages). Applications are Developed in Visual Basic.Net and the database is managed with Microsoft Access.
Event history data
This dataset contains rounds 1 to 18 of demographic surveillance data covering the period from 1 Jan 1983 to 31 December 2015.
From 1983 to 1987, data were collected through annual rounds during the dry season. Demographic events were collected by interviewers using a printed list of compound residents with their characteristics. From 1987 to 1997, rounds became weekly because of the need for continuous birth registration for vaccine trials. Annual censuses were carried out to check data collection, particularly relative to in- and out-migration. Routine visits were conducted in the 30 villages of the study area every three months between 1997and 2007 and every four months between 2008 and 2012 and every six month since then.
This dataset is not based on a sample; it contains information from the complete demographic surveillence area.
None
Proxy Respondent [proxy]
List of questionnaires:
Compound Registration or update Form Houshold Registration or update Form Household Membership Registration or update Form External Migration Registration Form Internal Migration Registration Form Individual Registration Form Birth Registration Form Death Registration Form
On data entry data consistency and plausibility were checked by 455 data validation rules at database level. If data validaton failure was due to a data collection error, the questionnaire was referred back to the field for revisit and correction. If the error was due to data inconsistencies that could not be directly traced to a data collection error, the record was referred to the data quality team under the supervision of the senior database scientist. This could request further field level investigation by a team of trackers or could correct the inconsistency directly at database level.
No imputations were done on the resulting micro data set, except for:
a. If an out-migration (OMG) event is followed by a homestead entry event (ENT) and the gap between OMG event and ENT event is greater than 180 days, the ENT event was changed to an in-migration event (IMG). b. If an out-migration (OMG) event is followed by a homestead entry event (ENT) and the gap between OMG event and ENT event is less than 180 days, the OMG event was changed to an homestead exit event (EXT) and the ENT event date changed to the day following the original OMG event. c. If a homestead exit event (EXT) is followed by an in-migration event (IMG) and the gap between the EXT event and the IMG event is greater than 180 days, the EXT event was changed to an out-migration event (OMG). d. If a homestead exit event (EXT) is followed by an in-migration event (IMG) and the gap between the EXT event and the IMG event is less than 180 days, the IMG event was changed to an homestead entry event (ENT) with a date equal to the day following the EXT event. e. If the last recorded event for an individual is homestead exit (EXT) and this event is more than 180 days prior to the end of the surveillance period, then the EXT event is changed to an out-migration event (OMG)
In the case of the village that was added (enumerated) in 2006, some individuals may have outmigrated from the original surveillance area and setlled in the the new village prior to the first enumeration. Where the records of such individuals have been linked, and indivdiual can legitmately have and outmigration event (OMG) forllowed by and enumeration event (ENU). In a few cases a homestead exit event (EXT) was followed by an enumeration event in these cases. In these instances the EXT events were changed to an out-migration event (OMG).
On an average the response rate is about 99% over the years for each round
Not Applicable
CentreId MetricTable QMetric Illegal Legal Total Metric RunDate
SN013 MicroDataCleaned Starts 86883 2017-05-19 15:12
SN013 MicroDataCleaned Transitions 241970 241970 0 2017-05-19 15:12
SN013 MicroDataCleaned Ends 86883 2017-05-19 15:12
SN013 MicroDataCleaned SexValues 32 241938 241970 0 2017-05-19 15:12
SN013 MicroDataCleaned DoBValues 241970 2017-05-19 15:12
https://datafinder.stats.govt.nz/license/attribution-4-0-international/https://datafinder.stats.govt.nz/license/attribution-4-0-international/
Dataset contains counts and measures for individuals from the 2013, 2018, and 2023 Censuses. Data is available by statistical area 1.
The variables included in this dataset are for the census usually resident population count (unless otherwise stated). All data is for level 1 of the classification (unless otherwise stated).
The variables for part 1 of the dataset are:
Download lookup file for part 1 from Stats NZ ArcGIS Online or embedded attachment in Stats NZ geographic data service. Download data table (excluding the geometry column for CSV files) using the instructions in the Koordinates help guide.
Footnotes
Te Whata
Under the Mana Ōrite Relationship Agreement, Te Kāhui Raraunga (TKR) will be publishing Māori descent and iwi affiliation data from the 2023 Census in partnership with Stats NZ. This will be available on Te Whata, a TKR platform.
Geographical boundaries
Statistical standard for geographic areas 2023 (updated December 2023) has information about geographic boundaries as of 1 January 2023. Address data from 2013 and 2018 Censuses was updated to be consistent with the 2023 areas. Due to the changes in area boundaries and coding methodologies, 2013 and 2018 counts published in 2023 may be slightly different to those published in 2013 or 2018.
Subnational census usually resident population
The census usually resident population count of an area (subnational count) is a count of all people who usually live in that area and were present in New Zealand on census night. It excludes visitors from overseas, visitors from elsewhere in New Zealand, and residents temporarily overseas on census night. For example, a person who usually lives in Christchurch city and is visiting Wellington city on census night will be included in the census usually resident population count of Christchurch city.
Population counts
Stats NZ publishes a number of different population counts, each using a different definition and methodology. Population statistics – user guide has more information about different counts.
Caution using time series
Time series data should be interpreted with care due to changes in census methodology and differences in response rates between censuses. The 2023 and 2018 Censuses used a combined census methodology (using census responses and administrative data), while the 2013 Census used a full-field enumeration methodology (with no use of administrative data).
Study participation time series
In the 2013 Census study participation was only collected for the census usually resident population count aged 15 years and over.
About the 2023 Census dataset
For information on the 2023 dataset see Using a combined census model for the 2023 Census. We combined data from the census forms with administrative data to create the 2023 Census dataset, which meets Stats NZ's quality criteria for population structure information. We added real data about real people to the dataset where we were confident the people who hadn’t completed a census form (which is known as admin enumeration) will be counted. We also used data from the 2018 and 2013 Censuses, administrative data sources, and statistical imputation methods to fill in some missing characteristics of people and dwellings.
Data quality
The quality of data in the 2023 Census is assessed using the quality rating scale and the quality assurance framework to determine whether data is fit for purpose and suitable for release. Data quality assurance in the 2023 Census has more information.
Concept descriptions and quality ratings
Data quality ratings for 2023 Census variables has additional details about variables found within totals by topic, for example, definitions and data quality.
Disability indicator
This data should not be used as an official measure of disability prevalence. Disability prevalence estimates are only available from the 2023 Household Disability Survey. Household Disability Survey 2023: Final content has more information about the survey.
Activity limitations are measured using the Washington Group Short Set (WGSS). The WGSS asks about six basic activities that a person might have difficulty with: seeing, hearing, walking or climbing stairs, remembering or concentrating, washing all over or dressing, and communicating. A person was classified as disabled in the 2023 Census if there was at least one of these activities that they had a lot of difficulty with or could not do at all.
Using data for good
Stats NZ expects that, when working with census data, it is done so with a positive purpose, as outlined in the Māori Data Governance Model (Data Iwi Leaders Group, 2023). This model states that "data should support transformative outcomes and should uplift and strengthen our relationships with each other and with our environments. The avoidance of harm is the minimum expectation for data use. Māori data should also contribute to iwi and hapū tino rangatiratanga”.
Confidentiality
The 2023 Census confidentiality rules have been applied to 2013, 2018, and 2023 data. These rules protect the confidentiality of individuals, families, households, dwellings, and undertakings in 2023 Census data. Counts are calculated using fixed random rounding to base 3 (FRR3) and suppression of ‘sensitive’ counts less than six, where tables report multiple geographic variables and/or small populations. Individual figures may not always sum to stated totals. Applying confidentiality rules to 2023 Census data and summary of changes since 2018 and 2013 Censuses has more information about 2023 Census confidentiality rules.
Measures
Measures like averages, medians, and other quantiles are calculated from unrounded counts, with input noise added to or subtracted from each contributing value during measures calculations. Averages and medians based on less than six units (e.g. individuals, dwellings, households, families, or extended families) are suppressed. This suppression threshold changes for other quantiles. Where the cells have been suppressed, a placeholder value has been used.
Percentages
To calculate percentages, divide the figure for the category of interest by the figure for 'Total stated' where this applies.
Symbol
-997 Not available
-999 Confidential
Inconsistencies in definitions
Please note that there may be differences in definitions between census classifications and those used for other data collections.
https://www.usa.gov/government-workshttps://www.usa.gov/government-works
The Current Population Survey Civic Engagement and Volunteering (CEV) Supplement is the most robust longitudinal survey about volunteerism and other forms of civic engagement in the United States. Produced by AmeriCorps in partnership with the U.S. Census Bureau, the CEV takes the pulse of our nation’s civic health every two years. The data on this page was collected in September 2023. The next wave of the CEV will be administered in September 2025.
The CEV can generate reliable estimates at the national level, within states and the District of Columbia, and in the largest twelve Metropolitan Statistical Areas to support evidence-based decision making and efforts to understand how people make a difference in communities across the country.
Click on "Export" to download and review an excerpt from the 2023 CEV Analytic Codebook that shows the variables available in the analytic CEV datasets produced by AmeriCorps.
Click on "Show More" to download and review the following 2023 CEV data and resources provided as attachments:
1) 2023 CEV Dataset Fact Sheet – brief summary of technical aspects of the 2023 CEV dataset. 2) CEV FAQs – answers to frequently asked technical questions about the CEV 3) Constructs and measures in the CEV 4) 2023 CEV Analytic Data and Setup Files – analytic dataset in Stata (.dta), R (.rdata), SPSS (.sav), and Excel (.csv) formats, codebook for analytic dataset, and Stata code (.do) to convert raw dataset to analytic formatting produced by AmeriCorps. These files were updated on January 16, 2025 to correct erroneous missing values for the ssupwgt variable. 5) 2023 CEV Technical Documentation – codebook for raw dataset and full supplement documentation produced by U.S. Census Bureau 6) 2023 CEV Raw Data and Read In Files – raw dataset in Stata (.dta) format, Stata code (.do) and dictionary file (.dct) to read ASCII dataset (.dat) into Stata using layout files (.lis)
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Open Science in (Higher) Education – data of the February 2017 survey
This data set contains:
Survey structure
The survey includes 24 questions and its structure can be separated in five major themes: material used in courses (5), OER awareness, usage and development (6), collaborative tools used in courses (2), assessment and participation options (5), demographics (4). The last two questions include an open text questions about general issues on the topics and singular open education experiences, and a request on forwarding the respondent’s e-mail address for further questionings. The online survey was created with Limesurvey[1]. Several questions include filters, i.e. these questions were only shown if a participants did choose a specific answer beforehand ([n/a] in Excel file, [.] In SPSS).
Demographic questions
Demographic questions asked about the current position, the discipline, birth year and gender. The classification of research disciplines was adapted to general disciplines at German higher education institutions. As we wanted to have a broad classification, we summarised several disciplines and came up with the following list, including the option “other” for respondents who do not feel confident with the proposed classification:
The current job position classification was also chosen according to common positions in Germany, including positions with a teaching responsibility at higher education institutions. Here, we also included the option “other” for respondents who do not feel confident with the proposed classification:
We chose to have a free text (numerical) for asking about a respondent’s year of birth because we did not want to pre-classify respondents’ age intervals. It leaves us options to have different analysis on answers and possible correlations to the respondents’ age. Asking about the country was left out as the survey was designed for academics in Germany.
Remark on OER question
Data from earlier surveys revealed that academics suffer confusion about the proper definition of OER[2]. Some seem to understand OER as free resources, or only refer to open source software (Allen & Seaman, 2016, p. 11). Allen and Seaman (2016) decided to give a broad explanation of OER, avoiding details to not tempt the participant to claim “aware”. Thus, there is a danger of having a bias when giving an explanation. We decided not to give an explanation, but keep this question simple. We assume that either someone knows about OER or not. If they had not heard of the term before, they do not probably use OER (at least not consciously) or create them.
Data collection
The target group of the survey was academics at German institutions of higher education, mainly universities and universities of applied sciences. To reach them we sent the survey to diverse institutional-intern and extern mailing lists and via personal contacts. Included lists were discipline-based lists, lists deriving from higher education and higher education didactic communities as well as lists from open science and OER communities. Additionally, personal e-mails were sent to presidents and contact persons from those communities, and Twitter was used to spread the survey.
The survey was online from Feb 6th to March 3rd 2017, e-mails were mainly sent at the beginning and around mid-term.
Data clearance
We got 360 responses, whereof Limesurvey counted 208 completes and 152 incompletes. Two responses were marked as incomplete, but after checking them turned out to be complete, and we added them to the complete responses dataset. Thus, this data set includes 210 complete responses. From those 150 incomplete responses, 58 respondents did not answer 1st question, 40 respondents discontinued after 1st question. Data shows a constant decline in response answers, we did not detect any striking survey question with a high dropout rate. We deleted incomplete responses and they are not in this data set.
Due to data privacy reasons, we deleted seven variables automatically assigned by Limesurvey: submitdate, lastpage, startlanguage, startdate, datestamp, ipaddr, refurl. We also deleted answers to question No 24 (email address).
References
Allen, E., & Seaman, J. (2016). Opening the Textbook: Educational Resources in U.S. Higher Education, 2015-16.
First results of the survey are presented in the poster:
Heck, Tamara, Blümel, Ina, Heller, Lambert, Mazarakis, Athanasios, Peters, Isabella, Scherp, Ansgar, & Weisel, Luzian. (2017). Survey: Open Science in Higher Education. Zenodo. http://doi.org/10.5281/zenodo.400561
Contact:
Open Science in (Higher) Education working group, see http://www.leibniz-science20.de/forschung/projekte/laufende-projekte/open-science-in-higher-education/.
[1] https://www.limesurvey.org
[2] The survey question about the awareness of OER gave a broad explanation, avoiding details to not tempt the participant to claim “aware”.
The 2022 Philippines National Demographic and Health Survey (NDHS) was implemented by the Philippine Statistics Authority (PSA). Data collection took place from May 2 to June 22, 2022.
The primary objective of the 2022 NDHS is to provide up-to-date estimates of basic demographic and health indicators. Specifically, the NDHS collected information on fertility, fertility preferences, family planning practices, childhood mortality, maternal and child health, nutrition, knowledge and attitudes regarding HIV/AIDS, violence against women, child discipline, early childhood development, and other health issues.
The information collected through the NDHS is intended to assist policymakers and program managers in designing and evaluating programs and strategies for improving the health of the country’s population. The 2022 NDHS also provides indicators anchored to the attainment of the Sustainable Development Goals (SDGs) and the new Philippine Development Plan for 2023 to 2028.
National coverage
The survey covered all de jure household members (usual residents), all women aged 15-49, and all children aged 0-4 resident in the household.
Sample survey data [ssd]
The sampling scheme provides data representative of the country as a whole, for urban and rural areas separately, and for each of the country’s administrative regions. The sample selection methodology for the 2022 NDHS was based on a two-stage stratified sample design using the Master Sample Frame (MSF) designed and compiled by the PSA. The MSF was constructed based on the listing of households from the 2010 Census of Population and Housing and updated based on the listing of households from the 2015 Census of Population. The first stage involved a systematic selection of 1,247 primary sampling units (PSUs) distributed by province or HUC. A PSU can be a barangay, a portion of a large barangay, or two or more adjacent small barangays.
In the second stage, an equal take of either 22 or 29 sample housing units were selected from each sampled PSU using systematic random sampling. In situations where a housing unit contained one to three households, all households were interviewed. In the rare situation where a housing unit contained more than three households, no more than three households were interviewed. The survey interviewers were instructed to interview only the preselected housing units. No replacements and no changes of the preselected housing units were allowed in the implementing stage in order to prevent bias. Survey weights were calculated, added to the data file, and applied so that weighted results are representative estimates of indicators at the regional and national levels.
All women age 15–49 who were either usual residents of the selected households or visitors who stayed in the households the night before the survey were eligible to be interviewed. Among women eligible for an individual interview, one woman per household was selected for a module on women’s safety.
For further details on sample design, see APPENDIX A of the final report.
Computer Assisted Personal Interview [capi]
Two questionnaires were used for the 2022 NDHS: the Household Questionnaire and the Woman’s Questionnaire. The questionnaires, based on The DHS Program’s model questionnaires, were adapted to reflect the population and health issues relevant to the Philippines. Input was solicited from various stakeholders representing government agencies, academe, and international agencies. The survey protocol was reviewed by the ICF Institutional Review Board.
After all questionnaires were finalized in English, they were translated into six major languages: Tagalog, Cebuano, Ilocano, Bikol, Hiligaynon, and Waray. The Household and Woman’s Questionnaires were programmed into tablet computers to allow for computer-assisted personal interviewing (CAPI) for data collection purposes, with the capability to choose any of the languages for each questionnaire.
Processing the 2022 NDHS data began almost as soon as fieldwork started, and data security procedures were in place in accordance with confidentiality of information as provided by Philippine laws. As data collection was completed in each PSU or cluster, all electronic data files were transferred securely via SyncCloud to a server maintained by the PSA Central Office in Quezon City. These data files were registered and checked for inconsistencies, incompleteness, and outliers. The field teams were alerted to any inconsistencies and errors while still in the area of assignment. Timely generation of field check tables allowed for effective monitoring of fieldwork, including tracking questionnaire completion rates. Only the field teams, project managers, and NDHS supervisors in the provincial, regional, and central offices were given access to the CAPI system and the SyncCloud server.
A team of secondary editors in the PSA Central Office carried out secondary editing, which involved resolving inconsistencies and recoding “other” responses; the former was conducted during data collection, and the latter was conducted following the completion of the fieldwork. Data editing was performed using the CSPro software package. The secondary editing of the data was completed in August 2022. The final cleaning of the data set was carried out by data processing specialists from The DHS Program in September 2022.
A total of 35,470 households were selected for the 2022 NDHS sample, of which 30,621 were found to be occupied. Of the occupied households, 30,372 were successfully interviewed, yielding a response rate of 99%. In the interviewed households, 28,379 women age 15–49 were identified as eligible for individual interviews. Interviews were completed with 27,821 women, yielding a response rate of 98%.
The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors and (2) sampling errors. Nonsampling errors are the results of mistakes made in implementing data collection and in data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2022 Philippines National Demographic and Health Survey (2022 NDHS) to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 2022 NDHS is only one of many samples that could have been selected from the same population, using the same design and identical size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95% of all possible samples of identical size and design.
If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 2022 NDHS sample was the result of a multistage stratified design, and, consequently, it was necessary to use more complex formulas. Sampling errors are computed in SAS using programs developed by ICF. These programs use the Taylor linearization method to estimate variances for survey estimates that are means, proportions, or ratios. The Jackknife repeated replication method is used for variance estimation of more complex statistics such as fertility and mortality rates.
A more detailed description of estimates of sampling errors are presented in APPENDIX B of the survey report.
Data Quality Tables
See details of the data quality tables in Appendix C of the final report.
The primary objective of the 2018 NDHS is to provide up-to-date estimates of basic demographic and health indicators. Specifically, the NDHS collected information on fertility, awareness and use of family planning methods, breastfeeding practices, nutritional status of women and children, maternal and child health, adult and childhood mortality, women’s empowerment, domestic violence, female genital cutting, prevalence of malaria, awareness and behaviour regarding HIV/AIDS and other sexually transmitted infections (STIs), disability, and other health-related issues such as smoking.
The information collected through the 2018 NDHS is intended to assist policymakers and programme managers in evaluating and designing programmes and strategies for improving the health of the country’s population. The 2018 NDHS also provides indicators relevant to the Sustainable Development Goals (SDGs) for Nigeria.
National coverage
The survey covered all de jure household members (usual residents), all women aged 15-49 years resident in the household, and all children aged 0-5 years resident in the household.
Sample survey data [ssd]
The sampling frame used for the 2018 NDHS is the Population and Housing Census of the Federal Republic of Nigeria (NPHC), which was conducted in 2006 by the National Population Commission. Administratively, Nigeria is divided into states. Each state is subdivided into local government areas (LGAs), and each LGA is divided into wards. In addition to these administrative units, during the 2006 NPHC each locality was subdivided into convenient areas called census enumeration areas (EAs). The primary sampling unit (PSU), referred to as a cluster for the 2018 NDHS, is defined on the basis of EAs from the 2006 EA census frame. Although the 2006 NPHC did not provide the number of households and population for each EA, population estimates were published for 774 LGAs. A combination of information from cartographic material demarcating each EA and the LGA population estimates from the census was used to identify the list of EAs, estimate the number of households, and distinguish EAs as urban or rural for the survey sample frame. Before sample selection, all localities were classified separately into urban and rural areas based on predetermined minimum sizes of urban areas (cut-off points); consistent with the official definition in 2017, any locality with more than a minimum population size of 20,000 was classified as urban.
The sample for the 2018 NDHS was a stratified sample selected in two stages. Stratification was achieved by separating each of the 36 states and the Federal Capital Territory into urban and rural areas. In total, 74 sampling strata were identified. Samples were selected independently in every stratum via a two-stage selection. Implicit stratifications were achieved at each of the lower administrative levels by sorting the sampling frame before sample selection according to administrative order and by using a probability proportional to size selection during the first sampling stage.
For further details on sample selection, see Appendix A of the final report.
Computer Assisted Personal Interview [capi]
Four questionnaires were used for the 2018 NDHS: the Household Questionnaire, the Woman’s Questionnaire, the Man’s Questionnaire, and the Biomarker Questionnaire. The questionnaires, based on The DHS Program’s standard Demographic and Health Survey (DHS-7) questionnaires, were adapted to reflect the population and health issues relevant to Nigeria. Comments were solicited from various stakeholders representing government ministries and agencies, nongovernmental organisations, and international donors. In addition, information about the fieldworkers for the survey was collected through a self-administered Fieldworker Questionnaire.
The processing of the 2018 NDHS data began almost immediately after the fieldwork started. As data collection was completed in each cluster, all electronic data files were transferred via the IFSS to the NPC central office in Abuja. These data files were registered and checked for inconsistencies, incompleteness, and outliers. The field teams were alerted to any inconsistencies and errors. Secondary editing, carried out in the central office, involved resolving inconsistencies and coding the open-ended questions. The NPC data processor coordinated the exercise at the central office. The biomarker paper questionnaires were compared with electronic data files to check for any inconsistencies in data entry. Data entry and editing were carried out using the CSPro software package. The concurrent processing of the data offered a distinct advantage because it maximised the likelihood of the data being error-free and accurate. Timely generation of field check tables allowed for effective monitoring. The secondary editing of the data was completed in the second week of April 2019.
A total of 41,668 households were selected for the sample, of which 40,666 were occupied. Of the occupied households, 40,427 were successfully interviewed, yielding a response rate of 99%. In the households interviewed, 42,121 women age 15-49 were identified for individual interviews; interviews were completed with 41,821 women, yielding a response rate of 99%. In the subsample of households selected for the male survey, 13,422 men age 15-59 were identified and 13,311 were successfully interviewed, yielding a response rate of 99%.
The estimates from a sample survey are affected by two types of errors: nonsampling errors and sampling errors. Nonsampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2018 Nigeria Demographic and Health Survey (NDHS) to minimise this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 2018 NDHS is only one of many samples that could have been selected from the same population, using the same design and expected size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability among all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
Sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95% of all possible samples of identical size and design.
If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 2018 NDHS sample is the result of a multistage stratified design, and, consequently, it was necessary to use more complex formulas. Sampling errors are computed in SAS, using programs developed by ICF. These programs use the Taylor linearisation method to estimate variances for survey estimates that are means, proportions, or ratios. The Jackknife repeated replication method is used for variance estimation of more complex statistics such as fertility and mortality rates.
Note: A more detailed description of estimates of sampling errors are presented in APPENDIX B of the survey report.
Data Quality Tables - Household age distribution - Age distribution of eligible and interviewed women - Age distribution of eligible and interviewed men - Completeness of reporting - Births by calendar years - Reporting of age at death in days - Reporting of age at death in months - Standardisation exercise results from anthropometry training - Height and weight data completeness and quality for children - Height measurements from random subsample of measured children - Sibship size and sex ratio of siblings - Pregnancy-related mortality trends - Data collection period - Malaria prevalence according to rapid diagnostic test (RDT)
Note: See detailed data quality tables in APPENDIX C of the report.
The 2022 Ghana Demographic and Health Survey (2022 GDHS) is the seventh in the series of DHS surveys conducted by the Ghana Statistical Service (GSS) in collaboration with the Ministry of Health/Ghana Health Service (MoH/GHS) and other stakeholders, with funding from the United States Agency for International Development (USAID) and other partners.
The primary objective of the 2022 GDHS is to provide up-to-date estimates of basic demographic and health indicators. Specifically, the GDHS collected information on: - Fertility levels and preferences, contraceptive use, antenatal and delivery care, maternal and child health, childhood mortality, childhood immunisation, breastfeeding and young child feeding practices, women’s dietary diversity, violence against women, gender, nutritional status of adults and children, awareness regarding HIV/AIDS and other sexually transmitted infections, tobacco use, and other indicators relevant for the Sustainable Development Goals - Haemoglobin levels of women and children - Prevalence of malaria parasitaemia (rapid diagnostic testing and thick slides for malaria parasitaemia in the field and microscopy in the lab) among children age 6–59 months - Use of treated mosquito nets - Use of antimalarial drugs for treatment of fever among children under age 5
The information collected through the 2022 GDHS is intended to assist policymakers and programme managers in designing and evaluating programmes and strategies for improving the health of the country’s population.
National coverage
The survey covered all de jure household members (usual residents), all women aged 15-49, men aged 15-59, and all children aged 0-4 resident in the household.
Sample survey data [ssd]
To achieve the objectives of the 2022 GDHS, a stratified representative sample of 18,450 households was selected in 618 clusters, which resulted in 15,014 interviewed women age 15–49 and 7,044 interviewed men age 15–59 (in one of every two households selected).
The sampling frame used for the 2022 GDHS is the updated frame prepared by the GSS based on the 2021 Population and Housing Census.1 The sampling procedure used in the 2022 GDHS was stratified two-stage cluster sampling, designed to yield representative results at the national level, for urban and rural areas, and for each of the country’s 16 regions for most DHS indicators. In the first stage, 618 target clusters were selected from the sampling frame using a probability proportional to size strategy for urban and rural areas in each region. Then the number of targeted clusters were selected with equal probability systematic random sampling of the clusters selected in the first phase for urban and rural areas. In the second stage, after selection of the clusters, a household listing and map updating operation was carried out in all of the selected clusters to develop a list of households for each cluster. This list served as a sampling frame for selection of the household sample. The GSS organized a 5-day training course on listing procedures for listers and mappers with support from ICF. The listers and mappers were organized into 25 teams consisting of one lister and one mapper per team. The teams spent 2 months completing the listing operation. In addition to listing the households, the listers collected the geographical coordinates of each household using GPS dongles provided by ICF and in accordance with the instructions in the DHS listing manual. The household listing was carried out using tablet computers, with software provided by The DHS Program. A fixed number of 30 households in each cluster were randomly selected from the list for interviews.
For further details on sample design, see APPENDIX A of the final report.
Face-to-face computer-assisted interviews [capi]
Four questionnaires were used in the 2022 GDHS: the Household Questionnaire, the Woman’s Questionnaire, the Man’s Questionnaire, and the Biomarker Questionnaire. The questionnaires, based on The DHS Program’s model questionnaires, were adapted to reflect the population and health issues relevant to Ghana. In addition, a self-administered Fieldworker Questionnaire collected information about the survey’s fieldworkers.
The GSS organized a questionnaire design workshop with support from ICF and obtained input from government and development partners expected to use the resulting data. The DHS Program optional modules on domestic violence, malaria, and social and behavior change communication were incorporated into the Woman’s Questionnaire. ICF provided technical assistance in adapting the modules to the questionnaires.
DHS staff installed all central office programmes, data structure checks, secondary editing, and field check tables from 17–20 October 2022. Central office training was implemented using the practice data to test the central office system and field check tables. Seven GSS staff members (four male and three female) were trained on the functionality of the central office menu, including accepting clusters from the field, data editing procedures, and producing reports to monitor fieldwork.
From 27 February to 17 March, DHS staff visited the Ghana Statistical Service office in Accra to work with the GSS central office staff on finishing the secondary editing and to clean and finalize all data received from the 618 clusters.
A total of 18,540 households were selected for the GDHS sample, of which 18,065 were found to be occupied. Of the occupied households, 17,933 were successfully interviewed, yielding a response rate of 99%. In the interviewed households, 15,317 women age 15–49 were identified as eligible for individual interviews. Interviews were completed with 15,014 women, yielding a response rate of 98%. In the subsample of households selected for the male survey, 7,263 men age 15–59 were identified as eligible for individual interviews and 7,044 were successfully interviewed.
The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors and (2) sampling errors. Nonsampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2022 Ghana Demographic and Health Survey (2022 GDHS) to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 2022 GDHS is only one of many samples that could have been selected from the same population, using the same design and identical size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results. A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95% of all possible samples of identical size and design.
If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 2022 GDHS sample was the result of a multistage stratified design, and, consequently, it was necessary to use more complex formulas. The computer software used to calculate sampling errors for the GDHS 2022 is an SAS program. This program used the Taylor linearization method to estimate variances for survey estimates that are means, proportions, or ratios. The Jackknife repeated replication method is used for variance estimation of more complex statistics such as fertility and mortality rates.
A more detailed description of estimates of sampling errors are presented in APPENDIX B of the survey report.
Data Quality Tables
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
This table provides the age-standardized mortality rates per 100,000 population, for the three selected causes of death and all causes combined. The three selected causes of death are Circulatory System, Neoplasms and External Causes (Injury). Age standardization is a technique applied to make rates comparable across groups with different age distributions. A simple rate is defined as the number of people with a particular condition divided by the whole population. An age-standardized rate is defined as the number of people with a condition divided by the population within each age group. Standardizing (adjusting) the rate across age groups allows a more accurate comparison between populations that have different age structures. Age standardization is typically done when comparing rates across time periods, different geographic areas, and or population sub-groups (e.g. ethnic group). This indicator dataset contains information at both Local Geographic Area (for example, Lacombe, Red Deer - North, Calgary - West Bow, etc.) and Alberta levels. Local geographic area refers to 132 geographic areas created by Alberta Health (AH) and Alberta Health Services (AHS) based on census boundaries. This table is the part of "Alberta Health Primary Health Care - Community Profiles" report published February 2013
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This dataset contains a collection of reviews related to clothing products. It is designed to be a valuable resource for multilabel classification research. Each data entry includes annotated labels, enabling exploration of various aspects of reviewed clothing items. The dataset offers diverse perspectives, supporting the development and evaluation of robust classification models capable of predicting multiple characteristics of a clothing item. Its focus on multilabel classification aids in advancing machine learning applications within the fashion industry.
The dataset is typically provided in a CSV file format. Specific numbers for rows or records are not currently available. It is structured as a tabular collection of consumer reviews.
This dataset is ideal for: * Multilabel classification research and model development. * Building and evaluating machine learning algorithms to predict multiple attributes of clothing items from review text. * Natural Language Processing (NLP) tasks, such as sentiment analysis, topic modelling, and feature extraction from consumer reviews. * Data visualisation projects to understand consumer preferences and pain points in the fashion sector. * Advancing the application of machine learning in the fashion industry.
The dataset's regional coverage is global. It was listed on 11th June 2025. Specific time ranges for the reviews themselves are not detailed. While reviews may contain demographic mentions (e.g., petite sizing, body types), the dataset does not explicitly define demographic scope.
CCO
Original Data Source: Consumer Review of Clothing Product
Figure 7.1 provides the age-standardized mortality rates per 100,000 population, for the three selected causes of death and all causes combined. The three selected causes of death are Circulatory System, Neoplasms and External Causes (Injury). Age standardization is a technique applied to make rates comparable across groups with different age distributions. A simple rate is defined as the number of people with a particular condition divided by the whole population. An age-standardized rate is defined as the number of people with a condition divided by the population within each age group. Standardizing (adjusting) the rate across age groups allows a more accurate comparison between populations that have different age structures. Age standardization is typically done when comparing rates across time periods, different geographic areas, and or population sub-groups (e.g. ethnic group). This indicator dataset contains information at both Local Geographic Area (for example, Lacombe, Red Deer - North, Calgary - West Bow, etc.) and Alberta levels. Local geographic area refers to 132 geographic areas created by Alberta Health (AH) and Alberta Health Services (AHS) based on census boundaries. This table is the part of "Alberta Health Primary Health Care - Community Profiles" report published August 2022
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
This table provides the age-standardized mortality rates per 100,000 population, for the three selected causes of death and all causes combined for both the local geographic area and Alberta for the most recent three-year period available. The three selected causes of death are Circulatory System, Neoplasms and External Causes (Injury). Age standardization is a technique applied to make rates comparable across groups with different age distributions. A simple rate is defined as the number of people with a particular condition divided by the whole population. An age-standardized rate is defined as the number of people with a condition divided by the population within each age group. Standardizing (adjusting) the rate across age groups allows a more accurate comparison between populations that have different age structures. Age standardization is typically done when comparing rates across time periods, different geographic areas, and or population sub-groups (e.g. ethnic group). This indicator dataset contains information at both Local Geographic Area (for example, Lacombe, Red Deer - North, Calgary - West Bow, etc.) and Alberta levels. Local geographic area refers to 132 geographic areas created by Alberta Health (AH) and Alberta Health Services (AHS) based on census boundaries. This table is the part of "Alberta Health Primary Health Care - Community Profiles" report published March 2015
This table provides the age-standardized mortality rates per 100,000 population, for the three selected causes of death and all causes combined. The three selected causes of death are Circulatory System, Neoplasms and External Causes (Injury). Age standardization is a technique applied to make rates comparable across groups with different age distributions. A simple rate is defined as the number of people with a particular condition divided by the whole population. An age-standardized rate is defined as the number of people with a condition divided by the population within each age group. Standardizing (adjusting) the rate across age groups allows a more accurate comparison between populations that have different age structures. Age standardization is typically done when comparing rates across time periods, different geographic areas, and or population sub-groups (e.g. ethnic group). This indicator dataset contains information at both Local Geographic Area (for example, Lacombe, Red Deer - North, Calgary - West Bow, etc.) and Alberta levels. Local geographic area refers to 132 geographic areas created by Alberta Health (AH) and Alberta Health Services (AHS) based on census boundaries. This table is the part of "Alberta Health Primary Health Care - Community Profiles" report published March 2019
Provides the age-standardized mortality rates per 100,000 population, for the three selected causes of death and all causes combined. The three selected causes of death are Circulatory System, Neoplasms and External Causes (Injury). Age standardization is a technique applied to make rates comparable across groups with different age distributions. A simple rate is defined as the number of people with a particular condition divided by the whole population. An age-standardized rate is defined as the number of people with a condition divided by the population within each age group. Standardizing (adjusting) the rate across age groups allows a more accurate comparison between populations that have different age structures. Age standardization is typically done when comparing rates across time periods, different geographic areas, and or population sub-groups (e.g. ethnic group). This indicator dataset contains information at both Local Geographic Area (for example, Lacombe, Red Deer - North, Calgary - West Bow, etc.) and Alberta levels. Local geographic area refers to 132 geographic areas created by Alberta Health (AH) and Alberta Health Services (AHS) based on census boundaries. This table is the part of "Alberta Health Primary Health Care - Community Profiles" report published March 2015
https://datafinder.stats.govt.nz/license/attribution-4-0-international/https://datafinder.stats.govt.nz/license/attribution-4-0-international/
This dataset is the definitive set of annually released statistical area 1 (SA1) boundaries for 2022 as defined by Stats NZ (the custodian). This version contains 29,913 SA1 features.
SA1s were introduced as part of the Statistical Standard for Geographic Areas 2018 (SSGA18) which replaced the New Zealand Standard Areas Classification (NZSAC92). SA1 is an output geography that allows the release of more detailed information about population characteristics than is available at the meshblock level.
Built by joining meshblocks, SA1s have an ideal size range of 100–200 residents, and a maximum population of about 500. This is to minimise suppression of population data in multivariate statistics tables. SA1s either define or aggregate to define SA2s, urban rural areas, territorial authorities, and regional councils. Some SA1s that contain apartment blocks, retirement villages, and large non-residential facilities have more than 500 residents.
This generalised version has been simplified for rapid drawing and is designed for thematic or web mapping purposes.
Digital boundary data became freely available on 1 July 2007.
The SA1 classification can also be downloaded from the Stats NZ classification and concordance tool Ariā.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
There are a number of sources for estimates of the size and distribution of ethnic group populations in England. These estimates vary in quality, accuracy, timeliness, and detail; in some cases, the underlying definition of what constitutes the resident population is different. This document outlines in some detail the major sources of ethnic group information currently available at the national and regional level. It also gives a brief summary of the estimates themselves.