6 datasets found
  1. Living Standards Survey V 2005-2006 - World Bank SHIP Harmonized Dataset -...

    • microdata.worldbank.org
    • catalog.ihsn.org
    Updated Dec 9, 2014
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ghana Statistical Service (GSS) (2014). Living Standards Survey V 2005-2006 - World Bank SHIP Harmonized Dataset - Ghana [Dataset]. https://microdata.worldbank.org/index.php/catalog/1064
    Explore at:
    Dataset updated
    Dec 9, 2014
    Dataset provided by
    Ghana Statistical Services
    Authors
    Ghana Statistical Service (GSS)
    Time period covered
    2005 - 2006
    Area covered
    Ghana
    Description

    Abstract

    Survey based Harmonized Indicators (SHIP) files are harmonized data files from household surveys that are conducted by countries in Africa. To ensure the quality and transparency of the data, it is critical to document the procedures of compiling consumption aggregation and other indicators so that the results can be duplicated with ease. This process enables consistency and continuity that make temporal and cross-country comparisons consistent and more reliable.

    Four harmonized data files are prepared for each survey to generate a set of harmonized variables that have the same variable names. Invariably, in each survey, questions are asked in a slightly different way, which poses challenges on consistent definition of harmonized variables. The harmonized household survey data present the best available variables with harmonized definitions, but not identical variables. The four harmonized data files are

    a) Individual level file (Labor force indicators in a separate file): This file has information on basic characteristics of individuals such as age and sex, literacy, education, health, anthropometry and child survival. b) Labor force file: This file has information on labor force including employment/unemployment, earnings, sectors of employment, etc. c) Household level file: This file has information on household expenditure, household head characteristics (age and sex, level of education, employment), housing amenities, assets, and access to infrastructure and services. d) Household Expenditure file: This file has consumption/expenditure aggregates by consumption groups according to Purpose (COICOP) of Household Consumption of the UN.

    Geographic coverage

    National

    Analysis unit

    • Individual level for datasets with suffix _I and _L
    • Household level for datasets with suffix _H and _E

    Universe

    The survey covered all de jure household members (usual residents).

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    Sampling Frame and Units As in all probability sample surveys, it is important that each sampling unit in the surveyed population has a known, non-zero probability of selection. To achieve this, there has to be an appropriate list, or sampling frame of the primary sampling units (PSUs).The universe defined for the GLSS 5 is the population living within private households in Ghana. The institutional population (such as schools, hospitals etc), which represents a very small percentage in the 2000 Population and Housing Census (PHC), is excluded from the frame for the GLSS 5.

    The Ghana Statistical Service (GSS) maintains a complete list of census EAs, together with their respective population and number of households as well as maps, with well defined boundaries, of the EAs. . This information was used as the sampling frame for the GLSS 5. Specifically, the EAs were defined as the primary sampling units (PSUs), while the households within each EA constituted the secondary sampling units (SSUs).

    Stratification In order to take advantage of possible gains in precision and reliability of the survey estimates from stratification, the EAs were first stratified into the ten administrative regions. Within each region, the EAs were further sub-divided according to their rural and urban areas of location. The EAs were also classified according to ecological zones and inclusion of Accra (GAMA) so that the survey results could be presented according to the three ecological zones, namely 1) Coastal, 2) Forest, and 3) Northern Savannah, and for Accra.

    Sample size and allocation The number and allocation of sample EAs for the GLSS 5 depend on the type of estimates to be obtained from the survey and the corresponding precision required. It was decided to select a total sample of around 8000 households nationwide.

    To ensure adequate numbers of complete interviews that will allow for reliable estimates at the various domains of interest, the GLSS 5 sample was designed to ensure that at least 400 households were selected from each region.

    A two-stage stratified random sampling design was adopted. Initially, a total sample of 550 EAs was considered at the first stage of sampling, followed by a fixed take of 15 households per EA. The distribution of the selected EAs into the ten regions or strata was based on proportionate allocation using the population.

    For example, the number of selected EAs allocated to the Western Region was obtained as: 1924577/18912079*550 = 56

    Under this sampling scheme, it was observed that the 400 households minimum requirement per region could be achieved in all the regions but not the Upper West Region. The proportionate allocation formula assigned only 17 EAs out of the 550 EAs nationwide and selecting 15 households per EA would have yielded only 255 households for the region. In order to surmount this problem, two options were considered: retaining the 17 EAs in the Upper West Region and increasing the number of selected households per EA from 15 to about 25, or increasing the number of selected EAs in the region from 17 to 27 and retaining the second stage sample of 15 households per EA.

    The second option was adopted in view of the fact that it was more likely to provide smaller sampling errors for the separate domains of analysis. Based on this, the number of EAs in Upper East and the Upper West were adjusted from 27 and 17 to 40 and 34 respectively, bringing the total number of EAs to 580 and the number of households to 8,700.

    A complete household listing exercise was carried out between May and June 2005 in all the selected EAs to provide the sampling frame for the second stage selection of households. At the second stage of sampling, a fixed number of 15 households per EA was selected in all the regions. In addition, five households per EA were selected as replacement samples.The overall sample size therefore came to 8,700 households nationwide.

    Mode of data collection

    Face-to-face [f2f]

  2. d

    HSRC Master Sample II - Dataset - B2FIND

    • demo-b2find.dkrz.de
    Updated Sep 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). HSRC Master Sample II - Dataset - B2FIND [Dataset]. http://demo-b2find.dkrz.de/dataset/e34fc48c-0f01-51a9-bf21-93dc96b59013
    Explore at:
    Dataset updated
    Sep 27, 2025
    Description

    Description: The 2005 HSRC Master Sample was used for SABSSM 2008 and 2012, the SANHANES study in 2012 and SASAS 2007-2010 (adjacent EAs) to obtain an understanding of geographical spread of HIV/AIDS, perceptions and attitudes of people and other health related studies over time. Abstract: A sample can be defined as a subset containing the characteristics of a larger population. Samples are used in statistical testing when population sizes are too large for the test to include all possible members or observations. A sample should represent the whole population and not reflect bias toward a specific attribute.[1] One of the most crucial aspects of sample design in household surveys is its frame. The sampling frame has significant implications on the cost and the quality of any survey, household or otherwise.[2] The sampling frame .... in a household survey must cover the entire target population. When that frame is used for multiple surveys or multiple rounds of the same survey it is known as a master sample frame or .... master sample.[3] A master sample is a sample drawn from a population for use on a number of future occasions, so as to avoid ad hoc sampling on each occasion. Sometimes the master sample is large and subsequent inquiries are based on a sub-sample from it.[4] The HSRC compiles master samples in order to construct samples for various HSRC research studies. The 2005 HSRC Master Sample was used for SABSSM 2008 and 2012, SASAS 2007-2010 and the SANHANES study in 2012 to obtain an understanding of geographical spread of HIV/AIDS, perceptions and attitudes of people and other health related studies over time. The 2005 HSRC Master Sample was created in the following way: South Africa was delineated into EAs according to municipality and province. Municipal boundaries were obtained from the Municipal Demarcation Board. An Enumeration area (EA) is the smallest geographical unit (piece of land) into which the country is divided for census or survey enumeration.[5] The concepts and definitions of terms used for Census 2001 comply in most instances with United Nations standards for censuses. A total of 1,000 census enumeration areas (EAs) from the 2001 population census were randomly selected using probability proportional to size and stratified by province, locality type and race in urban areas from a database of 80 787 EAs that were mapped using aerial photography to develop an HSRC master sample for selecting households. The ideal frame would be complete with respect to the target population if all of its members (the universe) are covered by the frame. Ideal characteristics of a master sample: The master frame should be as complete, accurate and current as practicable. A master sample frame for household surveys is typically developed from the most recent census, just as a regular sample frame is. Because the master frame may be used during an entire intercensal (between census) period, however, it will usually require periodic and regular updating such as every 2-3 years. This is in contrast to a regular frame which is more likely to be up-dated on an ad hoc basis and only when a particular survey is being planned[6] [1] http://www.investopedia.com/terms/s/sample.asp [2] http://unstats.un.org/unsd/demographic/meetings/egm/sampling_1203/docs/no_3.pdf [3] http://unstats.un.org/unsd/demographic/meetings/egm/sampling_1203/docs/no_3.pdf [4] A Dictionary of Statistical Terms, 5th edition, prepared for the International Statistical Institute by F.H.C. Marriott. Published for the International Statistical Institute by Longman Scientific and Technical. http://stats.oecd.org/glossary/detail.asp?ID=3708 [5] http://africageodownloads.info/128_mokgokolo.pdf [6] http://unstats.un.org/unsd/demographic/meetings/egm/sampling_1203/docs/no_3.pdf All enumeration areas (80 787 EAs) within the South African borders during the 2001 Census. The whole country was delimited into EAs according to municipality and province. Municipal boundaries were obtained from the Municipal Demarcation Board. A total of 1,000 census enumeration areas (EAs) from the 2001 population census were randomly selected using probability proportional to size and stratified by province, locality type and race in urban areas from a database of 80 787 EAs that were mapped in all surveys using aerial photography to develop all HSRC master sample for selecting households. The first digit represents the province The second and third digits represent the municipality

  3. Definition of the selected demographic and socioeconomic variables.

    • plos.figshare.com
    xls
    Updated Jan 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhenlong Li; Huan Ning; Fengrui Jing; M. Naser Lessani (2024). Definition of the selected demographic and socioeconomic variables. [Dataset]. http://doi.org/10.1371/journal.pone.0294430.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jan 19, 2024
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Zhenlong Li; Huan Ning; Fengrui Jing; M. Naser Lessani
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Definition of the selected demographic and socioeconomic variables.

  4. Default sim_abundance function call, with descriptions, default values and...

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Paul M. Regular; Gregory J. Robertson; Keith P. Lewis; Jonathan Babyn; Brian Healey; Fran Mowbray (2023). Default sim_abundance function call, with descriptions, default values and associated parameter symbols of key arguments. [Dataset]. http://doi.org/10.1371/journal.pone.0232822.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Paul M. Regular; Gregory J. Robertson; Keith P. Lewis; Jonathan Babyn; Brian Healey; Fran Mowbray
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Default sim_abundance function call, with descriptions, default values and associated parameter symbols of key arguments.

  5. Characteristics of the datasets used to estimate minimum sample size.

    • plos.figshare.com
    xls
    Updated Feb 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matheus Scaketti; Patricia Sanae Sujii; Alessandro Alves-Pereira; Kaiser Dias Schwarcz; Ana Flávia Francisconi; Matheus Sartori Moro; Kauanne Karolline Moreno Martins; Thiago Araujo de Jesus; Guilherme Brener Ferreira de Souza; Maria Imaculada Zucchi (2025). Characteristics of the datasets used to estimate minimum sample size. [Dataset]. http://doi.org/10.1371/journal.pone.0316634.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Feb 13, 2025
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Matheus Scaketti; Patricia Sanae Sujii; Alessandro Alves-Pereira; Kaiser Dias Schwarcz; Ana Flávia Francisconi; Matheus Sartori Moro; Kauanne Karolline Moreno Martins; Thiago Araujo de Jesus; Guilherme Brener Ferreira de Souza; Maria Imaculada Zucchi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Characteristics of the datasets used to estimate minimum sample size.

  6. N

    Population Pyramid Dataset: Age Groups, Male and Female Population, and...

    • neilsberg.com
    csv, json
    Updated Feb 22, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). Population Pyramid Dataset: Age Groups, Male and Female Population, and Total Population for Demographics Analysis // 2025 Edition [Dataset]. https://www.neilsberg.com/research/datasets/527b28ab-f122-11ef-8c1b-3860777c1fe6/
    Explore at:
    json, csvAvailable download formats
    Dataset updated
    Feb 22, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Male and Female Population Under 5 Years, Male and Female Population over 85 years, Male and Female Total Population for Age Groups, Male and Female Population Between 5 and 9 years, Male and Female Population Between 10 and 14 years, Male and Female Population Between 15 and 19 years, Male and Female Population Between 20 and 24 years, Male and Female Population Between 25 and 29 years, Male and Female Population Between 30 and 34 years, Male and Female Population Between 35 and 39 years, and 9 more
    Measurement technique
    The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) -Year Estimates. To measure the three variables, namely (a) male population, (b) female population and (b) total population, we initially analyzed and categorized the data for each of the age groups. For age groups we divided it into roughly a 5 year bucket for ages between 0 and 85. For over 85, we aggregated data into a single group for all ages. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    The dataset tabulates the data for the population pyramid, which represents the population distribution across age and gender, using estimates from the U.S. Census Bureau American Community Survey (ACS) -Year Estimates. It lists the male and female population for each age group, along with the total population for those age groups. Higher numbers at the bottom of the table suggest population growth, whereas higher numbers at the top indicate declining birth rates. Furthermore, the dataset can be utilized to understand the youth dependency ratio, old-age dependency ratio, total dependency ratio, and potential support ratio.

    Key observations

    • Youth dependency ratio, which is the number of children aged 0-14 per 100 persons aged 15-64, for , is .
    • Old-age dependency ratio, which is the number of persons aged 65 or over per 100 persons aged 15-64, for , is .
    • Total dependency ratio for is .
    • Potential support ratio, which is the number of youth (working age population) per elderly, for is .
    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) -Year Estimates.

    Age groups:

    • Under 5 years
    • 5 to 9 years
    • 10 to 14 years
    • 15 to 19 years
    • 20 to 24 years
    • 25 to 29 years
    • 30 to 34 years
    • 35 to 39 years
    • 40 to 44 years
    • 45 to 49 years
    • 50 to 54 years
    • 55 to 59 years
    • 60 to 64 years
    • 65 to 69 years
    • 70 to 74 years
    • 75 to 79 years
    • 80 to 84 years
    • 85 years and over

    Variables / Data Columns

    • Age Group: This column displays the age group for the population analysis. Total expected values are 18 and are define above in the age groups section.
    • Population (Male): The male population in the for the selected age group is shown in the following column.
    • Population (Female): The female population in the for the selected age group is shown in the following column.
    • Total Population: The total population of the for the selected age group is shown in the following column.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

    Recommended for further research

    This dataset is a part of the main dataset for Population by Age. You can refer the same here

  7. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Ghana Statistical Service (GSS) (2014). Living Standards Survey V 2005-2006 - World Bank SHIP Harmonized Dataset - Ghana [Dataset]. https://microdata.worldbank.org/index.php/catalog/1064
Organization logo

Living Standards Survey V 2005-2006 - World Bank SHIP Harmonized Dataset - Ghana

Explore at:
3 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Dec 9, 2014
Dataset provided by
Ghana Statistical Services
Authors
Ghana Statistical Service (GSS)
Time period covered
2005 - 2006
Area covered
Ghana
Description

Abstract

Survey based Harmonized Indicators (SHIP) files are harmonized data files from household surveys that are conducted by countries in Africa. To ensure the quality and transparency of the data, it is critical to document the procedures of compiling consumption aggregation and other indicators so that the results can be duplicated with ease. This process enables consistency and continuity that make temporal and cross-country comparisons consistent and more reliable.

Four harmonized data files are prepared for each survey to generate a set of harmonized variables that have the same variable names. Invariably, in each survey, questions are asked in a slightly different way, which poses challenges on consistent definition of harmonized variables. The harmonized household survey data present the best available variables with harmonized definitions, but not identical variables. The four harmonized data files are

a) Individual level file (Labor force indicators in a separate file): This file has information on basic characteristics of individuals such as age and sex, literacy, education, health, anthropometry and child survival. b) Labor force file: This file has information on labor force including employment/unemployment, earnings, sectors of employment, etc. c) Household level file: This file has information on household expenditure, household head characteristics (age and sex, level of education, employment), housing amenities, assets, and access to infrastructure and services. d) Household Expenditure file: This file has consumption/expenditure aggregates by consumption groups according to Purpose (COICOP) of Household Consumption of the UN.

Geographic coverage

National

Analysis unit

  • Individual level for datasets with suffix _I and _L
  • Household level for datasets with suffix _H and _E

Universe

The survey covered all de jure household members (usual residents).

Kind of data

Sample survey data [ssd]

Sampling procedure

Sampling Frame and Units As in all probability sample surveys, it is important that each sampling unit in the surveyed population has a known, non-zero probability of selection. To achieve this, there has to be an appropriate list, or sampling frame of the primary sampling units (PSUs).The universe defined for the GLSS 5 is the population living within private households in Ghana. The institutional population (such as schools, hospitals etc), which represents a very small percentage in the 2000 Population and Housing Census (PHC), is excluded from the frame for the GLSS 5.

The Ghana Statistical Service (GSS) maintains a complete list of census EAs, together with their respective population and number of households as well as maps, with well defined boundaries, of the EAs. . This information was used as the sampling frame for the GLSS 5. Specifically, the EAs were defined as the primary sampling units (PSUs), while the households within each EA constituted the secondary sampling units (SSUs).

Stratification In order to take advantage of possible gains in precision and reliability of the survey estimates from stratification, the EAs were first stratified into the ten administrative regions. Within each region, the EAs were further sub-divided according to their rural and urban areas of location. The EAs were also classified according to ecological zones and inclusion of Accra (GAMA) so that the survey results could be presented according to the three ecological zones, namely 1) Coastal, 2) Forest, and 3) Northern Savannah, and for Accra.

Sample size and allocation The number and allocation of sample EAs for the GLSS 5 depend on the type of estimates to be obtained from the survey and the corresponding precision required. It was decided to select a total sample of around 8000 households nationwide.

To ensure adequate numbers of complete interviews that will allow for reliable estimates at the various domains of interest, the GLSS 5 sample was designed to ensure that at least 400 households were selected from each region.

A two-stage stratified random sampling design was adopted. Initially, a total sample of 550 EAs was considered at the first stage of sampling, followed by a fixed take of 15 households per EA. The distribution of the selected EAs into the ten regions or strata was based on proportionate allocation using the population.

For example, the number of selected EAs allocated to the Western Region was obtained as: 1924577/18912079*550 = 56

Under this sampling scheme, it was observed that the 400 households minimum requirement per region could be achieved in all the regions but not the Upper West Region. The proportionate allocation formula assigned only 17 EAs out of the 550 EAs nationwide and selecting 15 households per EA would have yielded only 255 households for the region. In order to surmount this problem, two options were considered: retaining the 17 EAs in the Upper West Region and increasing the number of selected households per EA from 15 to about 25, or increasing the number of selected EAs in the region from 17 to 27 and retaining the second stage sample of 15 households per EA.

The second option was adopted in view of the fact that it was more likely to provide smaller sampling errors for the separate domains of analysis. Based on this, the number of EAs in Upper East and the Upper West were adjusted from 27 and 17 to 40 and 34 respectively, bringing the total number of EAs to 580 and the number of households to 8,700.

A complete household listing exercise was carried out between May and June 2005 in all the selected EAs to provide the sampling frame for the second stage selection of households. At the second stage of sampling, a fixed number of 15 households per EA was selected in all the regions. In addition, five households per EA were selected as replacement samples.The overall sample size therefore came to 8,700 households nationwide.

Mode of data collection

Face-to-face [f2f]

Search
Clear search
Close search
Google apps
Main menu