23 datasets found
  1. Rural Access Index by Country (2022 - 2023)

    • sdg-transformation-center-sdsn.hub.arcgis.com
    Updated Apr 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sustainable Development Solutions Network (2023). Rural Access Index by Country (2022 - 2023) [Dataset]. https://sdg-transformation-center-sdsn.hub.arcgis.com/datasets/d386abdab7d946aa8b1a0cd11496d91f
    Explore at:
    Dataset updated
    Apr 19, 2023
    Dataset authored and provided by
    Sustainable Development Solutions Networkhttps://www.unsdsn.org/
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Area covered
    Description

    The Rural Access Index (RAI) is a measure of access, developed by the World Bank in 2006. It was adopted as Sustainable Development Goal (SDG) indicator 9.1.1 in 2015, to measure the accessibility of rural populations. It is currently the only indicator for the SDGs that directly measures rural access.The RAI measures the proportion of the rural population that lives within 2 km of an all-season road. An all-season road is one that is motorable all year, but may be temporarily unavailable during inclement weather (Roberts, Shyam, & Rastogi, 2006). This dataset implements and expands on the most recent official methodology put forward by the World Bank, ReCAP's 2019 RAI Supplemental Guidelines. This is, to date, the only publicly available application of this method at a global scale.MethodologyReCAP's methodology provided new insight on what makes a road all-season and how this data should be handled: instead of removing unpaved roads from the network, the ones that are classified as unpaved are to be intersected with topographic and climatic conditions and, whenever there’s an overlap with excess precipitation and slope, a multiplying factor ranging from 0% to 100% is applied to the population that would access to that road. This present dataset developed by SDSN's SDG Transformation Centre proposes that authorities ability to maintain and remediate road conditions also be taken into account.Data sourcesThe indicator relies on four major items of geospatial data: land cover (rural or urban), population distribution, road network extent and the “all-season” status of those roads.Land cover data (urban/rural distinction)Since the indicator measures the acess rural populations, it's necessary to define what is and what isn't rural. This dataset uses the DegUrba Methodology, proposed by the United Nations Expert Group on Statistical Methodology for Delineating Cities and Rural Areas (United Nations Expert Group, 2019). This approach has been developed by the European Commission Global Human Settlement Layer (GHSL-SMOD) project, and is designed to instil some consistency into the definitions based on population density on a 1-km grid, but adjusted for local situations.Population distributionThe source for population distribution data is WorldPop. This uses national census data, projections and other ancillary data from countries to produce aggregated, 100 m2 population data. Road extentTwo widely recognized road datasets are used: the real-time updated crowd-sourced OpenStreetMap (OSM) or the GLOBIO’s 2018 GRIP database, which draws data from official national sources. The reasons for picking the latter are mostly related to its ability to provide information on the surface (pavement) of these roads, to the detriment of the timeliness of the data, which is restrained to the year 2018. Additionally, data from Microsoft Bing's recent Road Detection project is used to ensure completeness. This dataset is completely derived from machine learning methods applied over satellite imagery, and detected 1,165 km of roads missing from OSM.Roads’ all-season statusThe World Bank's original 2006 methodology defines the term all-season as “… a road that is motorable all year round by the prevailing means of rural transport, allowing for occasional interruptions of short duration”. ReCAP's 2019 methodology makes a case for passability equating to the all-season status of a road, along with the assumption that typically the wet season is when roads become impassable, especially so in steep roads that are more exposed to landslides.This dataset follows the ReCAP methodology by creating an passability index. The proposed use of passability factors relies on the following three aspects:• Surface type. Many rural roads in LICs (and even in large high-income countries including the USA and Australia) are unpaved. As mentioned before, unpaved roads deteriorate rapidly and in a different way to paved roads. They are very susceptible to water ingress to the surface, which softens the materials and makes them very vulnerable to the action of traffic. So, when a road surface becomes saturated and is subject to traffic, the deterioration is accelerated. • Climate. Precipitation has a significant effect on the condition of a road, especially on unpaved roads, which predominate in LICs and provide much of the extended connectivity to rural and poor areas. As mentioned above, the rainfall on a road is a significant factor in its deterioration, but the extent depends on the type of rainfall in terms of duration and intensity, and how well the roadside drainage copes with this. While ReCAP suggested the use of general climate zones, we argue that better spatial and temporal resolutions can be acquired through the Copernicus Programme precipitation data, which is made available freely at ~30km pixel size for each month of the year.• Terrain. The gradient and altitude of roads also has an effect on their accessibility. Steep roads become impassable more easily due to the potential for scour during heavy rainfall, and also due to slipperiness as a result of the road surface materials used. Here this is drawn from slope calculated from SRTM Digital Terrain data.• Road maintenance. The ability of local authorities to remediate damaged caused by precipitation and landslides is proposed as a correcting factor to the previous ones. Ideally this would be measured by the % of GDP invested in road construction and maintenance, but this isn't available for all countries. For this reason, GDP per capita is adopted as a proxy instead. The data range is normalized in such a way that a road maxed out in terms of precipitation and slope (accessibility score of 0.25) in a country at the top of the GDP per capita range is brought back at to the higher end of the accessibility score (0.95), while the accessibility score of a road meeting the same passability conditions in a country which GDP per capita is towards the lower end is kept unchanged.Data processingThe roads from the three aforementioned datasets (Bing, GRIP and OSM) are merged together to them is applied a 2km buffer. The populations falling exclusively on unpaved road buffers are multiplied by the resulting passability index, which is defined as the normalized sum of the aforementioned components, ranging from 0.25 to. 0.9, with 0.95 meaning 95% probability that the road is all-season. The index applied to the population data, so, when calculated, the RAI includes the probability that the roads which people are using in each area will be all-season or not. For example, an unpaved road in a flat area with low rainfall would have an accessibility factor of 0.95, as this road is designed to be accessible all year round and the environmental effects on its impassability are minimal.The code for generating this dataset is available on Github at: https://github.com/sdsna/rai

  2. Bolivia BO: Rural Population: % of Total Population

    • ceicdata.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CEICdata.com, Bolivia BO: Rural Population: % of Total Population [Dataset]. https://www.ceicdata.com/en/bolivia/population-and-urbanization-statistics/bo-rural-population--of-total-population
    Explore at:
    Dataset provided by
    CEIC Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Dec 1, 2012 - Dec 1, 2023
    Area covered
    Bolivia
    Variables measured
    Population
    Description

    Bolivia BO: Rural Population: % of Total Population data was reported at 28.814 % in 2023. This records a decrease from the previous number of 29.170 % for 2022. Bolivia BO: Rural Population: % of Total Population data is updated yearly, averaging 42.940 % from Dec 1960 (Median) to 2023, with 64 observations. The data reached an all-time high of 63.238 % in 1960 and a record low of 28.814 % in 2023. Bolivia BO: Rural Population: % of Total Population data remains active status in CEIC and is reported by World Bank. The data is categorized under Global Database’s Bolivia – Table BO.World Bank.WDI: Population and Urbanization Statistics. Rural population refers to people living in rural areas as defined by national statistical offices. It is calculated as the difference between total population and urban population.;World Bank staff estimates based on the United Nations Population Division's World Urbanization Prospects: 2018 Revision.;Weighted average;

  3. Household Income, Consumption and Expenditure Survey 2004-2005 - World Bank...

    • dev.ihsn.org
    • catalog.ihsn.org
    • +1more
    Updated Apr 25, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Central Statistical Agency (CSA) (2019). Household Income, Consumption and Expenditure Survey 2004-2005 - World Bank SHIP Harmonized Dataset - Ethiopia [Dataset]. https://dev.ihsn.org/nada/catalog/study/ETH_2004_HICES_v01_M_v01_A_SHIP
    Explore at:
    Dataset updated
    Apr 25, 2019
    Dataset provided by
    Central Statistical Agencyhttps://ess.gov.et/
    Authors
    Central Statistical Agency (CSA)
    Time period covered
    2004 - 2005
    Area covered
    Ethiopia
    Description

    Abstract

    Survey based Harmonized Indicators (SHIP) files are harmonized data files from household surveys that are conducted by countries in Africa. To ensure the quality and transparency of the data, it is critical to document the procedures of compiling consumption aggregation and other indicators so that the results can be duplicated with ease. This process enables consistency and continuity that make temporal and cross-country comparisons consistent and more reliable.

    Four harmonized data files are prepared for each survey to generate a set of harmonized variables that have the same variable names. Invariably, in each survey, questions are asked in a slightly different way, which poses challenges on consistent definition of harmonized variables. The harmonized household survey data present the best available variables with harmonized definitions, but not identical variables. The four harmonized data files are

    a) Individual level file (Labor force indicators in a separate file): This file has information on basic characteristics of individuals such as age and sex, literacy, education, health, anthropometry and child survival. b) Labor force file: This file has information on labor force including employment/unemployment, earnings, sectors of employment, etc. c) Household level file: This file has information on household expenditure, household head characteristics (age and sex, level of education, employment), housing amenities, assets, and access to infrastructure and services. d) Household Expenditure file: This file has consumption/expenditure aggregates by consumption groups according to Purpose (COICOP) of Household Consumption of the UN.

    Geographic coverage

    National

    Analysis unit

    • Individual level for datasets with suffix _I and _L
    • Household level for datasets with suffix _H and _E

    Universe

    The survey covered all de jure household members (usual residents).

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    Sample Frame The list of households obtained from the 2001/2 Ethiopian Agricultural Sample Enumeration (EASE) was used as a frame to select EAs from the rural part of the country. On the other hand, the list consisting of households by EA, which was obtained from the 2004 Ethiopian Urban Economic Establishment Census, (EUEEC), was used as a frame in order to select sample enumeration areas for the urban HICE survey. A fresh list of households from each urban and rural EA was prepared at the beginning of the survey period. This list was, thus, used as a frame in order to select households from sample EAs.

    Sample Design For the purpose of the survey the country was divided into three broad categories. That is; rural, major urban center and other urban center categories.

    Category I: Rural: - This category consists of the rural areas of eight regional states and two administrative councils (Addis Ababa and Dire Dawa) of the country, except Gambella region. Each region was considered to be a domain (Reporting Level) for which major findings of the survey are reported. This category comprises 10 reporting levels. A stratified two-stage cluster sample design was used to select samples in which the primary sampling units (PSUs) were EAs. Twelve households per sample EA were selected as a Second Stage Sampling Unit (SSU) to which the survey questionnaire were administered.

    Category II:- Major urban centers:- In this category all regional capitals (except Gambella region) and four additional urban centers having higher population sizes as compared to other urban centers were included. Each urban center in this category was considered as a reporting level. However, each sub-city of Addis Ababa was considered to be a domain (reporting levels). Since there is a high variation in the standards of living of the residents of these urban centers (that may have a significant impact on the final results of the survey), each urban center was further stratified into the following three sub-strata. Sub-stratum 1:- Households having a relatively high standards of living Sub-stratum 2:- Households having a relatively medium standards of living and Sub-stratum 3:- Households having a relatively low standards of living. The category has a total of 14 reporting levels. A stratified two-stage cluster sample design was also adopted in this instance. The primary sampling units were EAs of each urban center. Allocation of sample EAs of a reporting level among the above mentioned strata were accomplished in proportion to the number of EAs each stratum consists of. Sixteen households from each sample EA were inally selected as a Secondary Sampling Unit (SSU).

    Category III: - Other urban centers: - Urban centers in the country other than those under category II were grouped into this category. Excluding Gambella region a domain of "other urban centers" is formed for each region. Consequently, 7 reporting levels were formed in this category. Harari, Addis Ababa and Dire Dawa do not have urban centers other than that grouped in category II. Hence, no domain was formed for these regions under this category. Unlike the above two categories a stratified three-stage cluster sample design was adopted to select samples from this category. The primary sampling units were urban centers and the second stage sampling units were EAs. Sixteen households from each EA were lastly selected at the third stage and the survey questionnaires administered for all of them.

    Mode of data collection

    Face-to-face [f2f]

  4. Cameroon CM: Rural Population

    • ceicdata.com
    Updated Feb 27, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CEICdata.com (2018). Cameroon CM: Rural Population [Dataset]. https://www.ceicdata.com/en/cameroon/population-and-urbanization-statistics/cm-rural-population
    Explore at:
    Dataset updated
    Feb 27, 2018
    Dataset provided by
    CEIC Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Dec 1, 2012 - Dec 1, 2023
    Area covered
    Cameroon
    Variables measured
    Population
    Description

    Cameroon CM: Rural Population data was reported at 11,543,428.000 Person in 2023. This records an increase from the previous number of 11,403,216.000 Person for 2022. Cameroon CM: Rural Population data is updated yearly, averaging 7,039,305.000 Person from Dec 1960 (Median) to 2023, with 64 observations. The data reached an all-time high of 11,543,428.000 Person in 2023 and a record low of 4,440,039.000 Person in 1960. Cameroon CM: Rural Population data remains active status in CEIC and is reported by World Bank. The data is categorized under Global Database’s Cameroon – Table CM.World Bank.WDI: Population and Urbanization Statistics. Rural population refers to people living in rural areas as defined by national statistical offices. It is calculated as the difference between total population and urban population. Aggregation of urban and rural population may not add up to total population because of different country coverages.;World Bank staff estimates based on the United Nations Population Division's World Urbanization Prospects: 2018 Revision.;Sum;

  5. Living Standards Survey IV 1998-1999 - World Bank SHIP Harmonized Dataset -...

    • catalog.ihsn.org
    • dev.ihsn.org
    • +2more
    Updated Mar 29, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ghana Statistical Service (GSS) (2019). Living Standards Survey IV 1998-1999 - World Bank SHIP Harmonized Dataset - Ghana [Dataset]. https://catalog.ihsn.org/catalog/2359
    Explore at:
    Dataset updated
    Mar 29, 2019
    Dataset provided by
    Ghana Statistical Services
    Authors
    Ghana Statistical Service (GSS)
    Time period covered
    1998 - 1999
    Area covered
    Ghana
    Description

    Abstract

    Survey based Harmonized Indicators (SHIP) files are harmonized data files from household surveys that are conducted by countries in Africa. To ensure the quality and transparency of the data, it is critical to document the procedures of compiling consumption aggregation and other indicators so that the results can be duplicated with ease. This process enables consistency and continuity that make temporal and cross-country comparisons consistent and more reliable.

    Four harmonized data files are prepared for each survey to generate a set of harmonized variables that have the same variable names. Invariably, in each survey, questions are asked in a slightly different way, which poses challenges on consistent definition of harmonized variables. The harmonized household survey data present the best available variables with harmonized definitions, but not identical variables. The four harmonized data files are

    a) Individual level file (Labor force indicators in a separate file): This file has information on basic characteristics of individuals such as age and sex, literacy, education, health, anthropometry and child survival. b) Labor force file: This file has information on labor force including employment/unemployment, earnings, sectors of employment, etc. c) Household level file: This file has information on household expenditure, household head characteristics (age and sex, level of education, employment), housing amenities, assets, and access to infrastructure and services. d) Household Expenditure file: This file has consumption/expenditure aggregates by consumption groups according to Purpose (COICOP) of Household Consumption of the UN.

    Geographic coverage

    National

    Analysis unit

    • Individual level for datasets with suffix _I and _L
    • Household level for datasets with suffix _H and _E

    Universe

    The survey covered all de jure household members (usual residents).

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    SAMPLE DESIGN FOR ROUND 4 OF THE GLSS A nationally representative sample of households was selected in order to achieve the survey objectives.

    Sample Frame For the purposes of this survey the list of the 1984 population census Enumeration Areas (EAs) with population and household information was used as the sampling frame. The primary sampling units were the 1984 EAs with the secondary units being the households in the EAs. This frame, though quite old, was considered inadequate, it being the best available at the time. Indeed, this frame was used for the earlier rounds of the GLSS.

    Stratification In order to increase precision and reliability of the estimates, the technique of stratification was employed in the sample design, using geographical factors, ecological zones and location of residence as the main controls. Specifically, the EAs were first stratified according to the three ecological zones namely; Coastal, Forest and Savannah, and then within each zone further stratification was done based on the size of the locality into rural or urban.

    SAMPLE SELECTION EAs A two-stage sample was selected for the survey. At the first stage, 300 EAs were selected using systematic sampling with probability proportional to size method (PPS) where the size measure is the 1984 number of households in the EA. This was achieved by ordering the list of EAs with their sizes according to the strata. The size column was then cumulated, and with a random start and a fixed interval the sample EAs were selected.

    It was observed that some of the selected EAs had grown in size over time and therefore needed segmentation. In this connection, such EAs were divided into approximately equal parts, each segment constituting about 200 households. Only one segment was then randomly selected for listing of the households.

    Households At the second stage, a fixed number of 20 households was systematically selected from each selected EA to give a total of 6,000 households. Additional 5 households were selected as reserve to replace missing households. Equal number of households was selected from each EA in order to reflect the labour force focus of the survey.

    NOTE: The above sample selection procedure deviated slightly from that used for the earlier rounds of the GLSS, as such the sample is not self-weighting. This is because, 1. given the long period between 1984 and the GLSS 4 fieldwork the number of households in the various EAs are likely to have grown at different rates. 2. the listing exercise was not properly done as some of the selected EAs were not listed completely. Moreover, it was noted that the segmentation done for larger EAs during the listing was a bit arbitrary.

    Mode of data collection

    Face-to-face [f2f]

  6. Ghana GH: Rural Population

    • ceicdata.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CEICdata.com (2021). Ghana GH: Rural Population [Dataset]. https://www.ceicdata.com/en/ghana/population-and-urbanization-statistics/gh-rural-population
    Explore at:
    Dataset provided by
    CEIC Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Dec 1, 2005 - Dec 1, 2016
    Area covered
    Ghana
    Variables measured
    Population
    Description

    Ghana GH: Rural Population data was reported at 12,857,780.000 Person in 2017. This records an increase from the previous number of 12,763,826.000 Person for 2016. Ghana GH: Rural Population data is updated yearly, averaging 9,077,181.000 Person from Dec 1960 (Median) to 2017, with 58 observations. The data reached an all-time high of 12,857,780.000 Person in 2017 and a record low of 5,105,497.000 Person in 1960. Ghana GH: Rural Population data remains active status in CEIC and is reported by World Bank. The data is categorized under Global Database’s Ghana – Table GH.World Bank.WDI: Population and Urbanization Statistics. Rural population refers to people living in rural areas as defined by national statistical offices. It is calculated as the difference between total population and urban population. Aggregation of urban and rural population may not add up to total population because of different country coverages.; ; World Bank staff estimates based on the United Nations Population Division's World Urbanization Prospects: 2018 Revision.; Sum;

  7. Canada CA: Rural Population: % of Total Population

    • ceicdata.com
    Updated Dec 15, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CEICdata.com (2022). Canada CA: Rural Population: % of Total Population [Dataset]. https://www.ceicdata.com/en/canada/population-and-urbanization-statistics/ca-rural-population--of-total-population
    Explore at:
    Dataset updated
    Dec 15, 2022
    Dataset provided by
    CEIC Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Dec 1, 2012 - Dec 1, 2023
    Area covered
    Canada
    Variables measured
    Population
    Description

    Canada CA: Rural Population: % of Total Population data was reported at 18.138 % in 2023. This records a decrease from the previous number of 18.248 % for 2022. Canada CA: Rural Population: % of Total Population data is updated yearly, averaging 23.246 % from Dec 1960 (Median) to 2023, with 64 observations. The data reached an all-time high of 30.939 % in 1960 and a record low of 18.138 % in 2023. Canada CA: Rural Population: % of Total Population data remains active status in CEIC and is reported by World Bank. The data is categorized under Global Database’s Canada – Table CA.World Bank.WDI: Population and Urbanization Statistics. Rural population refers to people living in rural areas as defined by national statistical offices. It is calculated as the difference between total population and urban population.;World Bank staff estimates based on the United Nations Population Division's World Urbanization Prospects: 2018 Revision.;Weighted average;

  8. S

    Data from: A standardized dataset of built-up areas of China’s cities with...

    • scidb.cn
    Updated Jul 7, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jiang Huiping; Sun Zhongchang; Guo Huadong; Du Wenjie; Xing Qiang; Cai Guoyin (2021). A standardized dataset of built-up areas of China’s cities with populations over 300,000 for the period 1990–2015 [Dataset]. http://doi.org/10.11922/sciencedb.j00076.00004
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 7, 2021
    Dataset provided by
    Science Data Bank
    Authors
    Jiang Huiping; Sun Zhongchang; Guo Huadong; Du Wenjie; Xing Qiang; Cai Guoyin
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    China
    Description

    Here we used remote sensing data from multiple sources (time-series of Landsat and Sentinel images) to map the impervious surface area (ISA) at five-year intervals from 1990 to 2015, and then converted the results into a standardized dataset of the built-up area for 433 Chinese cities with 300,000 inhabitants or more, which were listed in the United Nations (UN) World Urbanization Prospects (WUP) database (including Mainland China, Hong Kong, Macao and Taiwan). We employed a range of spectral indices to generate the 1990–2015 ISA maps in urban areas based on remotely sensed data acquired from multiple sources. In this process, various types of auxiliary data were used to create the desired products for urban areas through manual segmentation of peri-urban and rural areas together with reference to several freely available products of urban extent derived from ISA data using automated urban–rural segmentation methods. After that, following the well-established rules adopted by the UN, we carried out the conversion to the standardized built-up area products from the 1990–2015 ISA maps in urban areas, which conformed to the definition of urban agglomeration area (UAA). Finally, we implemented data postprocessing to guarantee the spatial accuracy and temporal consistency of the final product.The standardized urban built-up area dataset (SUBAD–China) introduced here is the first product using the same definition of UAA adopted by the WUP database for 433 county and higher-level cities in China. The comparisons made with contemporary data produced by the National Bureau of Statistics of China, the World Bank and UN-habitat indicate that our results have a high spatial accuracy and good temporal consistency and thus can be used to characterize the process of urban expansion in China.The SUBAD–China contains 2,598 vector files in shapefile format containing data for all China's cities listed in the WUP database that have different urban sizes and income levels with populations over 300,000. Attached with it, we also provided the distribution of validation points for the 1990–2010 ISA products of these 433 Chinese cities in shapefile format and the confusion matrices between classified data and reference data during different time periods as a Microsoft Excel Open XML Spreadsheet (XLSX) file.Furthermore, The standardized built-up area products for such cities will be consistently updated and refined to ensure the quality of their spatiotemporal coverage and accuracy. The production of this dataset together with the usage of population counts derived from the WUP database will close some of the data gaps in the calculation of SDG11.3.1 and benefit other downstream applications relevant to a combined analysis of the spatial and socio-economic domains in urban areas.

  9. a

    Electricity Access, Africa

    • hub.arcgis.com
    Updated Jan 20, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UN Environment, Early Warning &Data Analytics (2016). Electricity Access, Africa [Dataset]. https://hub.arcgis.com/maps/9ec221b2a63745e586ac258e0827c6a5
    Explore at:
    Dataset updated
    Jan 20, 2016
    Dataset authored and provided by
    UN Environment, Early Warning &Data Analytics
    Area covered
    Description

    This map shows electricity access in Africa. The data source is from the International Energy Agency’s World Energy Outlook. The International Energy Agency’s World Energy Outlook first constructed a database on electrification rates for WEO-2002. The database once again was updated for WEO-2015, showing detailed data on national, urban and rural electrification.

    The general paucity of data on electricity access means that it must be gathered through a combination of sources, including: IEA energy statistics; a network of contacts spanning governments, multilateral development banks and country-level representatives of various international organisations; and, other publicly available statistics, such as US Agency for International Development (USAID) supported DHS survey data, the World Bank’s Living Standards Measurement Surveys (LSMS), the UN Economic Commission for Latin America and the Caribbean’s (ECLAC) statistical publications, and data from national statistics agencies. In the small number of cases where no data could be provided through these channels other sources were used. If electricity access data for 2013 was not available, data for the latest available year was used.

    For many countries, data on the urban and rural breakdown was collected, but if not available an estimate was made on the basis of pre-existing data or a comparison to the average correlation between urban and national electrification rates. Often only the percentage of households with a connection is known and assumptions about an average household size are used to determine access rates as a percentage of the population. To estimate the number of people without access, population data comes from OECD statistics in conjunction with the United Nations Population Division reports World Urbanization Prospects: the 2014 Revision Population Database, and World Population Prospects: the 2012 Revision. Electricity access data is adjusted to be consistent with demographic patterns of urban and rural population. Due to differences in definitions and methodology from different sources, data quality may vary from country to country. Where country data appeared contradictory, outdated or unreliable, the IEA Secretariat made estimates based on cross-country comparisons and earlier surveys.

  10. i

    Household Income, Consumption and Expenditure Survey 1999-2000 - World Bank...

    • dev.ihsn.org
    • catalog.ihsn.org
    • +2more
    Updated Apr 25, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Central Statistical Authority (CSA) (2019). Household Income, Consumption and Expenditure Survey 1999-2000 - World Bank SHIP Harmonized Dataset - Ethiopia [Dataset]. https://dev.ihsn.org/nada/catalog/study/ETH_2000_HICES_v01_M_v01_A_SHIP
    Explore at:
    Dataset updated
    Apr 25, 2019
    Dataset authored and provided by
    Central Statistical Authority (CSA)
    Time period covered
    1999 - 2000
    Area covered
    Ethiopia
    Description

    Abstract

    Survey based Harmonized Indicators (SHIP) files are harmonized data files from household surveys that are conducted by countries in Africa. To ensure the quality and transparency of the data, it is critical to document the procedures of compiling consumption aggregation and other indicators so that the results can be duplicated with ease. This process enables consistency and continuity that make temporal and cross-country comparisons consistent and more reliable.

    Four harmonized data files are prepared for each survey to generate a set of harmonized variables that have the same variable names. Invariably, in each survey, questions are asked in a slightly different way, which poses challenges on consistent definition of harmonized variables. The harmonized household survey data present the best available variables with harmonized definitions, but not identical variables. The four harmonized data files are

    a) Individual level file (Labor force indicators in a separate file): This file has information on basic characteristics of individuals such as age and sex, literacy, education, health, anthropometry and child survival. b) Labor force file: This file has information on labor force including employment/unemployment, earnings, sectors of employment, etc. c) Household level file: This file has information on household expenditure, household head characteristics (age and sex, level of education, employment), housing amenities, assets, and access to infrastructure and services. d) Household Expenditure file: This file has consumption/expenditure aggregates by consumption groups according to Purpose (COICOP) of Household Consumption of the UN.

    Geographic coverage

    National

    Analysis unit

    • Individual level for datasets with suffix _I and _L
    • Household level for datasets with suffix _H and _E

    Universe

    The survey covered all de jure household members (usual residents).

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    Sample Design The 1999/2000 Household Income, Consurnption, and Expendi.ture Survey covered both the urban and the sedentary rural parts of the country. The survey has not covered six zones in Somalia Region and two zones in Afar Region that are inhabited mainly by nomadic population. For the purpose of the survey, the country was divided into three categories . That is, the rural parts of the country and the urban areas that were divided into two broad categories taking into account sizes of their population. Category I: Rural parts of nine Regional States and two administrative regions were grouped in this category each of which were the survey dornains (reporting levels). These regions are Tigrai,Afar, Amhara, Oromia, Sornalia, Eenishangul-Gunuz, SNNP,Gambela, Flarari, Addis Ababa and Dire Dawa.

    Category II: All Regional capitals and five major urban centers of the country were grouped in this category. Each of the urban centers in this category was the survey domain (reporting level) for which separate survey results for rnajor survey characteristics were reported.

    Category III: Urban centers in the country other than the urban centers in category II were grouped in this category and formed a single reporting level. Other than the reporting levels defined in category II and category III one additional domain, namely total urban (country level) can be constructed by eombining the basic domains defined in the two categories. All in all 35'basie rural and urban domains (reporting levels) were defined for the survey. In addition to the above urban and rural domains, survey results are to be reported at regional and eountry levels by aggregating the survey results for the conesponding urban and rural areas. Definition of the survey dornains was based on both technical and resource considerations. More specifically, sample size for the domains were determined to enable provision of major indicators with reasonable precision subject to the resources that were available for the survey.

    Selection Scheme and Sample Size in Each Category CategoryI : A stratified two-stage sample design was used to select the sample in which the primary sampling units (PSUs) were EAs. Sample enumeration areas( EAs) from each domain were selected using systematic sampling that is probability proportional to the size being number of households obtained from the 1994 population and housing census.A total of 722 EAs were selected from the rural parts of the country. Within each sample EA a fresh list of households was prepared at the beginning of the survey's field work and for the administration of the survey questionnaire 12 households per sample EA for rural areas were systematically selected.

    Category II: In this category also,a stratified two-stage sample design was used to select the sample. Here a strata constitutes all the "Regional State Capitals" and the five "Major Urban Centers" in the country and are grouped as a strata in this category. The primary sampling units (PSUs) are the EA's in the Regional State Capitals and the five Major Urban Centers and excludes the special EAs (non-conventional households). Sample enumeration areas( EAs) from each strata were selected using systematic sampling probability proportional to size, size being number of households obtained from the 1994 population and housing census. A total of 373 EAs were selected from this domain of study. Within each sample EAs a fresh list of households was prepared at the beginning of the survey's field work and for the administration of the questionnaire 16 household per sample EA were systematically selected-

    Category III: Three-stage stratified sample design was adopted to select the sample from domains in category III. The PSUs were other urban centers selected using systematic sampling that is probability proportional to size; size being number of households obtained from the 1994 population and housing census. The secondary sampling units (SSUs) were EAs which were selected using systematic sampling that is probability proportional to size; size being number of households obtained from the 1994 population and housing census. A total of 169 sample EAs were selected from the sample of other urban centers and was determined by proportional allocation to their size of households from the 1994 census. Ultimately, 16 households within each of the sample EAs were selected systematically from a fresh list of households prepared at the beginning of the survey's fieldwork for the administration of the survey questionnaire.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    The Household Income, Consumption and Expenditure Survey questionnaire contains the following forms: - Form 1: Area Identification and Household Characteristics - Form 2A: Quantity and value of weekly consumption of food and drinks consumed at home and tobacco/including quantity purchased, own produced, obtained, etc for first and second week. - Form 2B: Quantity and value of weekly consumption of food and drinks consumed at home and tobacco/including quantity purchased, own produced, obtained, etc for third and fourth week . - Form 3A: All transaction (income, expenditure and consumption) for the first and second weeks except what is collected in Forms 2A and 2B - Form 3B: All transaction (income, expenditure and consumption) for the third and fourth weeks except what is collected in Forms 2A and 2B - Form 4: All transaction (expenditure and consumption) for last 6 months for Household expenditure on some selected item groups - Form 5: Cash income and receipts received by household and type of tenure. The survey questionnaire is provided as external resource.

  11. a

    Electricity Access, Asia and the Pacific

    • hub.arcgis.com
    • sdgs-uneplive.opendata.arcgis.com
    Updated Jan 20, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UN Environment, Early Warning &Data Analytics (2016). Electricity Access, Asia and the Pacific [Dataset]. https://hub.arcgis.com/maps/286793bc9f1147da97e3accb6c52d5b5
    Explore at:
    Dataset updated
    Jan 20, 2016
    Dataset authored and provided by
    UN Environment, Early Warning &Data Analytics
    Area covered
    Description

    This map shows electricity access in Asia and the Pacific. The data source is from the International Energy Agency’s World Energy Outlook. The International Energy Agency’s World Energy Outlook first constructed a database on electrification rates for WEO-2002. The database once again was updated for WEO-2015, showing detailed data on national, urban and rural electrification.

    The general paucity of data on electricity access means that it must be gathered through a combination of sources, including: IEA energy statistics; a network of contacts spanning governments, multilateral development banks and country-level representatives of various international organisations; and, other publicly available statistics, such as US Agency for International Development (USAID) supported DHS survey data, the World Bank’s Living Standards Measurement Surveys (LSMS), the UN Economic Commission for Latin America and the Caribbean’s (ECLAC) statistical publications, and data from national statistics agencies. In the small number of cases where no data could be provided through these channels other sources were used. If electricity access data for 2013 was not available, data for the latest available year was used.

    For many countries, data on the urban and rural breakdown was collected, but if not available an estimate was made on the basis of pre-existing data or a comparison to the average correlation between urban and national electrification rates. Often only the percentage of households with a connection is known and assumptions about an average household size are used to determine access rates as a percentage of the population. To estimate the number of people without access, population data comes from OECD statistics in conjunction with the United Nations Population Division reports World Urbanization Prospects: the 2014 Revision Population Database, and World Population Prospects: the 2012 Revision. Electricity access data is adjusted to be consistent with demographic patterns of urban and rural population. Due to differences in definitions and methodology from different sources, data quality may vary from country to country. Where country data appeared contradictory, outdated or unreliable, the IEA Secretariat made estimates based on cross-country comparisons and earlier surveys.

  12. w

    Integrated Household Panel Survey 2010-2013-2016-2019 (Long-Term Panel, 102...

    • microdata.worldbank.org
    • catalog.ihsn.org
    • +1more
    Updated Jul 30, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Statistical Office (NSO) (2021). Integrated Household Panel Survey 2010-2013-2016-2019 (Long-Term Panel, 102 EAs) - Malawi [Dataset]. https://microdata.worldbank.org/index.php/catalog/3819
    Explore at:
    Dataset updated
    Jul 30, 2021
    Dataset authored and provided by
    National Statistical Office (NSO)
    Time period covered
    2010 - 2019
    Area covered
    Malawi
    Description

    Abstract

    The 2016 Integrated Household Panel Survey (IHPS) was launched in April 2016 as part of the Malawi Fourth Integrated Household Survey fieldwork operation. The IHPS 2016 targeted 1,989 households that were interviewed in the IHPS 2013 and that could be traced back to half of the 204 enumeration areas that were originally sampled as part of the Third Integrated Household Survey (IHS3) 2010/11. The 2019 IHPS was launched in April 2019 as part of the Malawi Fifth Integrated Household Survey fieldwork operations targeting the 2,508 households that were interviewed in 2016. The panel sample expanded each wave through the tracking of split-off individuals and the new households that they formed. Available as part of this project is the IHPS 2019 data, the IHPS 2016 data as well as the rereleased IHPS 2010 & 2013 data including only the subsample of 102 EAs with updated panel weights. Additionally, the IHPS 2016 was the first survey that received complementary financial and technical support from the Living Standards Measurement Study – Plus (LSMS+) initiative, which has been established with grants from the Umbrella Facility for Gender Equality Trust Fund, the World Bank Trust Fund for Statistical Capacity Building, and the International Fund for Agricultural Development, and is implemented by the World Bank Living Standards Measurement Study (LSMS) team, in collaboration with the World Bank Gender Group and partner national statistical offices. The LSMS+ aims to improve the availability and quality of individual-disaggregated household survey data, and is, at start, a direct response to the World Bank IDA18 commitment to support 6 IDA countries in collecting intra-household, sex-disaggregated household survey data on 1) ownership of and rights to selected physical and financial assets, 2) work and employment, and 3) entrepreneurship – following international best practices in questionnaire design and minimizing the use of proxy respondents while collecting personal information. This dataset is included here.

    Geographic coverage

    National coverage

    Analysis unit

    • Households
    • Individuals
    • Children under 5 years
    • Consumption expenditure commodities/items
    • Communities
    • Agricultural household/ Holder/ Crop

    Universe

    The IHPS 2016 and 2019 attempted to track all IHPS 2013 households stemming from 102 of the original 204 baseline panel enumeration areas as well as individuals that moved away from the 2013 dwellings between 2013 and 2016 as long as they were neither servants nor guests at the time of the IHPS 2013; were projected to be at least 12 years of age and were known to be residing in mainland Malawi but excluding those in Likoma Island and in institutions, including prisons, police compounds, and army barracks.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    A sub-sample of IHS3 2010 sample enumeration areas (EAs) (i.e. 204 EAs out of 768 EAs) was selected prior to the start of the IHS3 field work with the intention to (i) to track and resurvey these households in 2013 in accordance with the IHS3 fieldwork timeline and as part of the Integrated Household Panel Survey (IHPS 2013) and (ii) visit a total of 3,246 households in these EAs twice to reduce recall associated with different aspects of agricultural data collection. At baseline, the IHPS sample was selected to be representative at the national, regional, urban/rural levels and for each of the following 6 strata: (i) Northern Region - Rural, (ii) Northern Region - Urban, (iii) Central Region - Rural, (iv) Central Region - Urban, (v) Southern Region - Rural, and (vi) Southern Region - Urban. The IHPS 2013 main fieldwork took place during the period of April-October 2013, with residual tracking operations in November-December 2013.

    Given budget and resource constraints, for the IHPS 2016 the number of sample EAs in the panel was reduced to 102 out of the 204 EAs. As a result, the domains of analysis are limited to the national, urban and rural areas. Although the results of the IHPS 2016 cannot be tabulated by region, the stratification of the IHPS by region, urban and rural strata was maintained. The IHPS 2019 tracked all individuals 12 years or older from the 2016 households.

    Mode of data collection

    Computer Assisted Personal Interview [capi]

    Cleaning operations

    Data Entry Platform To ensure data quality and timely availability of data, the IHPS 2019 was implemented using the World Bank’s Survey Solutions CAPI software. To carry out IHPS 2019, 1 laptop computer and a wireless internet router were assigned to each team supervisor, and each enumerator had an 8–inch GPS-enabled Lenovo tablet computer that the NSO provided. The use of Survey Solutions allowed for the real-time availability of data as the completed data was completed, approved by the Supervisor and synced to the Headquarters server as frequently as possible. While administering the first module of the questionnaire the enumerator(s) also used their tablets to record the GPS coordinates of the dwelling units. Geo-referenced household locations from that tablet complemented the GPS measurements taken by the Garmin eTrex 30 handheld devices and these were linked with publically available geospatial databases to enable the inclusion of a number of geospatial variables - extensive measures of distance (i.e. distance to the nearest market), climatology, soil and terrain, and other environmental factors - in the analysis.

    Data Management The IHPS 2019 Survey Solutions CAPI based data entry application was designed to stream-line the data collection process from the field. IHPS 2019 Interviews were mainly collected in “sample” mode (assignments generated from headquarters) and a few in “census” mode (new interviews created by interviewers from a template) for the NSO to have more control over the sample. This hybrid approach was necessary to aid the tracking operations whereby an enumerator could quickly create a tracking assignment considering that they were mostly working in areas with poor network connection and hence could not quickly receive tracking cases from Headquarters.

    The range and consistency checks built into the application was informed by the LSMS-ISA experience with the IHS3 2010/11, IHPS 2013 and IHPS 2016. Prior programming of the data entry application allowed for a wide variety of range and consistency checks to be conducted and reported and potential issues investigated and corrected before closing the assigned enumeration area. Headquarters (the NSO management) assigned work to the supervisors based on their regions of coverage. The supervisors then made assignments to the enumerators linked to their supervisor account. The work assignments and syncing of completed interviews took place through a Wi-Fi connection to the IHPS 2019 server. Because the data was available in real time it was monitored closely throughout the entire data collection period and upon receipt of the data at headquarters, data was exported to Stata for other consistency checks, data cleaning, and analysis.

    Data Cleaning The data cleaning process was done in several stages over the course of fieldwork and through preliminary analysis. The first stage of data cleaning was conducted in the field by the field-based field teams utilizing error messages generated by the Survey Solutions application when a response did not fit the rules for a particular question. For questions that flagged an error, the enumerators were expected to record a comment within the questionnaire to explain to their supervisor the reason for the error and confirming that they double checked the response with the respondent. The supervisors were expected to sync the enumerator tablets as frequently as possible to avoid having many questionnaires on the tablet, and to enable daily checks of questionnaires. Some supervisors preferred to review completed interviews on the tablets so they would review prior to syncing but still record the notes in the supervisor account and reject questionnaires accordingly. The second stage of data cleaning was also done in the field, and this resulted from the additional error reports generated in Stata, which were in turn sent to the field teams via email or DropBox. The field supervisors collected reports for their assignments and in coordination with the enumerators reviewed, investigated, and collected errors. Due to the quick turn-around in error reporting, it was possible to conduct call-backs while the team was still operating in the EA when required. Corrections to the data were entered in the rejected questionnaires and sent back to headquarters.

    The data cleaning process was done in several stages over the course of the fieldwork and through preliminary analyses. The first stage was during the interview itself. Because CAPI software was used, as enumerators asked the questions and recorded information, error messages were provided immediately when the information recorded did not match previously defined rules for that variable. For example, if the education level for a 12 year old respondent was given as post graduate. The second stage occurred during the review of the questionnaire by the Field Supervisor. The Survey Solutions software allows errors to remain in the data if the enumerator does not make a correction. The enumerator can write a comment to explain why the data appears to be incorrect. For example, if the previously mentioned 12 year old was, in fact, a genius who had completed graduate studies. The next stage occurred when the data were transferred to headquarters where the NSO staff would again review the data for errors and verify the comments from the

  13. w

    National Panel Survey 2008-2015, Uniform Panel Dataset - Tanzania

    • microdata.worldbank.org
    • datacatalog.ihsn.org
    • +1more
    Updated Mar 17, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Bureau of Statistics (2021). National Panel Survey 2008-2015, Uniform Panel Dataset - Tanzania [Dataset]. https://microdata.worldbank.org/index.php/catalog/3814
    Explore at:
    Dataset updated
    Mar 17, 2021
    Dataset authored and provided by
    National Bureau of Statistics
    Time period covered
    2008 - 2015
    Area covered
    Tanzania
    Description

    Abstract

    Panel data possess several advantages over conventional cross-sectional and time-series data, including their power to isolate the effects of specific actions, treatments, and general policies often at the core of large-scale econometric development studies. While the concept of panel data alone provides the capacity for modeling the complexities of human behavior, the notion of universal panel data – in which time- and situation-driven variances leading to variations in tools, and thus results, are mitigated – can further enhance exploitation of the richness of panel information.

    This Basic Information Document (BID) provides a brief overview of the Tanzania National Panel Survey (NPS), but focuses primarily on the theoretical development and application of panel data, as well as key elements of the universal panel survey instrument and datasets generated by the four rounds of the NPS. As this Basic Information Document (BID) for the UPD does not describe in detail the background, development, or use of the NPS itself, the round-specific NPS BIDs should supplement the information provided here.

    The NPS Uniform Panel Dataset (UPD) consists of both survey instruments and datasets, meticulously aligned and engineered with the aim of facilitating the use of and improving access to the wealth of panel data offered by the NPS. The NPS-UPD provides a consistent and straightforward means of conducting not only user-driven analyses using convenient, standardized tools, but also for monitoring MKUKUTA, FYDP II, and other national level development indicators reported by the NPS.

    The design of the NPS-UPD combines the four completed rounds of the NPS – NPS 2008/09 (R1), NPS 2010/11 (R2), NPS 2012/13 (R3), and NPS 2014/15 (R4) – into pooled, module-specific survey instruments and datasets. The panel survey instruments offer the ease of comparability over time, with modifications and variances easily identifiable as well as those aspects of the questionnaire which have remained identical and offer consistent information. By providing all module-specific data over time within compact, pooled datasets, panel datasets eliminate the need for user-generated merges between rounds and present data in a clear, logical format, increasing both the usability and comprehension of complex data.

    Geographic coverage

    Designed for analysis of key indicators at four primary domains of inference, namely: Dar es Salaam, other urban, rural, Zanzibar.

    Analysis unit

    • Households
    • Individuals

    Universe

    The universe includes all households and individuals in Tanzania with the exception of those residing in military barracks or other institutions.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    While the same sample of respondents was maintained over the first three rounds of the NPS, longitudinal surveys tend to suffer from bias introduced by households leaving the survey over time; i.e. attrition. Although the NPS maintains a highly successful recapture rate (roughly 96% retention at the household level), minimizing the escalation of this selection bias, a refresh of longitudinal cohorts was done for the NPS 2014/15 to ensure proper representativeness of estimates while maintaining a sufficient primary sample to maintain cohesion within panel analysis. A newly completed Population and Housing Census (PHC) in 2012, providing updated population figures along with changes in administrative boundaries, emboldened the opportunity to realign the NPS sample and abate collective bias potentially introduced through attrition.

    To maintain the panel concept of the NPS, the sample design for NPS 2014/2015 consisted of a combination of the original NPS sample and a new NPS sample. A nationally representative sub-sample was selected to continue as part of the “Extended Panel” while an entirely new sample, “Refresh Panel”, was selected to represent national and sub-national domains. Similar to the sample in NPS 2008/2009, the sample design for the “Refresh Panel” allows analysis at four primary domains of inference, namely: Dar es Salaam, other urban areas on mainland Tanzania, rural mainland Tanzania, and Zanzibar. This new cohort in NPS 2014/2015 will be maintained and tracked in all future rounds between national censuses.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    The format of the NPS-UPD survey instrument is similar to previously disseminated NPS survey instruments. Each module has a questionnaire and clearly identifies if the module collects information at the individual or household level. Within each module-specific questionnaire of the NPS-UPD survey instrument, there are five distinct sections, arranged vertically: (1) the UPD - “U” on the survey instrument, (2) R4, (3), R3, (4) R2, and (5) R1 – the latter 4 sections presenting each questionnaire in its original form at time of its respective dissemination.

    The uppermost section of each module’s questionnaire (“U”) represents the model universal panel questionnaire, with questions generated from the comprehensive listing of questions across all four rounds of the NPS and codes generated from the comprehensive collection of codes. The following sections are arranged vertically by round, considering R4 as most recent. While not all rounds will have data reported for each question in the UPD and not each question will have reports for each of the UPD codes listed, the NPS-UPD survey instrument represents the visual, all-inclusive set of information collected by the NPS over time.

    The four round-specific sections (R4, R3, R2, R1) are aligned with their UPD-equivalent question, visually presenting their contribution to compatibility with the UPD. Each round-specific section includes the original round-specific variable names, response codes and skip patterns (corresponding to their respective round-specific NPS data sets, and despite their variance from other rounds or from the comprehensive UPD code listing)4.

  14. w

    Measuring Income Inequality (Deininger and Squire) Database 1890-1996 -...

    • microdata.worldbank.org
    • catalog.ihsn.org
    • +1more
    Updated Oct 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Klaus W. Deininger and Lyn Squire (2023). Measuring Income Inequality (Deininger and Squire) Database 1890-1996 - Argentina, Australia, Austria...and 99 more [Dataset]. https://microdata.worldbank.org/index.php/catalog/1790
    Explore at:
    Dataset updated
    Oct 26, 2023
    Dataset authored and provided by
    Klaus W. Deininger and Lyn Squire
    Time period covered
    1890 - 1996
    Area covered
    Austria, Australia
    Description

    Abstract

    This file contains data on Gini coefficients, cumulative quintile shares, explanations regarding the basis on which the Gini coefficient was computed, and the source of the information. There are two data-sets, one containing the "high quality" sample and the other one including all the information (of lower quality) that had been collected.

    The database was constructed for the production of the following paper:

    Deininger, Klaus and Lyn Squire, "A New Data Set Measuring Income Inequality", The World Bank Economic Review, 10(3): 565-91, 1996.

    This article presents a new data set on inequality in the distribution of income. The authors explain the criteria they applied in selecting data on Gini coefficients and on individual quintile groups’ income shares. Comparison of the new data set with existing compilations reveals that the data assembled here represent an improvement in quality and a significant expansion in coverage, although differences in the definition of the underlying data might still affect intertemporal and international comparability. Based on this new data set, the authors do not find a systematic link between growth and changes in aggregate inequality. They do find a strong positive relationship between growth and reduction of poverty.

    Geographic coverage

    In what follows, we provide brief descriptions of main features for individual countries that are included in the data-base. Without being comprehensive, these notes are intended to indicate some of the considerations underlying our decision to include or exclude certain observations.

    Argentina Various permanent household surveys, all covering urban centers only, have been regularly conducted since 1972 and are quoted in a wide variety of sources and years, e.g., for 1980 (World Bank 1992), 1985 (Altimir 1994), and 1989 (World Bank 1992). Estimates for 1963, 1965, 1969/70, 1970/71, 1974, 1975, 1980, and 1981 (Altimir 1987) are based only on Greater Buenos Aires. Estimates for 1961, 1963, 1970 (Jain 1975) and for 1970 (van Ginneken 1984) have only limited geographic coverage and do not satisfy our minimum criteria.

    Despite the many urban surveys, there are no income distribution data that are representative of the population as a whole. References to national income distribution for the years 1953, 1959, and 1961(CEPAL 1968 in Altimir 1986 ) are based on extrapolation from national accounts and have therefore not been included. Data for 1953 and 1961 from Weisskoff (1970) , from Lecaillon (1984) , and from Cromwell (1977) are also excluded.

    Australia Household surveys, the result of which is reported in the statistical yearbook, have been conducted in 1968/9, 1975/6, 1978/9, 1981, 1985, 1986, 1989, and 1990.

    Data for 1962 (Cromwell, 1977) and 1966/67 (Sawyer 1976) were excluded as they covered only tax payers. Jain's data for 1970 was excluded because it covered income recipients only. Data from Podder (1972) for 1967/68, from Jain (1975) for the same year, from UN (1985) for 78/79, from Sunders and Hobbes (1993) for 1986 and for 1989 were excluded given the availability of the primary sources. Data from Bishop (1991) for 1981/82, from Buhman (1988) for 1981/82, from Kakwani (1986) for 1975/76, and from Sunders and Hobbes (1993) for 1986 were utilized to test for the effect of different definitions. The values for 1967 used by Persson and Tabellini and Alesina and Rodrik (based on Paukert and Jain) are close to the ones reported in the Statistical Yearbook for 1969.

    Austria: In addition to data referring to the employed population (Guger 1989), national household surveys for 1987 and 1991 are included in the LIS data base. As these data do not include income from self-employment, we do not report them in our high quality data-set.

    Bahamas Data for Ginis and shares are available for 1973, 1977, 1979, 1986, 1988, 1989, 1991, 1992, and 1993 in government reports on population censuses and household budget surveys, and for 1973 and 1975 from UN (1981). Estimates for 1970 (Jain 1975), 1973, 1975, 1977, and 1979 (Fields 1989) have been excluded given the availability of primary sources.

    Bangladesh Data from household surveys for 1973/74, 1976/77, 1977/78, 1981/82, and 1985/86 are available from the Statistical Yearbook, complemented by household-survey based information from Chen (1995) and the World Development Report. Household surveys with rural coverage for 1959, 1960, 1963/64, 1965, 1966/67 and 1968/69, and with urban coverage for 1963/64, 1965, 1966/67, and 1968/69 are also available from the Statistical yearbook. Data for 1963/64 ,1964 and 1966/67, (Jain 1975) are not included due to limited geographic coverage, We also excluded secondary sources for 1973/74, 1976/77, 1981/82 (Fields 1989), 1977 (UN 1981), 1983 (Milanovic 1994), and 1985/86 due to availability of the primary source.

    Barbados National household surveys have been conducted in 1951/52 and 1978/79 (Downs, 1988). Estimates based on personal tax returns, reported consistently for 1951-1981 (Holder and Prescott, 1989), had to be excluded as they exclude the non-wage earning population. Jain's figure (used by Alesina and Rodrik) is based on the same source.

    Belgium Household surveys with national coverage are available for 1978/79 (UN 1985), and for 1985, 1988, and 1992 (LIS 1995). Earlier data for 1969, 1973, 1975, 1976 and 1977 (UN 1981) refer to taxable households only and are not included.

    Bolivia The only survey with national coverage is the 1990 LSMS (World Development Report). Surveys for 1986 and 1989 cover the main cities only (Psacharopoulos et al. 1992) and are therefore not included. Data for 1968 (Cromwell 1977) do not refer to a clear definition and is therefore excluded.

    Botswana The only survey with national coverage was conducted in 1985-1986 (Chen et al 1993); surveys in 74/75 and 85/86 included rural areas only (UN 1981). We excluded Gini estimates for 1971/72 that refer to the economically active population only (Jain 1975), as well as 1974/75 and 1985/86 (Valentine 1993) due to lack of national coverage or consistency in definition.

    Brazil Data from 1960, 1970, 1974/75, 1976, 1977, 1978, 1980, 1982, 1983, 1985, 1987 and 1989 are available from the statistical yearbook, in addition to data for 1978 (Fields 1987) and for 1979 (Psacharopoulos et al. 1992). Other sources have been excluded as they were either not of national coverage, based on wage earners only, or because a more consistent source was available.

    Bulgaria: Data from household surveys are available for 1963-69 (in two year intervals), for 1970-90 (on an annual basis) from the Statistical yearbook and for 1991 - 93 from household surveys by the World Bank (Milanovic and Ying).

    Burkina Faso A priority survey has been undertaken in 1995.

    Central African Republic: Except for a household survey conducted in 1992, no information was available.

    Cameroon The only data are from a 1983/4 household budget survey (World Bank Poverty Assessment).

    Canada Gini- and share data for the 1950-61 (in irregular intervals), 1961-81 (biennially), and 1981-91 (annually) are available from official sources (Statistical Yearbook for years before 1971 and Income Distributions by Size in Canada for years since 1973, various issues). All other references seem to be based on these primary sources.

    Chad: An estimate for 1958 is available in the literature, and used by Alesina and Rodrik and Persson and Tabellini but was not included due to lack of primary sources.

    Chile The first nation-wide survey that included not only employment income was carried out in 1968 (UN 1981). This is complemented by household survey-based data for 1971 (Fields 1989), 1989, and 1994. Other data that refer either only to part of the population or -as in the case of a long series available from World Bank country operations- are not clearly based on primary sources, are excluded.

    China Annual household surveys from 1980 to 1992, conducted separately in rural and urban areas, were consolidated by Ying (1995), based on the statistical yearbook. Data from other secondary sources are excluded due to limited geographic and population coverage and data from Chen et al (1993) for 1985 and 1990 have not been included, to maintain consistency of sources..

    Colombia The first household survey with national coverage was conducted in 1970 (DANE 1970). In addition, there are data for 1971, 1972, 1974 CEPAL (1986), and for 1978, 1988/89, and 1991 (World Bank Poverty Assessment 1992 and Chen et al. 1995). Data referring to years before 1970 -including the 1964 estimate used in Persson and Tabellini were excluded, as were estimates for the wage earning population only.

    Costa Rica Data on Gini coefficients and quintile shares are available for 1961, 1971 (Cespedes 1973),1977 (OPNPE 1982), 1979 (Fields 1989), 1981 (Chen et al 1993), 1983 (Bourguignon and Morrison 1989), 1986 (Sauma-Fiatt 1990), and 1989 (Chen et al 1993). Gini coefficients for 1971 (Gonzalez-Vega and Cespedes in Rottenberg 1993), 1973 and 1985 (Bourguignon and Morrison 1989) cover urban areas only and were excluded.

    Cote d'Ivoire: Data based on national-level household surveys (LSMS) are available for 1985, 1986, 1987, 1988, and 1995. Information for the 1970s (Schneider 1991) is based on national accounting information and therefore excluded

    Cuba Official information on income distribution is limited. Data from secondary sources are available for 1953, 1962, 1973, and 1978, relying on personal wage income, i.e. excluding the population that is not economically active (Brundenius 1984).

    Czech Republic Household surveys for 1993 and 1994 were obtained from Milanovic and Ying. While it is in principle possible to go back further, splitting national level surveys for the former Czechoslovakia into their independent parts, we decided not to do so as the same argument could be used to

  15. Somaliland Household Survey 2013, Adapted for the Somali High Frequency...

    • microdata.worldbank.org
    • datacatalog.ihsn.org
    • +1more
    Updated May 28, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The World Bank Group (2019). Somaliland Household Survey 2013, Adapted for the Somali High Frequency Survey - Somalia [Dataset]. https://microdata.worldbank.org/index.php/catalog/2818
    Explore at:
    Dataset updated
    May 28, 2019
    Dataset provided by
    World Bank Grouphttp://www.worldbank.org/
    World Bankhttp://worldbank.org/
    Authors
    The World Bank Group
    Time period covered
    2013
    Area covered
    Somalia
    Description

    Abstract

    In 2013, the World Bank, in collaboration with the Ministry of Planning and Development implemented the 2013 Somaliland Household Survey (SHS 2013). Somaliland self-declared independence in 1991. The survey interviewed 852 urban and 873 rural households. The sample was drawn randomly based on a multi-level clustered design. This dataset contains information on consumption, income and household characteristics. The sample is representative of urban Somaliland, and parts of rural Somaliland. It does not include nomadic households or those affected by the ongoing conflict. The data and code reproduce some of the results from the original submission of the SHS 2013, but also comparable poverty estimates to those obtained with the Somali High Frequency Survey.

    Geographic coverage

    The SHS 2013 sample is representative of urban Somaliland, and parts of rural Somaliland.

    Analysis unit

    Household

    Universe

    The sample does not include nomadic households (which recent estimates suggest comprises 36% of the population), and omits households in areas affected by ongoing conflict

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    The SHS 2013 interviewed 852 urban and 873 rural households. The frame used for selecting enumeration areas (EAs) was a mixed frame where the database of EAs for the UNFPA urban survey was used for all urban areas (in two strata: Hergaisa and Other Urban). The rural frame used the list of polling stations which was provided by the electoral commission.The sample frame used was the 2012 cartographic list of enumeration areas.

    The sample employs a stratified two-staged clustered design with the Primary Sampling Unit (PSU) being the enumeration area. Within each enumeration area, 9 households were selected for interviews. Then, a listing approach was used to select these 9 households randomly for interviews. Three primary strata were defined as: rural, Hergaisa and other urban areas. The population proportion varied by stratum, and the general agreement in informal discussions was that about 50% of the population was urban and 50% were rural.

    Sampling deviation

    A total of 26 EAs had to be replaced, and except in the case of Hergaisa, the replacements were rural polling stations. The most prevalent problem in the rural area were in the Sool, Sannag and Sahil zones, and these were identified as problems with security. A practical approach was undertaken by using the “nearest secure neighbor”. The idea was to assure that the sample polling station in the same district had similar characteristics to those of the insecure sample PSU, in order to maintain the geographic representativeness of the sample and reduce the bias from the PSU nonresponse

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    The SHS 2013 questionnaire is available under the Related Materials tab

    Cleaning operations

    Accompanying Stata do-files for carrying out the household analysis using the SHS 2013 data are provided under the Related Materials tab.

  16. Socio-Economic Panel Survey 2021-2022 - Ethiopia

    • microdata.worldbank.org
    • datacatalog.ihsn.org
    • +1more
    Updated Jan 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ethiopian Statistical Service (ESS) (2024). Socio-Economic Panel Survey 2021-2022 - Ethiopia [Dataset]. https://microdata.worldbank.org/index.php/catalog/6161
    Explore at:
    Dataset updated
    Jan 25, 2024
    Dataset provided by
    Central Statistical Agencyhttps://ess.gov.et/
    Authors
    Ethiopian Statistical Service (ESS)
    Time period covered
    2021 - 2022
    Area covered
    Ethiopia
    Description

    Abstract

    The Ethiopia Socioeconomic Panel Survey (ESPS) is a collaborative project between the Ethiopian Statistical Service (ESS) and the World Bank Living Standards Measurement Study-Integrated Surveys on Agriculture (LSMS-ISA) team. The objective of the LSMS-ISA is to collect multi-topic, household-level panel data with a special focus on improving agriculture statistics and generating a clearer understanding of the link between agriculture and other sectors of the economy. The project also aims to build capacity, share knowledge across countries, and improve survey methodologies and technology. ESPS is a long-term project to collect panel data. The project responds to the data needs of the country, given the dependence of a high percentage of households on agriculture activities in the country. The ESPS collects information on household agricultural activities along with other information on the households like human capital, other economic activities, and access to services and resources. The ability to follow the same households over time makes the ESPS a new and powerful tool for studying and understanding the role of agriculture in household welfare over time as it allows analyses of how households add to their human and physical capital, how education affects earnings, and the role of government policies and programs on poverty, inter alia. The ESPS is the first-panel survey to be carried out by the Ethiopian Statistical Service that links a multi-topic household questionnaire with detailed data on agriculture.

    Geographic coverage

    National Regional Urban and Rural

    Analysis unit

    • Household
    • Individual
    • Community

    Universe

    The survey covered all de jure households excluding prisons, hospitals, military barracks, and school dormitories.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    The sampling frame for the second phase ESPS panel survey is based on the updated 2018 pre-census cartographic database of enumeration areas by the Ethiopian Statistical Service (ESS). The sample is a two-stage stratified probability sample. The ESPS EAs in rural areas are the subsample of the AgSS EA sample. That means the first stage of sampling in the rural areas entailed selecting enumeration areas (i.e., the primary sampling units) using simple random sampling (SRS) from the sample of the 2018 AgSS enumeration areas (EAs). The first stage of sampling for urban areas is selecting EAs directly from the urban frame of EAs within each region using systematic PPS. This is designed to automatically result in a proportional allocation of the urban sample by zone within each region. Following the selection of sample EAs, they are allocated by urban rural strata using power allocation which is happened to be closer to proportional allocation.

    The second stage of sampling is the selection of households to be surveyed in each sampled EA using systematic random sampling. From the rural EAs, 10 agricultural households are selected as a subsample of the households selected for the AgSS, and 2 non-agricultural households are selected from the non-agriculture households list in that specific EA. The non-agriculture household selection follows the same sampling method i.e., systematic random sampling. One important issue to note in ESPS sampling is that the total number of agriculture households per EA remains at 10 even though there are less than 2 or no non-agriculture households are listed and sampled in that EA. For urban areas, a total of 15 households are selected per EA regardless of the households’ economic activity. The households are selected using systematic random sampling from the total households listed in that specific EA.

    The ESPS-5 kept all the ESPS-4 samples except for those in the Tigray region and a few other places. A more detailed description of the sample design is provided in Section 3 of the Basic Information Document provided under the Related Materials tab.

    Mode of data collection

    Computer Assisted Personal Interview [capi]

    Research instrument

    The ESPS-5 survey consisted of four questionnaires (household, community, post-planting, and post-harvest questionnaires), similar to those used in previous waves but revised based on the results of those waves and on the need for new data they revealed. The following new topics are included in ESPS-5:

    a. Dietary Quality: This module collected information on the household’s consumption of specified food items.

    b. Food Insecurity Experience Scale (FIES): In this round the survey has implemented FIES. The scale is based on the eight food insecurity experience questions on the Food Insecurity Experience Scale | Voices of the Hungry | Food and Agriculture Organization of the United Nations (fao.org).

    c. Basic Agriculture Information: This module is designed to collect minimal agriculture information from households. It is primarily for urban households. However, it was also used for a few rural households where it was not possible to implement the full agriculture module due to security reasons and administered for urban households. It asked whether they had undertaken any agricultural activity, such as crop farming and tending livestock) in the last 12 months. For crop farming, the questions were on land tenure, crop type, input use, and production. For livestock there were also questions on their size and type, livestock products, and income from sales of livestock or livestock products.

    d. Climate Risk Perception: This module was intended to elicit both rural and urban households perceptions, beliefs, and attitudes about different climate-related risks. It also asked where and how households were obtaining information on climate and weather-related events.

    e. Agriculture Mechanization and Video-Based Agricultural Extension: The rural area community questionnaire covered these areas rural areas. On mechanization the questions related to the penetration, availability and accessibility of agricultural machinery. Communities were also asked if they had received video-based extension services.

    Cleaning operations

    Final data cleaning was carried out on all data files. Only errors that could be clearly and confidently fixed by the team were corrected; errors that had no clear fix were left in the datasets. Cleaning methods for these errors are left up to the data user.

    Response rate

    ESPS-5 planned to interview 7,527 households from 565 enumeration areas (EAs) (Rural 316 EAs and Urban 249 EAs). However, due to the security situation in northern Ethiopia and to a lesser extent in the western part of the country, only a total of 4999 households from 438 EAs were interviewed for both the agriculture and household modules. The security situation in northern parts of Ethiopia meant that, in Tigray, ESPS-5 did not cover any of the EAs and households previously sampled. In Afar, while 275 households in 44 EAs had been covered by both the ESPS-4 agriculture and household modules, in ESPS-5 only 252 households in 22 EAs were covered by both modules. During the fifth wave, security was also a problem in both the Amhara and Oromia regions, so there was a comparable reduction in the number of households and EAs covered there.

    More detailed information is available in the BID.

  17. w

    Fifth Integrated Household Survey 2019-2020 - Malawi

    • microdata.worldbank.org
    • catalog.ihsn.org
    • +1more
    Updated Jan 16, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fifth Integrated Household Survey 2019-2020 - Malawi [Dataset]. https://microdata.worldbank.org/index.php/catalog/3818
    Explore at:
    Dataset updated
    Jan 16, 2024
    Dataset authored and provided by
    National Statistical Office (NSO)
    Time period covered
    2019 - 2020
    Area covered
    Malawi
    Description

    Abstract

    The Integrated Household Survey is one of the primary instruments implemented by the Government of Malawi through the National Statistical Office (NSO) roughly every 3-5 years to monitor and evaluate the changing conditions of Malawian households. The IHS data have, among other insights, provided benchmark poverty and vulnerability indicators to foster evidence-based policy formulation and monitor the progress of meeting the Millennium Development Goals (MDGs), the goals listed as part of the Malawi Growth and Development Strategy (MGDS) and now the Sustainable Development Goals (SDGs).

    Geographic coverage

    National coverage

    Analysis unit

    • Households
    • Individuals
    • Consumption expenditure commodities/items
    • Communities
    • Agricultural household/ Holder/ Crop
    • Market

    Universe

    Members of the following households are not eligible for inclusion in the survey: • All people who live outside the selected EAs, whether in urban or rural areas. • All residents of dwellings other than private dwellings, such as prisons, hospitals and army barracks. • Members of the Malawian armed forces who reside within a military base. (If such individuals reside in private dwellings off the base, however, they should be included among the households eligible for random selection for the survey.) • Non-Malawian diplomats, diplomatic staff, and members of their households. (However, note that non-Malawian residents who are not diplomats or diplomatic staff and are resident in private dwellings are eligible for inclusion in the survey. The survey is not restricted to Malawian citizens alone.) • Non-Malawian tourists and others on vacation in Malawi.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    The IHS5 sampling frame is based on the listing information and cartography from the 2018 Malawi Population and Housing Census (PHC); includes the three major regions of Malawi, namely North, Center and South; and is stratified into rural and urban strata. The urban strata include the four major urban areas: Lilongwe City, Blantyre City, Mzuzu City, and the Municipality of Zomba. All other areas are considered as rural areas, and each of the 27 districts were considered as a separate sub-stratum as part of the main rural stratum. The sampling frame further excludes the population living in institutions, such as hospitals, prisons and military barracks. Hence, the IHS5 strata are composed of 32 districts in Malawi.

    A stratified two-stage sample design was used for the IHS5.

    Note: Detailed sample design information is presented in the "Fifth Integrated Household Survey 2019-2020, Basic Information Document" document.

    Mode of data collection

    Computer Assisted Personal Interview [capi]

    Research instrument

    HOUSEHOLD QUESTIONNAIRE The Household Questionnaire is a multi-topic survey instrument and is near-identical to the content and organization of the IHS3 and IHS4 questionnaires. It encompasses economic activities, demographics, welfare and other sectoral information of households. It covers a wide range of topics, dealing with the dynamics of poverty (consumption, cash and non-cash income, savings, assets, food security, health and education, vulnerability and social protection). Although the IHS5 household questionnaire covers a wide variety of topics in detail it intentionally excludes in-depth information on topics covered in other surveys that are part of the NSO’s statistical plan (such as maternal and child health issues covered at length in the Malawi Demographic and Health Survey).

    AGRICULTURE QUESTIONNAIRE All IHS5 households that are identified as being involved in agricultural or livestock activities were administered the agriculture questionnaire, which is primarily modelled after the IHS3 counterpart. The modules are expanding on the agricultural content of the IHS4, IHS3, IHS2, AISS, and other regional agricultural surveys, while remaining consistent with the NACAL topical coverage and methodology. The development of the agriculture questionnaire was done with input from the aforementioned stakeholders who provided input on the household questionnaire as well as outside researchers involved in research and policy discussions pertaining to the Malawian agriculture. The agriculture questionnaire allows, among other things, for extensive agricultural productivity analysis through the diligent estimation of land areas, both owned and cultivated, labor and non-labor input use and expenditures, and production figures for main crops, and livestock. Although one of the major foci of the agriculture data collection effort was to produce smallholder production estimates for major crops, it is also possible to disaggregate the data by gender and main geographical regions. The IHS5 cross-sectional households supply information on the last completed rainy season (2017/2018 or 2018/2019) and the last completed dry season (2018 or 2019) depending on the timing of their interview.

    FISHERIES QUESTIONNAIRE The design of the IHS5 fishery questionnaire is identical to the questionnaire designed for IHS3. The IHS3 fisheries questionnaire was informed by the design and piloting of a fishery questionnaire by the World Fish Center (WFC), which was supported by the LSMS-ISA project for the purpose of assembling a fishery questionnaire that could be integrated into multi-topic household-surveys. The WFC piloted the draft instrument in November 2009 in the Lower Shire region, and the NSO team considered the revised draft in designing the IHS5 fishery questionnaire.

    COMMUNITY QUESTIONNAIRE The content of the IHS5 Community Questionnaire follows the content of the IHS3 & IHS4 Community Questionnaires. A “community” is defined as the village or urban location surrounding the enumeration area selected for inclusion in the sample and which most residents recognize as being their community. The IHS5 community questionnaire was administered to each community associated with the cross-sectional EAs interviewed. Identical to the IHS3 and IHS4 approach, to a group of several knowledgeable residents such as the village headman, the headmaster of the local school, the agricultural field assistant, religious leaders, local merchants, health workers and long-term knowledgeable residents. The instrument gathers information on a range of community characteristics, including religious and ethnic background, physical infrastructure, access to public services, economic activities, communal resource management, organization and governance, investment projects, and local retail price information for essential goods and services.

    MARKET QUESTIONNAIRE The Market Survey consisted of one questionnaire which is composed of four modules. Module A: Market Identification, Module B: Seasonal Main Crops, Module C: Permanents Crops, and Module D: Food Consumption.

    Cleaning operations

    DATA ENTRY PLATFORM To ensure data quality and timely availability of data, the IHS5 was implemented using the World Bank’s Survey Solutions CAPI software. To carry out IHS5, 1 laptop computer and a wireless internet router were assigned to each team supervisor, and each enumerator had an 8–inch GPS-enabled Lenovo tablet computer. The use of Survey Solutions allowed for the real-time availability of data as the completed data was completed, approved by the Supervisor and synced to the Headquarters server as frequently as possible. While administering the first module of the questionnaire the enumerator(s) also used their tablets to record the GPS coordinates of the dwelling units. In Survey Solutions, Headquarters can then see the location of the dwellings plotted on a map of Malawi to better enable supervision from afar – checking both the number of interviews performed and the fact that the sample households lie within EA boundaries. Geo-referenced household locations from that tablet complemented the GPS measurements taken by the Garmin eTrex 30 handheld devices and these were linked with publically available geospatial databases to enable the inclusion of a number of geospatial variables - extensive measures of distance (i.e. distance to the nearest market), climatology, soil and terrain, and other environmental factors - in the analysis.

    The range and consistency checks built into the application was informed by the LSMS-ISA experience in previous IHS waves. Prior programming of the data entry application allowed for a wide variety of range and consistency checks to be conducted and reported and potential issues investigated and corrected before closing the assigned enumeration area. Headquarters (NSO management) assigned work to supervisors based on their regions of coverage. Supervisors then made assignments to the enumerators linked to their Supervisor account. The work assignments and syncing of completed interviews took place through a Wi-Fi connection to the IHS5 server. Because the data was available in real time it was monitored closely throughout the entire data collection period and upon receipt of the data at headquarters, data was exported to STATA for other consistency checks, data cleaning, and analysis.

    DATA MANAGEMENT The IHS5 Survey Solutions CAPI based data entry application was designed to stream-line the data collection process from the field. IHS5 Interviews were collected in “sample” mode (assignments generated from headquarters) as opposed to “census” mode (new interviews created by interviewers from a template) for the NSO to have more control over the sample.

    The range and consistency checks built into the application was informed by the LSMS-ISA experience in previous IHS waves. Prior programming of the data

  18. w

    Household Risk and Vulnerability Survey 2016, Wave 1 - Nepal

    • microdata.worldbank.org
    • catalog.ihsn.org
    Updated Oct 5, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hanan Jacoby (2017). Household Risk and Vulnerability Survey 2016, Wave 1 - Nepal [Dataset]. https://microdata.worldbank.org/index.php/catalog/2905
    Explore at:
    Dataset updated
    Oct 5, 2017
    Dataset provided by
    Thomas Walker
    Hanan Jacoby
    Time period covered
    2016
    Area covered
    Nepal
    Description

    Abstract

    The objective of this three-year panel survey is to provide the Government of Nepal with empirical evidence on the patterns of exposure to shocks at the household level and on the vulnerability of households’ welfare to these shocks. It covers 6,000 households in non-metropolitan areas of Nepal, which were interviewed in mid 2016. Being a relatively comprehensive and representative (rural) sample household survey, it can also be used for other research into living conditions of Nepali households in rural areas. This is the entire dataset for the first wave of the survey. The same households will be reinterviewed in mid 2017 and mid 2018. The survey dataset contains a multi-topic survey which was completed for each of the 6,000 households, and a community survey fielded to a senior community representative at the village development committee (VDC) level in each of the 400 PSUs.

    Geographic coverage

    All non-metropolitan areas in Nepal. Non-metropolitan areas are as defined by the 2010 Census.

    Analysis unit

    Household, following the NLSS definition.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    The sample frame was all households in non-metropolitan areas per the 2010 Census definition, excluding households in the Kathmandu valley (Kathmandu, Lalitpur and Bhaktapur districts). The country was segmented into 11 analytical strata, defined to correspond to those used in the NLSS III (excluding the three urban strata used there). To increase the concentration of sampled households, 50 of the 75 districts in Nepal were selected with probability proportional to size (the measure of size being the number of households). PSUs were selected with probability proportional to size from the entire list of wards in the 50 selected districts, one stratum at a time. The number of PSUs per stratum is proportional to the stratum's population share, and corresponds closely to the allocations used in the LFS-II and NLSS-III (adjusted for different overall numbers of PSUs in those surveys).

    In each of the selected PSUs (administrative wards), survey teams compiled a list of households in the ward based on existing administrative records, and cross-checked with local leaders. The number of households shown in the list was compared to the ward population in the 2010 Census, adjusted for likely population growth. Where the listed population deviated by more than 10% from the projected population based on the Census data, the team conducted a full listing of households in the ward. 15 households were selected at random from the ward list for interviewing, and a further 5 households were selected as potential replacements.

    Sampling deviation

    During the fieldwork, one PSU in Lapu VDC was inaccessible due to weather, and was replaced by a ward in Hastichaur VDC using PPS sampling on that stratum (excluding the already selected PSUs). All other sampled PSUs were reached, and a full sample of 6,000 households was interviewed in the first wave.

    Mode of data collection

    Computer Assisted Personal Interview [capi]

    Research instrument

    The household questionnaire contained 16 modules: the household roster; education; health; housing and access to facilities; food expenses and home production; non-food expenditures and inventory of durable goods; jobs and time use; wage jobs; farming and livestock; non-agriculture enterprises/activities; migration; credit, savings, and financial assets; private assistance; public assistance; shocks; and anthropometrics (for children less than 5 years). Where possible, the style of questions was kept similar to those used in the NLSS-III questionnaire for comparability reasons. In some cases, new modules needed to be developed. The shocks questionnaire was developed by the World Bank team. A food security module was added based on the design recommended by USAID, and a psychosocial questionnaire was also developed by social development specialists in the World Bank. The section on government and other assistance was also redesigned to cover a broader range of programs and elicit information on details such as experience with enrollment and frequency of payment.

    The community questionnaire was fielded to a senior community representative at the VDC level in each of the 400 PSUs. The purpose of the community questionnaire was to obtain further details on access to services in each PSU, to gather information on shocks at the community level, and to collect market price data. The questionnaire had six modules: respondent details; community characteristics; access to facilities; educational facilities; community shocks, household shocks; and market price.

    Cleaning operations

    These are the raw data entered and checked by the survey firm, formatted to conform to the original questionnaire numbering system and confidentialized. The data were cleaned for spelling errors and translation of Nepali phrases, and suspicious values were checked by calling respondents. No other transformations have taken place.

    Response rate

    Of the 6,000 originally sampled households, 5,191 agreed to be interviewed. Of the 13.5% of households that were not interviewed, 11.1% were resident but could not be located by the team after two attempts, 0.9% were found to have outmigrated, and 1.4% refused. The 809 replacement households were drawn in order from the randomized list created during sampling (see above).

  19. w

    Living Standards Survey 2018-2019 - Nigeria

    • microdata.worldbank.org
    • catalog.ihsn.org
    • +1more
    Updated Jan 12, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Living Standards Survey 2018-2019 - Nigeria [Dataset]. https://microdata.worldbank.org/index.php/catalog/3827
    Explore at:
    Dataset updated
    Jan 12, 2021
    Dataset authored and provided by
    National Bureau of Statistics (NBS)
    Time period covered
    2018 - 2019
    Area covered
    Nigeria
    Description

    Abstract

    The main objectives of the 2018/19 NLSS are: i) to provide critical information for production of a wide range of socio-economic and demographic indicators, including for benchmarking and monitoring of SDGs; ii) to monitor progress in population’s welfare; iii) to provide statistical evidence and measure the impact on households of current and anticipated government policies. In addition, the 2018/19 NLSS could be utilized to improve other non-survey statistical information, e.g. to determine and calibrate the contribution of final consumption expenditures of households to GDP; to update the weights and determine the basket for the national Consumer Price Index (CPI); to improve the methodology and dissemination of micro-economic and welfare statistics in Nigeria.

    The 2018/19 NLSS collected a comprehensive and diverse set of socio-economic and demographic data pertaining to the basic needs and conditions under which households live on a day to day basis. The 2018/19 NLSS questionnaire includes wide-ranging modules, covering demographic indicators, education, health, labour, expenditures on food and non-food goods, non-farm enterprises, household assets and durables, access to safety nets, housing conditions, economic shocks, exposure to crime and farm production indicators.

    Geographic coverage

    National coverage

    Analysis unit

    • Households
    • Individuals
    • Communities

    Universe

    The survey covered all de jure households excluding prisons, hospitals, military barracks, and school dormitories.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    The 2018/19 NLSS sample is designed to provide representative estimates for the 36 states and the Federal Capital Territory (FCT), Abuja. By extension. The sample is also representative at the national and zonal levels. Although the sample is not explicitly stratified by urban and rural areas, it is possible to obtain urban and rural estimates from the NLSS data at the national level. At all stages, the relative proportion of urban and rural EAs as has been maintained.

    Before designing the sample for the 2018/19 NLSS, the results from the 2009/10 HNLSS were analysed to extract the sampling properties (variance, design effect, etc.) and estimate the required sample size to reach a desired precision for poverty estimates in the 2018/19 NLSS.

    EA SELECTION: The sampling frame for the 2018/19 NLSS was based on the national master sample developed by the NBS, referred to as the NISH2 (Nigeria Integrated Survey of Households 2). This master sample was based on the enumeration areas (EAs) defined for the 2006 Nigeria Census Housing and Population conducted by National Population Commission (NPopC). The NISH2 was developed by the NBS to use as a frame for surveys with state-level domains. NISH2 EAs were drawn from another master sample that NBS developed for surveys with LGA-level domains (referred to as the “LGA master sample”). The NISH2 contains 200 EAs per state composed of 20 replicates of 10 sample EAs for each state, selected systematically from the full LGA master sample. Since the 2018/19 NLSS required domains at the state-level, the NISH2 served as the sampling frame for the survey.

    Since the NISH2 is composed of state-level replicates of 10 sample EAs, a total of 6 replicates were selected from the NISH2 for each state to provide a total sample of 60 EAs per state. The 6 replicates selected for the 2018/19 NLSS in each state were selected using random systematic sampling. This sampling procedure provides a similar distribution of the sample EAs within each state as if one systematic sample of 60 EAs had been selected directly from the census frame of EAs.

    A fresh listing of households was conducted in the EAs selected for the 2018/19 NLSS. Throughout the course of the listing, 139 of the selected EAs (or about 6%) were not able to be listed by the field teams. The primary reason the teams were not able to conduct the listing in these EAs was due to security issues in the country. The fieldwork period of the 2018/19 NLSS saw events related to the insurgency in the north east of the country, clashes between farmers and herdsman, and roving groups of bandits. These events made it impossible for the interviewers to visit the EAs in the villages and areas affected by these conflict events. In addition to security issues, some EAs had been demolished or abandoned since the 2006 census was conducted. In order to not compromise the sample size and thus the statistical power of the estimates, it was decided to replace these 139 EAs. Additional EAs from the same state and sector were randomly selected from the remaining NISH2 EAs to replace each EA that could not be listed by the field teams. This necessary exclusion of conflict affected areas implies that the sample is representative of areas of Nigeria that were accessible during the 2018/19 NLSS fieldwork period. The sample will not reflect conditions in areas that were undergoing conflict at that time. This compromise was necessary to ensure the safety of interviewers.

    HOUSEHOLD SELECTION: Following the listing, the 10 households to be interviewed were selected from the listed households. These households were selected systemically after sorting by the order in which the households were listed. This systematic sampling helped to ensure that the selected households were well dispersed across the EA and thereby limit the potential for clustering of the selected households within an EA.

    Occasionally, interviewers would encounter selected households that were not able to be interviewed (e.g. due to migration, refusal, etc.). In order to preserve the sample size and statistical power, households that could not be interviewed were replaced with an additional randomly selected household from the EA. Replacement households had to be requested by the field teams on a case-by-case basis and the replacement household was sent by the CAPI managers from NBS headquarters. Interviewers were required to submit a record for each household that was replaced, and justification given for their replacement. These replaced households are included in the disseminated data. However, replacements were relatively rare with only 2% of sampled households not able to be interviewed and replaced.

    Sampling deviation

    Although a sample was initially drawn for Borno state, the ongoing insurgency in the state presented severe challenges in conducting the survey there. The situation in the state made it impossible for the field teams to reach large areas of the state without compromising their safety. Given this limitation it was clear that a representative sample for Borno was not possible. However, it was decided to proceed with conducting the survey in areas that the teams could access in order to collect some information on the parts of the state that were accessible.

    The limited area that field staff could safely operate in in Borno necessitated an alternative sample selection process from the other states. The EA selection occurred in several stages. Initially, an attempt was made to limit the frame to selected LGAs that were considered accessible. However, after selection of the EAs from the identified LGAs, it was reported by the NBS listing teams that a large share of the selected EAs were not safe for them to visit. Therefore, an alternative approach was adopted that would better ensure the safety of the field team but compromise further the representativeness of the sample. First, the list of 788 EAs in the LGA master sample for Borno were reviewed by NBS staff in Borno and the EAs they deemed accessible were identified. The team identified 359 EAs (46%) that were accessible. These 359 EAs served as the frame for the Borno sample and 60 EAs were randomly selected from this frame. However, throughout the course of the NLSS fieldwork, additional insurgency related events occurred which resulted in 7 of the 60 EAs being inaccessible when they were to be visited. Unlike for the main sample, these EAs were not replaced. Therefore, 53 EAs were ultimately covered from the Borno sample. The listing and household selection process that followed was the same as for the rest of the states.

    Mode of data collection

    Computer Assisted Personal Interview [capi]

    Research instrument

    Two sets of questionnaires – household and community – were used to collect information in the NLSS2018/19. The Household Questionnaire was administered to all households in the sample. The Community Questionnaire was administered to the community to collect information on the socio-economic indicators of the enumeration areas where the sample households reside.

    Household Questionnaire: The Household Questionnaire provides information on demographics; education; health; labour; food and non-food expenditure; household nonfarm income-generating activities; food security and shocks; safety nets; housing conditions; assets; information and communication technology; agriculture and land tenure; and other sources of household income.

    Community Questionnaire: The Community Questionnaire solicits information on access to transported and infrastructure; community organizations; resource management; changes in the community; key events; community needs, actions and achievements; and local retail price information.

    Cleaning operations

    CAPI: The 2018/19 NLSS was conducted using the Survey Solutions Computer Assisted Person Interview (CAPI) platform. The Survey Solutions software was developed and maintained by the Development Economics Data Group (DECDG) at the World Bank. Each interviewer and supervisor was given a tablet

  20. w

    National Panel Survey 2020-21, Wave 5 - Tanzania

    • microdata.worldbank.org
    • catalog.ihsn.org
    Updated Mar 3, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Bureau of Statistics (2023). National Panel Survey 2020-21, Wave 5 - Tanzania [Dataset]. https://microdata.worldbank.org/index.php/catalog/5639
    Explore at:
    Dataset updated
    Mar 3, 2023
    Dataset authored and provided by
    National Bureau of Statistics
    Time period covered
    2020 - 2022
    Area covered
    Tanzania
    Description

    Abstract

    The main objective of the NPS is to provide high-quality household-level data to the Tanzanian government and other stakeholders for monitoring poverty dynamics, tracking the progress of the Five Year Development Plan (FYDP) II poverty reduction strategy and its predecessor plans, and evaluating the impact of other major, national-level government policy initiatives. As an integrated survey covering a number of different socioeconomic factors, it compliments other more narrowly focused survey efforts, such as the Demographic and Health Survey (DHS) on health, the Integrated Labour Force Survey (ILFS) on labour markets, the Household Budget Survey (HBS) on expenditure, and the National Sample Census of Agriculture (NSCA). Secondly, as a panel household survey in which the same households are revisited over time, the NPS allows for the study of poverty and welfare transitions and the determinants of living standard changes.

    Geographic coverage

    Designed for analysis of key indicators at four primary domains of inference, namely: Dar es Salaam, Other Urban, Rural, Zanzibar,

    Analysis unit

    Households; Individuals

    Sampling procedure

    The NPS is based on a stratified, multi-stage cluster sample design which recognizes four analytical strata: Dar es Salaam, Other Urban areas in Mainland, Rural areas in Mainland, and Zanzibar. The sample design for the NPS 2020/21 targeted the sub-sample of households from the initial NPS 2014/15 cohort considered the “Refresh Panel”. These specific households had never previously been a part of the NPS sample design. This sample consisted of 3,352 households from 419 clusters in the NPS 2014/15 that were tracked and interviewed in the NPS 2020/21. An additional “Booster Sample” of 545 households from major cities and urban areas (specifically, Mbeya, Arusha, Mwanza, Tanga, and Dodoma) was also interviewed to allow for improved estimates in urban centres.

    In previous NPS rounds, the sample design included complete households that could not be interviewed in a particular year but were found in later rounds, excluding those households that had refused to be interviewed (i.e. a household that was interviewed in Round 1, lost in Round 2, and found again in Round 3). This situation does not exist in the NPS 2020/21 as they have only been included in, at most, two rounds.

    The eligibility requirement for inclusion of a household in this round of the NPS and all others is defined as any household having at least one member aged 15 years and above, excluding live-in servants. Households with at least one eligible member were completely interviewed, including any non-eligible members present in the household.

    Additionally, the final sample for NPS 2020/21 included any split-off household or eligible members identified during data collection (i.e. a previous NPS member who had moved or started another household in between rounds). Marriage and migration are the most common reasons for households splitting over time. Ultimately, the final sample size for NPS 2020/21 was 23,592 individuals in 4,709 households. Of these, 4,164 households allow for panel analysis as they have been found and interviewed in both NPS 2014/15 and NPS 2020/21, while the remaining 545 (in the “Booster Sample”) will only have data available in the NPS 2020/21. The complete cohort interviewed in NPS 2020/21 will be maintained and tracked in all future waves of the NPS.

    Mode of data collection

    Computer Assisted Personal Interview [capi]

    Research instrument

    The NPS 2020/21 consists of four survey instruments: a Household Questionnaire, Agriculture Questionnaire, Livestock Questionnaire, and a Community Questionnaire. A detailed description of the questionnaires is provided in the Survey Instruments section of the Basic Information Document (available under Downloads). All questionnaires are in English and available for download.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Sustainable Development Solutions Network (2023). Rural Access Index by Country (2022 - 2023) [Dataset]. https://sdg-transformation-center-sdsn.hub.arcgis.com/datasets/d386abdab7d946aa8b1a0cd11496d91f
Organization logo

Rural Access Index by Country (2022 - 2023)

Explore at:
Dataset updated
Apr 19, 2023
Dataset authored and provided by
Sustainable Development Solutions Networkhttps://www.unsdsn.org/
License

Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically

Area covered
Description

The Rural Access Index (RAI) is a measure of access, developed by the World Bank in 2006. It was adopted as Sustainable Development Goal (SDG) indicator 9.1.1 in 2015, to measure the accessibility of rural populations. It is currently the only indicator for the SDGs that directly measures rural access.The RAI measures the proportion of the rural population that lives within 2 km of an all-season road. An all-season road is one that is motorable all year, but may be temporarily unavailable during inclement weather (Roberts, Shyam, & Rastogi, 2006). This dataset implements and expands on the most recent official methodology put forward by the World Bank, ReCAP's 2019 RAI Supplemental Guidelines. This is, to date, the only publicly available application of this method at a global scale.MethodologyReCAP's methodology provided new insight on what makes a road all-season and how this data should be handled: instead of removing unpaved roads from the network, the ones that are classified as unpaved are to be intersected with topographic and climatic conditions and, whenever there’s an overlap with excess precipitation and slope, a multiplying factor ranging from 0% to 100% is applied to the population that would access to that road. This present dataset developed by SDSN's SDG Transformation Centre proposes that authorities ability to maintain and remediate road conditions also be taken into account.Data sourcesThe indicator relies on four major items of geospatial data: land cover (rural or urban), population distribution, road network extent and the “all-season” status of those roads.Land cover data (urban/rural distinction)Since the indicator measures the acess rural populations, it's necessary to define what is and what isn't rural. This dataset uses the DegUrba Methodology, proposed by the United Nations Expert Group on Statistical Methodology for Delineating Cities and Rural Areas (United Nations Expert Group, 2019). This approach has been developed by the European Commission Global Human Settlement Layer (GHSL-SMOD) project, and is designed to instil some consistency into the definitions based on population density on a 1-km grid, but adjusted for local situations.Population distributionThe source for population distribution data is WorldPop. This uses national census data, projections and other ancillary data from countries to produce aggregated, 100 m2 population data. Road extentTwo widely recognized road datasets are used: the real-time updated crowd-sourced OpenStreetMap (OSM) or the GLOBIO’s 2018 GRIP database, which draws data from official national sources. The reasons for picking the latter are mostly related to its ability to provide information on the surface (pavement) of these roads, to the detriment of the timeliness of the data, which is restrained to the year 2018. Additionally, data from Microsoft Bing's recent Road Detection project is used to ensure completeness. This dataset is completely derived from machine learning methods applied over satellite imagery, and detected 1,165 km of roads missing from OSM.Roads’ all-season statusThe World Bank's original 2006 methodology defines the term all-season as “… a road that is motorable all year round by the prevailing means of rural transport, allowing for occasional interruptions of short duration”. ReCAP's 2019 methodology makes a case for passability equating to the all-season status of a road, along with the assumption that typically the wet season is when roads become impassable, especially so in steep roads that are more exposed to landslides.This dataset follows the ReCAP methodology by creating an passability index. The proposed use of passability factors relies on the following three aspects:• Surface type. Many rural roads in LICs (and even in large high-income countries including the USA and Australia) are unpaved. As mentioned before, unpaved roads deteriorate rapidly and in a different way to paved roads. They are very susceptible to water ingress to the surface, which softens the materials and makes them very vulnerable to the action of traffic. So, when a road surface becomes saturated and is subject to traffic, the deterioration is accelerated. • Climate. Precipitation has a significant effect on the condition of a road, especially on unpaved roads, which predominate in LICs and provide much of the extended connectivity to rural and poor areas. As mentioned above, the rainfall on a road is a significant factor in its deterioration, but the extent depends on the type of rainfall in terms of duration and intensity, and how well the roadside drainage copes with this. While ReCAP suggested the use of general climate zones, we argue that better spatial and temporal resolutions can be acquired through the Copernicus Programme precipitation data, which is made available freely at ~30km pixel size for each month of the year.• Terrain. The gradient and altitude of roads also has an effect on their accessibility. Steep roads become impassable more easily due to the potential for scour during heavy rainfall, and also due to slipperiness as a result of the road surface materials used. Here this is drawn from slope calculated from SRTM Digital Terrain data.• Road maintenance. The ability of local authorities to remediate damaged caused by precipitation and landslides is proposed as a correcting factor to the previous ones. Ideally this would be measured by the % of GDP invested in road construction and maintenance, but this isn't available for all countries. For this reason, GDP per capita is adopted as a proxy instead. The data range is normalized in such a way that a road maxed out in terms of precipitation and slope (accessibility score of 0.25) in a country at the top of the GDP per capita range is brought back at to the higher end of the accessibility score (0.95), while the accessibility score of a road meeting the same passability conditions in a country which GDP per capita is towards the lower end is kept unchanged.Data processingThe roads from the three aforementioned datasets (Bing, GRIP and OSM) are merged together to them is applied a 2km buffer. The populations falling exclusively on unpaved road buffers are multiplied by the resulting passability index, which is defined as the normalized sum of the aforementioned components, ranging from 0.25 to. 0.9, with 0.95 meaning 95% probability that the road is all-season. The index applied to the population data, so, when calculated, the RAI includes the probability that the roads which people are using in each area will be all-season or not. For example, an unpaved road in a flat area with low rainfall would have an accessibility factor of 0.95, as this road is designed to be accessible all year round and the environmental effects on its impassability are minimal.The code for generating this dataset is available on Github at: https://github.com/sdsna/rai

Search
Clear search
Close search
Google apps
Main menu