56 datasets found
  1. h

    2015 Population Census of Japan: Anonymous data Coding table and data layout...

    • d-repo.ier.hit-u.ac.jp
    • jdcat.jsps.go.jp
    application/x-yaml +5
    Updated Nov 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    独立行政法人統計センター; 一橋大学 (2023). 2015 Population Census of Japan: Anonymous data Coding table and data layout [Dataset]. https://d-repo.ier.hit-u.ac.jp/records/2007155/file_details/tokumei_kokuH27.xlsx?filename=tokumei_kokuH27.xlsx&file_order=0
    Explore at:
    xlsx, txt, csv, pdf, text/x-shellscript, application/x-yamlAvailable download formats
    Dataset updated
    Nov 8, 2023
    Authors
    独立行政法人統計センター; 一橋大学
    Time period covered
    Oct 1, 2015
    Area covered
    Japan, 日本
    Description

    平成27年に実施した国勢調査の母集団から、匿名データ提供の対象データとして更に1%(約125万レコード)リサンプリングを行ったもの。

  2. UCI Adult Census Data Dataset

    • kaggle.com
    zip
    Updated Aug 10, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sagnik (2020). UCI Adult Census Data Dataset [Dataset]. https://www.kaggle.com/datasets/sagnikpatra/uci-adult-census-data-dataset
    Explore at:
    zip(745972 bytes)Available download formats
    Dataset updated
    Aug 10, 2020
    Authors
    Sagnik
    Description

    The Us Adult income dataset was extracted by Barry Becker from the 1994 US Census Database. The data set consists of anonymous information such as occupation, age, native country, race, capital gain, capital loss, education, work class and more. I encountered it during my course, and I wish to share it here because it is a good starter example for data pre-processing and machine learning practices.

    Fields

    The dataset contains 16 columns Target filed: Income -- The income is divide into two classes: 50K Number of attributes: 14 -- These are the demographics and other features to describe a person

    We can explore the possibility in predicting income level based on the individual’s personal information.

    Acknowledgements

    This dataset named “adult” is found in the UCI machine learning repository

  3. The 1970 Census of the Federal Republic Germany - IPUMS Subset - Germany

    • microdata.worldbank.org
    • catalog.ihsn.org
    Updated Aug 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Federal Statistical Office (Statistisches Bundesamt) (2025). The 1970 Census of the Federal Republic Germany - IPUMS Subset - Germany [Dataset]. https://microdata.worldbank.org/index.php/catalog/2129
    Explore at:
    Dataset updated
    Aug 1, 2025
    Dataset provided by
    Statistisches Bundesamthttp://www.destatis.de/
    IPUMS
    Time period covered
    1970
    Area covered
    Germany
    Description

    Analysis unit

    Persons West Germany; persons not organized into households

    UNITS IDENTIFIED: - Dwellings: yes - Vacant Units: no - Households: no - Individuals: yes - Group quarters: yes

    UNIT DESCRIPTIONS: - Dwellings: A dwelling, in terms of a survey unit, is defined as a self-contained set of rooms intended for living purposes which enable persons to keep their own household. - Households: A household consists of all persons who live in the same dwelling and have a common housekeeping budget. Also. persons living and keeping house alone as well as lodgers are counted as households. - Group quarters: Persons in institutions, homes or the like are covered if they live there, are registered at that address with the police or relevant authorities and, fully or partly, make use of communal catering arrangements or of any joint facilities.

    Universe

    Total population entitled to reside in households

    Kind of data

    Population and Housing Census [hh/popcen]

    Sampling procedure

    MICRODATA SOURCE: Federal Statistical Office (Statistisches Bundesamt)

    SAMPLE SIZE (person records): 3094845.

    SAMPLE DESIGN: A randomized 50% sub-sample was drawn from the 10% sample of the factually anonymous Scientific-Use-File (SUF) by using the ending numbers method. This yields the Public Use File, which is 5% sample of the total 1970 FRG population.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    2 questionnaires: (f1) population census questionnaire: 90% of the population received a questionnaire comprising 18 questions, the remaining 10% received a questionnaire containing 39 questions; (f2) local unit questionnaire

  4. B

    2021 Census of Population [Canada] Public Use Microdata File (PUMF):...

    • borealisdata.ca
    • dataone.org
    • +1more
    Updated Nov 7, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statistics Canada (2025). 2021 Census of Population [Canada] Public Use Microdata File (PUMF): Individuals File [Dataset]. http://doi.org/10.5683/SP3/UIHWYC
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 7, 2025
    Dataset provided by
    Borealis
    Authors
    Statistics Canada
    License

    https://borealisdata.ca/api/datasets/:persistentId/versions/7.2/customlicense?persistentId=doi:10.5683/SP3/UIHWYChttps://borealisdata.ca/api/datasets/:persistentId/versions/7.2/customlicense?persistentId=doi:10.5683/SP3/UIHWYC

    Area covered
    Canada
    Description

    The Individuals File, 2021 Census Public Use Microdata Files (PUMF) provides data on the characteristics of the Canadian population. The file contains a 2.7% sample of anonymous responses to the 2021 Census questionnaire. The files have been carefully scrutinized to ensure the complete confidentiality of the individual responses and geographic identifiers have been restricted to provinces/territories and metropolitan areas. With 144 variables, this comprehensive tool is excellent for policy analysts, pollsters, social researchers and anyone interested in modelling and performing statistical regression analysis using the Census.

  5. h

    2005 Population Census of Japan: Anonymous data Coding table and data layout...

    • d-repo.ier.hit-u.ac.jp
    application/x-yaml +5
    Updated Nov 8, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    独立行政法人統計センター; 一橋大学 (2023). 2005 Population Census of Japan: Anonymous data Coding table and data layout [Dataset]. https://d-repo.ier.hit-u.ac.jp/records/2007153/file_details/somu_spec.xls?filename=somu_spec.xls&file_order=4
    Explore at:
    xlsx, txt, application/x-yaml, text/x-shellscript, pdf, csvAvailable download formats
    Dataset updated
    Nov 8, 2023
    Authors
    独立行政法人統計センター; 一橋大学
    Time period covered
    Oct 1, 2005
    Area covered
    日本, Japan
    Description

    平成17年に実施した国勢調査の母集団から、匿名データ提供の対象データとして更に1%(約124万レコード)リサンプリングを行ったもの。

  6. 2

    1991 UK Census

    • datacatalogue.ukdataservice.ac.uk
    Updated Jan 6, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UK Data Service (2023). 1991 UK Census [Dataset]. http://doi.org/10.5255/UKDA-SN-7212-1
    Explore at:
    Dataset updated
    Jan 6, 2023
    Dataset provided by
    UK Data Servicehttps://ukdataservice.ac.uk/
    Area covered
    Ireland, Northern Ireland, Scotland, Wales, England
    Description

    The UK censuses took place on 21st April 1991. They were run by the Census Office for Northern Ireland, General Register Office for Scotland, and the Office of Population and Surveys for both England and Wales. The UK comprises the countries of England, Wales, Scotland and Northern Ireland.

    Statistics from the UK censuses help paint a picture of the nation and how we live. They provide a detailed snapshot of the population and its characteristics, and underpin funding allocation to provide public services.

    The Northern Ireland Individual SAR is a 2% sample of individuals which was drawn from the full set of 1991 Census records. It was released to the then Census Microdata Unit (now the Centre for Census and Survey Research) in May 1994 who then undertook quality assurance work and produced documentation and additional derived variables.

    The dataset contains 31,967 person records and 53 variables. A number of protections are in place to ensure the anonymity of cases in the data including the low sampling fraction, grouping of some rare categories, limitation of geographical detail and record reordering so that the cases are not ordered geographically.

    Once the Household SAR (held under SN 7213) had been removed to avoid an overlap between the two files, remaining records were stratified into groups of 99 and two individuals were chosen from each group. Individuals in communal establishments were stratified geographically into groups of 50 people and one person was chosen at random from each group. Unlike Great Britain, 100% of Northern Ireland records were coded.

    For many variables, the codes that are used in the Northern Ireland SARs differ from those used in the GB SARs. Where this is the case, the Northern Ireland coding follows on numerically from the GB coding but it will not necessarily start at 1 and have a value for every succeeding integer. For example, 'household family type' uses quite different coding to the nearest GB equivalent 'family type': the GB codes run from 00 through to 08, whilst the Northern Ireland codes pick up from 09 and run through to 23.

    Further information, including guides and other documentation, may be found on the Cathie Marsh Centre for Survey Research Samples of Anonymised Records (SARS) website.

  7. D

    Sales Tax by Census Block

    • data.sfgov.org
    csv, xlsx, xml
    Updated Oct 28, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Sales Tax by Census Block [Dataset]. https://data.sfgov.org/Economy-and-Community/Sales-Tax-by-Census-Block/6k2u-yz39
    Explore at:
    xml, xlsx, csvAvailable download formats
    Dataset updated
    Oct 28, 2025
    License

    ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
    License information was derived automatically

    Description

    A. SUMMARY This dataset contains sales tax collected in San Francisco for calendar years 2015 through the quarter preceding the most recent one. Sales tax is aggregated, or summed, using Census Block Boundaries. However, some geographic boundaries have been combined to maintain the anonymity of businesses based on Taxation Code Section 7056. See “How to use this dataset” below for more details on how the data has been aggregated. Sales tax is collected by businesses on many types of transactions and regulated by the California Department of Tax and Fee Administration.

    B. HOW THE DATASET IS CREATED Data is collected by HDL. The data is then aggregated based on the criteria outlined in the "How to use this dataset" section.

    C. UPDATE PROCESS This dataset will be updated quarterly.

    D. HOW TO USE THIS DATASET This dataset can be used to analyze sales tax data over time across geographic boundary in San Francisco. Due to data privacy protection regulations for businesses, sales tax data is not available for all geographic boundary. For example, boundaries where there are less than 4 businesses paying sales tax or a single business that pays 80% or more of the total sales tax have been combined with neighboring geographic boundary to protect the confidentiality of affected businesses.

    Because of this aggregation, some Census Block groups in this dataset may change in future years as the number of businesses in a particular Census Block change. The historical data changes based on audit findings and amended returns. If Census Block groupings change, it will happen when the dataset is updated - on a quarterly basis. These new blocks will be backfilled to previous years.

    Additionally, business payers with multiple locations (for example chain stores) are excluded because sales tax cannot be tied back to the location where it was collected.

    Finally, census blocks in the area field are from the 2020 census. A dataset of the 2020 census blocks can be found here.

    E. RELATED DATASETS Sales Tax by Supervisor District Sales Tax by Census Block Sales Tax by Analysis Neighborhood Sales Tax by Zip Code

  8. Data from: A visitor-enriched census in the U.S. cities using large-scale...

    • figshare.com
    application/gzip
    Updated Jun 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Meicheng Xiong; Di Zhu; David van Riper (2025). A visitor-enriched census in the U.S. cities using large-scale mobile positioning data. [Dataset]. http://doi.org/10.6084/m9.figshare.28537322.v1
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Jun 9, 2025
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Meicheng Xiong; Di Zhu; David van Riper
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    United States
    Description

    Census data, as a traditional data source of resident socio-demographics, provides valuable information for decision-makers, researchers, and the public. While numerous efforts have been made to develop more comprehensive data products based on published census datasets, most approaches treat census units as static and independent entities, overlooking their interactions. In this paper, we introduce the visitor census dataset, a semantically enriched census that incorporates human visitations extracted from large-scale mobile positioning data. We identified and validated the potential home locations of 3.58 million anonymous mobile phone users across seven U.S. metropolitan statistical areas in July 2021 and utilized home detection results to enrich the socio-demographic profile of the places users visited. The proposed data generation framework is adaptive, allowing future integration of diverse socio-demographic features at varying spatial and temporal scales. Overall, this visitor-based census represents an effort to enrich resident-based census knowledge by incorporating mobilities and spatial interactions in human digital traces, bridging the gap between aggregated and individual analysis, as well as between conventional census and mobile phone data.Seven MSAs include Angeles–Long Beach–Anaheim (LA), Houston–Pasadena–The Woodlands (Houston), Atlanta–Sandy Springs–Roswell (Atlanta), Miami–Fort Lauderdale–West Palm Beach (Miami), Seattle–Tacoma–Bellevue (Seattle), Denver–Aurora–Centennial (Denver), and Minneapolis-Saint. Paul (Twin Cities). There are ten files for each MSA:Visitor-based aggregation census table (e.g., Atlanta_visitor_July2021.csv): one fileVisit-based aggregation census table (e.g., Atlanta_visit_July2021.csv): one fileOne-week (July 19-25, 2021) intermediate home-visit table (e.g., Atlanta_homevisit_July21.csv.gz): seven filesGeographic boundary file at the CBG level (e.g., Atlanta_cbg.geojson): one file

  9. 2

    2021 Census of population: England, Wales, Northern Ireland

    • datacatalogue.ukdataservice.ac.uk
    • beta.ukdataservice.ac.uk
    Updated Feb 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UK Data Service (2025). 2021 Census of population: England, Wales, Northern Ireland [Dataset]. http://doi.org/10.5257/census/aggregate-2021-1
    Explore at:
    Dataset updated
    Feb 28, 2025
    Dataset provided by
    UK Data Servicehttps://ukdataservice.ac.uk/
    Area covered
    Northern Ireland, England and Wales
    Description
    The 2021 UK Census was the 23rd official census of the United Kingdom. The UK Census is generally conducted once every 10 years, and the 2021 censuses of England, Wales, and Northern Ireland took place on 21 March 2021. In Scotland, the decision was made to move the census to March 2022 because of the impact of the coronavirus pandemic (see SNs 9461 and 9462). The censuses were administered by the Office for National Statistics (ONS), the Northern Ireland Statistics and Research Agency (NISRA) and National Records of Scotland (NRS), respectively.

    Census 2021 was the first census with a digital-first design, encouraging participants to respond online rather than on a paper questionnaire. Support was given to people who could not respond online, including paper questionnaires, telephone contact centres, field force support, and an extended collection period.

    Topics covered in the 2021 UK Census included:

    • demography and migration
    • ethnic group, national identity, language and religion
    • labour market and travel to work
    • housing
    • education
    • health, disability, and unpaid care
    • Welsh and other languages
    • UK armed forces veterans
    • sexual orientation and gender identity.

    A census of population is held every ten years in the UK, in England and Wales it is undertaken by the Office for National Statistics (ONS), in Scotland by the National Records of Scotland (NRS) and in Northern Ireland by the Northern Ireland Statistics and Research Agency (NISRA).


    In England, Wales and Northern Ireland the latest census was taken on Sunday 21st March 2021. Due to issues around COVID-19, the census in Scotland was held a year later on 28th June 2022.


    The census asks questions about you, your household and your home. In doing so, it helps to build a detailed snapshot of our society. Information from the census helps the government and local authorities to plan and fund local services, such as education, doctors' surgeries and roads.


    Topics covered by the data released by the Census agencies include -


    Demography and migration, UK armed forces veterans, ethnicity, national identity, language, religion, labour market, housing, sexual orientation, gender identity, education, health, disability and unpaid care.


    The data in this series covers aggregate data at geographies from country level down to Output Area. Due to disclosure control (data can be blurred, changed or withheld to protect anonymity) not all datasets are available at all levels.


  10. g

    San Francisco Sales Tax by Census Block (2018 - 2023) | gimi9.com

    • gimi9.com
    Updated May 29, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). San Francisco Sales Tax by Census Block (2018 - 2023) | gimi9.com [Dataset]. https://gimi9.com/dataset/data-gov_san-francisco-sales-tax-by-census-block-2018-2023/
    Explore at:
    Dataset updated
    May 29, 2024
    Area covered
    San Francisco
    Description

    A. SUMMARY This dataset contains sales tax collected in San Francisco for calendar years 2018 through 2023 (CY 2018 to 2023). Sales tax is aggregated, or summed, at the census block level. However, some census blocks have been combined to maintain the anonymity of businesses based on Taxation Code Section 7056. See “How to use this dataset” below for more details on how the data has been aggregated. Sales tax is collected by businesses on many types of transactions and regulated by the California Department of Tax and Fee Administration. B. HOW THE DATASET IS CREATED Data is collected by HDL. The data is then aggregated based on the criteria outlined in the "How to use this dataset" section. C. UPDATE PROCESS This dataset will be updated annually. D. HOW TO USE THIS DATASET This dataset can be used to analyze sales tax data over time across census blocks in San Francisco. Due to data privacy protection regulations for businesses, sales tax data is not available for all census blocks. Census blocks where there are less than 4 businesses paying sales tax or a single business that pays 80% or more of the total sales tax have been combined with neighboring Census Blocks to protect the confidentiality of affected businesses. Because of this aggregation, some Census Block groups in this dataset may change in future years as the number of businesses in a particular Census Block changes. The historical data changes based on audit findings and amended returns. If census block groupings change, it will happen when the dataset is updated - on an annual basis. These new blocks will be backfilled to previous years. Additionally, business payers with multiple locations (for example chain stores) are excluded because sales tax cannot be tied back to the location where it was collected. Finally, census blocks in the area field are from 2010 (GEOID10) and not from 2020. A map of this dataset can be viewed here.

  11. Zip4 record with attributes.

    • plos.figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jayakrishnan Ajayakumar; Andrew Curtis; Jacqueline Curtis (2023). Zip4 record with attributes. [Dataset]. http://doi.org/10.1371/journal.pone.0285552.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Jayakrishnan Ajayakumar; Andrew Curtis; Jacqueline Curtis
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    There are many public health situations within the United States that require fine geographical scale data to effectively inform response and intervention strategies. However, a condition for accessing and analyzing such data, especially when multiple institutions are involved, is being able to preserve a degree of spatial privacy and confidentiality. Hospitals and state health departments, who are generally the custodians of these fine-scale health data, are sometimes understandably hesitant to collaborate with each other due to these concerns. This paper looks at the utility and pitfalls of using Zip4 codes, a data layer often included as it is believed to be “safe”, as a source for sharing fine-scale spatial health data that enables privacy preservation while maintaining a suitable precision for spatial analysis. While the Zip4 is widely supplied, researchers seldom utilize it. Nor is its spatial characteristics known by data guardians. To address this gap, we use the context of a near-real time spatial response to an emerging health threat to show how the Zip4 aggregation preserves an underlying spatial structure making it potentially suitable dataset for analysis. Our results suggest that based on the density of urbanization, Zip4 centroids are within 150 meters of the real location almost 99% of the time. Spatial analysis experiments performed on these Zip4 data suggest a far more insightful geographic output than if using more commonly used aggregation units such as street lines and census block groups. However, this improvement in analytical output comes at a spatial privy cost as Zip4 centroids have a higher potential of compromising spatial anonymity with 73% of addresses having a spatial k anonymity value less than 5 when compared to other aggregations. We conclude that while offers an exciting opportunity to share data between organizations, researchers and analysts need to be made aware of the potential for serious confidentiality violations.

  12. Demographic and Health Survey 2011 - Ethiopia

    • microdata.worldbank.org
    • catalog.ihsn.org
    • +1more
    Updated May 27, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ministry of Health (MOH) (2019). Demographic and Health Survey 2011 - Ethiopia [Dataset]. https://microdata.worldbank.org/index.php/catalog/1381
    Explore at:
    Dataset updated
    May 27, 2019
    Dataset provided by
    Central Statistical Agencyhttps://ess.gov.et/
    Ministry of Health (MOH)
    Time period covered
    2010 - 2011
    Area covered
    Ethiopia
    Description

    Abstract

    The 2011 Ethiopia Demographic and Health Survey (EDHS) was conducted by the Central Statistical Agency (CSA) under the auspices of the Ministry of Health.

    The principal objective of the 2011 Ethiopia Demographic and Health Survey (EDHS) is to provide current and reliable data on fertility and family planning behaviour, child mortality, adult and maternal mortality, children’s nutritional status, use of maternal and child health services, knowledge of HIV/AIDS, and prevalence of HIV/AIDS and anaemia. The specific objectives are these: - Collect data at the national level that will allow the calculation of key demographic rates; - Analyse the direct and indirect factors that determine fertility levels and trends; - Measure the levels of contraceptive knowledge and practice of women and men by family planning method, urban-rural residence, and region of the country; - Collect high-quality data on family health, including immunisation coverage among children, prevalence and treatment of diarrhoea and other diseases among children under ge five, and maternity care indicators, including antenatal visits and assistance at delivery; - Collect data on infant and child mortality and maternal mortality; - Obtain data on child feeding practices, including breastfeeding, and collect anthropometric measures to assess the nutritional status of women and children; - Collect data on knowledge and attitudes of women and men about sexually transmitted diseases and HIV/AIDS and evaluate patterns of recent behaviour regarding condom use; - Conduct haemoglobin testing on women age 15-49 and children 6-59 months to provide information on the prevalence of anaemia among these groups; - Carry out anonymous HIV testing on women and men of reproductive age to provide information on the prevalence of HIV.

    This information is essential for informed policy decisions, planning, monitoring, and evaluation of programmes on health in general and reproductive health in particular at both the national and regional levels. A long-term objective of the survey is to strengthen the technical capacity of the Central Statistical Agency to plan, conduct, process, and analyse data from complex national population and health surveys.

    Moreover, the 2011 EDHS provides national and regional estimates on population and health that are comparable to data collected in similar surveys in other developing countries and to Ethiopia’s two previous DHS surveys, conducted in 2000 and 2005. Data collected in the 2011 EDHS add to the large and growing international database of demographic and health indicators.

    The survey was intentionally planned to be fielded at the beginning of the last term of the MDG reporting period to provide data for the assessment of the Millennium Development Goals (MDGs).

    The survey interviewed a nationally representative population in about 18,500 households, and all women age 15-49 and all men age 15-59 in these households. In this report key indicators relating to family planning, fertility levels and determinants, fertility preferences, infant, child, adult and maternal mortality, maternal and child health, nutrition, women’s empowerment, and knowledge of HIV/AIDS are provided for the nine regional states and two city administrations. In addition, this report also provides data by urban and rural residence at the country level.

    Major stakeholders from various government, non-government, and UN organizations have been involved and have contributed in the technical, managerial, and operational aspects of the survey.

    Geographic coverage

    A nationally representative sample of 17,817 households was selected.

    Universe

    All women 15-49 who were usual residents or who slept in the selected households the night before the survey were eligible for the survey. A male survey was also conducted. All men 15-49 who were usual residents or who slept in the selected households the night before the survey were eligible for the male survey.

    Kind of data

    Sample survey data

    Sampling procedure

    The sample for the 2011 EDHS was designed to provide population and health indicators at the national (urban and rural) and regional levels. The sample design allowed for specific indicators, such as contraceptive use, to be calculated for each of Ethiopia's 11 geographic/administrative regions (the nine regional states and two city administrations). The 2007 Population and Housing Census, conducted by the CSA, provided the sampling frame from which the 2011 EDHS sample was drawn.

    Administratively, regions in Ethiopia are divided into zones, and zones, into administrative units called weredas. Each wereda is further subdivided into the lowest administrative unit, called kebele. During the 2007 census each kebele was subdivided into census enumeration areas (EAs), which were convenient for the implementation of the census. The 2011 EDHS sample was selected using a stratified, two-stage cluster design, and EAs were the sampling units for the first stage. The sample included 624 EAs, 187 in urban areas and 437 in rural areas.

    Households comprised the second stage of sampling. A complete listing of households was carried out in each of the 624 selected EAs from September 2010 through January 2011. Sketch maps were drawn for each of the clusters, and all conventional households were listed. The listing excluded institutional living arrangements and collective quarters (e.g., army barracks, hospitals, police camps, and boarding schools). A representative sample of 17,817 households was selected for the 2011 EDHS. Because the sample is not self-weighting at the national level, all data in this report are weighted unless otherwise specified.

    In the Somali region, in 18 of the 65 selected EAs listed households were not interviewed for various reasons, such as drought and security problems, and 10 of the 65 selected EAs were not listed due to security reasons. Therefore, the data for Somali may not be totally representative of the region as a whole. However, national-level estimates are not affected, as the percentage of the population in the EAs not covered in the Somali region is proportionally very small.

    SAMPLING FRAME

    The sampling frame used for 2011 EDHS is the Population and Housing Census (PHC) conducted in 2007 provided by the Central Statistical Agency (CSA, 2008). CSA has an electronic file consisting of 81,654 Enumeration Areas (EA) created for the 2007 census in 10 of its 11 geographic regions. An EA is a geographic area consisting of a convenient number of dwelling units which served as counting unit for the census. The frame file contains information about the location, the type of residence, and the number of residential households for each of the 81,654 EAs. Sketch maps are also available for each EA which delimitate the geographic boundaries of the EA. The 2007 PHC conducted in the Somali region used a different methodology due to difficulty of access. Therefore, the sampling frame for the Somali region is in a different file and in different format. Due to security concerns in the Somali region, in the beginning it was decided that 2011 EDHS would be conducted only in three of nine zones in the Somali region: Shinile, Jijiga, and Liben, same as in the 2000 and 2005 EDHS. However, a later decision was made to include three other zones: Afder, Gode and Warder. This was the first time that these three zones were included in a major nationwide survey such as the 2011 EDHS. The sampling frame for the 2011 EDHS consists of a total of 85,057 EAs.

    The sampling frame excluded some special EAs with disputed boundaries. These EAs represent only 0.1% of the total population.

    Ethiopia is divided into 11 geographical regions. Each region is sub-divided into zones, each zone into Waredas, each Wareda into towns, and each town into Kebeles. Among the 85,057 EAs, 17,548 (21 percent) are in urban areas and 67,509 (79 percent) are in rural areas. The average size of EA in number of households is 169 in an urban EA and 180 in a rural EA, with an overall average of 178 households per EA. Table A.2 shows the distributions of households in the sampling frame, by region and residence. The data show that 81 percent of the Ethiopia’s households are concentrated in three regions: Amhara, Oromiya and SNNP, while 4 percent of all households are in the five smallest regions: Afar, Benishangul-Gumuz, Gambela, Harari and Dire Dawa.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    The 2011 EDHS used three questionnaires: the Household Questionnaire, the Woman’s Questionnaire, and the Man’s Questionnaire. These questionnaires were adapted from model survey instruments developed for the MEASURE DHS project to reflect the population and health issues relevant to Ethiopia. Issues were identified at a series of meetings with the various stakeholders. In addition to English, the questionnaires were translated into three major languages—Amharigna, Oromiffa, and Tigrigna.

    The Household Questionnaire was used to list all the usual members and visitors of selected households. Basic information was collected on the characteristics of each person listed, including age, sex, education, and relationship to the head of the household. For children under age 18, survival status of the parents was determined. The data on the age and sex of household members obtained in the Household Questionnaire were used to identify women and men who were eligible for the individual interview. The Household Questionnaire also collected information on characteristics of the household’s dwelling unit, such as the source of water, type of toilet facilities, materials used for the floor of the house, and ownership of various consumer

  13. a

    Decoding Home Values: The Power of Education vs. Race, Ethnicity, and Gender...

    • chi-phi-nmcdc.opendata.arcgis.com
    Updated Jul 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    New Mexico Community Data Collaborative (2023). Decoding Home Values: The Power of Education vs. Race, Ethnicity, and Gender [Dataset]. https://chi-phi-nmcdc.opendata.arcgis.com/datasets/decoding-home-values-the-power-of-education-vs-race-ethnicity-and-gender
    Explore at:
    Dataset updated
    Jul 25, 2023
    Dataset authored and provided by
    New Mexico Community Data Collaborative
    Description

    A detailed explanation of how this dataset was put together, including data sources and methodologies, follows below.Please see the "Terms of Use" section below for the Data DictionaryDATA ACQUISITION AND CLEANING PROCESSThis dataset was built from 5 separate datasets queried during the months of April and May 2023 from the Census Microdata System (link below):https://data.census.gov/mdat/#/All datasets include information on Property Value (VALP) by: Educational Attainment (SCHL), Gender (SEX), a specified race or ethnicity (RAC or HISP), and are grouped by Public Use Microdata Areas (PUMAS). PUMAS are geographic areas created by the Census bureau; they are weighted by land area and population to facilitate data analysis. Data also Included totals for the state of New Mexico, so 19 total geographies are represented. Datasets were downloaded separately by race and ethnicity because this was the only way to obtain the VALP, SCHL, and SEX variables intersectionally with race or ethnicity data. Datasets were downloaded separately by race and ethnicity because this was the only way to obtain the VALP, SCHL, and SEX variables intersectionally with race or ethnicity data. Cleaning each dataset started with recoding the SCHL and HISP variables - details on recoding can be found below.After recoding, each dataset was transposed so that PUMAS were rows and SCHL, VALP, SEX, and Race or Ethnicity variables were the columns.Median values were calculated in every case that recoding was necessary. As a result, all Property Values in this dataset reflect median values.At times the ACS data downloaded with zeros instead of the 'null' values in initial query results. The VALP variable also included a "-1" variable to reflect N/A values (details in variable notes). Both zeros and "-1" values were removed before calculating median values, both to keep the data true to the original query and to generate accurate median values.Recoding the SCHL variable resulted in 5 rows for each PUMA, reflecting the different levels of educational attainment in each region. Columns grouped variables by race or ethnicity and gender. Cell values were property values.All 5 datasets were joined after recoding and cleaning the data. Original datasets all include 95 rows with 5 separate Educational Attainment variables for each PUMA, including New Mexico State totals.Because 1 row was needed for each PUMA in order to map this data, the data was split by Educational Attainment (SCHL), resulting in 110 columns reflecting median property values for each race or ethnicity by gender and level of educational attainment.A short, unique 2 to 5 letter alias was created for each PUMA area in anticipation of needing a unique identifier to join the data with. GIS AND MAPPING PROCESSA PUMA shapefile was downloaded from the ACS site. The Shapefile can be downloaded here: https://tigerweb.geo.census.gov/arcgis/rest/services/TIGERweb/PUMA_TAD_TAZ_UGA_ZCTA/MapServerThe DBF from the PUMA shapefile was exported to Excel; this shapefile data included needed geographic information for mapping such as: GEOID, PUMACE. The UIDs created for each PUMA were added to the shapefile data; the PUMA shapfile data and ACS data were then joined on UID in JMP.The data table was joined to the shapefile in ARC GiIS, based on PUMA region (specifically GEOID text).The resulting shapefile was exported as a GDB (geodatabase) in order to keep 'Null' values in the data. GDBs are capable of including a rule allowing null values where shapefiles are not. This GDB was uploaded to NMCDCs Arc Gis platform. SYSTEMS USEDMS Excel was used for data cleaning, recoding, and deriving values. Recoding was done directly in the Microdata system when possible - but because the system is was in beta at the time of use some features were not functional at times.JMP was used to transpose, join, and split data. ARC GIS Desktop was used to create the shapefile uploaded to NMCDC's online platform. VARIABLE AND RECODING NOTESTIMEFRAME: Data was queried for the 5 year period of 2015 to 2019 because ACS changed its definiton for and methods of collecting data on race and ethinicity in 2020. The change resulted in greater aggregation and les granular data on variables from 2020 onward.Note: All Race Data reflects that respondants identified as the specified race alone or in combination with one or more other races.VARIABLE:ACS VARIABLE DEFINITIONACS VARIABLE NOTESDETAILS OR URL FOR RAW DATA DOWNLOADRACBLKBlack or African American ACS Query: RACBLK, SCHL, SEX, VALP 2019 5yrRACAIANAmerican Indian and Alaska Native ACS Query: RACAIAN, SCHL, SEX, VALP 2019 5yrRACASNAsian ACS Query: RACASN, SCHL, SEX, VALP 2019 5yrRACWHTWhite ACS Query: RACWHT, SCHL, SEX, VALP 2019 5yrHISPHispanic Origin ACS Query: HISP ORG, SCHL, SEX, VALP 2019 5yrHISP RECODE: 24 original separate variablesThe Hispanic Origin (HISP) variable originally included 24 subcategories reflecting Mexican, Central American, South American, and Caribbean Latino, and Spanish identities from each Latin American counry. 7 recoded VariablesThese 24 variables were recoded (grouped) into 7 simpler categories for data analysis: Not Spanish/Hispanic/Latino, Mexican, Caribbean Latino, Central American, South American, Spaniard, All other Spanish/Hispanic/Latino Female. Not Spanish/Hispanic/Latino was not really used in the final dataset as the race datasets provided that information.SCHLEducational Attainment25 original separate variablesThe Educational Attainment (SCHL) variable originally included 25 subcategories reflecting the education levels of adults (over 18) surveyed by the ACS. These include: Kindergarten, Grades 1 through 12 separately, 12th grade with no diploma, Highschool Diploma, GED or credential, less than 1 year of college, more than 1 year of college with no degree, Associate's Degree, Bachelor's Degree, Master's Degree, Professional Degree, and Doctorate Degree.SCHL RECODE: 5 recoded variablesThese 25 variables were recoded (grouped) into 5 simpler categories for data analysis: No High School Diploma, High School Diploma or GED, Some College, Bachelor's Degree, and Advanced or Professional DegreeSEXGender2 variables1 - Male, 2 - FemaleVALPProperty Value1 variableValues were rounded and top-coded by ACS for anonymity. The "-1" variable is defined as N/A (GQ/ Vacant lots except 'for sale only' and 'sold, not occupied' / not owned or being bought.) This variable reflects the median value of property owned by individuals of each race, ethnicity, gender, and educational attainment category.PUMAPublic Use Microdata Area18 PUMAsPUMAs in New Mexico can be viewed here:https://nmcdc.maps.arcgis.com/apps/mapviewer/index.html?webmap=d9fed35f558948ea9051efe9aa529eafData includes 19 total regions: 18 Pumas and NM State TotalsNOTES AND RESOURCESThe following resources and documentation were used to navigate the ACS PUMS system and to answer questions about variables:Census Microdata API User Guide:https://www.census.gov/data/developers/guidance/microdata-api-user-guide.Additional_Concepts.html#list-tab-1433961450Accessing PUMS Data:https://www.census.gov/programs-surveys/acs/microdata/access.htmlHow to use PUMS on data.census.govhttps://www.census.gov/programs-surveys/acs/microdata/mdat.html2019 PUMS Documentation:https://www.census.gov/programs-surveys/acs/microdata/documentation.2019.html#list-tab-13709392012014 to 2018 ACS PUMS Data Dictionary:https://www2.census.gov/programs-surveys/acs/tech_docs/pums/data_dict/PUMS_Data_Dictionary_2014-2018.pdf2019 PUMS Tiger/Line Shapefileshttps://www.census.gov/cgi-bin/geo/shapefiles/index.php?year=2019&layergroup=Public+Use+Microdata+Areas Note 1: NMCDC attemepted to contact analysts with the ACS system to clarify questions about variables, but did not receive a timely response. Documentation was then consulted.Note 2: All relevant documentation was reviewed and seems to imply that all survey questions were answered by adults, age 18 or over. Youth who have inherited property could potentially be reflected in this data.Dataset and feature service created in May 2023 by Renee Haley, Data Specialist, NMCDC.

  14. i

    Demographic and Health Survey 2005 - Ethiopia

    • datacatalog.ihsn.org
    • catalog.ihsn.org
    • +1more
    Updated Jul 6, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Population and Housing Census Commissions Office (PHCCO) (2017). Demographic and Health Survey 2005 - Ethiopia [Dataset]. https://datacatalog.ihsn.org/catalog/163
    Explore at:
    Dataset updated
    Jul 6, 2017
    Dataset authored and provided by
    Population and Housing Census Commissions Office (PHCCO)
    Time period covered
    2005
    Area covered
    Ethiopia
    Description

    Abstract

    The 2005 Ethiopia Demographic and Health Survey (2005 EDHS) is part of the worldwide MEASURE DHS project which is funded by the United States Agency for International Development (USAID).

    The principal objective of the 2005 Ethiopia Demographic and Health Survey (DHS) is to provide current and reliable data on fertility and family planning behaviour, child mortality, adult and maternal mortality, children’s nutritional status, the utilization of maternal and child health services, knowledge of HIV/AIDS and prevalence of HIV/AIDS and anaemia.

    The specific objectives are to: - collect data at the national level which will allow the calculation of key demographic rates; - analyze the direct and indirect factors which determine the level and trends of fertility; - measure the level of contraceptive knowledge and practice of women and men by method, urban-rural residence, and region; - collect high quality data on family health including immunization coverage among children, prevalence and treatment of diarrhoea and other diseases among children under five, and maternity care indicators including antenatal visits and assistance at delivery; - collect data on infant and child mortality and maternal and adult mortality; - obtain data on child feeding practices including breastfeeding and collect anthropometric measures to use in assessing the nutritional status of women and children; - collect data on knowledge and attitudes of women and men about sexually transmitted diseases and HIV/AIDS and evaluate patterns of recent behaviour regarding condom use; - conduct haemoglobin testing on women age 15-49 and children under age five years in a subsample of the households selected for the survey to provide information on the prevalence of anaemia among women in the reproductive ages and young children; - collect samples for anonymous HIV testing from women and men in the reproductive ages to provide information on the prevalence of HIV among the adult population.

    This information is essential for informed policy decisions, planning, monitoring, and evaluation of programs on health in general and reproductive health in particular at both the national and regional levels. A long-term objective of the survey is to strengthen the technical capacity of the Central Statistical Agency to plan, conduct, process, and analyse data from complex national population and health surveys. Moreover, the 2005 Ethiopia DHS provides national and regional estimates on population and health that are comparable to data collected in similar surveys in other developing countries. The first ever Demographic and Health Survey (DHS) in Ethiopia was conducted in the year 2000 as part of the worldwide DHS programme. Data from the 2005 Ethiopia DHS survey, the second such survey, add to the vast and growing international database on demographic and health variables.

    Wherever possible, the 2005 EDHS data is compared with data from the 2000 EDHS. In addition, where applicable, the 2005 EDHS is compared with the 1990 NFFS, which also sampled women age 15-49. Husbands of currently married women were also covered in this survey. However, for security and other reasons, the NFFS excluded from its coverage Eritrea, Tigray, Asseb, and Ogaden autonomous regions. In addition, fieldwork could not be carried out for Northern Gondar, Southern Gondar, Northern Wello, and Southern Wello due to security reasons. Thus, any comparison between the EDHS and the NFFS has to be interpreted with caution.

    Geographic coverage

    National

    Analysis unit

    • Household
    • Children under five years
    • Women age 15-49
    • Men age 15-59

    Kind of data

    Sample survey data

    Sampling procedure

    The 2005 EDHS sample was designed to provide estimates for the health and demographic variables of interest for the following domains: Ethiopia as a whole; urban and rural areas of Ethiopia (each as a separate domain); and 11 geographic areas (9 regions and 2 city administrations), namely: Tigray; Affar; Amhara; Oromiya; Somali; Benishangul-Gumuz; Southern Nations, Nationalities and Peoples (SNNP); Gambela; Harari; Addis Ababa and Dire Dawa. In general, a DHS sample is stratified, clustered and selected in two stages. In the 2005 EDHS a representative sample of approximately 14,500 households from 540 clusters was selected. The sample was selected in two stages. In the first stage, 540 clusters (145 urban and 395 rural) were selected from the list of enumeration areas (EA) from the 1994 Population and Housing Census sample frame.

    In the census frame, each of the 11 administrative areas is subdivided into zones and each zone into weredas. In addition to these administrative units, each wereda was subdivided into convenient areas called census EAs. Each EA was either totally urban or rural and the EAs were grouped by administrative wereda. Demarcated cartographic maps as well as census household and population data were also available for each census EA. The 1994 Census provided an adequate frame for drawing the sample for the 2005 EDHS. As in the 2000 EDHS, the 2005 EDHS sampled three of seven zones in the Somali Region (namely, Jijiga, Shinile and Liben). In the Affar Region the incomplete frame used in 2000 was improved adding a list of villages not previously included, to improve the region's representativeness in the survey. However, despite efforts to cover the settled population, there may be some bias in the representativeness of the regional estimates for both the Somali and Affar regions, primarily because the census frame excluded some areas in these regions that had a predominantly nomadic population.

    The 540 EAs selected for the EDHS are not distributed by region proportionally to the census population. Thus, the sample for the 2005 EDHS must be weighted to produce national estimates. As part of the second stage, a complete household listing was carried out in each selected cluster. The listing operation lasted for three months from November 2004 to January 2005. Between 24 and 32 households from each cluster were then systematically selected for participation in the survey.

    Because of the way the sample was designed, the number of cases in some regions appear small since they are weighted to make the regional distribution nationally representative. Throughout this report, numbers in the tables reflect weighted numbers. To ensure statistical reliability, percentages based on 25 to 49 unweighted cases are shown in parentheses and percentages based on fewer than 25 unweighted cases are suppressed.

    Note: See detailed sample implementation table in APPENDIX A of the survey report.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    In order to adapt the standard DHS core questionnaires to the specific socio-cultural settings and needs in Ethiopia, its contents were revised through a technical committee composed of senior and experienced demographers of PHCCO. After the draft questionnaires were prepared in English, copies of the household, women’s and men’s questionnaires were distributed to relevant institutions and individual researchers for comments. A one-day workshop was organized on November 22, 2004 at the Ghion Hotel in Addis Ababa to discuss the contents of the questionnaire. Over 50 participants attended the national workshop and their comments and suggestions collected. Based on these comments, further revisions were made on the contents of the questionnaires. Some additional questions were included at the request of MOH, the Fistula Hospital, and USAID. The questionnaires were finalized in English and translated into the three main local languages: Amharic, Oromiffa and Tigrigna. In addition, the DHS core interviewer’s manual for the Women’s and Men’s Questionnaires, the supervisor’s and editor’s manual, and the HIV and anaemia field manual were modified and translated into Amharic.

    The Household Questionnaire was used to list all the usual members and visitors in the selected households. Some basic information was collected on the characteristics of each person listed, including age, sex, education, and relationship to the head of the household. The main purpose of the Household Questionnaire was to identify women and men who were eligible for the individual interview. The Household Questionnaire also collected information on characteristics of the household’s dwelling unit, such as the source of water, type of toilet facilities, materials used for the floor and roof of the house, ownership of various durable goods, and ownership and use of mosquito nets. In addition, this questionnaire was used to record height and weight measurements of women age 15-49 and children under the age of five, households eligible for collection of blood samples, and the respondents’ consent to voluntarily give blood samples.

    The Women’s Questionnaire was used to collect information from all women age 15-49 years and covered the following topics. - Household and respondent characteristics - Fertility levels and preferences - Knowledge and use of family planning - Childhood mortality - Maternity care - Childhood illness, treatment, and preventative actions - Anaemia levels among women and children - Breastfeeding practices - Nutritional status of women and young children - Malaria prevention and treatment - Marriage and sexual activity - Awareness and behaviour regarding AIDS and STIs - Harmful traditional practices - Maternal mortality

    The Men’s Questionnaire was administered to all men age 15-59 years living in every second household in the sample. The Men’s Questionnaire collected similar information contained in the Women’s Questionnaire, but was shorter because it did not contain questions on reproductive

  15. US Adult Income

    • kaggle.com
    zip
    Updated Jul 14, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    John Olafenwa (2017). US Adult Income [Dataset]. https://www.kaggle.com/forums/f/4741/us-adult-income
    Explore at:
    zip(719385 bytes)Available download formats
    Dataset updated
    Jul 14, 2017
    Authors
    John Olafenwa
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    United States
    Description

    US Adult Census data relating income to social factors such as Age, Education, race etc.

    The Us Adult income dataset was extracted by Barry Becker from the 1994 US Census Database. The data set consists of anonymous information such as occupation, age, native country, race, capital gain, capital loss, education, work class and more. Each row is labelled as either having a salary greater than ">50K" or "<=50K".

    This Data set is split into two CSV files, named adult-training.txt and adult-test.txt.

    The goal here is to train a binary classifier on the training dataset to predict the column income_bracket which has two possible values ">50K" and "<=50K" and evaluate the accuracy of the classifier with the test dataset.

    Note that the dataset is made up of categorical and continuous features. It also contains missing values The categorical columns are: workclass, education, marital_status, occupation, relationship, race, gender, native_country

    The continuous columns are: age, education_num, capital_gain, capital_loss, hours_per_week

    This Dataset was obtained from the UCI repository, it can be found on

    https://archive.ics.uci.edu/ml/datasets/census+income, http://mlr.cs.umass.edu/ml/machine-learning-databases/adult/

    USAGE This dataset is well suited to developing and testing wide linear classifiers, deep neutral network classifiers and a combination of both. For more info on Combined Deep and Wide Model classifiers, refer to the Research Paper by Google https://arxiv.org/abs/1606.07792

    Refer to this kernel for sample usage : https://www.kaggle.com/johnolafenwa/wage-prediction

    Complete Tutorial is available from http://johnolafenwa.blogspot.com.ng/2017/07/machine-learning-tutorial-1-wage.html?m=1

  16. g

    Passer-by census in Schönbornstraße | gimi9.com

    • gimi9.com
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Passer-by census in Schönbornstraße | gimi9.com [Dataset]. https://gimi9.com/dataset/eu_passantenzaehlung_schoenbornstrasse-wuerzburg/
    Explore at:
    License

    Data licence Germany – Attribution – Version 2.0https://www.govdata.de/dl-de/by-2-0
    License information was derived automatically

    Description

    Against the background of digitalisation, e-commerce and the aftermath of global pandemics, pedestrian frequencies are an important indicator of the attractiveness of a location. This measure indicates the number of people who have entered a delimited location within a certain period of time. As one of many Smart City solutions, Würzburg also has sensors that are able to measure the number of passers-by completely in compliance with data protection and anonymously with the help of laser technology. As one of the first locations in Germany, the city of Würzburg has been publishing the hourly passer-by frequency in Schönbornstraße transparently and openly with the help of laser technology since April 2019. In October 2022, Kaiserstraße and Spiegelstraße were added as additional locations. The aim of these measurements is to record the pulse rate of the city centre in order to create a realistic situational picture and differentiated analyses of the number of passers-by. In addition, you will find in our Open Data Portal a monthly overview of the passers-by in Schönbornstraße since April 2019 and a comparison of our three locations since October 2022.

  17. a

    Community Focus Areas 2023 RTP

    • data-wfrc.opendata.arcgis.com
    • data.wfrc.utah.gov
    Updated Jun 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wasatch Front Regional Council (2023). Community Focus Areas 2023 RTP [Dataset]. https://data-wfrc.opendata.arcgis.com/datasets/community-focus-areas-2023-rtp
    Explore at:
    Dataset updated
    Jun 15, 2023
    Dataset authored and provided by
    Wasatch Front Regional Council
    Area covered
    Description

    WFRC Community Focus Areas (2023)Geographic Representation Units WFRC’s Community Focus Areas (CFAs) are geographic areas for which additional consideration may be given within the planning and programming processes for future transportation, economic development, and other projects administered through WFRC. CFAs are used by WFRC in support of meeting the Council-established goal of promoting “inclusive engagement in transportation planning processes and equitable access to affordable and reliable transportation options.” CFAs are designated from Census block group geographic zones that meet the criteria described below. Census block groups are used as these are the smallest geographic areas for which more detailed household characteristics like employment, income, vehicle ownership, commute trip, and English language proficiency are available. WFRC recognizes the limitations of geography-based analysis, as proper planning work considers together the needs of individuals, groups and sectors, and geographic areas. However, geography-based analyses offer a useful starting point for the consideration and prioritization of projects that will serve specific community needs.2023 Community Focus Area Criteria UpdateFor the 2023 RTP planning cycle, WFRC will use two factors in designating geography-based CFAs: 1) concentration of low-income households and 2) concentration of persons identifying as members of racial and ethnic minority groups. The geography for these factors can be identified from consistent and regularly updated data sources maintained by the U.S. Census Bureau. WFRC will also make data available that conveys, while maintaining individual anonymity, the geographic distribution of additional measures including concentrations of persons with disabilities, households with limited English language proficiency, households that do not own a vehicle, older residents (65+ years of age), and younger residents (0-17 years of age). While the application of these factors within the planning process is less straightforward because of their higher statistical margins of error and comparatively even distribution within the region, these additional factors remain valuable as planning context. Low Income Focus Areas, Methodology for IdentificationThe block group-level data from the 2020 Census American Community Survey (ACS) 5-year dataset (Table C17002: Ratio of Income to Poverty Level), is used to determine the percentage of the population within each block group that are in households that have a ratio of income to federal poverty threshold of equal to or less than 1, i.e., their income is below the poverty level. The federal poverty threshold is set differently for households, considering their household size and age of household members.Census block groups in which more than 20% of the households whose income is less than or equal to the federal poverty threshold are included in the WFRC CFAs and designated as Low-Income focus areas. Racial and Ethnic Minority Focus AreasThe block group-level data from the 2020 ACS 5-year dataset (Table B03002: Hispanic or Latino Origin By Race) is used to determine the percentage of the population that did not self-identify their race and ethnicity as “White alone.” The average census block group area in the Wasatch Front urbanized areas has 24.2% of its population that identifies as Black or African American alone, American Indian, and Alaska Native alone, Asian alone, Native Hawaiian and other Pacific Islander alone, some other race alone, two or more races, or of Hispanic or Latino origin.Census blocks in which more than 40%2 of the population identifies as one or more of the racial or ethnic groups listed above are included in the WFRC CFAs and designated as Racial and Ethnic Minority focus areas.Excluding Predominantly Non-Residential Areas from CFAsSome census block groups that meet one or both of the CFA criteria described above contain large, non-residential areas or low density residential areas. Such census block areas may have small residential neighborhoods surrounded by predominantly commercial or industrial land uses, or large areas of public land or as-yet undeveloped lands. For this reason, WFRC staff may adjust the boundaries of an CFA whose census block group population density is less than 500 persons per square mile, to exclude areas of those block groups that have large, predominantly non-residential land uses.Community Focus Area Update FrequencyThe geography for WFRC CFAs will be updated not less than every four years, preceding the project phasing period of the Regional Transportation Planning update cycle. The update will use the most recent version of the 5 year ACS dataset. The next update is expected in the summer of 2026 (the beginning of the 4th year for the 2027 RTP development process) and is expected to use the 2024 5-year ACS results that average results across 2020-2024.Footnotes:1. The 2019 version of WFRC CFAs used ‘Zero Car Households’ as a third factor. This factor is no longer included because of its geographic and statistical fluctuation over time in data reported by the American Community Survey. Additionally, ‘Zero Car households’ was observed to have a strong relationship with the other two CFA designation factors.2. The percentage threshold specified here is approximately one standard deviation above the regional mean for this indicator. Assuming a statistically normal distribution, approximately 16% of the overall set (i.e. census blocks, in this case) would fall above a one standard deviation threshold.3. Table B03002 includes information from both 'Race' and 'Hispanic or Latino Origin' identification questions asked as part of the Census Bureau's American Community Survey.

  18. f

    An example of voter data with real, Zip4, Street Segment, Census Block...

    • figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jayakrishnan Ajayakumar; Andrew Curtis; Jacqueline Curtis (2023). An example of voter data with real, Zip4, Street Segment, Census Block Group, and Zip centroid details. [Dataset]. http://doi.org/10.1371/journal.pone.0285552.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Jayakrishnan Ajayakumar; Andrew Curtis; Jacqueline Curtis
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    An example of voter data with real, Zip4, Street Segment, Census Block Group, and Zip centroid details.

  19. d

    2016 Census of Population [Canada] Public Use Microdata File (PUMF):...

    • search.dataone.org
    Updated Dec 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statistics Canada (2023). 2016 Census of Population [Canada] Public Use Microdata File (PUMF): Hierarchical File [Dataset]. http://doi.org/10.5683/SP3/FZSVE0
    Explore at:
    Dataset updated
    Dec 28, 2023
    Dataset provided by
    Borealis
    Authors
    Statistics Canada
    Area covered
    Canada
    Description

    This Hierarchical File, 2016 Census Public Use Microdata File (PUMF) product provides access to non-aggregated data covering a sample of 1% of the Canadian households. It is a comprehensive social, demographic and economic database about Canada and its people, and contains a wealth of characteristics on the population. The file enables the study of individuals in relation to their census families, economic families and households. Geographic identifiers have been restricted to the provinces, the three territories grouped into a region called Northern Canada and selected metropolitan areas (Toronto, Montréal, Vancouver, Edmonton and Calgary) to ensure respondents’ anonymity. This comprehensive file is excellent tool for policy analysts, pollsters, social researchers and anyone interested in modeling and performing statistical regression analysis using 2016 Census microdata.

  20. G

    Annual School Meal Census

    • dtechtive.com
    • find.data.gov.scot
    csv
    Updated Feb 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Glasgow City Council (uSmart) (2024). Annual School Meal Census [Dataset]. https://dtechtive.com/datasets/39576
    Explore at:
    csv(0.0203 MB), csv(0.0134 MB), csv(0.0138 MB), csv(0.0177 MB), csv(0.0195 MB), csv(0.0188 MB), csv(0.0162 MB), csv(0.0218 MB), csv(0.0159 MB), csv(0.0051 MB), csv(0.0156 MB)Available download formats
    Dataset updated
    Feb 2, 2024
    Dataset provided by
    Glasgow City Council (uSmart)
    Description

    Data presented here is an extract of data taken from the Annual School Meal Census in publicly funded schools in Scotland. The data shows the provision of school meals (including free school meals) for each school in the Glasgow local authority area. The dataset forms part of a time series and is available for the years 2003 through to 2014. Full datasets can be downloaded from The Scottish Government. The data is graduated to school level and data includes: the numbers on the school roll; counts of pupils entitled to free school meals; counts of pupils present on the day of the survey, counts of pupils taking a school meal (free or not) on the day of the survey; and counts of pupils taking a free school meal on the day of the survey. In order to protect the identity of pupils a * used in the dataset denotes the number of pupils is 4 or less (zero included) or where such a figure could be worked out. A * * used in the dataset denotes where the difference between the number of pupils on the register and pupils with FME is 4 or less. Some of the datasets also include information on breakfast clubs, the provision of fresh fruit and water and the anonymity of the free school meal application process. The School Meal Census is carried out annually. For individual dataset errata or qualifications users should consult the background data or notes of the individual datasets. Licence: None fsm2003-2014.zip - https://dataservices.open.glasgow.gov.uk/Download/Organisation/728522f0-86da-48c6-8f75-1649934eb8a4/Dataset/7a0f701f-b55f-463f-a748-d62d6adf9979/File/51a57ba3-cf7b-4738-90ff-81d48dac9820/Version/7e43a6c6-cff0-4be2-b6e4-cd2e9c2707a5

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
独立行政法人統計センター; 一橋大学 (2023). 2015 Population Census of Japan: Anonymous data Coding table and data layout [Dataset]. https://d-repo.ier.hit-u.ac.jp/records/2007155/file_details/tokumei_kokuH27.xlsx?filename=tokumei_kokuH27.xlsx&file_order=0

2015 Population Census of Japan: Anonymous data Coding table and data layout

Explore at:
xlsx, txt, csv, pdf, text/x-shellscript, application/x-yamlAvailable download formats
Dataset updated
Nov 8, 2023
Authors
独立行政法人統計センター; 一橋大学
Time period covered
Oct 1, 2015
Area covered
Japan, 日本
Description

平成27年に実施した国勢調査の母集団から、匿名データ提供の対象データとして更に1%(約125万レコード)リサンプリングを行ったもの。

Search
Clear search
Close search
Google apps
Main menu