Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
MSM = men who have sex with men; IDU = injection drug users.§Age was determined at the time of acquisition of the first chronological sample collected from an individual patient that was included in the analysis.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Note. Groups: SA = subacute, CH = chronic, CG = control group. Pt = patient; M/F = male/female. NIHSS: National Institutes of Health Stroke Scale. Stroke etiology: i = ischemic, h = hemorrhagic stroke. V&TDS: visual and tactile double stimulation. CAV screen: CAV visual field screening. CAV-ET: CAV extinction test. NET Score: for subtests 1 to 8 and for the whole test battery. Mean (M) and standard deviation (SD) given for patients and healthy controls.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
By [source]
This dataset contains the power to help us better understand the prevalence and treatment outcomes of childhood allergies over an extended period of time. Not only does it publicize the number of individuals currently suffering from asthma, atopic dermatitis, allergic rhinitis and food allergies through retrospective data as reported by healthcare providers - but it also features a set of columns which allow us to gain valuable insights into how these outcomes differ across different demographics such as gender, race and ethnicity. By further examining this data, we can start to recognize patterns in trends among the diagnosed cases - paving way for new treatments and prevention strategies that could prevent severe allergic reactions for many children all around the world
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
Assess what kind of questions you want to answer using this data - do you want to focus on one particular type of allergy or analyze them together? Do you want a descriptive analysis or would an analysis that looks for correlations between conditions be more appropriate?
Once you have determined your research question(s), identify what variables from the dataset are pertinent to your inquiry and assess any outliers that might need further investigation or filtering out during your analysis. Also consider any independent variables or confounding factors which might affect your results as well as any existing hypotheses related to the topic that might help guide your research project expectations
Be aware of potential sources of bias when using self-reported healthcare provider information such as difficulties in disease identification (i.e allergies may be misdiagnosed). Additionally note that many allergy cases may go unreported/unrecorded due issues such as lack access/awareness about healthcare etc). A good way combat bias is by sample size - use largest possible datasets whenever available!
Begin collecting relevant data from columns pertaining medical history (allergy diagnosis start & end date etc.), patient demographic information (gender factor ,ethnicity factor etc.), treatment trends & outcomes( first Asthma RX date , last asthma RX date , NUM asthma rx etc ). To get the most insights outta thisdata all these factors must be taken into account – if there isn’t enough evidence then explore other reliable sources too
Structure & organize collected data so they can me easily accessed later – maybe create separate sheets/tabs with different categories i.e patient/treatment information OR create individual sheets for each subject depending upon how much info needs collecting .Designing formulaic functions will not only make life easier but critically save time & energy when it comes analyzing vast amounts data stored within workbook ! Remember larger sample sizes provide more
- Use the dataset to identify risk factors or patterns in childhood allergies that can inform preventative and treatment measures.
- Investigate the correlation between demographic characteristics (e.g., age, gender) and diagnosis or severity of childhood allergies by using cross-tabs or other statistical techniques on the data provided in this dataset.
- Analyze longitudinal trends in treatment outcomes for various types of childhood allergy, such as asthma, atopic dermatitis and food allergy by comparing patient results over time (i.e., looking at pre-treatment diagnosis and post-treatment diagnoses)
If you use this dataset in your research, please credit the original authors. Data Source
License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
File: food-allergy-analysis-Zenodo.csv | Column name | Description | |:----------------------------|:--------------------------------------------------------------| | BIRTH_YEAR | Year of birth of the patient. (Integer) | | GENDER_FACTOR ...
Population Health Management Market Size 2025-2029
The population health management market size is forecast to increase by USD 19.40 billion at a CAGR of 10.7% between 2024 and 2029.
The Population Health Management Market is experiencing significant growth, driven by the increasing adoption of healthcare IT solutions and the rising focus on personalized medicine. The implementation of electronic health records (EHRs) and other digital health technologies has enabled healthcare providers to collect and analyze large amounts of patient data, facilitating proactive care and population health management. Moreover, the trend towards personalized medicine, which aims to tailor healthcare treatments to individual patients based on their unique genetic makeup and health history, is further fueling the demand for PHM solutions. However, the high cost of installing and implementing these platforms poses a significant challenge for market growth.
Despite this, the potential benefits of PHM, including improved patient outcomes, reduced healthcare costs, and enhanced population health, make it an attractive area for investment and innovation. Companies seeking to capitalize on these opportunities must navigate the challenges of data privacy and security, interoperability, and integration with existing healthcare systems. By addressing these challenges and focusing on delivering actionable insights from patient data, PHM solution providers can help healthcare organizations optimize their resources, improve patient care, and ultimately, improve population health.
What will be the Size of the Population Health Management Market during the forecast period?
Request Free Sample
The market is experiencing significant growth, driven by the increasing focus on accountable care organizations (ACOs) and payer organizations to improve health outcomes and reduce costs. Healthcare professionals are leveraging big data, data analytics services, and clinical data integration to develop personalized care plans and implement intervention strategies for various populations. Telehealth services have become essential in population health management, enabling care coordination, health promotion, and health navigation for patients. Health equity is a critical factor in population health management, with a growing emphasis on addressing disparities and ensuring equal access to care.
Data security and interoperability standards are essential in population health management, as healthcare providers exchange sensitive patient data for risk adjustment, care pathways, and quality reporting. Data mining and data visualization tools are used to identify health behavior changes and lifestyle modifications, leading to better health outcomes. Consumer health technology, such as patient engagement tools and wearable technology, are playing an increasingly important role in population health management. Health coaching and evidence-based medicine are intervention strategies used to prevent diseases and improve health outcomes. In summary, the market in the US is characterized by the adoption of precision medicine, health literacy, clinical guidelines, and personalized care plans.
The market is driven by the need for care coordination, data analytics, and patient engagement to improve health outcomes and reduce costs. The use of data security, data mining, and interoperability standards ensures the effective exchange and utilization of health data.
How is this Population Health Management Industry segmented?
The population health management industry research report provides comprehensive data (region-wise segment analysis), with forecasts and estimates in 'USD billion' for the period 2025-2029, as well as historical data from 2019-2023 for the following segments.
Component
Software
Services
End-user
Large enterprises
SMEs
Delivery Mode
On-Premise
Cloud-Based
Web-Based
On-Premise
Cloud-Based
End-Use
Providers
Payers
Employer Groups
Government Bodies
Providers
Payers
Employer Groups
Geography
North America
US
Canada
Europe
France
Germany
Italy
UK
APAC
China
India
Japan
South Korea
Rest of World
By Component Insights
The software segment is estimated to witness significant growth during the forecast period.
The market's software segment is experiencing significant growth and innovation. Healthcare organizations are utilizing these solutions to effectively manage and enhance the health outcomes of diverse populations. The software component incorporates various tools that collect, analyze, and utilize health data for informed decision-making. Population health management platforms gather data from multiple sources, such as electronic health records, claims data, and patient-generated data. These platforms employ advanced analytics to generate valuable insi
The 110th Congressional District Summary File (Sample) (110CDSAMPLE) contains the sample data, which is the information compiled from the questions asked of a sample of all people and housing units. Population items include basic population totals; urban and rural; households and families; marital status; grandparents as caregivers; language and ability to speak English; ancestry; place of birth, citizenship status, and year of entry; migration; place of work; journey to work (commuting); school enrollment and educational attainment; veteran status; disability; employment status; industry, occupation, and class of worker; income; and poverty status. Housing items include basic housing totals; urban and rural; number of rooms; number of bedrooms; year moved into unit; household size and occupants per room; units in structure; year structure built; heating fuel; telephone service; plumbing and kitchen facilities; vehicles available; value of home; monthly rent; and shelter costs. The file contains subject content identical to that shown in Summary File 3 (SF 3).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset tabulates the Excel population distribution across 18 age groups. It lists the population in each age group along with the percentage population relative of the total population for Excel. The dataset can be utilized to understand the population distribution of Excel by age. For example, using this dataset, we can identify the largest age group in Excel.
Key observations
The largest age group in Excel, AL was for the group of age 45 to 49 years years with a population of 74 (15.64%), according to the ACS 2018-2022 5-Year Estimates. At the same time, the smallest age group in Excel, AL was the 85 years and over years with a population of 2 (0.42%). Source: U.S. Census Bureau American Community Survey (ACS) 2018-2022 5-Year Estimates
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2018-2022 5-Year Estimates
Age groups:
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for Excel Population by Age. You can refer the same here
The State Legislative District Summary File (Sample) (SLDSAMPLE) contains the sample data, which is the information compiled from the questions asked of a sample of all people and housing units. Population items include basic population totals; urban and rural; households and families; marital status; grandparents as caregivers; language and ability to speak English; ancestry; place of birth, citizenship status, and year of entry; migration; place of work; journey to work (commuting); school enrollment and educational attainment; veteran status; disability; employment status; industry, occupation, and class of worker; income; and poverty status. Housing items include basic housing totals; urban and rural; number of rooms; number of bedrooms; year moved into unit; household size and occupants per room; units in structure; year structure built; heating fuel; telephone service; plumbing and kitchen facilities; vehicles available; value of home; monthly rent; and shelter costs. The file contains subject content identical to that shown in Summary File 3 (SF 3).
This data package contains claims-based data about beneficiaries of Medicare program services including Inpatient, Outpatient, related to Chronic Conditions, Skilled Nursing Facility, Home Health Agency, Hospice, Carrier, Durable Medical Equipment (DME) and data related to Prescription Drug Events. It is necessary to mention that the values are estimated and counted, by using a random sample of fee-for-service Medicare claims.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset provides detailed medical and demographic information for 5,000 patients diagnosed with various types of eye cancer, including Melanoma, Retinoblastoma, and Lymphoma. It includes fields such as:
Patient Demographics: Age (1–90), Gender, Country
Clinical Details: Cancer type, Laterality (Left/Right/Bilateral), Stage at diagnosis
Diagnosis & Treatment: Date of diagnosis, Type of treatment (Surgery, Radiation, Chemotherapy), Treatment intensity (e.g., radiation dose in Gy, number of chemo sessions)
Outcomes: Survival time in months, Outcome status (In Remission, Active, Deceased)
Genetics & History: Genetic markers (e.g., BRAF mutation), Family history of eye cancer
This dataset is designed for use in machine learning projects, statistical modeling, and healthcare research. It is suitable for tasks like classification (e.g., predicting outcomes), regression (e.g., survival analysis), clustering, and exploratory data analysis.
Potential Uses:
Survival prediction models
Outcome analysis based on treatment types
Correlation analysis between genetic markers and cancer type
Geo-demographic trends in eye cancer incidence
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Prevalence of health conditions in entire sample.
Our consumer data is gathered and aggregated via surveys, digital services, and public data sources. We use powerful profiling algorithms to collect and ingest only fresh and reliable data points.
Our comprehensive data enrichment solution includes a variety of data sets that can help you address gaps in your customer data, gain a deeper understanding of your customers, and power superior client experiences.
Consumer Graph Schema & Reach: Our data reach represents the total number of counts available within various categories and comprises attributes such as country location, MAU, DAU & Monthly Location Pings:
Data Export Methodology: Since we collect data dynamically, we provide the most updated data and insights via a best-suited method on a suitable interval (daily/weekly/monthly).
Consumer Graph Use Cases:
360-Degree Customer View:Get a comprehensive image of customers by the means of internal and external data aggregation.
Data Enrichment:Leverage Online to offline consumer profiles to build holistic audience segments to improve campaign targeting using user data enrichment
Fraud Detection: Use multiple digital (web and mobile) identities to verify real users and detect anomalies or fraudulent activity.
Advertising & Marketing:Understand audience demographics, interests, lifestyle, hobbies, and behaviors to build targeted marketing campaigns.
Using Factori Consumer Data graph you can solve use cases like:
Acquisition Marketing Expand your reach to new users and customers using lookalike modeling with your first party audiences to extend to other potential consumers with similar traits and attributes.
Lookalike Modeling
Build lookalike audience segments using your first party audiences as a seed to extend your reach for running marketing campaigns to acquire new users or customers
And also, CRM Data Enrichment, Consumer Data Enrichment B2B Data Enrichment B2C Data Enrichment Customer Acquisition Audience Segmentation 360-Degree Customer View Consumer Profiling Consumer Behaviour Data
The 2013 Turkey Demographic and Health Survey (TDHS-2013) is a nationally representative sample survey. The primary objective of the TDHS-2013 is to provide data on socioeconomic characteristics of households and women between ages 15-49, fertility, childhood mortality, marriage patterns, family planning, maternal and child health, nutritional status of women and children, and reproductive health. The survey obtained detailed information on these issues from a sample of women of reproductive age (15-49). The TDHS-2013 was designed to produce information in the field of demography and health that to a large extent cannot be obtained from other sources.
Specifically, the objectives of the TDHS-2013 included: - Collecting data at the national level that allows the calculation of some demographic and health indicators, particularly fertility rates and childhood mortality rates, - Obtaining information on direct and indirect factors that determine levels and trends in fertility and childhood mortality, - Measuring the level of contraceptive knowledge and practice by contraceptive method and some background characteristics, i.e., region and urban-rural residence, - Collecting data relative to maternal and child health, including immunizations, antenatal care, and postnatal care, assistance at delivery, and breastfeeding, - Measuring the nutritional status of children under five and women in the reproductive ages, - Collecting data on reproductive-age women about marriage, employment status, and social status
The TDHS-2013 information is intended to provide data to assist policy makers and administrators to evaluate existing programs and to design new strategies for improving demographic, social and health policies in Turkey. Another important purpose of the TDHS-2013 is to sustain the flow of information for the interested organizations in Turkey and abroad on the Turkish population structure in the absence of a reliable and sufficient vital registration system. Additionally, like the TDHS-2008, TDHS-2013 is accepted as a part of the Official Statistic Program.
National coverage
The survey covered all de jure household members (usual residents), children age 0-5 years and women age 15-49 years resident in the household.
Sample survey data [ssd]
The sample design and sample size for the TDHS-2013 makes it possible to perform analyses for Turkey as a whole, for urban and rural areas, and for the five demographic regions of the country (West, South, Central, North, and East). The TDHS-2013 sample is of sufficient size to allow for analysis on some of the survey topics at the level of the 12 geographical regions (NUTS 1) which were adopted at the second half of the year 2002 within the context of Turkey’s move to join the European Union.
In the selection of the TDHS-2013 sample, a weighted, multi-stage, stratified cluster sampling approach was used. Sample selection for the TDHS-2013 was undertaken in two stages. The first stage of selection included the selection of blocks as primary sampling units from each strata and this task was requested from the TURKSTAT. The frame for the block selection was prepared using information on the population sizes of settlements obtained from the 2012 Address Based Population Registration System. Settlements with a population of 10,000 and more were defined as “urban”, while settlements with populations less than 10,000 were considered “rural” for purposes of the TDHS-2013 sample design. Systematic selection was used for selecting the blocks; thus settlements were given selection probabilities proportional to their sizes. Therefore more blocks were sampled from larger settlements.
The second stage of sample selection involved the systematic selection of a fixed number of households from each block, after block lists were obtained from TURKSTAT and were updated through a field operation; namely the listing and mapping fieldwork. Twentyfive households were selected as a cluster from urban blocks, and 18 were selected as a cluster from rural blocks. The total number of households selected in TDHS-2013 is 14,490.
The total number of clusters in the TDHS-2013 was set at 642. Block level household lists, each including approximately 100 households, were provided by TURKSTAT, using the National Address Database prepared for municipalities. The block lists provided by TURKSTAT were updated during the listing and mapping activities.
All women at ages 15-49 who usually live in the selected households and/or were present in the household the night before the interview were regarded as eligible for the Women’s Questionnaire and were interviewed. All analysis in this report is based on de facto women.
Note: A more technical and detailed description of the TDHS-2013 sample design, selection and implementation is presented in Appendix B of the final report of the survey.
Face-to-face [f2f]
Two main types of questionnaires were used to collect the TDHS-2013 data: the Household Questionnaire and the Individual Questionnaire for all women of reproductive age. The contents of these questionnaires were based on the DHS core questionnaire. Additions, deletions and modifications were made to the DHS model questionnaire in order to collect information particularly relevant to Turkey. Attention also was paid to ensuring the comparability of the TDHS-2013 findings with previous demographic surveys carried out by the Hacettepe Institute of Population Studies. In the process of designing the TDHS-2013 questionnaires, national and international population and health agencies were consulted for their comments.
The questionnaires were developed in Turkish and translated into English.
TDHS-2013 questionnaires were returned to the Hacettepe University Institute of Population Studies by the fieldwork teams for data processing as soon as interviews were completed in a province. The office editing staff checked that the questionnaires for all selected households and eligible respondents were returned from the field. A total of 29 data entry staff were trained for data entry activities of the TDHS-2013. The data entry of the TDHS-2013 began in late September 2013 and was completed at the end of January 2014.
The data were entered and edited on microcomputers using the Census and Survey Processing System (CSPro) software. CSPro is designed to fulfill the census and survey data processing needs of data-producing organizations worldwide. CSPro is developed by MEASURE partners, the U.S. Bureau of the Census, ICF International’s DHS Program, and SerPro S.A. CSPro allows range, skip, and consistency errors to be detected and corrected at the data entry stage. During the data entry process, 100% verification was performed by entering each questionnaire twice using different data entry operators and comparing the entered data.
In all, 14,490 households were selected for the TDHS-2013. At the time of the listing phase of the survey, 12,640 households were considered occupied and, thus, eligible for interview. Of the eligible households, 93 percent (11,794) households were successfully interviewed. The main reasons the field teams were unable to interview some households were because some dwelling units that had been listed were found to be vacant at the time of the interview or the household was away for an extended period.
In the interviewed 11,794 households, 10,840 women were identified as eligible for the individual interview, aged 15-49 and were present in the household on the night before the interview. Interviews were successfully completed with 9,746 of these women (90 percent). Among the eligible women not interviewed in the survey, the principal reason for nonresponse was the failure to find the women at home after repeated visits to the household.
The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors, and (2) sampling errors. Nonsampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the TDHS-2013 to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the TDHS-2013 is only one of many samples that could have been selected from the same population, using the same design and expected size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall
https://www.icpsr.umich.edu/web/ICPSR/studies/36998/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/36998/terms
The American Community Survey (ACS) is an ongoing statistical survey that samples a small percentage of the population every year -- giving communities the information they need to plan investments and services. The 5-year public use microdata sample (PUMS) for 2012-2016 is a subset of the 2012-2012 ACS sample. It contains the same sample as the combined PUMS 1-year files for 2012, 2013, 2014, 2015 and 2016. This data collection provides a person-level subset of 133,781 respondents whose occupations were coded as arts-related in the 2011-2015 ACS PUMS. The 2012-2016 PUMS is the seventh 5-year file published by the ACS. This data collection contains five years of data for the population from households and the group quarters (GQ) population. The GQ population and population from households are all weighted to agree with the ACS counts which are an average over the five year period (2012-2016). The ACS sample was selected from all counties across the nation. The ACS provides social, housing, and economic characteristics for demographic groups covering a broad spectrum of geographic areas in the United States. For a more detailed list of variables of what these categories include please see the decriptions of variables section.
The 1998 Ghana Demographic and Health Survey (GDHS) is the latest in a series of national-level population and health surveys conducted in Ghana and it is part of the worldwide MEASURE DHS+ Project, designed to collect data on fertility, family planning, and maternal and child health.
The primary objective of the 1998 GDHS is to provide current and reliable data on fertility and family planning behaviour, child mortality, children’s nutritional status, and the utilisation of maternal and child health services in Ghana. Additional data on knowledge of HIV/AIDS are also provided. This information is essential for informed policy decisions, planning and monitoring and evaluation of programmes at both the national and local government levels.
The long-term objectives of the survey include strengthening the technical capacity of the Ghana Statistical Service (GSS) to plan, conduct, process, and analyse the results of complex national sample surveys. Moreover, the 1998 GDHS provides comparable data for long-term trend analyses within Ghana, since it is the third in a series of demographic and health surveys implemented by the same organisation, using similar data collection procedures. The GDHS also contributes to the ever-growing international database on demographic and health-related variables.
National
Sample survey data
The major focus of the 1998 GDHS was to provide updated estimates of important population and health indicators including fertility and mortality rates for the country as a whole and for urban and rural areas separately. In addition, the sample was designed to provide estimates of key variables for the ten regions in the country.
The list of Enumeration Areas (EAs) with population and household information from the 1984 Population Census was used as the sampling frame for the survey. The 1998 GDHS is based on a two-stage stratified nationally representative sample of households. At the first stage of sampling, 400 EAs were selected using systematic sampling with probability proportional to size (PPS-Method). The selected EAs comprised 138 in the urban areas and 262 in the rural areas. A complete household listing operation was then carried out in all the selected EAs to provide a sampling frame for the second stage selection of households. At the second stage of sampling, a systematic sample of 15 households per EA was selected in all regions, except in the Northern, Upper West and Upper East Regions. In order to obtain adequate numbers of households to provide reliable estimates of key demographic and health variables in these three regions, the number of households in each selected EA in the Northern, Upper West and Upper East regions was increased to 20. The sample was weighted to adjust for over sampling in the three northern regions (Northern, Upper East and Upper West), in relation to the other regions. Sample weights were used to compensate for the unequal probability of selection between geographically defined strata.
The survey was designed to obtain completed interviews of 4,500 women age 15-49. In addition, all males age 15-59 in every third selected household were interviewed, to obtain a target of 1,500 men. In order to take cognisance of non-response, a total of 6,375 households nation-wide were selected.
Note: See detailed description of sample design in APPENDIX A of the survey report.
Face-to-face
Three types of questionnaires were used in the GDHS: the Household Questionnaire, the Women’s Questionnaire, and the Men’s Questionnaire. These questionnaires were based on model survey instruments developed for the international MEASURE DHS+ programme and were designed to provide information needed by health and family planning programme managers and policy makers. The questionnaires were adapted to the situation in Ghana and a number of questions pertaining to on-going health and family planning programmes were added. These questionnaires were developed in English and translated into five major local languages (Akan, Ga, Ewe, Hausa, and Dagbani).
The Household Questionnaire was used to enumerate all usual members and visitors in a selected household and to collect information on the socio-economic status of the household. The first part of the Household Questionnaire collected information on the relationship to the household head, residence, sex, age, marital status, and education of each usual resident or visitor. This information was used to identify women and men who were eligible for the individual interview. For this purpose, all women age 15-49, and all men age 15-59 in every third household, whether usual residents of a selected household or visitors who slept in a selected household the night before the interview, were deemed eligible and interviewed. The Household Questionnaire also provides basic demographic data for Ghanaian households. The second part of the Household Questionnaire contained questions on the dwelling unit, such as the number of rooms, the flooring material, the source of water and the type of toilet facilities, and on the ownership of a variety of consumer goods.
The Women’s Questionnaire was used to collect information on the following topics: respondent’s background characteristics, reproductive history, contraceptive knowledge and use, antenatal, delivery and postnatal care, infant feeding practices, child immunisation and health, marriage, fertility preferences and attitudes about family planning, husband’s background characteristics, women’s work, knowledge of HIV/AIDS and STDs, as well as anthropometric measurements of children and mothers.
The Men’s Questionnaire collected information on respondent’s background characteristics, reproduction, contraceptive knowledge and use, marriage, fertility preferences and attitudes about family planning, as well as knowledge of HIV/AIDS and STDs.
A total of 6,375 households were selected for the GDHS sample. Of these, 6,055 were occupied. Interviews were completed for 6,003 households, which represent 99 percent of the occupied households. A total of 4,970 eligible women from these households and 1,596 eligible men from every third household were identified for the individual interviews. Interviews were successfully completed for 4,843 women or 97 percent and 1,546 men or 97 percent. The principal reason for nonresponse among individual women and men was the failure of interviewers to find them at home despite repeated callbacks.
Note: See summarized response rates by place of residence in Table 1.1 of the survey report.
The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors, and (2) sampling errors. Nonsampling errors are the results of shortfalls made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 1998 GDHS to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 1998 GDHS is only one of many samples that could have been selected from the same population, using the same design and expected size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95 percent of all possible samples of identical size and design.
If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 1998 GDHS sample is the result of a two-stage stratified design, and, consequently, it was necessary to use more complex formulae. The computer software used to calculate sampling errors for the 1998 GDHS is the ISSA Sampling Error Module. This module uses the Taylor linearization method of variance estimation for survey estimates that are means or proportions. The Jackknife repeated replication method is used for variance estimation of more complex statistics such as fertility and mortality rates.
Data Quality Tables - Household age distribution - Age distribution of eligible and interviewed women - Age distribution of eligible and interviewed men - Completeness of reporting - Births by calendar years - Reporting of age at death in days - Reporting of age at death in months
Note: See detailed tables in APPENDIX C of the survey report.
ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
A. SUMMARY This archived dataset includes data for population characteristics that are no longer being reported publicly. The date on which each population characteristic type was archived can be found in the field “data_loaded_at”.
B. HOW THE DATASET IS CREATED Data on the population characteristics of COVID-19 cases are from: * Case interviews * Laboratories * Medical providers These multiple streams of data are merged, deduplicated, and undergo data verification processes.
Race/ethnicity * We include all race/ethnicity categories that are collected for COVID-19 cases. * The population estimates for the "Other" or “Multi-racial” groups should be considered with caution. The Census definition is likely not exactly aligned with how the City collects this data. For that reason, we do not recommend calculating population rates for these groups.
Gender * The City collects information on gender identity using these guidelines.
Skilled Nursing Facility (SNF) occupancy * A Skilled Nursing Facility (SNF) is a type of long-term care facility that provides care to individuals, generally in their 60s and older, who need functional assistance in their daily lives. * This dataset includes data for COVID-19 cases reported in Skilled Nursing Facilities (SNFs) through 12/31/2022, archived on 1/5/2023. These data were identified where “Characteristic_Type” = ‘Skilled Nursing Facility Occupancy’.
Sexual orientation * The City began asking adults 18 years old or older for their sexual orientation identification during case interviews as of April 28, 2020. Sexual orientation data prior to this date is unavailable. * The City doesn’t collect or report information about sexual orientation for persons under 12 years of age. * Case investigation interviews transitioned to the California Department of Public Health, Virtual Assistant information gathering beginning December 2021. The Virtual Assistant is only sent to adults who are 18+ years old. https://www.sfdph.org/dph/files/PoliciesProcedures/COM9_SexualOrientationGuidelines.pdf">Learn more about our data collection guidelines pertaining to sexual orientation.
Comorbidities * Underlying conditions are reported when a person has one or more underlying health conditions at the time of diagnosis or death.
Homelessness Persons are identified as homeless based on several data sources: * self-reported living situation * the location at the time of testing * Department of Public Health homelessness and health databases * Residents in Single-Room Occupancy hotels are not included in these figures. These methods serve as an estimate of persons experiencing homelessness. They may not meet other homelessness definitions.
Single Room Occupancy (SRO) tenancy * SRO buildings are defined by the San Francisco Housing Code as having six or more "residential guest rooms" which may be attached to shared bathrooms, kitchens, and living spaces. * The details of a person's living arrangements are verified during case interviews.
Transmission Type * Information on transmission of COVID-19 is based on case interviews with individuals who have a confirmed positive test. Individuals are asked if they have been in close contact with a known COVID-19 case. If they answer yes, transmission category is recorded as contact with a known case. If they report no contact with a known case, transmission category is recorded as community transmission. If the case is not interviewed or was not asked the question, they are counted as unknown.
C. UPDATE PROCESS This dataset has been archived and will no longer update as of 9/11/2023.
D. HOW TO USE THIS DATASET Population estimates are only available for age groups and race/ethnicity categories. San Francisco population estimates for race/ethnicity and age groups can be found in a view based on the San Francisco Population and Demographic Census dataset. These population estimates are from the 2016-2020 5-year American Community Survey (ACS).
This dataset includes many different types of characteristics. Filter the “Characteristic Type” column to explore a topic area. Then, the “Characteristic Group” column shows each group or category within that topic area and the number of cases on each date.
New cases are the count of cases within that characteristic group where the positive tests were collected on that specific specimen collection date. Cumulative cases are the running total of all San Francisco cases in that characteristic group up to the specimen collection date listed.
This data may not be immediately available for recently reported cases. Data updates as more information becomes available.
To explore data on the total number of cases, use the ARCHIVED: COVID-19 Cases Over Time dataset.
E. CHANGE LOG
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Population: City: Age 15 to 64: Anhui data was reported at 13.705 Person th in 2023. This records an increase from the previous number of 13.631 Person th for 2022. Population: City: Age 15 to 64: Anhui data is updated yearly, averaging 8.738 Person th from Dec 1997 (Median) to 2023, with 27 observations. The data reached an all-time high of 11,787.862 Person th in 2020 and a record low of 3.193 Person th in 2002. Population: City: Age 15 to 64: Anhui data remains active status in CEIC and is reported by National Bureau of Statistics. The data is categorized under China Premium Database’s Socio-Demographic – Table CN.GA: Population: Sample Survey: By Age and Region: City.
The 1997 Jordan Population and Family Health Survey (JPFHS) is a national sample survey carried out by the Department of Statistics (DOS) as part of its National Household Surveys Program (NHSP). The JPFHS was specifically aimed at providing information on fertility, family planning, and infant and child mortality. Information was also gathered on breastfeeding, on maternal and child health care and nutritional status, and on the characteristics of households and household members. The survey will provide policymakers and planners with important information for use in formulating informed programs and policies on reproductive behavior and health.
National
Sample survey data
SAMPLE DESIGN AND IMPLEMENTATION
The 1997 JPFHS sample was designed to produce reliable estimates of major survey variables for the country as a whole, for urban and rural areas, for the three regions (each composed of a group of governorates), and for the three major governorates, Amman, Irbid, and Zarqa.
The 1997 JPFHS sample is a subsample of the master sample that was designed using the frame obtained from the 1994 Population and Housing Census. A two-stage sampling procedure was employed. First, primary sampling units (PSUs) were selected with probability proportional to the number of housing units in the PSU. A total of 300 PSUs were selected at this stage. In the second stage, in each selected PSU, occupied housing units were selected with probability inversely proportional to the number of housing units in the PSU. This design maintains a self-weighted sampling fraction within each governorate.
UPDATING OF SAMPLING FRAME
Prior to the main fieldwork, mapping operations were carried out and the sample units/blocks were selected and then identified and located in the field. The selected blocks were delineated and the outer boundaries were demarcated with special signs. During this process, the numbers on buildings and housing units were updated, listed and documented, along with the name of the owner/tenant of the unit or household and the name of the household head. These activities took place between January 7 and February 28, 1997.
Note: See detailed description of sample design in APPENDIX A of the survey report.
Face-to-face
The 1997 JPFHS used two questionnaires, one for the household interview and the other for eligible women. Both questionnaires were developed in English and then translated into Arabic. The household questionnaire was used to list all members of the sampled households, including usual residents as well as visitors. For each member of the household, basic demographic and social characteristics were recorded and women eligible for the individual interview were identified. The individual questionnaire was developed utilizing the experience gained from previous surveys, in particular the 1983 and 1990 Jordan Fertility and Family Health Surveys (JFFHS).
The 1997 JPFHS individual questionnaire consists of 10 sections: - Respondent’s background - Marriage - Reproduction (birth history) - Contraception - Pregnancy, breastfeeding, health and immunization - Fertility preferences - Husband’s background, woman’s work and residence - Knowledge of AIDS - Maternal mortality - Height and weight of children and mothers.
Fieldwork and data processing activities overlapped. After a week of data collection, and after field editing of questionnaires for completeness and consistency, the questionnaires for each cluster were packaged together and sent to the central office in Amman where they were registered and stored. Special teams were formed to carry out office editing and coding.
Data entry started after a week of office data processing. The process of data entry, editing, and cleaning was done by means of the ISSA (Integrated System for Survey Analysis) program DHS has developed especially for such surveys. The ISSA program allows data to be edited while being entered. Data entry was completed on November 14, 1997. A data processing specialist from Macro made a trip to Jordan in November and December 1997 to identify problems in data entry, editing, and cleaning, and to work on tabulations for both the preliminary and final report.
A total of 7,924 occupied housing units were selected for the survey; from among those, 7,592 households were found. Of the occupied households, 7,335 (97 percent) were successfully interviewed. In those households, 5,765 eligible women were identified, and complete interviews were obtained with 5,548 of them (96 percent of all eligible women). Thus, the overall response rate of the 1997 JPFHS was 93 percent. The principal reason for nonresponse among the women was the failure of interviewers to find them at home despite repeated callbacks.
Note: See summarized response rates by place of residence in Table 1.1 of the survey report.
The estimates from a sample survey are subject to two types of errors: nonsampling errors and sampling errors. Nonsampling errors are the result of mistakes made in implementing data collection and data processing (such as failure to locate and interview the correct household, misunderstanding questions either by the interviewer or the respondent, and data entry errors). Although during the implementation of the 1997 JPFHS numerous efforts were made to minimize this type of error, nonsampling errors are not only impossible to avoid but also difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The respondents selected in the 1997 JPFHS constitute only one of many samples that could have been selected from the same population, given the same design and expected size. Each of those samples would have yielded results differing somewhat from the results of the sample actually selected. Sampling errors are a measure of the variability among all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95 percent of all possible samples of identical size and design.
If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, since the 1997 JDHS-II sample resulted from a multistage stratified design, formulae of higher complexity had to be used. The computer software used to calculate sampling errors for the 1997 JDHS-II was the ISSA Sampling Error Module, which uses the Taylor linearization method of variance estimation for survey estimates that are means or proportions. The Jackknife repeated replication method is used for variance estimation of more complex statistics, such as fertility and mortality rates.
Note: See detailed estimate of sampling error calculation in APPENDIX B of the survey report.
Data Quality Tables - Household age distribution - Age distribution of eligible and interviewed women - Completeness of reporting - Births by calendar years - Reporting of age at death in days - Reporting of age at death in months
Note: See detailed tables in APPENDIX C of the survey report.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Population: County: Age 15 to 64: Jiangxi data was reported at 10.872 Person th in 2023. This records a decrease from the previous number of 10.920 Person th for 2022. Population: County: Age 15 to 64: Jiangxi data is updated yearly, averaging 15.771 Person th from Dec 1997 (Median) to 2023, with 27 observations. The data reached an all-time high of 19,227.826 Person th in 2000 and a record low of 10.216 Person th in 2019. Population: County: Age 15 to 64: Jiangxi data remains active status in CEIC and is reported by National Bureau of Statistics. The data is categorized under China Premium Database’s Socio-Demographic – Table CN.GA: Population: Sample Survey: By Age and Region: Rural.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Population: County: Age 65 and Above: Guizhou data was reported at 2.907 Person th in 2023. This records an increase from the previous number of 2.794 Person th for 2022. Population: County: Age 65 and Above: Guizhou data is updated yearly, averaging 1.948 Person th from Dec 1997 (Median) to 2023, with 27 observations. The data reached an all-time high of 2,675.335 Person th in 2020 and a record low of 1.455 Person th in 1999. Population: County: Age 65 and Above: Guizhou data remains active status in CEIC and is reported by National Bureau of Statistics. The data is categorized under China Premium Database’s Socio-Demographic – Table CN.GA: Population: Sample Survey: By Age and Region: Rural.
This data collection contains a stratified 1-percent sample of households, with separate records for each household, each "sample line" respondent, and each person in the household. These records were encoded from microfilm copies of original handwritten enumeration schedules from the 1950 Census of Population. Geographic identification of the location of the sampled households includes Census regions and divisions, states (except Alaska and Hawaii), Standard Metropolitan Areas (SMAs), and State Economic Areas (SEAs). The data collection was constructed from and consists of 20 independently-drawn subsamples stored in 20 discrete physical files. The 1950 Census had both a complete-count and a sample component. Individuals selected for the sample component were asked a set of additional questions. Only households with a sample line person were included in the 1950 Public Use Microdata Sample. The collection also contains records of group quarters members who were also on the Census sample line. Each household record contains variables describing the location and composition of the household. The sample line records contain variables describing demographic characteristics such as nativity, marital status, number of children, veteran status, education, income, and occupation. The person records contain demographic variables such as nativity, marital status, family membership, and occupation. (Source: downloaded from ICPSR 7/13/10)
Please Note: This dataset is part of the historical CISER Data Archive Collection and is also available at ICPSR at https://doi.org/10.3886/ICPSR08251.v1. We highly recommend using the ICPSR version as they may make this dataset available in multiple data formats in the future.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
MSM = men who have sex with men; IDU = injection drug users.§Age was determined at the time of acquisition of the first chronological sample collected from an individual patient that was included in the analysis.