22 datasets found

Income of individuals by age group, sex and income source, Canada, provinces...
www150.statcan.gc.ca
ouvert.canada.ca
+2more
Updated May 1, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Government of Canada, Statistics Canada (2025). Income of individuals by age group, sex and income source, Canada, provinces and selected census metropolitan areas [Dataset]. http://doi.org/10.25318/1110023901-eng
Explore at:
Unique identifier
https://doi.org/10.25318/1110023901-eng
Dataset updated
May 1, 2025
Dataset provided by
Statistics Canadahttps://statcan.gc.ca/en
Area covered
Canada
Description
Income of individuals by age group, sex and income source, Canada, provinces and selected census metropolitan areas, annual.
Single-earner and dual-earner census families by number of children
www150.statcan.gc.ca
ouvert.canada.ca
+2more
Updated Jul 18, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Government of Canada, Statistics Canada (2025). Single-earner and dual-earner census families by number of children [Dataset]. http://doi.org/10.25318/1110002801-eng
Explore at:
Unique identifier
https://doi.org/10.25318/1110002801-eng
Dataset updated
Jul 18, 2025
Dataset provided by
Statistics Canadahttps://statcan.gc.ca/en
Area covered
Canada
Description
Families of tax filers; Single-earner and dual-earner census families by number of children (final T1 Family File; T1FF).
High income tax filers in Canada
www150.statcan.gc.ca
open.canada.ca
Updated Oct 28, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Government of Canada, Statistics Canada (2024). High income tax filers in Canada [Dataset]. http://doi.org/10.25318/1110005501-eng
Explore at:
Unique identifier
https://doi.org/10.25318/1110005501-eng
Dataset updated
Oct 28, 2024
Dataset provided by
Statistics Canadahttps://statcan.gc.ca/en
Area covered
Canada
Description
This table presents income shares, thresholds, tax shares, and total counts of individual Canadian tax filers, with a focus on high income individuals (95% income threshold, 99% threshold, etc.). Income thresholds are based on national threshold values, regardless of selected geography; for example, the number of Nova Scotians in the top 1% will be calculated as the number of taxfiling Nova Scotians whose total income exceeded the 99% national income threshold. Different definitions of income are available in the table namely market, total, and after-tax income, both with and without capital gains.
t
Tucson Equity Priority Index (TEPI): Ward 6 Census Block Groups
teds.tucsonaz.gov
Updated Feb 4, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
City of Tucson (2025). Tucson Equity Priority Index (TEPI): Ward 6 Census Block Groups [Dataset]. https://teds.tucsonaz.gov/maps/cotgis::tucson-equity-priority-index-tepi-ward-6-census-block-groups
Explore at:
Dataset updated
Feb 4, 2025
Dataset authored and provided by
City of Tucson
Area covered

Description
For detailed information, visit the Tucson Equity Priority Index StoryMap.Download the Data DictionaryWhat is the Tucson Equity Priority Index (TEPI)?The Tucson Equity Priority Index (TEPI) is a tool that describes the distribution of socially vulnerable demographics. It categorizes the dataset into 5 classes that represent the differing prioritization needs based on the presence of social vulnerability: Low (0-20), Low-Moderate (20-40), Moderate (40-60), Moderate-High (60-80) High (80-100). Each class represents 20% of the dataset’s features in order of their values. The features within the Low (0-20) classification represent the areas that, when compared to all other locations in the study area, have the lowest need for prioritization, as they tend to have less socially vulnerable demographics. The features that fall into the High (80-100) classification represent the 20% of locations in the dataset that have the greatest need for prioritization, as they tend to have the highest proportions of socially vulnerable demographics. How is social vulnerability measured?The Tucson Equity Priority Index (TEPI) examines the proportion of vulnerability per feature using 11 demographic indicators:Income Below Poverty: Households with income at or below the federal poverty level (FPL), which in 2023 was $14,500 for an individual and $30,000 for a family of fourUnemployment: Measured as the percentage of unemployed persons in the civilian labor forceHousing Cost Burdened: Homeowners who spend more than 30% of their income on housing expenses, including mortgage, maintenance, and taxesRenter Cost Burdened: Renters who spend more than 30% of their income on rentNo Health Insurance: Those without private health insurance, Medicare, Medicaid, or any other plan or programNo Vehicle Access: Households without automobile, van, or truck accessHigh School Education or Less: Those highest level of educational attainment is a High School diploma, equivalency, or lessLimited English Ability: Those whose ability to speak English is "Less Than Well."People of Color: Those who identify as anything other than Non-Hispanic White Disability: Households with one or more physical or cognitive disabilities Age: Groups that tend to have higher levels of vulnerability, including children (those below 18), and seniors (those 65 and older)An overall percentile value is calculated for each feature based on the total proportion of the above indicators in each area. How are the variables combined?These indicators are divided into two main categories that we call Thematic Indices: Economic and Personal Characteristics. The two thematic indices are further divided into five sub-indices called Tier-2 Sub-Indices. Each Tier-2 Sub-Index contains 2-3 indicators. Indicators are the datasets used to measure vulnerability within each sub-index. The variables for each feature are re-scaled using the percentile normalization method, which converts them to the same scale using values between 0 to 100. The variables are then combined first into each of the five Tier-2 Sub-Indices, then the Thematic Indices, then the overall TEPI using the mean aggregation method and equal weighting. The resulting dataset is then divided into the five classes, where:High Vulnerability (80-100%): Representing the top classification, this category includes the highest 20% of regions that are the most socially vulnerable. These areas require the most focused attention. Moderate-High Vulnerability (60-80%): This upper-middle classification includes areas with higher levels of vulnerability compared to the median. While not the highest, these areas are more vulnerable than a majority of the dataset and should be considered for targeted interventions. Moderate Vulnerability (40-60%): Representing the middle or median quintile, this category includes areas of average vulnerability. These areas may show a balanced mix of high and low vulnerability. Detailed examination of specific indicators is recommended to understand the nuanced needs of these areas. Low-Moderate Vulnerability (20-40%): Falling into the lower-middle classification, this range includes areas that are less vulnerable than most but may still exhibit certain vulnerable characteristics. These areas typically have a mix of lower and higher indicators, with the lower values predominating. Low Vulnerability (0-20%): This category represents the bottom classification, encompassing the lowest 20% of data points. Areas in this range are the least vulnerable, making them the most resilient compared to all other features in the dataset.
N
Income Distribution by Quintile: Mean Household Income in Winchester, VA //...
neilsberg.com
csv, json
Updated Mar 3, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2025). Income Distribution by Quintile: Mean Household Income in Winchester, VA // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/winchester-va-median-household-income/
Explore at:
csv, jsonAvailable download formats
Dataset updated
Mar 3, 2025
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Virginia, Winchester
Variables measured
Income Level, Mean Household Income
Measurement technique
The data presented in this dataset is derived from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. It delineates income distributions across income quintiles (mentioned above) following an initial analysis and categorization. Subsequently, we adjusted these figures for inflation using the Consumer Price Index retroactive series via current methods (R-CPI-U-RS). For additional information about these estimations, please contact us via email at research@neilsberg.com
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset presents the mean household income for each of the five quintiles in Winchester, VA, as reported by the U.S. Census Bureau. The dataset highlights the variation in mean household income across quintiles, offering valuable insights into income distribution and inequality.

Key observations

Income disparities: The mean income of the lowest quintile (20% of households with the lowest income) is 14,125, while the mean income for the highest quintile (20% of households with the highest income) is 215,015. This indicates that the top earners earn 15 times compared to the lowest earners.

*Top 5%: * The mean household income for the wealthiest population (top 5%) is 344,621, which is 160.28% higher compared to the highest quintile, and 2439.79% higher compared to the lowest quintile.

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

Income Levels:

Lowest Quintile

Second Quintile

Third Quintile

Fourth Quintile

Highest Quintile

Top 5 Percent

Variables / Data Columns

Income Level: This column showcases the income levels (As mentioned above).

Mean Household Income: Mean household income, in 2023 inflation-adjusted dollars for the specific income level.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Winchester median household income. You can refer the same here
A Gold Standard Corpus for Activity Information (GoSCAI)
zenodo.org
Updated May 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zenodo (2025). A Gold Standard Corpus for Activity Information (GoSCAI) [Dataset]. http://doi.org/10.5281/zenodo.15528545
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.15528545
Dataset updated
May 30, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Description
A Gold Standard Corpus for Activity Information

Dataset Title: A Gold Standard Corpus for Activity Information (GoSCAI)

Dataset Curators: The Epidemiology & Biostatistics Section of the NIH Clinical Center Rehabilitation Medicine Department

Dataset Version: 1.0 (May 16, 2025)

Dataset Citation and DOI: NIH CC RMD Epidemiology & Biostatistics Section. (2025). A Gold Standard Corpus for Activity Information (GoSCAI) [Data set]. Zenodo. doi: 10.5281/zenodo.15528545

EXECUTIVE SUMMARY

This data statement is for a gold standard corpus of de-identified clinical notes that have been annotated for human functioning information based on the framework of the WHO's International Classification of Functioning, Disability and Health (ICF). The corpus includes 484 notes from a single institution within the United States written in English in a clinical setting. This dataset was curated for the purpose of training natural language processing models to automatically identify, extract, and classify information on human functioning at the whole-person, or activity, level.

CURATION RATIONALE

This dataset is curated to be a publicly available resource for the development and evaluation of methods for the automatic extraction and classification of activity-level functioning information as defined in the ICF. The goals of data curation are to 1) create a corpus of a size that can be manually deidentified and annotated, 2) maximize the density and diversity of functioning information of interest, and 3) allow public dissemination of the data.

LANGUAGE VARIETIES

Language Region: en-US

Prose Description: English as written by native and bilingual English speakers in a clinical setting

LANGUAGE USER DEMOGRAPHIC

The language users represented in this dataset are medical and clinical professionals who work in a research hospital setting. These individuals hold professional degrees corresponding to their respective specialties. Specific demographic characteristics of the language users such as age, gender, or race/ethnicity were not collected.

ANNOTATOR DEMOGRAPHIC

The annotator group consisted of five people, 33 to 76 years old, including four females and one male. Socioeconomically, they came from the middle and upper-middle income classes. Regarding first language, three annotators had English as their first language, one had Chinese, and one had Spanish. Proficiency in English, the language of the data being annotated, was native for three of the annotators and bilingual for the other two. The annotation team included clinical rehabilitation domain experts with backgrounds in occupational therapy, physical therapy, and individuals with public health and data science expertise. Prior to annotation, all annotators were trained on the specific annotation process using established guidelines for the given domain, and annotators were required to achieve a specified proficiency level prior to annotating notes in this corpus.

LINGUISTIC SITUATION AND TEXT CHARACTERISTICS

The notes in the dataset were written as part of clinical care within a U.S. research hospital between May 2008 and November 2019. These notes were written by health professionals asynchronously following the patient encounter to document the interaction and support continuity of care. The intended audience of these notes were clinicians involved in the patients' care. The included notes come from nine disciplines - neuropsychology, occupational therapy, physical medicine (physiatry), physical therapy, psychiatry, recreational therapy, social work, speech language pathology, and vocational rehabilitation. The notes were curated to support research on natural language processing for functioning information between 2018 and 2024.

PREPROCESSING AND DATA FORMATTING

The final corpus was derived from a set of clinical notes extracted from the hospital electronic medical record (EMR) for the purpose of clinical research. The original data include character-based digital content originally. We work in ASCII 8 or UNICODE encoding, and therefore part of our pre-processing includes running encoding detection and transformation from encodings such as Windows-1252 or ISO-8859 format to our preferred format.

On the larger corpus, we applied sampling to match our curation rationale. Given the resource constraints of manual annotation, we set out to create a dataset of 500 clinical notes, which would exclude notes over 10,000 characters in length.

To promote density and diversity, we used five note characteristics as sampling criteria. We used the text length as expressed in number of characters. Next, we considered the discipline group as derived from note type metadata and describes which discipline a note originated from: occupational and vocational therapy (OT/VOC), physical therapy (PT), recreation therapy (RT), speech and language pathology (SLP), social work (SW), or miscellaneous (MISC, including psychiatry, neurology and physiatry). These disciplines were selected for collecting the larger corpus because their notes are likely to include functioning information. Existing information extraction tools were used to obtain annotation counts in four areas of functioning and provided a note’s annotation count, annotation density (annotation count divided by text length), and domain count (number of domains with at least 1 annotation).

We used stratified sampling across the 6 discipline groups to ensure discipline diversity in the corpus. Because of low availability, 50 notes were sampled from SLP with relaxed criteria, and 90 notes each from the 5 other discipline groups with stricter criteria. Sampled SLP notes were those with the highest annotation density that had an annotation count of at least 5 and a domain count of at least 2. Other notes were sampled by highest annotation count and lowest text length, with a minimum annotation count of 15 and minimum domain count of 3.

The notes in the resulting sample included certain types of PHI and PII. To prepare for public dissemination, all sensitive or potentially identifying information was manually annotated in the notes and replaced with substituted content to ensure readability and enough context needed for machine learning without exposing any sensitive information. This de-identification effort was manually reviewed to ensure no PII or PHI exposure and correct any resulting readability issues. Notes about pediatric patients were excluded. No intent was made to sample multiple notes from the same patient. No metadata is provided to group notes other than by note type, discipline, or discipline group. The dataset is not organized beyond the provided metadata, but publications about models trained on this dataset should include information on the train/test splits used.

All notes were sentence-segmented and tokenized using the spaCy en_core_web_lg model with additional rules for sentence segmentation customized to the dataset. Notes are stored in an XML format readable by the GATE annotation software (https://gate.ac.uk/family/developer.html), which stores annotations separately in annotation sets.

CAPTURE QUALITY

As the clinical notes were extracted directly from the EMR in text format, the capture quality was determined to be high. The clinical notes did not have to be converted from other data formats, which means this dataset is free from noise introduced by conversion processes such as optical character recognition.

LIMITATIONS

Because of the effort required to manually deidentify and annotate notes, this corpus is limited in terms of size and representation. The curation decisions skewed note selection towards specific disciplines and note types to increase the likelihood of encountering information on functioning. Some subtypes of functioning occur infrequently in the data, or not at all. The deidentification of notes was done in a manner to preserve natural language as it would occur in the notes, but some information is lost, e.g. on rare diseases.

METADATA

Information on the manual annotation process is provided in the annotation guidelines for each of the four domains:

- Communication & Cognition (https://zenodo.org/records/13910167)

- Mobility (https://zenodo.org/records/11074838)

- Self-Care & Domestic Life (SCDL) (https://zenodo.org/records/11210183)

- Interpersonal Interactions & Relationships (IPIR) (https://zenodo.org/records/13774684)

Inter-annotator agreement was established on development datasets described in the annotation guidelines prior to the annotation of this gold standard corpus.

The gold standard corpus consists of 484 documents, which include 35,147 sentences in total. The distribution of annotated information is provided in the table below.

<td style="width: 1.75in; padding: 0in 5.4pt 0in

Domain

Number of Annotated Sentences

% of All Sentences

Mean Number of Annotated Sentences per Document

Communication & Cognition

6033

17.2%
t
Tucson Equity Priority Index (TEPI): Ward 2 Census Block Groups
teds.tucsonaz.gov
Updated Feb 4, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
City of Tucson (2025). Tucson Equity Priority Index (TEPI): Ward 2 Census Block Groups [Dataset]. https://teds.tucsonaz.gov/datasets/tucson-equity-priority-index-tepi-ward-2-census-block-groups/about
Explore at:
Dataset updated
Feb 4, 2025
Dataset authored and provided by
City of Tucson
Area covered

Description
For detailed information, visit the Tucson Equity Priority Index StoryMap.Download the Data DictionaryWhat is the Tucson Equity Priority Index (TEPI)?The Tucson Equity Priority Index (TEPI) is a tool that describes the distribution of socially vulnerable demographics. It categorizes the dataset into 5 classes that represent the differing prioritization needs based on the presence of social vulnerability: Low (0-20), Low-Moderate (20-40), Moderate (40-60), Moderate-High (60-80) High (80-100). Each class represents 20% of the dataset’s features in order of their values. The features within the Low (0-20) classification represent the areas that, when compared to all other locations in the study area, have the lowest need for prioritization, as they tend to have less socially vulnerable demographics. The features that fall into the High (80-100) classification represent the 20% of locations in the dataset that have the greatest need for prioritization, as they tend to have the highest proportions of socially vulnerable demographics. How is social vulnerability measured?The Tucson Equity Priority Index (TEPI) examines the proportion of vulnerability per feature using 11 demographic indicators:Income Below Poverty: Households with income at or below the federal poverty level (FPL), which in 2023 was $14,500 for an individual and $30,000 for a family of fourUnemployment: Measured as the percentage of unemployed persons in the civilian labor forceHousing Cost Burdened: Homeowners who spend more than 30% of their income on housing expenses, including mortgage, maintenance, and taxesRenter Cost Burdened: Renters who spend more than 30% of their income on rentNo Health Insurance: Those without private health insurance, Medicare, Medicaid, or any other plan or programNo Vehicle Access: Households without automobile, van, or truck accessHigh School Education or Less: Those highest level of educational attainment is a High School diploma, equivalency, or lessLimited English Ability: Those whose ability to speak English is "Less Than Well."People of Color: Those who identify as anything other than Non-Hispanic White Disability: Households with one or more physical or cognitive disabilities Age: Groups that tend to have higher levels of vulnerability, including children (those below 18), and seniors (those 65 and older)An overall percentile value is calculated for each feature based on the total proportion of the above indicators in each area. How are the variables combined?These indicators are divided into two main categories that we call Thematic Indices: Economic and Personal Characteristics. The two thematic indices are further divided into five sub-indices called Tier-2 Sub-Indices. Each Tier-2 Sub-Index contains 2-3 indicators. Indicators are the datasets used to measure vulnerability within each sub-index. The variables for each feature are re-scaled using the percentile normalization method, which converts them to the same scale using values between 0 to 100. The variables are then combined first into each of the five Tier-2 Sub-Indices, then the Thematic Indices, then the overall TEPI using the mean aggregation method and equal weighting. The resulting dataset is then divided into the five classes, where:High Vulnerability (80-100%): Representing the top classification, this category includes the highest 20% of regions that are the most socially vulnerable. These areas require the most focused attention. Moderate-High Vulnerability (60-80%): This upper-middle classification includes areas with higher levels of vulnerability compared to the median. While not the highest, these areas are more vulnerable than a majority of the dataset and should be considered for targeted interventions. Moderate Vulnerability (40-60%): Representing the middle or median quintile, this category includes areas of average vulnerability. These areas may show a balanced mix of high and low vulnerability. Detailed examination of specific indicators is recommended to understand the nuanced needs of these areas. Low-Moderate Vulnerability (20-40%): Falling into the lower-middle classification, this range includes areas that are less vulnerable than most but may still exhibit certain vulnerable characteristics. These areas typically have a mix of lower and higher indicators, with the lower values predominating. Low Vulnerability (0-20%): This category represents the bottom classification, encompassing the lowest 20% of data points. Areas in this range are the least vulnerable, making them the most resilient compared to all other features in the dataset.
t
Tucson Equity Priority Index (TEPI): Pima County Block Groups
teds.tucsonaz.gov
tucson-equity-data-hub-cotgis.hub.arcgis.com
Updated Jul 23, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
City of Tucson (2024). Tucson Equity Priority Index (TEPI): Pima County Block Groups [Dataset]. https://teds.tucsonaz.gov/maps/cotgis::tucson-equity-priority-index-tepi-pima-county-block-groups
Explore at:
Dataset updated
Jul 23, 2024
Dataset authored and provided by
City of Tucson
Area covered

Description
For detailed information, visit the Tucson Equity Priority Index StoryMap.Download the Data DictionaryWhat is the Tucson Equity Priority Index (TEPI)?The Tucson Equity Priority Index (TEPI) is a tool that describes the distribution of socially vulnerable demographics. It categorizes the dataset into 5 classes that represent the differing prioritization needs based on the presence of social vulnerability: Low (0-20), Low-Moderate (20-40), Moderate (40-60), Moderate-High (60-80) High (80-100). Each class represents 20% of the dataset’s features in order of their values. The features within the Low (0-20) classification represent the areas that, when compared to all other locations in the study area, have the lowest need for prioritization, as they tend to have less socially vulnerable demographics. The features that fall into the High (80-100) classification represent the 20% of locations in the dataset that have the greatest need for prioritization, as they tend to have the highest proportions of socially vulnerable demographics. How is social vulnerability measured?The Tucson Equity Priority Index (TEPI) examines the proportion of vulnerability per feature using 11 demographic indicators:Income Below Poverty: Households with income at or below the federal poverty level (FPL), which in 2023 was $14,500 for an individual and $30,000 for a family of fourUnemployment: Measured as the percentage of unemployed persons in the civilian labor forceHousing Cost Burdened: Homeowners who spend more than 30% of their income on housing expenses, including mortgage, maintenance, and taxesRenter Cost Burdened: Renters who spend more than 30% of their income on rentNo Health Insurance: Those without private health insurance, Medicare, Medicaid, or any other plan or programNo Vehicle Access: Households without automobile, van, or truck accessHigh School Education or Less: Those highest level of educational attainment is a High School diploma, equivalency, or lessLimited English Ability: Those whose ability to speak English is "Less Than Well."People of Color: Those who identify as anything other than Non-Hispanic White Disability: Households with one or more physical or cognitive disabilities Age: Groups that tend to have higher levels of vulnerability, including children (those below 18), and seniors (those 65 and older)An overall percentile value is calculated for each feature based on the total proportion of the above indicators in each area. How are the variables combined?These indicators are divided into two main categories that we call Thematic Indices: Economic and Personal Characteristics. The two thematic indices are further divided into five sub-indices called Tier-2 Sub-Indices. Each Tier-2 Sub-Index contains 2-3 indicators. Indicators are the datasets used to measure vulnerability within each sub-index. The variables for each feature are re-scaled using the percentile normalization method, which converts them to the same scale using values between 0 to 100. The variables are then combined first into each of the five Tier-2 Sub-Indices, then the Thematic Indices, then the overall TEPI using the mean aggregation method and equal weighting. The resulting dataset is then divided into the five classes, where:High Vulnerability (80-100%): Representing the top classification, this category includes the highest 20% of regions that are the most socially vulnerable. These areas require the most focused attention. Moderate-High Vulnerability (60-80%): This upper-middle classification includes areas with higher levels of vulnerability compared to the median. While not the highest, these areas are more vulnerable than a majority of the dataset and should be considered for targeted interventions. Moderate Vulnerability (40-60%): Representing the middle or median quintile, this category includes areas of average vulnerability. These areas may show a balanced mix of high and low vulnerability. Detailed examination of specific indicators is recommended to understand the nuanced needs of these areas. Low-Moderate Vulnerability (20-40%): Falling into the lower-middle classification, this range includes areas that are less vulnerable than most but may still exhibit certain vulnerable characteristics. These areas typically have a mix of lower and higher indicators, with the lower values predominating. Low Vulnerability (0-20%): This category represents the bottom classification, encompassing the lowest 20% of data points. Areas in this range are the least vulnerable, making them the most resilient compared to all other features in the dataset.
e
International Social Survey Programme: Social Inequality I-IV - ISSP...
b2find.eudat.eu
Updated Aug 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). International Social Survey Programme: Social Inequality I-IV - ISSP 1987-1992-1999-2009 - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/6e0445b8-5c31-5026-97cc-26198850f081
Explore at:
Dataset updated
Aug 11, 2025
Description
A comprehensive overview on the contents, the structure and basiccoding rules of both data files can be found in the following guide: Guide for the ISSP ´Social Inequality´ cumulation of the years 1987,1992, 1999 and 2009 Attitudes to social inequality. Themes: Importance of social background and other factors asprerequisites for personal success in society (wealthy family,well-educated parents, good education, ambitions, natural ability, hardwork, knowing the right people, political connections, person´s raceand religion, the part of a country a person comes from, gender andpolitical beliefs); chances to increase personal standard of living(social mobility); corruption as criteria for social mobility;importance of differentiated payment; higher payment with acceptance ofincreased responsibility; higher payment as incentive for additionalqualification of workers; avoidability of inequality of society;increased income expectation as motivation for taking up studies; goodprofits for entrepreneurs as best prerequisite for increase in generalstandard of living; insufficient solidarity of the average populationas reason for the persistence of social inequalities; opinion about ownsalary: actual occupational earning is adequate; income differences aretoo large in the respondent´s country; responsibility of government toreduce income differences; government should provide chances for poorchildren to go to university; jobs for everyone who wants one;government should provide a decent living standard for the unemployedand spend less on benefits for poor people; demand for basic income forall; opinion on taxes for people with high incomes; judgement on totaltaxation for recipients of high, middle and low incomes; justificationof better medical supply and better education for richer people;perception of class conflicts between social groups in the country(poor and rich people, working class and middle class, unemployed andemployed people, management and workers, farmers and city people,people at the top of society and people at the bottom, young people andolder people); salary criteria (scale: job responsibility, years ofeducation and training, supervising others, needed support for familiyand children, quality of job performance or hard work at the job);feeling of a just payment; perceived and desired social structure ofcountry; self-placement within social structure of society; number ofbooks in the parental home in the respondent´s youth (culturalresources); self-assessment of social class; level of status ofrespondent´s job compared to father (social mobility); self-employment,employee of a private company or business or government, occupation(ILO, ISCO 1988), type of job of respondent´s father in therespondent´s youth; mother´s occupation (ILO, ISCO 1988) in therespondent´s youth; respondent´s type of job in first and current(last) job; self-employment of respondent´ first job or worked forsomeone else. Demograpy: sex; age; marital status; steady life partner; education ofrespondent: years of schooling and highest education level; currentemployment status; hours worked weekly; occupation (ILO, ISCO 1988);self-employment; supervising function at work; working-type: workingfor private or public sector or self-employed; if self-employed: numberof employees; trade union membership; highest education level of fatherand mother; education of spouse or partner: years of schooling andhighest education level; current employment status of spouse orpartner; occupation of spouse or partner (ILO, ISCO 1988);self-employment of spouse or partner; size of household; householdcomposition (children and adults); type of housing; party affiliation(left-right (derived from affiliation to a certain party); partyaffiliation (derived from question on left-right placement); partypreference; participation in last election; perceived position of partyvoted for on left-right-scale; attendance of religious services;religious main groups (derived); self-placement on a top-bottom scale;region. Additionally coded: several country variables; weighting factor.
🎓 Elite College Admissions
kaggle.com
Updated Jul 31, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
mexwell (2024). 🎓 Elite College Admissions [Dataset]. https://www.kaggle.com/datasets/mexwell/elite-college-admissions
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 31, 2024
Dataset provided by
Kaggle
Authors
mexwell
Description
We know that students at elite universities tend to be from high-income families, and that graduates are more likely to end up in high-status or high-income jobs. But very little public data has been available on university admissions practices. This dataset, collected by Opportunity Insights, gives extensive detail on college application and admission rates for 139 colleges and universities across the United States, including data on the incomes of students. How do admissions practices vary by institution, and are wealthy students overrepresented?

Motivation

Education equality is one of the most contested topics in society today. It can be defined and explored in many ways, from accessible education to disabled/low-income/rural students to the cross-generational influence of doctorate degrees and tenure track positions. One aspect of equality is the institutions students attend. Consider the “Ivy Plus” universities, which are all eight Ivy League schools plus MIT, Stanford, Duke, and Chicago. Although less than half of one percent of Americans attend Ivy-Plus colleges, they account for more than 10% of Fortune 500 CEOs, a quarter of U.S. Senators, half of all Rhodes scholars, and three-fourths of Supreme Court justices appointed in the last half-century.

A 2023 study (Chetty et al, 2023) tried to understand how these elite institutions affect educational equality:

Do highly selective private colleges amplify the persistence of privilege across generations by taking students from high-income families and helping them obtain high-status, high-paying leadership positions? Conversely, to what extent could such colleges diversify the socioeconomic backgrounds of society’s leaders by changing their admissions policies?

To answer these questions, they assembled a dataset documenting the admission and attendance rate for 13 different income bins for 139 selective universities around the country. They were able to access and link not only student SAT/ACT scores and high school grades, but also parents’ income through their tax records, students’ post-college graduate school enrollment or employment (including earnings, employers, and occupations), and also for some selected colleges, their internal admission ratings for each student. This dataset covers students in the entering classes of 2010–2015, or roughly 2.4 million domestic students.

They found that children from families in the top 1% (by income) are more than twice as likely to attend an Ivy-Plus college as those from middle-class families with comparable SAT/ACT scores, and two-thirds of this gap can be attributed to higher admission rates with similar scores, with the remaining third due to the differences in rates of application and matriculation (enrollment conditional on admission). This is not a shocking conclusion, but we can further explore elite college admissions by socioeconomic status to understand the differences between elite private colleges and public flagships admission practices, and to reflect on the privilege we have here and to envision what a fairer higher education system could look like.

Data

The data has been aggregated by university and by parental income level, grouped into 13 income brackets. The income brackets are grouped by percentile relative to the US national income distribution, so for instance the 75.0 bin represents parents whose incomes are between the 70th and 80th percentile. The top two bins overlap: the 99.4 bin represents parents between the 99 and 99.9th percentiles, while the 99.5 bin represents parents in the top 1%.

Each row represents students’ admission and matriculation outcomes from one income bracket at a given university. There are 139 colleges covered in this dataset.

The variables include an array of different college-level-income-binned estimates for things including attendance rate (both raw and reweighted by SAT/ACT scores), application rate, and relative attendance rate conditional on application, also with respect to specific test score bands for each college and in/out-of state. Colleges are categorized into six tiers: Ivy Plus, other elite schools (public and private), highly selective public/private, and selective public/private, with selectivity generally in descending order. It also notes whether a college is public and/or flagship, where “flagship” means public flagship universities. Furthermore, they also report the relative application rate for each income bin within specific test bands, which are 50-point bands that had the most attendees in each school tier/category.

Several values are reported in “test-score-reweighted” form. These values control for SAT score: they are calculated separately for each SAT score value, then averaged with weights based on the distribution of SAT scores at the institution.

Note that since private schools typically don’t differentiate between in-...
e
Nottingham Elites 1900-1950: Evaluating Participation in Civil Society -...
b2find.eudat.eu
Updated Oct 22, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). Nottingham Elites 1900-1950: Evaluating Participation in Civil Society - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/5e51b218-6de9-56e1-956f-709c103aadd9
Explore at:
Dataset updated
Oct 22, 2023
Area covered
Nottingham
Description
Abstract copyright UK Data Service and data collection copyright owner. With few dissenting voices, the historiography of twentieth-century British civil society has been relayed through a prism of continuing and escalating elite disengagement. Within a paradigm of declinism, academics, politicians, and social commentators have contrasted a nineteenth and early twentieth century past, offering a richness of social commitment, against a present characterized by lowering standards in urban governance and civic disengagement. Put simply, as we entered the twentieth century the right sorts of people were no longer volunteering. Yet the data for such claims is insubstantial for we lack detailed empirical studies of social trends of urban volunteering across the first fifty years of the twentieth century. This dataset fills that void. It offers details of those involved in local politics, who were magistrates or poor law guardians, or who helped manage or represent one of 34 voluntary associations serving one ‘typical’ large city - Nottingham - and the surrounding county between 1900 and 1950. The sample covers a range of voluntary activities from the smallest to the largest of charities and associations. Three quarters of people captured by the data set lived within the city boundary. The clear majority of those sampled were middle class, only 10 per cent being working class, and 1.5 per cent upper class. Within this middle class there were major disparities in wealth, income, status, lifestyle, and self-view. Broken down, about 29 per cent of the sample overall were upper middle class, 43 per cent middle middle class, and 17 per cent lower middle class. Middle-class numbers in Nottingham, at about 22.5 per cent of the population, were roughly comparable with other Northern or Midland industrial cities. Its occupational distribution also approximately mirrored that of England.

Integrated Agent-based Modelling and Simulation of Transportation Demand and...

zenodo.org

bin

Updated Jun 19, 2024

Facebook

Twitter

Click to copy link

Link copied

Cite

Yuan Liao; Yuan Liao; Çağlar Tozluoğlu; Çağlar Tozluoğlu; Kaniska Ghosh; Kaniska Ghosh; Swapnil Dhamal; Swapnil Dhamal; Frances Sprei; Frances Sprei; Sonia Yeh; Sonia Yeh (2024). Integrated Agent-based Modelling and Simulation of Transportation Demand and Mobility Patterns in Sweden [Dataset]. http://doi.org/10.5281/zenodo.10648078

Explore at:

binAvailable download formats

Unique identifier

https://doi.org/10.5281/zenodo.10648078

Dataset updated

Jun 19, 2024

Dataset provided by

Zenodohttp://zenodo.org/

Authors

Yuan Liao; Yuan Liao; Çağlar Tozluoğlu; Çağlar Tozluoğlu; Kaniska Ghosh; Kaniska Ghosh; Swapnil Dhamal; Swapnil Dhamal; Frances Sprei; Frances Sprei; Sonia Yeh; Sonia Yeh

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Area covered

Sweden

Description

About

The Synthetic Sweden Mobility (SySMo) model provides a simplified yet statistically realistic microscopic representation of the real population of Sweden. The agents in this synthetic population contain socioeconomic attributes, household characteristics, and corresponding activity plans for an average weekday. This agent-based modelling approach derives the transportation demand from the agents’ planned activities using various transport modes (e.g., car, public transport, bike, and walking).

This open data repository contains four datasets:

(1) Synthetic Agents,

(2) Activity Plans of the Agents,

(3) Travel Trajectories of the Agents, and

(4) Road Network (EPSG: 3006)

(OpenStreetMap data were retrieved on August 28, 2023, from https://download.geofabrik.de/europe.html, and GTFS data were retrieved on September 6, 2023 from https://samtrafiken.se/)

The database can serve as input to assess the potential impacts of new transportation technologies, infrastructure changes, and policy interventions on the mobility patterns of the Swedish population.

Methodology

This dataset contains statistically simulated 10.2 million agents representing the population of Sweden, their socio-economic characteristics and the activity plan for an average weekday. For preparing data for the MATSim simulation, we randomly divided all the agents into 10 batches. Each batch's agents are then simulated in MATSim using the multi-modal network combining road networks and public transit data in Sweden using the package pt2matsim (https://github.com/matsim-org/pt2matsim).

The agents' daily activity plans along with the road network serve as the primary inputs in the MATSim environment which ensures iterative replanning while aiming for a convergence on optimal activity plans for all the agents. Subsequently, the individual mobility trajectories of the agents from the MATSim simulation are retrieved.

The activity plans of the individual agents extracted from the MATSim simulation output data are then further processed. All agents with negative utility score and negative activity time corresponding to at least one activity are filtered out as the ‘infeasible’ agents. The dataset ‘Synthetic Agents’ contains all synthetic agents regardless of their ‘feasibility’ (0=excluded & 1=included in plans and trajectories). In the other datasets, only agents with feasible activity plans are included.

The simulation setup adheres to the MATSim 13.0 benchmark scenario, with slight adjustments. The strategy for replanning integrates BestScore (60%), TimeAllocationMutator (30%), and ReRoute (10%)— the percentages denote the proportion of agents utilizing these strategies. In each iteration of the simulation, the agents adopt these strategies to adjust their activity plans. The "BestScore" strategy retains the plan with the highest score from the previous iteration, selecting the most successful strategy an agent has employed up until that point. The "TimeAllocationMutator" modifies the end times of activities by introducing random shifts within a specified range, allowing for the exploration of different schedules. The "ReRoute" strategy enables agents to alter their current routes, potentially optimizing travel based on updated information or preferences. These strategies are detailed further in W. Axhausen et al. (2016) work, which provides comprehensive insights into their implementation and impact within the context of transport simulation modeling.

Data Description

(1) Synthetic Agents

This dataset contains all agents in Sweden and their socioeconomic characteristics.

The attribute ‘feasibility’ has two categories: feasible agents (73%), and infeasible agents (27%). Infeasible agents are agents with negative utility score and negative activity time corresponding to at least one activity.

File name: 1_syn_pop_all.parquet

Column	Description	Data type	Unit
PId	Agent ID	Integer	-
Deso	Zone code of Demographic statistical areas (DeSO)¹	String	-
kommun	Municipality code	Integer	-
marital	Marital Status (single/ couple/ child)	String	-
sex	Gender (0 = Male, 1 = Female)	Integer	-
age	Age	Integer	-
HId	A unique identifier for households	Integer	-
HHtype	Type of households (single/ couple/ other)	String	-
HHsize	Number of people living in the households	Integer	-
num_babies	Number of children less than six years old in the household	Integer	-
employment	Employment Status (0 = Not Employed, 1 = Employed)	Integer	-
studenthood	Studenthood Status (0 = Not Student, 1 = Student)	Integer	-
income_class	Income Class (0 = No Income, 1 = Low Income, 2 = Lower-middle Income, 3 = Upper-middle Income, 4 = High Income)	Integer	-
num_cars	Number of cars owned by an individual	Integer	-
HHcars	Number of cars in the household	Integer	-
feasibility	Status of the individual (1=feasible, 0=infeasible)	Integer	-

1 https://www.scb.se/vara-tjanster/oppna-data/oppna-geodata/deso--demografiska-statistikomraden/

(2) Activity Plans of the Agents

The dataset contains the car agents’ (agents that use cars on the simulated day) activity plans for a simulated average weekday.

File name: 2_plans_i.parquet, i = 0, 1, 2, ..., 8, 9. (10 files in total)

Column	Description	Data type	Unit
act_purpose	Activity purpose (work/ home/ school/ other)	String	-
PId	Agent ID	Integer	-
act_end	End time of activity (0:00:00 – 23:59:59)	String	hour:minute:seco nd
act_id	Activity index of each agent	Integer	-
mode	Transport mode to reach the activity location	String	-
POINT_X	Coordinate X of activity location (SWEREF99TM)	Float	metre
POINT_Y	Coordinate Y of activity location (SWEREF99TM)	Float	metre
dep_time	Departure time (0:00:00 – 23:59:59)	String	hour:minute:seco nd
score	Utility score of the simulation day as obtained from MATSim	Float	-
trav_time	Travel time to reach the activity location	String	hour:minute:seco nd
trav_time_min	Travel time in decimal minute	Float	minute
act_time	Activity duration in decimal minute	Float	minute
distance	Travel distance between the origin and the destination	Float	km
speed	Travel speed to reach the activity location	Float	km/h

(3) Travel Trajectories of the Agents

This dataset contains the driving trajectories of all the agents on the road network, and the public transit vehicles used by these agents, including buses, ferries, trams etc. The files are produced by MATSim simulations and organised into 10 *.parquet’ files (representing different batches of simulation) corresponding to each plan file.

File name: 3_events_i.parquet, i = 0, 1, 2, ..., 8, 9. (10 files in total)

Column	Description	Data type	Unit
time

I
India Proportion of People Living Below 50 Percent Of Median Income: %
ceicdata.com
Updated Mar 15, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CEICdata.com, India Proportion of People Living Below 50 Percent Of Median Income: % [Dataset]. https://www.ceicdata.com/en/india/social-poverty-and-inequality/proportion-of-people-living-below-50-percent-of-median-income-
Explore at:
Dataset updated
Mar 15, 2017
Dataset provided by
CEICdata.com
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Dec 1, 1987 - Dec 1, 2021
Area covered
India
Description
India Proportion of People Living Below 50 Percent Of Median Income: % data was reported at 9.800 % in 2021. This records a decrease from the previous number of 10.000 % for 2020. India Proportion of People Living Below 50 Percent Of Median Income: % data is updated yearly, averaging 6.200 % from Dec 1977 (Median) to 2021, with 14 observations. The data reached an all-time high of 10.300 % in 2019 and a record low of 5.100 % in 2004. India Proportion of People Living Below 50 Percent Of Median Income: % data remains active status in CEIC and is reported by World Bank. The data is categorized under Global Database’s India – Table IN.World Bank.WDI: Social: Poverty and Inequality. The percentage of people in the population who live in households whose per capita income or consumption is below half of the median income or consumption per capita. The median is measured at 2017 Purchasing Power Parity (PPP) using the Poverty and Inequality Platform (http://www.pip.worldbank.org). For some countries, medians are not reported due to grouped and/or confidential data. The reference year is the year in which the underlying household survey data was collected. In cases for which the data collection period bridged two calendar years, the first year in which data were collected is reported.;World Bank, Poverty and Inequality Platform. Data are based on primary household survey data obtained from government statistical agencies and World Bank country departments. Data for high-income economies are mostly from the Luxembourg Income Study database. For more information and methodology, please see http://pip.worldbank.org.;;The World Bank’s internationally comparable poverty monitoring database now draws on income or detailed consumption data from more than 2000 household surveys across 169 countries. See the Poverty and Inequality Platform (PIP) for details (www.pip.worldbank.org).
e
International Social Survey Programme: Social Inequality IV - ISSP 2009 -...
b2find.eudat.eu
Updated Dec 9, 2011
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2011). International Social Survey Programme: Social Inequality IV - ISSP 2009 - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/51028182-bd0e-5fb6-8417-568be8384b67
Explore at:
Dataset updated
Dec 9, 2011
Description
Soziale Ungleichheit. Themen: Soziale Herkunft, Verdienst, Diskriminierung, Korruption und gute Beziehungen als Voraussetzung für Erfolg in der Gesellschaft (wohlhabende Familie, gut ausgebildete Eltern, gute Ausbildung, Ehrgeiz, harte Arbeit, die richtigen Leute kennen, politische Verbindungen, Bestechungen, Rasse und Religion bzw. Geschlecht einer Person); Meinung zur Angleichung der Bildungschancen im Land (Korruption als Mittel für soziale Mobilität, nur Studenten aus den besten Schulen haben gute Chancen, eine Hochschulausbildung zu erhalten, nur Reiche können sich die Kosten für den Besuch einer Universität leisten, gleiche Chancen für alle für den Hochschulzugang, unabhängig von Geschlecht, ethnischer Zugehörigkeit oder sozialer Herkunft); Meinung zum eigenen Gehalt: ausreichendes Einkommen, Einschätzung des tatsächlichen und des angemessenen Einkommens für ausgewählte Berufsgruppen: Arzt, Vorsitzender eines großen nationalen Unternehmens, Verkäufer, Hilfsarbeiter in einer Fabrik, Minister in der nationalen Regierung; zu große Einkommensunterschiede im eigenen Land; Verantwortlichkeit der Regierung zur Verringerung von Einkommensunterschieden; Forderung nach staatlich garantiertem angemessenen Lebensstandard für Arbeitslose anstelle von Sozialleistungen für Arme; Forderung nach höheren Steuern für Menschen mit hohem Einkommen; Einstellung zu Steuern für Menschen mit hohem Einkommen; Rechtfertigung von besserer medizinischer Versorgung und Bildung für Menschen mit höherem Einkommen; Wahrnehmung von Klassenkonflikten zwischen sozialen Gruppen in dem Land (Arm und Reich, Arbeiterklasse und Mittelschicht, Arbeitgeber und Arbeitnehmer, Menschen an der Spitze der Gesellschaft und Menschen am unteren Rand); Selbsteinschätzung der Herkunftsfamilie des Befragten auf einer Oben-Unten-Skala; Vergleich der persönlichen sozialen Lage mit der des Vaters (soziale Mobilität); Gehaltkriterien (Skala: Verantwortung, Bildung, benötigte Unterstützung für Familien und Kinder, Qualität der Arbeitsleistung oder harte Arbeit); Gefühl von gerechter Bezahlung; Charakterisierung des tatsächlichen und des gewünschten sozialen Systems des Landes, gemessen an der Einstufung auf einem Pyramidendiagramm (Bild der Gesellschaft). Demographie: Geschlecht; Alter; Familienstand; Zusammenleben mit einem Partner; Jahre der Schulbildung; höchster Bildungsabschluss; länderspezifischer Bildungsgrad; derzeitiger Erwerbsstatus (Befragter und Partner); Wochenarbeitszeit, Beruf (ISCO 88) (Befragter und Partner); Vorgesetztenfunktion bei der Arbeit, Erwerbstätigkeit im privaten oder öffentliche Sektor oder Selbständigkeit (Befragte und Partner); Selbständige wurden gefragt: Zahl der Mitarbeiter; Mitgliedschaft in einer Gewerkschaft; Einkommen des Befragten (länderspezifisch); Familieneinkommen (länderspezifisch), Haushaltsgröße; Haushaltszusammensetzung, Parteipräferenz (links-rechts); länderspezifische Parteipräferenz; Wahlbeteiligung an der letzten Wahl; Konfession; religiöse Hauptgruppe; Kirchgangshäufigkeit; Selbsteinschätzung auf einer Oben-Unten-Skala; Region (länderspezifisch), Ortsgröße (länderspezifisch); Urbanisierungsgrad; Herkunftsland oder ethnische Gruppenzugehörigkeit; Erwerbsstatus und Beruf von Vater und Mutter während der Jugend des Befragten (ISCO 88); Anzahl der Bücher im Elternhaus während der Jugend der Befragten (kulturelle Ressourcen); berufliche Stellung im ersten und derzeitigen Job (ISCO 88 und Arbeitstyp); Selbsteinschätzung der sozialen Klasse; geschätzter Betrag des Familienvermögens (Geld und Vermögenswerte); Arbeitsorientierung: Selbst-Charakterisierung derzeit und in der Jugend der Befragten bezüglich seiner Leistung am Arbeitsplatz bzw. in der Schule. Zusätzlich verkodet wurde: Art der Datenerhebung; Gewichtungsfaktor; case substitution. Social inequality. Themes: Importance of social background, merit, discrimination, corruption and good relations as prerequisites for success in society (wealthy family, well-educated parents, good education, ambitions, hard working, knowing the right people, political connections, giving bribes, person´s race and religion, gender); attitude towards equality of educational opportunity in one´s country (corruption as criteria for social mobility, only students from the best secondary schools have a good chance to obtain a university education, only rich people can afford the costs of attending university, same chances for everyone to enter university, regardless of gender, ethnicity or social background); opinion about own salary: actual occupational earning is adequate; estimation of actual and reasonable earnings for occupational groups: doctor, chairman of a large national corporation, shop assistant, unskilled worker in a factory, cabinet minister in the national government; income differences are too large in the respondent´s country; responsibility of government to reduce income differences; government should provide a decent standard of living for the unemployed and spend less on benefits for poor people; demand for higher taxes for people with high incomes; opinion on taxes for people with high income; justification of better medical supply and better education for people with higher income; perception of class conflicts between social groups in the country (poor and rich people, working class and middle class, management and workers, people at the top of society and people at the bottom); self-assessment and assessment of the family the respondent grew up in on a top-bottom-scale; social position compared to father (social mobility); salary criteria (scale: responsibility, education, needed support for family and children, quality of job performance or hard work at the job); feeling of a just payment; characterisation of the actual and the desired social system of the country, measured by classification on pyramid diagrams (image of society). Demography: sex; age; marital status; steady life partner; years of schooling; highest education level; country specific education and degree; current employment status (respondent and partner); hours worked weekly; occupation (ISCO 1988) (respondent and partner); supervising function at work; working for private or public sector or self-employed (respondent and partner); if self-employed: number of employees; trade union membership; earnings of respondent (country specific); family income (country specific); size of household; household composition; party affiliation (left-right); country specific party affiliation; participation in last election; religious denomination; religious main groups; attendance of religious services; self-placement on a top-bottom scale; region (country specific); size of community (country specific); type of community: urban-rural area; country of origin or ethnic group affiliation; occupation status and profession of respondent´s father and mother during the youth of the respondent (ISCO 88); number of books in the parental home during the youth of the respondent (cultural resources); occupational status and profession in the first job and the current job (ISCO 88 and working type); self-assessment of the social class; estimated amount of family wealth (monetary value of assets); work orientation: self-characterisation at this time and in the youth of the respondent concerning his performance at work respectively at school. Additionally coded: administrative mode of data-collection; weighting factor; case substitution.
e
Attendance of the FanZone Lille of the Euro 2016 according to the living...
data.europa.eu
ogc api features
Updated Jun 4, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Métropole Européenne de Lille (2024). Attendance of the FanZone Lille of the Euro 2016 according to the living situation [Dataset]. https://data.europa.eu/data/datasets/frequentation-fanzone-lille-euro-2016-selon-situation-vie/embed
Explore at:
ogc api featuresAvailable download formats
Dataset updated
Jun 4, 2024
Dataset authored and provided by
Métropole Européenne de Lille
License
https://www.etalab.gouv.fr/licence-ouverte-open-licencehttps://www.etalab.gouv.fr/licence-ouverte-open-licence
Description
The data is based on mobile streams on Orange relay antennas.

** Attendance at the various MEL Euro 2016 venues:**

FanZone of Lille

Indoor stadium

Outdoor Stadium

All the different zones

These data are derived from the geolocation of mobile flows analysed by Orange, and were aimed at: the adequacy between population concentrations and the security means deployed at Euro 2016.

_ _

_ _

The different socio-professional classes according to this dataset (SOURCE: ORANGE):

** **

Dynamic Urban

Young executives in the city center, overequipped in NICT

** **

Easy family urban

High-income families, living in the city center and having a high consumption of NICT

** **

Urban middle class

Young people, students, with average income

** **

Popular

Social diversity, low-income families

** **

Unfavoured urban

Low-income, NICT-refractory families or low-resource clients

** **

Peri-urban growing

Intermediate CSP families, social mix

** **

Easy family member

Families with good incomes, living in recent pavilions, equipped with NICT

** **

Dynamic rural

Intermediate CSPs, growing

** **

Rural worker

CSP modest, workers with modest incomes, living in old pavilions, little equipped with NICT

** **

Traditional rural

Older population, living in old pavilions, little equipped with NICT

** **

Secondary residence

Older population, holiday location

** **

** **

GLOSSARY FOR ALL EURO 2016 DATA GAMES (SOURCE: ORANGE)

** **

Study area

Area covered by the Orange network, where data from the mobile network was collected.

** **

Residents

Persons whose billing address is that of the department (North).

** **

Resident excursionists

Residents of the department who did not sleep in the study area in the evening and the day before the study day

** **

French Tourists

People billed in France but not in the department

** **

French tourist excursionists

French tourists who did not sleep in the study area in the evening and the day before the study day

** **

Foreign Tourists

Persons with a foreign motive

** **

Foreign tourist excursionists

Foreign tourists who did not sleep in the study area in the evening and the day before the study day

** **

Night places

Breakdown of the department into several zones, on which are counted the nights spent on the department

** **

Visitors

Persons present at least 1 hour between 00:00 and 00:00 in the study area

** **

The "#" in the data

When the data is less than or equal to 20 people, the CNIL does not allow to enter the value. Thus, between 1 and 20 people, the data is replaced by "#". When the data is "0", it means that no mobile has been captured.
H
Milwaukee Area Renters Study (MARS)
dataverse.harvard.edu
search.dataone.org
Updated Nov 6, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Matthew Desmond (2016). Milwaukee Area Renters Study (MARS) [Dataset]. http://doi.org/10.7910/DVN/BLUU3U
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/BLUU3U
Dataset updated
Nov 6, 2016
Dataset provided by
Harvard Dataverse
Authors
Matthew Desmond
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Area covered
Milwaukee
Description
Designed to collect new data related to housing, poverty, and urban life, the Milwaukee Area Renters Study (MARS) is an in-person survey of 1,086 households in Milwaukee. One person per household, usually an adult leaseholder, was interviewed. The MARS instrument was comprised of more than 250 unique items and administered in-person in English and Spanish. The University of Wisconsin Survey Center supervised data collection, which took place between 2009 and 2011. The MARS sample was limited to renters. Nationwide, the majority of low-income families live in rental housing, and most receive no federal housing assistance. Except in exceptional cities with very high housing costs, the rental population is comprised of some upper- and middle-class households who prefer renting and most of the cities’ low-income households who are excluded both from public housing and homeownership. To focus on urban renters in the private market, then, is to focus on the lived experience of most low-income families living in cities. MARS was funded by the John D. and Catherine T. MacArthur Foundation, through its “How Housing Matters” initiative.
i
Richest Zip Codes in Missouri
incomebyzipcode.com
Updated Dec 18, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cubit Planning, Inc. (2024). Richest Zip Codes in Missouri [Dataset]. https://www.incomebyzipcode.com/missouri
Explore at:
Dataset updated
Dec 18, 2024
Dataset authored and provided by
Cubit Planning, Inc.
License
https://www.incomebyzipcode.com/terms#TERMShttps://www.incomebyzipcode.com/terms#TERMS
Area covered
Missouri
Description
A dataset listing the richest zip codes in Missouri per the most current US Census data, including information on rank and average income.
e
The motives and methods of middle-class international property investors -...
b2find.eudat.eu
Updated Mar 30, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2014). The motives and methods of middle-class international property investors - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/d995594f-daf7-59cf-939e-0cf148f2127d
Explore at:
Dataset updated
Mar 30, 2014
Description
This data collection consists of 18 interview transcripts meant to explore the rationales and methods by which investors in Hong Kong buy properties in the UK. The life and impact of the residential choices of the 'super rich' has been a major strand in research by the research team. This work advanced the proposition that the upper-tier of income groups living in cities tend to exploit particular forms of service provision (such as education, cultural life and personal services), are largely distanced from the mundane flow of social life in urban areas and tend to be withdrawn from the civic life of cities more generally. Some of this work is underpinned by the literature on, for example, gated communities, but it has surprisingly been under-used as the guiding framework for close empirical work in affluent neighbourhoods, perhaps largely as a result of the perceived difficulty of working with such individuals. This project will allow us to generate insights into how super-rich neighbourhoods operate, how people come to live there and the social and economic tensions and trade-offs that exist as such processes are allowed to run. As many people question the role and value of wealth and identify inequality as a growing social problem this research will feed into public conversations and policymaker concerns about how socially vital cities can be maintained when capital investment may undermine such objectives on one level (the creation of neighbourhoods that are both exclusive and often 'abandoned' for large parts of the year), while potentially fulfilling broader ambitions at others (over tax receipts for example).Social research has tended not to focus on the super-rich, largely because they are hard to locate, and even harder to collaborate with in research. In this project we seek to address these concerns by focusing extensive research effort on the question of where and how the super-rich live and invest in the property markets of the cities of Hong Kong and London. We see these cities as exemplary in assisting in the construction of further insights and knowledge in how the super-rich seek residential investment opportunities, how they live there when they are 'at home' in such residences and how these patterns of investment shape the social, political and economic life of these cities more broadly. Given that the super-rich make such decisions on the basis of tax incentives and the attraction of major cultural infrastructure (such as galleries and theatre) we have proposed a program of research capable of offering an inside account of the practices that go to make-up these investment patterns including processes of searching for suitable property, its financing, the kinds of property deemed to be suitable and an analysis of how estate agents and city authorities seek to capitalise and retain the potentially highly mobile investment by the super-rich. In economic terms the life and functioning of rich neighbourhood spaces appears intuitively important. For example, attractive and safe spaces for captains of industry, senior figures in political and non-government organizations are often regarded as major markers of urban vitality and the foundation of social networks that may make-up the broader glue of civic and political society. Yet we know very little about how such neighbourhoods operate, who they attract and how they are linked to other cities and their neighbourhoods globally. Our aim in this research is to grapple with what might be described as the 'problem' of these super-rich neighbourhoods - sometime called the 'alpha territory' - and undertake research that will help us to understand more about the advantages and disadvantages of these kinds of property investment. The research was carried out using semi-structured interviews and participant observation at property fairs and development sites in Hong Kong and different cities in the UK. Moreover, semi-structured interviews were conducted to explore the rationales and methods by which investors in Hong Kong buy properties in the UK. Participants were recruited using searches for relevant key actors as well as accessing personal and professional networks that enabled snowballing techniques to elicit further contacts. Interviews were conducted with individual investors, local government officials, planning officers, inward investment agencies, city government officials and estate agents. Interviews were conducted in both English and Cantonese.
i
Richest Zip Codes in New Jersey
incomebyzipcode.com
Updated Dec 18, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cubit Planning, Inc. (2024). Richest Zip Codes in New Jersey [Dataset]. https://www.incomebyzipcode.com/newjersey
Explore at:
Dataset updated
Dec 18, 2024
Dataset authored and provided by
Cubit Planning, Inc.
License
https://www.incomebyzipcode.com/terms#TERMShttps://www.incomebyzipcode.com/terms#TERMS
Area covered
New Jersey
Description
A dataset listing the richest zip codes in New Jersey per the most current US Census data, including information on rank and average income.
i
Richest Zip Codes in West Virginia
incomebyzipcode.com
Updated Dec 18, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cubit Planning, Inc. (2024). Richest Zip Codes in West Virginia [Dataset]. https://www.incomebyzipcode.com/westvirginia
Explore at:
Dataset updated
Dec 18, 2024
Dataset authored and provided by
Cubit Planning, Inc.
License
https://www.incomebyzipcode.com/terms#TERMShttps://www.incomebyzipcode.com/terms#TERMS
Area covered
West Virginia
Description
A dataset listing the richest zip codes in West Virginia per the most current US Census data, including information on rank and average income.

Facebook

Twitter

Click to copy link

Link copied

Cite

Government of Canada, Statistics Canada (2025). Income of individuals by age group, sex and income source, Canada, provinces and selected census metropolitan areas [Dataset]. http://doi.org/10.25318/1110023901-eng

Income of individuals by age group, sex and income source, Canada, provinces and selected census metropolitan areas

1110023901

Explore at:

Unique identifier

https://doi.org/10.25318/1110023901-eng

Dataset updated

May 1, 2025

Dataset provided by

Statistics Canadahttps://statcan.gc.ca/en

Area covered

Canada

Description

Income of individuals by age group, sex and income source, Canada, provinces and selected census metropolitan areas, annual.

Clear search

Close search

Google apps

Main menu

Domain	Number of Annotated Sentences	% of All Sentences	Mean Number of Annotated Sentences per Document
Communication & Cognition	6033	17.2%

Income of individuals by age group, sex and income source, Canada, provinces...

Single-earner and dual-earner census families by number of children

High income tax filers in Canada

Tucson Equity Priority Index (TEPI): Ward 6 Census Block Groups

Income Distribution by Quintile: Mean Household Income in Winchester, VA //...

About this dataset

Content

Inspiration

Recommended for further research

A Gold Standard Corpus for Activity Information (GoSCAI)

A Gold Standard Corpus for Activity Information

EXECUTIVE SUMMARY

CURATION RATIONALE

LANGUAGE VARIETIES

LANGUAGE USER DEMOGRAPHIC

ANNOTATOR DEMOGRAPHIC

LINGUISTIC SITUATION AND TEXT CHARACTERISTICS

PREPROCESSING AND DATA FORMATTING

CAPTURE QUALITY

LIMITATIONS

METADATA

Tucson Equity Priority Index (TEPI): Ward 2 Census Block Groups

Tucson Equity Priority Index (TEPI): Pima County Block Groups

International Social Survey Programme: Social Inequality I-IV - ISSP...

🎓 Elite College Admissions

Motivation

Data

Nottingham Elites 1900-1950: Evaluating Participation in Civil Society -...

Integrated Agent-based Modelling and Simulation of Transportation Demand and...

About

Methodology

Data Description

(1) Synthetic Agents

(2) Activity Plans of the Agents

(3) Travel Trajectories of the Agents

India Proportion of People Living Below 50 Percent Of Median Income: %

International Social Survey Programme: Social Inequality IV - ISSP 2009 -...

Attendance of the FanZone Lille of the Euro 2016 according to the living...

Milwaukee Area Renters Study (MARS)

Richest Zip Codes in Missouri

The motives and methods of middle-class international property investors -...

Richest Zip Codes in New Jersey

Richest Zip Codes in West Virginia

Income of individuals by age group, sex and income source, Canada, provinces and selected census metropolitan areasSee More Versions

1110023901

Income of individuals by age group, sex and income source, Canada, provinces and selected census metropolitan areas