92 datasets found

P
Percentage of Population within 1 5 & 10km Coastal Buffers
pacificdata.org
csv, gpkg +1
Updated Aug 12, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
SPC Statistics for Development Division (SDD) (2019). Percentage of Population within 1 5 & 10km Coastal Buffers [Dataset]. https://pacificdata.org/data/dataset/percentage-of-population-within-1-5-10km-coastal-buffers
Explore at:
gpkg(278528), zipped shapefile(146506), csv(846)Available download formats
Dataset updated
Aug 12, 2019
Dataset provided by
SPC Statistics for Development Division (SDD)
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
A collaborative project between SPC, the World Fish Centre and the University of Wollongong has produced the first detailed population estimates of people living close to the coast in the 22 Pacific Island Countries and Territories (PICTs). These estimates are stratified into 1, 5, and 10km zones. More information about this dataset: https://sdd.spc.int/mapping-coastal
c
Poverty Rate
data.ccrpc.org
data.cuuats.cloud.ccrpc.org
csv
Updated Oct 17, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Champaign County Regional Planning Commission (2024). Poverty Rate [Dataset]. https://data.ccrpc.org/dataset/poverty-rate
Explore at:
csv(393)Available download formats
Dataset updated
Oct 17, 2024
Dataset provided by
Champaign County Regional Planning Commission
Description
This poverty rate data shows what percentage of the measured population* falls below the poverty line. Poverty is closely related to income: different “poverty thresholds” are in place for different sizes and types of household. A family or individual is considered to be below the poverty line if that family or individual’s income falls below their relevant poverty threshold. For more information on how poverty is measured by the U.S. Census Bureau (the source for this indicator’s data), visit the U.S. Census Bureau’s poverty webpage.

The poverty rate is an important piece of information when evaluating an area’s economic health and well-being. The poverty rate can also be illustrative when considered in the contexts of other indicators and categories. As a piece of data, it is too important and too useful to omit from any indicator set.

The poverty rate for all individuals in the measured population in Champaign County has hovered around roughly 20% since 2005. However, it reached its lowest rate in 2021 at 14.9%, and its second lowest rate in 2023 at 16.3%. Although the American Community Survey (ACS) data shows fluctuations between years, given their margins of error, none of the differences between consecutive years’ estimates are statistically significant, making it impossible to identify a trend.

Poverty rate data was sourced from the U.S. Census Bureau’s American Community Survey 1-Year Estimates, which are released annually.

As with any datasets that are estimates rather than exact counts, it is important to take into account the margins of error (listed in the column beside each figure) when drawing conclusions from the data.

Due to the impact of the COVID-19 pandemic, instead of providing the standard 1-year data products, the Census Bureau released experimental estimates from the 1-year data in 2020. This includes a limited number of data tables for the nation, states, and the District of Columbia. The Census Bureau states that the 2020 ACS 1-year experimental tables use an experimental estimation methodology and should not be compared with other ACS data. For these reasons, and because data is not available for Champaign County, no data for 2020 is included in this Indicator.

For interested data users, the 2020 ACS 1-Year Experimental data release includes a dataset on Poverty Status in the Past 12 Months by Age.

*According to the U.S. Census Bureau document “How Poverty is Calculated in the ACS," poverty status is calculated for everyone but those in the following groups: “people living in institutional group quarters (such as prisons or nursing homes), people in military barracks, people in college dormitories, living situations without conventional housing, and unrelated individuals under 15 years old."

Sources: U.S. Census Bureau; American Community Survey, 2023 American Community Survey 1-Year Estimates, Table S1701; generated by CCRPC staff; using data.census.gov; (17 October 2024).; U.S. Census Bureau; American Community Survey, 2022 American Community Survey 1-Year Estimates, Table S1701; generated by CCRPC staff; using data.census.gov; (25 September 2023).; U.S. Census Bureau; American Community Survey, 2021 American Community Survey 1-Year Estimates, Table S1701; generated by CCRPC staff; using data.census.gov; (16 September 2022).; U.S. Census Bureau; American Community Survey, 2019 American Community Survey 1-Year Estimates, Table S1701; generated by CCRPC staff; using data.census.gov; (8 June 2021).; U.S. Census Bureau; American Community Survey, 2018 American Community Survey 1-Year Estimates, Table S1701; generated by CCRPC staff; using data.census.gov; (8 June 2021).; U.S. Census Bureau; American Community Survey, 2017 American Community Survey 1-Year Estimates, Table S1701; generated by CCRPC staff; using American FactFinder; (13 September 2018).; U.S. Census Bureau; American Community Survey, 2016 American Community Survey 1-Year Estimates, Table S1701; generated by CCRPC staff; using American FactFinder; (14 September 2017).; U.S. Census Bureau; American Community Survey, 2015 American Community Survey 1-Year Estimates, Table S1701; generated by CCRPC staff; using American FactFinder; (19 September 2016).; U.S. Census Bureau; American Community Survey, 2014 American Community Survey 1-Year Estimates, Table S1701; generated by CCRPC staff; using American FactFinder; (16 March 2016).; U.S. Census Bureau; American Community Survey, 2013 American Community Survey 1-Year Estimates, Table S1701; generated by CCRPC staff; using American FactFinder; (16 March 2016).; U.S. Census Bureau; American Community Survey, 2012 American Community Survey 1-Year Estimates, Table S1701; generated by CCRPC staff; using American FactFinder; (16 March 2016).; U.S. Census Bureau; American Community Survey, 2011 American Community Survey 1-Year Estimates, Table S1701; generated by CCRPC staff; using American FactFinder; (16 March 2016).; U.S. Census Bureau; American Community Survey, 2010 American Community Survey 1-Year Estimates, Table S1701; generated by CCRPC staff; using American FactFinder; (16 March 2016).; U.S. Census Bureau; American Community Survey, 2009 American Community Survey 1-Year Estimates, Table S1701; generated by CCRPC staff; using American FactFinder; (16 March 2016).; U.S. Census Bureau; American Community Survey, 2008 American Community Survey 1-Year Estimates, Table S1701; generated by CCRPC staff; using American FactFinder; (16 March 2016).; U.S. Census Bureau; American Community Survey, 2007 American Community Survey 1-Year Estimates, Table S1701; generated by CCRPC staff; using American FactFinder; (16 March 2016).; U.S. Census Bureau; American Community Survey, 2006 American Community Survey 1-Year Estimates, Table S1701; generated by CCRPC staff; using American FactFinder; (16 March 2016).; U.S. Census Bureau; American Community Survey, 2005 American Community Survey 1-Year Estimates, Table S1701; generated by CCRPC staff; using American FactFinder; (16 March 2016).
2022 American Community Survey: 1-Year Estimates - Public Use Microdata...
catalog.data.gov
Updated Oct 20, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Census Bureau (2023). 2022 American Community Survey: 1-Year Estimates - Public Use Microdata Sample [Dataset]. https://catalog.data.gov/dataset/2022-american-community-survey-1-year-estimates-public-use-microdata-sample
Explore at:
Dataset updated
Oct 20, 2023
Dataset provided by
United States Census Bureauhttp://census.gov/
Description
The American Community Survey (ACS) Public Use Microdata Sample (PUMS) contains a sample of responses to the ACS. The ACS PUMS dataset includes variables for nearly every question on the survey, as well as many new variables that were derived after the fact from multiple survey responses (such as poverty status). Each record in the file represents a single person, or, in the household-level dataset, a single housing unit. In the person-level file, individuals are organized into households, making possible the study of people within the contexts of their families and other household members. Individuals living in Group Quarters, such as nursing facilities or college facilities, are also included on the person file. ACS PUMS data are available at the nation, state, and Public Use Microdata Area (PUMA) levels. PUMAs are special non-overlapping areas that partition each state into contiguous geographic units containing roughly 100,000 people each. ACS PUMS files for an individual year, such as 2022, contain data on approximately one percent of the United States population.
N
Austin, TX annual income distribution by work experience and gender dataset...
neilsberg.com
csv, json
Updated Jan 9, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2024). Austin, TX annual income distribution by work experience and gender dataset (Number of individuals ages 15+ with income, 2022) [Dataset]. https://www.neilsberg.com/research/datasets/23642d83-981b-11ee-99cf-3860777c1fe6/
Explore at:
json, csvAvailable download formats
Dataset updated
Jan 9, 2024
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Texas, Austin
Variables measured
Income for Male Population, Income for Female Population, Income for Male Population working full time, Income for Male Population working part time, Income for Female Population working full time, Income for Female Population working part time, Number of males working full time for a given income bracket, Number of males working part time for a given income bracket, Number of females working full time for a given income bracket, Number of females working part time for a given income bracket
Measurement technique
The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2022 1-Year Estimates. To portray the number of individuals for both the genders (Male and Female), within each income bracket we conducted an initial analysis and categorization of the American Community Survey data. Households are categorized, and median incomes are reported based on the self-identified gender of the head of the household. For additional information about these estimations, please contact us via email at research@neilsberg.com
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset presents the detailed breakdown of the count of individuals within distinct income brackets, categorizing them by gender (men and women) and employment type - full-time (FT) and part-time (PT), offering valuable insights into the diverse income landscapes within Austin. The dataset can be utilized to gain insights into gender-based income distribution within the Austin population, aiding in data analysis and decision-making..

Key observations

Employment patterns: Within Austin, among individuals aged 15 years and older with income, there were 396.34 thousand men and 357.85 thousand women in the workforce. Among them, 264.47 thousand men were engaged in full-time, year-round employment, while 197,239 women were in full-time, year-round roles.

Annual income under $24,999: Of the male population working full-time, 7.12% fell within the income range of under $24,999, while 7.23% of the female population working full-time was represented in the same income bracket.

Annual income above $100,000: 38.97% of men in full-time roles earned incomes exceeding $100,000, while 25.99% of women in full-time positions earned within this income bracket.

Refer to the research insights for more key observations on more income brackets ( Annual income under $24,999, Annual income between $25,000 and $49,999, Annual income between $50,000 and $74,999, Annual income between $75,000 and $99,999 and Annual income above $100,000) and employment types (full-time year-round and part-time)

https://i.neilsberg.com/ch/austin-tx-income-distribution-by-gender-and-employment-type.jpeg" alt="Austin, TX gender and employment-based income distribution analysis (Ages 15+)">

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2022 1-Year Estimates.

Income brackets:

$1 to $2,499 or loss

$2,500 to $4,999

$5,000 to $7,499

$7,500 to $9,999

$10,000 to $12,499

$12,500 to $14,999

$15,000 to $17,499

$17,500 to $19,999

$20,000 to $22,499

$22,500 to $24,999

$25,000 to $29,999

$30,000 to $34,999

$35,000 to $39,999

$40,000 to $44,999

$45,000 to $49,999

$50,000 to $54,999

$55,000 to $64,999

$65,000 to $74,999

$75,000 to $99,999

$100,000 or more

Variables / Data Columns

Income Bracket: This column showcases 20 income brackets ranging from $1 to $100,000+..

Full-Time Males: The count of males employed full-time year-round and earning within a specified income bracket

Part-Time Males: The count of males employed part-time and earning within a specified income bracket

Full-Time Females: The count of females employed full-time year-round and earning within a specified income bracket

Part-Time Females: The count of females employed part-time and earning within a specified income bracket

Employment type classifications include:

Full-time, year-round: A full-time, year-round worker is a person who worked full time (35 or more hours per week) and 50 or more weeks during the previous calendar year.

Part-time: A part-time worker is a person who worked less than 35 hours per week during the previous calendar year.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Austin median household income by gender. You can refer the same here
o
Geonames - All Cities with a population > 1000
public.opendatasoft.com
data.smartidf.services
+2more
csv, excel, geojson +1
Updated Mar 10, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Geonames - All Cities with a population > 1000 [Dataset]. https://public.opendatasoft.com/explore/dataset/geonames-all-cities-with-a-population-1000/
Explore at:
csv, json, geojson, excelAvailable download formats
Dataset updated
Mar 10, 2024
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
All cities with a population > 1000 or seats of adm div (ca 80.000)Sources and ContributionsSources : GeoNames is aggregating over hundred different data sources. Ambassadors : GeoNames Ambassadors help in many countries. Wiki : A wiki allows to view the data and quickly fix error and add missing places. Donations and Sponsoring : Costs for running GeoNames are covered by donations and sponsoring.Enrichment:add country name
Global Startup Success Dataset
kaggle.com
Updated Mar 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hamna Kaleem (2025). Global Startup Success Dataset [Dataset]. https://www.kaggle.com/datasets/hamnakaleemds/global-startup-success-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 1, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Hamna Kaleem
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
📊 Dataset Features This dataset includes 5,000 startups from 10 countries and contains 15 key features: Startup Name: Name of the startup Founded Year: Year the startup was founded Country: Country where the startup is based Industry: Industry category (Tech, FinTech, AI, etc.) Funding Stage: Stage of investment (Seed, Series A, etc.) Total Funding ($M): Total funding received (in million $) Number of Employees: Number of employees in the startup Annual Revenue ($M): Annual revenue in million dollars Valuation ($B): Startup's valuation in billion dollars Success Score: Score from 1 to 10 based on growth Acquired?: Whether the startup was acquired (Yes/No) IPO?: Did the startup go public? (Yes/No) Customer Base (Millions): Number of active customers Tech Stack: Technologies used by the startup Social Media Followers: Total followers on social platforms Analysis Ideas 📈 What Can You Do with This Dataset? Here are some exciting analyses you can perform:

Predict Startup Success: Train a machine learning model to predict the success score. Industry Trends: Analyze which industries get the most funding. **Valuation vs. Funding: **Explore the correlation between funding and valuation. Acquisition Analysis: Investigate the factors that contribute to startups being acquired.
A
‘California Housing Prices Data (5 new features!)’ analyzed by Analyst-2
analyst-2.ai
Updated Jul 28, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘California Housing Prices Data (5 new features!)’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-california-housing-prices-data-5-new-features-230f/d4c4de7c/?iid=000-393&v=presentation
Explore at:
Dataset updated
Jul 28, 2021
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
California
Description
Analysis of ‘California Housing Prices Data (5 new features!)’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/fedesoriano/california-housing-prices-data-extra-features on 28 January 2022.

--- Dataset description provided by original source is as follows ---

Similar Datasets:

Boston House Prices: LINK

Context

This is the dataset is a modified version of the California Housing Data used in the paper Pace, R. Kelley, and Ronald Barry. "Sparse spatial autoregressions." Statistics & Probability Letters 33.3 (1997): 291-297.. It serves as an excellent introduction to implementing machine learning algorithms because it requires rudimentary data cleaning, has an easily understandable list of variables and sits at an optimal size between being too toyish and too cumbersome.

The data contains information from the 1990 California census. So although it may not help you with predicting current housing prices like the Zillow Zestimate dataset, it does provide an accessible introductory dataset for teaching people about the basics of machine learning.

Modifications with respect to the original data

This dataset includes 5 extra features defined by me: "Distance to coast", "Distance to Los Angeles", "Distance to San Diego", "Distance to San Jose", and "Distance to San Francisco". These extra features try to account for the distance to the nearest coast and the distance to the centre of the largest cities in California.

The distances were calculated using the Haversine formula with the Longitude and Latitude:

https://wikimedia.org/api/rest_v1/media/math/render/svg/a65dbbde43ff45bacd2505fcf32b44fc7dcd8cc0" alt="">

where:

phi_1 and phi_2 are the Latitudes of point 1 and point 2, respectively

lambda_1 and lambda_2 are the Longitudes of point 1 and point 2, respectively

r is the radius of the Earth (6371km)

Content

The data pertains to the houses found in a given California district and some summary stats about them based on the 1990 census data. The columns are as follows, their names are pretty self-explanatory:

1) Median House Value: Median house value for households within a block (measured in US Dollars) [$] 2) Median Income: Median income for households within a block of houses (measured in tens of thousands of US Dollars) [10k$] 3) Median Age: Median age of a house within a block; a lower number is a newer building [years] 4) Total Rooms: Total number of rooms within a block 5) Total Bedrooms: Total number of bedrooms within a block 6) Population: Total number of people residing within a block 7) Households: Total number of households, a group of people residing within a home unit, for a block 8) Latitude: A measure of how far north a house is; a higher value is farther north [°] 9) Longitude: A measure of how far west a house is; a higher value is farther west [°] 10) Distance to coast: Distance to the nearest coast point [m] 11) Distance to Los Angeles: Distance to the centre of Los Angeles [m] 12) Distance to San Diego: Distance to the centre of San Diego [m] 13) Distance to San Jose: Distance to the centre of San Jose [m] 14) Distance to San Francisco: Distance to the centre of San Francisco [m]

Source

This data was entirely modified and cleaned by me. The original data (without the distance features) was initially featured in the following paper: Pace, R. Kelley, and Ronald Barry. "Sparse spatial autoregressions." Statistics & Probability Letters 33.3 (1997): 291-297.

The original dataset can be found under the following link: https://www.dcc.fc.up.pt/~ltorgo/Regression/cal_housing.html

--- Original source retains full ownership of the source dataset ---
f
Skin Characteristics Dataset 1
figshare.com
xlsx
Updated Jan 19, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stephen Elliott; Adam Graham (2016). Skin Characteristics Dataset 1 [Dataset]. http://doi.org/10.6084/m9.figshare.1155434.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.1155434.v1
Dataset updated
Jan 19, 2016
Dataset provided by
figshare
Authors
Stephen Elliott; Adam Graham
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset is a part of the Skin Characteristics Dataset composed of 3 parts. The individual datasets include a unique subjectID for each sample, human subject demographics (age, gender, ethnicity), fingerprint image quality, fingerprint minutiae, and skin characteristics (temperature, moisture, oiliness, and elasticity). Each dataset is a separate set of data which includes some of the same individuals. NOTE: subjectIDs are unique to the dataset. Ex: subjectID 1 in Dataset 1 is not the same as subjectID in Dataset 2 or Dataset 3.
NYC Open Data
kaggle.com
zip
Updated Mar 20, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NYC Open Data (2019). NYC Open Data [Dataset]. https://www.kaggle.com/nycopendata/new-york
Explore at:
zip(0 bytes)Available download formats
Dataset updated
Mar 20, 2019
Dataset authored and provided by
NYC Open Data
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

NYC Open Data is an opportunity to engage New Yorkers in the information that is produced and used by City government. We believe that every New Yorker can benefit from Open Data, and Open Data can benefit from every New Yorker. Source: https://opendata.cityofnewyork.us/overview/

Content

Thanks to NYC Open Data, which makes public data generated by city agencies available for public use, and Citi Bike, we've incorporated over 150 GB of data in 5 open datasets into Google BigQuery Public Datasets, including:

Over 8 million 311 service requests from 2012-2016

More than 1 million motor vehicle collisions 2012-present

Citi Bike stations and 30 million Citi Bike trips 2013-present

Over 1 billion Yellow and Green Taxi rides from 2009-present

Over 500,000 sidewalk trees surveyed decennially in 1995, 2005, and 2015

This dataset is deprecated and not being updated.

Fork this kernel to get started with this dataset.

Acknowledgements

https://opendata.cityofnewyork.us/

https://cloud.google.com/blog/big-data/2017/01/new-york-city-public-datasets-now-available-on-google-bigquery

This dataset is publicly available for anyone to use under the following terms provided by the Dataset Source - https://data.cityofnewyork.us/ - and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.

By accessing datasets and feeds available through NYC Open Data, the user agrees to all of the Terms of Use of NYC.gov as well as the Privacy Policy for NYC.gov. The user also agrees to any additional terms of use defined by the agencies, bureaus, and offices providing data. Public data sets made available on NYC Open Data are provided for informational purposes. The City does not warranty the completeness, accuracy, content, or fitness for any particular purpose or use of any public data set made available on NYC Open Data, nor are any such warranties to be implied or inferred with respect to the public data sets furnished therein.

The City is not liable for any deficiencies in the completeness, accuracy, content, or fitness for any particular purpose or use of any public data set, or application utilizing such data set, provided by any third party.

Banner Photo by @bicadmedia from Unplash.

Inspiration

On which New York City streets are you most likely to find a loud party?

Can you find the Virginia Pines in New York City?

Where was the only collision caused by an animal that injured a cyclist?

What’s the Citi Bike record for the Longest Distance in the Shortest Time (on a route with at least 100 rides)?

https://cloud.google.com/blog/big-data/2017/01/images/148467900588042/nyc-dataset-6.png" alt="enter image description here"> https://cloud.google.com/blog/big-data/2017/01/images/148467900588042/nyc-dataset-6.png
2011 American Community Survey: 1-Year Estimates - Public Use Microdata...
catalog.data.gov
Updated Sep 8, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Census Bureau (2023). 2011 American Community Survey: 1-Year Estimates - Public Use Microdata Sample [Dataset]. https://catalog.data.gov/dataset/2011-american-community-survey-1-year-estimates-public-use-microdata-sample
Explore at:
Dataset updated
Sep 8, 2023
Dataset provided by
United States Census Bureauhttp://census.gov/
Description
The American Community Survey (ACS) Public Use Microdata Sample (PUMS) contains a sample of responses to the ACS. The ACS PUMS dataset includes variables for nearly every question on the survey, as well as many new variables that were derived after the fact from multiple survey responses (such as poverty status).Each record in the file represents a single person, or, in the household-level dataset, a single housing unit. In the person-level file, individuals are organized into households, making possible the study of people within the contexts of their families and other household members. Individuals living in Group Quarters, such as nursing facilities or college facilities, are also included on the person file. ACS PUMS data are available at the nation, state, and Public Use Microdata Area (PUMA) levels. PUMAs are special non-overlapping areas that partition each state into contiguous geographic units containing roughly 100,000 people each. ACS PUMS files for an individual year, such as 2020, contain data on approximately one percent of the United States population
N
Los Angeles, CA annual income distribution by work experience and gender...
neilsberg.com
csv, json
Updated Jan 9, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2024). Los Angeles, CA annual income distribution by work experience and gender dataset (Number of individuals ages 15+ with income, 2022) [Dataset]. https://www.neilsberg.com/research/datasets/23e5ec76-981b-11ee-99cf-3860777c1fe6/
Explore at:
csv, jsonAvailable download formats
Dataset updated
Jan 9, 2024
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Los Angeles, California
Variables measured
Income for Male Population, Income for Female Population, Income for Male Population working full time, Income for Male Population working part time, Income for Female Population working full time, Income for Female Population working part time, Number of males working full time for a given income bracket, Number of males working part time for a given income bracket, Number of females working full time for a given income bracket, Number of females working part time for a given income bracket
Measurement technique
The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2022 1-Year Estimates. To portray the number of individuals for both the genders (Male and Female), within each income bracket we conducted an initial analysis and categorization of the American Community Survey data. Households are categorized, and median incomes are reported based on the self-identified gender of the head of the household. For additional information about these estimations, please contact us via email at research@neilsberg.com
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset presents the detailed breakdown of the count of individuals within distinct income brackets, categorizing them by gender (men and women) and employment type - full-time (FT) and part-time (PT), offering valuable insights into the diverse income landscapes within Los Angeles. The dataset can be utilized to gain insights into gender-based income distribution within the Los Angeles population, aiding in data analysis and decision-making..

Key observations

Employment patterns: Within Los Angeles, among individuals aged 15 years and older with income, there were 1.41 million men and 1.31 million women in the workforce. Among them, 767.32 thousand men were engaged in full-time, year-round employment, while 582.95 thousand women were in full-time, year-round roles.

Annual income under $24,999: Of the male population working full-time, 10.10% fell within the income range of under $24,999, while 10.29% of the female population working full-time was represented in the same income bracket.

Annual income above $100,000: 27.42% of men in full-time roles earned incomes exceeding $100,000, while 23.14% of women in full-time positions earned within this income bracket.

Refer to the research insights for more key observations on more income brackets ( Annual income under $24,999, Annual income between $25,000 and $49,999, Annual income between $50,000 and $74,999, Annual income between $75,000 and $99,999 and Annual income above $100,000) and employment types (full-time year-round and part-time)

https://i.neilsberg.com/ch/los-angeles-ca-income-distribution-by-gender-and-employment-type.jpeg" alt="Los Angeles, CA gender and employment-based income distribution analysis (Ages 15+)">

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2022 1-Year Estimates.

Income brackets:

$1 to $2,499 or loss

$2,500 to $4,999

$5,000 to $7,499

$7,500 to $9,999

$10,000 to $12,499

$12,500 to $14,999

$15,000 to $17,499

$17,500 to $19,999

$20,000 to $22,499

$22,500 to $24,999

$25,000 to $29,999

$30,000 to $34,999

$35,000 to $39,999

$40,000 to $44,999

$45,000 to $49,999

$50,000 to $54,999

$55,000 to $64,999

$65,000 to $74,999

$75,000 to $99,999

$100,000 or more

Variables / Data Columns

Income Bracket: This column showcases 20 income brackets ranging from $1 to $100,000+..

Full-Time Males: The count of males employed full-time year-round and earning within a specified income bracket

Part-Time Males: The count of males employed part-time and earning within a specified income bracket

Full-Time Females: The count of females employed full-time year-round and earning within a specified income bracket

Part-Time Females: The count of females employed part-time and earning within a specified income bracket

Employment type classifications include:

Full-time, year-round: A full-time, year-round worker is a person who worked full time (35 or more hours per week) and 50 or more weeks during the previous calendar year.

Part-time: A part-time worker is a person who worked less than 35 hours per week during the previous calendar year.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Los Angeles median household income by gender. You can refer the same here
Stock Portfolio Data with Prices and Indices
kaggle.com
Updated Mar 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nikita Manaenkov (2025). Stock Portfolio Data with Prices and Indices [Dataset]. http://doi.org/10.34740/kaggle/dsv/11140976
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.34740/kaggle/dsv/11140976
Dataset updated
Mar 23, 2025
Dataset provided by
Kaggle
Authors
Nikita Manaenkov
License
https://www.gnu.org/licenses/gpl-3.0.htmlhttps://www.gnu.org/licenses/gpl-3.0.html
Description
This dataset consists of five CSV files that provide detailed data on a stock portfolio and related market performance over the last 5 years. It includes portfolio positions, stock prices, and major U.S. market indices (NASDAQ, S&P 500, and Dow Jones). The data is essential for conducting portfolio analysis, financial modeling, and performance tracking.

1. Portfolio

This file contains the portfolio composition with details about individual stock positions, including the quantity of shares, sector, and their respective weights in the portfolio. The data also includes the stock's closing price.

Columns:

Ticker: The stock symbol (e.g., AAPL, TSLA)

Quantity: The number of shares in the portfolio

Sector: The sector the stock belongs to (e.g., Technology, Healthcare)

Close: The closing price of the stock

Weight: The weight of the stock in the portfolio (as a percentage of total portfolio)

2. Portfolio Prices

This file contains historical pricing data for the stocks in the portfolio. It includes daily open, high, low, close prices, adjusted close prices, returns, and volume of traded stocks.

Columns:

Date: The date of the data point

Ticker: The stock symbol

Open: The opening price of the stock on that day

High: The highest price reached on that day

Low: The lowest price reached on that day

Close: The closing price of the stock

Adjusted: The adjusted closing price after stock splits and dividends

Returns: Daily percentage return based on close prices

Volume: The volume of shares traded that day

3. NASDAQ

This file contains historical pricing data for the NASDAQ Composite index, providing similar data as in the Portfolio Prices file, but for the NASDAQ market index.

Columns:

Date: The date of the data point

Ticker: The stock symbol (for NASDAQ index, this will be "IXIC")

Open: The opening price of the index

High: The highest value reached on that day

Low: The lowest value reached on that day

Close: The closing value of the index

Adjusted: The adjusted closing value after any corporate actions

Returns: Daily percentage return based on close values

Volume: The volume of shares traded

4. S&P 500

This file contains similar historical pricing data, but for the S&P 500 index, providing insights into the performance of the top 500 U.S. companies.

Columns:

Date: The date of the data point

Ticker: The stock symbol (for S&P 500 index, this will be "SPX")

Open: The opening price of the index

High: The highest value reached on that day

Low: The lowest value reached on that day

Close: The closing value of the index

Adjusted: The adjusted closing value after any corporate actions

Returns: Daily percentage return based on close values

Volume: The volume of shares traded

5. Dow Jones

This file contains similar historical pricing data for the Dow Jones Industrial Average, providing insights into one of the most widely followed stock market indices in the world.

Columns:

Date: The date of the data point

Ticker: The stock symbol (for Dow Jones index, this will be "DJI")

Open: The opening price of the index

High: The highest value reached on that day

Low: The lowest value reached on that day

Close: The closing value of the index

Adjusted: The adjusted closing value after any corporate actions

Returns: Daily percentage return based on close values

Volume: The volume of shares traded

Personal Portfolio Data

This data is received using a custom framework that fetches real-time and historical stock data from Yahoo Finance. It provides the portfolio’s data based on user-specific stock holdings and performance, allowing for personalized analysis. The personal framework ensures the portfolio data is automatically retrieved and updated with the latest stock prices, returns, and performance metrics.

This part of the dataset would typically involve data specific to a particular user’s stock positions, weights, and performance, which can be integrated with the other files for portfolio performance analysis.
S
2023 Census totals by topic for individuals by statistical area 1 – part 2
datafinder.stats.govt.nz
csv, dwg, geodatabase +6
Updated Dec 9, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stats NZ (2024). 2023 Census totals by topic for individuals by statistical area 1 – part 2 [Dataset]. https://datafinder.stats.govt.nz/layer/120792-2023-census-totals-by-topic-for-individuals-by-statistical-area-1-part-2/
Explore at:
csv, shapefile, pdf, geodatabase, kml, geopackage / sqlite, mapinfo tab, mapinfo mif, dwgAvailable download formats
Dataset updated
Dec 9, 2024
Dataset provided by
Statistics New Zealandhttp://www.stats.govt.nz/
Authors
Stats NZ
License
https://datafinder.stats.govt.nz/license/attribution-4-0-international/https://datafinder.stats.govt.nz/license/attribution-4-0-international/
Area covered
Description
Dataset contains counts and measures for individuals from the 2013, 2018, and 2023 Censuses. Data is available by statistical area 1.

The variables included in this dataset are for the census usually resident population count (unless otherwise stated). All data is for level 1 of the classification.

The variables for part 2 of the dataset are:

Individual home ownership for the census usually resident population count aged 15 years and over

Usual residence 1 year ago indicator

Usual residence 5 years ago indicator

Years at usual residence

Average years at usual residence

Years since arrival in New Zealand for the overseas-born census usually resident population count

Average years since arrival in New Zealand for the overseas-born census usually resident population count

Study participation

Main means of travel to education, by usual residence address for the census usually resident population who are studying

Main means of travel to education, by education address for the census usually resident population who are studying

Highest qualification for the census usually resident population count aged 15 years and over

Post-school qualification in New Zealand indicator for the census usually resident population count aged 15 years and over

Highest secondary school qualification for the census usually resident population count aged 15 years and over

Post-school qualification level of attainment for the census usually resident population count aged 15 years and over

Sources of personal income (total responses) for the census usually resident population count aged 15 years and over

Total personal income for the census usually resident population count aged 15 years and over

Median ($) total personal income for the census usually resident population count aged 15 years and over

Work and labour force status for the census usually resident population count aged 15 years and over

Job search methods (total responses) for the unemployed census usually resident population count aged 15 years and over

Status in employment for the employed census usually resident population count aged 15 years and over

Unpaid activities (total responses) for the census usually resident population count aged 15 years and over

Hours worked in employment per week for the employed census usually resident population count aged 15 years and over

Average hours worked in employment per week for the employed census usually resident population count aged 15 years and over

Industry, by usual residence address for the employed census usually resident population count aged 15 years and over

Industry, by workplace address for the employed census usually resident population count aged 15 years and over

Occupation, by usual residence address for the employed census usually resident population count aged 15 years and over

Occupation, by workplace address for the employed census usually resident population count aged 15 years and over

Main means of travel to work, by usual residence address for the employed census usually resident population count aged 15 years and over

Main means of travel to work, by workplace address for the employed census usually resident population count aged 15 years and over

Sector of ownership for the employed census usually resident population count aged 15 years and over

Individual unit data source.

Download lookup file for part 2 from Stats NZ ArcGIS Online or embedded attachment in Stats NZ geographic data service. Download data table (excluding the geometry column for CSV files) using the instructions in the Koordinates help guide.

Footnotes

Te Whata

Under the Mana Ōrite Relationship Agreement, Te Kāhui Raraunga (TKR) will be publishing Māori descent and iwi affiliation data from the 2023 Census in partnership with Stats NZ. This will be available on Te Whata, a TKR platform.

Geographical boundaries

Statistical standard for geographic areas 2023 (updated December 2023) has information about geographic boundaries as of 1 January 2023. Address data from 2013 and 2018 Censuses was updated to be consistent with the 2023 areas. Due to the changes in area boundaries and coding methodologies, 2013 and 2018 counts published in 2023 may be slightly different to those published in 2013 or 2018.

Subnational census usually resident population

The census usually resident population count of an area (subnational count) is a count of all people who usually live in that area and were present in New Zealand on census night. It excludes visitors from overseas, visitors from elsewhere in New Zealand, and residents temporarily overseas on census night. For example, a person who usually lives in Christchurch city and is visiting Wellington city on census night will be included in the census usually resident population count of Christchurch city.

Population counts

Stats NZ publishes a number of different population counts, each using a different definition and methodology. Population statistics – user guide has more information about different counts.

Caution using time series

Time series data should be interpreted with care due to changes in census methodology and differences in response rates between censuses. The 2023 and 2018 Censuses used a combined census methodology (using census responses and administrative data), while the 2013 Census used a full-field enumeration methodology (with no use of administrative data).

Study participation time series

In the 2013 Census study participation was only collected for the census usually resident population count aged 15 years and over.

About the 2023 Census dataset

For information on the 2023 dataset see Using a combined census model for the 2023 Census. We combined data from the census forms with administrative data to create the 2023 Census dataset, which meets Stats NZ's quality criteria for population structure information. We added real data about real people to the dataset where we were confident the people who hadn’t completed a census form (which is known as admin enumeration) will be counted. We also used data from the 2018 and 2013 Censuses, administrative data sources, and statistical imputation methods to fill in some missing characteristics of people and dwellings.

Data quality

The quality of data in the 2023 Census is assessed using the quality rating scale and the quality assurance framework to determine whether data is fit for purpose and suitable for release. Data quality assurance in the 2023 Census has more information.

Concept descriptions and quality ratings

Data quality ratings for 2023 Census variables has additional details about variables found within totals by topic, for example, definitions and data quality.

Disability indicator

This data should not be used as an official measure of disability prevalence. Disability prevalence estimates are only available from the 2023 Household Disability Survey. Household Disability Survey 2023: Final content has more information about the survey.

Activity limitations are measured using the Washington Group Short Set (WGSS). The WGSS asks about six basic activities that a person might have difficulty with: seeing, hearing, walking or climbing stairs, remembering or concentrating, washing all over or dressing, and communicating. A person was classified as disabled in the 2023 Census if there was at least one of these activities that they had a lot of difficulty with or could not do at all.

Using data for good

Stats NZ expects that, when working with census data, it is done so with a positive purpose, as outlined in the Māori Data Governance Model (Data Iwi Leaders Group, 2023). This model states that "data should support transformative outcomes and should uplift and strengthen our relationships with each other and with our environments. The avoidance of harm is the minimum expectation for data use. Māori data should also contribute to iwi and hapū tino rangatiratanga”.

Confidentiality

The 2023 Census confidentiality rules have been applied to 2013, 2018, and 2023 data. These rules protect the confidentiality of individuals, families, households, dwellings, and undertakings in 2023 Census data. Counts are calculated using fixed random rounding to base 3 (FRR3) and suppression of ‘sensitive’ counts less than six, where tables report multiple geographic variables and/or small populations. Individual figures may not always sum to stated totals. Applying confidentiality rules to 2023 Census data and summary of changes since 2018 and 2013 Censuses has more information about 2023 Census confidentiality rules.

Measures

Measures like averages, medians, and other quantiles are calculated from unrounded counts, with input noise added to or subtracted from each contributing value
Weekly United States COVID-19 Cases and Deaths by State - ARCHIVED
data.cdc.gov
data.virginia.gov
+1more
application/rdfxml +5
Updated Oct 6, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CDC COVID-19 Response (2022). Weekly United States COVID-19 Cases and Deaths by State - ARCHIVED [Dataset]. https://data.cdc.gov/Case-Surveillance/Weekly-United-States-COVID-19-Cases-and-Deaths-by-/pwn4-m3yp
Explore at:
csv, application/rdfxml, xml, tsv, json, application/rssxmlAvailable download formats
Dataset updated
Oct 6, 2022
Dataset provided by
Centers for Disease Control and Preventionhttp://www.cdc.gov/
Authors
CDC COVID-19 Response
License
https://www.usa.gov/government-workshttps://www.usa.gov/government-works
Area covered
United States
Description
Reporting of new Aggregate Case and Death Count data was discontinued May 11, 2023, with the expiration of the COVID-19 public health emergency declaration. This dataset will receive a final update on June 1, 2023, to reconcile historical data through May 10, 2023, and will remain publicly available.

Aggregate Data Collection Process Since the start of the COVID-19 pandemic, data have been gathered through a robust process with the following steps:
A CDC data team reviews and validates the information obtained from jurisdictions’ state and local websites via an overnight data review process.

If more than one official county data source exists, CDC uses a comprehensive data selection process comparing each official county data source, and takes the highest case and death counts respectively, unless otherwise specified by the state.

CDC compiles these data and posts the finalized information on COVID Data Tracker.

County level data is aggregated to obtain state and territory specific totals.

This process is collaborative, with CDC and jurisdictions working together to ensure the accuracy of COVID-19 case and death numbers. County counts provide the most up-to-date numbers on cases and deaths by report date. CDC may retrospectively update counts to correct data quality issues.

Methodology Changes Several differences exist between the current, weekly-updated dataset and the archived version:
Source: The current Weekly-Updated Version is based on county-level aggregate count data, while the Archived Version is based on State-level aggregate count data.

Confirmed/Probable Cases/Death breakdown:  While the probable cases and deaths are included in the total case and total death counts in both versions (if applicable), they were reported separately from the confirmed cases and deaths by jurisdiction in the Archived Version.  In the current Weekly-Updated Version, the counts by jurisdiction are not reported by confirmed or probable status (See Confirmed and Probable Counts section for more detail).

Time Series Frequency: The current Weekly-Updated Version contains weekly time series data (i.e., one record per week per jurisdiction), while the Archived Version contains daily time series data (i.e., one record per day per jurisdiction).

Update Frequency: The current Weekly-Updated Version is updated weekly, while the Archived Version was updated twice daily up to October 20, 2022.
Important note: The counts reflected during a given time period in this dataset may not match the counts reflected for the same time period in the archived dataset noted above. Discrepancies may exist due to differences between county and state COVID-19 case surveillance and reconciliation efforts.

Confirmed and Probable Counts In this dataset, counts by jurisdiction are not displayed by confirmed or probable status. Instead, confirmed and probable cases and deaths are included in the Total Cases and Total Deaths columns, when available. Not all jurisdictions report probable cases and deaths to CDC.* Confirmed and probable case definition criteria are described here:

Council of State and Territorial Epidemiologists (ymaws.com).

Deaths CDC reports death data on other sections of the website: CDC COVID Data Tracker: Home, CDC COVID Data Tracker: Cases, Deaths, and Testing, and NCHS Provisional Death Counts. Information presented on the COVID Data Tracker pages is based on the same source (total case counts) as the present dataset; however, NCHS Death Counts are based on death certificates that use information reported by physicians, medical examiners, or coroners in the cause-of-death section of each certificate. Data from each of these pages are considered provisional (not complete and pending verification) and are therefore subject to change. Counts from previous weeks are continually revised as more records are received and processed.

Number of Jurisdictions Reporting There are currently 60 public health jurisdictions reporting cases of COVID-19. This includes the 50 states, the District of Columbia, New York City, the U.S. territories of American Samoa, Guam, the Commonwealth of the Northern Mariana Islands, Puerto Rico, and the U.S Virgin Islands as well as three independent countries in compacts of free association with the United States, Federated States of Micronesia, Republic of the Marshall Islands, and Republic of Palau. New York State’s reported case and death counts do not include New York City’s counts as they separately report nationally notifiable conditions to CDC.

CDC COVID-19 data are available to the public as summary or aggregate count files, including total counts of cases and deaths, available by state and by county. These and other data on COVID-19 are available from multiple public locations, such as:

https://www.cdc.gov/coronavirus/2019-ncov/cases-updates/cases-in-us.html

https://www.cdc.gov/covid-data-tracker/index.html

https://www.cdc.gov/coronavirus/2019-ncov/covid-data/covidview/index.html

https://www.cdc.gov/coronavirus/2019-ncov/php/open-america/surveillance-data-analytics.html

Additional COVID-19 public use datasets, include line-level (patient-level) data, are available at: https://data.cdc.gov/browse?tags=covid-19.

Archived Data Notes:

November 3, 2022: Due to a reporting cadence issue, case rates for Missouri counties are calculated based on 11 days’ worth of case count data in the Weekly United States COVID-19 Cases and Deaths by State data released on November 3, 2022, instead of the customary 7 days’ worth of data.

November 10, 2022: Due to a reporting cadence change, case rates for Alabama counties are calculated based on 13 days’ worth of case count data in the Weekly United States COVID-19 Cases and Deaths by State data released on November 10, 2022, instead of the customary 7 days’ worth of data.

November 10, 2022: Per the request of the jurisdiction, cases and deaths among non-residents have been removed from all Hawaii county totals throughout the entire time series. Cumulative case and death counts reported by CDC will no longer match Hawaii’s COVID-19 Dashboard, which still includes non-resident cases and deaths. 

November 17, 2022: Two new columns, weekly historic cases and weekly historic deaths, were added to this dataset on November 17, 2022. These columns reflect case and death counts that were reported that week but were historical in nature and not reflective of the current burden within the jurisdiction. These historical cases and deaths are not included in the new weekly case and new weekly death columns; however, they are reflected in the cumulative totals provided for each jurisdiction. These data are used to account for artificial increases in case and death totals due to batched reporting of historical data.

December 1, 2022: Due to cadence changes over the Thanksgiving holiday, case rates for all Ohio counties are reported as 0 in the data released on December 1, 2022.

January 5, 2023: Due to North Carolina’s holiday reporting cadence, aggregate case and death data will contain 14 days’ worth of data instead of the customary 7 days. As a result, case and death metrics will appear higher than expected in the January 5, 2023, weekly release.

January 12, 2023: Due to data processing delays, Mississippi’s aggregate case and death data will be reported as 0. As a result, case and death metrics will appear lower than expected in the January 12, 2023, weekly release.

January 19, 2023: Due to a reporting cadence issue, Mississippi’s aggregate case and death data will be calculated based on 14 days’ worth of data instead of the customary 7 days in the January 19, 2023, weekly release.

January 26, 2023: Due to a reporting backlog of historic COVID-19 cases, case rates for two Michigan counties (Livingston and Washtenaw) were higher than expected in the January 19, 2023 weekly release.

January 26, 2023: Due to a backlog of historic COVID-19 cases being reported this week, aggregate case and death counts in Charlotte County and Sarasota County, Florida, will appear higher than expected in the January 26, 2023 weekly release.

January 26, 2023: Due to data processing delays, Mississippi’s aggregate case and death data will be reported as 0 in the weekly release posted on January 26, 2023.

February 2, 2023: As of the data collection deadline, CDC observed an abnormally large increase in aggregate COVID-19 cases and deaths reported for Washington State. In response, totals for new cases and new deaths released on February 2, 2023, have been displayed as zero at the state level until the issue is addressed with state officials. CDC is working with state officials to address the issue.

February 2, 2023: Due to a decrease reported in cumulative case counts by Wyoming, case rates will be reported as 0 in the February 2, 2023, weekly release. CDC is working with state officials to verify the data submitted.

February 16, 2023: Due to data processing delays, Utah’s aggregate case and death data will be reported as 0 in the weekly release posted on February 16, 2023. As a result, case and death metrics will appear lower than expected and should be interpreted with caution.

February 16, 2023: Due to a reporting cadence change, Maine’s
COVID19 - The New York Times
kaggle.com
zip
Updated May 18, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Google BigQuery (2020). COVID19 - The New York Times [Dataset]. https://www.kaggle.com/datasets/bigquery/covid19-nyt
Explore at:
zip(0 bytes)Available download formats
Dataset updated
May 18, 2020
Dataset provided by
BigQueryhttps://cloud.google.com/bigquery
Authors
Google BigQuery
Description
Context

This is the US Coronavirus data repository from The New York Times . This data includes COVID-19 cases and deaths reported by state and county. The New York Times compiled this data based on reports from state and local health agencies. More information on the data repository is available here . For additional reporting and data visualizations, see The New York Times’ U.S. coronavirus interactive site

Sample Queries

Query 1

Which US counties have the most confirmed cases per capita? This query determines which counties have the most cases per 100,000 residents. Note that this may differ from similar queries of other datasets because of differences in reporting lag, methodologies, or other dataset differences.

SELECT covid19.county, covid19.state_name, total_pop AS county_population, confirmed_cases, ROUND(confirmed_cases/total_pop *100000,2) AS confirmed_cases_per_100000, deaths, ROUND(deaths/total_pop *100000,2) AS deaths_per_100000 FROM bigquery-public-data.covid19_nyt.us_counties covid19 JOIN bigquery-public-data.census_bureau_acs.county_2017_5yr acs ON covid19.county_fips_code = acs.geo_id WHERE date = DATE_SUB(CURRENT_DATE(),INTERVAL 1 day) AND covid19.county_fips_code != "00000" ORDER BY confirmed_cases_per_100000 desc

Query 2

How do I calculate the number of new COVID-19 cases per day? This query determines the total number of new cases in each state for each day available in the dataset SELECT b.state_name, b.date, MAX(b.confirmed_cases - a.confirmed_cases) AS daily_confirmed_cases FROM (SELECT state_name AS state, state_fips_code , confirmed_cases, DATE_ADD(date, INTERVAL 1 day) AS date_shift FROM bigquery-public-data.covid19_nyt.us_states WHERE confirmed_cases + deaths > 0) a JOIN bigquery-public-data.covid19_nyt.us_states b ON a.state_fips_code = b.state_fips_code AND a.date_shift = b.date GROUP BY b.state_name, date ORDER BY date desc
w
Dataset of books called Some people : poems
workwithdata.com
Updated Apr 17, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Work With Data (2025). Dataset of books called Some people : poems [Dataset]. https://www.workwithdata.com/datasets/books?f=1&fcol0=book&fop0=%3D&fval0=Some+people+%3A+poems
Explore at:
Dataset updated
Apr 17, 2025
Dataset authored and provided by
Work With Data
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset is about books. It has 1 row and is filtered where the book is Some people : poems. It features 7 columns including author, publication date, language, and book publisher.
People In Paintings Dataset
universe.roboflow.com
zip
Updated Aug 1, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Roboflow 100 (2024). People In Paintings Dataset [Dataset]. https://universe.roboflow.com/roboflow-100/people-in-paintings/dataset/1
Explore at:
zipAvailable download formats
Dataset updated
Aug 1, 2024
Dataset provided by
Roboflowhttps://roboflow.com/
Authors
Roboflow 100
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
People In Paintings Bounding Boxes
Description
This dataset was originally created by Raya Al. To see the current project, which may have been updated since this version, please go here: https://universe.roboflow.com/raya-al/french-paintings-dataset-d2vbe.

This dataset is part of RF100, an Intel-sponsored initiative to create a new object detection benchmark for model generalizability.

Access the RF100 Github repo: https://github.com/roboflow-ai/roboflow-100-benchmark
h
Black_People_Face_Recognition
huggingface.co
Updated May 30, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AxonLabs (2024). Black_People_Face_Recognition [Dataset]. https://huggingface.co/datasets/AxonData/Black_People_Face_Recognition
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 30, 2024
Authors
AxonLabs
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Black people Face Detection Dataset: 3M+ Identities

Large human faces dataset for face recognition models (10M+ images)

Share with us your feedback and recieve additional samples for free!😊 Full version of dataset is availible for commercial usage - leave a request on our website Axon Labs to purchase the dataset 💰

Dataset targeting 1:N and 1:1 NIST face recognition tests. Dataset contains 3M individuals, each with 3-5 images containing their faces The… See the full description on the dataset page: https://huggingface.co/datasets/AxonData/Black_People_Face_Recognition.
T
United States Unemployment Rate
tradingeconomics.com
pt.tradingeconomics.com
+13more
csv, excel, json, xml
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TRADING ECONOMICS, United States Unemployment Rate [Dataset]. https://tradingeconomics.com/united-states/unemployment-rate
Explore at:
excel, xml, csv, jsonAvailable download formats
Dataset authored and provided by
TRADING ECONOMICS
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jan 31, 1948 - May 31, 2025
Area covered
United States
Description
Unemployment Rate in the United States remained unchanged at 4.20 percent in May. This dataset provides the latest reported value for - United States Unemployment Rate - plus previous releases, historical high and low, short-term forecast and long-term prediction, economic calendar, survey consensus and news.
A
‘Young People Survey’ analyzed by Analyst-2
analyst-2.ai
Updated Aug 27, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2016). ‘Young People Survey’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-young-people-survey-40db/latest
Explore at:
Dataset updated
Aug 27, 2016
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of ‘Young People Survey’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/miroslavsabo/young-people-survey on 13 February 2022.

--- Dataset description provided by original source is as follows ---

Introduction

In 2013, students of the Statistics class at "https://fses.uniba.sk/en/">FSEV UK were asked to invite their friends to participate in this survey.

The data file (responses.csv) consists of 1010 rows and 150 columns (139 integer and 11 categorical).

For convenience, the original variable names were shortened in the data file. See the columns.csv file if you want to match the data with the original names.

The data contain missing values.

The survey was presented to participants in both electronic and written form.

The original questionnaire was in Slovak language and was later translated into English.

All participants were of Slovakian nationality, aged between 15-30.

The variables can be split into the following groups:

Music preferences (19 items)

Movie preferences (12 items)

Hobbies & interests (32 items)

Phobias (10 items)

Health habits (3 items)

Personality traits, views on life, & opinions (57 items)

Spending habits (7 items)

Demographics (10 items)

Research questions

Many different techniques can be used to answer many questions, e.g.

Clustering: Given the music preferences, do people make up any clusters of similar behavior?

Hypothesis testing: Do women fear certain phenomena significantly more than men? Do the left handed people have different interests than right handed?

Predictive modeling: Can we predict spending habits of a person from his/her interests and movie or music preferences?

Dimension reduction: Can we describe a large number of human interests by a smaller number of latent concepts?

Correlation analysis: Are there any connections between music and movie preferences?

Visualization: How to effectively visualize a lot of variables in order to gain some meaningful insights from the data?

(Multivariate) Outlier detection: Small number of participants often cheats and randomly answers the questions. Can you identify them? Hint: [Local outlier factor][1] may help.

Missing values analysis: Are there any patterns in missing responses? What is the optimal way of imputing the values in surveys?

Recommendations: If some of user's interests are known, can we predict the other? Or, if we know what a person listen, can we predict which kind of movies he/she might like?

Past research

(in slovak) Sleziak, P. - Sabo, M.: Gender differences in the prevalence of specific phobias. Forum Statisticum Slovacum. 2014, Vol. 10, No. 6. [Differences (gender + whether people lived in village/town) in the prevalence of phobias.]

Sabo, Miroslav. Multivariate Statistical Methods with Applications. Diss. Slovak University of Technology in Bratislava, 2014. [Clustering of variables (music preferences, movie preferences, phobias) + Clustering of people w.r.t. their interests.]

Questionnaire

MUSIC PREFERENCES

I enjoy listening to music.: Strongly disagree 1-2-3-4-5 Strongly agree (integer)

I prefer.: Slow paced music 1-2-3-4-5 Fast paced music (integer)

Dance, Disco, Funk: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Folk music: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Country: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Classical: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Musicals: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Pop: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Rock: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Metal, Hard rock: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Punk: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Hip hop, Rap: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Reggae, Ska: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Swing, Jazz: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Rock n Roll: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Alternative music: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Latin: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Techno, Trance: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Opera: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

MOVIE PREFERENCES

I really enjoy watching movies.: Strongly disagree 1-2-3-4-5 Strongly agree (integer)

Horror movies: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Thriller movies: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Comedies: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Romantic movies: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Sci-fi movies: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

War movies: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Tales: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Cartoons: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Documentaries: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Western movies: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

Action movies: Don't enjoy at all 1-2-3-4-5 Enjoy very much (integer)

HOBBIES & INTERESTS

History: Not interested 1-2-3-4-5 Very interested (integer)

Psychology: Not interested 1-2-3-4-5 Very interested (integer)

Politics: Not interested 1-2-3-4-5 Very interested (integer)

Mathematics: Not interested 1-2-3-4-5 Very interested (integer)

Physics: Not interested 1-2-3-4-5 Very interested (integer)

Internet: Not interested 1-2-3-4-5 Very interested (integer)

PC Software, Hardware: Not interested 1-2-3-4-5 Very interested (integer)

Economy, Management: Not interested 1-2-3-4-5 Very interested (integer)

Biology: Not interested 1-2-3-4-5 Very interested (integer)

Chemistry: Not interested 1-2-3-4-5 Very interested (integer)

Poetry reading: Not interested 1-2-3-4-5 Very interested (integer)

Geography: Not interested 1-2-3-4-5 Very interested (integer)

Foreign languages: Not interested 1-2-3-4-5 Very interested (integer)

Medicine: Not interested 1-2-3-4-5 Very interested (integer)

Law: Not interested 1-2-3-4-5 Very interested (integer)

Cars: Not interested 1-2-3-4-5 Very interested (integer)

Art: Not interested 1-2-3-4-5 Very interested (integer)

Religion: Not interested 1-2-3-4-5 Very interested (integer)

Outdoor activities: Not interested 1-2-3-4-5 Very interested (integer)

Dancing: Not interested 1-2-3-4-5 Very interested (integer)

Playing musical instruments: Not interested 1-2-3-4-5 Very interested (integer)

Poetry writing: Not interested 1-2-3-4-5 Very interested (integer)

Sport and leisure activities: Not interested 1-2-3-4-5 Very interested (integer)

Sport at competitive level: Not interested 1-2-3-4-5 Very interested (integer)

Gardening: Not interested 1-2-3-4-5 Very interested (integer)

Celebrity lifestyle: Not interested 1-2-3-4-5 Very interested (integer)

Shopping: Not interested 1-2-3-4-5 Very interested (integer)

Science and technology: Not interested 1-2-3-4-5 Very interested (integer)

Theatre: Not interested 1-2-3-4-5 Very interested (integer)

Socializing: Not interested 1-2-3-4-5 Very interested (integer)

Adrenaline sports: Not interested 1-2-3-4-5 Very interested (integer)

Pets: Not interested 1-2-3-4-5 Very interested (integer)

PHOBIAS

Flying: Not afraid at all 1-2-3-4-5 Very afraid of (integer)

Thunder, lightning: Not afraid at all 1-2-3-4-5 Very afraid of (integer)

Darkness: Not afraid at all 1-2-3-4-5 Very afraid of (integer)

Heights: Not afraid at all 1-2-3-4-5 Very afraid of (integer)

Spiders: Not afraid at all 1-2-3-4-5 Very afraid of (integer)

Snakes: Not afraid at all 1-2-3-4-5 Very afraid of (integer)

Rats, mice: Not afraid at all 1-2-3-4-5 Very afraid of (integer)

Ageing: Not afraid at all 1-2-3-4-5 Very afraid of (integer)

Dangerous dogs: Not afraid at all 1-2-3-4-5 Very afraid of (integer)

Public speaking: Not afraid at all 1-2-3-4-5 Very afraid of (integer)

HEALTH HABITS

Smoking habits: Never smoked - Tried smoking - Former smoker - Current smoker (categorical)

Drinking: Never - Social drinker - Drink a lot (categorical)

I live a very healthy lifestyle.: Strongly disagree 1-2-3-4-5 Strongly agree (integer)

PERSONALITY TRAITS, VIEWS ON LIFE & OPINIONS

I take notice of what goes on around me.: Strongly disagree 1-2-3-4-5 Strongly agree (integer)

I try to do tasks as soon as possible and not leave them until last minute.: Strongly disagree 1-2-3-4-5 Strongly agree (integer)

I always make a list so I don't forget anything.: Strongly disagree 1-2-3-4-5 Strongly agree (integer)

I often study or work even in my spare time.: Strongly disagree 1-2-3-4-5 Strongly agree (integer)

I look at things from all different angles before I go ahead.: Strongly disagree 1-2-3-4-5 Strongly agree (integer)

I believe that bad people will suffer one day and good people will be rewarded.: Strongly disagree 1-2-3-4-5 Strongly agree (integer)

I am reliable at work and always complete all tasks given to me.: Strongly disagree 1-2-3-4-5 Strongly agree (integer)

I always keep my promises.: Strongly disagree 1-2-3-4-5 Strongly agree (integer)

**I can fall for someone very quickly and then

Facebook

Twitter

Click to copy link

Link copied

Cite

SPC Statistics for Development Division (SDD) (2019). Percentage of Population within 1 5 & 10km Coastal Buffers [Dataset]. https://pacificdata.org/data/dataset/percentage-of-population-within-1-5-10km-coastal-buffers

Percentage of Population within 1 5 & 10km Coastal Buffers

Explore at:

gpkg(278528), zipped shapefile(146506), csv(846)Available download formats

Dataset updated

Aug 12, 2019

Dataset provided by

SPC Statistics for Development Division (SDD)

License

Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically

Description

A collaborative project between SPC, the World Fish Centre and the University of Wollongong has produced the first detailed population estimates of people living close to the coast in the 22 Pacific Island Countries and Territories (PICTs). These estimates are stratified into 1, 5, and 10km zones. More information about this dataset: https://sdd.spc.int/mapping-coastal

Clear search

Close search

Google apps

Main menu

Percentage of Population within 1 5 & 10km Coastal Buffers

Poverty Rate

2022 American Community Survey: 1-Year Estimates - Public Use Microdata...

Austin, TX annual income distribution by work experience and gender dataset...

About this dataset

Content

Inspiration

Recommended for further research

Geonames - All Cities with a population > 1000

Global Startup Success Dataset

‘California Housing Prices Data (5 new features!)’ analyzed by Analyst-2

Similar Datasets:

Context

Modifications with respect to the original data

Content

Source

Skin Characteristics Dataset 1

NYC Open Data

Context

Content

Acknowledgements

Inspiration

2011 American Community Survey: 1-Year Estimates - Public Use Microdata...

Los Angeles, CA annual income distribution by work experience and gender...

About this dataset

Content

Inspiration

Recommended for further research

Stock Portfolio Data with Prices and Indices

1. Portfolio

2. Portfolio Prices

3. NASDAQ

4. S&P 500

5. Dow Jones

Personal Portfolio Data

2023 Census totals by topic for individuals by statistical area 1 – part 2

Weekly United States COVID-19 Cases and Deaths by State - ARCHIVED

COVID19 - The New York Times

Context

Sample Queries

Query 1

Query 2

Dataset of books called Some people : poems

People In Paintings Dataset

Black_People_Face_Recognition

United States Unemployment Rate

‘Young People Survey’ analyzed by Analyst-2

Introduction

Research questions

Past research

Questionnaire

MUSIC PREFERENCES

MOVIE PREFERENCES

HOBBIES & INTERESTS

PHOBIAS

HEALTH HABITS

PERSONALITY TRAITS, VIEWS ON LIFE & OPINIONS

Percentage of Population within 1 5 & 10km Coastal Buffers