Unadjusted decennial census data from 1950-2000 and projected figures from 2010-2040: summary table of New York City population numbers and percentage share by Borough, including school-age (5 to 17), 65 and Over, and total population.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
NYC Open Data is an opportunity to engage New Yorkers in the information that is produced and used by City government. We believe that every New Yorker can benefit from Open Data, and Open Data can benefit from every New Yorker. Source: https://opendata.cityofnewyork.us/overview/
Thanks to NYC Open Data, which makes public data generated by city agencies available for public use, and Citi Bike, we've incorporated over 150 GB of data in 5 open datasets into Google BigQuery Public Datasets, including:
Over 8 million 311 service requests from 2012-2016
More than 1 million motor vehicle collisions 2012-present
Citi Bike stations and 30 million Citi Bike trips 2013-present
Over 1 billion Yellow and Green Taxi rides from 2009-present
Over 500,000 sidewalk trees surveyed decennially in 1995, 2005, and 2015
This dataset is deprecated and not being updated.
Fork this kernel to get started with this dataset.
https://opendata.cityofnewyork.us/
This dataset is publicly available for anyone to use under the following terms provided by the Dataset Source - https://data.cityofnewyork.us/ - and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.
By accessing datasets and feeds available through NYC Open Data, the user agrees to all of the Terms of Use of NYC.gov as well as the Privacy Policy for NYC.gov. The user also agrees to any additional terms of use defined by the agencies, bureaus, and offices providing data. Public data sets made available on NYC Open Data are provided for informational purposes. The City does not warranty the completeness, accuracy, content, or fitness for any particular purpose or use of any public data set made available on NYC Open Data, nor are any such warranties to be implied or inferred with respect to the public data sets furnished therein.
The City is not liable for any deficiencies in the completeness, accuracy, content, or fitness for any particular purpose or use of any public data set, or application utilizing such data set, provided by any third party.
Banner Photo by @bicadmedia from Unplash.
On which New York City streets are you most likely to find a loud party?
Can you find the Virginia Pines in New York City?
Where was the only collision caused by an animal that injured a cyclist?
What’s the Citi Bike record for the Longest Distance in the Shortest Time (on a route with at least 100 rides)?
https://cloud.google.com/blog/big-data/2017/01/images/148467900588042/nyc-dataset-6.png" alt="enter image description here">
https://cloud.google.com/blog/big-data/2017/01/images/148467900588042/nyc-dataset-6.png
This dataset provides benefit, program, and resource information for over 80 health and human services available to NYC residents in all eleven local law languages. The data is kept up-to-date, including the most recent applications, eligibility requirements, and application dates. Information in this dataset is used on ACCESS NYC, Generation NYC, and Growing Up NYC. Reach out to products@nycopportunity.nyc.gov if you have any questions about this dataset. This data makes it easier for NYC residents to discover and be aware of multiple benefits they may be eligible for. NYC Opportunity Product team works with 15+ government agencies to collect and update this data. Each record in the dataset represents a benefit or program. Blank fields are NULL values in this dataset. The data can be used to develop new websites or directory resources to help residents to discover benefits they need. For access to the multilingual version of this dataset, please follow this link: https://data.cityofnewyork.us/City-Government/Benefits-and-Programs-Multilingual-Dataset/yjpx-srhp
Population Numbers By New York City Neighborhood Tabulation Areas The data was collected from Census Bureaus' Decennial data dissemination (SF1). Neighborhood Tabulation Areas (NTAs), are aggregations of census tracts that are subsets of New York City's 55 Public Use Microdata Areas (PUMAs). Primarily due to these constraints, NTA boundaries and their associated names may not definitively represent neighborhoods. This report shows change in population from 2000 to 2010 for each NTA. Compiled by the Population Division – New York City Department of City Planning.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset tabulates the New York population over the last 20 plus years. It lists the population for each year, along with the year on year change in population, as well as the change in percentage terms for each year. The dataset can be utilized to understand the population change of New York across the last two decades. For example, using this dataset, we can identify if the population is declining or increasing. If there is a change, when the population peaked, or if it is still growing and has not reached its peak. We can also compare the trend with the overall trend of United States population over the same period of time.
Key observations
In 2023, the population of New York was 8.26 million, a 0.93% decrease year-by-year from 2022. Previously, in 2022, New York population was 8.34 million, a decline of 1.49% compared to a population of 8.46 million in 2021. Over the last 20 plus years, between 2000 and 2023, population of New York increased by 242,826. In this period, the peak population was 8.74 million in the year 2020. The numbers suggest that the population has already reached its peak and is showing a trend of decline. Source: U.S. Census Bureau Population Estimates Program (PEP).
When available, the data consists of estimates from the U.S. Census Bureau Population Estimates Program (PEP).
Data Coverage:
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for New York Population by Year. You can refer the same here
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
There are a number of Kaggle datasets that provide spatial data around New York City. For many of these, it may be quite interesting to relate the data to the demographic and economic characteristics of nearby neighborhoods. I hope this data set will allow for making these comparisons without too much difficulty.
Exploring the data and making maps could be quite interesting as well.
This dataset contains two CSV files:
nyc_census_tracts.csv
This file contains a selection of census data taken from the ACS DP03 and DP05 tables. Things like total population, racial/ethnic demographic information, employment and commuting characteristics, and more are contained here. There is a great deal of additional data in the raw tables retrieved from the US Census Bureau website, so I could easily add more fields if there is enough interest.
I obtained data for individual census tracts, which typically contain several thousand residents.
census_block_loc.csv
For this file, I used an online FCC census block lookup tool to retrieve the census block code for a 200 x 200 grid containing
New York City and a bit of the surrounding area. This file contains the coordinates and associated census block codes along
with the state and county names to make things a bit more readable to users.
Each census tract is split into a number of blocks, so one must extract the census tract code from the block code.
The data here was taken from the American Community Survey 2015 5-year estimates (https://factfinder.census.gov/faces/nav/jsf/pages/index.xhtml).
The census block coordinate data was taken from the FCC Census Block Conversions API (https://www.fcc.gov/general/census-block-conversions-api)
As public data from the US government, this is not subject to copyright within the US and should be considered public domain.
Many residents of New York City speak more than one language; a number of them speak and understand non-English languages more fluently than English. This dataset, derived from the Census Bureau's American Community Survey (ACS), includes information on over 1.7 million limited English proficient (LEP) residents and a subset of that population called limited English proficient citizens of voting age (CVALEP) at the Community District level. There are 59 community districts throughout NYC, with each district being represented by a Community Board.
This dataset contains information about NYC Business Solutions service, a service offered by the Department of Small Business Services (SBS) aimed at giving New Yorkers free services to start, operate and grow their businesses. Each row in the dataset represents the number of public housing residents on a City Council District-level who receive or utilize this service. For datasets related to other services provided to NYCHA residents, view the data collection “Services available to NYCHA Residents - Local Law 163”.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset tabulates the Non-Hispanic population of Manhattan by race. It includes the distribution of the Non-Hispanic population of Manhattan across various race categories as identified by the Census Bureau. The dataset can be utilized to understand the Non-Hispanic population distribution of Manhattan across relevant racial categories.
Key observations
Of the Non-Hispanic population in Manhattan, the largest racial group is White alone with a population of 40,666 (83.14% of the total Non-Hispanic population).
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.
Racial categories include:
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for Manhattan Population by Race & Ethnicity. You can refer the same here
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘NYC Business Solutions for NYCHA Residents by City Council District - Local Law 163’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://catalog.data.gov/dataset/4f451d71-4f1d-4a58-a207-88319c381957 on 27 January 2022.
--- Dataset description provided by original source is as follows ---
This dataset contains information about NYC Business Solutions service, a service offered by the Department of Small Business Services (SBS) aimed at giving New Yorkers free services to start, operate and grow their businesses. Each row in the dataset represents the number of public housing residents on a City Council District-level who receive or utilize this service. For datasets related to other services provided to NYCHA residents, view the data collection “Services available to NYCHA Residents - Local Law 163”.
--- Original source retains full ownership of the source dataset ---
This dataset contains information on antibody testing for COVID-19: the number of people who received a test, the number of people with positive results, the percentage of people tested who tested positive, and the rate of testing per 100,000 people, stratified by ZIP Code Tabulation Area (ZCTA) neighborhood poverty group. These data can also be accessed here: https://github.com/nychealth/coronavirus-data/blob/master/totals/antibody-by-poverty.csv Exposure to COVID-19 can be detected by measuring antibodies to the disease in a person’s blood, which can indicate that a person may have had an immune response to the virus. Antibodies are proteins produced by the body’s immune system that can be found in the blood. People can test positive for antibodies after they have been exposed, sometimes when they no longer test positive for the virus itself. It is important to note that the science around COVID-19 antibody tests is evolving rapidly and there is still much uncertainty about what individual antibody test results mean for a single person and what population-level antibody test results mean for understanding the epidemiology of COVID-19 at a population level. These data only provide information on people tested. People receiving an antibody test do not reflect all people in New York City; therefore, these data may not reflect antibody prevalence among all New Yorkers. Increasing instances of screening programs further impact the generalizability of these data, as screening programs influence who and how many people are tested over time. Examples of screening programs in NYC include: employers screening their workers (e.g., hospitals), and long-term care facilities screening their residents. In addition, there may be potential biases toward people receiving an antibody test who have a positive result because people who were previously ill are preferentially seeking testing, in addition to the testing of persons with higher exposure (e.g., health care workers, first responders.) Neighborhood-level poverty groups were classified in a manner consistent with Health Department practices to describe and monitor disparities in health in NYC. Neighborhood poverty measures are defined as the percentage of people earning below the Federal Poverty Threshold (FPT) within a ZCTA. The standard cut-points for defining categories of neighborhood-level poverty in NYC are: • Low: <10% of residents in ZCTA living below the FPT • Medium: 10% to <20% • High: 20% to <30% • Very high: ≥30% residents living below the FPT The ZCTAs used for classification reflect the first non-missing address within NYC for each person reported with an antibody test result. Rates were calculated using interpolated intercensal population estimates updated in 2019. These rates differ from previously reported rates based on the 2000 Census or previous versions of population estimates. The Health Department produced these population estimates based on estimates from the U.S. Census Bureau and NYC Department of City Planning. Rates for poverty were calculated using direct standardization for age at diagnosis and weighting by the US 2000 standard population. Antibody tests are categorized based on the date of specimen collection and are aggregated by full weeks starting each Sunday and ending on Saturday. For example, a person whose blood was collected for antibody testing on Wednesday, May 6 would be categorized as tested during the week ending May 9. A person tested twice in one week would only be counted once in that week. This dataset includes testing data beginning April 5, 2020. Data are updated daily, and the dataset preserves historical records and source data changes, so each extract date reflects the current copy of the data as of that date. For example, an extract date of 11/04/2020 and extract date of 11/03/2020 will both contain all records as they were as of that extract date. Without filtering or grouping by extract date, an analysis will almost certain
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset tabulates the Non-Hispanic population of New York by race. It includes the distribution of the Non-Hispanic population of New York across various race categories as identified by the Census Bureau. The dataset can be utilized to understand the Non-Hispanic population distribution of New York across relevant racial categories.
Key observations
Of the Non-Hispanic population in New York, the largest racial group is White alone with a population of 2.67 million (43.74% of the total Non-Hispanic population).
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.
Racial categories include:
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for New York Population by Race & Ethnicity. You can refer the same here
Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
License information was derived automatically
NYC Neighborhoods polygons and correlated data with their respective Postal Codes, Assembly Districts, Community Districts, Congressional Districts, Council Districts and State Senate Districts created by Ontodia. There are hundreds of neighborhoods in New York City's five boroughs, each with unique characteristics and histories. Many historical neighborhood names are derived from the names of the previously independent villages, towns, and cities that were incorporated into into the City of New York in the consolidation of 1898. Other neighborhood names have been introduced by real estate developers and urban planners, sometimes contentiously. Boundaries of neighborhoods are notoriously fuzzy, although many boundaries are widely agreed upon. Complicating the definition of neighborhood further, boundaries may overlap, some neighborhoods may function as a micro-neighborhood within another neighborhood, or a larger district which can be made up of multiple neighborhoods. Names and boundaries of neighborhoods shift over time; they are determined by the collective conscious of the people who live, work, and play in these places. There is never an official version of neighborhoods, but the concept is deeply meaningful to many people. In many cases a New Yorker is just as proud to claim identity with a particular neighborhood, and visitors plan their trips around visits to specific neighborhoods. To display data about neighborhoods on NYCpedia we created our own neighborhood boundaries, 264 in all. In order to display a continuous map with no overlap some boundaries have been stretched or shrunk, and neighborhoods have been omitted in this version. We intend to expand our work developing neighborhood polygon files (all released with open source license) and also to collect and organize as many meaningful alternative versions of neighborhood boundaries as possible. If you are a map geek or software developer who builds apps about New York City you can find the shapefile and geoJSON of the NYCpedia neighborhoods on Data Wrangler. Drop us a line if you see any errors, or if you have suggestions for how to improve our conception of NYC geography.
Table of Census Demographics represented at the NYC City Council district level
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset is now updated annually here.
This dataset contains the salary, pay rate, and total compensation of every New York City employee. In this dataset this information is provided for the 2014, 2015, 2016, and 2017 fiscal years, and provides a transparent lens into who gets paid how much and for what.
Note that fiscal years in the New York City budget cycle start on July 1st and end on June 30th (see here). That means that this dataset contains, in its sum, compensation information for all City of New York employees for the period July 1, 2014 to June 30, 2017.
This dataset provides columns for fiscal year, employee name, the city department they work for, their job title, and various fields describing their compensation. The most important of these fields is "Regular Gross Pay", which provides that employee's total compensation.
This information was published as-is by the City of New York.
This datasets contains information about NYCHA residents’ use of:
a) NYC Financial Empowerment Centers: a program that provides free, one-on-one professional financial counseling and coaching to all NYC residents. Each row in the dataset represents the number of NYCHA residents on a Borough-level who utilized this service;
b) EmpoweredNYC: is an initiative to assist New Yorkers with disabilities and their families to better manage their finances and become more financially stable. Each row in the dataset represents the number of NYCHA residents on a Borough-level who utilized this service;
c) Student Loan Debt clinic: is an initiative to help New Yorkers understand their student loans and how to repay them. Each row in the dataset represents the number of NYCHA residents on a Borough-level who utilized this service; and
d) Ready to Rent: a program providing free one-on-one financial counseling to New Yorkers seeking to apply for affordable housing units through HPD’s Housing Connect lottery. Each row in the dataset represents the number of NYCHA residents on a Borough-level who utilized this service.
The dataset is part of the annual report compiled by the Mayor’s Office of Operations as mandated by the Local Law 163 of 2016 on different services provided to NYCHA residents. See other datasets in this report by searching the keyword “Services available to NYCHA Residents - Local Law 163 (2016)” on the Open Data Portal.
This dataset shows daily citywide counts of persons tested by nucleic acid amplification tests (NAAT, also known as a molecular test; e.g. a PCR test) for SARS-CoV-2 , counts of persons with positive tests, and the percent positivity. Also included is a calculation of the average percent positivity over a 7-day period. NAAT tests work through direct detection of the virus’s genetic material, and typically involve collecting a nasal swab. These tests are highly accurate and recommended for diagnosing current COVID-19 infection. After specimen collection, molecular tests are processed in a laboratory, and results are electronically reported to the New York State (NYS) Electronic Clinical Laboratory Results System (ECLRS). Test results for NYC residents are then sent electronically to NYC DOHMH. There is typically a lag of a few days between when a specimen is collected and when a result is reported to NYC DOHMH. Data is sourced from electronic laboratory reporting from NYS ECLRS. All identifying health information is excluded from the dataset.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘NYC Business Solutions for NYCHA Residents by Borough - Local Law 163’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://catalog.data.gov/dataset/104ce120-d4eb-4528-9944-6d2a7fcda917 on 13 February 2022.
--- Dataset description provided by original source is as follows ---
This dataset contains information about NYC Business Solutions service, a service offered by the Department of Small Business Services (SBS) aimed at giving New Yorkers free services to start, operate and grow their businesses. Each row in the dataset represents the number of public housing residents on a Borough-level who receive or utilize this service.
For datasets related to other services provided to NYCHA residents, view the data collection “Services available to NYCHA Residents - Local Law 163”.
--- Original source retains full ownership of the source dataset ---
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Daily inmates in custody with attributes (custody level, mental health designation, race, gender, age, leagal status, sealed status, security risk group membership, top charge, and infraction flag). This data set excludes Sealed Cases. Resulting summaries may differ slightly from other published statistics.
This is a dataset hosted by the City of New York. The city has an open data platform found here and they update their information according the amount of data that is brought in. Explore New York City using Kaggle and all of the data sources available through the City of New York organization page!
This dataset is maintained using Socrata's API and Kaggle's API. Socrata has assisted countless organizations with hosting their open data and has been an integral part of the process of bringing more data to the public.
Cover photo by Fredrik Öhlander on Unsplash
Unsplash Images are distributed under a unique Unsplash License.
This map answers the question "What is the most common, or predominant, education level for people in this area?" The map shows predominant educational attainment in each census tract. Darker colors indicate a greater gap between the predominant group and the next largest group.The U.S. Census Bureau asks citizens to indicate how far they went in formal education. The database includes seven different columns, each representing a count of population by that education level. A simple routine in compares the seven columns of information, and finds which one has the highest value, writing that to a string field. Each tract's transparency is set by a transparency field added to the data.Predominance maps can be created in ArcGIS Online by adding two fields, calculating their values, and setting up the renderer based on those two fields. See this blog by Jim Herries for details on how to create a predominance map in ArcGIS Online from any feature layer.See this GitHub repo by Jennifer Bell for a script you can run in ArcMap as a script tool, to calculate predominance for any columns of data you have.
Unadjusted decennial census data from 1950-2000 and projected figures from 2010-2040: summary table of New York City population numbers and percentage share by Borough, including school-age (5 to 17), 65 and Over, and total population.