description: Population Numbers By New York City Neighborhood Tabulation Areas The data was collected from Census Bureaus' Decennial data dissemination (SF1). Neighborhood Tabulation Areas (NTAs), are aggregations of census tracts that are subsets of New York City's 55 Public Use Microdata Areas (PUMAs). Primarily due to these constraints, NTA boundaries and their associated names may not definitively represent neighborhoods. This report shows change in population from 2000 to 2010 for each NTA. Compiled by the Population Division New York City Department of City Planning.; abstract: Population Numbers By New York City Neighborhood Tabulation Areas The data was collected from Census Bureaus' Decennial data dissemination (SF1). Neighborhood Tabulation Areas (NTAs), are aggregations of census tracts that are subsets of New York City's 55 Public Use Microdata Areas (PUMAs). Primarily due to these constraints, NTA boundaries and their associated names may not definitively represent neighborhoods. This report shows change in population from 2000 to 2010 for each NTA. Compiled by the Population Division New York City Department of City Planning.
New York City Population By Community Districts The data was collected from Census Bureaus' Decennial data dissemination (SF1) for the years 1970, 1980, 1990, 2000 and 2010. Compiled by the Population Division – New York City Department of City Planning
Unadjusted decennial census data from 1950-2000 and projected figures from 2010-2040: summary table of New York City population numbers and percentage share by Borough, including school-age (5 to 17), 65 and Over, and total population.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
There are a number of Kaggle datasets that provide spatial data around New York City. For many of these, it may be quite interesting to relate the data to the demographic and economic characteristics of nearby neighborhoods. I hope this data set will allow for making these comparisons without too much difficulty.
Exploring the data and making maps could be quite interesting as well.
This dataset contains two CSV files:
nyc_census_tracts.csv
This file contains a selection of census data taken from the ACS DP03 and DP05 tables. Things like total population, racial/ethnic demographic information, employment and commuting characteristics, and more are contained here. There is a great deal of additional data in the raw tables retrieved from the US Census Bureau website, so I could easily add more fields if there is enough interest.
I obtained data for individual census tracts, which typically contain several thousand residents.
census_block_loc.csv
For this file, I used an online FCC census block lookup tool to retrieve the census block code for a 200 x 200 grid containing
New York City and a bit of the surrounding area. This file contains the coordinates and associated census block codes along
with the state and county names to make things a bit more readable to users.
Each census tract is split into a number of blocks, so one must extract the census tract code from the block code.
The data here was taken from the American Community Survey 2015 5-year estimates (https://factfinder.census.gov/faces/nav/jsf/pages/index.xhtml).
The census block coordinate data was taken from the FCC Census Block Conversions API (https://www.fcc.gov/general/census-block-conversions-api)
As public data from the US government, this is not subject to copyright within the US and should be considered public domain.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘New York City Population By Neighborhood Tabulation Areas’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://catalog.data.gov/dataset/903f4a4a-0402-4ff8-b8bd-f1d6ad856d0c on 13 February 2022.
--- Dataset description provided by original source is as follows ---
Population Numbers By New York City Neighborhood Tabulation Areas
The data was collected from Census Bureaus' Decennial data dissemination (SF1). Neighborhood Tabulation Areas (NTAs), are aggregations of census tracts that are subsets of New York City's 55 Public Use Microdata Areas (PUMAs). Primarily due to these constraints, NTA boundaries and their associated names may not definitively represent neighborhoods. This report shows change in population from 2000 to 2010 for each NTA. Compiled by the Population Division – New York City Department of City Planning.
--- Original source retains full ownership of the source dataset ---
This dataset contains information on antibody testing for COVID-19: the number of people who received a test, the number of people with positive results, the percentage of people tested who tested positive, and the rate of testing per 100,000 people, stratified by ZIP Code Tabulation Area (ZCTA) neighborhood poverty group. These data can also be accessed here: https://github.com/nychealth/coronavirus-data/blob/master/totals/antibody-by-poverty.csv Exposure to COVID-19 can be detected by measuring antibodies to the disease in a person’s blood, which can indicate that a person may have had an immune response to the virus. Antibodies are proteins produced by the body’s immune system that can be found in the blood. People can test positive for antibodies after they have been exposed, sometimes when they no longer test positive for the virus itself. It is important to note that the science around COVID-19 antibody tests is evolving rapidly and there is still much uncertainty about what individual antibody test results mean for a single person and what population-level antibody test results mean for understanding the epidemiology of COVID-19 at a population level. These data only provide information on people tested. People receiving an antibody test do not reflect all people in New York City; therefore, these data may not reflect antibody prevalence among all New Yorkers. Increasing instances of screening programs further impact the generalizability of these data, as screening programs influence who and how many people are tested over time. Examples of screening programs in NYC include: employers screening their workers (e.g., hospitals), and long-term care facilities screening their residents. In addition, there may be potential biases toward people receiving an antibody test who have a positive result because people who were previously ill are preferentially seeking testing, in addition to the testing of persons with higher exposure (e.g., health care workers, first responders.) Neighborhood-level poverty groups were classified in a manner consistent with Health Department practices to describe and monitor disparities in health in NYC. Neighborhood poverty measures are defined as the percentage of people earning below the Federal Poverty Threshold (FPT) within a ZCTA. The standard cut-points for defining categories of neighborhood-level poverty in NYC are: • Low: <10% of residents in ZCTA living below the FPT • Medium: 10% to <20% • High: 20% to <30% • Very high: ≥30% residents living below the FPT The ZCTAs used for classification reflect the first non-missing address within NYC for each person reported with an antibody test result. Rates were calculated using interpolated intercensal population estimates updated in 2019. These rates differ from previously reported rates based on the 2000 Census or previous versions of population estimates. The Health Department produced these population estimates based on estimates from the U.S. Census Bureau and NYC Department of City Planning. Rates for poverty were calculated using direct standardization for age at diagnosis and weighting by the US 2000 standard population. Antibody tests are categorized based on the date of specimen collection and are aggregated by full weeks starting each Sunday and ending on Saturday. For example, a person whose blood was collected for antibody testing on Wednesday, May 6 would be categorized as tested during the week ending May 9. A person tested twice in one week would only be counted once in that week. This dataset includes testing data beginning April 5, 2020. Data are updated daily, and the dataset preserves historical records and source data changes, so each extract date reflects the current copy of the data as of that date. For example, an extract date of 11/04/2020 and extract date of 11/03/2020 will both contain all records as they were as of that extract date. Without filtering or grouping by extract date, an analysis will almost certain
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
NYC Open Data is an opportunity to engage New Yorkers in the information that is produced and used by City government. We believe that every New Yorker can benefit from Open Data, and Open Data can benefit from every New Yorker. Source: https://opendata.cityofnewyork.us/overview/
Thanks to NYC Open Data, which makes public data generated by city agencies available for public use, and Citi Bike, we've incorporated over 150 GB of data in 5 open datasets into Google BigQuery Public Datasets, including:
Over 8 million 311 service requests from 2012-2016
More than 1 million motor vehicle collisions 2012-present
Citi Bike stations and 30 million Citi Bike trips 2013-present
Over 1 billion Yellow and Green Taxi rides from 2009-present
Over 500,000 sidewalk trees surveyed decennially in 1995, 2005, and 2015
This dataset is deprecated and not being updated.
Fork this kernel to get started with this dataset.
https://opendata.cityofnewyork.us/
This dataset is publicly available for anyone to use under the following terms provided by the Dataset Source - https://data.cityofnewyork.us/ - and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.
By accessing datasets and feeds available through NYC Open Data, the user agrees to all of the Terms of Use of NYC.gov as well as the Privacy Policy for NYC.gov. The user also agrees to any additional terms of use defined by the agencies, bureaus, and offices providing data. Public data sets made available on NYC Open Data are provided for informational purposes. The City does not warranty the completeness, accuracy, content, or fitness for any particular purpose or use of any public data set made available on NYC Open Data, nor are any such warranties to be implied or inferred with respect to the public data sets furnished therein.
The City is not liable for any deficiencies in the completeness, accuracy, content, or fitness for any particular purpose or use of any public data set, or application utilizing such data set, provided by any third party.
Banner Photo by @bicadmedia from Unplash.
On which New York City streets are you most likely to find a loud party?
Can you find the Virginia Pines in New York City?
Where was the only collision caused by an animal that injured a cyclist?
What’s the Citi Bike record for the Longest Distance in the Shortest Time (on a route with at least 100 rides)?
https://cloud.google.com/blog/big-data/2017/01/images/148467900588042/nyc-dataset-6.png" alt="enter image description here">
https://cloud.google.com/blog/big-data/2017/01/images/148467900588042/nyc-dataset-6.png
This resource is a member of a series. The TIGER/Line shapefiles and related database files (.dbf) are an extract of selected geographic and cartographic information from the U.S. Census Bureau's Master Address File / Topologically Integrated Geographic Encoding and Referencing (MAF/TIGER) Database (MTDB). The MTDB represents a seamless national file with no overlaps or gaps between parts, however, each TIGER/Line shapefile is designed to stand alone as an independent data set, or they can be combined to cover the entire nation. Block Groups (BGs) are clusters of blocks within the same census tract. Each census tract contains at least one BG, and BGs are uniquely numbered within census tracts. BGs have a valid code range of 0 through 9. BGs have the same first digit of their 4-digit census block number from the same decennial census. For example, tabulation blocks numbered 3001, 3002, 3003,.., 3999 within census tract 1210.02 are also within BG 3 within that census tract. BGs coded 0 are intended to only include water area, no land area, and they are generally in territorial seas, coastal water, and Great Lakes water areas. Block groups generally contain between 600 and 3,000 people. A BG usually covers a contiguous area but never crosses county or census tract boundaries. They may, however, cross the boundaries of other geographic entities like county subdivisions, places, urban areas, voting districts, congressional districts, and American Indian / Alaska Native / Native Hawaiian areas. The BG boundaries in this release are those that were delineated as part of the Census Bureau's Participant Statistical Areas Program (PSAP) for the 2020 Census.
Table of ACS Demographics and profile represented at the NTA level. NTAs are aggregations of census tracts that are subsets of New York City's 55 Public Use Microdata Areas (PUMAs)
Table of Census Demographics represented at the NYC City Council district level
https://www.newyork-demographics.com/terms_and_conditionshttps://www.newyork-demographics.com/terms_and_conditions
A dataset listing New York cities by population for 2024.
This dataset contains information on antibody testing for COVID-19: the number of people who received a test, the number of people with positive results, the percentage of people tested who tested positive, and the rate of testing per 100,000 people, stratified by modified ZIP Code Tabulation Area (ZCTA) of residence. Modified ZCTA reflects the first non-missing address within NYC for each person reported with an antibody test result. This unit of geography is similar to ZIP codes but combines census blocks with smaller populations to allow more stable estimates of population size for rate calculation. It can be challenging to map data that are reported by ZIP Code. A ZIP Code doesn’t refer to an area, but rather a collection of points that make up a mail delivery route. Furthermore, there are some buildings that have their own ZIP Code, and some non-residential areas with ZIP Codes. To deal with the challenges of ZIP Codes, the Health Department uses ZCTAs which solidify ZIP codes into units of area. Often, data reported by ZIP code are actually mapped by ZCTA. The ZCTA geography was developed by the U.S. Census Bureau. These data can also be accessed here: https://github.com/nychealth/coronavirus-data/blob/master/totals/antibody-by-modzcta.csv Exposure to COVID-19 can be detected by measuring antibodies to the disease in a person’s blood, which can indicate that a person may have had an immune response to the virus. Antibodies are proteins produced by the body’s immune system that can be found in the blood. People can test positive for antibodies after they have been exposed, sometimes when they no longer test positive for the virus itself. It is important to note that the science around COVID-19 antibody tests is evolving rapidly and there is still much uncertainty about what individual antibody test results mean for a single person and what population-level antibody test results mean for understanding the epidemiology of COVID-19 at a population level. These data only provide information on people tested. People receiving an antibody test do not reflect all people in New York City; therefore, these data may not reflect antibody prevalence among all New Yorkers. Increasing instances of screening programs further impact the generalizability of these data, as screening programs influence who and how many people are tested over time. Examples of screening programs in NYC include: employers screening their workers (e.g., hospitals), and long-term care facilities screening their residents. In addition, there may be potential biases toward people receiving an antibody test who have a positive result because people who were previously ill are preferentially seeking testing, in addition to the testing of persons with higher exposure (e.g., health care workers, first responders) Rates were calculated using interpolated intercensal population estimates updated in 2019. These rates differ from previously reported rates based on the 2000 Census or previous versions of population estimates. The Health Department produced these population estimates based on estimates from the U.S. Census Bureau and NYC Department of City Planning. Antibody tests are categorized based on the date of specimen collection and are aggregated by full weeks starting each Sunday and ending on Saturday. For example, a person whose blood was collected for antibody testing on Wednesday, May 6 would be categorized as tested during the week ending May 9. A person tested twice in one week would only be counted once in that week. This dataset includes testing data beginning April 5, 2020. Data are updated daily, and the dataset preserves historical records and source data changes, so each extract date reflects the current copy of the data as of that date. For example, an extract date of 11/04/2020 and extract date of 11/03/2020 will both contain all records as they were as of that extract date. Without filtering or grouping by extract date, an analysis wi
Selected demographic, social, economic, and housing estimates data by community district/PUMA (Public Use Micro Data Sample Area). Three year estimates of population data from the Census Bureau's American Community Survey
2020 Census Tracts from the US Census for New York City. These boundary files are derived from the US Census Bureau's TIGER data products and have been geographically modified to fit the New York City base map. All previously released versions of this data are available at BYTES of the BIG APPLE- Archive.
The New York City Department of Health and Mental Hygiene (NYC DOHMH) has shared vital statistics data (birth and mortality data) online. Birth data includes demographic information on the mother, including age, race, and education. Mortality data includes demographic information on the deceased, such as age, sex, race, and education. The publicly-available birth and death micro-SAS datasets provide aggregate data on the community district, zip code, and census tract levels. Researchers may also complete an application process to request line-listed and de-identified vital statistics data from NYC DOHMH.
Many residents of New York City speak more than one language; a number of them speak and understand non-English languages more fluently than English. This dataset, derived from the Census Bureau's American Community Survey (ACS), includes information on over 1.7 million limited English proficient (LEP) residents and a subset of that population called limited English proficient citizens of voting age (CVALEP) at the Community District level. There are 59 community districts throughout NYC, with each district being represented by a Community Board.
http://reference.data.gov.uk/id/open-government-licencehttp://reference.data.gov.uk/id/open-government-licence
A range of indicators for a selection of cities from the New York City Global City database.
Dataset includes the following:
Geography
City Area (km2)
Metro Area (km2)
People
City Population (millions)
Metro Population (millions)
Foreign Born
Annual Population Growth
Economy
GDP Per Capita (thousands $, PPP rates, per resident)
Primary Industry
Secondary Industry
Share of Global 500 Companies (%)
Unemployment Rate
Poverty Rate
Transportation
Public Transportation
Mass Transit Commuters
Major Airports
Major Ports
Education
Students Enrolled in Higher Education
Percent of Population with Higher Education (%)
Higher Education Institutions
Tourism
Total Tourists Annually (millions)
Foreign Tourists Annually (millions)
Domestic Tourists Annually (millions)
Annual Tourism Revenue ($US billions)
Hotel Rooms (thousands)
Health
Infant Mortality (Deaths per 1,000 Births)
Life Expectancy in Years (Male)
Life Expectancy in Years (Female)
Physicians per 100,000 People
Number of Hospitals
Anti-Smoking Legislation
Culture
Number of Museums
Number of Cultural and Arts Organizations
Environment
Green Spaces (km2)
Air Quality
Laws or Regulations to Improve Energy Efficiency
Retrofitted City Vehicle Fleet
Bike Share Program
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset expands on my earlier New York City Census Data dataset. It includes data from the entire country instead of just New York City. The expanded data will allow for much more interesting analyses and will also be much more useful at supporting other data sets.
The data here are taken from the DP03 and DP05 tables of the 2015 American Community Survey 5-year estimates. The full datasets and much more can be found at the American Factfinder website. Currently, I include two data files:
The two files have the same structure, with just a small difference in the name of the id column. Counties are political subdivisions, and the boundaries of some have been set for centuries. Census tracts, however, are defined by the census bureau and will have a much more consistent size. A typical census tract has around 5000 or so residents.
The Census Bureau updates the estimates approximately every year. At least some of the 2016 data is already available, so I will likely update this in the near future.
The data here were collected by the US Census Bureau. As a product of the US federal government, this is not subject to copyright within the US.
There are many questions that we could try to answer with the data here. Can we predict things such as the state (classification) or household income (regression)? What kinds of clusters can we find in the data? What other datasets can be improved by the addition of census data?
The Council has numerous standing committees that practice oversight of New York City functions, including human services, infrastructure, and government affairs. Each committee is headed by a Council Member (the Chair), includes at least five members, and meets at least once a month. In addition, the Council has several subcommittees, which are convened to review and make recommendations regarding topics of particular interest. After proposed legislation is heard by its appropriate Committee, it may be voted on and approved at that Committee. If the legislation is passed by Committee, is then sent to be considered by the whole Council. Council Members are assigned to committees through a process that the entire Council votes on. The NYC City Council Committee Membership dataset is drawn from the City Council's legislative API and updated weekly. Committee membership by City Council Members changes infrequently. This dataset includes committee membership starting Jan 1, 2018. Committee Descriptions: https://council.nyc.gov/committees/ Github: https://github.com/NewYorkCityCouncil/districts/tree/master/district_data/committees More info and API Key for Legislative API: https://council.nyc.gov/legislation/api/ Legislative API endpoints utilized for Committee Membership: 1)http://webapi.legistar.com/Help/Api/GET-v1-Client-Bodies https://webapi.legistar.com/v1/nyc/bodies/?token={}&$filter=(BodyTypeName+eq+'Committee'+or+BodyTypeName+eq+'Subcommittee'+or+BodyTypeName+eq+'Land Use')+and+BodyActiveFlag+eq+1 2)http://webapi.legistar.com/Help/Api/GET-v1-Client-Bodies-BodyId-OfficeRecords https://webapi.legistar.com/v1/nyc/bodies/{}/officerecords/?token={}&$filter=OfficeRecordStartDate+ge+datetime'{}'+and+OfficeRecordEndDate+eq+datetime'{}
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘New York City REACH Members’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/new-york-city/new-york-city-reach-members on 12 November 2021.
--- Dataset description provided by original source is as follows ---
The location and facility information for doctors who participate in NYC Regional Electronic Adoption Center for Health (REACH), which assists providers in adopting technology and methods for electronic health records.
This is a dataset hosted by the City of New York. The city has an open data platform found here and they update their information according the amount of data that is brought in. Explore New York City using Kaggle and all of the data sources available through the City of New York organization page!
This dataset is maintained using Socrata's API and Kaggle's API. Socrata has assisted countless organizations with hosting their open data and has been an integral part of the process of bringing more data to the public.
--- Original source retains full ownership of the source dataset ---
description: Population Numbers By New York City Neighborhood Tabulation Areas The data was collected from Census Bureaus' Decennial data dissemination (SF1). Neighborhood Tabulation Areas (NTAs), are aggregations of census tracts that are subsets of New York City's 55 Public Use Microdata Areas (PUMAs). Primarily due to these constraints, NTA boundaries and their associated names may not definitively represent neighborhoods. This report shows change in population from 2000 to 2010 for each NTA. Compiled by the Population Division New York City Department of City Planning.; abstract: Population Numbers By New York City Neighborhood Tabulation Areas The data was collected from Census Bureaus' Decennial data dissemination (SF1). Neighborhood Tabulation Areas (NTAs), are aggregations of census tracts that are subsets of New York City's 55 Public Use Microdata Areas (PUMAs). Primarily due to these constraints, NTA boundaries and their associated names may not definitively represent neighborhoods. This report shows change in population from 2000 to 2010 for each NTA. Compiled by the Population Division New York City Department of City Planning.