New York City Population By Community Districts The data was collected from Census Bureaus' Decennial data dissemination (SF1) for the years 1970, 1980, 1990, 2000 and 2010. Compiled by the Population Division – New York City Department of City Planning
An index of pedestrian volumes tracking the long-term trends of neighborhood commercial corridors. Data is collected at 114 locations, including 100 on-street locations (primarily retail corridors), 13 East River and Harlem River bridge locations, and the Hudson River Greenway. Screenline sampling is conducted during May and September on the sidewalk, mid-block (or mid-bridge) on both sides of street where applicable. Pedestrian volumes at 50 sample locations around the City are combined to create the Pedestrian Volume Index for the Mayor’s Management Report. Click here for metadata - http://www.nyc.gov/html/dot/downloads/pdf/bi-annual-ped-count-readme.pdf
The dataset comes from CouncilStat, which is used by many NYC Council district offices to enter and track constituent cases that can range from issues around affordable housing, to potholes and pedestrian safety. This dataset aggregates the information that individual staff have input. However, district staffs handle a wide range of complex issues. Each offices uses the program differently, and thus records cases, differently and so comparisons between accounts may be difficult. Not all offices use the program. For more info - http://labs.council.nyc/districts/data/
The New York State Department of Health (NYS DOH) shares de-identified and aggregated metrics on the NYS Medicaid program through the Health Data NY catalog and as summary statistics on DOH website. Datasets vary by subject/scope, unit of analysis, years of data collection, and update frequency. Publicly-available datasets in the Health Data NY catalog address topics including:
For a fee, researchers at NYU Langone Health may acquire NYS Medicaid claims data by submitting a study proposal to the Health Evaluation and Analytics Lab (HEAL). For more information, click on the link to the NYS Medicaid Claims File under the Related Datasets section or search for the NYS Medicaid Claims File in the NYU Data Catalog.
The NYC KIDS Survey is a population-based telephone survey conducted by the Health Department. The survey provides robust data on the health of children aged 13 years or younger (2017: children aged 0-13 years; 2019: children aged 1-13 years) in New York City, including citywide and borough estimates, on a broad range of topics including physical and mental health, health care access, and school and childcare enrollment and learning. For more information, visit https://www1.nyc.gov/site/doh/data/data-sets/child-chs.page
The New York Times Annotated Corpus contains over 1.8 million articles written and published by the New York Times between January 1, 1987 and June 19, 2007 with article metadata provided by the New York Times Newsroom, the New York Times Indexing Service and the online production staff at nytimes.com. The corpus includes:
Over 1.8 million articles (excluding wire services articles that appeared during the covered period). Over 650,000 article summaries written by library scientists. Over 1,500,000 articles manually tagged by library scientists with tags drawn from a normalized indexing vocabulary of people, organizations, locations and topic descriptors. Over 275,000 algorithmically-tagged articles that have been hand verified by the online production staff at nytimes.com. As part of the New York Times' indexing procedures, most articles are manually summarized and tagged by a staff of library scientists. This collection contains over 650,000 article-summary pairs which may prove to be useful in the development and evaluation of algorithms for automated document summarization. Also, over 1.5 million documents have at least one tag. Articles are tagged for persons, places, organizations, titles and topics using a controlled vocabulary that is applied consistently across articles. For instance if one article mentions "Bill Clinton" and another refers to "President William Jefferson Clinton", both articles will be tagged with "CLINTON, BILL".
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
There are a number of Kaggle datasets that provide spatial data around New York City. For many of these, it may be quite interesting to relate the data to the demographic and economic characteristics of nearby neighborhoods. I hope this data set will allow for making these comparisons without too much difficulty.
Exploring the data and making maps could be quite interesting as well.
This dataset contains two CSV files:
nyc_census_tracts.csv
This file contains a selection of census data taken from the ACS DP03 and DP05 tables. Things like total population, racial/ethnic demographic information, employment and commuting characteristics, and more are contained here. There is a great deal of additional data in the raw tables retrieved from the US Census Bureau website, so I could easily add more fields if there is enough interest.
I obtained data for individual census tracts, which typically contain several thousand residents.
census_block_loc.csv
For this file, I used an online FCC census block lookup tool to retrieve the census block code for a 200 x 200 grid containing
New York City and a bit of the surrounding area. This file contains the coordinates and associated census block codes along
with the state and county names to make things a bit more readable to users.
Each census tract is split into a number of blocks, so one must extract the census tract code from the block code.
The data here was taken from the American Community Survey 2015 5-year estimates (https://factfinder.census.gov/faces/nav/jsf/pages/index.xhtml).
The census block coordinate data was taken from the FCC Census Block Conversions API (https://www.fcc.gov/general/census-block-conversions-api)
As public data from the US government, this is not subject to copyright within the US and should be considered public domain.
Annual Average Daily Traffic (AADT) is an estimate of the average daily traffic along a defined segment of roadway. This value is calculated from short term counts taken along the same section which are then factored to produce the estimate of AADT. Because of this process, the most recent AADT for any given roadway will always be for the previous year. Data is available for all New York State Routes and roads that are part of the Federal Aid System.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Since 1998, the New York City Police Department (NYPD) has been tasked with the collection and maintenance of crime data for incidents that occur in New York City public schools. The NYPD has provided this data to the New York City Department of Education (DOE). The DOE has compiled this data by schools and locations for the information of our parents and students, our teachers and staff, and the general public. In some instances, several Department of Education learning communities co-exist within a single building. In other instances, a single school has locations in several different buildings. In either of these instances, the data presented here is aggregated by building location rather than by school, since safety is always a building-wide issue. We use “consolidated locations” throughout the presentation of the data to indicate the numbers of incidents in buildings that include more than one learning community.
This is a dataset hosted by the City of New York. The city has an open data platform found here and they update their information according the amount of data that is brought in. Explore New York City using Kaggle and all of the data sources available through the City of New York organization page!
This dataset is maintained using Socrata's API and Kaggle's API. Socrata has assisted countless organizations with hosting their open data and has been an integral part of the process of bringing more data to the public.
Photo by Ryan Jacobson on Unsplash
The dataset comes from CouncilStat, which is used by many NYC Council district offices to enter and track constituent cases that can range from issues around affordable housing, to potholes and pedestrian safety. This dataset aggregates the information that individual staff have input. However, district staffs handle a wide range of complex issues. Each offices uses the program differently, and thus records cases, differently and so comparisons between accounts may be difficult. Not all offices use the program. For more info - http://labs.council.nyc/districts/data/
Data extracted from records of tickets on file with NYS DMV. The tickets were issued to motorists for violations of: NYS Vehicle & Traffic Law (VTL), Thruway Rules and Regulations, Tax Law, Transportation Law, Parks and Recreation Regulations, Local New York City Traffic Ordinances, and NYS Penal Law pertaining to the involvement of a motor vehicle in acts of assault, homicide, manslaughter and criminal negligence resulting in injury or death.
The dataset comes from CouncilStat, which is used by many NYC Council district offices to enter and track constituent cases that can range from issues around affordable housing, to potholes and pedestrian safety. This dataset aggregates the information that individual staff have input. However, district staffs handle a wide range of complex issues. Each offices uses the program differently, and thus records cases, differently and so comparisons between accounts may be difficult. Not all offices use the program. For more info - http://labs.council.nyc/districts/data/
The Council has numerous standing committees that practice oversight of New York City functions, including human services, infrastructure, and government affairs. Each committee is headed by a Council Member (the Chair), includes at least five members, and meets at least once a month. In addition, the Council has several subcommittees, which are convened to review and make recommendations regarding topics of particular interest. After proposed legislation is heard by its appropriate Committee, it may be voted on and approved at that Committee. If the legislation is passed by Committee, is then sent to be considered by the whole Council. Council Members are assigned to committees through a process that the entire Council votes on. The NYC City Council Committee Membership dataset is drawn from the City Council's legislative API and updated weekly. Committee membership by City Council Members changes infrequently. This dataset includes committee membership starting Jan 1, 2018. Committee Descriptions: https://council.nyc.gov/committees/ Github: https://github.com/NewYorkCityCouncil/districts/tree/master/district_data/committees More info and API Key for Legislative API: https://council.nyc.gov/legislation/api/ Legislative API endpoints utilized for Committee Membership: 1)http://webapi.legistar.com/Help/Api/GET-v1-Client-Bodies https://webapi.legistar.com/v1/nyc/bodies/?token={}&$filter=(BodyTypeName+eq+'Committee'+or+BodyTypeName+eq+'Subcommittee'+or+BodyTypeName+eq+'Land Use')+and+BodyActiveFlag+eq+1 2)http://webapi.legistar.com/Help/Api/GET-v1-Client-Bodies-BodyId-OfficeRecords https://webapi.legistar.com/v1/nyc/bodies/{}/officerecords/?token={}&$filter=OfficeRecordStartDate+ge+datetime'{}'+and+OfficeRecordEndDate+eq+datetime'{}
A list of all closed allegations made against uniformed members of the New York Police Department since the year 2000. A single complaint may include multiple allegations between multiple victims / alleged victims and multiple officers. A single allegation is between one complainant and one officer. The term "Victim / Alleged Victim" refers to the person claiming harm by at least one or more allegation(s) of police misconduct.
The dataset is part of a database of all public police misconduct records the Civilian Complaint Review Board (CCRB) maintains on complaints against New York Police Department uniformed members of service received in CCRB's jurisdiction since the year 2000, when CCRB's database was first built. This data is published as four tables:
Civilian Complaint Review Board: Police Officers Civilian Complaint Review Board: Complaints Against Police Officers Civilian Complaint Review Board: Allegations Against Police Officers Civilian Complaint Review Board: Penalties
A single complaint can include multiple allegations, and those allegations may include multiple subject officers and multiple complainants.
Public records exclude complaints and allegations that were closed as Mediated, Mediation Attempted, Administrative Closure, Conciliated (for some complaints prior to the year 2000), or closed as Other Possible Misconduct Noted.
This database is inclusive of prior datasets held on Open Data (previously maintained as "Civilian Complaint Review Board (CCRB) - Complaints Received," "Civilian Complaint Review Board (CCRB) - Complaints Closed," and "Civilian Complaint Review Board (CCRB) - Allegations Closed") but includes information and records made public by the June 2020 repeal of New York Civil Rights law 50-a, which precipitated a full revision of what CCRB data could be considered public.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Community Districts (Water Areas Included)’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://catalog.data.gov/dataset/c41c97e9-eaf4-429d-9c86-b59ad8c2e873 on 13 February 2022.
--- Dataset description provided by original source is as follows ---
GIS data: Community Districts (Water areas included)
Community Districts are mandated by the city charter to review and monitor quality of life issues for New York City (NYC) neighborhoods. NYC is currently comprised of 59 community districts. The first byte is a borough code and the second and third bytes are the community district number. There are also 12 Joint Interest Areas (JIAs). The JIAs are major parks and airports and are not contained within any community district. This dataset is being provided by the Department of City Planning (DCP) for informational purposes only. DCP does not warranty the completeness, accuracy, content, or fitness for any particular purpose or use of the dataset, nor are any such warranties to be implied or inferred with respect to the dataset as furnished on the website. DCP and the City are not liable for any deficiencies in the completeness, accuracy, content, or fitness for any particular purpose or use the dataset, or applications utilizing the dataset, provided by any third party.
All previously released versions of this data are available at BYTES of the BIG APPLE- Archive
--- Original source retains full ownership of the source dataset ---
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
This dataset provides a survey of Irish writers publishing in The New Yorker magazine from 1940 to 1980. Methodology I conduct the survey through archival research and secondary references. The primary sources are The New Yorker's digital archive and the New Yorker Records housed in the New York Public Library. Parameters The timeframe of the survey concerns The New Yorker’s international expansion in the middle decades of the twentieth century. It starts from 1940 and ends in 1980, when the magazine industry’s cultural impact was eclipsed by the popularity of TV. For the purpose of the project, I focus on "fiction" contributions. Verse and shorter writings (such as column pieces and book reviews) are not included. Therefore, Maeve Brennan's shorter pieces under her alias "the long-winded lady" and Patricia Collinge's shorter contributions are not included in the quantatative survey. This survey includes both Irish and Irish-American writers. One key criterium of the selection is the writer’s connection with Ireland and Irish culture. Irish-American writers whose works are more concerned about (Irish-)America rather than Ireland itself are excluded from the survey. Therefore, Elizabeth Cullinan and J.P. Donleavy are included, while John O’Hara and Mary McCarthy are not. Notes for Users The list is presented in the chronological order of the contributions’ appearance in the magazine. The date format follows the international convention (ISO8601), thus: year/month/day. The date refers to the publication of The New Yorker issues. The New Yorker is a weekly, and the timeframe of the project spans four decades. This means that there are thousands of back issues under examination. I acknowledge the possibility that there are Irish writers whose contributions in the magazine escaped my attention. If there is any omission, I would appreciate the user’s input to update the survey. It is hoped that this survey will help researchers investigate the Irish connections with one of America’s most influential publications. Teachers, students, and the general public may also use this list as a guide to better appreciate these fascinating Irish stories.
Pursuant to New York City’s Housing Maintenance Code, the Department of Housing Preservation and Development (HPD) issues violations against conditions, in rental dwelling units and buildings, that have been verified to violate the New York City Housing Maintenance Code (HMC) or the New York State Multiple Dwelling Law (MDL).
Each row in this dataset contains discrete information about one violation of the New York City Housing Maintenance Code or New York State Multiple Dwelling Law. Each violation is identified using a unique Violation ID. These Laws are in place to provide requirements for the maintenance of residential dwelling units within New York City.
Violations are issued by Housing Inspectors after a physical inspection is conducted (except for class I violations which are generally administratively issued). Violations are issued in four classes: Class A (non-hazardous), Class B (hazardous), Class C (immediately hazardous) and Class I (information orders). For more information on violations, see https://www1.nyc.gov/site/hpd/owners/compliance-clear-violations.page
The base data for this file is all violations open as of October 1, 2012. Violation data is updated daily. The daily update includes both new violations and updates to the status of previously issued violations. An open violation is a violation which is still active on the Department records. See the status table for determining how to filter for open violations versus closed violations, and within open violations for a more detailed current status.
The property owner may or may not have corrected the physical condition if the status is open. The violation status is closed when the violation is observed/verified as corrected by HPD or as certified by the landlord. The processes for having violations dismissed are described at http://www1.nyc.gov/site/hpd/owners/compliance-clear-violations.page
Using other HPD datasets, such as the Building File or the Registration File, a user can link together violations issued for given buildings or for given owners.
GIS data: Community Districts (Water areas included) Community Districts are mandated by the city charter to review and monitor quality of life issues for New York City (NYC) neighborhoods. NYC is currently comprised of 59 community districts. The first byte is a borough code and the second and third bytes are the community district number. There are also 12 Joint Interest Areas (JIAs). The JIAs are major parks and airports and are not contained within any community district. This dataset is being provided by the Department of City Planning (DCP) for informational purposes only. DCP does not warranty the completeness, accuracy, content, or fitness for any particular purpose or use of the dataset, nor are any such warranties to be implied or inferred with respect to the dataset as furnished on the website. DCP and the City are not liable for any deficiencies in the completeness, accuracy, content, or fitness for any particular purpose or use the dataset, or applications utilizing the dataset, provided by any third party. All previously released versions of this data are available at BYTES of the BIG APPLE- Archive
This map shows the percentage of premature deaths of individuals less than 75 years old by county. Counties are shaded based on quartile distribution. The lighter shaded counties have a lower percentage of premature deaths. The darker shaded counties have a higher percentage of premature deaths. New York State Community Health Indicator Reports (CHIRS) were developed in 2012, and are updated annually to consolidate and improve data linkages for the health indicators included in the County Health Assessment Indicators (CHAI) for all communities in New York. The CHIRS present data for more than 300 health indicators that are organized by 15 different health topics. Data if provided for all 62 New York State counties, 11 regions (including New York City), the State excluding New York City, and New York State. For more information, check out: http://www.health.ny.gov/statistics/chac/indicators/. The "About" tab contains additional details concerning this dataset.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This map shows the incidence rate per 100,000 for all cancer types by county. Counties are shaded based on quartile distribution. The lighter shaded counties have lower cancer incidence rates. The darker shaded counties have higher cancer incidence rates. New York State Community Health Indicator Reports (CHIRS) were developed in 2012, and are updated annually to consolidate and improve data linkages for the health indicators included in the County Health Assessment Indicators (CHAI) for all communities in New York. The CHIRS present data for more than 300 health indicators that are organized by 15 different health topics. Data if provided for all 62 New York State counties, 11 regions (including New York City), the State excluding New York City, and New York State. For more information, check out: http://www.health.ny.gov/statistics/chac/indicators/. The "About" tab contains additional details concerning this dataset.
New York City Population By Community Districts The data was collected from Census Bureaus' Decennial data dissemination (SF1) for the years 1970, 1980, 1990, 2000 and 2010. Compiled by the Population Division – New York City Department of City Planning