There has been little research on United States homicide rates from a long-term perspective, primarily because there has been no consistent data series on a particular place preceding the Uniform Crime Reports (UCR), which began its first full year in 1931. To fill this research gap, this project created a data series on homicides per capita for New York City that spans two centuries. The goal was to create a site-specific, individual-based data series that could be used to examine major social shifts related to homicide, such as mass immigration, urban growth, war, demographic changes, and changes in laws. Data were also gathered on various other sites, particularly in England, to allow for comparisons on important issues, such as the post-World War II wave of violence. The basic approach to the data collection was to obtain the best possible estimate of annual counts and the most complete information on individual homicides. The annual count data (Parts 1 and 3) were derived from multiple sources, including the Federal Bureau of Investigation's Uniform Crime Reports and Supplementary Homicide Reports, as well as other official counts from the New York City Police Department and the City Inspector in the early 19th century. The data include a combined count of murder and manslaughter because charge bargaining often blurs this legal distinction. The individual-level data (Part 2) were drawn from coroners' indictments held by the New York City Municipal Archives, and from daily newspapers. Duplication was avoided by keeping a record for each victim. The estimation technique known as "capture-recapture" was used to estimate homicides not listed in either source. Part 1 variables include counts of New York City homicides, arrests, and convictions, as well as the homicide rate, race or ethnicity and gender of victims, type of weapon used, and source of data. Part 2 includes the date of the murder, the age, sex, and race of the offender and victim, and whether the case led to an arrest, trial, conviction, execution, or pardon. Part 3 contains annual homicide counts and rates for various comparison sites including Liverpool, London, Kent, Canada, Baltimore, Los Angeles, Seattle, and San Francisco.
This dataset includes all valid felony, misdemeanor, and violation crimes reported to the New York City Police Department (NYPD) for all complete quarters so far this year (2017). For additional details, please see the attached data dictionary in the ‘About’ section.
Daily count of NYC residents who tested positive for SARS-CoV-2, who were hospitalized with COVID-19, and deaths among COVID-19 patients.
Note that this dataset currently pulls from https://raw.githubusercontent.com/nychealth/coronavirus-data/master/trends/data-by-day.csv on a daily basis.
The leading causes of death by sex and ethnicity in New York City in since 2007. Cause of death is derived from the NYC death certificate which is issued for every death that occurs in New York City.
Report last ran: 09/24/2019List of every shooting incident that occurred in NYC during the current calendar year. This is a breakdown of every shooting incident that occurred in NYC during the current calendar year. This data is manually extracted every quarter and reviewed by the Office of Management Analysis and Planning before being posted on the NYPD website. Each record represents a shooting incident in NYC and includes information about the event, the location and time of occurrence. In addition, information related to suspect and victim demographics is also included. This data can be used by the public to explore the nature of police enforcement activity. Please refer to the attached data footnotes for additional information about this dataset.
This dataset shows actual locations of newsstands throughout New York City, with actual contact information, however I've added a few columns of fake data representing sales of magazines at that newsstand. the numbers are randomly generated, and fake.
This dataset provides highly detailed (Block Level) views of various demographics for Manhattan, New York city. this dataset includes information on age, race, sex, income, housing, and various other attributes. This data comes from the 2000 Us Census and was joined to the Census Tiger line files to create the output. enjoy!
Note: Data elements were retired from HERDS on 10/6/23 and this dataset was archived.
This dataset includes the cumulative number and percent of healthcare facility-reported fatalities for patients with lab-confirmed COVID-19 disease by reporting date and age group. This dataset does not include fatalities related to COVID-19 disease that did not occur at a hospital, nursing home, or adult care facility. The primary goal of publishing this dataset is to provide users with information about healthcare facility fatalities among patients with lab-confirmed COVID-19 disease.
The information in this dataset is also updated daily on the NYS COVID-19 Tracker at https://www.ny.gov/covid-19tracker.
The data source for this dataset is the daily COVID-19 survey through the New York State Department of Health (NYSDOH) Health Electronic Response Data System (HERDS). Hospitals, nursing homes, and adult care facilities are required to complete this survey daily. The information from the survey is used for statewide surveillance, planning, resource allocation, and emergency response activities. Hospitals began reporting for the HERDS COVID-19 survey in March 2020, while Nursing Homes and Adult Care Facilities began reporting in April 2020. It is important to note that fatalities related to COVID-19 disease that occurred prior to the first publication dates are also included.
The fatality numbers in this dataset are calculated by assigning age groups to each patient based on the patient age, then summing the patient fatalities within each age group, as of each reporting date. The statewide total fatality numbers are calculated by summing the number of fatalities across all age groups, by reporting date. The fatality percentages are calculated by dividing the number of fatalities in each age group by the statewide total number of fatalities, by reporting date. The fatality numbers represent the cumulative number of fatalities that have been reported as of each reporting date.
This file contains COVID-19 death counts, death rates, and percent of total deaths by jurisdiction of residence. The data is grouped by different time periods including 3-month period, weekly, and total (cumulative since January 1, 2020). United States death counts and rates include the 50 states, plus the District of Columbia and New York City. New York state estimates exclude New York City. Puerto Rico is included in HHS Region 2 estimates. Deaths with confirmed or presumed COVID-19, coded to ICD–10 code U07.1. Number of deaths reported in this file are the total number of COVID-19 deaths received and coded as of the date of analysis and may not represent all deaths that occurred in that period. Counts of deaths occurring before or after the reporting period are not included in the file. Data during recent periods are incomplete because of the lag in time between when the death occurred and when the death certificate is completed, submitted to NCHS and processed for reporting purposes. This delay can range from 1 week to 8 weeks or more, depending on the jurisdiction and cause of death. Death counts should not be compared across states. Data timeliness varies by state. Some states report deaths on a daily basis, while other states report deaths weekly or monthly. The ten (10) United States Department of Health and Human Services (HHS) regions include the following jurisdictions. Region 1: Connecticut, Maine, Massachusetts, New Hampshire, Rhode Island, Vermont; Region 2: New Jersey, New York, New York City, Puerto Rico; Region 3: Delaware, District of Columbia, Maryland, Pennsylvania, Virginia, West Virginia; Region 4: Alabama, Florida, Georgia, Kentucky, Mississippi, North Carolina, South Carolina, Tennessee; Region 5: Illinois, Indiana, Michigan, Minnesota, Ohio, Wisconsin; Region 6: Arkansas, Louisiana, New Mexico, Oklahoma, Texas; Region 7: Iowa, Kansas, Missouri, Nebraska; Region 8: Colorado, Montana, North Dakota, South Dakota, Utah, Wyoming; Region 9: Arizona, California, Hawaii, Nevada; Region 10: Alaska, Idaho, Oregon, Washington. Rates were calculated using the population estimates for 2021, which are estimated as of July 1, 2021 based on the Blended Base produced by the US Census Bureau in lieu of the April 1, 2020 decennial population count. The Blended Base consists of the blend of Vintage 2020 postcensal population estimates, 2020 Demographic Analysis Estimates, and 2020 Census PL 94-171 Redistricting File (see https://www2.census.gov/programs-surveys/popest/technical-documentation/methodology/2020-2021/methods-statement-v2021.pdf). Rates are based on deaths occurring in the specified week/month and are age-adjusted to the 2000 standard population using the direct method (see https://www.cdc.gov/nchs/data/nvsr/nvsr70/nvsr70-08-508.pdf). These rates differ from annual age-adjusted rates, typically presented in NCHS publications based on a full year of data and annualized weekly/monthly age-adjusted rates which have been adjusted to allow comparison with annual rates. Annualization rates presents deaths per year per 100,000 population that would be expected in a year if the observed period specific (weekly/monthly) rate prevailed for a full year. Sub-national death counts between 1-9 are suppressed in accordance with NCHS data confidentiality standards. Rates based on death counts less than 20 are suppressed in accordance with NCHS standards of reliability as specified in NCHS Data Presentation Standards for Proportions (available from: https://www.cdc.gov/nchs/data/series/sr_02/sr02_175.pdf.).
This is the US Coronavirus data repository from The New York Times . This data includes COVID-19 cases and deaths reported by state and county. The New York Times compiled this data based on reports from state and local health agencies. More information on the data repository is available here . For additional reporting and data visualizations, see The New York Times’ U.S. coronavirus interactive site
Which US counties have the most confirmed cases per capita? This query determines which counties have the most cases per 100,000 residents. Note that this may differ from similar queries of other datasets because of differences in reporting lag, methodologies, or other dataset differences.
SELECT
covid19.county,
covid19.state_name,
total_pop AS county_population,
confirmed_cases,
ROUND(confirmed_cases/total_pop *100000,2) AS confirmed_cases_per_100000,
deaths,
ROUND(deaths/total_pop *100000,2) AS deaths_per_100000
FROM
bigquery-public-data.covid19_nyt.us_counties
covid19
JOIN
bigquery-public-data.census_bureau_acs.county_2017_5yr
acs ON covid19.county_fips_code = acs.geo_id
WHERE
date = DATE_SUB(CURRENT_DATE(),INTERVAL 1 day)
AND covid19.county_fips_code != "00000"
ORDER BY
confirmed_cases_per_100000 desc
How do I calculate the number of new COVID-19 cases per day?
This query determines the total number of new cases in each state for each day available in the dataset
SELECT
b.state_name,
b.date,
MAX(b.confirmed_cases - a.confirmed_cases) AS daily_confirmed_cases
FROM
(SELECT
state_name AS state,
state_fips_code ,
confirmed_cases,
DATE_ADD(date, INTERVAL 1 day) AS date_shift
FROM
bigquery-public-data.covid19_nyt.us_states
WHERE
confirmed_cases + deaths > 0) a
JOIN
bigquery-public-data.covid19_nyt.us_states
b ON
a.state_fips_code = b.state_fips_code
AND a.date_shift = b.date
GROUP BY
b.state_name, date
ORDER BY
date desc
Notice of data discontinuation: Since the start of the pandemic, AP has reported case and death counts from data provided by Johns Hopkins University. Johns Hopkins University has announced that they will stop their daily data collection efforts after March 10. As Johns Hopkins stops providing data, the AP will also stop collecting daily numbers for COVID cases and deaths. The HHS and CDC now collect and visualize key metrics for the pandemic. AP advises using those resources when reporting on the pandemic going forward.
April 9, 2020
April 20, 2020
April 29, 2020
September 1st, 2020
February 12, 2021
new_deaths
column.February 16, 2021
The AP is using data collected by the Johns Hopkins University Center for Systems Science and Engineering as our source for outbreak caseloads and death counts for the United States and globally.
The Hopkins data is available at the county level in the United States. The AP has paired this data with population figures and county rural/urban designations, and has calculated caseload and death rates per 100,000 people. Be aware that caseloads may reflect the availability of tests -- and the ability to turn around test results quickly -- rather than actual disease spread or true infection rates.
This data is from the Hopkins dashboard that is updated regularly throughout the day. Like all organizations dealing with data, Hopkins is constantly refining and cleaning up their feed, so there may be brief moments where data does not appear correctly. At this link, you’ll find the Hopkins daily data reports, and a clean version of their feed.
The AP is updating this dataset hourly at 45 minutes past the hour.
To learn more about AP's data journalism capabilities for publishers, corporations and financial institutions, go here or email kromano@ap.org.
Use AP's queries to filter the data or to join to other datasets we've made available to help cover the coronavirus pandemic
Filter cases by state here
Rank states by their status as current hotspots. Calculates the 7-day rolling average of new cases per capita in each state: https://data.world/associatedpress/johns-hopkins-coronavirus-case-tracker/workspace/query?queryid=481e82a4-1b2f-41c2-9ea1-d91aa4b3b1ac
Find recent hotspots within your state by running a query to calculate the 7-day rolling average of new cases by capita in each county: https://data.world/associatedpress/johns-hopkins-coronavirus-case-tracker/workspace/query?queryid=b566f1db-3231-40fe-8099-311909b7b687&showTemplatePreview=true
Join county-level case data to an earlier dataset released by AP on local hospital capacity here. To find out more about the hospital capacity dataset, see the full details.
Pull the 100 counties with the highest per-capita confirmed cases here
Rank all the counties by the highest per-capita rate of new cases in the past 7 days here. Be aware that because this ranks per-capita caseloads, very small counties may rise to the very top, so take into account raw caseload figures as well.
The AP has designed an interactive map to track COVID-19 cases reported by Johns Hopkins.
@(https://datawrapper.dwcdn.net/nRyaf/15/)
<iframe title="USA counties (2018) choropleth map Mapping COVID-19 cases by county" aria-describedby="" id="datawrapper-chart-nRyaf" src="https://datawrapper.dwcdn.net/nRyaf/10/" scrolling="no" frameborder="0" style="width: 0; min-width: 100% !important;" height="400"></iframe><script type="text/javascript">(function() {'use strict';window.addEventListener('message', function(event) {if (typeof event.data['datawrapper-height'] !== 'undefined') {for (var chartId in event.data['datawrapper-height']) {var iframe = document.getElementById('datawrapper-chart-' + chartId) || document.querySelector("iframe[src*='" + chartId + "']");if (!iframe) {continue;}iframe.style.height = event.data['datawrapper-height'][chartId] + 'px';}}});})();</script>
Johns Hopkins timeseries data - Johns Hopkins pulls data regularly to update their dashboard. Once a day, around 8pm EDT, Johns Hopkins adds the counts for all areas they cover to the timeseries file. These counts are snapshots of the latest cumulative counts provided by the source on that day. This can lead to inconsistencies if a source updates their historical data for accuracy, either increasing or decreasing the latest cumulative count. - Johns Hopkins periodically edits their historical timeseries data for accuracy. They provide a file documenting all errors in their timeseries files that they have identified and fixed here
This data should be credited to Johns Hopkins University COVID-19 tracking project
This file shows bars and clubs in the New York City MSA. locations were pulled from multiple data sources. This isn't a full listing of bars in the NYC area, but all bars do have a user rating with them. This dataset has been migrated from our Geocommons platform, and lacks a description from the original posting user. This is not a Fortiusone provided dataset. Please keep this in mind, and make of the dataset what you will. Thank you for visiting Finder!
This dataset displays the locations of all the public libraries in the state of New York. The data included is the name of the library, name of the library system, library's address, phone, and lat/lon coordinates. The data came from publiclibraries.com which is a updated directory of all the public libraries throughout the United States.
Calls for Service to NYPD's 911 system
This dataset documents entries into the NYPD 911 system, ICAD. The data is collected from the ICAD system which call takers and dispatchers use to communicate with callers and the NYPD. Each record represents an entry into the system. The data includes entries generated by members of the public as well as self-initiated entries by NYPD Members of Service. The data can be used for issues being responded to by the NYPD.
This is a dataset of 424 newsstands in New York City compiled from superpages.com. Attributes name, address, business categories and phone number. Please not that this is not all of the newsstands in New York City and that some of the newsstands listed may not actually be a newsstand or may no longer be in business.
The New York Times is releasing a series of data files with cumulative counts of coronavirus cases in the United States, at the state and county level, over time. We are compiling this time series data from state and local governments and health departments in an attempt to provide a complete record of the ongoing outbreak.
Since late January, The Times has tracked cases of coronavirus in real time as they were identified after testing. Because of the widespread shortage of testing, however, the data is necessarily limited in the picture it presents of the outbreak.
We have used this data to power our maps and reporting tracking the outbreak, and it is now being made available to the public in response to requests from researchers, scientists and government officials who would like access to the data to better understand the outbreak.
The data begins with the first reported coronavirus case in Washington State on Jan. 21, 2020. We will publish regular updates to the data in this repository.
This dataset shows City level data for all over the United States, and has various attributes for different crimes. Cities are shown as Latitude and longitude points. Attributes include murder, manslaughter, violent crimes, arson, motor vehicle theft, property crimes, aggravated assault, burglary, larceny, theft, and rape. Data was provided by the Federal Bureau of Investigation. Source: FBI URL: http://www.fbi.gov/ucr/cius2006/data/table_08.html
This dataset displays the locations of all the Adult Correctional Facilities in the state of New York as of 3.2008. This includes both female and male institutions.
U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
This dataset contains the number of cases, number of in hospital/30 day deaths, observed, expected and risk- adjusted mortality rates for cardiac surgery and percutaneous coronary interventions (PCI) by hospital. Regions represent where the hospitals are located. The initial Health Data NY dataset includes patients discharged between January 1, 2008, and December 31, 2010. Analyses of risk-adjusted mortality rates and associated risk factors are provided for 2010 and for the three-year period from 2008 through 2010. For PCI, analyses of all cases, non-emergency cases (which represent the majority of procedures) and emergency cases are included. Subsequent year reports data will be appended to this dataset. For more information check out: http://www.health.ny.gov/health_care/consumer_information/cardiac_surgery/ or go to the “About” tab.
This dataset was retrieved from the U.S. Department of Justice Federal Bureau of Investigation Criminal Justice Information Services Division website on February 29, 2008. "This table provides the estimated number of offenses and the actual number of offenses reported in the Metropolitan Statistical Areas (MSA), cities outside metropolitan areas, and nonmetropolitan counties, and the rate (per 100,000 inhabitants) for each grouping, and the estimated population for each state" however to simplify the dataset, I only included the state total and the rate for each state. Data for Puerto Rico was not included. "The FBI collects these data through the Uniform Crime Reporting (UCR) Program". Please see the Data Declaration for further information on the data set. Values of -1 represent no value.
There has been little research on United States homicide rates from a long-term perspective, primarily because there has been no consistent data series on a particular place preceding the Uniform Crime Reports (UCR), which began its first full year in 1931. To fill this research gap, this project created a data series on homicides per capita for New York City that spans two centuries. The goal was to create a site-specific, individual-based data series that could be used to examine major social shifts related to homicide, such as mass immigration, urban growth, war, demographic changes, and changes in laws. Data were also gathered on various other sites, particularly in England, to allow for comparisons on important issues, such as the post-World War II wave of violence. The basic approach to the data collection was to obtain the best possible estimate of annual counts and the most complete information on individual homicides. The annual count data (Parts 1 and 3) were derived from multiple sources, including the Federal Bureau of Investigation's Uniform Crime Reports and Supplementary Homicide Reports, as well as other official counts from the New York City Police Department and the City Inspector in the early 19th century. The data include a combined count of murder and manslaughter because charge bargaining often blurs this legal distinction. The individual-level data (Part 2) were drawn from coroners' indictments held by the New York City Municipal Archives, and from daily newspapers. Duplication was avoided by keeping a record for each victim. The estimation technique known as "capture-recapture" was used to estimate homicides not listed in either source. Part 1 variables include counts of New York City homicides, arrests, and convictions, as well as the homicide rate, race or ethnicity and gender of victims, type of weapon used, and source of data. Part 2 includes the date of the murder, the age, sex, and race of the offender and victim, and whether the case led to an arrest, trial, conviction, execution, or pardon. Part 3 contains annual homicide counts and rates for various comparison sites including Liverpool, London, Kent, Canada, Baltimore, Los Angeles, Seattle, and San Francisco.