MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Card for "vibhorag101/suicide_prediction_dataset_phr"
The dataset contains text with binary labels for suicide or non-suicide.
The dataset was cleaned and following steps were applied
Converted to lowercase
Removed numbers and special characters.
Removed URLs, Emojis and accented characters.
Removed any word contractions.
Remove any extra white spaces and any extra spaces after a single space.
Removed any consecutive characters repeated more than 3 times.
Tokenised the… See the full description on the dataset page: https://huggingface.co/datasets/vibhorag101/suicide_prediction_dataset_phr.
This dataset contains counts of deaths for California counties based on information entered on death certificates. Final counts are derived from static data and include out-of-state deaths to California residents, whereas provisional counts are derived from incomplete and dynamic data. Provisional counts are based on the records available when the data was retrieved and may not represent all deaths that occurred during the time period. Deaths involving injuries from external or environmental forces, such as accidents, homicide and suicide, often require additional investigation that tends to delay certification of the cause and manner of death. This can result in significant under-reporting of these deaths in provisional data.
The final data tables include both deaths that occurred in each California county regardless of the place of residence (by occurrence) and deaths to residents of each California county (by residence), whereas the provisional data table only includes deaths that occurred in each county regardless of the place of residence (by occurrence). The data are reported as totals, as well as stratified by age, gender, race-ethnicity, and death place type. Deaths due to all causes (ALL) and selected underlying cause of death categories are provided. See temporal coverage for more information on which combinations are available for which years.
The cause of death categories are based solely on the underlying cause of death as coded by the International Classification of Diseases. The underlying cause of death is defined by the World Health Organization (WHO) as "the disease or injury which initiated the train of events leading directly to death, or the circumstances of the accident or violence which produced the fatal injury." It is a single value assigned to each death based on the details as entered on the death certificate. When more than one cause is listed, the order in which they are listed can affect which cause is coded as the underlying cause. This means that similar events could be coded with different underlying causes of death depending on variations in how they were entered. Consequently, while underlying cause of death provides a convenient comparison between cause of death categories, it may not capture the full impact of each cause of death as it does not always take into account all conditions contributing to the death.
U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
Fire Incident data includes all fire incident responses. This includes emergency medical services (EMS) calls, fires, rescue incidents, and all other services handled by the Fire Department.
The source of this data is the City of Cincinnati's computer aided dispatch (CAD) database.
This data is updated daily.
DISCLAIMER: In compliance with privacy laws, all Public Safety datasets are anonymized and appropriately redacted prior to publication on the City of Cincinnati’s Open Data Portal. This means that for all public safety datasets: (1) the last two digits of all addresses have been replaced with “XX,” and in cases where there is a single digit street address, the entire address number is replaced with "X"; and (2) Latitude and Longitude have been randomly skewed to represent values within the same block area (but not the exact location) of the incident.
Dataset Card for "vibhorag101/suicide_prediction_dataset_phr"
The dataset contains text with binary labels for suicide or non-suicide. The dataset was cleaned minimally, as BERT depends on contextually sensitive information, which can worsely effect its performance. Removed numbers Removed URLs, Emojis, and accented characters. Remove any extra white spaces and any extra spaces after a single space. Removed any consecutive characters repeated more than 3 times. The rows with more… See the full description on the dataset page: https://huggingface.co/datasets/vibhorag101/phr_suicide_prediction_dataset_clean_light.
Rank, number of deaths, percentage of deaths, and age-specific mortality rates for the leading causes of death, by age group and sex, 2000 to most recent year.
Non-natural deaths of inmates in custody.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Card for "vibhorag101/suicide_prediction_dataset_phr"
The dataset contains text with binary labels for suicide or non-suicide.
The dataset was cleaned and following steps were applied
Converted to lowercase
Removed numbers and special characters.
Removed URLs, Emojis and accented characters.
Removed any word contractions.
Remove any extra white spaces and any extra spaces after a single space.
Removed any consecutive characters repeated more than 3 times.
Tokenised the… See the full description on the dataset page: https://huggingface.co/datasets/vibhorag101/suicide_prediction_dataset_phr.