SEER Limited-Use cancer incidence data with associated population data. Geographic areas available are county and SEER registry. The Surveillance, Epidemiology, and End Results (SEER) Program of the National Cancer Institute collects and distributes high quality, comprehensive cancer data from a number of population-based cancer registries. Data include patient demographics, primary tumor site, morphology, stage at diagnosis, first course of treatment, and follow-up for vital status. The SEER Program is the only comprehensive source of population-based information in the United States that includes stage of cancer at the time of diagnosis and survival rates within each stage.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains Age-Adjusted Rate, Confidence Interval, Average Annual Count, and Trend field information for US States for the average 5 year span from 2012 to 2016.Data is segmented by sex and age, with fields describing the sex and age group tabulated.For more information, visit statecancerprofiles.cancer.gov Data NotationsState Cancer Registries may provide more current or more local data.† Incidence rates (cases per 100,000 population per year) are age-adjusted to the 2000 US standard population seer.cancer.gov/stdpopulations/stdpop.19ages.html. Rates are for invasive cancer only (except for bladder cancer which is invasive and in situ) or unless otherwise specified. Rates calculated using SEER*Stat. [seer.cancer.gov/seerstat]Population counts for denominators are based on Census populations as modified [seer.cancer.gov/popdata] by NCI. The 1969-2016 US Population Data File [seer.cancer.gov/popdata] is used for SEER and NPCR incidence rates.‡ Incidence data come from different sources. Due to different years of data availability, most of the trends are AAPCs based on APCs but some are APCs calculated in SEER*Stat. Please refer to the source for each area for additional information. Rates and trends are computed using different standards for malignancy. For more information see malignant.html.^ All Stages refers to any stage in the Surveillance, Epidemiology, and End Results (SEER) summary stage [seer.cancer.gov/tools/ssm].Healthy People 2020 Objectives [www.healthypeople.gov]provided by the Centers for Disease Control and Prevention [www.cdc.gov]. Michigan Data do not include cases diagnosed in other states for those states in which the data exchange agreement specifically prohibits the release of data to third parties.Trend Data not available for Nevada.Data Source Field Key:(1) Source: CDC's National Program of Cancer Registries Cancer Surveillance System (NPCR-CSS) November 2018 data submission and SEER November 2018 submission as published in United States Cancer Statistics nccd.cdc.gov/uscs Source: State Cancer Registry and the CDC's National Program of Cancer Registries Cancer Surveillance System (NPCR-CSS) November 2018 data submission. State rates include rates from metropolitan areas funded by SEER [seer.cancer.gov/registries].(6) Source: State Cancer Registry and the CDC's National Program of Cancer Registries Cancer Surveillance System (NPCR-CSS) November 2018 data submission.(7) Source: SEER November 2018 submission.8 Source: Incidence data provided by the SEER Program. [seer.cancer.gov] AAPCs are calculated by the Joinpoint Regression Program [surveillance.cancer.gov/joinpoint] and are based on APCs. Data are age-adjusted to the 2000 US standard population www.seer.cancer.gov/stdpopulations/single_age.html. Rates are for invasive cancer only (except for bladder cancer which is invasive and in situ) or unless otherwise specified. Population counts for denominators are based on Census populations as modified by NCI. The 1969-2017 US Population Data [seer.cancer.gov/popdata] File is used with SEER November 2018 data. Please note that the data comes from different sources. Due to different years [statecancerprofiles.cancer.gov/historicaltrend/differences.html] of data availability, most of the trends are AAPCs based on APCs but some are APCs calculated in SEER*Stat. [seer.cancer.gov/seerstat] Please refer to the source for each graph for additional information. Some data are not available [http://statecancerprofiles.cancer.gov/datanotavailable.html] for combinations of geography, cancer site, age, and race/ethnicity.
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
This submission includes publicly available data extracted in its original form. Please reference the Related Publication listed here for source and citation information. Cancer statistics from SEER data by race, age, sex, stage, and cancer subtypes, covering 48% of the U.S. population. If you have questions about the underlying data stored here, please contact the National Cancer Institute at NCIinfo@nih.gov. If you have questions or recommendations related to this metadata entry and extracted data, please contact the CAFE Data Management team at: climatecafe@bu.edu "SEER*Explorer is an interactive website that provides easy access to a wide range of SEER cancer statistics. It provides detailed statistics for a cancer site by sex, race, calendar year, age, and for a selected number of cancer sites, by stage and histology."[Quote from: https://seer.cancer.gov/statistics-network/]
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Users can access data about cancer statistics in the United States including but not limited to searches by type of cancer and race, sex, ethnicity, age at diagnosis, and age at death. Background Surveillance Epidemiology and End Results (SEER) database’s mission is to provide information on cancer statistics to help reduce the burden of disease in the U.S. population. The SEER database is a project to the National Cancer Institute. The SEER database collects information on incidence, prevalence, and survival from specific geographic areas representing 28 percent of the United States population. User functionality Users can access a variety of reso urces. Cancer Stat Fact Sheets allow users to look at summaries of statistics by major cancer type. Cancer Statistic Reviews are available from 1975-2008 in table format. Users are also able to build their own tables and graphs using Fast Stats. The Cancer Query system provides more flexibility and a larger set of cancer statistics than F ast Stats but requires more input from the user. State Cancer Profiles include dynamic maps and graphs enabling the investigation of cancer trends at the county, state, and national levels. SEER research data files and SEER*Stat software are available to download through your Internet connection (SEER*Stat’s client-server mode) or via discs shipped directly to you. A signed data agreement form is required to access the SEER data Data Notes Data is available in different formats depending on which type of data is accessed. Some data is available in table, PDF, and html formats. Detailed information about the data is available under “Data Documentation and Variable Recodes”.
This series of files links two large population-based sources providing detailed data about Medicare beneficiaries with cancer. The SEER (Surveillance, Epidemiology, and End Results) program consists of clinical, demographic, and cause of death information collected from tumor registries beginning in January 1, 1973. The Medicare contribution includes all claims for covered health care services from beneficiaries’ time of eligibility until death. Linkage is processed biennially by SEER and Centers for Medicare and Medicaid Services (CMS) staff. 95% of individuals age 65 and older are included in the SEER files. Due to privacy concerns, access to this database requires an application, SEER-Medicare Data Use Agreement (DUA), and documentation of institutional review board approval. Additionally, the National Cancer Institute’s information technology contractor assesses a processing fee the amount of which is dependent upon the type and number of files requested.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Abstract:
This dataset of breast cancer patients was obtained from the 2017 November update of the SEER Program of the NCI, which provides information on population-based cancer statistics. The dataset involved female patients with infiltrating duct and lobular carcinoma breast cancer (SEER primary cites recode NOS histology codes 8522/3) diagnosed in 2006-2010. Patients with unknown tumor size, examined regional LNs, regional positive LNs, and patients whose survival months were less than 1 month were excluded; thus, 4024 patients were ultimately included.
Inspiration:
This dataset uploaded to U-BRITE for "AI against CANCER DATA SCIENCE HACKATHON"
https://cancer.ubrite.org/hackathon-2021/
Acknowledgements
JING TENG, January 18, 2019, "SEER Breast Cancer Data", IEEE Dataport, doi: https://dx.doi.org/10.21227/a9qy-ph35.
https://ieee-dataport.org/open-access/seer-breast-cancer-data
U-BRITE last update date: 07/21/2021
This dataset tracks the updates made on the dataset "Cancer Incidence - Surveillance, Epidemiology, and End Results (SEER) Registries Limited-Use" as a repository for previous versions of the data and metadata.
Rate: Number of new cases of breast cancer (per 100,000) diagnosed at the regional or distant stage among females.
Definition: Age-adjusted incidence rate of invasive breast cancer per 100,000 female population.
Data Sources:
(1) NJ State Cancer Registry, Dec 31, 2015 Analytic File, using NCI SEER*Stat ver 8.2.1 (www.seer.cancer.gov/seerstat)
(2) NJ population estimates as calculated by the NCI's SEER Program, released January 2015, http://www.seer.cancer.gov/popdata/download.html.
The State Cancer Profiles (SCP) web site provides statistics to help guide and prioritize cancer control activities at the state and local levels. SCP is a collaborative effort using local and national level cancer data from the Centers for Disease Control and Prevention's National Program of Cancer Registries (NPCR) and National Cancer Institute's Surveillance, Epidemiology and End Results Registries (SEER). SCP address select types of cancer and select behavioral risk factors for which there are evidence-based control interventions. The site provides incidence, mortality and prevalence comparison tables as well as interactive graphs and maps and support data. The graphs and maps provide visual support for deciding where to focus cancer control efforts.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
E)Your Dataset
This dataset of breast cancer patients was obtained from the 2017 November update of the SEER Program of the
NCI, which provides information on population-based cancer statistics. The dataset contains the following
attributes:
Table.1 Data Dictionary
Attribute Description
Patient ID Unique identification for each patient
Month of Birth A patient’s month of birth
Age A patient’s month of birth in years
Sex A patient’s genomic sex
Occupation The field of a patient’s job role
T Stage The T stage in breast cancer refers to the size of the tumour from T1, T2,
T3 and T4
N Stage
Used to indicate if the breast cancer has spread to surrounding lymph
nodes (N), with a higher number representing a greater number of lymph
nodes impacted, from N1, N2 and N3.
6th Stage Breast Imaging Reporting and Data System or BI-RADS
Differentiated How the cancer cells look and are growing compared with normal cells.
Grade Breast Cancer Grades (Nottingham Grading System)
A Stage
Breast cancer is staged based on how far it has spread.
Regional: The cancer has spread to nearby lymph nodes or tissues.
Distant: The cancer has spread to distant parts of the body, such as the
lungs, liver, or bones
Tumour Size Tumor size measured in millimeters
Estrogen Status Cancer cells have estrogen hormone receptors or not.
Progesterone Status Cancer cells have progesterone hormone receptors or not.
Regional Node Examined Count of examined regional lymph nodes for cancer spread
Regional Node Positive Count of cancer positive regional lymph nodes to contain metastases
Survival Months Survival months based on date of last contact.
Mortality Status
Any patient that dies after the follow-up cut-off date is recoded to alive
as of the cut-off date. If date of last contact > study cutoff date, vital
status recoded = alive.
Note: For general knowledge, further information about the collection of patients’ data can be found at
https://ieee-dataport.org/open-access/seer-breast-cancer-data
M. A. Aldraimli
5DATA002W.2
2024/2025
The survival calculations can be found at
https://seer.cancer.gov/survivaltime/
https://seer.cancer.gov/survivaltime/SurvivalTimeCalculation.pdf
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains Cancer Incidence data for Breast Cancer (All Stages^) including: Age-Adjusted Rate, Confidence Interval, Average Annual Count, and Trend field information for US States for the average 5 year span from 2016 to 2020.Data are for females segmented by age (All Ages, Ages Under 50, Ages 50 & Over, Ages Under 65, and Ages 65 & Over), with field names and aliases describing the sex and age group tabulated.For more information, visit statecancerprofiles.cancer.govData NotationsState Cancer Registries may provide more current or more local data.TrendRising when 95% confidence interval of average annual percent change is above 0.Stable when 95% confidence interval of average annual percent change includes 0.Falling when 95% confidence interval of average annual percent change is below 0.† Incidence rates (cases per 100,000 population per year) are age-adjusted to the 2000 US standard population (19 age groups: <1, 1-4, 5-9, ... , 80-84, 85+). Rates are for invasive cancer only (except for bladder cancer which is invasive and in situ) or unless otherwise specified. Rates calculated using SEER*Stat. Population counts for denominators are based on Census populations as modified by NCI. The US Population Data File is used for SEER and NPCR incidence rates.‡ Incidence Trend data come from different sources. Due to different years of data availability, most of the trends are AAPCs based on APCs but some are APCs calculated in SEER*Stat. Please refer to the source for each area for additional information.Rates and trends are computed using different standards for malignancy. For more information see malignant.^ All Stages refers to any stage in the Surveillance, Epidemiology, and End Results (SEER) summary stage.Data Source Field Key(1) Source: National Program of Cancer Registries and Surveillance, Epidemiology, and End Results SEER*Stat Database - United States Department of Health and Human Services, Centers for Disease Control and Prevention and National Cancer Institute. Based on the 2022 submission.(5) Source: National Program of Cancer Registries and Surveillance, Epidemiology, and End Results SEER*Stat Database - United States Department of Health and Human Services, Centers for Disease Control and Prevention and National Cancer Institute. Based on the 2022 submission.(6) Source: National Program of Cancer Registries SEER*Stat Database - United States Department of Health and Human Services, Centers for Disease Control and Prevention (based on the 2022 submission).(7) Source: SEER November 2022 submission.(8) Source: Incidence data provided by the SEER Program. AAPCs are calculated by the Joinpoint Regression Program and are based on APCs. Data are age-adjusted to the 2000 US standard population (19 age groups: <1, 1-4, 5-9, ... , 80-84,85+). Rates are for invasive cancer only (except for bladder cancer which is invasive and in situ) or unless otherwise specified. Population counts for denominators are based on Census populations as modified by NCI. The US Population Data File is used with SEER November 2022 data.Some data are not available, see Data Not Available for combinations of geography, cancer site, age, and race/ethnicity.Data for the United States does not include data from Nevada.Data for the United States does not include Puerto Rico.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains Cancer Incidence data for Lung Cancer (All Stages^) including: Age-Adjusted Rate, Confidence Interval, Average Annual Count, and Trend field information for US States for the average 5 year span from 2016 to 2020.Data are segmented by sex (Both Sexes, Male, and Female) and age (All Ages, Ages Under 50, Ages 50 & Over, Ages Under 65, and Ages 65 & Over), with field names and aliases describing the sex and age group tabulated.For more information, visit statecancerprofiles.cancer.govData NotationsState Cancer Registries may provide more current or more local data.TrendRising when 95% confidence interval of average annual percent change is above 0.Stable when 95% confidence interval of average annual percent change includes 0.Falling when 95% confidence interval of average annual percent change is below 0.† Incidence rates (cases per 100,000 population per year) are age-adjusted to the 2000 US standard population (19 age groups: <1, 1-4, 5-9, ... , 80-84, 85+). Rates are for invasive cancer only (except for bladder cancer which is invasive and in situ) or unless otherwise specified. Rates calculated using SEER*Stat. Population counts for denominators are based on Census populations as modified by NCI. The US Population Data File is used for SEER and NPCR incidence rates.‡ Incidence Trend data come from different sources. Due to different years of data availability, most of the trends are AAPCs based on APCs but some are APCs calculated in SEER*Stat. Please refer to the source for each area for additional information.Rates and trends are computed using different standards for malignancy. For more information see malignant.^ All Stages refers to any stage in the Surveillance, Epidemiology, and End Results (SEER) summary stage.Data Source Field Key(1) Source: National Program of Cancer Registries and Surveillance, Epidemiology, and End Results SEER*Stat Database - United States Department of Health and Human Services, Centers for Disease Control and Prevention and National Cancer Institute. Based on the 2022 submission.(5) Source: National Program of Cancer Registries and Surveillance, Epidemiology, and End Results SEER*Stat Database - United States Department of Health and Human Services, Centers for Disease Control and Prevention and National Cancer Institute. Based on the 2022 submission.(6) Source: National Program of Cancer Registries SEER*Stat Database - United States Department of Health and Human Services, Centers for Disease Control and Prevention (based on the 2022 submission).(7) Source: SEER November 2022 submission.(8) Source: Incidence data provided by the SEER Program. AAPCs are calculated by the Joinpoint Regression Program and are based on APCs. Data are age-adjusted to the 2000 US standard population (19 age groups: <1, 1-4, 5-9, ... , 80-84,85+). Rates are for invasive cancer only (except for bladder cancer which is invasive and in situ) or unless otherwise specified. Population counts for denominators are based on Census populations as modified by NCI. The US Population Data File is used with SEER November 2022 data.Some data are not available, see Data Not Available for combinations of geography, cancer site, age, and race/ethnicity.Data for the United States does not include data from Nevada.Data for the United States does not include Puerto Rico.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Data support a paper of this title:
A Geotemporospatial and Causal Inference Epidemiological Exploration of Substance and Cannabinoid Exposure as Drivers of Rising US Pediatric Cancer Rates
Data represent a compilation of various data inputs from numerous sources including the National Cancer Institute SEER*Stat National Program of Cancer Registries and Surveillance, Epidemiology, and End Results SEER*Stat Database: NPCR and SEER Incidence – U.S. Cancer Statistics Public Use Research Database, 2019 submission (2001-2017), United States Department of Health and Human Services, Centers for Disease Control and Prevention and National Cancer Institute. Released June 2020. Available at www.cdc.gov/cancer/public-use program; the National survey of Drug Use and Health conducted by the Substance Abuse and Mental Health Services Administration; and the US Census bureau.
Data also include inverse probability weights for cannabis exposure.
Data also include their geospatial linkage network constructed for all US states which makes Alaska and Hawaii spatially connected to the contiguous USA.
Data also include the R script used to conduct and prepare the analysis.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains Cancer Incidence data for Prostate Cancer(All Stages^) including: Age-Adjusted Rate, Confidence Interval, Average Annual Count, and Trend field information for US States for the average 5 year span from 2016 to 2020.Data are for males segmented age (All Ages, Ages Under 50, Ages 50 & Over, Ages Under 65, and Ages 65 & Over), with field names and aliases describing the sex and age group tabulated.For more information, visit statecancerprofiles.cancer.govData NotationsState Cancer Registries may provide more current or more local data.TrendRising when 95% confidence interval of average annual percent change is above 0.Stable when 95% confidence interval of average annual percent change includes 0.Falling when 95% confidence interval of average annual percent change is below 0.† Incidence rates (cases per 100,000 population per year) are age-adjusted to the 2000 US standard population (19 age groups: <1, 1-4, 5-9, ... , 80-84, 85+). Rates are for invasive cancer only (except for bladder cancer which is invasive and in situ) or unless otherwise specified. Rates calculated using SEER*Stat. Population counts for denominators are based on Census populations as modified by NCI. The US Population Data File is used for SEER and NPCR incidence rates.‡ Incidence Trend data come from different sources. Due to different years of data availability, most of the trends are AAPCs based on APCs but some are APCs calculated in SEER*Stat. Please refer to the source for each area for additional information.Rates and trends are computed using different standards for malignancy. For more information see malignant.^ All Stages refers to any stage in the Surveillance, Epidemiology, and End Results (SEER) summary stage.Data Source Field Key(1) Source: National Program of Cancer Registries and Surveillance, Epidemiology, and End Results SEER*Stat Database - United States Department of Health and Human Services, Centers for Disease Control and Prevention and National Cancer Institute. Based on the 2022 submission.(5) Source: National Program of Cancer Registries and Surveillance, Epidemiology, and End Results SEER*Stat Database - United States Department of Health and Human Services, Centers for Disease Control and Prevention and National Cancer Institute. Based on the 2022 submission.(6) Source: National Program of Cancer Registries SEER*Stat Database - United States Department of Health and Human Services, Centers for Disease Control and Prevention (based on the 2022 submission).(7) Source: SEER November 2022 submission.(8) Source: Incidence data provided by the SEER Program. AAPCs are calculated by the Joinpoint Regression Program and are based on APCs. Data are age-adjusted to the 2000 US standard population (19 age groups: <1, 1-4, 5-9, ... , 80-84,85+). Rates are for invasive cancer only (except for bladder cancer which is invasive and in situ) or unless otherwise specified. Population counts for denominators are based on Census populations as modified by NCI. The US Population Data File is used with SEER November 2022 data.Some data are not available, see Data Not Available for combinations of geography, cancer site, age, and race/ethnicity.Data for the United States does not include data from Nevada.Data for the United States does not include Puerto Rico.
NSSP Emergency Department (ED) Visit Trajectories by State and Sub-State Regions- COVID-19, Flu, RSV, Combined. This dataset provides the percentage of emergency department patient visits for the specified pathogen of all ED patient visits for the specified geographic part of the country that were observed for the given week from data submitted to the National Syndromic Surveillance Program (NSSP). In addition, the trend over time is characterized as increasing, decreasing or no change, with exceptions for when there are no data available, the data are too sparse, or there are not enough data to compute a trend. These data are to provide awareness of how the weekly trend is changing for the given geographic region.
Note that the reported sub-state trends are from Health Service Areas (HSA) and the data reported from the health care facilities located within the given HSA. Health Service Areas are regions of one or more counties that align to patterns of care seeking. The HSA level data are reported for each county in the HSA.
More information on HSAs is available here.
For the emergency department time series, trajectory classifications reported on for sub-state (HSA) emergency department time series, trajectory classifications are based on approximations of the first derivative (slope) of trends that are smoothed using generalized additive models (GAMs). To determine time intervals in which the slope is sufficiently changing (i.e., rate of change distinguishable from 0), 95% confidence intervals for the slope approximations are calculated and assessed. Weeks with a 95% confidence interval not containing 0 are classified as increasing if the slope estimate is positive and decreasing if the slope estimate is negative. Weeks with a 95% confidence interval containing 0 are classified as stable. In the scenario that an HSA's time series is determined to be too sparse (i.e., many weeks with percentages of 0%), a model is not fit, and the HSA is classified as “sparse”.
For additional information, please see: Companion Guide: NSSP Emergency Department Data on Respiratory Illness
Updated once per week on Fridays.
Official employee count for Seer as reported in SEC filings for the period ending 2024-12-31.
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Annual percent change and average annual percent change in age-standardized cancer incidence rates since 1984 to the most recent diagnosis year. The table includes a selection of commonly diagnosed invasive cancers, as well as in situ bladder cancer. Cases are defined using the Surveillance, Epidemiology and End Results (SEER) Groups for Primary Site based on the World Health Organization International Classification of Diseases for Oncology, Third Edition (ICD-O-3) from 1992 to the most recent data year and on the International Classification of Diseases, ninth revision (ICD-9) from 1984 to 1991.
https://www.usa.gov/government-works/https://www.usa.gov/government-works/
COVID-19 Cases and Deaths by Race/Ethnicity
COVID-19 cases and associated deaths that have been reported among Connecticut residents, broken down by race and ethnicity. All data in this report are preliminary; data for previous dates will be updated as new reports are received and data errors are corrected. Deaths reported to either the Office of the Chief Medical Examiner (OCME) or Department of Public Health (DPH) are included in the COVID-19 update.
The following data show the number of COVID-19 cases and associated deaths per 100,000 population by race and ethnicity. Crude rates represent the total cases or deaths per 100,000 people. Age-adjusted rates consider the age of the person at diagnosis or death when estimating the rate and use a standardized population to provide a fair comparison between population groups with different age distributions. Age-adjustment is important in Connecticut as the median age of among the non-Hispanic white population is 47 years, whereas it is 34 years among non-Hispanic blacks, and 29 years among Hispanics. Because most non-Hispanic white residents who died were over 75 years of age, the age-adjusted rates are lower than the unadjusted rates. In contrast, Hispanic residents who died tend to be younger than 75 years of age which results in higher age-adjusted rates.
The age-adjusted rates are directly standardized using the 2018 ASRH Connecticut population estimate denominators (available here: https://portal.ct.gov/DPH/Health-Information-Systems--Reporting/Population/Annual-State--County-Population-with-Demographics).
Rates are standardized to the 2000 US Millions Standard population (data available here: https://seer.cancer.gov/stdpopulations/). Standardization was done using 19 age groups (0, 1-4, 5-9, 10-14, ..., 80-84, 85 years and older). More information about direct standardization for age adjustment is available here: https://www.cdc.gov/nchs/data/statnt/statnt06rv.pdf Categories are mutually exclusive. The category “multiracial” includes people who answered ‘yes’ to more than one race category. Counts may not add up to total case counts as data on race and ethnicity may be missing. Age-adjusted rates calculated only for groups with more than 20 deaths. Abbreviation: NH=Non-Hispanic.
This dataset will be updated on a daily basis. Data are subject to future revision as reporting changes.
Starting in July 2020, this dataset will be updated every weekday.
Additional notes: A delay in the data pull schedule occurred on 06/23/2020. Data from 06/22/2020 was processed on 06/23/2020 at 3:30 PM. The normal data cycle resumed with the data for 06/23/2020.
A network outage on 05/19/2020 resulted in a change in the data pull schedule. Data from 5/19/2020 was processed on 05/20/2020 at 12:00 PM. Data from 5/20/2020 was processed on 5/20/2020 8:30 PM. The normal data cycle resumed on 05/20/2020 with the 8:30 PM data pull. As a result of the network outage, the timestamp on the datasets on the Open Data Portal differs from the timestamp in DPH's daily PDF reports.
Thanks to catalog.data.gov.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Clinical data from the US Surveillance, Epidemiology, and End Results (SEER) Program from 2010-2015 (https://seer.cancer.gov/) was extracted and analyzed as training set, data from 2016-2017 was adopted as internal validation set. Data from The Cancer Genome Atlas Program (TCGA) (https://portal.gdc.cancer.gov/) and prognosis data from Gastrointestinal surgery Department, Third Affiliated Hospital of Sun Yat-sen University were applied as external validation sets.
Screening criteria for gastric cancer cases were as follow: exclusion of cases with only autopsy or death certificate, cases where initial tumor location was not stomach, patients with stage 0 and stage IV, cases without radical surgery, non-adenocarcinoma cases, death cases within one month after operation, and cases with unknown lymph node information and AJCC TNM stage.
The study analyzed various factors such as age of diagnosis (<50 years, 50-69 years, >69 years), gender, race (white, black, other), AJCC T stage (T1-T4b), AJCC TNM stage (I-III), primary tumor location (stomach body, antrum/pylorus, cardia/fundus, greater gastric recurve, lesser gastric recurve, overlapping area, NOS), Clinical features such as tumor size (≥5cm,<5cm, unknown), tumor grade (I-IV), chemotherapy, radiotherapy, number of lymph nodes retrieved and number of metastases, and lymph node positive rate. The populations of American Indian/Alaskan and Asian/Pacific Islander were classified as "other" due to small sample sizes. Tumor grade was also analyzed, with grades I-IV representing highly differentiated, moderately differentiated, poorly differentiated, and signed-ring cell carcinoma, respectively. Overall survival (OS) is the time from cancer diagnosis to death from any cause, while disease-specific survival (DSS) is the time from cancer diagnosis to death specifically due to the disease.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘COVID-19 Cases and Deaths by Race/Ethnicity’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://catalog.data.gov/dataset/3fdc6593-c708-4a6a-8073-5ca862caa279 on 27 January 2022.
--- Dataset description provided by original source is as follows ---
COVID-19 cases and associated deaths that have been reported among Connecticut residents, broken down by race and ethnicity. All data in this report are preliminary; data for previous dates will be updated as new reports are received and data errors are corrected. Deaths reported to the either the Office of the Chief Medical Examiner (OCME) or Department of Public Health (DPH) are included in the COVID-19 update.
The following data show the number of COVID-19 cases and associated deaths per 100,000 population by race and ethnicity. Crude rates represent the total cases or deaths per 100,000 people. Age-adjusted rates consider the age of the person at diagnosis or death when estimating the rate and use a standardized population to provide a fair comparison between population groups with different age distributions. Age-adjustment is important in Connecticut as the median age of among the non-Hispanic white population is 47 years, whereas it is 34 years among non-Hispanic blacks, and 29 years among Hispanics. Because most non-Hispanic white residents who died were over 75 years of age, the age-adjusted rates are lower than the unadjusted rates. In contrast, Hispanic residents who died tend to be younger than 75 years of age which results in higher age-adjusted rates.
The population data used to calculate rates is based on the CT DPH population statistics for 2019, which is available online here: https://portal.ct.gov/DPH/Health-Information-Systems--Reporting/Population/Population-Statistics. Prior to 5/10/2021, the population estimates from 2018 were used.
Rates are standardized to the 2000 US Millions Standard population (data available here: https://seer.cancer.gov/stdpopulations/). Standardization was done using 19 age groups (0, 1-4, 5-9, 10-14, ..., 80-84, 85 years and older). More information about direct standardization for age adjustment is available here: https://www.cdc.gov/nchs/data/statnt/statnt06rv.pdf
Categories are mutually exclusive. The category “multiracial” includes people who answered ‘yes’ to more than one race category. Counts may not add up to total case counts as data on race and ethnicity may be missing. Age adjusted rates calculated only for groups with more than 20 deaths. Abbreviation: NH=Non-Hispanic.
Data on Connecticut deaths were obtained from the Connecticut Deaths Registry maintained by the DPH Office of Vital Records. Cause of death was determined by a death certifier (e.g., physician, APRN, medical examiner) using their best clinical judgment. Additionally, all COVID-19 deaths, including suspected or related, are required to be reported to OCME. On April 4, 2020, CT DPH and OCME released a joint memo to providers and facilities within Connecticut providing guidelines for certifying deaths due to COVID-19 that were consistent with the CDC’s guidelines and a reminder of the required reporting to OCME.25,26 As of July 1, 2021, OCME had reviewed every case reported and performed additional investigation on about one-third of reported deaths to better ascertain if COVID-19 did or did not cause or contribute to the death. Some of these investigations resulted in the OCME performing postmortem swabs for PCR testing on individuals whose deaths were suspected to be due to COVID-19, but antemortem diagnosis was unable to be made.31 The OCME issued or re-issued about 10% of COVID-19 death certificates and, when appropriate, removed COVID-19 from the death certificate. For standardization and tabulation of mortality statistics, written cause of death statements made by the certifiers on death certificates are sent to the National Center for Health Statistics (NCHS) at the CDC which assigns cause of death codes according to the International Causes of Disease 10th Revision (ICD-10) classification system.25,26 COVID-19 deaths in this report are defined as those for which the death certificate has an ICD-10 code of U07.1 as either a primary (underlying) or a contributing cause of death. More infor
--- Original source retains full ownership of the source dataset ---
SEER Limited-Use cancer incidence data with associated population data. Geographic areas available are county and SEER registry. The Surveillance, Epidemiology, and End Results (SEER) Program of the National Cancer Institute collects and distributes high quality, comprehensive cancer data from a number of population-based cancer registries. Data include patient demographics, primary tumor site, morphology, stage at diagnosis, first course of treatment, and follow-up for vital status. The SEER Program is the only comprehensive source of population-based information in the United States that includes stage of cancer at the time of diagnosis and survival rates within each stage.