Link Function: information
The NCI DIS 3D database is a collection of 3D structures for over 400,000 drugs. The database is an extension of the NCI Drug Information System. The structural information stored in the DIS is only the connection table for each drug. The connection table is just a list of which atoms are connected and how they are connected. It is essentially a searcheable database of three-dimensional structures has been developed from the chemistry database of the NCI Drug Information System (DIS), a file of about 450,000 primarily organic compounds which have been tested by NCI for anticancer activity. The DIS database is very similar in size and content to the proprietary databases used in the pharmaceutical industry; its development began in the 1950s; and this history led to a number of problems in the generation of 3D structures. This information can be searched to find drugs that share similar patterns of connections, which can correlate with similar biological activity. But the cellular targets for drug action, as well as the drugs themselves, are 3 dimensional objects and advances in computer hardware and software have reached the point where they can be represented as such. In many cases the important points of interaction between a drug and its target can be represented by a 3D arrangement of a small number of atoms. Such a group of atoms is called a pharmacophore. The pharmacophore can be used to search 3D databases and drugs that match the pharmacophore could have similar biological activity, but have very different patterns of atomic connections. Having a diverse set of lead compounds increases the chances of finding an active compound with acceptable properties for clinical development. Sponsor: The ICBG are supported by the Cooperative Agreement mechanism, with funds from nine components of the NIH, the National Science Foundation, and the Foreign Agricultural Service of the USDA.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Users can access data about cancer statistics in the United States including but not limited to searches by type of cancer and race, sex, ethnicity, age at diagnosis, and age at death. Background Surveillance Epidemiology and End Results (SEER) database’s mission is to provide information on cancer statistics to help reduce the burden of disease in the U.S. population. The SEER database is a project to the National Cancer Institute. The SEER database collects information on incidence, prevalence, and survival from specific geographic areas representing 28 percent of the United States population. User functionality Users can access a variety of reso urces. Cancer Stat Fact Sheets allow users to look at summaries of statistics by major cancer type. Cancer Statistic Reviews are available from 1975-2008 in table format. Users are also able to build their own tables and graphs using Fast Stats. The Cancer Query system provides more flexibility and a larger set of cancer statistics than F ast Stats but requires more input from the user. State Cancer Profiles include dynamic maps and graphs enabling the investigation of cancer trends at the county, state, and national levels. SEER research data files and SEER*Stat software are available to download through your Internet connection (SEER*Stat’s client-server mode) or via discs shipped directly to you. A signed data agreement form is required to access the SEER data Data Notes Data is available in different formats depending on which type of data is accessed. Some data is available in table, PDF, and html formats. Detailed information about the data is available under “Data Documentation and Variable Recodes”.
SEER Limited-Use cancer incidence data with associated population data. Geographic areas available are county and SEER registry. The Surveillance, Epidemiology, and End Results (SEER) Program of the National Cancer Institute collects and distributes high quality, comprehensive cancer data from a number of population-based cancer registries. Data include patient demographics, primary tumor site, morphology, stage at diagnosis, first course of treatment, and follow-up for vital status. The SEER Program is the only comprehensive source of population-based information in the United States that includes stage of cancer at the time of diagnosis and survival rates within each stage.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Characteristics of the study population, by age group.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
As of my last update in January 2022, I don't have access to specific real-time datasets, including a specific "US cancer analysis dataset." However, there are several well-known sources where you might find such datasets:
Surveillance, Epidemiology, and End Results (SEER) Program: SEER is a comprehensive source of cancer statistics in the United States, operated by the National Cancer Institute (NCI). They provide a wide range of cancer-related data including incidence, mortality, survival, and population-based data on cancer cases.
National Program of Cancer Registries (NPCR): This program, also managed by the Centers for Disease Control and Prevention (CDC), collects cancer incidence data at the state level.
CDC WONDER: The CDC's Wide-ranging Online Data for Epidemiologic Research (WONDER) platform provides access to a wide array of public health-related datasets, including cancer statistics.
National Cancer Database (NCDB): This database, jointly sponsored by the American College of Surgeons and the American Cancer Society, contains hospital registry data from over 1,500 Commission on Cancer (CoC)-accredited facilities.
National Health Interview Survey (NHIS): While not specific to cancer, the NHIS collects data on health and health-related behaviors, which may include information on cancer screenings, risk factors, and prevalence.
Behavioral Risk Factor Surveillance System (BRFSS): Similar to NHIS, BRFSS collects state-based, cross-sectional data about U.S. residents regarding their health-related risk behaviors, chronic health conditions, and use of preventive services, which may include cancer-related data.
National Health and Nutrition Examination Survey (NHANES): NHANES collects data on the health and nutritional status of a nationally representative sample of the U.S. population through interviews, physical examinations, and laboratory tests, which may include cancer-related information.
When accessing these datasets, it's essential to review their documentation thoroughly to understand the variables available, the methodology of data collection, any limitations or biases, and the terms of use. Additionally, many of these datasets require approval or registration before access is granted.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Characteristics for young breast cancer patients, NCDB 2007–2013.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Overall patient characteristics, time to treatment, and 5-year survival.
The United States Cancer Statistics (USCS) online databases in WONDER provide cancer incidence and mortality data for the United States for the years since 1999, by year, state and metropolitan areas (MSA), age group, race, ethnicity, sex, childhood cancer classifications and cancer site. Report case counts, deaths, crude and age-adjusted incidence and death rates, and 95% confidence intervals for rates. The USCS data are the official federal statistics on cancer incidence from registries having high-quality data and cancer mortality statistics for 50 states and the District of Columbia. USCS are produced by the Centers for Disease Control and Prevention (CDC) and the National Cancer Institute (NCI), in collaboration with the North American Association of Central Cancer Registries (NAACCR). Mortality data are provided by the Centers for Disease Control and Prevention (CDC), National Center for Health Statistics (NCHS), National Vital Statistics System (NVSS).
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains Cancer Incidence data for Breast Cancer (Late Stage^) including: Age-Adjusted Rate, Confidence Interval, Average Annual Count, and Trend field information for US States for the average 5 year span from 2016 to 2020.Data are for females segmented by age (All Ages, Ages Under 50, Ages 50 & Over, Ages Under 65, and Ages 65 & Over), with field names and aliases describing the sex and age group tabulated.For more information, visit statecancerprofiles.cancer.govData NotationsState Cancer Registries may provide more current or more local data.TrendRising when 95% confidence interval of average annual percent change is above 0.Stable when 95% confidence interval of average annual percent change includes 0.Falling when 95% confidence interval of average annual percent change is below 0.† Incidence rates (cases per 100,000 population per year) are age-adjusted to the 2000 US standard population (19 age groups: <1, 1-4, 5-9, ... , 80-84, 85+). Rates are for invasive cancer only (except for bladder cancer which is invasive and in situ) or unless otherwise specified. Rates calculated using SEER*Stat. Population counts for denominators are based on Census populations as modified by NCI. The US Population Data File is used for SEER and NPCR incidence rates.‡ Incidence Trend data come from different sources. Due to different years of data availability, most of the trends are AAPCs based on APCs but some are APCs calculated in SEER*Stat. Please refer to the source for each area for additional information.Rates and trends are computed using different standards for malignancy. For more information see malignant.^ Late Stage is defined as cases determined to be regional or distant. Due to changes in stage coding, Combined Summary Stage (2004+) is used for data from Surveillance, Epidemiology, and End Results (SEER) databases and Merged Summary Stage is used for data from National Program of Cancer Registries databases. Due to the increased complexity with staging, other staging variables maybe used if necessary.Data Source Field Key(1) Source: National Program of Cancer Registries and Surveillance, Epidemiology, and End Results SEER*Stat Database - United States Department of Health and Human Services, Centers for Disease Control and Prevention and National Cancer Institute. Based on the 2022 submission.(5) Source: National Program of Cancer Registries and Surveillance, Epidemiology, and End Results SEER*Stat Database - United States Department of Health and Human Services, Centers for Disease Control and Prevention and National Cancer Institute. Based on the 2022 submission.(6) Source: National Program of Cancer Registries SEER*Stat Database - United States Department of Health and Human Services, Centers for Disease Control and Prevention (based on the 2022 submission).(7) Source: SEER November 2022 submission.(8) Source: Incidence data provided by the SEER Program. AAPCs are calculated by the Joinpoint Regression Program and are based on APCs. Data are age-adjusted to the 2000 US standard population (19 age groups: <1, 1-4, 5-9, ... , 80-84,85+). Rates are for invasive cancer only (except for bladder cancer which is invasive and in situ) or unless otherwise specified. Population counts for denominators are based on Census populations as modified by NCI. The US Population Data File is used with SEER November 2022 data.Some data are not available, see Data Not Available for combinations of geography, cancer site, age, and race/ethnicity.Data for the United States does not include data from Nevada.Data for the United States does not include Puerto Rico.
The Australian Cancer Database (ACD) contains all new cases of cancer in Australia since 1 January 1982, (excluding basal cell carcinoma and sqamous cell carcinoma of the skin). All Australian states and teritories have legislation requiring mandatory reporting of new cases of cancer to jurisdictional cancer registries, who supply the data annually to AIHW. The main purposes of the ACD are: - to be the national cancer database and the source of national cancer statistics published by AIHW; - to identify duplicates and advise the state and territory cancer registries of these; - to work with the state and territory cancer registries to standardise collection of cancer data and classification.
https://www.shibatadb.com/license/data/proprietary/v1.0/license.txthttps://www.shibatadb.com/license/data/proprietary/v1.0/license.txt
Yearly citation counts for the publication titled "Radiation therapy dose is associated with improved survival for unresected anaplastic thyroid carcinoma: Outcomes from the National Cancer Data Base".
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Supplemental materials
https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/
The Cancer Moonshot Biobank is a National Cancer Institute initiative to support current and future investigations into drug resistance and sensitivity and other NCI-sponsored cancer research initiatives, with an aim of improving researchers' understanding of cancer and how to intervene in cancer initiation and progression. During the course of this study, biospecimens (blood and tissue removed during medical procedures) and associated data will be collected longitudinally from at least 1000 patients across at least 10 cancer types, who represent the demographic diversity of the U.S. and receiving standard of care cancer treatment at multiple NCI Community Oncology Research Program (NCORP) sites.
This collection contains de-identified radiology and histopathology imaging procured from subjects in NCI’s Cancer Moonshot Biobank - Prostate Cancer (CMB-PCA) cohort. Associated genomic, phenotypic and clinical data will be hosted by The Database of Genotypes and Phenotypes (dbGaP) and other NCI databases. A summary of Cancer Moonshot Biobank imaging efforts can be found on the Cancer Moonshot Biobank Imaging page.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Background: Limited data exists demonstrating the clinical benefit of proton radiotherapy (PRT) in breast cancer. Using the National Cancer Database, we evaluated predictors associated with PRT use for patients with breast cancer. An exploratory analysis also investigates the impact of PRT on overall survival (OS).Methods: Patients with non-metastatic breast cancer treated with adjuvant radiotherapy from 2004 to 2014 were identified. Patients were stratified based on receipt of PRT or non-PRT (i.e., photons ± electrons). A logistic regression model was used to determine predictors for PRT utilization. For OS, Multivariable analysis (MVA) was performed using Cox proportional hazard model.Results: A total of 724,492 patients were identified: 871 received PRT and 723,621 received non-PRT. 58.3% of the PRT patients were group stage 0–1. Median follow-up time was 62.2 months. On multivariate logistic analysis, the following factors were found to be significant for receipt of PRT (all p < 0.05): academic facility (odds ratio [OR] = 2.50), South (OR = 2.01) and West location (OR = 12.43), left-sided (OR = 1.21), ER-positive (OR = 1.59), and mastectomy (OR = 1.47); pT2-T4 disease predicted for decrease use (OR = 0.79). PRT was not associated with OS on MVA for all patients: Hazard Ratio: 0.85, p = 0.168. PRT remained not significant on MVA after stratifying for subsets likely associated with higher heart radiation doses, including: left-sided (p = 0.140), inner-quadrant (p = 0.173), mastectomy (p = 0.095), node positivity (p = 0.680), N2-N3 disease (p = 0.880), and lymph node irradiation (LNI) (p = 0.767).Conclusions: Receipt of PRT was associated with left-sided, ER+ tumors, mastectomy, South and West location, and academic facilities, but not higher group stages or LNI. PRT was not associated with OS, including in subsets likely at risk for higher heart doses. Further studies are required to determine non-OS benefits of PRT. In the interim, given the high cost of protons, only well-selected patients should receive PRT unless enrolled on a clinical trial.
This series of files links two large population-based sources providing detailed data about Medicare beneficiaries with cancer. The SEER (Surveillance, Epidemiology, and End Results) program consists of clinical, demographic, and cause of death information collected from tumor registries beginning in January 1, 1973. The Medicare contribution includes all claims for covered health care services from beneficiaries’ time of eligibility until death. Linkage is processed biennially by SEER and Centers for Medicare and Medicaid Services (CMS) staff. 95% of individuals age 65 and older are included in the SEER files. Due to privacy concerns, access to this database requires an application, SEER-Medicare Data Use Agreement (DUA), and documentation of institutional review board approval. Additionally, the National Cancer Institute’s information technology contractor assesses a processing fee the amount of which is dependent upon the type and number of files requested.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Characteristics of the study population by type of cancer.
https://www.shibatadb.com/license/data/proprietary/v1.0/license.txthttps://www.shibatadb.com/license/data/proprietary/v1.0/license.txt
Yearly citation counts for the publication titled "Survival in patients with primary dermatofibrosarcoma protuberans: National Cancer Database analysis".
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
The Welsh Cancer Intelligence & Surveillance Unit (WCISU) is the National Cancer Registry for Wales and its primary role is to record, store and report on all incidence of cancer for the resident population of Wales wherever they are treated. Cancer registration in Wales began almost five decades ago and today’s electronic database which holds records going back to 1972 contains in the region of 686,000 records.
WCISU collects data about occurrences of cancer in Welsh residents via direct or indirect submissions from Welsh Hospitals.
Staging of malignant melanoma (ICD 10 code C43), breast (C50), colorectal (C18-C20) and cervix (C53) started in 2001 since this was when we started receiving pathological information. Staging for all other cancers started in 2010.
Treatment information started in 1995.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains Cancer Incidence data for Lung Cancer (All Stages^) including: Age-Adjusted Rate, Confidence Interval, Average Annual Count, and Trend field information for US States for the average 5 year span from 2016 to 2020.Data are segmented by sex (Both Sexes, Male, and Female) and age (All Ages, Ages Under 50, Ages 50 & Over, Ages Under 65, and Ages 65 & Over), with field names and aliases describing the sex and age group tabulated.For more information, visit statecancerprofiles.cancer.govData NotationsState Cancer Registries may provide more current or more local data.TrendRising when 95% confidence interval of average annual percent change is above 0.Stable when 95% confidence interval of average annual percent change includes 0.Falling when 95% confidence interval of average annual percent change is below 0.† Incidence rates (cases per 100,000 population per year) are age-adjusted to the 2000 US standard population (19 age groups: <1, 1-4, 5-9, ... , 80-84, 85+). Rates are for invasive cancer only (except for bladder cancer which is invasive and in situ) or unless otherwise specified. Rates calculated using SEER*Stat. Population counts for denominators are based on Census populations as modified by NCI. The US Population Data File is used for SEER and NPCR incidence rates.‡ Incidence Trend data come from different sources. Due to different years of data availability, most of the trends are AAPCs based on APCs but some are APCs calculated in SEER*Stat. Please refer to the source for each area for additional information.Rates and trends are computed using different standards for malignancy. For more information see malignant.^ All Stages refers to any stage in the Surveillance, Epidemiology, and End Results (SEER) summary stage.Data Source Field Key(1) Source: National Program of Cancer Registries and Surveillance, Epidemiology, and End Results SEER*Stat Database - United States Department of Health and Human Services, Centers for Disease Control and Prevention and National Cancer Institute. Based on the 2022 submission.(5) Source: National Program of Cancer Registries and Surveillance, Epidemiology, and End Results SEER*Stat Database - United States Department of Health and Human Services, Centers for Disease Control and Prevention and National Cancer Institute. Based on the 2022 submission.(6) Source: National Program of Cancer Registries SEER*Stat Database - United States Department of Health and Human Services, Centers for Disease Control and Prevention (based on the 2022 submission).(7) Source: SEER November 2022 submission.(8) Source: Incidence data provided by the SEER Program. AAPCs are calculated by the Joinpoint Regression Program and are based on APCs. Data are age-adjusted to the 2000 US standard population (19 age groups: <1, 1-4, 5-9, ... , 80-84,85+). Rates are for invasive cancer only (except for bladder cancer which is invasive and in situ) or unless otherwise specified. Population counts for denominators are based on Census populations as modified by NCI. The US Population Data File is used with SEER November 2022 data.Some data are not available, see Data Not Available for combinations of geography, cancer site, age, and race/ethnicity.Data for the United States does not include data from Nevada.Data for the United States does not include Puerto Rico.
Link Function: information