60 datasets found

Agreement Workflow Tool
catalog.data.gov
Updated Mar 8, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Social Security Administration (2025). Agreement Workflow Tool [Dataset]. https://catalog.data.gov/dataset/agreement-workflow-tool
Explore at:
Dataset updated
Mar 8, 2025
Dataset provided by
Social Security Administrationhttp://www.ssa.gov/
Description
AWT is a component of the Verification and Information Exchange Workload System (VIEWS). It is an online application which replicates the reimbursable agreement documents for review, approval and signature. AWT is available within the agency nationwide with users in various components within Headquarters, Office of Central Operations and Regional Offices across the country.
COVID-19 Case Surveillance Restricted Access Detailed Data
data.cdc.gov
data.virginia.gov
+1more
application/rdfxml +5
Updated Nov 20, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CDC Data, Analytics and Visualization Task Force (2020). COVID-19 Case Surveillance Restricted Access Detailed Data [Dataset]. https://data.cdc.gov/w/mbd7-r32t/tdwk-ruhb?cur=IxdiCK2HbvP&from=fR_yXV11V1R
Explore at:
tsv, csv, json, application/rdfxml, application/rssxml, xmlAvailable download formats
Dataset updated
Nov 20, 2020
Dataset provided by
Centers for Disease Control and Preventionhttp://www.cdc.gov/
Authors
CDC Data, Analytics and Visualization Task Force
License
https://www.usa.gov/government-workshttps://www.usa.gov/government-works
Description
Note: Reporting of new COVID-19 Case Surveillance data will be discontinued July 1, 2024, to align with the process of removing SARS-CoV-2 infections (COVID-19 cases) from the list of nationally notifiable diseases. Although these data will continue to be publicly available, the dataset will no longer be updated.

Authorizations to collect certain public health data expired at the end of the U.S. public health emergency declaration on May 11, 2023. The following jurisdictions discontinued COVID-19 case notifications to CDC: Iowa (11/8/21), Kansas (5/12/23), Kentucky (1/1/24), Louisiana (10/31/23), New Hampshire (5/23/23), and Oklahoma (5/2/23). Please note that these jurisdictions will not routinely send new case data after the dates indicated. As of 7/13/23, case notifications from Oregon will only include pediatric cases resulting in death.

This case surveillance publicly available dataset has 33 elements for all COVID-19 cases shared with CDC and includes demographics, geography (county and state of residence), any exposure history, disease severity indicators and outcomes, and presence of any underlying medical conditions and risk behaviors. This dataset requires a registration process and a data use agreement.

CDC has three COVID-19 case surveillance datasets:
COVID-19 Case Surveillance Public Use Data with Geography: Public use, patient-level dataset with clinical data (including symptoms), demographics, and county and state of residence. (19 data elements)
COVID-19 Case Surveillance Public Use Data: Public use, patient-level dataset with clinical and symptom data and demographics, with no geographic data. (12 data elements)
COVID-19 Case Surveillance Restricted Access Detailed Data: Restricted access, patient-level dataset with clinical and symptom data, demographics, and state and county of residence. Access requires a registration process and a data use agreement. (33 data elements)
Requesting Access to the COVID-19 Case Surveillance Restricted Access Detailed Data Please review the following documents to determine your interest in accessing the COVID-19 Case Surveillance Restricted Access Detailed Data file: 1) CDC COVID-19 Case Surveillance Restricted Access Detailed Data: Summary, Guidance, Limitations Information, and Restricted Access Data Use Agreement Information 2) Data Dictionary for the COVID-19 Case Surveillance Restricted Access Detailed Data The next step is to complete the Registration Information and Data Use Restrictions Agreement (RIDURA). Once complete, CDC will review your agreement. After access is granted, Ask SRRG (eocevent394@cdc.gov) will email you information about how to access the data through GitHub. If you have questions about obtaining access, email eocevent394@cdc.gov.
Overview

The COVID-19 case surveillance database includes individual-level data reported to U.S. states and autonomous reporting entities, including New York City and the District of Columbia (D.C.), as well as U.S. territories and affiliates. On April 5, 2020, COVID-19 was added to the Nationally Notifiable Condition List and classified as “immediately notifiable, urgent (within 24 hours)” by a Council of State and Territorial Epidemiologists (CSTE) Interim Position Statement (Interim-20-ID-01). CSTE updated the position statement on August 5, 2020, to clarify the interpretation of antigen detection tests and serologic test results within the case classification (Interim-20-ID-02). The statement also recommended that all states and territories enact laws to make COVID-19 reportable in their jurisdiction, and that jurisdictions conducting surveillance should submit case notifications to CDC. COVID-19 case surveillance data are collected by jurisdictions and reported voluntarily to CDC.

COVID-19 case surveillance data are collected by jurisdictions and are shared voluntarily with CDC. For more information, visit: https://www.cdc.gov/coronavirus/2019-ncov/covid-data/about-us-cases-deaths.html.

The deidentified data in the restricted access dataset include demographic characteristics, state and county of residence, any exposure history, disease severity indicators and outcomes, clinical data, laboratory diagnostic test results, and comorbidities.

All data elements can be found on the COVID-19 case report form located at www.cdc.gov/coronavirus/2019-ncov/downloads/pui-form.pdf.

COVID-19 case reports have been routinely submitted using standardized case reporting forms.

On April 5, 2020, CSTE released an Interim Position Statement with national surveillance case definitions for COVID-19 included. Current versions of these case definitions are available here: https://ndc.services.cdc.gov/case-definitions/coronavirus-disease-2019-2021/.

CSTE updated the position statement on August 5, 2020, to clarify the interpretation of antigen detection tests and serologic test results within the case classification. All cases reported on or after were requested to be shared by public health departments to CDC using the standardized case definitions for lab-confirmed or probable cases.

On May 5, 2020, the standardized case reporting form was revised. Case reporting using this new form is ongoing among U.S. states and territories.

Data are Considered Provisional

The COVID-19 case surveillance data are dynamic; case reports can be modified at any time by the jurisdictions sharing COVID-19 data with CDC. CDC may update prior cases shared with CDC based on any updated information from jurisdictions. For instance, as new information is gathered about previously reported cases, health departments provide updated data to CDC. As more information and data become available, analyses might find changes in surveillance data and trends during a previously reported time window. Data may also be shared late with CDC due to the volume of COVID-19 cases.
Annual finalized data: To create the final NNDSS data used in the annual tables, CDC works carefully with the reporting jurisdictions to reconcile the data received during the year until each state or territorial epidemiologist confirms that the data from their area are correct.

Access Addressing Gaps in Public Health Reporting of Race and Ethnicity for COVID-19, a report from the Council of State and Territorial Epidemiologists, to better understand the challenges in completing race and ethnicity data for COVID-19 and recommendations for improvement.

Data Limitations

To learn more about the limitations in using case surveillance data, visit FAQ: COVID-19 Data and Surveillance.

Data Quality Assurance Procedures

CDC’s Case Surveillance Section routinely performs data quality assurance procedures (i.e., ongoing corrections and logic checks to address data errors). To date, the following data cleaning steps have been implemented:
Questions that have been left unanswered (blank) on the case report form are reclassified to a Missing value, if applicable to the question. For example, in the question "Was the individual hospitalized?" where the possible answer choices include "Yes," "No," or "Unknown," the blank value is recoded to "Missing" because the case report form did not include a response to the question.
Logic checks are performed for date data. If an illogical date has been provided, CDC reviews the data with the reporting jurisdiction. For example, if a symptom onset date in the future is reported to CDC, this value is set to null until the reporting jurisdiction updates the date appropriately.
Additional data quality processing to recode free text data is ongoing. Data on symptoms, race, ethnicity, and healthcare worker status have been prioritized.

Data Suppression

To prevent release of data that could be used to identify people, data cells are suppressed for low frequency (<11 COVID-19 case records with a given values). Suppression includes low frequency combinations of case month, geographic characteristics (county and state of residence), and demographic characteristics (sex, age group, race, and ethnicity). Suppressed values are re-coded to the NA answer option; records with data suppression are never removed.

Additional COVID-19 Data

COVID-19 data are available to the public as summary or aggregate count files, including total counts of cases and deaths by state and by county. These and other COVID-19 data are available from multiple public locations:
g
COVID-19 Case Surveillance Public Use Data
gimi9.com
paperswithcode.com
+6more
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
COVID-19 Case Surveillance Public Use Data [Dataset]. https://gimi9.com/dataset/data-gov_covid-19-case-surveillance-public-use-data
Explore at:
Description
Beginning March 1, 2022, the "COVID-19 Case Surveillance Public Use Data" will be updated on a monthly basis. This case surveillance public use dataset has 12 elements for all COVID-19 cases shared with CDC and includes demographics, any exposure history, disease severity indicators and outcomes, presence of any underlying medical conditions and risk behaviors, and no geographic data. CDC has three COVID-19 case surveillance datasets: COVID-19 Case Surveillance Public Use Data with Geography: Public use, patient-level dataset with clinical data (including symptoms), demographics, and county and state of residence. (19 data elements) COVID-19 Case Surveillance Public Use Data: Public use, patient-level dataset with clinical and symptom data and demographics, with no geographic data. (12 data elements) COVID-19 Case Surveillance Restricted Access Detailed Data: Restricted access, patient-level dataset with clinical and symptom data, demographics, and state and county of residence. Access requires a registration process and a data use agreement. (32 data elements) The following apply to all three datasets: Data elements can be found on the COVID-19 case report form located at www.cdc.gov/coronavirus/2019-ncov/downloads/pui-form.pdf. Data are considered provisional by CDC and are subject to change until the data are reconciled and verified with the state and territorial data providers. Some data cells are suppressed to protect individual privacy. The datasets will include all cases with the earliest date available in each record (date received by CDC or date related to illness/specimen collection) at least 14 days prior to the creation of the previously updated datasets. This 14-day lag allows case reporting to be stabilized and ensures that time-dependent outcome data are accurately captured. Datasets are updated monthly. Datasets are created using CDC’s operational Policy on Public Health Research and Nonresearch Data Management and Access and include protections designed to protect individual privacy. For more information about data collection and reporting, please see https://wwwn.cdc.gov/nndss/data-collection.html For more information about the COVID-19 case surveillance data, please see https://www.cdc.gov/coronavirus/2019-ncov/covid-data/faq-surveillance.html Overview The COVID-19 case surveillance database includes individual-level data reported to U.S. states and autonomous reporting entities, including New York City and the District of Columbia (D.C.), as well as U.S. territories and affiliates. On April 5, 2020, COVID-19 was added to the Nationally Notifiable Condition List and classified as “immediately notifiable, urgent (within 24 hours)” by a Council of State and Territorial Epidemiologists (CSTE) Interim Position Statement (Interim-20-ID-01). CSTE updated the position statement on August 5, 2020 to clarify the interpretation of antigen detection tests and serologic test results within the case classification. The statement also recommended that all states and territories enact laws to make COVID-19 reportable in their jurisdiction, and that jurisdictions conducting surveillance should submit case notifications to CDC. COVID-19 case surveillance data are collected by jurisdictions and reported volun
COVID-19 Case Surveillance Restricted Access Detailed Data
catalog.data.gov
s.cnmilf.com
Updated May 2, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Centers for Disease Control and Prevention (2021). COVID-19 Case Surveillance Restricted Access Detailed Data [Dataset]. https://catalog.data.gov/dataset/covid-19-case-surveillance-restricted-access-detailed-data-63ce4
Explore at:
Dataset updated
May 2, 2021
Dataset provided by
Centers for Disease Control and Preventionhttp://www.cdc.gov/
Description
This case surveillance publicly available dataset has 32 elements for all COVID-19 cases shared with CDC and includes demographics, geography (county and state of residence), any exposure history, disease severity indicators and outcomes, and presence of any underlying medical conditions and risk behaviors. This dataset requires a registration process and a data use agreement. CDC has three COVID-19 case surveillance datasets: COVID-19 Case Surveillance Public Use Data with Geography: Public use, patient-level dataset with clinical data (including symptoms), demographics, and county and state of residence. (19 data elements) COVID-19 Case Surveillance Public Use Data: Public use, patient-level dataset with clinical and symptom data and demographics, with no geographic data. (12 data elements) COVID-19 Case Surveillance Restricted Access Data: Restricted access, patient-level dataset with clinical (including symptoms), demographics, and county and state of residence. Access requires a registration process and a data use agreement. (32 data elements) Requesting Access to the COVID-19 Case Surveillance Restricted Access Detailed Data Please review the following documents to determine your interest in accessing the COVID-19 Case Surveillance Restricted Access Detailed Data file: 1) CDC COVID-19 Case Surveillance Restricted Access Detailed Data: Summary, Guidance, Limitations Information, and Restricted Access Data Use Agreement Information 2) Data Dictionary for the COVID-19 Case Surveillance Restricted Access Detailed Data The next step is to complete the Registration Information and Data Use Restrictions Agreement (RIDURA). Once complete, CDC will review your agreement. After access is granted, Ask SRRG (eocevent394@cdc.gov) will email you information about how to access the data through GitHub. If you have questions about obtaining access, email eocevent394@cdc.gov. Overview The COVID-19 case surveillance database includes patient-level data reported by U.S. states and autonomous reporting entities, including New York City, the District of Columbia, as well as U.S. territories and affiliates. On April 5, 2020, COVID-19 was added to the Nationally Notifiable Condition List and classified as “immediately notifiable, urgent (within 24 hours)” by a Council of State and Territorial Epidemiologists (CSTE) Interim Position Statement (Interim-20-ID-01). CSTE updated the position statement on August 5, 2020, to clarify the interpretation of antigen detection tests and serologic test results within the case classification. The statement also recommended that all states and territories enact laws to make COVID-19 reportable in their jurisdiction, and that jurisdictions conducting surveillance should submit case notifications to CDC. COVID-19 case surveillance data are collected by jurisdictions and are shared voluntarily with CDC. For more information, visit: <a href="https://wwwn.cdc.gov/nndss/conditions/coronavirus-disease-2019-c
e
Simple download service (Atom) of the dataset: Signed contracts for the...
data.europa.eu
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Simple download service (Atom) of the dataset: Signed contracts for the engagement of CAD/CTE linear elements — Department of Tarn-et-Garonne [Dataset]. https://data.europa.eu/data/datasets/fr-120066022-srv-de473d67-28d0-4fd2-a343-d724bb374506
Explore at:
inspire download serviceAvailable download formats
Description
Linear elements committed to environmental actions of a signed DAC (Sustainable Agriculture Contract), contracts related to the 2006-2010 programme, these contracts were completed in 2012. This layer constitutes the localisation plan of linear elements engaged in signed CADs or in the process of being signed. These data are imported from CartoCAD, the ADASEA input software, according to a frequency and mode of shipments defined locally between the DDAF and ADASEA. — Elements intended to be used for “business” uses in ACTE, and in PacDdaf to term Their perennial and legal nature, however, allows for other uses, within the limits of rights and restrictions of use. — Consolidated description sheet on the occasion of the June 2005 meeting of the CNV
d
ESS-DIVE Reporting Format for Dataset Package Metadata
search.dataone.org
knb.ecoinformatics.org
+1more
Updated Jun 11, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Deb Agarwal; Shreyas Cholia; Valerie C. Hendrix; Robert Crystal-Ornelas; Cory Snavely; Joan Damerow; Charuleka Varadharajan (2022). ESS-DIVE Reporting Format for Dataset Package Metadata [Dataset]. http://doi.org/10.15485/1866026
Explore at:
Unique identifier
https://doi.org/10.15485/1866026
Dataset updated
Jun 11, 2022
Dataset provided by
ESS-DIVE
Authors
Deb Agarwal; Shreyas Cholia; Valerie C. Hendrix; Robert Crystal-Ornelas; Cory Snavely; Joan Damerow; Charuleka Varadharajan
Time period covered
Jan 1, 2017
Description
ESS-DIVE’s (Environmental Systems Science Data Infrastructure for a Virtual Ecosystem) dataset metadata reporting format is intended to compile information about a dataset (e.g., title, description, funding sources) that can enable reuse of data submitted to the ESS-DIVE data repository. The files contained in this dataset include instructions (dataset_metadata_guide.md and README.md) that can be used to understand the types of metadata ESS-DIVE collects. The data dictionary (dd.csv) follows ESS-DIVE’s file-level metadata reporting format and includes brief descriptions about each element of the dataset metadata reporting format. This dataset also includes a terminology crosswalk (dataset_metadata_crosswalk.csv) that shows how ESS-DIVE’s metadata reporting format maps onto other existing metadata standards and reporting formats. Data contributors to ESS-DIVE can provide this metadata by manual entry using a web form or programmatically via ESS-DIVE’s API (Application Programming Interface). A metadata template (dataset_metadata_template.docx or dataset_metadata_template.pdf) can be used to collaboratively compile metadata before providing it to ESS-DIVE. Since being incorporated into ESS-DIVE’s data submission user interface, ESS-DIVE’s dataset metadata reporting format, has enabled features like automated metadata quality checks, and dissemination of ESS-DIVE datasets onto other data platforms including Google Dataset Search and DataCite.
The Phantom of Bern: repeated scans of two volunteers with eight different...
openneuro.org
Updated Apr 26, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
M. Rebsamen; D. Romascano; M. Capiglioni; R. Wiest; P. Radojewski; C. Rummel (2023). The Phantom of Bern: repeated scans of two volunteers with eight different combinations of MR sequence parameters [Dataset]. http://doi.org/10.18112/openneuro.ds004560.v1.0.0
Explore at:
Unique identifier
https://doi.org/10.18112/openneuro.ds004560.v1.0.0
Dataset updated
Apr 26, 2023
Dataset provided by
OpenNeurohttps://openneuro.org/
Authors
M. Rebsamen; D. Romascano; M. Capiglioni; R. Wiest; P. Radojewski; C. Rummel
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Area covered
Bern
Description
The Phantom of Bern: repeated scans of two volunteers with eight different combinations of MR sequence parameters

The Phantom of Bern consists of eight same-session re-scans of T1-weighted MRI with different combinations of sequence parameters, acquired on two healthy subjects. The subjects have agreed in writing to the publication of these data, including the original anonymized DICOM files and waving the requirement of defacing. Usage is permitted under the terms of the data usage agreement stated below.

CONTENT

The BIDS directory is organized as follows:

└── PhantomOfBern/ ├─ code/ │ ├─ derivatives/ │ ├─ dldirect_v1-0-0/ │ │ ├─ results/ # Folder with flattened subject/session inputs and outputs of DL+DiReCT │ │ └─ stats2table/ # Folder with tables summarizing all DL+DiReCT outputs │ ├─ freesurfer_v6-0-0/ │ │ ├─ results/ # Folder with flattened subject/session inputs and outputs of freesurfer │ │ └─ stats2table/ # Folder with tables summarizing all freesurfer outputs │ └─ siena_v2-6/ │ ├─ SIENA_results.csv # Siena's main output │ └─ ... # Flattened subject/session inputs and outputs of SIENA │ ├─ sourcedata/ │ ├─ POBHC0001/ │ │ └─ 17473A/ │ │ └─ ... # Anonymized DICOM folders │ └─ POBHC0002/ │ └─ 14610A/ │ └─ ... # Anonymized DICOM folders │ ├─ sub-<label>/ │ └─ ses-<label>/ │ └─ anat/ # Folder with scan's json and nifti files ├─ ...

ACKNOWLEDGEMENT

The dataset can be cited as:

M. Rebsamen, D. Romascano, M. Capiglioni, R. Wiest, P. Radojewski, C. Rummel. The Phantom of Bern: repeated scans of two volunteers with eight different combinations of MR sequence parameters. OpenNeuro, 2023.

If you use these data, please also cite the original paper:

M. Rebsamen, M. Capiglioni, R. Hoepner, A. Salmen, R. Wiest, P. Radojewski, C. Rummel. Growing importance of brain morphometry analysis in the clinical routine: The hidden impact of MR sequence parameters. Journal of Neuroradiology, 2023.

DATA USE AGREEMENT

The Phantom of Bern is distributed under the following terms, to which you agree by downloading and/or using the dataset:

Any intentional identification of a subject or disclosure of his or her confidential information violates the promise of confidentiality given to the providers of the information. Therefore, all users of the dataset agree:

To use these datasets solely for research and development or statistical purposes and not for investigation of specific subjects

To make no use of the identity of any subject discovered inadvertently, and to advise the providers of any such discovery (crummel@web.de)

When publicly presenting any results or algorithms that benefited from the use of the Phantom of Bern, you should acknowledge it, see above. Papers, book chapters, books, posters, oral presentations, and all other printed and digital presentations of results derived from the Phantom of Bern data should cite the publications listed above.

Redistribution of data (complete or in parts) in any manner without explicit inclusion of this data use agreement is prohibited.

Usage of the data for testing commercial tools is explicitly allowed. Usage for military purposes is prohibited.

The original collector and provider of the data (see acknowledgement) and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses.

FUNDING

This work was supported by the Swiss National Science Foundation under grant numbers 204593 (ScanOMetrics) and CRSII5_180365 (The Swiss-First Study).
HCUP State Emergency Department Databases (SEDD) - Restricted Access File
catalog.data.gov
healthdata.gov
+1more
Updated Feb 22, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Agency for Healthcare Research and Quality, Department of Health & Human Services (2025). HCUP State Emergency Department Databases (SEDD) - Restricted Access File [Dataset]. https://catalog.data.gov/dataset/hcup-state-emergency-department-databases-sedd-restricted-access-file
Explore at:
Dataset updated
Feb 22, 2025
Dataset provided by
Agency for Healthcare Research and Qualityhttp://www.ahrq.gov/
United States Department of Health and Human Serviceshttp://www.hhs.gov/
Description
The Healthcare Cost and Utilization Project (HCUP) State Emergency Department Databases (SEDD) contain the universe of emergency department visits in participating States. The data are translated into a uniform format to facilitate multi-State comparisons and analyses. The SEDD consist of data from hospital-based emergency department visits that do not result in an admission. The SEDD include all patients, regardless of the expected payer including but not limited to Medicare, Medicaid, private insurance, self-pay, or those billed as ‘no charge’. Developed through a Federal-State-Industry partnership sponsored by the Agency for Healthcare Research and Quality (AHRQ), HCUP data inform decision making at the national, State, and community levels. The SEDD contain clinical and resource use information included in a typical discharge abstract, with safeguards to protect the privacy of individual patients, physicians, and facilities (as required by data sources). Data elements include but are not limited to: diagnoses, procedures, admission and discharge status, patient demographics (e.g., sex, age, race), total charges, length of stay, and expected payment source, including but not limited to Medicare, Medicaid, private insurance, self-pay, or those billed as ‘no charge’. In addition to the core set of uniform data elements common to all SEDD, some include State-specific data elements. The SEDD exclude data elements that could directly or indirectly identify individuals. For some States, hospital and county identifiers are included that permit linkage to the American Hospital Association Annual Survey File and the Bureau of Health Professions' Area Resource File except in States that do not allow the release of hospital identifiers. Restricted access data files are available with a data use agreement and brief online security training.
HCUP State Inpatient Databases (SID) - Restricted Access File
catalog.data.gov
healthdata.gov
+2more
Updated Feb 22, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Agency for Healthcare Research and Quality, Department of Health & Human Services (2025). HCUP State Inpatient Databases (SID) - Restricted Access File [Dataset]. https://catalog.data.gov/dataset/hcup-state-inpatient-databases-sid-restricted-access-file
Explore at:
Dataset updated
Feb 22, 2025
Dataset provided by
Agency for Healthcare Research and Qualityhttp://www.ahrq.gov/
United States Department of Health and Human Serviceshttp://www.hhs.gov/
Description
The Healthcare Cost and Utilization Project (HCUP) State Inpatient Databases (SID) are a set of hospital databases that contain the universe of hospital inpatient discharge abstracts from data organizations in participating States. The data are translated into a uniform format to facilitate multi-State comparisons and analyses. The SID are based on data from short term, acute care, nonfederal hospitals. Some States include discharges from specialty facilities, such as acute psychiatric hospitals. The SID include all patients, regardless of payer and contain clinical and resource use information included in a typical discharge abstract, with safeguards to protect the privacy of individual patients, physicians, and hospitals (as required by data sources). Developed through a Federal-State-Industry partnership sponsored by the Agency for Healthcare Research and Quality (AHRQ), HCUP data inform decision making at the national, State, and community levels. The SID contain clinical and resource-use information that is included in a typical discharge abstract, with safeguards to protect the privacy of individual patients, physicians, and hospitals (as required by data sources). Data elements include but are not limited to: diagnoses, procedures, admission and discharge status, patient demographics (e.g., sex, age), total charges, length of stay, and expected payment source, including but not limited to Medicare, Medicaid, private insurance, self-pay, or those billed as ‘no charge’. In addition to the core set of uniform data elements common to all SID, some include State-specific data elements. The SID exclude data elements that could directly or indirectly identify individuals. For some States, hospital and county identifiers are included that permit linkage to the American Hospital Association Annual Survey File and county-level data from the Bureau of Health Professions' Area Resource File except in States that do not allow the release of hospital identifiers. Restricted access data files are available with a data use agreement and brief online security training.
HCUP Nationwide Readmissions Database (NRD)- Restricted Access Files
catalog.data.gov
healthdata.gov
+2more
Updated Jul 26, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Agency for Healthcare Research and Quality, Department of Health & Human Services (2023). HCUP Nationwide Readmissions Database (NRD)- Restricted Access Files [Dataset]. https://catalog.data.gov/dataset/healthcare-cost-and-utilization-project-nationwide-readmissions-database-nrd
Explore at:
Dataset updated
Jul 26, 2023
Dataset provided by
Agency for Healthcare Research and Qualityhttp://www.ahrq.gov/
United States Department of Health and Human Serviceshttp://www.hhs.gov/
Description
The Healthcare Cost and Utilization Project (HCUP) Nationwide Readmissions Database (NRD) is a unique and powerful database designed to support various types of analyses of national readmission rates for all payers and the uninsured. The NRD includes discharges for patients with and without repeat hospital visits in a year and those who have died in the hospital. Repeat stays may or may not be related. The criteria to determine the relationship between hospital admissions is left to the analyst using the NRD. This database addresses a large gap in health care data - the lack of nationally representative information on hospital readmissions for all ages. Outcomes of interest include national readmission rates, reasons for returning to the hospital for care, and the hospital costs for discharges with and without readmissions. Unweighted, the NRD contains data from approximately 18 million discharges each year. Weighted, it estimates roughly 35 million discharges. Developed through a Federal-State-Industry partnership sponsored by the Agency for Healthcare Research and Quality, HCUP data inform decision making at the national, State, and community levels. The NRD is drawn from HCUP State Inpatient Databases (SID) containing verified patient linkage numbers that can be used to track a person across hospitals within a State, while adhering to strict privacy guidelines. The NRD is not designed to support regional, State-, or hospital-specific readmission analyses. The NRD contains more than 100 clinical and non-clinical data elements provided in a hospital discharge abstract. Data elements include but are not limited to: diagnoses, procedures, patient demographics (e.g., sex, age), expected source of payer, regardless of expected payer, including but not limited to Medicare, Medicaid, private insurance, self-pay, or those billed as ‘no charge, discharge month, quarter, and year, total charges, length of stay, and data elements essential to readmission analyses. The NIS excludes data elements that could directly or indirectly identify individuals. Restricted access data files are available with a data use agreement and brief online security training.
d
Data from: Purchase Orders and Contracts
catalog.data.gov
data.brla.gov
+1more
Updated Mar 22, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Purchase Orders and Contracts [Dataset]. https://catalog.data.gov/dataset/purchase-orders-and-contracts
Explore at:
Dataset updated
Mar 22, 2025
Dataset provided by
data.brla.gov
Description
Listing of all purchase orders and contracts issued to procure goods and/or services within City-Parish. In the City-Parish, a PO/Contract is made up of two components: a header and one or many detail items that comprise the overarching PO/Contract. The header contains information that pertains to the entire PO/Contract. This includes, but is not limited to, the total amount of the PO/Contract, the department requesting the purchase and the vendor providing the goods or services. The detail item(s) contain information that is specific to the individual item ordered or service procured through the PO/Contract. The item/service description, item/service quantity and the cost of the item is located within the PO/Contract details. There may be one or many detail items on an individual PO/Contract. For example, a Purchase Order for a computer equipment may include three items: the computer, the monitor and the base software package. Both header information and detail item information are included in this dataset in order to provide a comprehensive view of the PO/Contract data. The Record Type field indicates whether the record is a header record (H) or detail item record (D). In the computer purchase example from above, the system would display 4 records – one header record and 3 detail item records. It should be noted header information will be duplicated on all detail items. No detail item information will be displayed on the header record. ***In October of 2017, the City-Parish switched to a new system used to track PO/Contracts. This data contains all PO/Contracts entered in or after October 2017. For prior year data, please see the Legacy Purchase Order dataset https://data.brla.gov/Government/Legacy-Purchase-Orders/54bn-2sqf
e
Simple download service (Atom) of the dataset: Agriculture — Contracts...
data.europa.eu
wfs
Updated Mar 28, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2019). Simple download service (Atom) of the dataset: Agriculture — Contracts signed for ad hoc CAD/CTE elements in Loir-et-Cher [Dataset]. https://data.europa.eu/data/datasets/fr-120066022-srv-46eb2faa-9baa-4f29-a68d-e31cc7c07f0f
Explore at:
wfsAvailable download formats
Dataset updated
Mar 28, 2019
Area covered
Loir-et-Cher
Description
This layer constitutes the localisation plan of linear elements engaged in signed CADs or in the process of being signed.
Health Insurance Marketplace
kaggle.com
zip
Updated May 1, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
US Department of Health and Human Services (2017). Health Insurance Marketplace [Dataset]. https://www.kaggle.com/hhs/health-insurance-marketplace
Explore at:
zip(868821924 bytes)Available download formats
Dataset updated
May 1, 2017
Dataset authored and provided by
US Department of Health and Human Services
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
The Health Insurance Marketplace Public Use Files contain data on health and dental plans offered to individuals and small businesses through the US Health Insurance Marketplace.

Exploration Ideas

To help get you started, here are some data exploration ideas:

How do plan rates and benefits vary across states?

How do plan benefits relate to plan rates?

How do plan rates vary by age?

How do plans vary across insurance network providers?

See this forum thread for more ideas, and post there if you want to add your own ideas or answer some of the open questions!

Data Description

This data was originally prepared and released by the Centers for Medicare & Medicaid Services (CMS). Please read the CMS Disclaimer-User Agreement before using this data.

Here, we've processed the data to facilitate analytics. This processed version has three components:

1. Original versions of the data

The original versions of the 2014, 2015, 2016 data are available in the "raw" directory of the download and "../input/raw" on Kaggle Scripts. Search for "dictionaries" on this page to find the data dictionaries describing the individual raw files.

2. Combined CSV files that contain

In the top level directory of the download ("../input" on Kaggle Scripts), there are six CSV files that contain the combined at across all years:

BenefitsCostSharing.csv

BusinessRules.csv

Network.csv

PlanAttributes.csv

Rate.csv

ServiceArea.csv

Additionally, there are two CSV files that facilitate joining data across years:

Crosswalk2015.csv - joining 2014 and 2015 data

Crosswalk2016.csv - joining 2015 and 2016 data

3. SQLite database

The "database.sqlite" file contains tables corresponding to each of the processed CSV files.

The code to create the processed version of this data is available on GitHub.
National Inpatient Sample (NIS) - Restricted Access Files
catalog.data.gov
data.virginia.gov
+1more
Updated Feb 22, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Agency for Healthcare Research and Quality, Department of Health & Human Services (2025). National Inpatient Sample (NIS) - Restricted Access Files [Dataset]. https://catalog.data.gov/dataset/hcup-national-nationwide-inpatient-sample-nis-restricted-access-file
Explore at:
Dataset updated
Feb 22, 2025
Dataset provided by
Agency for Healthcare Research and Qualityhttp://www.ahrq.gov/
United States Department of Health and Human Serviceshttp://www.hhs.gov/
Description
The Healthcare Cost and Utilization Project (HCUP) National Inpatient Sample (NIS) is the largest publicly available all-payer inpatient care database in the United States. The NIS is designed to produce U.S. regional and national estimates of inpatient utilization, access, cost, quality, and outcomes. Unweighted, it contains data from more than 7 million hospital stays each year. Weighted, it estimates more than 35 million hospitalizations nationally. Developed through a Federal-State-Industry partnership sponsored by the Agency for Healthcare Research and Quality (AHRQ), HCUP data inform decision making at the national, State, and community levels. Starting with the 2012 data year, the NIS is a sample of discharges from all hospitals participating in HCUP, covering more than 97 percent of the U.S. population. For prior years, the NIS was a sample of hospitals. The NIS allows for weighted national estimates to identify, track, and analyze national trends in health care utilization, access, charges, quality, and outcomes. The NIS's large sample size enables analyses of rare conditions, such as congenital anomalies; uncommon treatments, such as organ transplantation; and special patient populations, such as the uninsured. NIS data are available since 1988, allowing analysis of trends over time. The NIS inpatient data include clinical and resource use information typically available from discharge abstracts with safeguards to protect the privacy of individual patients, physicians, and hospitals (as required by data sources). Data elements include but are not limited to: diagnoses, procedures, discharge status, patient demographics (e.g., sex, age), total charges, length of stay, and expected payment source, including but not limited to Medicare, Medicaid, private insurance, self-pay, or those billed as ‘no charge’. The NIS excludes data elements that could directly or indirectly identify individuals. Restricted access data files are available with a data use agreement and brief online security training.
d
Tsunami Model Boundaries - Dataset - data.govt.nz - discover and use data
catalogue.data.govt.nz
Updated Jan 31, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Tsunami Model Boundaries - Dataset - data.govt.nz - discover and use data [Dataset]. https://catalogue.data.govt.nz/dataset/tsunami-model-boundaries
Explore at:
Dataset updated
Jan 31, 2024
License
Attribution-NoDerivs 4.0 (CC BY-ND 4.0)https://creativecommons.org/licenses/by-nd/4.0/
License information was derived automatically
Description
Purpose: This data has been published to support the Waikato Regional Hazards Portal. This data is part of the Coastal hazards theme.Statement: Model boundaries for the maximum credible event tsunami modelling carried out for the Coromandel (published between 2013 and 2017), Firth of Thames (published 2019) and Opoutere, Kawhia Harbour, Marokopa, Awakino and Mokau (published 2022), by eCoast limited. Preliminary modelling (to show tsunami susceptible areas) carried out for other populated parts of the West Coast (published 2015), by eCoast limited. The model boundaries show which areas were included in the modelling. Areas that are outside of the tsunami model boundaries have not been modelled but may still be at risk of tsunami inundation. For the Coromandel, Firth of Thames, Kawhia Harbour, Marokopa, Awakino and Mokau the modelling was carried out using the Community Model Interface for Tsunamis (ComMIT) numerical modelling tool. For the remaining parts of the West Coast, only preliminary modelling and investigation has been undertaken (as the preliminary results showed that wave heights are likely to be relatively small, and there will likely be time for an official tsunami warning). For more information on tsunami evacuation, including Community Response Plans, the Emergency Management staff for the relevant District Council should be contacted. More detailed information on the numerical modelling carried out by eCoast Limited can be found on the Waikato Regional Council tsunami webpages.WRC Spatial Information carried out GIS work in 2022 and 2023 to create the bounding boxes representing the model boundaries for five new models for Port Charles, Opoutere and Wharekawa Harbour, Kawhia Harbour, Marokopa, and Mokau/Awakino. This work also involved splitting out the new model areas from existing preliminary model areas where they overlapped for Kawhia Harbour, Marokopa, and Mokau/Awakino.Intended Use Scale: Data should not be used at scales greater than 1:1,000Data Use Limits:As per statement above and disclaimers below.Data Specific Disclaimers:Any data sets or maps derived from this information (either published or unpublished) should comply with and contain the following disclaimer: “This map has been prepared using data supplied by the eCoast under contract to Waikato Regional Council. Unless otherwise agreed in writing, all liability of eCoast to any party other than Waikato Regional Council in respect of the map is expressly excluded”."The presence of a hazard zone on this map does not guarantee the existence of such a hazard, nor does the lack of information preclude the existence of a hazard or risk. Waikato Regional Council, while providing the information in good faith, accepts no responsibility for any loss, damage, injury, or loss in value of any person, property, service or otherwise resulting from hazards or knowledge of hazards in the Waikato Region."The standard Waikato Regional Council Disclaimer applies to this data “While Waikato Regional Council has exercised all reasonable skill and care in controlling the contents of this information, Waikato Regional Council accepts no liability in contract, tort or otherwise howsoever, for any loss, damage, injury or expense (whether direct, indirect or consequential) arising out of the provision of this information or its use by you.”WRC Metadata: HAZARD.sdeadmin.HAZ_TSUNAMI_MODEL_BOUNDARIES - HAZARDS - Regional Hazards Portal - GIS LayersTerms of Use:This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License.Should users wish to distribute or publish derivatives or modified versions of this data then they must seek a data use agreement through WRC first.See the Waikato Regional Hazards Portal Terms of use here Waikato Regional Hazards Portal - Terms of use | Waikato Regional Council.
CanadaBuys contract history
ouvert.canada.ca
open.canada.ca
csv, xml
Updated Mar 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Public Services and Procurement Canada (2025). CanadaBuys contract history [Dataset]. https://ouvert.canada.ca/data/dataset/4fe645a1-ffcd-40c1-9385-2c771be956a4
Explore at:
csv, xmlAvailable download formats
Dataset updated
Mar 1, 2025
Dataset provided by
Public Services and Procurement Canadahttp://www.pwgsc.gc.ca/
License
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Description
This dataset contains the history of contracts awarded by Public Works and Government Services Canada (PWGSC), published online since January 2009 on behalf of federal departments and agencies. It replaces and builds upon the original Contract History dataset, which has been archived and will no longer be updated. CanadaBuys became the authoritative source of this information as of June 2023. Visit the How procurement works page on the CanadaBuys website to learn more. And to browse related information online, see the Contract history tab of the CanadaBuys Tender opportunities page. Additional supporting documentation was created for updates to the CanadaBuys tender notices and CanadaBuys awards notices datasets. This page can be referenced for additional context to the update of this dataset as well. >NOTE: While the CanadaBuys online portal includes contract history from across multiple levels of government, the data files in this related dataset only include contracts associated with federal government organizations. --- This dataset is a compilation of information from multiple procurement systems. Not all of these data sources capture information in the same format, so some data fields will remain blank where a particular system does not provide that information. The following list describes the resources associated with this CanadaBuys contract history dataset. --- (1) CanadaBuys data dictionary: This XML file offers a description of each data field contained in the contract history files linked below, as well as other related procurement datasets published by CanadaBuys. Use this as a guide for understanding the data elements in these files. This dictionary is updated as needed to reflect changes to the data elements. (2) All CanadaBuys contract history, 2023-06-01 onwards: This file contains up to date information on all contracts awarded after the implementation of CanadaBuys. This includes any contracts initially awarded on or after June 1, 2023, when CanadaBuys became the system of record for the consolidated Contract History for the Government of Canada. This file is refreshed monthly, generally in the 3rd week of each month to include any updates or amendments from the preceding month. (3) Contract history, YYYY-YYYY: These files contain the history of any contract initially awarded in a given fiscal year (and any of its future amendments). A fiscal year for the Government of Canada starts on April 1 until March 31 of the subsequent year. These files are refreshed monthly, generally in the 3rd week of the month, to include any new updates or amendments. Contract history records in these files can have any initial award date between April 1 of a given year and March 31 of the subsequent year (displayed in the field labelled “contractAwardDate-dateAttributionContrat”). This information originally began to be published in January 2009 so the earliest file only covers the final quarter of that fiscal year. New contract history data files will be added on April 1 for each fiscal year. > Note: A known data quality issue exists where in past years (predominately before 2021) records detailing contract amendments placed the amendment date in the contract award date field of that record. This was partially due to the fact that the previous version of the Contract history dataset (now archived) did not have a separate field to capture the amendment date apart from the initial contract award date. This issue unfortunately means that filtering on the contract award date field to produce the fiscal year specific datafiles does not properly capture the data, as intended. We are investigating this issue at present, and if there is a way for this to be corrected, we will do so in a later release and will provide an update to the description of this dataset accordingly. (4) Legacy contract history, 2009-01 to 2023-05 (prior to CanadaBuys): This file contains the history of contracts initially awarded prior to CanadaBuys becoming the system of record for contract history data, which occurred on 1 June 2023. This file will be refreshed monthly, generally in the 3rd week of the month, to include any new updates or amendments.
HCUP Nationwide Emergency Department Database (NEDS) Restricted Access File
data.virginia.gov
healthdata.gov
+2more
Updated Jul 26, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Agency for Healthcare Research and Quality, Department of Health & Human Services (2023). HCUP Nationwide Emergency Department Database (NEDS) Restricted Access File [Dataset]. https://data.virginia.gov/dataset/hcup-nationwide-emergency-department-database-neds-restricted-access-file
Explore at:
Dataset updated
Jul 26, 2023
Dataset provided by
Agency for Healthcare Research and Qualityhttp://www.ahrq.gov/
United States Department of Health and Human Serviceshttp://www.hhs.gov/
Description
The Healthcare Cost and Utilization Project (HCUP) Nationwide Emergency Department Sample (NEDS) is the largest all-payer emergency department (ED) database in the United States. yielding national estimates of hospital-owned ED visits. Unweighted, it contains data from over 30 million ED visits each year. Weighted, it estimates roughly 145 million ED visits nationally. Developed through a Federal-State-Industry partnership sponsored by the Agency for Healthcare Research and Quality, HCUP data inform decision making at the national, State, and community levels.

Sampled from the HCUP State Inpatient Databases (SID) and State Emergency Department Databases (SEDD), the HCUP NEDS can be used to create national and regional estimates of ED care. The SID contain information on patients initially seen in the ED and subsequently admitted to the same hospital. The SEDD capture information on ED visits that do not result in an admission (i.e., treat-and-release visits and transfers to another hospital). Developed through a Federal-State-Industry partnership sponsored by the Agency for Healthcare Research and Quality, HCUP data inform decision making at the national, State, and community levels.

The NEDS contain information about geographic characteristics, hospital characteristics, patient characteristics, and the nature of visits (e.g., common reasons for ED visits, including injuries). The NEDS contains clinical and resource use information included in a typical discharge abstract, with safeguards to protect the privacy of individual patients, physicians, and hospitals (as required by data sources). It includes ED charge information for over 85% of patients, regardless of expected payer, including but not limited to Medicare, Medicaid, private insurance, self-pay, or those billed as ‘no charge’. The NEDS excludes data elements that could directly or indirectly identify individuals, hospitals, or states.Restricted access data files are available with a data use agreement and brief online security training.
i
Data from: A Large-Scale Dataset of Twitter Chatter about Online Learning...
ieee-dataport.org
Updated Aug 9, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nirmalya Thakur (2022). A Large-Scale Dataset of Twitter Chatter about Online Learning during the Current COVID-19 Omicron Wave [Dataset]. http://doi.org/10.21227/z882-rt97
Explore at:
Unique identifier
https://doi.org/10.21227/z882-rt97
Dataset updated
Aug 9, 2022
Dataset provided by
IEEE Dataport
Authors
Nirmalya Thakur
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Please cite the following paper when using this dataset:N. Thakur, “A Large-Scale Dataset of Twitter Chatter about Online Learning during the Current COVID-19 Omicron Wave,” Journal of Data, vol. 7, no. 8, p. 109, Aug. 2022, doi: 10.3390/data7080109AbstractThe COVID-19 Omicron variant, reported to be the most immune evasive variant of COVID-19, is resulting in a surge of COVID-19 cases globally. This has caused schools, colleges, and universities in different parts of the world to transition to online learning. As a result, social media platforms such as Twitter are seeing an increase in conversations, centered around information seeking and sharing, related to online learning. Mining such conversations, such as Tweets, to develop a dataset can serve as a data resource for interdisciplinary research related to the analysis of interest, views, opinions, perspectives, attitudes, and feedback towards online learning during the current surge of COVID-19 cases caused by the Omicron variant. Therefore this work presents a large-scale public Twitter dataset of conversations about online learning since the first detected case of the COVID-19 Omicron variant in November 2021. The dataset is compliant with the privacy policy, developer agreement, and guidelines for content redistribution of Twitter and the FAIR principles (Findability, Accessibility, Interoperability, and Reusability) principles for scientific data management.Data DescriptionThe dataset comprises a total of 52,984 Tweet IDs (that correspond to the same number of Tweets) about online learning that were posted on Twitter from 9th November 2021 to 13th July 2022. The earliest date was selected as 9th November 2021, as the Omicron variant was detected for the first time in a sample that was collected on this date. 13th July 2022 was the most recent date as per the time of data collection and publication of this dataset.The dataset consists of 9 .txt files. An overview of these dataset files along with the number of Tweet IDs and the date range of the associated tweets is as follows. Filename: TweetIDs_November_2021.txt (No. of Tweet IDs: 1283, Date Range of the associated Tweet IDs: November 1, 2021 to November 30, 2021)Filename: TweetIDs_December_2021.txt (No. of Tweet IDs: 10545, Date Range of the associated Tweet IDs: December 1, 2021 to December 31, 2021)Filename: TweetIDs_January_2022.txt (No. of Tweet IDs: 23078, Date Range of the associated Tweet IDs: January 1, 2022 to January 31, 2022)Filename: TweetIDs_February_2022.txt (No. of Tweet IDs: 4751, Date Range of the associated Tweet IDs: February 1, 2022 to February 28, 2022)Filename: TweetIDs_March_2022.txt (No. of Tweet IDs: 3434, Date Range of the associated Tweet IDs: March 1, 2022 to March 31, 2022)Filename: TweetIDs_April_2022.txt (No. of Tweet IDs: 3355, Date Range of the associated Tweet IDs: April 1, 2022 to April 30, 2022)Filename: TweetIDs_May_2022.txt (No. of Tweet IDs: 3120, Date Range of the associated Tweet IDs: May 1, 2022 to May 31, 2022)Filename: TweetIDs_June_2022.txt (No. of Tweet IDs: 2361, Date Range of the associated Tweet IDs: June 1, 2022 to June 30, 2022)Filename: TweetIDs_July_2022.txt (No. of Tweet IDs: 1057, Date Range of the associated Tweet IDs: July 1, 2022 to July 13, 2022)The dataset contains only Tweet IDs in compliance with the terms and conditions mentioned in the privacy policy, developer agreement, and guidelines for content redistribution of Twitter. The Tweet IDs need to be hydrated to be used. For hydrating this dataset the Hydrator application (link to download and a step-by-step tutorial on how to use Hydrator) may be used.The list of all the synonyms or terms that were used for the dataset development is as follows:COVID-19: Omicron, COVID, COVID19, coronavirus, coronaviruspandemic, COVID-19, corona, coronaoutbreak, omicron variant, SARS CoV-2, corona virusonline learning: online education, online learning, remote education, remote learning, e-learning, elearning, distance learning, distance education, virtual learning, virtual education, online teaching, remote teaching, virtual teaching, online class, online classes, remote class, remote classes, distance class, distance classes, virtual class, virtual classes, online course, online courses, remote course, remote courses, distance course, distance courses, virtual course, virtual courses, online school, virtual school, remote school, online college, online university, virtual college, virtual university, remote college, remote university, online lecture, virtual lecture, remote lecture, online lectures, virtual lectures, remote lectures
e
Simple download service (Atom) of the dataset: Contracts signed for the...
data.europa.eu
unknown
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Simple download service (Atom) of the dataset: Contracts signed for the engagement of CAD/CTE surface elements in Haut-Rhin [Dataset]. https://data.europa.eu/data/datasets/fr-120066022-srv-1082e0e6-6eb0-414e-8471-d498e01ecfa6
Explore at:
unknownAvailable download formats
Description
This layer constitutes the localisation plan of the surface elements engaged in signed CADs or in the process of being signed. These data are imported from CartoCAD, the ADASEA input software, according to a frequency and mode of shipments defined locally between the DDAF and ADASEA.
CT-FAN-21 corpus: A dataset for Fake News Detection
zenodo.org
Updated Oct 23, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gautam Kishore Shahi; Julia Maria Struß; Thomas Mandl; Gautam Kishore Shahi; Julia Maria Struß; Thomas Mandl (2022). CT-FAN-21 corpus: A dataset for Fake News Detection [Dataset]. http://doi.org/10.5281/zenodo.4714517
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.4714517
Dataset updated
Oct 23, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Gautam Kishore Shahi; Julia Maria Struß; Thomas Mandl; Gautam Kishore Shahi; Julia Maria Struß; Thomas Mandl
Description
Data Access: The data in the research collection provided may only be used for research purposes. Portions of the data are copyrighted and have commercial value as data, so you must be careful to use it only for research purposes. Due to these restrictions, the collection is not open data. Please download the Agreement at Data Sharing Agreement and send the signed form to fakenewstask@gmail.com .

Citation

Please cite our work as

@article{shahi2021overview, title={Overview of the CLEF-2021 CheckThat! lab task 3 on fake news detection}, author={Shahi, Gautam Kishore and Stru{\ss}, Julia Maria and Mandl, Thomas}, journal={Working Notes of CLEF}, year={2021} }

Problem Definition: Given the text of a news article, determine whether the main claim made in the article is true, partially true, false, or other (e.g., claims in dispute) and detect the topical domain of the article. This task will run in English.

Subtask 3A: Multi-class fake news detection of news articles (English) Sub-task A would detect fake news designed as a four-class classification problem. The training data will be released in batches and roughly about 900 articles with the respective label. Given the text of a news article, determine whether the main claim made in the article is true, partially true, false, or other. Our definitions for the categories are as follows:

False - The main claim made in an article is untrue.

Partially False - The main claim of an article is a mixture of true and false information. The article contains partially true and partially false information but cannot be considered 100% true. It includes all articles in categories like partially false, partially true, mostly true, miscaptioned, misleading etc., as defined by different fact-checking services.

True - This rating indicates that the primary elements of the main claim are demonstrably true.

Other- An article that cannot be categorised as true, false, or partially false due to lack of evidence about its claims. This category includes articles in dispute and unproven articles.

Subtask 3B: Topical Domain Classification of News Articles (English) Fact-checkers require background expertise to identify the truthfulness of an article. The categorisation will help to automate the sampling process from a stream of data. Given the text of a news article, determine the topical domain of the article (English). This is a classification problem. The task is to categorise fake news articles into six topical categories like health, election, crime, climate, election, education. This task will be offered for a subset of the data of Subtask 3A.

Input Data

The data will be provided in the format of Id, title, text, rating, the domain; the description of the columns is as follows:

Task 3a

ID- Unique identifier of the news article

Title- Title of the news article

text- Text mentioned inside the news article

our rating - class of the news article as false, partially false, true, other

Task 3b

public_id- Unique identifier of the news article

Title- Title of the news article

text- Text mentioned inside the news article

domain - domain of the given news article(applicable only for task B)

Output data format

Task 3a

public_id- Unique identifier of the news article

predicted_rating- predicted class

Sample File

public_id, predicted_rating 1, false 2, true

Task 3b

public_id- Unique identifier of the news article

predicted_domain- predicted domain

Sample file

public_id, predicted_domain 1, health 2, crime

Additional data for Training

To train your model, the participant can use additional data with a similar format; some datasets are available over the web. We don't provide the background truth for those datasets. For testing, we will not use any articles from other datasets. Some of the possible source:

Fakenews Classification Datasets

Fake News Detection Challenge KDD 2020

FakeNewsNet

IMPORTANT!

Fake news article used for task 3b is a subset of task 3a.

We have used the data from 2010 to 2021, and the content of fake news is mixed up with several topics like election, COVID-19 etc.

Evaluation Metrics

This task is evaluated as a classification task. We will use the F1-macro measure for the ranking of teams. There is a limit of 5 runs (total and not per day), and only one person from a team is allowed to submit runs.

Submission Link: https://competitions.codalab.org/competitions/31238

Related Work

Shahi GK. AMUSED: An Annotation Framework of Multi-modal Social Media Data. arXiv preprint arXiv:2010.00502. 2020 Oct 1.https://arxiv.org/pdf/2010.00502.pdf

G. K. Shahi and D. Nandini, “FakeCovid – a multilingualcross-domain fact check news dataset for covid-19,” inWorkshop Proceedings of the 14th International AAAIConference on Web and Social Media, 2020. http://workshop-proceedings.icwsm.org/abstract?id=2020_14

Shahi, G. K., Dirkson, A., & Majchrzak, T. A. (2021). An exploratory study of covid-19 misinformation on twitter. Online Social Networks and Media, 22, 100104. doi: 10.1016/j.osnem.2020.100104

Facebook

Twitter

Click to copy link

Link copied

Cite

Social Security Administration (2025). Agreement Workflow Tool [Dataset]. https://catalog.data.gov/dataset/agreement-workflow-tool

Agreement Workflow Tool

Explore at:

Dataset updated

Mar 8, 2025

Dataset provided by

Social Security Administrationhttp://www.ssa.gov/

Description

AWT is a component of the Verification and Information Exchange Workload System (VIEWS). It is an online application which replicates the reimbursable agreement documents for review, approval and signature. AWT is available within the agency nationwide with users in various components within Headquarters, Office of Central Operations and Regional Offices across the country.

Clear search

Close search

Google apps

Main menu

Agreement Workflow Tool

COVID-19 Case Surveillance Restricted Access Detailed Data

CDC has three COVID-19 case surveillance datasets:

Overview

Data are Considered Provisional

Data Limitations

Data Quality Assurance Procedures

Data Suppression

Additional COVID-19 Data

COVID-19 Case Surveillance Public Use Data

COVID-19 Case Surveillance Restricted Access Detailed Data

Simple download service (Atom) of the dataset: Signed contracts for the...

ESS-DIVE Reporting Format for Dataset Package Metadata

The Phantom of Bern: repeated scans of two volunteers with eight different...

CONTENT

ACKNOWLEDGEMENT

DATA USE AGREEMENT

FUNDING

HCUP State Emergency Department Databases (SEDD) - Restricted Access File

HCUP State Inpatient Databases (SID) - Restricted Access File

HCUP Nationwide Readmissions Database (NRD)- Restricted Access Files

Data from: Purchase Orders and Contracts

Simple download service (Atom) of the dataset: Agriculture — Contracts...

Health Insurance Marketplace

Exploration Ideas

Data Description

1. Original versions of the data

2. Combined CSV files that contain

3. SQLite database

National Inpatient Sample (NIS) - Restricted Access Files

Tsunami Model Boundaries - Dataset - data.govt.nz - discover and use data

CanadaBuys contract history

HCUP Nationwide Emergency Department Database (NEDS) Restricted Access File

Data from: A Large-Scale Dataset of Twitter Chatter about Online Learning...

Simple download service (Atom) of the dataset: Contracts signed for the...

CT-FAN-21 corpus: A dataset for Fake News Detection

Agreement Workflow ToolSee More Versions

Agreement Workflow Tool