7 datasets found
  1. otter_dude

    • huggingface.co
    Updated Aug 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    IBM Research (2023). otter_dude [Dataset]. https://huggingface.co/datasets/ibm-research/otter_dude
    Explore at:
    Dataset updated
    Aug 16, 2023
    Dataset provided by
    IBMhttp://ibm.com/
    IBM Research
    Authors
    IBM Research
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Otter DUDe Dataset Card

    Otter DUDe includes 1,452,568 instances of drug-target interactions.

      Dataset details
    
    
    
    
    
      DUDe
    

    DUDe comprises a collection of 22,886 active compounds and their corresponding affinities towards 102 targets. For our study, we utilized a preprocessed version of the DUDe, which includes 1,452,568 instances of drug-target interactions. To prevent any data leakage, we eliminated the negative interactions and the overlapping triples with the TDC DTI… See the full description on the dataset page: https://huggingface.co/datasets/ibm-research/otter_dude.

  2. Magnifying Side-Channel Leakage of Lattice-Based Cryptosystems with Chosen...

    • zenodo.org
    • data.europa.eu
    zip
    Updated Apr 30, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Z. Xu; O Pemberton; S. Roy; D. Oswald; Z. Xu; O Pemberton; S. Roy; D. Oswald (2021). Magnifying Side-Channel Leakage of Lattice-Based Cryptosystems with Chosen Ciphertexts: The Case Study of Kyber (Dataset) [Dataset]. http://doi.org/10.5281/zenodo.4726798
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 30, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Z. Xu; O Pemberton; S. Roy; D. Oswald; Z. Xu; O Pemberton; S. Roy; D. Oswald
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This repository contains data to reproduce results from the paper "Magnifying Side-Channel Leakage of Lattice-Based Cryptosystems with Chosen Ciphertexts: The Case Study of Kyber."

    Abstract

    In this paper, we propose EM side-channel attacks with carefully constructed ciphertext on Kyber, a lattice-based key encapsulation mechanism, which is a candidate of NIST Post-Quantum Cryptography standardization project. We demonstrate that specially chosen ciphertexts allow an adversary to modulate the leakage of a target device and enable full key extraction with a small number of traces through simple power analysis. Compared to prior research, our techniques require a lower number of traces and avoid the need for template attacks. We practically evaluate our methods using both a clean reference implementation of Kyber and the ARM-optimized pqm4 library. For the reference implementation, we target the leakage of the output of the inverse NTT computation and recover the full key with only four traces. For the pqm4 implementation, we develop a message-recovery attack that leads to extraction of the full secret-key with between eight and 960 traces (or 184 traces for recovering 98% of the secret-key), depending on the compiler optimization level. We discuss the relevance of our findings to other lattice-based schemes and explore potential countermeasures.

  3. OCT Dataset for Segmentation of Atherosclerotic Plaque Morphological...

    • zenodo.org
    csv, zip
    Updated Jan 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Viacheslav Danilov; Viacheslav Danilov; Vladislav Laptev; Vladislav Laptev; Kirill Klyshnikov; Kirill Klyshnikov; Evgeny Ovcharenko; Evgeny Ovcharenko; Nikita Kochergin; Nikita Kochergin (2025). OCT Dataset for Segmentation of Atherosclerotic Plaque Morphological Features [Dataset]. http://doi.org/10.5281/zenodo.14478210
    Explore at:
    zip, csvAvailable download formats
    Dataset updated
    Jan 30, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Viacheslav Danilov; Viacheslav Danilov; Vladislav Laptev; Vladislav Laptev; Kirill Klyshnikov; Kirill Klyshnikov; Evgeny Ovcharenko; Evgeny Ovcharenko; Nikita Kochergin; Nikita Kochergin
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Objectives: The primary goal of this dataset is to enable the automated segmentation and quantification of atherosclerotic plaque features in OCT images. Cardiovascular disease, with atherosclerosis at its core, remains a global health challenge. Accurate identification of vulnerable plaques is crucial for preventing acute cardiovascular events such as myocardial infarction and stroke. OCT imaging provides high-resolution insights into plaque morphology but is often constrained by manual interpretation challenges. This dataset, curated with diverse annotations of key plaque morphological features, aims to facilitate the development and evaluation of machine learning models for precise plaque analysis. By advancing segmentation capabilities, this dataset contributes to improved diagnostics and therapeutic strategies in cardiovascular care.

    Ethical Approval: The dataset complies with ethical standards, adhering to the Declaration of Helsinki. Ethical approval was granted by the Local Ethical Committee of the Research Institute for Complex Issues of Cardiovascular Diseases (Kemerovo, Russia) under protocol code 2022/06 (approved on June 30, 2022). All participants provided informed consent. Data collection involved patients aged 18 years or older, ensuring balanced gender representation and inclusion of various comorbid conditions for comprehensive clinical relevance (refer to Table 1).

    Description: The dataset consists of OCT images acquired from 103 patients across two cardiovascular research centers. These images, collected over one year, represent a diverse array of imaging devices and patient demographics. The dataset includes 25,698 annotated slices, each capturing key plaque morphological features. These features include lumen (LM), fibrous cap (FC), lipid core (LC), and vasa vasorum (VV). The images vary in dimensions from 704 x 704 to 1024 x 1024 pixels, reflecting differences in anatomical characteristics and imaging conditions. Annotations were performed using Supervisely, with meticulous double-verification processes to ensure accuracy.

    Annotation Method: Two cardiologists annotated the dataset, identifying plaque features using binary masks. The annotations underwent a review and double-verification by a senior cardiologist and technical specialist, enhancing precision and consistency. The morphological features segmented include the vascular lumen, fibrous cap, lipid core, and vasa vasorum, each providing critical insights into plaque stability and cardiovascular risk.

    Dataset Split: A 5-fold cross-validation technique was employed for dataset splitting, ensuring robust model evaluation while preventing data leakage. Approximately 80% of images were allocated for training in each fold, with the remaining 20% reserved for testing (refer to Table 2). This method allowed a balanced and comprehensive assessment of segmentation performance across the dataset.

    Access to the Study: Further information about this study, including curated source code, dataset details, and trained models, can be accessed through the following repositories:

    Table 1. Baseline characteristics of patients included in the study.

    Parameter

    Value

    Sex:

    Male, n (%)

    77 (74.7)

    Female, n (%)

    26 (25.3)

    Median Age, years [min – max]

    69 [43 – 83]

    Arterial hypertension, n (%)

    92 (89.3)

    Diabetes Mellitus, n (%)

    22 (21.4)

    Myocardial Infarction, n (%)

    22 (21.4)

    Polyvascular Disease, n (%)

    29 (28.2)

    Angina Pectoris:

    Silent ischemia, n (%)

    9 (8.7)

    Functional class 1, n (%)

    24 (23.3)

    Functional class 2, n (%)

    55 (53.4)

    Functional class 3, n (%)

    15 (14.6)

    Table 2. Image and plaque morphological feature distributions across folds and subsets.

    FoldSubsetLMFCLCVVTotal objectsTotal images
    1Train17264561055763282877816901
    1Test45441616161612278984492
    2Train17554570956902372919017207
    2Test42541517150221374864186
    3Train17220560055654072879216962
    3Test4588162616274378844431
    4Train17813572456864162963917473
    4Test3995150215063470373920
    5Train17381626162514123040517029
    5Test44279659413863714364

  4. f

    Full 6.5-day CW measurement for Drift Detection

    • uvaauas.figshare.com
    zip
    Updated Feb 5, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kostas Papagiannopoulos (2024). Full 6.5-day CW measurement for Drift Detection [Dataset]. http://doi.org/10.21942/uva.24949077.v10
    Explore at:
    zipAvailable download formats
    Dataset updated
    Feb 5, 2024
    Dataset provided by
    University of Amsterdam / Amsterdam University of Applied Sciences
    Authors
    Kostas Papagiannopoulos
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Full 6.5-day CW measurement for Drift Detection

    Experiment:

    -The dataset was captured using the Chipwhisperer CW308, with the target device being STM32F3 with an ARM Cortex-M4

    -The target device performed an AES-128 encryption while we measured the leakage traces

    -The experiment lasted for approximately 6.5 days

    -The data is organized in parts of 100k traces each. Each 100k-sized part was captured in approx. 38 minutes. Each trace has 5k time samples (features). The original experiment has a total of 254 parts of 100k traces each.

    -Every 100k-trace data part is called tracesi.mat and comes together with labeli.mat, for indexes i = 1, 2, ..., 254

    -The labeli.mat is the value of a single sboxoutput of AES-128 i.e. the label ranges in the set {0,1,...,255}. We assume that successfully recovering the sboxoutput implies successfully recovering the respective key byte of AES-128.

    We also have a reduced version of the dataset available here:

    https://doi.org/10.21942/uva.25061858

  5. Supplementary Datasets to "Increasing emissions of HCFC-123 and HCFC-124 may...

    • zenodo.org
    bin, pdf, zip
    Updated Jun 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Luke Western; Luke Western; Stephen Bourguet; Stephen Bourguet; MOLLY CROTWELL; MOLLY CROTWELL; Lei Hu; Lei Hu; Paul Krummel; Paul Krummel; Hélène De Longueville; Hélène De Longueville; Alistair Manning; Jens Mühle; Jens Mühle; Dominique Rust; Dominique Rust; Isaac Vimont; Isaac Vimont; Martin Vollmer; Martin Vollmer; Minde An; Minde An; jgor arduini; jgor arduini; Andreas Engel; Andreas Engel; Paul Fraser; Paul Fraser; Anita Ganesan; Anita Ganesan; Christina Harth; Christina Harth; Chris Lunder; Chris Lunder; Michela Maione; Michela Maione; Stephen Montzka; Stephen Montzka; David Nance; Simon O'Doherty; Simon O'Doherty; Sunyoung Park; Sunyoung Park; Stefan Reimann; Stefan Reimann; Peter Salameh; Schmidt Roland; Kieran Stanley; Kieran Stanley; Thomas Wagenhäuser; Thomas Wagenhäuser; Dickon Young; Dickon Young; Matthew Rigby; Matthew Rigby; Ronald Prinn; Ronald Prinn; Ray Weiss; Ray Weiss; Alistair Manning; David Nance; Peter Salameh; Schmidt Roland (2025). Supplementary Datasets to "Increasing emissions of HCFC-123 and HCFC-124 may be due to leakage during HFC-125 production" [Dataset]. http://doi.org/10.5281/zenodo.15595387
    Explore at:
    zip, pdf, binAvailable download formats
    Dataset updated
    Jun 24, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Luke Western; Luke Western; Stephen Bourguet; Stephen Bourguet; MOLLY CROTWELL; MOLLY CROTWELL; Lei Hu; Lei Hu; Paul Krummel; Paul Krummel; Hélène De Longueville; Hélène De Longueville; Alistair Manning; Jens Mühle; Jens Mühle; Dominique Rust; Dominique Rust; Isaac Vimont; Isaac Vimont; Martin Vollmer; Martin Vollmer; Minde An; Minde An; jgor arduini; jgor arduini; Andreas Engel; Andreas Engel; Paul Fraser; Paul Fraser; Anita Ganesan; Anita Ganesan; Christina Harth; Christina Harth; Chris Lunder; Chris Lunder; Michela Maione; Michela Maione; Stephen Montzka; Stephen Montzka; David Nance; Simon O'Doherty; Simon O'Doherty; Sunyoung Park; Sunyoung Park; Stefan Reimann; Stefan Reimann; Peter Salameh; Schmidt Roland; Kieran Stanley; Kieran Stanley; Thomas Wagenhäuser; Thomas Wagenhäuser; Dickon Young; Dickon Young; Matthew Rigby; Matthew Rigby; Ronald Prinn; Ronald Prinn; Ray Weiss; Ray Weiss; Alistair Manning; David Nance; Peter Salameh; Schmidt Roland
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Supplementary Datasets to "Increasing emissions of HCFC-123 and HCFC-124 may be due to leakage during HFC-125 production"

    This data repository contains supplmentary datasets to the manuscript "Increasing emissions of HCFC-123 and HCFC-124 may be due to leakage during HFC-125 production" by Western et al., submitted to ACP.

    There are six supplementary data sets in this folder:

    1) factorylocations_2024_translated.pdf contains a translation of production data and companies producing HFC-125 (and other HFCs) in China. The original citation is "Ministry of Ecology and Environment of the People’s Republic of China: 2024 Hydrofluorocarbon Production and Import Quotas Announced [in Chinese], https://www.mee.gov.cn/xxgk2018/xxgk/xxgk05/202402/W020240318587718307908.pdf, 2024".

    2) GlobalEmissions.zip contains the global emissions and mole fraction trends derived from the 12-box model used in this work. This is for HCFC-123 and HCFC-124 for both the NOAA and AGAGE networks. Inputs to the 12-box model inversion scripts are also included, which can be found at https://github.com/mrghg/py12box_invert (most recent version). These files are csv files.

    3) HCFC124_bank_feedstock_modelling_3_31_2025.mat contains the outputs from the estimation of the separation of HCFC-124 emitted from banks and from HFC-125 production. This is a matlab file, but can be read using free software, such as Python.

    4) US_HCFC-124 emissions.zip contains the estimated emissions of HCFC-124 for the USA and files containing information on the observations used to do this.

    5) Europe.zip contains netcdf files output from the InTEM and RHIME models that were used to derive European emissions. Each model has 2 netfdf files, one which contains the output emissions and the other the input/output mole fractions.

    6) EastAsia.zip contains the output files from the InTEM and RHIME models that were used to derive emissions from East Asia. InTEM outputs are text files and RHIME outputs are netcdf files.

    The most recent observations from the AGAGE network can be found https://www-air.larc.nasa.gov/missions/agage/data/" target="_blank" rel="noopener">here and from the NOAA network can be found https://gml.noaa.gov/aftp/data/hats/" target="_blank" rel="noopener">here.

  6. Data from: Are Optical Gas Imaging Technologies Effective For Methane Leak...

    • figshare.com
    • datasetcatalog.nlm.nih.gov
    • +1more
    zip
    Updated Jun 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Arvind P. Ravikumar; Jingfan Wang; Adam R. Brandt (2023). Are Optical Gas Imaging Technologies Effective For Methane Leak Detection? [Dataset]. http://doi.org/10.1021/acs.est.6b03906.s003
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 5, 2023
    Dataset provided by
    ACS Publications
    Authors
    Arvind P. Ravikumar; Jingfan Wang; Adam R. Brandt
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Concerns over mitigating methane leakage from the natural gas system have become ever more prominent in recent years. Recently, the U.S. Environmental Protection Agency proposed regulations requiring use of optical gas imaging (OGI) technologies to identify and repair leaks. In this work, we develop an open-source predictive model to accurately simulate the most common OGI technology, passive infrared (IR) imaging. The model accurately reproduces IR images of controlled methane release field experiments as well as reported minimum detection limits. We show that imaging distance is the most important parameter affecting IR detection effectiveness. In a simulated well-site, over 80% of emissions can be detected from an imaging distance of 10 m. Also, the presence of “superemitters” greatly enhance the effectiveness of IR leak detection. The minimum detectable limits of this technology can be used to selectively target “superemitters”, thereby providing a method for approximate leak-rate quantification. In addition, model results show that imaging backdrop controls IR imaging effectiveness: land-based detection against sky or low-emissivity backgrounds have higher detection efficiency compared to aerial measurements. Finally, we show that minimum IR detection thresholds can be significantly lower for gas compositions that include a significant fraction nonmethane hydrocarbons.

  7. z

    Jacobaea vulgaris and meadow image classification dataset (binary)

    • zenodo.org
    zip
    Updated Jun 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    University of Rostock (2024). Jacobaea vulgaris and meadow image classification dataset (binary) [Dataset]. http://doi.org/10.5281/zenodo.12207476
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 21, 2024
    Dataset provided by
    University of Rostock
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    General Information

    Instances in the Jacobaea vulgaris class: 895
    Instances in the Meadow class: 9141
    Image sizes from 77x77 to 817x817 pixels on three color channels (RGB)

    Data Generation and Source

    The images in this dataset were taken as part of the project “UAV-basiertes Grünlandmonitoring auf Bestands- und Einzelpflanzenebene” (engl. “UAV-based Grassland Monitoring at Population and Individual Plant Level”), financed by the Authority for Economy, Transport, and Innovation of Hamburg.
    In September 2018, flights with an octocopter were conducted over two extensively used grassland areas in the urban area of Hamburg. The multicopter flew in a height of circa 11 meters and took pictures with a ground resolution of approximately 3,18 mm/pixel. Additional information about the process of image generation for this dataset are to be found in the relevant papers written by P. Zacharias: 1) UAV-basiertes Grünland-Monitoring und Schadpflanzenkartierung mit offenen Geodaten [p. 45–53] and 2) UAV-basiertes Grünlandmonitoring auf Bestands- und Einzelpflanzenebene.

    Additionally, to the images of Jacobaea vulgaris taken by the UAV, the dataset includes images of Jacobaea vulgaris plants from the internet (included in the total 895 images; e.g. images 'jkk0523.jpg', 'jkk0527.jpg'). Furthermore, some of the images of the Jacobaea vulgaris plants have been rotated, further cropped or a filter has been applied. The exact number of augmentations made is unknown. As there are augmented images included in the datasets -which makes the dataset useful for training and validation- a use of the dataset for testing purposes is not recommended due to the risk of data leakage.

    Data License

    The dataset is licensed under the license CC BY 4.0. The attributor of the data is the Chair of Geodesy and Geoinformatics at the University of Rostock. The data was created within the scope of the project 'UAV-based Grassland Monitoring at Population and Individual Plant Level', financed by the Authority for Economy, Transport, and Innovation of Hamburg.

  8. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
IBM Research (2023). otter_dude [Dataset]. https://huggingface.co/datasets/ibm-research/otter_dude
Organization logo

otter_dude

ibm-research/otter_dude

Explore at:
Dataset updated
Aug 16, 2023
Dataset provided by
IBMhttp://ibm.com/
IBM Research
Authors
IBM Research
License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

Otter DUDe Dataset Card

Otter DUDe includes 1,452,568 instances of drug-target interactions.

  Dataset details





  DUDe

DUDe comprises a collection of 22,886 active compounds and their corresponding affinities towards 102 targets. For our study, we utilized a preprocessed version of the DUDe, which includes 1,452,568 instances of drug-target interactions. To prevent any data leakage, we eliminated the negative interactions and the overlapping triples with the TDC DTI… See the full description on the dataset page: https://huggingface.co/datasets/ibm-research/otter_dude.

Search
Clear search
Close search
Google apps
Main menu