29 datasets found

CICIDS 2019 Dataset
kaggle.com
Updated Mar 2, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
tarundhamor (2021). CICIDS 2019 Dataset [Dataset]. https://www.kaggle.com/tarundhamor/cicids-2019-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 2, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
tarundhamor
Description
Dataset

This dataset was created by tarundhamor

Contents
CIC-IDS-2017 V2
zenodo.org
zip
Updated Nov 26, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Akshayraj Madhubalan; Akshayraj Madhubalan; Amit Gautam; Amit Gautam; Priya Tiwary; Priya Tiwary (2024). CIC-IDS-2017 V2 [Dataset]. http://doi.org/10.5281/zenodo.10141593
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.10141593
Dataset updated
Nov 26, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Akshayraj Madhubalan; Akshayraj Madhubalan; Amit Gautam; Amit Gautam; Priya Tiwary; Priya Tiwary
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The CIC-IDS-V2 is an extended version of the original CIC-IDS 2017 dataset. The dataset is normalised and 1 new class called "Comb" is added which is a combination of synthesised data of multiple non-benign classes.

To cite the dataset, please reference the original paper with DOI: 10.1109/SmartNets61466.2024.10577645. The paper is published in IEEE SmartNets and can be accessed here.

Citation info:

Madhubalan, Akshayraj & Gautam, Amit & Tiwary, Priya. (2024). Blender-GAN: Multi-Target Conditional Generative Adversarial Network for Novel Class Synthetic Data Generation. 1-7. 10.1109/SmartNets61466.2024.10577645.

This dataset was made by Abluva Inc, a Palo Alto based, research-driven Data Protection firm. Our data protection platform empowers customers to secure data through advanced security mechanisms such as Fine Grained Access control and sophisticated depersonalization algorithms (e.g. Pseudonymization, Anonymization and Randomization). Abluva's Data Protection solutions facilitate data democratization within and outside the organizations, mitigating the concerns related to theft and compliance. The innovative intrusion detection algorithm by Abluva employs patented technologies for an intricately balanced approach that excludes normal access deviations, ensuring intrusion detection without disrupting the business operations. Abluva’s Solution enables organizations to extract further value from their data by enabling secure Knowledge Graphs and deploying Secure Data as a Service among other novel uses of data. Committed to providing a safe and secure environment, Abluva empowers organizations to unlock the full potential of their data.
h
CIC-IDS-2017
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
makekali, CIC-IDS-2017 [Dataset]. https://huggingface.co/datasets/makekali/CIC-IDS-2017
Explore at:
Authors
makekali
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
makekali/CIC-IDS-2017 dataset hosted on Hugging Face and contributed by the HF Datasets community
i
CICIDS2017 and UNBSW-NB15
ieee-dataport.org
Updated Dec 13, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
xinpeng chen (2023). CICIDS2017 and UNBSW-NB15 [Dataset]. https://ieee-dataport.org/documents/cicids2017-and-unbsw-nb15
Explore at:
Dataset updated
Dec 13, 2023
Authors
xinpeng chen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
DoS
CICIDS sample
kaggle.com
Updated Dec 10, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Srimadhaven T (2021). CICIDS sample [Dataset]. https://www.kaggle.com/srimadhavent/cicids-sample/discussion
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 10, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Srimadhaven T
Description
Dataset

This dataset was created by Srimadhaven T

Contents
h
CICIDS-2017-plus
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
bert van keulen, CICIDS-2017-plus [Dataset]. https://huggingface.co/datasets/bvk/CICIDS-2017-plus
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
bert van keulen
Description
bvk/CICIDS-2017-plus dataset hosted on Hugging Face and contributed by the HF Datasets community
f
Labels of normal and attack classes in the CICIDS-2017 dataset.
plos.figshare.com
xls
Updated Jul 21, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Islam Zada; Esraa Omran; Salman Jan; Hessa Alfraihi; Seetah Alsalamah; Abdullah Alshahrani; Shaukat Hayat; Nguyen Phi (2025). Labels of normal and attack classes in the CICIDS-2017 dataset. [Dataset]. http://doi.org/10.1371/journal.pone.0328050.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0328050.t001
Dataset updated
Jul 21, 2025
Dataset provided by
PLOS ONE
Authors
Islam Zada; Esraa Omran; Salman Jan; Hessa Alfraihi; Seetah Alsalamah; Abdullah Alshahrani; Shaukat Hayat; Nguyen Phi
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Labels of normal and attack classes in the CICIDS-2017 dataset.
CICIDS-2017 TUE
kaggle.com
zip
Updated May 14, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sweety (2020). CICIDS-2017 TUE [Dataset]. https://www.kaggle.com/sweety18/cicids2017-tue
Explore at:
zip(40046227 bytes)Available download formats
Dataset updated
May 14, 2020
Authors
Sweety
Description
Dataset

This dataset was created by Sweety

Contents
f
Attack types and descriptions in CICIDS datasets.
plos.figshare.com
xls
Updated Jul 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Congyuan Xu; Donghui Li; Zihao Liu; Jun Yang; Qinfeng Shen; Ningbing Tong (2025). Attack types and descriptions in CICIDS datasets. [Dataset]. http://doi.org/10.1371/journal.pone.0327161.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0327161.t001
Dataset updated
Jul 2, 2025
Dataset provided by
PLOS ONE
Authors
Congyuan Xu; Donghui Li; Zihao Liu; Jun Yang; Qinfeng Shen; Ningbing Tong
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Deep learning methods have achieved remarkable progress in network intrusion detection. However, their performance often deteriorates significantly in real-world scenarios characterized by limited attack samples and substantial domain shifts. To address this challenge, we propose a novel few-shot intrusion detection method that integrates multi-domain feature fusion with a bidirectional cross-attention mechanism. Specifically, the method adopts a dual-branch feature extractor to jointly capture spatial and frequency domain characteristics of network traffic. The frequency domain features are obtained via two-dimensional discrete cosine transform (2D-DCT), which helps to highlight the spectral structure and improve feature discriminability. To bridge the semantic gap between support and query samples under few-shot conditions, we design a dual-domain bidirectional cross-attention module that enables deep, task-specific alignment across spatial and frequency domains. Additionally, we introduce a hierarchical feature encoding module based on a modified Mamba architecture, which leverages state space modeling to capture long-range dependencies and temporal patterns in traffic sequences. Extensive experiments on two benchmark datasets, CICIDS2017 and CICIDS2018, demonstrate that the proposed method achieves accuracy of 99.03% and 98.64% under the 10-shot setting, outperforming state-of-the-art methods. Moreover, the method exhibits strong cross-domain generalization, achieving over 95.13% accuracy in cross-domain scenarios, thereby proving its robustness and practical applicability in real-world, dynamic network environments.
h
CIC-IDS-2017
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fikri Mulyana Setiawan, CIC-IDS-2017 [Dataset]. https://huggingface.co/datasets/fikrimulyana/CIC-IDS-2017
Explore at:
Authors
Fikri Mulyana Setiawan
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
fikrimulyana/CIC-IDS-2017 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
cic-ids-2017
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Son Nguyen, cic-ids-2017 [Dataset]. https://huggingface.co/datasets/sonnh-tech1/cic-ids-2017
Explore at:
Authors
Son Nguyen
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
sonnh-tech1/cic-ids-2017 dataset hosted on Hugging Face and contributed by the HF Datasets community
cic-ids-2018
kaggle.com
Updated Jan 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
santosh_3007 (2025). cic-ids-2018 [Dataset]. https://www.kaggle.com/datasets/santosh3007/cic-ids-2018/versions/1
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 5, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
santosh_3007
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset

This dataset was created by santosh_3007

Released under MIT

Contents
m
CSE-CIC-IDS2018
data.mendeley.com
Updated Feb 5, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Abdisalam Mohamed (2024). CSE-CIC-IDS2018 [Dataset]. http://doi.org/10.17632/29hdbdzx2r.1
Explore at:
Unique identifier
https://doi.org/10.17632/29hdbdzx2r.1
Dataset updated
Feb 5, 2024
Authors
Abdisalam Mohamed
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
A cleaned version of CSE-CIC-IDS2018 dataset
i
SCVIC-CIDS-2021: Collaborative Feature Maps of Networks and Hosts for...
ieee-dataport.org
Updated Sep 2, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Burak Kantarci (2023). SCVIC-CIDS-2021: Collaborative Feature Maps of Networks and Hosts for Intrusion Detection [Dataset]. https://ieee-dataport.org/documents/scvic-cids-2021-collaborative-feature-maps-networks-and-hosts-intrusion-detection
Explore at:
Dataset updated
Sep 2, 2023
Authors
Burak Kantarci
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Portugal
m
CIC-DDoS2019 Dataset
data.mendeley.com
Updated Mar 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Md Alamin Talukder (2023). CIC-DDoS2019 Dataset [Dataset]. http://doi.org/10.17632/ssnc74xm6r.1
Explore at:
Unique identifier
https://doi.org/10.17632/ssnc74xm6r.1
Dataset updated
Mar 3, 2023
Authors
Md Alamin Talukder
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Distributed Denial of Service (DDoS) attack is a menace to network security that aims at exhausting the target networks with malicious traffic. Although many statistical methods have been designed for DDoS attack detection, designing a real-time detector with low computational overhead is still one of the main concerns. On the other hand, the evaluation of new detection algorithms and techniques heavily relies on the existence of well-designed datasets. In this paper, first, we review the existing datasets comprehensively and propose a new taxonomy for DDoS attacks. Secondly, we generate a new dataset, namely CICDDoS2019, which remedies all current shortcomings. Thirdly, using the generated dataset, we propose a new detection and family classification approach based on a set of network flow features. Finally, we provide the most important feature sets to detect different types of DDoS attacks with their corresponding weights.

The dataset offers an extended set of Distributed Denial of Service attacks, most of which employ some form of amplification through reflection. The dataset shares its feature set with the other CIC NIDS datasets, IDS2017, IDS2018 and DoS2017

original paper link: https://ieeexplore.ieee.org/abstract/document/8888419 kaggle dataset link: https://www.kaggle.com/datasets/dhoogla/cicddos2019
i
SCVIC-CIDS-2022: Bridging Networks and Hosts via Machine Learning-Based...
ieee-dataport.org
Updated Sep 14, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Burak Kantarci (2022). SCVIC-CIDS-2022: Bridging Networks and Hosts via Machine Learning-Based Intrusion Detection [Dataset]. https://ieee-dataport.org/documents/scvic-cids-2022-bridging-networks-and-hosts-machine-learning-based-intrusion-detection
Explore at:
Dataset updated
Sep 14, 2022
Authors
Burak Kantarci
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
M. Simsek
h
cic-ids-2017-textual
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shiva Prasad Gyawali, cic-ids-2017-textual [Dataset]. https://huggingface.co/datasets/gyawalishiva/cic-ids-2017-textual
Explore at:
Authors
Shiva Prasad Gyawali
Description
gyawalishiva/cic-ids-2017-textual dataset hosted on Hugging Face and contributed by the HF Datasets community
f
Features of the CIC-IDS 2017 network intrusion dataset.
plos.figshare.com
xls
Updated Apr 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sanchit Vashisht; Shalli Rani; Mohammad Shabaz (2025). Features of the CIC-IDS 2017 network intrusion dataset. [Dataset]. http://doi.org/10.1371/journal.pone.0321224.t002
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0321224.t002
Dataset updated
Apr 14, 2025
Dataset provided by
PLOS ONE
Authors
Sanchit Vashisht; Shalli Rani; Mohammad Shabaz
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Features of the CIC-IDS 2017 network intrusion dataset.
n
Composed Encrypted Malicious Traffic Dataset for machine learning based...
narcis.nl
data.mendeley.com
Updated Oct 6, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wang, Z (via Mendeley Data) (2021). Composed Encrypted Malicious Traffic Dataset for machine learning based encrypted malicious traffic analysis. [Dataset]. http://doi.org/10.17632/ztyk4h3v6s.1
Explore at:
Unique identifier
https://doi.org/10.17632/ztyk4h3v6s.1
Dataset updated
Oct 6, 2021
Dataset provided by
Data Archiving and Networked Services (DANS)
Authors
Wang, Z (via Mendeley Data)
Description
This is a traffic dataset which contains balance size of encrypted malicious and legitimate traffic for encrypted malicious traffic detection. The dataset is a secondary csv feature data which is composed of five public traffic datasets. Our dataset is composed based on three criteria: The first criterion is to combine widely considered public datasets which contain both encrypted malicious and legitimate traffic in existing works, such as the Malwares Capture Facility Project dataset and the CICIDS-2017 dataset. The second criterion is to ensure the data balance, i.e., balance of malicious and legitimate network traffic and similar size of network traffic contributed by each individual dataset. Thus, approximate proportions of malicious and legitimate traffic from each selected public dataset are extracted by using random sampling. We also ensured that there will be no traffic size from one selected public dataset that is much larger than other selected public datasets. The third criterion is that our dataset includes both conventional devices' and IoT devices' encrypted malicious and legitimate traffic, as these devices are increasingly being deployed and are working in the same environments such as offices, homes, and other smart city settings.

Based on the criteria, 5 public datasets are selected. After data pre-processing, details of each selected public dataset and the final composed dataset are shown in “Dataset Statistic Analysis Document”. The document summarized the malicious and legitimate traffic size we selected from each selected public dataset, proportions of selected traffic size from each selected public dataset with respect to the total traffic size of the composed dataset (% w.r.t the composed dataset), proportions of selected encrypted traffic size from each selected public dataset (% from selected public dataset), and total traffic size of the composed dataset. From the table, we are able to observe that each public dataset equally contributes to approximately 20% of the composed dataset, except for CICDS-2012 (due to its limited number of encrypted malicious traffic). This achieves a balance across individual datasets and reduces bias towards traffic belonging to any dataset during learning. We can also observe that the size of malicious and legitimate traffic are almost the same, thus achieving class balance. The datasets now made available were prepared aiming at encrypted malicious traffic detection. Since the dataset is used for machine learning model training, a sample of train and test sets are also provided. The train and test datasets are separated based on 1:4 and stratification is applied during data split. Such datasets can be used directly for machine or deep learning model training based on selected features.
CIC-IDS-2018-processed
kaggle.com
Updated Jul 14, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rachit Das (2023). CIC-IDS-2018-processed [Dataset]. https://www.kaggle.com/datasets/exterminator11/shortened-cic-2018
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 14, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Rachit Das
Description
Dataset

This dataset was created by Rachit Das

Contents

Facebook

Twitter

Click to copy link

Link copied

Cite

tarundhamor (2021). CICIDS 2019 Dataset [Dataset]. https://www.kaggle.com/tarundhamor/cicids-2019-dataset

CICIDS 2019 Dataset

Explore at:

54 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Mar 2, 2021

Dataset provided by

Kagglehttp://kaggle.com/

Authors

tarundhamor

Description

Dataset

This dataset was created by tarundhamor

Clear search

Close search

Google apps

Main menu

CICIDS 2019 Dataset

Dataset

Contents

CIC-IDS-2017 V2

CIC-IDS-2017

CICIDS2017 and UNBSW-NB15

CICIDS sample

Dataset

Contents

CICIDS-2017-plus

Labels of normal and attack classes in the CICIDS-2017 dataset.

CICIDS-2017 TUE

Dataset

Contents

Attack types and descriptions in CICIDS datasets.

CIC-IDS-2017

cic-ids-2017

cic-ids-2018

Dataset

Contents

CSE-CIC-IDS2018

SCVIC-CIDS-2021: Collaborative Feature Maps of Networks and Hosts for...

CIC-DDoS2019 Dataset

SCVIC-CIDS-2022: Bridging Networks and Hosts via Machine Learning-Based...

cic-ids-2017-textual

Features of the CIC-IDS 2017 network intrusion dataset.

Composed Encrypted Malicious Traffic Dataset for machine learning based...

CIC-IDS-2018-processed

Dataset

Contents

CICIDS 2019 Dataset

Dataset

Contents