2 datasets found
  1. h

    UNSW-NB15

    • huggingface.co
    Updated Mar 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Witold Wydmański (2023). UNSW-NB15 [Dataset]. https://huggingface.co/datasets/wwydmanski/UNSW-NB15
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 19, 2023
    Authors
    Witold Wydmański
    Description

    Source

    https://www.kaggle.com/datasets/dhoogla/unswnb15?resource=download

      Dataset
    

    This is an academic intrusion detection dataset. All the credit goes to the original authors: dr. Nour Moustafa and dr. Jill Slay. Please cite their original paper and all other appropriate articles listed on the UNSW-NB15 page. The full dataset also offers the pcap, BRO and Argus files along with additional documentation. The modifications to the predesignated train-test sets are… See the full description on the dataset page: https://huggingface.co/datasets/wwydmanski/UNSW-NB15.

  2. UNSW-NB15

    • kaggle.com
    Updated Sep 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    StrGenIx | Laurens D'hooge (2024). UNSW-NB15 [Dataset]. http://doi.org/10.34740/kaggle/dsv/9350725
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 9, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    StrGenIx | Laurens D'hooge
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    This is an academic intrusion detection dataset. All the credit goes to the original authors: dr. Nour Moustafa and dr. Jill Slay.

    Please cite their original paper and all other appropriate articles listed on the UNSW-NB15 page.

    The full dataset also offers the pcap, BRO and Argus files along with additional documentation.

    V1: Original CSVs obtained from here V2: Cleaning -> parquet V3: Reorganize to save storage, only keep original CSVs in V1/V2 V4: Update to remove contaminating features [presentation] & [conference article]

    My modifications to the predesignated train-test sets are minimal and designed to decrease disk storage and increase performance & reliability.

    In its current iteration, the dataset can be loaded trivially with pd.read_parquet(). All data types are already set correctly and there are 0 records with missing information. Reading parquet files does require fastparquet and / or pyarrow

    Exploratory Data Analysis (EDA) through classification with very simple models to .877 AUROC.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Witold Wydmański (2023). UNSW-NB15 [Dataset]. https://huggingface.co/datasets/wwydmanski/UNSW-NB15

UNSW-NB15

wwydmanski/UNSW-NB15

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 19, 2023
Authors
Witold Wydmański
Description

Source

https://www.kaggle.com/datasets/dhoogla/unswnb15?resource=download

  Dataset

This is an academic intrusion detection dataset. All the credit goes to the original authors: dr. Nour Moustafa and dr. Jill Slay. Please cite their original paper and all other appropriate articles listed on the UNSW-NB15 page. The full dataset also offers the pcap, BRO and Argus files along with additional documentation. The modifications to the predesignated train-test sets are… See the full description on the dataset page: https://huggingface.co/datasets/wwydmanski/UNSW-NB15.

Search
Clear search
Close search
Google apps
Main menu