8 datasets found
  1. h

    isom5240-td-application-traffic-analysis

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gordon, isom5240-td-application-traffic-analysis [Dataset]. https://huggingface.co/datasets/slliac/isom5240-td-application-traffic-analysis
    Explore at:
    Authors
    Gordon
    Description

    Split:

    application: 38 samples

      Class Distribution:
    

    car (ID: 3): 67 (46.2%) motorcycle (ID: 4): 14 (9.7%) airplane (ID: 5): 62 (42.8%) truck (ID: 8): 2 (1.4%)

      Annotation Files:
    

    Latest: application/application_labels.json Timestamped: application/application_labels_20250309_212205.json

      Split:
    

    application: 49 samples

      Class Distribution:
    

    car (ID: 3): 120 (60.0%) motorcycle (ID: 4): 14 (7.0%) airplane (ID: 5): 62 (31.0%) truck (ID: 8):… See the full description on the dataset page: https://huggingface.co/datasets/slliac/isom5240-td-application-traffic-analysis.

  2. VRiV (Vehicle Recognition in Videos) Dataset

    • kaggle.com
    zip
    Updated Dec 5, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Landry KEZEBOU (2021). VRiV (Vehicle Recognition in Videos) Dataset [Dataset]. https://www.kaggle.com/landrykezebou/vriv-vehicle-recognition-in-videos-dataset
    Explore at:
    zip(2383870377 bytes)Available download formats
    Dataset updated
    Dec 5, 2021
    Authors
    Landry KEZEBOU
    Description

    Context

    The concept of searching and localizing vehicles from live traffic videos based on descriptive textual input has yet to be explored in the scholarly literature. Endowing Intelligent Transportation Systems (ITS) with such a capability could help solve crimes on roadways. While artificial intelligence (AI) can be a powerful tool for this data intensive application, existing state-of-the-art AI models struggle with fine-grain vehicle recognition. Typically, only reporting model performance on still input image data, often captured at high resolution and at pristine quality. These settings are not reflective of real-world operating conditions and thus, recognition accuracies typically cannot be replicated on video data. One major impediment to the advancement of fine-grain vehicle recognition models is the lack of video testbench datasets with annotated ground-truth data. Additionally, to the best of our knowledge, no metrics currently exist for evaluating the robustness and performance efficiency of a vehicle recognition model on live videos, and even less so for vehicle search and localization models. In this paper, we address these challenges by proposing V-Localize, a novel artificial intelligence framework for vehicle search and continuous localization captured from live traffic videos based on input textual descriptions. An efficient hashgraph algorithm is introduced to process input text (such as a sentence, paragraph, or report) to extract detailed target information used to query the recognition and localization model. This work further introduces two novel datasets that will help advance AI research in these challenging areas. These datasets include: a) the most diverse and large-scale Vehicle Color Recognition (VCoR) dataset with 15 colors classes -- twice as many as the number of color classes in the largest existing such dataset -- to facilitate finer-grain recognition with color information; and b) a Vehicle Recognition in Video (VRiV) dataset, which is a first of its kind video test-bench dataset for evaluating the performance of vehicle recognition models in live videos rather than still image data. The VRiV dataset will open new avenues for AI researchers to investigate innovative approaches that were previously intractable due to the lack of a traffic vehicle recognition annotated test-bench video dataset. Finally, to address the gap in the field, 5 novel metrics are introduced in this paper for adequately accessing the performance of vehicle recognition models in live videos. Ultimately, the proposed metrics could also prove intuitively effective at quantitative model evaluation in other video recognition applications. The novel metrics and VRiV test-bench dataset introduced in this paper are specifically aimed at advancing state-of-the-art research for vehicle recognition in videos. Likewise, the proposed novel vehicle search and continuous localization framework could prove assistive in cases such as of amber alerts or hit-and-run incidents. One major advantage of the proposed system is that it can be integrated into intelligent transportation system software to help aid law-enforcement.

    Image Acquisition

    The proposed Vehicle Recognition in Video (VRiV) dataset is the first of its kind and is aimed at developing, improving, and analyzing performance of vehicle search and recognition models on live videos. The lack of such a dataset has limited performance analysis of modern fine-grain vehicle recognition systems to only still image input data, making them less suitable for video applications. The VRiV dataset is introduced to help bridge this gap and foster research in this direction. The proposed VRiV dataset consists of up to 47 video sequences averaging about 38.5 seconds per video. The videos are recorded in a traffic setting focusing on vehicles of volunteer candidates whose ground truth make, model, year and color information are known. For security reasons and safety of participants, experiments are conducted on streets/road with low traffic density. For each video, there is a target vehicle with known ground truth information, and there are other vehicles either moving in traffic or parked on side streets, to simulate real-world traffic scenario. The goal is for the algorithm to be able to search, recognize and continuously localize just the specific target vehicle of interest for the corresponding video based on the search query. It is worth noting that the ground truth information about other vehicles in the videos are not known. The 47 videos in the testbench dataset are distributed across 7 distinct makes and 17 model designs as shown in Figure 10. The videos are also annotated to include ground truth bounding boxes for the specific target vehicles in corresponding videos. The dataset includes more than 46k annotated frames averaging about 920 frames per video. This dataset will be made available on Kaggle, and new videos will be added as they become available.

    Content

    There is one main zip file available for download. The zip file contains 94 files. 1) 47 video files 2) 47 ground-truth annotated files which identifies locations where the vehicle of interest is in the frame. Each video file is labelled with the corresponding vehicle brand name, model, year, and color information.

    Terms and Conditions

    • Videos provided in this dataset are freely available for research and education purposes only. Please be sure to properly credit the authors by citing the article below.
    • Be sure to upvote this dataset if you find it useful by scrolling up and clicking the ^ sign at the top-right corner of the cover image of this page.
    • Be sure to blur out all plate numbers before publishing any of the contents available in this dataset.

    Acknowledgements

    Any publication using this database must reference to the following journal manuscript:

    Note: if the link is broken, please use http instead of https.

    In Chrome, use the steps recommended in the following website to view the webpage if it appears to be broken https://www.technipages.com/chrome-enabledisable-not-secure-warning

    VCoR dataset: https://www.kaggle.com/landrykezebou/vcor-vehicle-color-recognition-dataset VRiV dataset: https://www.kaggle.com/landrykezebou/vriv-vehicle-recognition-in-videos-dataset

    For any enquires regarding the VCoR dataset, contact: landrykezebou@gmail.com

  3. Z

    AIT Alert Data Set

    • data.niaid.nih.gov
    • zenodo.org
    Updated Oct 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Landauer, Max (2024). AIT Alert Data Set [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_8263180
    Explore at:
    Dataset updated
    Oct 14, 2024
    Dataset provided by
    Skopik, Florian
    Landauer, Max
    Wurzenberger, Markus
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This repository contains the AIT Alert Data Set (AIT-ADS), a collection of synthetic alerts suitable for evaluation of alert aggregation, alert correlation, alert filtering, and attack graph generation approaches. The alerts were forensically generated from the AIT Log Data Set V2 (AIT-LDSv2) and origin from three intrusion detection systems, namely Suricata, Wazuh, and AMiner. The data sets comprise eight scenarios, each of which has been targeted by a multi-step attack with attack steps such as scans, web application exploits, password cracking, remote command execution, privilege escalation, etc. Each scenario and attack chain has certain variations so that attack manifestations and resulting alert sequences vary in each scenario; this means that the data set allows to develop and evaluate approaches that compute similarities of attack chains or merge them into meta-alerts. Since only few benchmark alert data sets are publicly available, the AIT-ADS was developed to address common issues in the research domain of multi-step attack analysis; specifically, the alert data set contains many false positives caused by normal user behavior (e.g., user login attempts or software updates), heterogeneous alert formats (although all alerts are in JSON format, their fields are different for each IDS), repeated executions of attacks according to an attack plan, collection of alerts from diverse log sources (application logs and network traffic) and all components in the network (mail server, web server, DNS, firewall, file share, etc.), and labels for attack phases. For more information on how this alert data set was generated, check out our paper accompanying this data set [1] or our GitHub repository. More information on the original log data set, including a detailed description of scenarios and attacks, can be found in [2].

    The alert data set contains two files for each of the eight scenarios, and a file for their labels:

    _aminer.json contains alerts from AMiner IDS

    _wazuh.json contains alerts from Wazuh IDS and Suricata IDS

    labels.csv contains the start and end times of attack phases in each scenario

    Beside false positive alerts, the alerts in the AIT-ADS correspond to the following attacks:

    Scans (nmap, WPScan, dirb)

    Webshell upload (CVE-2020-24186)

    Password cracking (John the Ripper)

    Privilege escalation

    Remote command execution

    Data exfiltration (DNSteal) and stopped service

    The total number of alerts involved in the data set is 2,655,821, of which 2,293,628 origin from Wazuh, 306,635 origin from Suricata, and 55,558 origin from AMiner. The numbers of alerts in each scenario are as follows. fox: 473,104; harrison: 593,948; russellmitchell: 45,544; santos: 130,779; shaw: 70,782; wardbeck: 91,257; wheeler: 616,161; wilson: 634,246.

    Acknowledgements: Partially funded by the European Defence Fund (EDF) projects AInception (101103385) and NEWSROOM (101121403), and the FFG project PRESENT (FO999899544). Views and opinions expressed are however those of the author(s) only and do not necessarily reflect those of the European Union. The European Union cannot be held responsible for them.

    If you use the AIT-ADS, please cite the following publications:

    [1] Landauer, M., Skopik, F., Wurzenberger, M. (2024): Introducing a New Alert Data Set for Multi-Step Attack Analysis. Proceedings of the 17th Cyber Security Experimentation and Test Workshop. [PDF]

    [2] Landauer M., Skopik F., Frank M., Hotwagner W., Wurzenberger M., Rauber A. (2023): Maintainable Log Datasets for Evaluation of Intrusion Detection Systems. IEEE Transactions on Dependable and Secure Computing, vol. 20, no. 4, pp. 3466-3482. [PDF]

  4. r

    Round Rock Traffic Counts Viewer by Year

    • geohub.roundrocktexas.gov
    Updated Jun 16, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Round Rock (2022). Round Rock Traffic Counts Viewer by Year [Dataset]. https://geohub.roundrocktexas.gov/datasets/CORR::round-rock-traffic-counts-viewer-by-year
    Explore at:
    Dataset updated
    Jun 16, 2022
    Dataset authored and provided by
    City of Round Rock
    Area covered
    Round Rock
    Description

    This web app contains the data for the traffic counts for the years 2016 through 2022 for the Transportation department in the City of Round Rock, located in Williamson County, Texas. This layer is part of an original dataset provided and maintained by the City of Round Rock GIS/IT Department and the Transportation Department.The data in this layer are represented as points and polygons.The web map connected to this web app can be found on Round Rock Traffic CountsThis time enabled map shows the traffic counts for for traffic zones within the city of Round Rock. The time sliding application within this map allows us to see the traffic counts by AADT each year between 2016 and 2022.

  5. S

    Snow Route Parking Restrictions

    • splitgraph.com
    • data.cityofchicago.org
    • +2more
    Updated Dec 6, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Chicago (2021). Snow Route Parking Restrictions [Dataset]. https://www.splitgraph.com/cityofchicago/snow-route-parking-restrictions-i6k4-giaj/
    Explore at:
    application/openapi+json, application/vnd.splitgraph.image, jsonAvailable download formats
    Dataset updated
    Dec 6, 2021
    Dataset authored and provided by
    City of Chicago
    Description

    No parking is allowed on Snow Routes when there is two or more inches on the ground regardless of date or time.For more information, please see https://www.chicago.gov/city/en/depts/streets/provdrs/traffic/svcs/winter-snow-parking-restrictions.html.

    ​​​​​This dataset is in a forma​​t for spatial datasets that is inherently tabular but allows for a map as a derived view. Please click the indicated link below for such a map.

    To export the data in either tabular or geographic format, please use the Export button on this dataset.

    Splitgraph serves as an HTTP API that lets you run SQL queries directly on this data to power Web applications. For example:

    See the Splitgraph documentation for more information.

  6. h

    ai-waf-dataset

    • huggingface.co
    Updated May 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MUNEEB UR RAHMAN (2025). ai-waf-dataset [Dataset]. https://huggingface.co/datasets/muneeburrahman/ai-waf-dataset
    Explore at:
    Dataset updated
    May 18, 2025
    Authors
    MUNEEB UR RAHMAN
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Synthetic HTTP Requests Dataset for AI WAF Training

    This dataset is synthetically generated and contains a diverse set of HTTP requests, labeled as either 'benign' or 'malicious'. It is designed for training and evaluating Web Application Firewalls (WAFs), particularly those based on AI/ML models. The dataset aims to provide a comprehensive collection of both common and sophisticated attack vectors, alongside a wide array of legitimate traffic patterns.

      Dataset… See the full description on the dataset page: https://huggingface.co/datasets/muneeburrahman/ai-waf-dataset.
    
  7. Mobile internet penetration in Europe 2024, by country

    • statista.com
    Updated Feb 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista Research Department (2025). Mobile internet penetration in Europe 2024, by country [Dataset]. https://www.statista.com/topics/779/mobile-internet/
    Explore at:
    Dataset updated
    Feb 5, 2025
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Statista Research Department
    Description

    Switzerland is leading the ranking by population share with mobile internet access , recording 95.06 percent. Following closely behind is Ukraine with 95.06 percent, while Moldova is trailing the ranking with 46.83 percent, resulting in a difference of 48.23 percentage points to the ranking leader, Switzerland. The penetration rate refers to the share of the total population having access to the internet via a mobile broadband connection.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).

  8. Mobile internet usage reach in North America 2020-2029

    • statista.com
    Updated Feb 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista Research Department (2025). Mobile internet usage reach in North America 2020-2029 [Dataset]. https://www.statista.com/topics/779/mobile-internet/
    Explore at:
    Dataset updated
    Feb 5, 2025
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Statista Research Department
    Description

    The population share with mobile internet access in North America was forecast to increase between 2024 and 2029 by in total 2.9 percentage points. This overall increase does not happen continuously, notably not in 2028 and 2029. The mobile internet penetration is estimated to amount to 84.21 percent in 2029. Notably, the population share with mobile internet access of was continuously increasing over the past years.The penetration rate refers to the share of the total population having access to the internet via a mobile broadband connection.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the population share with mobile internet access in countries like Caribbean and Europe.

  9. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Gordon, isom5240-td-application-traffic-analysis [Dataset]. https://huggingface.co/datasets/slliac/isom5240-td-application-traffic-analysis

isom5240-td-application-traffic-analysis

slliac/isom5240-td-application-traffic-analysis

Explore at:
Authors
Gordon
Description

Split:

application: 38 samples

  Class Distribution:

car (ID: 3): 67 (46.2%) motorcycle (ID: 4): 14 (9.7%) airplane (ID: 5): 62 (42.8%) truck (ID: 8): 2 (1.4%)

  Annotation Files:

Latest: application/application_labels.json Timestamped: application/application_labels_20250309_212205.json

  Split:

application: 49 samples

  Class Distribution:

car (ID: 3): 120 (60.0%) motorcycle (ID: 4): 14 (7.0%) airplane (ID: 5): 62 (31.0%) truck (ID: 8):… See the full description on the dataset page: https://huggingface.co/datasets/slliac/isom5240-td-application-traffic-analysis.

Search
Clear search
Close search
Google apps
Main menu