8 datasets found

h
isom5240-td-application-traffic-analysis
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gordon, isom5240-td-application-traffic-analysis [Dataset]. https://huggingface.co/datasets/slliac/isom5240-td-application-traffic-analysis
Explore at:
Authors
Gordon
Description
Split:

application: 38 samples

Class Distribution:

car (ID: 3): 67 (46.2%) motorcycle (ID: 4): 14 (9.7%) airplane (ID: 5): 62 (42.8%) truck (ID: 8): 2 (1.4%)

Annotation Files:

Latest: application/application_labels.json Timestamped: application/application_labels_20250309_212205.json

Split:

application: 49 samples

Class Distribution:

car (ID: 3): 120 (60.0%) motorcycle (ID: 4): 14 (7.0%) airplane (ID: 5): 62 (31.0%) truck (ID: 8):… See the full description on the dataset page: https://huggingface.co/datasets/slliac/isom5240-td-application-traffic-analysis.
VRiV (Vehicle Recognition in Videos) Dataset
kaggle.com
zip
Updated Dec 5, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Landry KEZEBOU (2021). VRiV (Vehicle Recognition in Videos) Dataset [Dataset]. https://www.kaggle.com/landrykezebou/vriv-vehicle-recognition-in-videos-dataset
Explore at:
zip(2383870377 bytes)Available download formats
Dataset updated
Dec 5, 2021
Authors
Landry KEZEBOU
Description
Context

The concept of searching and localizing vehicles from live traffic videos based on descriptive textual input has yet to be explored in the scholarly literature. Endowing Intelligent Transportation Systems (ITS) with such a capability could help solve crimes on roadways. While artificial intelligence (AI) can be a powerful tool for this data intensive application, existing state-of-the-art AI models struggle with fine-grain vehicle recognition. Typically, only reporting model performance on still input image data, often captured at high resolution and at pristine quality. These settings are not reflective of real-world operating conditions and thus, recognition accuracies typically cannot be replicated on video data. One major impediment to the advancement of fine-grain vehicle recognition models is the lack of video testbench datasets with annotated ground-truth data. Additionally, to the best of our knowledge, no metrics currently exist for evaluating the robustness and performance efficiency of a vehicle recognition model on live videos, and even less so for vehicle search and localization models. In this paper, we address these challenges by proposing V-Localize, a novel artificial intelligence framework for vehicle search and continuous localization captured from live traffic videos based on input textual descriptions. An efficient hashgraph algorithm is introduced to process input text (such as a sentence, paragraph, or report) to extract detailed target information used to query the recognition and localization model. This work further introduces two novel datasets that will help advance AI research in these challenging areas. These datasets include: a) the most diverse and large-scale Vehicle Color Recognition (VCoR) dataset with 15 colors classes -- twice as many as the number of color classes in the largest existing such dataset -- to facilitate finer-grain recognition with color information; and b) a Vehicle Recognition in Video (VRiV) dataset, which is a first of its kind video test-bench dataset for evaluating the performance of vehicle recognition models in live videos rather than still image data. The VRiV dataset will open new avenues for AI researchers to investigate innovative approaches that were previously intractable due to the lack of a traffic vehicle recognition annotated test-bench video dataset. Finally, to address the gap in the field, 5 novel metrics are introduced in this paper for adequately accessing the performance of vehicle recognition models in live videos. Ultimately, the proposed metrics could also prove intuitively effective at quantitative model evaluation in other video recognition applications. The novel metrics and VRiV test-bench dataset introduced in this paper are specifically aimed at advancing state-of-the-art research for vehicle recognition in videos. Likewise, the proposed novel vehicle search and continuous localization framework could prove assistive in cases such as of amber alerts or hit-and-run incidents. One major advantage of the proposed system is that it can be integrated into intelligent transportation system software to help aid law-enforcement.

Image Acquisition

The proposed Vehicle Recognition in Video (VRiV) dataset is the first of its kind and is aimed at developing, improving, and analyzing performance of vehicle search and recognition models on live videos. The lack of such a dataset has limited performance analysis of modern fine-grain vehicle recognition systems to only still image input data, making them less suitable for video applications. The VRiV dataset is introduced to help bridge this gap and foster research in this direction. The proposed VRiV dataset consists of up to 47 video sequences averaging about 38.5 seconds per video. The videos are recorded in a traffic setting focusing on vehicles of volunteer candidates whose ground truth make, model, year and color information are known. For security reasons and safety of participants, experiments are conducted on streets/road with low traffic density. For each video, there is a target vehicle with known ground truth information, and there are other vehicles either moving in traffic or parked on side streets, to simulate real-world traffic scenario. The goal is for the algorithm to be able to search, recognize and continuously localize just the specific target vehicle of interest for the corresponding video based on the search query. It is worth noting that the ground truth information about other vehicles in the videos are not known. The 47 videos in the testbench dataset are distributed across 7 distinct makes and 17 model designs as shown in Figure 10. The videos are also annotated to include ground truth bounding boxes for the specific target vehicles in corresponding videos. The dataset includes more than 46k annotated frames averaging about 920 frames per video. This dataset will be made available on Kaggle, and new videos will be added as they become available.

Content

There is one main zip file available for download. The zip file contains 94 files. 1) 47 video files 2) 47 ground-truth annotated files which identifies locations where the vehicle of interest is in the frame. Each video file is labelled with the corresponding vehicle brand name, model, year, and color information.

Terms and Conditions

Videos provided in this dataset are freely available for research and education purposes only. Please be sure to properly credit the authors by citing the article below.

Be sure to upvote this dataset if you find it useful by scrolling up and clicking the ^ sign at the top-right corner of the cover image of this page.

Be sure to blur out all plate numbers before publishing any of the contents available in this dataset.

Acknowledgements

Any publication using this database must reference to the following journal manuscript:

Panetta, Karen, Landry Kezebou, Victor Oludare, James Intriligator, and Sos Agaian. 2021. "Artificial Intelligence for Text-Based Vehicle Search, Recognition, and Continuous Localization in Traffic Videos" AI 2, no. 4: 684-704. https://doi.org/10.3390/ai2040041

open access : https://www.mdpi.com/2673-2688/2/4/41

Note: if the link is broken, please use http instead of https.

In Chrome, use the steps recommended in the following website to view the webpage if it appears to be broken https://www.technipages.com/chrome-enabledisable-not-secure-warning

VCoR dataset: https://www.kaggle.com/landrykezebou/vcor-vehicle-color-recognition-dataset VRiV dataset: https://www.kaggle.com/landrykezebou/vriv-vehicle-recognition-in-videos-dataset

For any enquires regarding the VCoR dataset, contact: landrykezebou@gmail.com
Z
AIT Alert Data Set
data.niaid.nih.gov
zenodo.org
Updated Oct 14, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Landauer, Max (2024). AIT Alert Data Set [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_8263180
Explore at:
Dataset updated
Oct 14, 2024
Dataset provided by
Skopik, Florian
Landauer, Max
Wurzenberger, Markus
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This repository contains the AIT Alert Data Set (AIT-ADS), a collection of synthetic alerts suitable for evaluation of alert aggregation, alert correlation, alert filtering, and attack graph generation approaches. The alerts were forensically generated from the AIT Log Data Set V2 (AIT-LDSv2) and origin from three intrusion detection systems, namely Suricata, Wazuh, and AMiner. The data sets comprise eight scenarios, each of which has been targeted by a multi-step attack with attack steps such as scans, web application exploits, password cracking, remote command execution, privilege escalation, etc. Each scenario and attack chain has certain variations so that attack manifestations and resulting alert sequences vary in each scenario; this means that the data set allows to develop and evaluate approaches that compute similarities of attack chains or merge them into meta-alerts. Since only few benchmark alert data sets are publicly available, the AIT-ADS was developed to address common issues in the research domain of multi-step attack analysis; specifically, the alert data set contains many false positives caused by normal user behavior (e.g., user login attempts or software updates), heterogeneous alert formats (although all alerts are in JSON format, their fields are different for each IDS), repeated executions of attacks according to an attack plan, collection of alerts from diverse log sources (application logs and network traffic) and all components in the network (mail server, web server, DNS, firewall, file share, etc.), and labels for attack phases. For more information on how this alert data set was generated, check out our paper accompanying this data set [1] or our GitHub repository. More information on the original log data set, including a detailed description of scenarios and attacks, can be found in [2].

The alert data set contains two files for each of the eight scenarios, and a file for their labels:

_aminer.json contains alerts from AMiner IDS

_wazuh.json contains alerts from Wazuh IDS and Suricata IDS

labels.csv contains the start and end times of attack phases in each scenario

Beside false positive alerts, the alerts in the AIT-ADS correspond to the following attacks:

Scans (nmap, WPScan, dirb)

Webshell upload (CVE-2020-24186)

Password cracking (John the Ripper)

Privilege escalation

Remote command execution

Data exfiltration (DNSteal) and stopped service

The total number of alerts involved in the data set is 2,655,821, of which 2,293,628 origin from Wazuh, 306,635 origin from Suricata, and 55,558 origin from AMiner. The numbers of alerts in each scenario are as follows. fox: 473,104; harrison: 593,948; russellmitchell: 45,544; santos: 130,779; shaw: 70,782; wardbeck: 91,257; wheeler: 616,161; wilson: 634,246.

Acknowledgements: Partially funded by the European Defence Fund (EDF) projects AInception (101103385) and NEWSROOM (101121403), and the FFG project PRESENT (FO999899544). Views and opinions expressed are however those of the author(s) only and do not necessarily reflect those of the European Union. The European Union cannot be held responsible for them.

If you use the AIT-ADS, please cite the following publications:

[1] Landauer, M., Skopik, F., Wurzenberger, M. (2024): Introducing a New Alert Data Set for Multi-Step Attack Analysis. Proceedings of the 17th Cyber Security Experimentation and Test Workshop. [PDF]

[2] Landauer M., Skopik F., Frank M., Hotwagner W., Wurzenberger M., Rauber A. (2023): Maintainable Log Datasets for Evaluation of Intrusion Detection Systems. IEEE Transactions on Dependable and Secure Computing, vol. 20, no. 4, pp. 3466-3482. [PDF]
r
Round Rock Traffic Counts Viewer by Year
geohub.roundrocktexas.gov
Updated Jun 16, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
City of Round Rock (2022). Round Rock Traffic Counts Viewer by Year [Dataset]. https://geohub.roundrocktexas.gov/datasets/CORR::round-rock-traffic-counts-viewer-by-year
Explore at:
Dataset updated
Jun 16, 2022
Dataset authored and provided by
City of Round Rock
Area covered
Round Rock
Description
This web app contains the data for the traffic counts for the years 2016 through 2022 for the Transportation department in the City of Round Rock, located in Williamson County, Texas. This layer is part of an original dataset provided and maintained by the City of Round Rock GIS/IT Department and the Transportation Department.The data in this layer are represented as points and polygons.The web map connected to this web app can be found on Round Rock Traffic CountsThis time enabled map shows the traffic counts for for traffic zones within the city of Round Rock. The time sliding application within this map allows us to see the traffic counts by AADT each year between 2016 and 2022.
S
Snow Route Parking Restrictions
splitgraph.com
data.cityofchicago.org
+2more
Updated Dec 6, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
City of Chicago (2021). Snow Route Parking Restrictions [Dataset]. https://www.splitgraph.com/cityofchicago/snow-route-parking-restrictions-i6k4-giaj/
Explore at:
application/openapi+json, application/vnd.splitgraph.image, jsonAvailable download formats
Dataset updated
Dec 6, 2021
Dataset authored and provided by
City of Chicago
Description
No parking is allowed on Snow Routes when there is two or more inches on the ground regardless of date or time.For more information, please see https://www.chicago.gov/city/en/depts/streets/provdrs/traffic/svcs/winter-snow-parking-restrictions.html.

This dataset is in a format for spatial datasets that is inherently tabular but allows for a map as a derived view. Please click the indicated link below for such a map.

To export the data in either tabular or geographic format, please use the Export button on this dataset.

Splitgraph serves as an HTTP API that lets you run SQL queries directly on this data to power Web applications. For example:

See the Splitgraph documentation for more information.
h
ai-waf-dataset
huggingface.co
Updated May 18, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MUNEEB UR RAHMAN (2025). ai-waf-dataset [Dataset]. https://huggingface.co/datasets/muneeburrahman/ai-waf-dataset
Explore at:
Dataset updated
May 18, 2025
Authors
MUNEEB UR RAHMAN
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Synthetic HTTP Requests Dataset for AI WAF Training

This dataset is synthetically generated and contains a diverse set of HTTP requests, labeled as either 'benign' or 'malicious'. It is designed for training and evaluating Web Application Firewalls (WAFs), particularly those based on AI/ML models. The dataset aims to provide a comprehensive collection of both common and sophisticated attack vectors, alongside a wide array of legitimate traffic patterns.

Dataset… See the full description on the dataset page: https://huggingface.co/datasets/muneeburrahman/ai-waf-dataset.
Mobile internet penetration in Europe 2024, by country
statista.com
Updated Feb 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista Research Department (2025). Mobile internet penetration in Europe 2024, by country [Dataset]. https://www.statista.com/topics/779/mobile-internet/
Explore at:
Dataset updated
Feb 5, 2025
Dataset provided by
Statistahttp://statista.com/
Authors
Statista Research Department
Description
Switzerland is leading the ranking by population share with mobile internet access , recording 95.06 percent. Following closely behind is Ukraine with 95.06 percent, while Moldova is trailing the ranking with 46.83 percent, resulting in a difference of 48.23 percentage points to the ranking leader, Switzerland. The penetration rate refers to the share of the total population having access to the internet via a mobile broadband connection.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).
Mobile internet usage reach in North America 2020-2029
statista.com
Updated Feb 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista Research Department (2025). Mobile internet usage reach in North America 2020-2029 [Dataset]. https://www.statista.com/topics/779/mobile-internet/
Explore at:
Dataset updated
Feb 5, 2025
Dataset provided by
Statistahttp://statista.com/
Authors
Statista Research Department
Description
The population share with mobile internet access in North America was forecast to increase between 2024 and 2029 by in total 2.9 percentage points. This overall increase does not happen continuously, notably not in 2028 and 2029. The mobile internet penetration is estimated to amount to 84.21 percent in 2029. Notably, the population share with mobile internet access of was continuously increasing over the past years.The penetration rate refers to the share of the total population having access to the internet via a mobile broadband connection.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the population share with mobile internet access in countries like Caribbean and Europe.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Gordon, isom5240-td-application-traffic-analysis [Dataset]. https://huggingface.co/datasets/slliac/isom5240-td-application-traffic-analysis

isom5240-td-application-traffic-analysis

slliac/isom5240-td-application-traffic-analysis

Explore at:

Authors

Gordon

Description

Split:

application: 38 samples

  Class Distribution:

car (ID: 3): 67 (46.2%) motorcycle (ID: 4): 14 (9.7%) airplane (ID: 5): 62 (42.8%) truck (ID: 8): 2 (1.4%)

  Annotation Files:

Latest: application/application_labels.json Timestamped: application/application_labels_20250309_212205.json

  Split:

application: 49 samples

  Class Distribution:

car (ID: 3): 120 (60.0%) motorcycle (ID: 4): 14 (7.0%) airplane (ID: 5): 62 (31.0%) truck (ID: 8):… See the full description on the dataset page: https://huggingface.co/datasets/slliac/isom5240-td-application-traffic-analysis.

Clear search

Close search

Google apps

Main menu

isom5240-td-application-traffic-analysis

VRiV (Vehicle Recognition in Videos) Dataset

Context

Image Acquisition

Content

Terms and Conditions

Acknowledgements

AIT Alert Data Set

Round Rock Traffic Counts Viewer by Year

Snow Route Parking Restrictions

ai-waf-dataset

Mobile internet penetration in Europe 2024, by country

Mobile internet usage reach in North America 2020-2029

isom5240-td-application-traffic-analysis

slliac/isom5240-td-application-traffic-analysis