46 datasets found

o
Traffic Flow and Incident Data
opendatabay.com
.undefined
Updated May 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Synthetic IDD (2025). Traffic Flow and Incident Data [Dataset]. https://www.opendatabay.com/data/synthetic/edbd578a-d940-4c06-8ce6-4a11d7bba766
Explore at:
.undefinedAvailable download formats
Dataset updated
May 30, 2025
Dataset authored and provided by
Synthetic IDD
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Area covered
Transportation
Description
Synthetic Traffic Flow and Incident Data dataset by Synthetic IDD offers a comprehensive collection of synthetic traffic metrics and incident reports from around the globe. With 100 million lines of data, this dataset provides an extensive resource for researchers, urban planners, and developers interested in understanding traffic patterns, congestion points, and incident occurrences.

Usage:

This dataset can be used for a variety of purposes, including but not limited to: - Analyzing traffic patterns and congestion hotspots globally - Building predictive models for traffic management and incident prediction - Researching the impact of road conditions and incidents on traffic flow - Developing applications for real-time traffic monitoring and navigation

License:

The dataset is provided under the CC0 (Public Domain) license, allowing users to freely use, modify, and distribute the data without any restrictions.
S
Synthetic Traffic Generator Report
archivemarketresearch.com
doc, pdf, ppt
Updated Feb 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Archive Market Research (2025). Synthetic Traffic Generator Report [Dataset]. https://www.archivemarketresearch.com/reports/synthetic-traffic-generator-16603
Explore at:
ppt, pdf, docAvailable download formats
Dataset updated
Feb 10, 2025
Dataset authored and provided by
Archive Market Research
License
https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The global synthetic traffic generator market size was valued at USD 832 million in 2025 and is projected to reach USD 1,333 million by 2033, exhibiting a CAGR of 8.2% during the forecast period. The growth of the market is attributed to the increasing demand for network performance testing and system evaluation, network security, and the need for efficient and cost-effective testing solutions. The market is expected to be driven by the adoption of software-defined networking (SDN) and network function virtualization (NFV), which enable the creation of dynamic and scalable networks. Additionally, the growing awareness of the importance of network security and the need for comprehensive testing to identify and mitigate vulnerabilities is expected to further contribute to the market growth. The key players in the synthetic traffic generator market include Keysight Technologies, BittWare, SolarWinds Worldwide, LLC, NagleCode, LLC, Apposite Technologies, East Coast Datacom Inc, ostinato.org, EasyTrafficBot UG, SparkTraffic, Northwest Performance Software, Inc., among others. These players are focusing on developing innovative solutions to meet the evolving needs of their customers and to maintain their competitive advantage. Partnerships, acquisitions, and new product launches are some of the key strategies adopted by these players to expand their market presence. The market is also witnessing the emergence of new startups and small businesses, which are offering innovative and cost-effective solutions to cater to the diverse needs of customers.
i
Synthetic Traffic Generator Market Report
imrmarketreports.com
Updated May 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Swati Kalagate; Akshay Patil; Vishal Kumbhar (2025). Synthetic Traffic Generator Market Report [Dataset]. https://www.imrmarketreports.com/reports/synthetic-traffic-generator-market
Explore at:
Dataset updated
May 2025
Dataset provided by
IMR Market Reports
Authors
Swati Kalagate; Akshay Patil; Vishal Kumbhar
License
https://www.imrmarketreports.com/privacy-policy/https://www.imrmarketreports.com/privacy-policy/
Description
Synthetic Traffic Generator comes with extensive industry analysis of development components, patterns, flows, and sizes. The report calculates present and past market values to forecast potential market management during the forecast period between 2025 - 2033.
Data from: HIKARI-2021: Generating Network Intrusion Detection Dataset Based...
zenodo.org
explore.openaire.eu
+1more
bin, csv, zip
Updated Apr 16, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Andrey Ferriyan; Achmad Husni Thamrin; Keiji Takeda; Jun Murai; Andrey Ferriyan; Achmad Husni Thamrin; Keiji Takeda; Jun Murai (2022). HIKARI-2021: Generating Network Intrusion Detection Dataset Based on Real and Encrypted Synthetic Attack Traffic [Dataset]. http://doi.org/10.5281/zenodo.5111946
Explore at:
bin, csv, zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.5111946
Dataset updated
Apr 16, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Andrey Ferriyan; Achmad Husni Thamrin; Keiji Takeda; Jun Murai; Andrey Ferriyan; Achmad Husni Thamrin; Keiji Takeda; Jun Murai
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Available datasets from the paper Generating Encrypted Network Traffic for Intrusion Detection Datasets.

To produce the dataset follow the technical detail in github
e
Synset Signset UK: Synthetic image data set for traffic sign recognition
data.europa.eu
binary data
Updated Oct 19, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e. V. (2024). Synset Signset UK: Synthetic image data set for traffic sign recognition [Dataset]. https://data.europa.eu/data/datasets/773196217178836992/embed
Explore at:
binary dataAvailable download formats
Dataset updated
Oct 19, 2024
Dataset authored and provided by
Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e. V.
License
http://dcat-ap.de/def/licenses/cc-byhttp://dcat-ap.de/def/licenses/cc-by
Area covered
United Kingdom
Description
The Synset Signset Germany dataset contains a total of 211,000 synthetically generated images of current German traffic signs (including the 2020 update) for machine learning methods (ML) in the area of application (task) of traffic sign recognition.

The dataset contains 211 German traffic sign classes with 500 images each, and is divided into two sub-datasets, which were generated with different rendering engines. In addition to the classification annotations, the data set also contains label images for segmentation of traffic signs, binary masks, as well as extensive information on image and scene properties, in particular on image artifacts.

The dataset was presented in September 2024 by Anne Sielemann, Lena Lörcher, Max-Lion Schumacher, Stefan Wolf, Jens Ziehn, Masoud Roschani and Jürgen Beyerer in the publication: Sielemann, A., Loercher, L., Schumacher, M., Wolf, S., Roschani, M., Ziehn, J., and Beyerer, J. (2024). Synset Signset UK: A Synthetic Dataset for German Traffic Sign Recognition. In 2024 IEEE 27th International Conference on Intelligent Transportation Systems (ITSC).

The forms of traffic signs are based on the picture board of traffic signs in the Federal Republic of Germany since 2017 on Wikipedia (https://en.wikipedia.org/wiki/Bildtafel_der_Verkehrszeichen_in_der_Bundesrepublik_Deutschland_seit_2017).

The data was generated with the simulation environment OCTANE (www.octane.org). One subset uses the Cycles Raytracer of the Blender project (www.cycles-renderer.org), the other (otherwise identical) subset uses the 3D rasterization engine OGRE3D (www.ogre3d.org).

The dataset's website provides detailed information on the generation process and model assumptions. The dataset is therefore also intended to be used for the suitability analysis of simulated, synthetic datasets.

The data set was developed as part of the Fraunhofer PREPARE program in the "ML4Safety" project with the funding code PREPARE 40-02702, as well as funded by the "New Vehicle and System Technologies" funding program of the Federal Ministry for Economic Affairs and Climate Protection of the Federal Republic of Germany (BMWK) as part of the "AVEAS" research project (www.aveas.org).

The generative generation of textures with dirt and wear of the traffic signs was trained on real data of traffic signs, which was collected with the kind support of the Civil Engineering Office Karlsruhe.
Network Digital Twin-Generated Dataset for Machine Learning-based Detection...
zenodo.org
zip
Updated Jun 23, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Amit Karamchandani Batra; Amit Karamchandani Batra; Javier Nuñez Fuente; Luis de la Cal García; Luis de la Cal García; Yenny Moreno Meneses; Alberto Mozo Velasco; Alberto Mozo Velasco; Antonio Pastor Perales; Antonio Pastor Perales; Diego R. López; Diego R. López; Javier Nuñez Fuente; Yenny Moreno Meneses (2025). Network Digital Twin-Generated Dataset for Machine Learning-based Detection of Benign and Malicious Heavy Hitter Flows [Dataset]. http://doi.org/10.5281/zenodo.14841650
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.14841650
Dataset updated
Jun 23, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Amit Karamchandani Batra; Amit Karamchandani Batra; Javier Nuñez Fuente; Luis de la Cal García; Luis de la Cal García; Yenny Moreno Meneses; Alberto Mozo Velasco; Alberto Mozo Velasco; Antonio Pastor Perales; Antonio Pastor Perales; Diego R. López; Diego R. López; Javier Nuñez Fuente; Yenny Moreno Meneses
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jul 11, 2024
Description
Overview

This record provides a dataset created as part of the study presented in the following publication and is made publicly available for research purposes. The associated article provides a comprehensive description of the dataset, its structure, and the methodology used in its creation. If you use this dataset, please cite the following article published in the journal IEEE Communications Magazine:

A. Karamchandani, J. Nunez, L. de-la-Cal, Y. Moreno, A. Mozo, and A. Pastor, “On the Applicability of Network Digital Twins in Generating Synthetic Data for Heavy Hitter Discrimination,” IEEE Communications Magazine, pp. 2–8, 2025, DOI: 10.1109/MCOM.003.2400648.

More specifically, the record contains several synthetic datasets generated to differentiate between benign and malicious heavy hitter flows within a realistic virtualized network environment. Heavy Hitter flows, which include high-volume data transfers, can significantly impact network performance, leading to congestion and degraded quality of service. Distinguishing legitimate heavy hitter activity from malicious Distributed Denial-of-Service traffic is critical for network management and security, yet existing datasets lack the granularity needed for training machine learning models to effectively make this distinction.

To address this, a Network Digital Twin (NDT) approach was utilized to emulate realistic network conditions and traffic patterns, enabling automated generation of labeled data for both benign and malicious HH flows alongside regular traffic.

Feature Set:

The feature set includes the following flow statistics commonly used in the literature on network traffic classification:

The protocol used for the connection, identifying whether it is TCP, UDP, ICMP, or OSPF.

The time (relative to the connection start) of the most recent packet sent from source to destination at the time of each snapshot.

The time (relative to the connection start) of the most recent packet sent from destination to source at the time of each snapshot.

The cumulative count of data packets sent from source to destination at the time of each snapshot.

The cumulative count of data packets sent from destination to source at the time of each snapshot.

The cumulative bytes sent from source to destination at the time of each snapshot.

The cumulative bytes sent from destination to source at the time of each snapshot.

The time difference between the first packet sent from source to destination and the first packet sent from destination to source.

Dataset Variations:

To accommodate diverse research needs and scenarios, the dataset is provided in the following variations:

All at Once:

Contains a synthetic dataset where all traffic types, including benign, normal, and malicious DDoS heavy hitter (HH) flows, are combined into a single dataset.

This version represents a holistic view of the traffic environment, simulating real-world scenarios where all traffic occurs simultaneously.

Balanced Traffic Generation:

Represents a balanced traffic dataset with an equal proportion of benign, normal, and malicious DDoS traffic.

Designed for scenarios where a balanced dataset is needed for fair training and evaluation of machine learning models.

DDoS at Intervals:

Contains traffic data where malicious DDoS HH traffic occurs at specific time intervals, mimicking real-world attack patterns.

Useful for studying the impact and detection of intermittent malicious activities.

Only Benign HH Traffic:

Includes only benign HH traffic flows.

Suitable for training and evaluating models to identify and differentiate benign heavy hitter traffic patterns.

Only DDoS Traffic:

Contains only malicious DDoS HH traffic.

Helps in isolating and analyzing attack characteristics for targeted threat detection.

Only Normal Traffic:

Comprises only regular, non-HH traffic flows.

Useful for understanding baseline network behavior in the absence of heavy hitters.

Unbalanced Traffic Generation:

Features an unbalanced dataset with varying proportions of benign, normal, and malicious traffic.

Simulates real-world scenarios where certain types of traffic dominate, providing insights into model performance in unbalanced conditions.

For each variation, the output of the different packet aggregators is provided separated in its respective folder.

Each variation was generated using the NDT approach to demonstrate its flexibility and ensure the reproducibility of our study's experiments, while also contributing to future research on network traffic patterns and the detection and classification of heavy hitter traffic flows. The dataset is designed to support research in network security, machine learning model development, and applications of digital twin technology.
Z
Traffic Detection Datasets
data.niaid.nih.gov
zenodo.org
Updated Jan 28, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sottovia, Paolo (2021). Traffic Detection Datasets [Dataset]. https://data.niaid.nih.gov/resources?id=ZENODO_4471356
Explore at:
Dataset updated
Jan 28, 2021
Dataset provided by
Foroni, Daniele
Maropaki, Stella
Bortoli, Stefano
Sottovia, Paolo
Description
Public (anonymized) road traffic detection datasets from Huawei Munich Research Center.

Traffic detection datasets from a variety of traffic sensors (i.e. induction loops). The data is useful for traffic indexing, dominant flows detection, forecasting traffic patterns, and adjusting stop-light control parameters, i.e. cycle length, offset and split times.

There are three datasets:

RM is a real-life dataset collected in the period of March 2020 from traffic detectors in an area of a Chinese city. The detectors were located in 6 intersections monitoring the traffic on 44 road edges. Each detector was collecting data every 1 second for all the road edges in its radius. In total, 2894174 detections were collected, from 337089 different vehicles.

COM is a synthetic dataset that generated in the same road network as the real-life data RM and RD, with the difference that we included detectors in the 2 intersections where the real scenario didn’t have. Then, we generated equally random trips over the road network. In total we generated 18910 detections from 7500 vehicles.

GRID was created using the SUMO Simulation of Urban Mobility [2]. We randomly generated trips on a 10𝑥10 intersections grid road network using a utility from SUMO that equally generates trips over the road network. Then, running the simulation and using TraCI Traffic Control Interface library [3] we read the simulation data and collect the detections. The simulation collected data include 1806141 detections from 135618 different vehicles.

The datasets were used in the Querying Top-k Dominant Traffic Flows on Large Urban Road Networks [1] paper.

[1] Stella Maropaki, Paolo Sottovia, and Stefano Bortoli. 2021. Querying Top-k Dominant Traffic Flows on Large Urban Road Networks. In 2021 24th International Conference on Extending Database Technology (EDBT).
g
Synset Signset UK: Synthetic image data set for traffic sign recognition
gimi9.com
Updated Dec 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Synset Signset UK: Synthetic image data set for traffic sign recognition [Dataset]. https://gimi9.com/dataset/eu_773196217178836992
Explore at:
Dataset updated
Dec 2, 2024
Area covered
United Kingdom
Description
🇩🇪 독일
Z
Synthetic Aircraft Trajectory Dataset
data.niaid.nih.gov
zenodo.org
Updated Sep 21, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Murad, Abdulmajid (2024). Synthetic Aircraft Trajectory Dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_13767131
Explore at:
Dataset updated
Sep 21, 2024
Dataset provided by
Ruocco, Massimiliano
Murad, Abdulmajid
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset comprises synthetically generated aircraft trajectories for multiple specific airport pairs across Europe. Generated using advanced machine learning techniques, the dataset includes high-resolution spatial and temporal information for each trajectory. It features key flight parameters such as latitude, longitude, altitude, and time, along with synthetic identifiers for each flight. This dataset is ideal for air traffic management research, flight path analysis, and the development of predictive models in aviation.
Traffic Congestion Prediction
kaggle.com
Updated Apr 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Şahide ŞEKER (2025). Traffic Congestion Prediction [Dataset]. https://www.kaggle.com/datasets/sahideseker/traffic-congestion-prediction
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 3, 2025
Dataset provided by
Kaggle
Authors
Şahide ŞEKER
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
🇬🇧 English:

This synthetic dataset provides location-based traffic congestion levels on an hourly basis over the last 30 days. It can be used to train time series models like LSTM and XGBoost to forecast traffic intensity.

Use this dataset to:

Train time series models to predict congestion levels

Analyze traffic patterns based on location and time

Develop AI-powered traffic management systems

Features:

location: Name of the location or neighborhood

date: Date in YYYY-MM-DD format

time: Hour of the day (e.g., 08:00)

congestion_level: Congestion score between 0 (low) and 10 (high)

🇹🇷 Türkçe:

Bu sentetik veri seti, son 30 güne ait saatlik trafik yoğunluğu bilgilerini lokasyon bazlı olarak sunar. Trafik yoğunluğunu tahmin etmeye yönelik zaman serisi modellerinin eğitimi için uygundur.

Bu veri seti ile:

LSTM ve XGBoost gibi modellerle trafik tahmini yapılabilir

Lokasyon ve saate göre trafik analizi yapılabilir

Trafik yönetim sistemleri geliştirilebilir

Özellikler:

location: Lokasyon adı

date: Tarih bilgisi (YYYY-MM-DD)

time: Günün saati (örn. 08:00)

congestion_level: 0 (düşük) ile 10 (yüksek) arasında trafik yoğunluğu skoru
Grand Traffic Auto (GTA) Dataset: a Collection of Synthetic Images for...
zenodo.org
Updated May 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ciampi Luca; Ciampi Luca; Santiago Carlos; Santiago Carlos; Costeira Joao Paulo; Costeira Joao Paulo; Claudio Gennaro Claudio; Claudio Gennaro Claudio; Amato Giuseppe; Amato Giuseppe (2025). Grand Traffic Auto (GTA) Dataset: a Collection of Synthetic Images for Vehicle Detection, Segmentation and Counting [Dataset]. http://doi.org/10.5281/zenodo.6560038
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.6560038
Dataset updated
May 8, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Ciampi Luca; Ciampi Luca; Santiago Carlos; Santiago Carlos; Costeira Joao Paulo; Costeira Joao Paulo; Claudio Gennaro Claudio; Claudio Gennaro Claudio; Amato Giuseppe; Amato Giuseppe
License
Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
License information was derived automatically
Description
The Dataset

A synthetic dataset for vehicle detection, segmentation, and counting. It comprises images extracted from the highly photo-realistic video game Grand Theft Auto V developed by Rockstar North. Each image is labeled by the game engine providing pixel-wise masks and bounding boxes localizing vehicle instances.
The dataset includes about 10k synthetic images depicting several urban scenarios with various background scenes, lighting, camera positions, traffic densities, and weather conditions. In total, we labeled more than 411,000 vehicles.

We provide two annotation files:

coco_annotations.json --> JSON file that follows the golden standard MS COCO data format (for more info see https://cocodataset.org/#format-data). All the vehicles are labeled with the COCO category 'car'. It is suitable for vehicle detection and instance segmentation.

dot_annotations.csv --> CSV file that contains xy coordinates of the centroids of the vehicles. Dot annotation is commonly used for the visual counting task.

Citing our work

If you found this dataset useful, please cite the following paper

@inproceedings{Ciampi_visapp_2021, doi = {10.5220/0010303401850195}, url = {https://doi.org/10.5220%2F0010303401850195}, year = 2021, publisher = {{SCITEPRESS} - Science and Technology Publications}, author = {Luca Ciampi and Carlos Santiago and Joao Costeira and Claudio Gennaro and Giuseppe Amato}, title = {Domain Adaptation for Traffic Density Estimation}, booktitle = {Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications} }

and this Zenodo Dataset

@dataset{ciampi_gta_6560038, author={Luca Ciampi and Carlos Santiago and Joao Costeira and Claudio Gennaro and Giuseppe Amato}, title = {{Grand Traffic Auto (GTA) Dataset: a Collection of Synthetic Images for Vehicle Detection, Segmentation and Counting}}, month = may, year = 2022, publisher = {Zenodo}, version = {1.0.0}, doi = {10.5281/zenodo.6560038}, url = {https://doi.org/10.5281/zenodo.6560038} }

Contact Information

If you would like further information about the dataset or if you experience any issues downloading files, please contact us at luca.ciampi@isti.cnr.it
C
DBTR - Road (Synthetic route) - (STR_PS_GLI)
ckan.mobidatalab.eu
pdf, wms
Updated May 3, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GeoDatiGovIt RNDT (2023). DBTR - Road (Synthetic route) - (STR_PS_GLI) [Dataset]. https://ckan.mobidatalab.eu/dataset/dbtr-road-synthetic-route-str_ps_gli
Explore at:
wms, pdfAvailable download formats
Dataset updated
May 3, 2023
Dataset provided by
GeoDatiGovIt RNDT
Description
Road (Synthetic route)
h
syntra-experiment-dataset
huggingface.co
Updated Nov 16, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NovelSense UG (2023). syntra-experiment-dataset [Dataset]. http://doi.org/10.57967/hf/1350
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.57967/hf/1350
Dataset updated
Nov 16, 2023
Dataset authored and provided by
NovelSense UG
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
About

This is the SYNTRA Experiment Dataset. It is a sample dataset from the NovelSense SYNTRA EU Hubs 4 Data experiment (https://euhubs4data.eu/experiments/syntra/). The experiment supported the development of a web application reachable under https://syntra.app. The dataset is a synthetic traffic infrastructure dataset e.g. for use for the validation, trainig and optimization of your traffic AI models.

Datset description

The dataset has been created by generating 14… See the full description on the dataset page: https://huggingface.co/datasets/NovelSense/syntra-experiment-dataset.
C
DBTR2013 - Road (Synthetic course) - (STR_PS_GLI)
ckan.mobidatalab.eu
pdf, wms
Updated Apr 28, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GeoDatiGovIt RNDT (2023). DBTR2013 - Road (Synthetic course) - (STR_PS_GLI) [Dataset]. https://ckan.mobidatalab.eu/dataset/dbtr2013-road-synthetic-route-str_ps_gli
Explore at:
pdf, wmsAvailable download formats
Dataset updated
Apr 28, 2023
Dataset provided by
GeoDatiGovIt RNDT
Description
Road (Synthetic route)
Z
Open synthetic data on travel and charging demand of battery electric cars:...
data.niaid.nih.gov
Updated Feb 9, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tozluoglu, Caglar (2023). Open synthetic data on travel and charging demand of battery electric cars: An agent-based simulation on three charging behavior archetypes [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7549846
Explore at:
Dataset updated
Feb 9, 2023
Dataset provided by
Sprei, Frances
Liao, Yuan
Dhamal, Swapnil
Yeh, Sonia
Tozluoglu, Caglar
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Background

Battery electric vehicles (BEVs) are crucial for a sustainable transportation system. As more people adopt BEVs, it becomes increasingly important to accurately assess the demand for charging infrastructure. However, much of the current research on charging infrastructure relies on outdated assumptions, such as the assumption that all BEV owners have access to home chargers and the "Liquid-fuel" mental model. To address this issue, we simulate the travel and charging demand on three charging behavior archetypes. We use a large synthetic population of Sweden, including detailed individual characteristics, such as dwelling types (detached house vs. apartment) and activity plans (for an average weekday). This data repository aims to provide the BEV simulation's input, assumptions, and output so that other studies can use them to study sizing and location design of charging infrastructure, grid impact, etc.

A journal paper published in Transportation Research Part D: Transport and Environment details the method to create the data (particularly Section 2.2 BEV simulation).

https://doi.org/10.1016/j.trd.2023.103645

Methodology

This data product is centered on the 1.7 million inhabitants of the Västra Götaland (VG) region, which includes the second largest city in Sweden, Gothenburg. We specifically simulated 284,000 car agents who live in VG, representing 35% of all car users and 18% of the total population in the region. They spend their simulation day (representing an average weekday) in a variety of locations throughout Sweden.

This open data repository contains the core model inputs and outputs. The numbers in parentheses correspond to the data sets. We use individual agents' activity plans (1) and travel trajectories from MATSim simulation for the BEV simulation (2), in which we consider overnight charger access (3), car fleet composition referencing the current private car fleet in Sweden (4), and Swedish road network with slope information (5) with realistic BEV charging & discharging dynamics. For the BEV simulation, we tested ten scenarios of charging behavior archetypes and fast charging powers (6). The output includes the time history of travel trajectories and charging of the simulated BEVs across the different scenarios (7).

Data description

The current data product covers seven data files.

(1) Agents' experienced activity plans

File name: 1_activity_plans.csv

Column

Description

Data type

Unit

person

Agent ID

Integer

-

act_id

Activity index of each agent

Integer

-

deso

Zone code of Demographic statistical areas (DeSO)1

String

-

POINT_X

Coordinate X of activity location (SWEREF99TM)

Float

meter

POINT_Y

Coordinate Y of activity location (SWEREF99TM)

Float

meter

act_purpose

Activity purpose (work, home, other)

String

-

mode

Transport mode to reach the activity location (car)

String

-

dep_time

Departure time in decimal hour (0-23.99)

Float

hour

trav_time

Travel time to reach the activity location

String

hour:minute:second

trav_time_min

Travel time in decimal minute

Float

minute

speed

Travel speed to reach the activity location

Float

km/h

distance

Travel distance between the origin and the destination

Float

km

act_start

Start time of activity in minute (0-1439)

Integer

minute

act_time

Activity duration in decimal minute

Float

minute

act_end

End time of activity in decimal hour (0-23.99)

Float

hour

score

Utility score of the simulation day given by MATSim

Float

-

1 https://www.scb.se/vara-tjanster/oppna-data/oppna-geodata/deso--demografiska-statistikomraden/

(2) Travel trajectories

File name: 2_input_zip

Produced by MATSim simulation, the zip folder contains ten files (events_batch_X.csv.gz, X=1, 2, …, 10) of input events for the BEV simulation. They are the moving trajectories of the car agents in their simulation days.

Column

Description

Data type

Unit

time

Time in second in a simulation day (0-86399)

Integer

Second

type

Event type defined by MATSim simulation2

String

-

person

Agent ID

Integer

-

link

Nearest road link consistent with (5)

String

-

vehicle

Vehicle ID identical to person

Integer

-

2 One typical episode of MATSim simulation events: Activity ends (actend) -> Agent’s vehicle enters traffic (vehicle enters traffic) -> Agent’s vehicle moves from previous road segment to its next connected one (left link) -> Agent’s vehicle leaves traffic for activity (vehicle leaves traffic) -> Activity starts (actstart)

(3) Overnight charger access

File name: 3_home_charger_access.csv

Column

Description

Data type

Unit

person

Agent ID

Integer

-

home_charger

Whether an agent has access to a home garage charger/living in a detached house (0=no, 1=yes)

Integer

-

(4) Car fleet composition

File name: 4_car_fleet.csv

Column

Description

Data type

Unit

person

Agent ID

Integer

-

income_class

Income group (0=None, 1=below 180K, 2=180K-300K, 3=300K-420K, 4=above 420K)

Integer

-

car

Car model class (B=40 kWh, C=60 kWh, D=100 kWh)

String

-

(5) Road network with slope information

File name: 5_road_network_with_slope.shp (5 files in total)

Column

Description

Data type

Unit

length

The length of road link

Float

meter

freespeed

Free speed

Float

km/h

capacity

Number of vehicles

Integer

-

permlanes

Number of lanes

Integer

-

oneway

Whether the segment is one-way (0=no, 1=yes)

Integer

-

modes

Transport mode (car)

String

-

link_id

Link ID

String

-

from_node

Start node of the link

String

-

to_node

End node of the link

String

-

count

Aggregated traffic (number of cars travelled per day)

Integer

-

slope

Slope in percent from -6% to 6%

Float

-

geometry

LINESTRING (SWEREF99TM)

geometry

meter

(6) Simulation scenarios specifying the parameter sets

File name: 6_scenarios.txt

Parameter set

(paraset)

Strategy 1

Strategy 2

Strategy 3

Fast charging power (kW)

Minimum parking time for charging (min)

Intermediate charging power (kW)

0

0.2

0.2

0.9

150

5

22

1

0.2

0.2

0.9

50

5

22

2

0.3

0.3

0.9

150

5

22

3

0.3

0.3

0.9

50

5

22

(7) Time history of travel trajectories and charging of the simulated BEVs

File name: 7_output.zip

Produced by the BEV simulation, the zip folder contains four files (parasetX.csv.gz, X=1, 2, 3, 4) corresponding to the four parameter sets specified in (6). They are the moving trajectories of the car agents with simulated energy and charging time history in their simulation days.

Column

Description

Data type

Unit

person

Agent ID

Integer

-

home_charger

Whether an agent has access to a home garage charger/living in a detached house (0=no, 1=yes)

Integer

-

car

Car model class (B=40 kWh, C=60 kWh, D=100 kWh)

String

-

seq

Sequence ID of time history by agent

Integer

-

time

Time (0-86399)

Integer

Second

purpose

Valid for activities (home, work, school,
p
Traffic weaver: generator półsyntetycznego ruchu sieciowego na podstawie...
dona.pwr.edu.pl
Updated 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Piotr T Lechowicz; Aleksandra M Knapińska; Adam Włodarczyk; Krzysztof Walkowiak (2024). Traffic weaver: generator półsyntetycznego ruchu sieciowego na podstawie uśrednionych szeregów czasowych. [Dataset]. http://doi.org/10.1016/j.softx.2024.101946
Explore at:
Unique identifier
https://doi.org/10.1016/j.softx.2024.101946
Dataset updated
2024
Authors
Piotr T Lechowicz; Aleksandra M Knapińska; Adam Włodarczyk; Krzysztof Walkowiak
Description
Library of Wroclaw University of Science and Technology scientific output (DONA database)
f
Zeus-GameOver-synthetic-dataset
figshare.com
zip
Updated Jun 10, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dhruba Jyoti Borah (2023). Zeus-GameOver-synthetic-dataset [Dataset]. http://doi.org/10.6084/m9.figshare.20832001.v3
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.20832001.v3
Dataset updated
Jun 10, 2023
Dataset provided by
figshare
Authors
Dhruba Jyoti Borah
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
The repository contains a synthetic Zeus GameOver dataset generated in a testbed. This is a compressed file containing the Zeus GameOver botnet traffic flow simulation. A Zeus bot software was created by studying the characteristics of Zeus GameOver from technical reports "ZeuS-P2P monitoring and analysis", by CERT Polska, published in June 2013 (https://www.cert.pl/en/uploads/2015/12/2013-06-p2p-rap_en.pdf), as well as "An analysis of the Zeus peer-to-peer protocol" by Dennis Andriesse and Herbert Bos, technical report, VU University Amsterdam, The Netherlands, April 2014 (URL:https://syssec.mistakenot.net/papers/zeus-tech-report-2013.pdf). A testbed has been set up with 101 virtual hosts. Each host has a piece of bot software installed. The bots then communicate with one another. The network traffic was captured for 24 hours. tcpdump tool is used to capture the raw trafffic. The captured traffic is then used to generate netflow records using the nprobe tool. The source and destination IP addresses are then extracted from the resulting flow dataset. The dataset uploaded here is a text file. The file contains the communication information of the bot nodes. It has two fields: source IP address and destination IP address
t
Synset Boulevard: Synthetic image dataset for Vehicle Make and Model...
service.tib.eu
Updated Feb 5, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Synset Boulevard: Synthetic image dataset for Vehicle Make and Model Recognition (VMMR) - Vdataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/govdata_725679870677258240
Explore at:
Dataset updated
Feb 5, 2025
Description
The Synset Boulevard dataset contains a total of 259,200 synthetically generated images of cars from a frontal traffic camera perspective, annotated by vehicle makes, models and years of construction for machine learning methods (ML) in the scope (task) of vehicle make and model recognition (VMMR). The data set contains 162 vehicle models from 43 brands with 200 images each, as well as 8 sub-data sets each to be able to investigate different imaging qualities. In addition to the classification annotations, the data set also contains label images for semantic segmentation, as well as information on image and scene properties, as well as vehicle color. The dataset was presented in May 2024 by Anne Sielemann, Stefan Wolf, Masoud Roschani, Jens Ziehn and Jürgen Beyerer in the publication: Sielemann, A., Wolf, S., Roschani, M., Ziehn, J. and Beyerer, J. (2024). Synset Boulevard: A Synthetic Image Dataset for VMMR. In 2024 IEEE International Conference on Robotics and Automation (ICRA). The model information is based on information from the ADAC online database (www.adac.de/rund-ums-fahrzeug/autokatalog/marken-modelle). The data was generated using the simulation environment OCTANE (www.octane.org), which uses the Cycles ray tracer of the Blender project. The dataset's website provides detailed information on the generation process and model assumptions. The dataset is therefore also intended to be used for the suitability analysis of simulated, synthetic datasets. The data set was developed as part of the Fraunhofer PREPARE program in the "ML4Safety" project with the funding code PREPARE 40-02702, as well as funded by the "Invest BW" funding program of the Ministry of Economic Affairs, Labour and Tourism as part of the "FeinSyn" research project.
u
Analysis of zero-day attacks and ransomware
researchdata.up.ac.za
txt
Updated Feb 22, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mike Wa Nkongolo (2024). Analysis of zero-day attacks and ransomware [Dataset]. http://doi.org/10.25403/UPresearchdata.25215530.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.25403/UPresearchdata.25215530.v1
Dataset updated
Feb 22, 2024
Dataset provided by
University of Pretoria
Authors
Mike Wa Nkongolo
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Cybersecurity faces challenges in identifying and mitigating undefined network vulnerabilities, critical for preventing zero-day attacks. The absence of datasets for distinguishing normal versus abnormal network behavior hinders the development of proactive detection strategies. An obstacle in proactive prevention methods is the absence of comprehensive datasets for contrasting normal versus abnormal network behaviours. Such dataset enabling such contrasts would significantly expedite threat anomaly mitigation. The thesis "Ensemble learning and genetic algorithm for the detection of novel network threat anomaly using the UGRansome Dataset"; introduces UGRansome, a dataset for anomaly detection in network traffic. This dataset comprises a comprehensive set of malware features designed for detecting and quantifying zero-day attacks. It was created by integrating similar attributes from both the UGR'16 and ransomware datasets, following a process of development and validation. Malicious behavior is categorized into normal and abnormal patterns, further characterized through supervised learning techniques, which include anomaly, signature, and synthetic signature stratifications. Despite significant advancements in intrusion detection and prevention systems, the need for detecting and quantifying zero-day attacks, including ransomware, persists. Therefore, the development of a specialized analytical approach tailored for quantifying zero-day attacks within cybersecurity datasets is crucial to effectively address the evolving threat landscape posed by advanced persistent threats.
Z
CURE-TSR: Challenging Unreal and Real Environments for Traffic Sign...
data.niaid.nih.gov
explore.openaire.eu
+1more
Updated Jun 28, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mohit Prabhushankar (2020). CURE-TSR: Challenging Unreal and Real Environments for Traffic Sign Recognition [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3903065
Explore at:
Dataset updated
Jun 28, 2020
Dataset provided by
Dogancan Temel
Gukyeong Kwon
Ghassan AlRegib
Mohit Prabhushankar
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
As one of the research directions at OLIVES Lab @ Georgia Tech, we focus on the robustness of data-driven algorithms under diverse challenging conditions where trained models can possibly be depolyed. To achieve this goal, we introduced a large-sacle (>2M images) traffic sign recognition dataset (CURE-TSR) which is among the most comprehensive datasets with controlled synthetic challenging conditions. Traffic sign images in the CURE-TSR dataset were cropped from the CURE-TSD dataset, which includes around 1.7 million real-world and simulator images with more than 2 million traffic sign instances. Real-world images were obtained from the BelgiumTS video sequences and simulated images were generated with the Unreal Engine 4 game development tool. Sign types include speed limit, goods vehicles, no overtaking, no stopping, no parking, stop, bicycle, hump, no left, no right, priority to, no entry, yield, and parking. Unreal and real sequences were processed with state-of-the-art visual effect software Adobe(c) After Effects to simulate challenging conditions, which include rain, snow, haze, shadow, darkness, brightness, blurriness, dirtiness, colorlessness, sensor and codec errors. Please refer to our GitHub page for code, papers, and more information.

Instructions:

The name format of the provided images are as follows: "sequenceType_signType_challengeType_challengeLevel_Index.bmp"

sequenceType: 01 - Real data 02 - Unreal data

signType: 01 - speed_limit 02 - goods_vehicles 03 - no_overtaking 04 - no_stopping 05 - no_parking 06 - stop 07 - bicycle 08 - hump 09 - no_left 10 - no_right 11 - priority_to 12 - no_entry 13 - yield 14 - parking

challengeType: 00 - No challenge 01 - Decolorization 02 - Lens blur 03 - Codec error 04 - Darkening 05 - Dirty lens 06 - Exposure 07 - Gaussian blur 08 - Noise 09 - Rain 10 - Shadow 11 - Snow 12 - Haze

challengeLevel: A number in between [01-05] where 01 is the least severe and 05 is the most severe challenge.

Index: A number shows different instances of traffic signs in the same conditions.

Facebook

Twitter

Click to copy link

Link copied

Cite

Synthetic IDD (2025). Traffic Flow and Incident Data [Dataset]. https://www.opendatabay.com/data/synthetic/edbd578a-d940-4c06-8ce6-4a11d7bba766

Traffic Flow and Incident Data

Explore at:

30 scholarly articles cite this dataset (View in Google Scholar)

.undefinedAvailable download formats

Dataset updated

May 30, 2025

Dataset authored and provided by

Synthetic IDD

License

CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically

Area covered

Transportation

Description

Synthetic Traffic Flow and Incident Data dataset by Synthetic IDD offers a comprehensive collection of synthetic traffic metrics and incident reports from around the globe. With 100 million lines of data, this dataset provides an extensive resource for researchers, urban planners, and developers interested in understanding traffic patterns, congestion points, and incident occurrences.

Usage:

This dataset can be used for a variety of purposes, including but not limited to: - Analyzing traffic patterns and congestion hotspots globally - Building predictive models for traffic management and incident prediction - Researching the impact of road conditions and incidents on traffic flow - Developing applications for real-time traffic monitoring and navigation

License:

The dataset is provided under the CC0 (Public Domain) license, allowing users to freely use, modify, and distribute the data without any restrictions.

Clear search

Close search

Google apps

Main menu

Traffic Flow and Incident Data

Usage:

License:

Synthetic Traffic Generator Report

Synthetic Traffic Generator Market Report

Data from: HIKARI-2021: Generating Network Intrusion Detection Dataset Based...

Synset Signset UK: Synthetic image data set for traffic sign recognition

Network Digital Twin-Generated Dataset for Machine Learning-based Detection...

Overview

Feature Set:

Dataset Variations:

Traffic Detection Datasets

Synset Signset UK: Synthetic image data set for traffic sign recognition

Synthetic Aircraft Trajectory Dataset

Traffic Congestion Prediction

Grand Traffic Auto (GTA) Dataset: a Collection of Synthetic Images for...

DBTR - Road (Synthetic route) - (STR_PS_GLI)

syntra-experiment-dataset

DBTR2013 - Road (Synthetic course) - (STR_PS_GLI)

Open synthetic data on travel and charging demand of battery electric cars:...

Traffic weaver: generator półsyntetycznego ruchu sieciowego na podstawie...

Zeus-GameOver-synthetic-dataset

Synset Boulevard: Synthetic image dataset for Vehicle Make and Model...

Analysis of zero-day attacks and ransomware

CURE-TSR: Challenging Unreal and Real Environments for Traffic Sign...

Traffic Flow and Incident Data

Usage:

License: