96 datasets found

Real Time Anomaly Detection in CCTV Surveillance
kaggle.com
Updated Apr 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
webadvisor (2023). Real Time Anomaly Detection in CCTV Surveillance [Dataset]. https://www.kaggle.com/datasets/webadvisor/real-time-anomaly-detection-in-cctv-surveillance
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 28, 2023
Dataset provided by
Kaggle
Authors
webadvisor
Description
UCF Crime Dataset in the most suitable structure. Contains 1900 videos from 13 different categories. To ensure the quality of this dataset, it is trained ten annotators (having different levels of computer vision expertise) to collect the dataset. Using videos search on YouTube and LiveLeak using text search queries (with slight variations e.g. “car crash”, “road accident”) of each anomaly.
Anomaly detection from sound data- Fan
kaggle.com
Updated Sep 22, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vuppala Adithya Sairam (2023). Anomaly detection from sound data- Fan [Dataset]. https://www.kaggle.com/datasets/vuppalaadithyasairam/anomaly-detection-from-sound-data-fan
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 22, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Vuppala Adithya Sairam
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Description
The dataset is a subset of the Task-2 of DCASE 2020 Challenge. The Challenge is to identify anomaly of a machine using the audio data. There are three different parts of the dataset, namely, training, validation and testing which have been combined into a single dataset.

Training- https://zenodo.org/record/3678171

Validation- https://zenodo.org/record/3727685

Testing- https://zenodo.org/record/3841772
Anomaly-Detection-Dataset-UCF
kaggle.com
Updated May 20, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Minhaj Uddin Meraj (2022). Anomaly-Detection-Dataset-UCF [Dataset]. https://www.kaggle.com/minhajuddinmeraj/anomalydetectiondatasetucf/tasks
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 20, 2022
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Minhaj Uddin Meraj
Description
The UCF-Crime dataset is a large-scale dataset of 128 hours of videos. It consists of 1900 long and untrimmed real-world surveillance videos, with 13 realistic anomalies including Abuse, Arrest, Arson, Assault, Road Accident, Burglary, Explosion, Fighting, Robbery, Shooting, Stealing, Shoplifting, and Vandalism. These anomalies are selected because they have a significant impact on public safety.

This dataset can be used for two tasks. First, general anomaly detection considering all anomalies in one group and all normal activities in another group. Second, for recognizing each of 13 anomalous activities.
network-anomaly-dataset
kaggle.com
Updated Sep 5, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alberto del Rio (2024). network-anomaly-dataset [Dataset]. http://doi.org/10.34740/kaggle/dsv/9325531
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.34740/kaggle/dsv/9325531
Dataset updated
Sep 5, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Alberto del Rio
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
This dataset, titled "Network Anomaly Dataset," is designed for the development and evaluation of machine learning models focused on network anomaly detection. The dataset is available in two versions: a labeled version where each instance is marked as "Anomaly" or "Normal," and an unlabeled version that can be used for unsupervised learning techniques.

Dataset Features: - Throughput: The amount of data successfully transmitted over a network in a given period. - Congestion: The degree of network traffic load, potentially leading to delays or packet loss. - Packet Loss: The percentage of packets that fail to reach their destination, indicative of network issues. - Latency: The time taken for data to travel from the source to the destination, crucial for time-sensitive applications. - Jitter: The variation in packet arrival times, affecting the quality of real-time communications.

Applications: - Supervised Learning: Use the labeled dataset to train and evaluate models such as Random Forest, SVM, and Logistic Regression for anomaly detection. - Unsupervised Learning: Apply techniques like clustering and change point detection on the unlabeled dataset to discover hidden patterns and anomalies.

This dataset is ideal for practitioners and researchers aiming to explore network security, develop robust anomaly detection models, or conduct comparative analysis between supervised and unsupervised learning methods.
Financial Transactions Dataset for Fraud Detection
kaggle.com
Updated May 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Aryan Kumar (2025). Financial Transactions Dataset for Fraud Detection [Dataset]. https://www.kaggle.com/datasets/aryan208/financial-transactions-dataset-for-fraud-detection
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 2, 2025
Dataset provided by
Kaggle
Authors
Aryan Kumar
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
This dataset contains 5 million synthetically generated financial transactions designed to simulate real-world behavior for fraud detection research and machine learning applications. Each transaction record includes fields such as:

Transaction Details: ID, timestamp, sender/receiver accounts, amount, type (deposit, transfer, etc.)

Behavioral Features: time since last transaction, spending deviation score, velocity score, geo-anomaly score

Metadata: location, device used, payment channel, IP address, device hash

Fraud Indicators: binary fraud label (is_fraud) and type of fraud (e.g., money laundering, account takeover)

The dataset follows realistic fraud patterns and behavioral anomalies, making it suitable for:

Binary and multiclass classification models

Fraud detection systems

Time-series anomaly detection

Feature engineering and model explainability
Anomaly-Detection-Dataset
kaggle.com
Updated Oct 3, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Muhammad Ahmad (2022). Anomaly-Detection-Dataset [Dataset]. https://www.kaggle.com/datasets/muhammadahmad6710/anomalydetectiondataset/data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 3, 2022
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Muhammad Ahmad
Description
Dataset

This dataset was created by Muhammad Ahmad

Contents
Synthetic Cybersecurity Logs for Anomaly Detection
kaggle.com
Updated Dec 16, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
fcWebDev (2024). Synthetic Cybersecurity Logs for Anomaly Detection [Dataset]. http://doi.org/10.34740/kaggle/dsv/10211131
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.34740/kaggle/dsv/10211131
Dataset updated
Dec 16, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
fcWebDev
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
This dataset contains synthetic HTTP log data designed for cybersecurity analysis, particularly for anomaly detection tasks.

Dataset Features Timestamp: Simulated time for each log entry. IP_Address: Randomized IP addresses to simulate network traffic. Request_Type: Common HTTP methods (GET, POST, PUT, DELETE). Status_Code: HTTP response status codes (e.g., 200, 404, 403, 500). Anomaly_Flag: Binary flag indicating anomalies (1 = anomaly, 0 = normal). User_Agent: Simulated user agents for device and browser identification. Session_ID: Random session IDs to simulate user activity. Location: Geographic locations of requests. Applications This dataset can be used for:

Anomaly Detection: Identify suspicious network activity or attacks. Machine Learning: Train models for classification tasks (e.g., detect anomalies). Cybersecurity Analysis: Analyze HTTP traffic patterns and identify threats. Example Challenge Build a machine learning model to predict the Anomaly_Flag based on the features provided.

Packaging Industry Anomaly DEtection Dataset

kaggle.com

Updated Apr 19, 2025

+ more versions

Facebook

Twitter

Click to copy link

Link copied

Cite

Orvile (2025). Packaging Industry Anomaly DEtection Dataset [Dataset]. https://www.kaggle.com/datasets/orvile/packaging-industry-anomaly-detection-dataset

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Apr 19, 2025

Dataset provided by

Kagglehttp://kaggle.com/

Authors

Orvile

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

PIADE dataset contains data from five industrial packaging machines:

Machine s_1: from 2020-01-01 14:00:00 to 2021-12-31 13:00:00
Machine s_2: from 2020-06-17 08:00:00 to 2021-12-31 07:00:00
Machine s_3: from 2020-10-07 12:00:00 to 2022-01-01 23:00:00
Machine s_4: from 2020-01-01 01:00:00 to 2022-01-01 23:00:00
Machine s_5: from 2020-01-20 08:00:00 to 2022-01-01 12:00:00

Raw Data

Each row represents a production interval, with the following schema:

interval_start: start of the production interval  
equipment_ID: equipment identifier  
alarm: alarm code of the active stop reason, if it occurred   
type: idle, production, downtime, performance_loss or scheduled_downtime  
start: start of the production interval  
end: end of the production interval  
elapsed: duration of the production interval  
pi: input packages  
po: output packages  
speed: speed (packages per hour)

There are 133 different types of alerts, and 429394 rows.

Sequences (1h) data

For each piece of equipment, we define sequences of length = 1 hour and we aggregate raw interval data as follows:

'equipment_ID': machine identifier
'#changes': changes in machine state
'%downtime': time spent in 'downtime' state
'%idle': time spent in 'idle' state
'%performance_loss': time spent in 'performance loss' state
'%production': time spent in production
'%scheduled_downtime': time spent in scheduled downtime
'count_sum': sum of all alarm occurrences
'A_

Z
ESA Anomaly Dataset
data.niaid.nih.gov
zenodo.org
+1more
Updated Jun 28, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Haskamp, Christoph (2024). ESA Anomaly Dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_12528695
Explore at:
Dataset updated
Jun 28, 2024
Dataset provided by
Kotowski, Krzysztof
De Canio, Gabriele
Haskamp, Christoph
License
Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically
Description
ESA Anomaly Dataset is the first large-scale, real-life satellite telemetry dataset with curated anomaly annotations originated from three ESA missions. We hope that this unique dataset will allow researchers and scientists from academia, research institutes, national and international space agencies, and industry to benchmark models and approaches on a common baseline as well as research and develop novel, computational-efficient approaches for anomaly detection in satellite telemetry data.

The dataset results from the work of an 18-month project carried by an industry Consortium composed of Airbus Defence and Space, KP Labs and the European Space Agency’s European Space Operations Centre. The project, funded by the European Space Agency (ESA), is part of the Artificial Intelligence for Automation (A²I) Roadmap (De Canio et al., 2023), a large endeavour started in 2021 to automate space operations by leveraging artificial intelligence.

Further details can be found on the arXiv and Github.

ReferencesDe Canio, G. et al. (2023) Development of an actionable AI roadmap for automating mission operations. In, 2023 SpaceOps Conference. American Institute of Aeronautics and Astronautics, Dubai, United Arab Emirates.
UGRansome dataset
kaggle.com
Updated Dec 11, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dr. Mike Wa Nkongolo (2023). UGRansome dataset [Dataset]. http://doi.org/10.34740/kaggle/dsv/7172543
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.34740/kaggle/dsv/7172543
Dataset updated
Dec 11, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Dr. Mike Wa Nkongolo
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
Acknowledgment to supporters: "Thank you to everyone who supported the UGRansome dataset; it has received a Bronze medal on Kaggle!"

The UGRansome dataset is a versatile cybersecurity resource designed for the analysis of ransomware and zero-day cyber-attacks, particularly those exhibiting cyclostationary behavior. This dataset features various essential components, including timestamps for attack time tracking, flags for categorizing attack types, protocol data for understanding attack vectors, network flow details to observe data transfer patterns, and ransomware family classifications.

It also provides insight into the associated malware, numeric clustering for pattern recognition, and quantifies financial damage in both USD and bitcoins (BTC). The dataset employs machine learning to generate attack signatures and offers synthetic signatures for testing and simulating cybersecurity defenses.

Additionally, it can be used to identify and document anomalies, contributing to anomaly detection research and enhancing cybersecurity understanding and preparedness. This dataset offers valuable information for researchers and practitioners interested in leveraging it for various analytical and investigatory purposes such as ransomware and zero-day threats detection and classification. The dataset required deduplication and transformation.

The UGRansome dataset has been previously utilized in studies by Tokmak (2022); Alhashmi et al. (2024); Chaudhary & Adhikari (2024); Sokhonn, Park, & Lee (2024); P. Yan et al. (2024), Sharath Kumar et al. (2024), and Mohamed, A.A., Al-Saleh, A., Sharma, S.K. et al. (2025).

It has been utilized and cited in several master's dissertations and reports, demonstrating its relevance in the field of anomaly intrusion detection. Notable examples include:

S. R. Zahra, 2022. "UGRansome: Optimal Approach for Anomaly Intrusion Detection and Zero-day Threats using Cloud Environment." Master's Research in Cloud Computing, School of Computing, National College of Ireland. https://www.researchgate.net/publication/365172610_UGRansome_Optimal_Approach_for_Anomaly_Intrusion_Detection_and_Zero-day_Threats_using_Cloud_Environment_MSc_Research_Project_Cloud_Computing/citations

B. Torky, 2023. "Ensemble Methods for Anomaly Detection in Enterprise Systems." Thesis, Rochester Institute of Technology, Dubai. Advisor: Sanjay Modak.https://repository.rit.edu/theses/11497/

A. Igugu, 2024. "Evaluating the Effectiveness of AI and Machine Learning Techniques for Zero-Day Attacks Detection in Cloud Environments" Master of Science in Information Security, Luleå University of Technology, Sweden. Department of Computer Science, Electrical and Space Engineering. Supervisor: Dr. Saguna. Examiner: Prof. Christer Ahlund. https://www.diva-portal.org/smash/get/diva2:1890285/FULLTEXT02

Duran, M., duSoft Yazılım, A.Ş. and Kilinc, H., 2024. D2. 1–Academic and Technology SoTA Report. Sierra (Panel), 1, pp.26-11. Edited by: Hakan Kilinc (Orion, Türkiye), Eva Catarina Gomes Maia (ISEP, Portugal), Orhan Yildirim (Beam Teknoloji, Türkiye), Gabriela Sousa (VisionWare, Portugal), Özgü Özkan, Melike Çolak, Nesil Bor (Bites, Türkiye), Daniel Esteban Villamil Sierra (Panel, Spain). https://itea4.org/project/vesta.html

Kaliberda A. A. Development of an anti-virus solution based on neural networks: master's thesis; Ural Federal University, Institute of Radio Electronics and Information Technologies-RTF, Department of Information Technologies and Control Systems. Russia — Yekaterinburg, 2024. — 52 p. http://elar.urfu.ru/handle/10995/140331

These citations underline the impact of the UGRansome in advancing research on intrusion detection and cybersecurity:

• Mohamed, A.A., Al-Saleh, A., Sharma, S.K. et al. Zero-day exploits detection with adaptive WavePCA-Autoencoder (AWPA) adaptive hybrid exploit detection network (AHEDNet). Sci Rep 15, 4036 (2025). https://doi.org/10.1038/s41598-025-87615-2

• P. Yan, T. T. Khoei, R. S. Hyder and R. S. Hyder, "A Dual-Stage Ensemble Approach to Detect and Classify Ransomware Attacks," 2024 IEEE 15th Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON), Yorktown Heights, NY, USA, 2024, pp. 781-786, doi: 10.1109/UEMCON62879.2024.10754695.

• Por, L.Y., Dai, Z., Leem, S.J., Chen, Y., Yang, J., Binbeshr, F., Phan, K.Y. and Ku, C.S., 2024. A Systematic Literature Review on the Methods and Challenges in Detecting Zero-Day Attacks: Insights from the Recent CrowdStrike Incident. IEEE Access.

• Torky, B., Karamitsos, I., Najar, T. (2024). Anomaly Detection in Enterprise Payment Systems: An Ensemble Machine Learning Approach. In: Emrouznejad, A., Zervopoulos, P.D., Ozturk, I., Jamali, D., Rice, J. (eds) Business Analytics and Decision Making in Practice. ICBAP 2024. Lecture Notes in Operations Research. Springer, Cham. https://doi.org/10.1007/978-3-...
Anomaly-Detection-Dataset
kaggle.com
Updated Feb 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Le Hung (2025). Anomaly-Detection-Dataset [Dataset]. https://www.kaggle.com/easterharry/anomaly-detection-dataset/discussion
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 6, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Le Hung
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Le Hung

Released under Apache 2.0

Contents
Anomaly Detection
kaggle.com
Updated Dec 8, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
k20 1702 Bilal Mamji (2023). Anomaly Detection [Dataset]. https://www.kaggle.com/datasets/k201702bilalmamji/anomaly-detection
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 8, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
k20 1702 Bilal Mamji
Description
Dataset

This dataset was created by k20 1702 Bilal Mamji

Contents
Anomaly detection dataset for beginners
kaggle.com
zip
Updated Aug 2, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Arun (2021). Anomaly detection dataset for beginners [Dataset]. https://www.kaggle.com/arunkumar1809/anomaly-detection-dataset-for-beginners
Explore at:
zip(258 bytes)Available download formats
Dataset updated
Aug 2, 2021
Authors
Arun
Description
Dataset

This dataset was created by Arun

Contents

It contains the following files:
anomaly detection
kaggle.com
Updated Jul 1, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shweta Dalal (2021). anomaly detection [Dataset]. https://www.kaggle.com/shwetadalal/anomaly-detection/activity
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 1, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Shweta Dalal
Description
Dataset

This dataset was created by Shweta Dalal

Contents
Screw-anomalies detection
kaggle.com
Updated Nov 8, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Thomas (2021). Screw-anomalies detection [Dataset]. https://www.kaggle.com/thomasdubail/screwanomalies-detection/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 8, 2021
Dataset provided by
Kaggle
Authors
Thomas
Description
This is a dataset of picture that were found on MVTec Anomaly Detection : https://www.mvtec.com/

There are three files in this dataset: train, test, and gound_truth.

The training set has 320 (1024x1024) pictures of screws which have no anomalies and the test file have : good, manipulated_front, scratch_head, scratch_neck, thread_side, thread_top for a total of 160 pictures with the matching ground_truth.

Paul Bergmann, Michael Fauser, David Sattlegger, and Carsten Steger, "A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection", IEEE Conference on Computer Vision and Pattern Recognition, 2019
anomaly-detection-diffusion
kaggle.com
Updated Nov 29, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Đạt Savu (2024). anomaly-detection-diffusion [Dataset]. https://www.kaggle.com/datasets/tsavumoon/anomaly-detection-diffusion
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 29, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Đạt Savu
Description
Dataset

This dataset was created by Đạt Savu

Contents
UCF-Crime-Anomaly-Videos-Part-1-no-abuse
kaggle.com
Updated Dec 1, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Roumaissaa (2024). UCF-Crime-Anomaly-Videos-Part-1-no-abuse [Dataset]. https://www.kaggle.com/datasets/roumaissaa/ucf-crime-anomaly-videos-part-1-no-abuse
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 1, 2024
Dataset provided by
Kaggle
Authors
Roumaissaa
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
The UCF-Crime dataset is a large-scale collection of real-world surveillance videos, featuring a diverse range of crime and normal activities. This dataset is ideal for training and evaluating advanced AI models for anomaly detection and video understanding tasks.

Key Features:

Real-world Data: The videos are sourced from real-world surveillance cameras, ensuring a realistic and challenging environment. Diverse Anomalies: The dataset covers a wide range of crime categories, including both common and rare events. Long, Untrimmed Videos: The videos are long and untrimmed, providing a more realistic and challenging scenario for anomaly detection. Detailed Annotations: The videos are meticulously annotated with bounding boxes, timestamps, and labels for each anomaly, enabling precise model training and evaluation.

Real-world Anomaly Detection in Surveillance Videos Link to download the data
anomaly-data
kaggle.com
Updated Dec 25, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
yuanheqiuye (2022). anomaly-data [Dataset]. https://www.kaggle.com/datasets/yuanheqiuye/anomaly-data/data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 25, 2022
Dataset provided by
Kagglehttp://kaggle.com/
Authors
yuanheqiuye
Description
Dataset

This dataset was created by yuanheqiuye

Contents
anomaly-detection-baseline-part2
kaggle.com
Updated Dec 10, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hiep Le (2021). anomaly-detection-baseline-part2 [Dataset]. https://www.kaggle.com/baohiep/anomaly-detection-baseline-part2/discussion
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 10, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Hiep Le
Description
Dataset

This dataset was created by Hiep Le

Contents
Satellite telemetry data anomaly prediction
kaggle.com
Updated Apr 17, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Orvile (2025). Satellite telemetry data anomaly prediction [Dataset]. https://www.kaggle.com/datasets/orvile/satellite-telemetry-data-anomaly-prediction
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 17, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Orvile
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
OPSSAT-AD - anomaly detection dataset for satellite telemetry

This is the AI-ready benchmark dataset (OPSSAT-AD) containing the telemetry data acquired on board OPS-SAT---a CubeSat mission that has been operated by the European Space Agency.

It is accompanied by the paper with baseline results obtained using 30 supervised and unsupervised classic and deep machine learning algorithms for anomaly detection. They were trained and validated using the training-test dataset split introduced in this work, and we present a suggested set of quality metrics that should always be calculated to confront the new algorithms for anomaly detection while exploiting OPSSAT-AD. We believe that this work may become an important step toward building a fair, reproducible, and objective validation procedure that can be used to quantify the capabilities of the emerging anomaly detection techniques in an unbiased and fully transparent way.

The included files are:

segments.csv with the acquired telemetry signals from ESA OPS-SAT aircraft, dataset.csv with the extracted, synthetic features are computed for each manually split and labeled telemetry segment. code files for data processing and example modeliing (dataset_generator.ipynb for data processing, modeling_examples.ipynb with simple examples, requirements.txt- with details on Python configuration, and the LICENSE file)

Citation Bogdan, R. (2024). OPSSAT-AD - anomaly detection dataset for satellite telemetry [Data set]. Ruszczak. https://doi.org/10.5281/zenodo.15108715

Facebook

Twitter

Click to copy link

Link copied

Cite

webadvisor (2023). Real Time Anomaly Detection in CCTV Surveillance [Dataset]. https://www.kaggle.com/datasets/webadvisor/real-time-anomaly-detection-in-cctv-surveillance

Real Time Anomaly Detection in CCTV Surveillance

Contains Videos for 13 different Class of Anomalies and Normal Events.

Explore at:

2 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Apr 28, 2023

Dataset provided by

Kaggle

Authors

webadvisor

Description

UCF Crime Dataset in the most suitable structure. Contains 1900 videos from 13 different categories. To ensure the quality of this dataset, it is trained ten annotators (having different levels of computer vision expertise) to collect the dataset. Using videos search on YouTube and LiveLeak using text search queries (with slight variations e.g. “car crash”, “road accident”) of each anomaly.

Clear search

Close search

Google apps

Main menu

Real Time Anomaly Detection in CCTV Surveillance

Anomaly detection from sound data- Fan

Anomaly-Detection-Dataset-UCF

network-anomaly-dataset

Financial Transactions Dataset for Fraud Detection

Anomaly-Detection-Dataset

Dataset

Contents

Synthetic Cybersecurity Logs for Anomaly Detection

Packaging Industry Anomaly DEtection Dataset

Raw Data

Sequences (1h) data

ESA Anomaly Dataset

UGRansome dataset

Anomaly-Detection-Dataset

Dataset

Contents

Anomaly Detection

Dataset

Contents

Anomaly detection dataset for beginners

Dataset

Contents

anomaly detection

Dataset

Contents

Screw-anomalies detection

anomaly-detection-diffusion

Dataset

Contents

UCF-Crime-Anomaly-Videos-Part-1-no-abuse

Key Features:

anomaly-data

Dataset

Contents

anomaly-detection-baseline-part2

Dataset

Contents

Satellite telemetry data anomaly prediction

The included files are:

Real Time Anomaly Detection in CCTV Surveillance

Contains Videos for 13 different Class of Anomalies and Normal Events.