100+ datasets found

Radio Frequency (RF) Signal Image Classification
kaggle.com
zip
Updated May 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Halcy0nic (2025). Radio Frequency (RF) Signal Image Classification [Dataset]. https://www.kaggle.com/datasets/halcy0nic/radio-frequecy-rf-signal-image-classification
Explore at:
zip(2599544168 bytes)Available download formats
Dataset updated
May 15, 2025
Authors
Halcy0nic
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
RF Signals Classification Dataset

This dataset contains images of various radio frequency (RF) signals captured on waterfall plots using a spectrum analyzer. It is designed to aid in the classification and identification of different types of RF signals commonly encountered in wireless communications and radio technologies.

Dataset Summary

The dataset comprises waterfall plot images of RF signals across 21 different classes, representing a wide range of communication protocols, technologies, and signal types. Each image in the dataset is a visual representation of a specific RF signal's frequency and time characteristics.

Supported Tasks

This dataset is primarily suited for:

Image Classification

Signal Identification

RF Signal Analysis

Machine Learning in Wireless Communications

Dataset Structure

Data Fields

image: A waterfall plot image of the RF signal

class: The label identifying the type of RF signal

Classes

The dataset includes the following 21 classes of RF signals:

RS41-Radiosonde

Radioteletype

ADS-B

AIS

Automatic Picture Transmission

Bluetooth

Cellular

Digital Audio Broadcasting

Digital Speech Decoder

FM

LoRa

Morse

On-Off Keying

Packet

POCSAG

Remote Keyless Entry

SSTV

WiFi

Airband

VOR

Unknown

File Description

This dataset contains images of radio frequency (RF) signals captured as waterfall plots using a spectrum analyzer. The dataset is organized as follows:

- datasets --- signal class ----- image

For example:

- datasets --- bluetooth ----- c17afe0fe5cc3cc1308605cf390ecbb5.png

Dataset Creation

Source

The images in this dataset were captured using a spectrum analyzer, which visualizes RF signals as waterfall plots. These plots show the frequency content of a signal over time, with color representing signal strength.

Collection Process

RF signals were collected across various frequency bands using appropriate antennas and receivers. The spectrum analyzer was used to generate waterfall plots for each captured signal. Care was taken to ensure a diverse representation of signal types and conditions.

Known Limitations

Some classes may have more samples than others, potentially leading to class imbalance.

The dataset does not include raw signal data, only visual representations.

Most of the images were captured using SDRAngel, therefore any classifier will likely have a heavy bias towards other images captured using that platform.

Dataset Curators

Halcy0nic

Licensing Information

[MIT]

Contributions

We welcome contributions to improve this dataset.
Noisy Drone RF Signal Classification v2
kaggle.com
zip
Updated Jun 26, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
sgluege (2024). Noisy Drone RF Signal Classification v2 [Dataset]. https://www.kaggle.com/datasets/sgluege/noisy-drone-rf-signal-classification-v2
Explore at:
zip(124950870773 bytes)Available download formats
Dataset updated
Jun 26, 2024
Authors
sgluege
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
We provide a preprocessed dataset to enable model development for the detection/classification of drone RF signals. It consists of the non-overlapping signal vectors of length of 1048576 samples, which corresponds to approx. 74.9ms at 14MHz. We have also added Labnoise (Bluetooth, Wi-Fi, Amplifier) and Gaussian noise to the dataset.

After normalization, the drone signals were mixed with either Labnoise (50%) or Gaussian noise (50%). The noise class was created by mixing Labnoise and Gaussian noise in all possible combinations (i.e., Labnoise + Labnoise, Labnoise + Gaussian noise, Gaussian noise + Labnoise, and Gaussian noise + Gaussian noise). For the drone signal classes, as for the noise class, the number of samples for each level of SNR is equally distributed over the interval of SNR in [-20, 30]dB in steps of 2dB, i.e., 679 - 685 samples per SNR. The resulting number of samples per class is shown in the Table below.

DJI FutabaT14 FutabaT7 Graupner Taranis Turnigy Noise
1280 3472 801 801 1663 855 8872

Code:

See https://github.com/sgluege/Robust-Drone-Detection-and-Classification for a script to load and inspect the dataset. Further you'll find code to train and evaluate a model.

Related Literature:

Further information about the data, and how to build a classifier, can be found in our related manuscript. Please cite it if you find it useful.

S. Glüge, M. Nyfeler, A. Aghaebrahimian, N. Ramagnano and C. Schüpbach, "Robust Low-Cost Drone Detection and Classification Using Convolutional Neural Networks in Low SNR Environments," in IEEE Journal of Radio Frequency Identification, vol. 8, pp. 821-830, 2024, doi: 10.1109/JRFID.2024.3487303

Bibtex: @ARTICLE{10737118, author={Glüge, Stefan and Nyfeler, Matthias and Aghaebrahimian, Ahmad and Ramagnano, Nicola and Schüpbach, Christof}, journal={IEEE Journal of Radio Frequency Identification}, title={Robust Low-Cost Drone Detection and Classification Using Convolutional Neural Networks in Low SNR Environments}, year={2024}, volume={8}, number={}, pages={821-830}, doi={10.1109/JRFID.2024.3487303} }
d
Data from: Simulated Radar Waveform and RF Dataset Generator for Incumbent...
catalog.data.gov
data.nist.gov
+2more
Updated Sep 30, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Institute of Standards and Technology (2025). Simulated Radar Waveform and RF Dataset Generator for Incumbent Signals in the 3.5 GHz CBRS Band [Dataset]. https://catalog.data.gov/dataset/simulated-radar-waveform-and-rf-dataset-generator-for-incumbent-signals-in-the-3-5-ghz-cbr
Explore at:
Dataset updated
Sep 30, 2025
Dataset provided by
National Institute of Standards and Technology
Description
This software tool generates simulated radar signals and creates RF datasets. The datasets can be used to develop and test detection algorithms by utilizing machine learning/deep learning techniques for the 3.5 GHz Citizens Broadband Radio Service (CBRS) or similar bands. In these bands, the primary users of the band are federal incumbent radar systems. The software tool generates radar waveforms and randomizes the radar waveform parameters. The pulse modulation types for the radar signals and their parameters are selected based on NTIA testing procedures for ESC certification, available at http://www.its.bldrdoc.gov/publications/3184.aspx. Furthermore, the tool mixes the waveforms with interference and packages them into one RF dataset file. The tool utilizes a graphical user interface (GUI) to simplify the selection of parameters and the mixing process. A reference RF dataset was generated using this software. The RF dataset is published at https://doi.org/10.18434/M32116.
RF Dataset of Incumbent Radar Systems in the 3.5 GHz CBRS Band
search.datacite.org
data.nist.gov
Updated Aug 27, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Raied Caromi (2019). RF Dataset of Incumbent Radar Systems in the 3.5 GHz CBRS Band [Dataset]. http://doi.org/10.18434/m32116
Explore at:
Unique identifier
https://doi.org/10.18434/m32116, https://identifiers.org/ark:/88434/mds2-2116
Dataset updated
Aug 27, 2019
Dataset provided by
DataCitehttps://www.datacite.org/
National Institute of Standards and Technology
Authors
Raied Caromi
License
https://www.nist.gov/director/licensinghttps://www.nist.gov/director/licensing
Description
The RF dataset can be used to develop and test detection algorithms for the 3.5 GHz CBRS or similar bands where the primary users of the band are federal incumbent radar systems. The dataset consists of synthetically generated radar waveforms with added white Gaussian noise. The RF dataset is suitable for development and testing of machine/deep learning detection algorithms. A large number of parameters of the waveforms are randomized across the dataset. Due to its large size, the dataset is divided into groups, and each group consists of multiple files. For more information about the dataset, refer to: R. Caromi, M. Souryal, and T. Hall, "RF Dataset of Incumbent Radar Systems in the 3.5 GHz CBRS Band," Journal of Research of the National Institute of Standards and Technology. (in press). In addition, the metadata of the dataset is summarized in "Data Dictionary of 3.5 GHz Radar Waveforms" [pdf] accompanying the data. For more information about the motivation behind this RF dataset, refer to: T. Hall, R. Caromi, M. Souryal, and A. Wunderlich, "Reference Datasets for Training and Evaluating RF Signal Detection and Classification Models," to appear in Proc. IEEE GLOBECOM Workshop on Advancements in Spectrum Sharing, Dec. 2019.
DeepSig Dataset: RadioML 2018.01A
kaggle.com
zip
Updated Jul 27, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
pinxau1000 (2021). DeepSig Dataset: RadioML 2018.01A [Dataset]. https://www.kaggle.com/pinxau1000/radioml2018
Explore at:
zip(19347212995 bytes)Available download formats
Dataset updated
Jul 27, 2021
Authors
pinxau1000
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
1. Context

Deepsig Inc. has created a small corpus of standard datasets which can be used for original and reproducible research, experimentation, measurement and comparison by fellow scientists and engineers. These datasets allow machine learning researchers with new ideas to dive directly into an important technical area without the need for collecting or generating new datasets, and allows for direct comparison to efficacy of prior work.

This dataset includes both synthetic simulated channel effects and over-the-air recordings of 24 digital and analog modulation types which has been heavily validated.

This dataset was used for Over-the-air deep learning based radio signal classification published 2017 in IEEE Journal of Selected Topics in Signal Processing, which provides additional details and description of the dataset. Data are stored in hdf5 format as complex floating point values, with 2 million examples, each 1024 samples long.

2. Content

2.1. Original content

The original content is those that is provided in the compressed archive that is accessible at the Deepsig Inc. website.

2.1.1. GOLD_XYZ_OSC.0001_1024.hdf5

The dataset is provided in the "GOLD_XYZ_OSC.0001_1024.hdf5" file. The HDF5 format is designed to store and organize large amounts of data. See this list of libraries and interfaces for HDF5 manipulation.

The dataset exhibits the following structure: - 24 modulations: OOK, ASK4, ASK8, BPSK, QPSK, PSK8, PSK16, PSK32, APSK16, APSK32, APSK64, APSK128, QAM16, QAM32, QAM64, QAM128, QAM256, AM_SSB_WC, AM_SSB_SC, AM_DSB_WC, AM_DSB_SC, FM, GMSK and OQPS. - 26 SNRs per modulation (-20 dB to +30 dB in steps of 2dB). - 4096 frames per modulation-SNR combination. - 1024 complex time-series samples per frame. - Samples as floating point in-phase and quadrature (I/Q) components, resulting in a (1024,2) frame shape. - 2.555.904 frames in total.

Each frame can be retrieved by accessing the HDF5 groups: - X: I/Q components of the frame; - Y: Modulation of the frame (one-hot encoded) - Z: SNR of the frame

Data consist of 24 Modulations --> 26 SNR --> 4096 Frames --> (1024, 2) I/Q Samples. Below is a structural example of the dataset: ``` python

Modulation 0: { SNR -20: [ Frame 0: # sample 0 (e.g. hdf5_file['X'][0]) [ [I0, Q0], # sample 0.0 (e.g. hdf5_file['X'][0][0]) [I1, Q1], # sample 0.1 (e.g. hdf5_file['X'][0][1]) ..., [I1023, Q1023] # sample 0.1023 (e.g. hdf5_file['X'][0][1023]) ], Frame 1: [ ... ] # sample 1 (e.g. hdf5_file['X'][1]) ..., Frame 4094: [ ... ] # sample 4094 (e.g. hdf5_file['X'][4094]) Frame 4095: [ ... ] # sample 4095 (e.g. hdf5_file['X'][4095]) ] SNR -18: [ Frame 0: [ ... ] # sample 4096 (e.g. hdf5_file['X'][4096]) ..., Frame 4095: [ ... ] # sample 8191 (e.g. hdf5_file['X'][8191]) ] ... SNR 30: [ Frame 0: [ ... ] # sample 102400 (e.g. hdf5_file['X'][102400]) ..., Frame 4095: [ ... ] # sample 106495 (e.g. hdf5_file['X'][106495]) ] } ... Modulation 23: { ... } ```

2.1.2. classes.txt

The original file provided with the dataset. The order of the modulation classes in this file is incorrect. This is a known issue, please consider the order of the classes provided in the paper Over-the-Air Deep Learning Based Radio Signal Classification. A text file and a json file with the fixed classes are provided along this dataset.

2.1.3. LICENSE.txt

The original license provided with the dataset. Dataset provided by Deepsig Inc. and licensed under Creative Commons Attribution - NonCommercial - ShareAlike 4.0 License (CC BY-NC-SA 4.0)

2.2. Additional content

The additional content is those that is not provided in the compressed archive that is accessible at the Deepsig Inc. website.

2.2.1. classes-fixed.txt

A text file with the classes in the correct order.

2.2.2. classes-fixed.json

A JSON file with the classes in the correct order.

2.2.3. datasets.desktop

A shortcut to Deepsig Inc. datasets.

3. Acknowledgements

This dataset is provided by Deepsig Inc..
Z
Datasets for Deep Learning Based Radio Frequency Side-Channel Attack on...
data.niaid.nih.gov
zenodo.org
Updated Jan 19, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Baliuka, Adomas; Stöcker, Markus; Auer, Michael; Freiwang, Peter; Weinfurter, Harald; Knips, Lukas (2024). Datasets for Deep Learning Based Radio Frequency Side-Channel Attack on Quantum Key Distribution [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7956054
Explore at:
Dataset updated
Jan 19, 2024
Dataset provided by
LMU Munich
Authors
Baliuka, Adomas; Stöcker, Markus; Auer, Michael; Freiwang, Peter; Weinfurter, Harald; Knips, Lukas
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The dataset contains measurements of radio-frequency electromagnetic emissions from a home-built sender module for BB84 quantum key distribution. The goal of these measurements was to evaluate information leakage through this side-channel. This dataset supplements our publication and allows to reproduce our results together with the source code hosted at GitHub (and also on Zenodo via integration with GitHub).The measurements are performed using a magnetic near-field probe, an amplifier and an oscilloscope. The dataset contains raw measured data in the file format output by the oscilloscope. Use our source code to make use of it. Detailed descriptions of measurement procedure can be found in our paper and in the metadata JSON files found within the dataset.

Commented list of datasets

This file lists the datasets that were analyzed and reported on in the paper. The datasets in the list refer to directories here. Note that most of the datasets contain additional files with metadata, which detail where and how the measurements were performed. The mentioned Jupyter notebooks refer to the source code repository https://github.com/XQP-Munich/EmissionSecurityQKD (not included in this dataset). Most of those notebooks output JSON files storing results. The processed JSON files are also included in the source code repository.

In naming of datasets,

Antenna refers to the log-periodic dipole antenna. All datasets that do not contain Antenna in their name are recorded with the magnetic near-field probe.

Rev1 refers to the initial electronics design, while rev2 refers to the revised electronics design which contains countermeasures aiming to reduce emissions.

Shielding refers to measurements where the device is enclosed in a metallic shielding and the measurement takes place outside the shielding.

Rotation refers to orientation of the magnetic near-field probe at the same spacial location

Datasets collected with near-field probe for Rev1 electronics

Rev1Distance: contains measurements at different distances from the Rev1 electronics performed above the FPGA. The deep learning attack is analyzed in TEMPEST_ATTACK.ipynb. The amplitude is analyzed in get_raw_data_RMS_amplitude.ipynb.

Rev12D: different locations on a 2d grid at a constant distance from the electronics. The deep learning attack is analyzed in TEMPEST_ATTACK.ipynb.

Rev130meas2.5cm: 30 measurements above the FPGA at a hight of 2.5cm. Used to evaluate how much amount of training data affects neural network performance. The deep learning attack is analyzed in notebooks TEMPEST_ATTACK*.ipynb. In particular, TEMPEST_ATTACK_VARY_TRAINING_DATA.ipynb is used on this dataset.

Rev1Rotation10deg contains a measurement for varying orientation of the probe at the same location. This is not mentioned in the paper and is only included for completeness. The deep learning attack is analyzed in notebooks TEMPEST_ATTACK*.ipynb.

Rev1TEMPESTShieldingFPGA Measurements with and without shielding at 4cm above the FPGA.

Rev1TEMPESTShieldingUSBHole Measurements with shielding in front of a hole of size about 2cm x 2cm. The deep learning attack is analyzed in TEMPEST_ATTACK*.ipynb.

Datasets collected with near-field probe for Rev2 electronics

Rev2Distance contains measurements at different distances from the Rev2 electronics performed above the FPGA.

Rev22D and Rev22Dstart_7_0 contain measurements on a 2d grid performed on the revised electronics. The dataset is split in two directories because the measurement procedure crashed in the middle. This split structure was kept in order to maintain consistency with the automatic metadata.

Rev230meas2.5cm 30 measurements above the FPGA at a hight of 2.5cm. Used to evaluate how much amount of training data affects neural network performance. The deep learning attack is analyzed in notebooks TEMPEST_ATTACK*.ipynb. In particular, TEMPEST_ATTACK_VARY_TRAINING_DATA.ipynb is used on this dataset.

Other datasets

BackgroundTuesday background measurement (QKD device is not powered at all) performed with near-field probe on 2022 June 21st.

BackgroundSaturday background measurement (QKD device is not powered at all) performed with near-field probe on 2022 June 11th.

AntennaSpectra Dataset of spectra directly recorded by the oscilloscope. Used to demonstrate ability of telling apart the situation of sending QKD key (standard operation) and having the device turned on but not sending any key at a distance. Analyzed in notebook Comparing_KeyNokey_Measurements.ipynb.

Rev2ShieldingAntenna Raw amplitude measurements with log-periodic dipole antenna on Rev2 electronics including shielding enclosure, collected at various distances. None of our attacks against this scenario were successful. The dataset represents a challenge to test more advanced attacks using improved data processing.
Baseline Deep Learning Detectors for Radar Detection in the 3.5 GHz CBRS...
data.nist.gov
datasets.ai
+2more
Updated Mar 1, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Raied Caromi (2021). Baseline Deep Learning Detectors for Radar Detection in the 3.5 GHz CBRS Band [Dataset]. http://doi.org/10.18434/mds2-2380
Explore at:
Unique identifier
https://doi.org/10.18434/mds2-2380, https://identifiers.org/ark:/88434/mds2-2380
Dataset updated
Mar 1, 2021
Dataset provided by
National Institute of Standards and Technologyhttp://www.nist.gov/
Authors
Raied Caromi
License
https://www.nist.gov/open/licensehttps://www.nist.gov/open/license
Description
This project aims to create a comprehensive framework for generating radio frequency (RF) datasets, designing deep learning (DL) detectors, and evaluating their detection performance using both simulated and experimental test data. The proposed tools and techniques are developed in the context of dynamic spectrum use for the 3.5 GHz Citizens Broadband Radio Service (CBRS), but they can be utilized and expanded for standardization of machine learned spectrum awareness technologies and methods. This dataset consists of pre-trained DL models for radar detection in the CBRS band using simulated waveforms. The code for creating and using these models is available at https://github.com/usnistgov/BaselineDeepLearningRadarDetectors.
f
Dataset of LogP and pKb for Machine Learning Predictions
ufs.figshare.com
zip
Updated Oct 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Juda Baikété; Alhadji Malloum; Jeanet Conradie (2025). Dataset of LogP and pKb for Machine Learning Predictions [Dataset]. http://doi.org/10.38140/ufs.30438257.v1
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.38140/ufs.30438257.v1
Dataset updated
Oct 28, 2025
Dataset provided by
University of the Free State
Authors
Juda Baikété; Alhadji Malloum; Jeanet Conradie
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The data contains two sets of datasets. One for pKb, and the other for LogP machine learning prediction. The datasets contain several descriptors generated using RDKit and density functional theory (DFT).
RF Signal Classification Using Logged Data
kaggle.com
Updated Jul 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Chidinma Idonor (2025). RF Signal Classification Using Logged Data [Dataset]. https://www.kaggle.com/datasets/chidinmaidonor/rf-signal-classification-using-logged-data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 30, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Chidinma Idonor
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
The dataset consists of labeled RF signal feature vectors, each row representing a single signal sample. Features may include signal strength, bandwidth, center frequency, modulation index, mean amplitude, FFT-based values, and other time/frequency domain attributes. Labels correspond to predefined signal types or modulation classes such as AM, FM, QPSK, etc.
RF SPECTROGRAMS OF UAV_OUTDOOR
figshare.com
zip
Updated Dec 19, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Prajoy Podder (2024). RF SPECTROGRAMS OF UAV_OUTDOOR [Dataset]. http://doi.org/10.6084/m9.figshare.28063535.v1
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.28063535.v1
Dataset updated
Dec 19, 2024
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Prajoy Podder
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
2, 3, 4, 5, 6, 7 respectively represent UAVs--Parrot, Fimi, Phantom 20, Mavic air, Mavic mini, and Phantom 10. Outdoors, pilot the UAVs at 50 m from the USRP.Also Cite the following Paper: P. Podder, M. Zawodniok and S. Madria, "Deep Learning for UAV Detection and Classification via Radio Frequency Signal Analysis," 2024 25th IEEE International Conference on Mobile Data Management (MDM), Brussels, Belgium, 2024, pp. 165-174, doi: 10.1109/MDM61037.2024.00040.
RF Signal Data
kaggle.com
zip
Updated Jun 14, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Suraj (2023). RF Signal Data [Dataset]. https://www.kaggle.com/datasets/suraj520/RF-signal-data
Explore at:
zip(214073576 bytes)Available download formats
Dataset updated
Jun 14, 2023
Authors
Suraj
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
The generated dataset contains radio frequency (RF) signal data for a period of one month, from May 5, 2023, to June 11, 2023 collected via SDR hardware interfaced to DragonOS Focal. Each row of the dataset represents a single RF signal observation, with various features that describe the signal and its environment.

The dataset can be used for tasks such as machine learning, statistical analysis, and signal processing.

The following is a detailed description of each feature in the dataset:

Timestamp: The date and time of the signal observation.

Frequency: The frequency of the RF signal in Hertz (Hz).

Signal Strength: The strength of the RF signal in decibels relative to one milliwatt (dBm).

Modulation: The modulation type used for the RF signal. Possible options include Amplitude Modulation (AM), - ---

Frequency Modulation (FM), Quadrature Amplitude Modulation (QAM), Binary Phase Shift Keying (BPSK), Quadrature - Phase Shift Keying (QPSK), and 8 Phase Shift Keying (8PSK).

Bandwidth: The bandwidth of the RF signal in Hertz (Hz).

Location: The location where the signal was observed. The location is a string that includes the name of the city and the state/province.

Device Type: The type of RF device used to generate the signal. Possible options include HackRF, Halow-U, and - - SteamDeck.

Antenna Type: The type of antenna used to transmit the signal. Possible options include Omnidirectional, Directional, - Dipole, and Yagi.

Temperature: The temperature at the location of the signal observation in degrees Celsius.

Humidity: The relative humidity at the location of the signal observation as a percentage.

Wind Speed: The speed of the wind at the location of the signal observation in kilometers per hour (km/hr).

Precipitation: The amount of precipitation at the location of the signal observation in millimeters (mm).

Weather Condition: The weather condition at the location of the signal observation. Possible options include Sunny, Rainy, and Cloudy.

Interference Type: The type of interference present in the environment. Possible options include None, Co-channel, Adjacent-channel, and Intermodulation.

Battery Level: The remaining battery level of the device used to generate the signal as a percentage.

Power Source: Whether the device used to generate the signal is currently plugged into a power source or not.

CPU Usage: The percentage of the CPU usage of the device used to generate the signal.

Memory Usage: The percentage of the memory usage of the device used to generate the signal.

WiFi Strength: The strength of the WiFi signal at the location of the signal observation in dBm.

Disk Usage: The percentage of the disk usage of the device used to generate the signal.

System Load: The system load of the device used to generate the signal.

Latitude: The latitude of the location of the signal observation.

Longitude: The longitude of the location of the signal observation.

Altitude(m): The altitude of the location of the signal observation in meters.

Air Pressure: The air pressure at the location of the signal observation in hectopascals (hPa).

Device Status: The current status of the device used to generate the signal. Possible options include Streaming I/Q data, Transmitting beacon signal, and Running game.

I/Q Data: The in-phase and quadrature components of the signal as a complex valued array.

The generated dataset can be used for various types of analysis and predictive analysis, which can help machine learning scientists in developing and testing models for RF signal processing, interference detection and mitigation, and device performance optimization. Some of the possible analysis and predictive analysis that can be performed using this data are:

Signal Classification: The dataset can be used to classify RF signals based on their modulation type, frequency, bandwidth, and other features. This can help in identifying specific types of signals, such as voice or data transmissions, and can aid in tasks such as signal detection, interception, and decoding.

Interference Detection: The dataset contains information about the type and level of interference present in the environment. This can be used to develop models for detecting and mitigating interference, which can improve the overall quality of the RF signal.

Device Performance Optimization: The dataset includes information about the type of RF device used to generate the signal, as well as its CPU usage, memory usage, and battery level. This can be used to develop models for optimizing the performance of RF devices, such as reducing power consumption or improving signal quality.

Weather Condition Analysis: The dataset provides information about the weather conditions at the time of signal observation, including temperature, humidity, wind speed, precipitation, and weather condition. This ...
Tabascal SNN-NLN Dataset
zenodo.org
researchdata.edu.au
+2more
zip
Updated Oct 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nicholas James Pritchard; Nicholas James Pritchard (2023). Tabascal SNN-NLN Dataset [Dataset]. http://doi.org/10.5281/zenodo.8401763
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.8401763
Dataset updated
Oct 31, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Nicholas James Pritchard; Nicholas James Pritchard
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset for training and evaluating RFI detection schemes representing MeerKat instrumentation and predominantly satellite-based contamination. These datasets are produced using Tabascal and output in hdf5 format. The choice of format is to allow for easy use with machine-learning workflows, not other astronomy pipelines (for example, measurement sets). These datasets are prepared for immediate loading with Tensorflow. The attached config.json files describe the parameters used to generate these datasets.

Dataset parameters Name Num Satellite Sources Num Ground RFI Sources obs_100AST_0SAT_0GRD_512BSL_64A_512T-0440-1462_016I_512F-1.227e+09-1.334e+09 0 0 obs_100AST_1SAT_0GRD_512BSL_64A_512T-0440-1462_016I_512F-1.227e+09-1.334e+09 1 0 obs_100AST_1SAT_3GRD_512BSL_64A_512T-0440-1462_016I_512F-1.227e+09-1.334e+09 1 3 obs_100AST_2SAT_0GRD_512BSL_64A_512T-0440-1462_016I_512F-1.227e+09-1.334e+09 2 0 obs_100AST_2SAT_3GRD_512BSL_64A_512T-0440-1462_016I_512F-1.227e+09-1.334e+09 2 3

Using simulated data allows for access to ground truth for noise contamination. As such, these datasets contain the observation visibility amplitudes (without noise), noise visibilities and boolean pixel-wise masks at several thresholds on the noise visibilities. We outline the dimensions of all datasets below:

Dataset Dimensions Field vis masks_orig masks_0 masks_1 masks_2 masks_4 masks_8 masks_16 Datatype float32 float32 bool bool bool bool bool bool

Of course, one can produce masks at arbitrary thresholds, but for convenience, we include several pre-computed options.

All datasets and all fields have the dimensions 512, 512, 512, 1 (baseline, time, frequency, amplitude/mask)
T
Dataset for the First Radio-Frequency Spectrum-Sharing Challenge (RFSSC)
dataverse.tdl.org
bin, zip
Updated Aug 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Chad Spooner; Chad Spooner (2025). Dataset for the First Radio-Frequency Spectrum-Sharing Challenge (RFSSC) [Dataset]. http://doi.org/10.18738/T8/VQZPCN
Explore at:
bin(4000000000), zip(196881), zip(3904886), zip(1111818458), zip(1533639906)Available download formats
Unique identifier
https://doi.org/10.18738/T8/VQZPCN
Dataset updated
Aug 22, 2025
Dataset provided by
Texas Data Repository
Authors
Chad Spooner; Chad Spooner
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
These data were used in the First RF Spectrum-Sharing Challenge which was held in 2024-2025, culminating in announcing the winner AiRANACULUS on January 30, 2025. The first RFSSC was co-sponsored by the United States Air Force Research Laboratory (AFRL) and the National Security Innovation Network (NSIN). The first RFSSC was a data-centric contest open to US academic and industrial organizations intended to spur innovation in machine learning and signal processing for automatic radio-frequency scene analysis (RFSA).
d
Data from: Estimating global GPP from the plant functional type perspective...
search.dataone.org
data.niaid.nih.gov
+1more
Updated Jul 16, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Renjie Guo; Tiexi Chen; Xin Chen; Wenping Yuan; Shuci Liu; Bin He; Lin Li; Shengzhen Wang; Ting Hu; Qingyun Yan; Xueqiong Wei; Jie Dai (2025). Estimating global GPP from the plant functional type perspective using a machine learning approach [Dataset]. http://doi.org/10.5061/dryad.dncjsxm2v
Explore at:
Unique identifier
https://doi.org/10.5061/dryad.dncjsxm2v
Dataset updated
Jul 16, 2025
Dataset provided by
Dryad Digital Repository
Authors
Renjie Guo; Tiexi Chen; Xin Chen; Wenping Yuan; Shuci Liu; Bin He; Lin Li; Shengzhen Wang; Ting Hu; Qingyun Yan; Xueqiong Wei; Jie Dai
Time period covered
Mar 28, 2023
Description
The long-term monitoring of gross primary production (GPP) is crucial to the assessment of the carbon cycle of terrestrial ecosystems. In this study, a well-known machine learning model (Random Forest, RF) is established to reconstruct the global GPP dataset named ECGC_GPP. The model distinguished nine functional plant types, including C3 and C4 crops, using eddy fluxes, meteorological variables, and leaf area index as training data of the RF model. Based on ERA5_Land and the corrected GEOV2 data, the global monthly GPP dataset at a 0.05-degree resolution from 1999 to 2019 was estimated. The results showed that the RF model could explain 74.81% of the monthly variation of GPP in the testing dataset, of which the average contribution of Leaf Area Index (LAI) reached 41.73%. The average annual and standard deviation of GPP during 1999â€“2019 were 117.14 Â± 1.51 Pg C yr-1, with an upward trend of 0.21 Pg C yr-2 (p < 0.01). By using the plant functional type classification, the underestimat..., We unified the ERA5_Land and the corrected GEOV2 datasets to 0.05 degree and monthly scales. The meteorological and remote sensing datasets were classified by the eight PFTs to estimate the GPP of different PFT. Particularly, we established site-level PFT training models for CRO_C3 and CRO_C4, respectively, due to their significant differences. The CRO cells were a mixture of CRO_C3 and CRO_C4. Therefore, trained CRO_C3 and CRO_C4 models were both applied to the CRO cells and multiplied by their respective proportions to generate the final GPP estimation of CRO. This is what we designed to improve the current situation of GPP underestimation over CRO_C4 dominated regions. In this way, we generated a 0.05 degree and monthly scales global GPP dataset (ECGC_GPP) from 1999 to 2019., The ECGC_GPP dataset is stored in .nc file format and can be opened using Matlab or Python.
Data from: SynthSoM: A synthetic intelligent multi-modal...
springernature.figshare.com
bin
Updated May 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xiang Cheng; Ziwei Huang; Yong Yu; Lu Bai; Mingran Sun; Zengrui Han; Ruide Zhang; Sijiang Li (2025). SynthSoM: A synthetic intelligent multi-modal sensing-communication dataset for Synesthesia of Machines (SoM) [Dataset]. http://doi.org/10.6084/m9.figshare.28123646.v1
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.28123646.v1
Dataset updated
May 20, 2025
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Xiang Cheng; Ziwei Huang; Yong Yu; Lu Bai; Mingran Sun; Zengrui Han; Ruide Zhang; Sijiang Li
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Given the importance of datasets for sensing-communication integration research, a novel simulation platform for constructing communication and multi-modal sensory dataset is developed. The developed platform integrates three high-precision software, i.e., AirSim, WaveFarer, and Wireless InSite, and further achieves in-depth integration and precise alignment of them. Based on the developed platform, a new synthetic intelligent multi-modal sensing-communication dataset for Synesthesia of Machines (SoM), named SynthSoM, is proposed. The SynthSoM dataset contains various air-ground multi-link cooperative scenarios with comprehensive conditions, including multiple weather conditions, times of the day, intelligent agent densities, frequency bands, and antenna types. The SynthSoM dataset encompasses multiple data modalities, including radio-frequency (RF) channel large-scale and small-scale fading data, RF millimeter wave (mmWave) radar sensory data, and non-RF sensory data, e.g., RGB images, depth maps, and light detection and ranging (LiDAR) point clouds. The quality of SynthSoM dataset is validated via statistics-based qualitative inspection and evaluation metrics through machine learning (ML) via real-world measurements. The SynthSoM dataset is open-sourced and provides consistent data for cross-comparing SoM-related algorithms.
Z
A Comparison framework for deep learning RFI detection algorithms in radio...
data.niaid.nih.gov
Updated Feb 18, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Charl du Toit; Trienko Grobler; Danie Ludick (2024). A Comparison framework for deep learning RFI detection algorithms in radio astronomy [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_8275060
Explore at:
Dataset updated
Feb 18, 2024
Dataset provided by
Student
Co-supervisor
Authors
Charl du Toit; Trienko Grobler; Danie Ludick
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
These are the datasets used for the study titled: A Comparison framework for deep learning RFI detection algorithms in radio astronomy. These files are made publicly available as an additional resource to the submission of the author's Masters degree at Stellenbosch University. The detection is done in the field of radio astronomy. Each dataset consists of images/spectrograms/waterfall plots for baselines, and the corresponding binary mask for each image. The datasets can be used to train machine learning models, or for the case of this study, supervised fully convolutional neural networks.

The LOFAR datasets consists of real observations and was slightly modified from https://zenodo.org/record/6724065. See this resource regarding the observational parameters used to retrieve the data from the LOFAR Long Term Archive.The HERA dataset consists of simulated observations generated with hera_sim (https://readthedocs.org/projects/hera-sim/). The 28 March dataset consists of accurate pixel-perfect binary masks for each image. The 20 July dataset is identical to the first, except the binary masks are generated with AOFlagger. All three datasets have a test set stored with pixel-perfected simulation masks (HERA) or expert hand labeled masks (LOFAR).

The csv file contains the results of all trained models and and has fields for: model class, #filters, #FLOPS, #weights, preprocessing methods, train, validation and test accuracy scores as well as list of (threshold, FPR, TPR) values to generate receiver operating characteristic curves. See https://github.com/CharlDuToit/RFI-NLN to visualize the results, to train new models.
f
Data from: GHOST: Adjusting the Decision Threshold to Handle Imbalanced Data...
acs.figshare.com
zip
Updated Jun 2, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Carmen Esposito; Gregory A. Landrum; Nadine Schneider; Nikolaus Stiefl; Sereina Riniker (2023). GHOST: Adjusting the Decision Threshold to Handle Imbalanced Data in Machine Learning [Dataset]. http://doi.org/10.1021/acs.jcim.1c00160.s002
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.1021/acs.jcim.1c00160.s002
Dataset updated
Jun 2, 2023
Dataset provided by
ACS Publications
Authors
Carmen Esposito; Gregory A. Landrum; Nadine Schneider; Nikolaus Stiefl; Sereina Riniker
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
Machine learning classifiers trained on class imbalanced data are prone to overpredict the majority class. This leads to a larger misclassification rate for the minority class, which in many real-world applications is the class of interest. For binary data, the classification threshold is set by default to 0.5 which, however, is often not ideal for imbalanced data. Adjusting the decision threshold is a good strategy to deal with the class imbalance problem. In this work, we present two different automated procedures for the selection of the optimal decision threshold for imbalanced classification. A major advantage of our procedures is that they do not require retraining of the machine learning models or resampling of the training data. The first approach is specific for random forest (RF), while the second approach, named GHOST, can be potentially applied to any machine learning classifier. We tested these procedures on 138 public drug discovery data sets containing structure–activity data for a variety of pharmaceutical targets. We show that both thresholding methods improve significantly the performance of RF. We tested the use of GHOST with four different classifiers in combination with two molecular descriptors, and we found that most classifiers benefit from threshold optimization. GHOST also outperformed other strategies, including random undersampling and conformal prediction. Finally, we show that our thresholding procedures can be effectively applied to real-world drug discovery projects, where the imbalance and characteristics of the data vary greatly between the training and test sets.
Data from: S1 Dataset -
plos.figshare.com
rar
Updated Aug 26, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Adel H. Elmetwalli; Asaad Derbala; Ibtisam Mohammed Alsudays; Eman A. Al-Shahari; Mahmoud Elhosary; Salah Elsayed; Laila A. Al-Shuraym; Farahat S. Moghanm; Osama Elsherbiny (2024). S1 Dataset - [Dataset]. http://doi.org/10.1371/journal.pone.0308826.s001
Explore at:
rarAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0308826.s001
Dataset updated
Aug 26, 2024
Dataset provided by
PLOShttp://plos.org/
Authors
Adel H. Elmetwalli; Asaad Derbala; Ibtisam Mohammed Alsudays; Eman A. Al-Shahari; Mahmoud Elhosary; Salah Elsayed; Laila A. Al-Shuraym; Farahat S. Moghanm; Osama Elsherbiny
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Estimation of fruit quality parameters are usually based on destructive techniques which are tedious, costly and unreliable when dealing with huge amounts of fruits. Alternatively, non–destructive techniques such as image processing and spectral reflectance would be useful in rapid detection of fruit quality parameters. This research study aimed to assess the potential of image processing, spectral reflectance indices (SRIs), and machine learning models such as decision tree (DT) and random forest (RF) to qualitatively estimate characteristics of mandarin and tomato fruits at different ripening stages. Quality parameters such as chlorophyll a (Chl a), chlorophyll b (Chl b), total soluble solids (TSS), titratable acidity (TA), TSS/TA, carotenoids (car), lycopene and firmness were measured. The results showed that Red-Blue-Green (RGB) indices and newly developed SRIs demonstrated high efficiency for quantifying different fruit properties. For example, the R2 of the relationships between all RGB indices (RGBI) and measured parameters varied between 0.62 and 0.96 for mandarin and varied between 0.29 and 0.90 for tomato. The RGBI such as visible atmospheric resistant index (VARI) and normalized red (Rn) presented the highest R2 = 0.96 with car of mandarin fruits. While excess red vegetation index (ExR) presented the highest R2 = 0.84 with car of tomato fruits. The SRIs such as RSI 710,600, and R730,650 showed the greatest R2 values with respect to Chl a (R2 = 0.80) for mandarin fruits while the GI had the greatest R2 with Chl a (R2 = 0.68) for tomato fruits. Combining RGB and SRIs with DT and RF models would be a robust strategy for estimating eight observed variables associated with reasonable accuracy. Regarding mandarin fruits, in the task of predicting Chl a, the DT-2HV model delivered exceptional results, registering an R2 of 0.993 with an RMSE of 0.149 for the training set, and an R2 of 0.991 with an RMSE of 0.114 for the validation set. As well as for tomato fruits, the DT-5HV model demonstrated exemplary performance in the Chl a prediction, achieving an R2 of 0.905 and an RMSE of 0.077 for the training dataset, and an R2 of 0.785 with an RMSE of 0.077 for the validation dataset. The overall outcomes showed that the RGB, newly SRIs as well as DT and RF based RGBI, and SRIs could be used to evaluate the measured parameters of mandarin and tomato fruits.
c
RF Probability (version 0.1): Mapping Wetlands with High Resolution Planet...
figshare.canterbury.ac.nz
tiff
Updated Jul 28, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Matthew Wilson; Saif Khan (2025). RF Probability (version 0.1): Mapping Wetlands with High Resolution Planet SuperDove Satellite Imagery [Dataset]. http://doi.org/10.26021/canterburynz.29310392.v1
Explore at:
tiffAvailable download formats
Unique identifier
https://doi.org/10.26021/canterburynz.29310392.v1
Dataset updated
Jul 28, 2025
Dataset provided by
University of Canterbury Data Repository
Authors
Matthew Wilson; Saif Khan
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
These datasets represent predictions and associated probabilities using four machine learning methods, associated with collection: Mapping Wetlands with High Resolution Planet SuperDove Satellite Imagery: An Assessment of Machine Learning Models Across the Diverse Waterscapes of New Zealand (10.26021/canterburynz.c.7848596).The following datasets are available:HGB predictionHGB probabilityMLPC predictionMLPC probabilityRandom forest predictionRandom forest probability [this dataset]XGBoost predictionXGBoost probabilityFor details of the models developed, please see the collection and associated paper. The following files are available in each dataset, each representing an area within New Zealand:xxxxx_mmm_prediction.tif: model prediction, encoded as 8-bit integers where 1 is predicted as wetland (>50% probability), and NA (no data) is non-wetland.xxxxx_mmm_probability.tif: model wetland probability, encoded as 16-bit integers, with probability values from 0 to 1 rescaled from 0 to 10,000. Divide the values by 10,000 to obtain probabilities to four decimal places.In the tile filenames, xxxxx refers to the UUID of the grid area, which can be found in the file nzgrid_uuid.gpkg, and mmm is a code which refers to the model used:hgb: histogram gradient boostmlpc: multi-layer perceptron classificationrf: random forestxgb: extreme gradient boostingIn addition to the tif images, two virtual raster tile files are included to enable mapping at the national scale:_mmm_prediction.vrt_mmm_probability.vrtAll tif images are saved using cloud optimised geotiff (COG), which makes them fast to display even at a national level, although increases the data size. Total size is around 700 MB for the prediction datasets, and ~75 GB for the probability datasets.Metadata for the Planet SuperDove imagery used for each pixel of the predictions is available here: https://doi.org/10.26021/canterburynz.29231837.v
r
RF-based UAV Detection and Identifiﬁcation System
resodate.org
service.tib.eu
Updated Dec 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Olusiji Medaiyese; Martins Ezuma; Adrian Lauf; Ismail Guvenc (2024). RF-based UAV Detection and Identifiﬁcation System [Dataset]. https://resodate.org/resources/aHR0cHM6Ly9zZXJ2aWNlLnRpYi5ldS9sZG1zZXJ2aWNlL2RhdGFzZXQvcmYtYmFzZWQtdWF2LWRldGVjdGlvbi1hbmQtaWRlbnRpZmktY2F0aW9uLXN5c3RlbQ==
Explore at:
Dataset updated
Dec 2, 2024
Dataset provided by
Leibniz Data Manager
Authors
Olusiji Medaiyese; Martins Ezuma; Adrian Lauf; Ismail Guvenc
Description
The dataset used in this paper for RF-based UAV detection and identiﬁcation system using machine learning.

DJI	FutabaT14	FutabaT7	Graupner	Taranis	Turnigy	Noise
1280	3472	801	801	1663	855	8872

Facebook

Twitter

Click to copy link

Link copied

Cite

Halcy0nic (2025). Radio Frequency (RF) Signal Image Classification [Dataset]. https://www.kaggle.com/datasets/halcy0nic/radio-frequecy-rf-signal-image-classification

Radio Frequency (RF) Signal Image Classification

RF Signals Classification Dataset

Explore at:

zip(2599544168 bytes)Available download formats

Dataset updated

May 15, 2025

Authors

Halcy0nic

License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

RF Signals Classification Dataset

This dataset contains images of various radio frequency (RF) signals captured on waterfall plots using a spectrum analyzer. It is designed to aid in the classification and identification of different types of RF signals commonly encountered in wireless communications and radio technologies.

Dataset Summary

The dataset comprises waterfall plot images of RF signals across 21 different classes, representing a wide range of communication protocols, technologies, and signal types. Each image in the dataset is a visual representation of a specific RF signal's frequency and time characteristics.

Supported Tasks

This dataset is primarily suited for:

Image Classification
Signal Identification
RF Signal Analysis
Machine Learning in Wireless Communications

Dataset Structure

Data Fields

image: A waterfall plot image of the RF signal
class: The label identifying the type of RF signal

Classes

The dataset includes the following 21 classes of RF signals:

RS41-Radiosonde
Radioteletype
ADS-B
AIS
Automatic Picture Transmission
Bluetooth
Cellular
Digital Audio Broadcasting
Digital Speech Decoder
FM
LoRa
Morse
On-Off Keying
Packet
POCSAG
Remote Keyless Entry
SSTV
WiFi
Airband
VOR
Unknown

File Description

This dataset contains images of radio frequency (RF) signals captured as waterfall plots using a spectrum analyzer. The dataset is organized as follows:

- datasets
--- signal class
----- image

For example:

- datasets
--- bluetooth
----- c17afe0fe5cc3cc1308605cf390ecbb5.png

Dataset Creation

Source

The images in this dataset were captured using a spectrum analyzer, which visualizes RF signals as waterfall plots. These plots show the frequency content of a signal over time, with color representing signal strength.

Collection Process

RF signals were collected across various frequency bands using appropriate antennas and receivers. The spectrum analyzer was used to generate waterfall plots for each captured signal. Care was taken to ensure a diverse representation of signal types and conditions.

Known Limitations

Some classes may have more samples than others, potentially leading to class imbalance.
The dataset does not include raw signal data, only visual representations.
Most of the images were captured using SDRAngel, therefore any classifier will likely have a heavy bias towards other images captured using that platform.

Dataset Curators

Halcy0nic

Licensing Information

[MIT]

Contributions

We welcome contributions to improve this dataset.

Clear search

Close search

Google apps

Main menu

Radio Frequency (RF) Signal Image Classification

RF Signals Classification Dataset

Dataset Summary

Supported Tasks

Dataset Structure

Data Fields

Classes

File Description

Dataset Creation

Source

Collection Process

Known Limitations

Dataset Curators

Licensing Information

Contributions

Noisy Drone RF Signal Classification v2

Code:

Related Literature:

Data from: Simulated Radar Waveform and RF Dataset Generator for Incumbent...

RF Dataset of Incumbent Radar Systems in the 3.5 GHz CBRS Band

DeepSig Dataset: RadioML 2018.01A

1. Context

2. Content

2.1. Original content

2.1.1. GOLD_XYZ_OSC.0001_1024.hdf5

2.1.2. classes.txt

2.1.3. LICENSE.txt

2.2. Additional content

2.2.1. classes-fixed.txt

2.2.2. classes-fixed.json

2.2.3. datasets.desktop

3. Acknowledgements

Datasets for Deep Learning Based Radio Frequency Side-Channel Attack on...

Baseline Deep Learning Detectors for Radar Detection in the 3.5 GHz CBRS...

Dataset of LogP and pKb for Machine Learning Predictions

RF Signal Classification Using Logged Data

RF SPECTROGRAMS OF UAV_OUTDOOR

RF Signal Data

The following is a detailed description of each feature in the dataset:

Tabascal SNN-NLN Dataset

Dataset for the First Radio-Frequency Spectrum-Sharing Challenge (RFSSC)

Data from: Estimating global GPP from the plant functional type perspective...

Data from: SynthSoM: A synthetic intelligent multi-modal...

A Comparison framework for deep learning RFI detection algorithms in radio...

Data from: GHOST: Adjusting the Decision Threshold to Handle Imbalanced Data...

Data from: S1 Dataset -

RF Probability (version 0.1): Mapping Wetlands with High Resolution Planet...

RF-based UAV Detection and Identifiﬁcation System

Radio Frequency (RF) Signal Image Classification

RF Signals Classification Dataset

RF Signals Classification Dataset

Dataset Summary

Supported Tasks

Dataset Structure

Data Fields

Classes

File Description

Dataset Creation

Source

Collection Process

Known Limitations

Dataset Curators

Licensing Information

Contributions