100+ datasets found
  1. Radio Frequency (RF) Signal Image Classification

    • kaggle.com
    zip
    Updated May 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Halcy0nic (2025). Radio Frequency (RF) Signal Image Classification [Dataset]. https://www.kaggle.com/datasets/halcy0nic/radio-frequecy-rf-signal-image-classification
    Explore at:
    zip(2599544168 bytes)Available download formats
    Dataset updated
    May 15, 2025
    Authors
    Halcy0nic
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    RF Signals Classification Dataset

    This dataset contains images of various radio frequency (RF) signals captured on waterfall plots using a spectrum analyzer. It is designed to aid in the classification and identification of different types of RF signals commonly encountered in wireless communications and radio technologies.

    Dataset Summary

    The dataset comprises waterfall plot images of RF signals across 21 different classes, representing a wide range of communication protocols, technologies, and signal types. Each image in the dataset is a visual representation of a specific RF signal's frequency and time characteristics.

    Supported Tasks

    This dataset is primarily suited for:

    • Image Classification
    • Signal Identification
    • RF Signal Analysis
    • Machine Learning in Wireless Communications

    Dataset Structure

    Data Fields

    • image: A waterfall plot image of the RF signal
    • class: The label identifying the type of RF signal

    Classes

    The dataset includes the following 21 classes of RF signals:

    1. RS41-Radiosonde
    2. Radioteletype
    3. ADS-B
    4. AIS
    5. Automatic Picture Transmission
    6. Bluetooth
    7. Cellular
    8. Digital Audio Broadcasting
    9. Digital Speech Decoder
    10. FM
    11. LoRa
    12. Morse
    13. On-Off Keying
    14. Packet
    15. POCSAG
    16. Remote Keyless Entry
    17. SSTV
    18. WiFi
    19. Airband
    20. VOR
    21. Unknown

    File Description

    This dataset contains images of radio frequency (RF) signals captured as waterfall plots using a spectrum analyzer. The dataset is organized as follows:

    - datasets
    --- signal class
    ----- image
    

    For example:

    - datasets
    --- bluetooth
    ----- c17afe0fe5cc3cc1308605cf390ecbb5.png
    

    Dataset Creation

    Source

    The images in this dataset were captured using a spectrum analyzer, which visualizes RF signals as waterfall plots. These plots show the frequency content of a signal over time, with color representing signal strength.

    Collection Process

    RF signals were collected across various frequency bands using appropriate antennas and receivers. The spectrum analyzer was used to generate waterfall plots for each captured signal. Care was taken to ensure a diverse representation of signal types and conditions.

    Known Limitations

    • Some classes may have more samples than others, potentially leading to class imbalance.
    • The dataset does not include raw signal data, only visual representations.
    • Most of the images were captured using SDRAngel, therefore any classifier will likely have a heavy bias towards other images captured using that platform.

    Dataset Curators

    Halcy0nic

    Licensing Information

    [MIT]

    Contributions

    We welcome contributions to improve this dataset.

  2. Noisy Drone RF Signal Classification v2

    • kaggle.com
    zip
    Updated Jun 26, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    sgluege (2024). Noisy Drone RF Signal Classification v2 [Dataset]. https://www.kaggle.com/datasets/sgluege/noisy-drone-rf-signal-classification-v2
    Explore at:
    zip(124950870773 bytes)Available download formats
    Dataset updated
    Jun 26, 2024
    Authors
    sgluege
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We provide a preprocessed dataset to enable model development for the detection/classification of drone RF signals. It consists of the non-overlapping signal vectors of length of 1048576 samples, which corresponds to approx. 74.9ms at 14MHz. We have also added Labnoise (Bluetooth, Wi-Fi, Amplifier) and Gaussian noise to the dataset.

    After normalization, the drone signals were mixed with either Labnoise (50%) or Gaussian noise (50%). The noise class was created by mixing Labnoise and Gaussian noise in all possible combinations (i.e., Labnoise + Labnoise, Labnoise + Gaussian noise, Gaussian noise + Labnoise, and Gaussian noise + Gaussian noise). For the drone signal classes, as for the noise class, the number of samples for each level of SNR is equally distributed over the interval of SNR in [-20, 30]dB in steps of 2dB, i.e., 679 - 685 samples per SNR. The resulting number of samples per class is shown in the Table below.

    DJIFutabaT14FutabaT7GraupnerTaranisTurnigyNoise
    1280347280180116638558872

    Code:

    See https://github.com/sgluege/Robust-Drone-Detection-and-Classification for a script to load and inspect the dataset. Further you'll find code to train and evaluate a model.

    Related Literature:

    Further information about the data, and how to build a classifier, can be found in our related manuscript. Please cite it if you find it useful.

    S. Glüge, M. Nyfeler, A. Aghaebrahimian, N. Ramagnano and C. Schüpbach, "Robust Low-Cost Drone Detection and Classification Using Convolutional Neural Networks in Low SNR Environments," in IEEE Journal of Radio Frequency Identification, vol. 8, pp. 821-830, 2024, doi: 10.1109/JRFID.2024.3487303

    Bibtex: @ARTICLE{10737118, author={Glüge, Stefan and Nyfeler, Matthias and Aghaebrahimian, Ahmad and Ramagnano, Nicola and Schüpbach, Christof}, journal={IEEE Journal of Radio Frequency Identification}, title={Robust Low-Cost Drone Detection and Classification Using Convolutional Neural Networks in Low SNR Environments}, year={2024}, volume={8}, number={}, pages={821-830}, doi={10.1109/JRFID.2024.3487303} }

  3. d

    Data from: Simulated Radar Waveform and RF Dataset Generator for Incumbent...

    • catalog.data.gov
    • data.nist.gov
    • +2more
    Updated Sep 30, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Standards and Technology (2025). Simulated Radar Waveform and RF Dataset Generator for Incumbent Signals in the 3.5 GHz CBRS Band [Dataset]. https://catalog.data.gov/dataset/simulated-radar-waveform-and-rf-dataset-generator-for-incumbent-signals-in-the-3-5-ghz-cbr
    Explore at:
    Dataset updated
    Sep 30, 2025
    Dataset provided by
    National Institute of Standards and Technology
    Description

    This software tool generates simulated radar signals and creates RF datasets. The datasets can be used to develop and test detection algorithms by utilizing machine learning/deep learning techniques for the 3.5 GHz Citizens Broadband Radio Service (CBRS) or similar bands. In these bands, the primary users of the band are federal incumbent radar systems. The software tool generates radar waveforms and randomizes the radar waveform parameters. The pulse modulation types for the radar signals and their parameters are selected based on NTIA testing procedures for ESC certification, available at http://www.its.bldrdoc.gov/publications/3184.aspx. Furthermore, the tool mixes the waveforms with interference and packages them into one RF dataset file. The tool utilizes a graphical user interface (GUI) to simplify the selection of parameters and the mixing process. A reference RF dataset was generated using this software. The RF dataset is published at https://doi.org/10.18434/M32116.

  4. RF Dataset of Incumbent Radar Systems in the 3.5 GHz CBRS Band

    • search.datacite.org
    • data.nist.gov
    Updated Aug 27, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Raied Caromi (2019). RF Dataset of Incumbent Radar Systems in the 3.5 GHz CBRS Band [Dataset]. http://doi.org/10.18434/m32116
    Explore at:
    Dataset updated
    Aug 27, 2019
    Dataset provided by
    DataCitehttps://www.datacite.org/
    National Institute of Standards and Technology
    Authors
    Raied Caromi
    License

    https://www.nist.gov/director/licensinghttps://www.nist.gov/director/licensing

    Description

    The RF dataset can be used to develop and test detection algorithms for the 3.5 GHz CBRS or similar bands where the primary users of the band are federal incumbent radar systems. The dataset consists of synthetically generated radar waveforms with added white Gaussian noise. The RF dataset is suitable for development and testing of machine/deep learning detection algorithms. A large number of parameters of the waveforms are randomized across the dataset. Due to its large size, the dataset is divided into groups, and each group consists of multiple files. For more information about the dataset, refer to: R. Caromi, M. Souryal, and T. Hall, "RF Dataset of Incumbent Radar Systems in the 3.5 GHz CBRS Band," Journal of Research of the National Institute of Standards and Technology. (in press). In addition, the metadata of the dataset is summarized in "Data Dictionary of 3.5 GHz Radar Waveforms" [pdf] accompanying the data. For more information about the motivation behind this RF dataset, refer to: T. Hall, R. Caromi, M. Souryal, and A. Wunderlich, "Reference Datasets for Training and Evaluating RF Signal Detection and Classification Models," to appear in Proc. IEEE GLOBECOM Workshop on Advancements in Spectrum Sharing, Dec. 2019.

  5. DeepSig Dataset: RadioML 2018.01A

    • kaggle.com
    zip
    Updated Jul 27, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    pinxau1000 (2021). DeepSig Dataset: RadioML 2018.01A [Dataset]. https://www.kaggle.com/pinxau1000/radioml2018
    Explore at:
    zip(19347212995 bytes)Available download formats
    Dataset updated
    Jul 27, 2021
    Authors
    pinxau1000
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    1. Context

    Deepsig Inc. has created a small corpus of standard datasets which can be used for original and reproducible research, experimentation, measurement and comparison by fellow scientists and engineers. These datasets allow machine learning researchers with new ideas to dive directly into an important technical area without the need for collecting or generating new datasets, and allows for direct comparison to efficacy of prior work.

    This dataset includes both synthetic simulated channel effects and over-the-air recordings of 24 digital and analog modulation types which has been heavily validated.

    This dataset was used for Over-the-air deep learning based radio signal classification published 2017 in IEEE Journal of Selected Topics in Signal Processing, which provides additional details and description of the dataset. Data are stored in hdf5 format as complex floating point values, with 2 million examples, each 1024 samples long.

    2. Content

    2.1. Original content

    The original content is those that is provided in the compressed archive that is accessible at the Deepsig Inc. website.

    2.1.1. GOLD_XYZ_OSC.0001_1024.hdf5

    The dataset is provided in the "GOLD_XYZ_OSC.0001_1024.hdf5" file. The HDF5 format is designed to store and organize large amounts of data. See this list of libraries and interfaces for HDF5 manipulation.

    The dataset exhibits the following structure: - 24 modulations: OOK, ASK4, ASK8, BPSK, QPSK, PSK8, PSK16, PSK32, APSK16, APSK32, APSK64, APSK128, QAM16, QAM32, QAM64, QAM128, QAM256, AM_SSB_WC, AM_SSB_SC, AM_DSB_WC, AM_DSB_SC, FM, GMSK and OQPS. - 26 SNRs per modulation (-20 dB to +30 dB in steps of 2dB). - 4096 frames per modulation-SNR combination. - 1024 complex time-series samples per frame. - Samples as floating point in-phase and quadrature (I/Q) components, resulting in a (1024,2) frame shape. - 2.555.904 frames in total.

    Each frame can be retrieved by accessing the HDF5 groups: - X: I/Q components of the frame; - Y: Modulation of the frame (one-hot encoded) - Z: SNR of the frame

    Data consist of 24 Modulations --> 26 SNR --> 4096 Frames --> (1024, 2) I/Q Samples. Below is a structural example of the dataset: ``` python

    Modulation 0: { SNR -20: [ Frame 0: # sample 0 (e.g. hdf5_file['X'][0]) [ [I0, Q0], # sample 0.0 (e.g. hdf5_file['X'][0][0]) [I1, Q1], # sample 0.1 (e.g. hdf5_file['X'][0][1]) ..., [I1023, Q1023] # sample 0.1023 (e.g. hdf5_file['X'][0][1023]) ], Frame 1: [ ... ] # sample 1 (e.g. hdf5_file['X'][1]) ..., Frame 4094: [ ... ] # sample 4094 (e.g. hdf5_file['X'][4094]) Frame 4095: [ ... ] # sample 4095 (e.g. hdf5_file['X'][4095]) ] SNR -18: [ Frame 0: [ ... ] # sample 4096 (e.g. hdf5_file['X'][4096]) ..., Frame 4095: [ ... ] # sample 8191 (e.g. hdf5_file['X'][8191]) ] ... SNR 30: [ Frame 0: [ ... ] # sample 102400 (e.g. hdf5_file['X'][102400]) ..., Frame 4095: [ ... ] # sample 106495 (e.g. hdf5_file['X'][106495]) ] } ... Modulation 23: { ... } ```

    2.1.2. classes.txt

    The original file provided with the dataset. The order of the modulation classes in this file is incorrect. This is a known issue, please consider the order of the classes provided in the paper Over-the-Air Deep Learning Based Radio Signal Classification. A text file and a json file with the fixed classes are provided along this dataset.

    2.1.3. LICENSE.txt

    The original license provided with the dataset. Dataset provided by Deepsig Inc. and licensed under Creative Commons Attribution - NonCommercial - ShareAlike 4.0 License (CC BY-NC-SA 4.0)

    2.2. Additional content

    The additional content is those that is not provided in the compressed archive that is accessible at the Deepsig Inc. website.

    2.2.1. classes-fixed.txt

    A text file with the classes in the correct order.

    2.2.2. classes-fixed.json

    A JSON file with the classes in the correct order.

    2.2.3. datasets.desktop

    A shortcut to Deepsig Inc. datasets.

    3. Acknowledgements

    This dataset is provided by Deepsig Inc..

  6. Z

    Datasets for Deep Learning Based Radio Frequency Side-Channel Attack on...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jan 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Baliuka, Adomas; Stöcker, Markus; Auer, Michael; Freiwang, Peter; Weinfurter, Harald; Knips, Lukas (2024). Datasets for Deep Learning Based Radio Frequency Side-Channel Attack on Quantum Key Distribution [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7956054
    Explore at:
    Dataset updated
    Jan 19, 2024
    Dataset provided by
    LMU Munich
    Authors
    Baliuka, Adomas; Stöcker, Markus; Auer, Michael; Freiwang, Peter; Weinfurter, Harald; Knips, Lukas
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset contains measurements of radio-frequency electromagnetic emissions from a home-built sender module for BB84 quantum key distribution. The goal of these measurements was to evaluate information leakage through this side-channel. This dataset supplements our publication and allows to reproduce our results together with the source code hosted at GitHub (and also on Zenodo via integration with GitHub).The measurements are performed using a magnetic near-field probe, an amplifier and an oscilloscope. The dataset contains raw measured data in the file format output by the oscilloscope. Use our source code to make use of it. Detailed descriptions of measurement procedure can be found in our paper and in the metadata JSON files found within the dataset.

    Commented list of datasets

    This file lists the datasets that were analyzed and reported on in the paper. The datasets in the list refer to directories here. Note that most of the datasets contain additional files with metadata, which detail where and how the measurements were performed. The mentioned Jupyter notebooks refer to the source code repository https://github.com/XQP-Munich/EmissionSecurityQKD (not included in this dataset). Most of those notebooks output JSON files storing results. The processed JSON files are also included in the source code repository.

    In naming of datasets,

    Antenna refers to the log-periodic dipole antenna. All datasets that do not contain Antenna in their name are recorded with the magnetic near-field probe.

    Rev1 refers to the initial electronics design, while rev2 refers to the revised electronics design which contains countermeasures aiming to reduce emissions.

    Shielding refers to measurements where the device is enclosed in a metallic shielding and the measurement takes place outside the shielding.

    Rotation refers to orientation of the magnetic near-field probe at the same spacial location

    Datasets collected with near-field probe for Rev1 electronics

    Rev1Distance: contains measurements at different distances from the Rev1 electronics performed above the FPGA. The deep learning attack is analyzed in TEMPEST_ATTACK.ipynb. The amplitude is analyzed in get_raw_data_RMS_amplitude.ipynb.

    Rev12D: different locations on a 2d grid at a constant distance from the electronics. The deep learning attack is analyzed in TEMPEST_ATTACK.ipynb.

    Rev130meas2.5cm: 30 measurements above the FPGA at a hight of 2.5cm. Used to evaluate how much amount of training data affects neural network performance. The deep learning attack is analyzed in notebooks TEMPEST_ATTACK*.ipynb. In particular, TEMPEST_ATTACK_VARY_TRAINING_DATA.ipynb is used on this dataset.

    Rev1Rotation10deg contains a measurement for varying orientation of the probe at the same location. This is not mentioned in the paper and is only included for completeness. The deep learning attack is analyzed in notebooks TEMPEST_ATTACK*.ipynb.

    Rev1TEMPESTShieldingFPGA Measurements with and without shielding at 4cm above the FPGA.

    • Rev1TEMPESTShieldingUSBHole Measurements with shielding in front of a hole of size about 2cm x 2cm. The deep learning attack is analyzed in TEMPEST_ATTACK*.ipynb.

    Datasets collected with near-field probe for Rev2 electronics

    Rev2Distance contains measurements at different distances from the Rev2 electronics performed above the FPGA.

    Rev22D and Rev22Dstart_7_0 contain measurements on a 2d grid performed on the revised electronics. The dataset is split in two directories because the measurement procedure crashed in the middle. This split structure was kept in order to maintain consistency with the automatic metadata.

    Rev230meas2.5cm 30 measurements above the FPGA at a hight of 2.5cm. Used to evaluate how much amount of training data affects neural network performance. The deep learning attack is analyzed in notebooks TEMPEST_ATTACK*.ipynb. In particular, TEMPEST_ATTACK_VARY_TRAINING_DATA.ipynb is used on this dataset.

    Other datasets

    BackgroundTuesday background measurement (QKD device is not powered at all) performed with near-field probe on 2022 June 21st.

    BackgroundSaturday background measurement (QKD device is not powered at all) performed with near-field probe on 2022 June 11th.

    AntennaSpectra Dataset of spectra directly recorded by the oscilloscope. Used to demonstrate ability of telling apart the situation of sending QKD key (standard operation) and having the device turned on but not sending any key at a distance. Analyzed in notebook Comparing_KeyNokey_Measurements.ipynb.

    Rev2ShieldingAntenna Raw amplitude measurements with log-periodic dipole antenna on Rev2 electronics including shielding enclosure, collected at various distances. None of our attacks against this scenario were successful. The dataset represents a challenge to test more advanced attacks using improved data processing.

  7. Baseline Deep Learning Detectors for Radar Detection in the 3.5 GHz CBRS...

    • data.nist.gov
    • datasets.ai
    • +2more
    Updated Mar 1, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Raied Caromi (2021). Baseline Deep Learning Detectors for Radar Detection in the 3.5 GHz CBRS Band [Dataset]. http://doi.org/10.18434/mds2-2380
    Explore at:
    Dataset updated
    Mar 1, 2021
    Dataset provided by
    National Institute of Standards and Technologyhttp://www.nist.gov/
    Authors
    Raied Caromi
    License

    https://www.nist.gov/open/licensehttps://www.nist.gov/open/license

    Description

    This project aims to create a comprehensive framework for generating radio frequency (RF) datasets, designing deep learning (DL) detectors, and evaluating their detection performance using both simulated and experimental test data. The proposed tools and techniques are developed in the context of dynamic spectrum use for the 3.5 GHz Citizens Broadband Radio Service (CBRS), but they can be utilized and expanded for standardization of machine learned spectrum awareness technologies and methods. This dataset consists of pre-trained DL models for radar detection in the CBRS band using simulated waveforms. The code for creating and using these models is available at https://github.com/usnistgov/BaselineDeepLearningRadarDetectors.

  8. f

    Dataset of LogP and pKb for Machine Learning Predictions

    • ufs.figshare.com
    zip
    Updated Oct 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Juda Baikété; Alhadji Malloum; Jeanet Conradie (2025). Dataset of LogP and pKb for Machine Learning Predictions [Dataset]. http://doi.org/10.38140/ufs.30438257.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 28, 2025
    Dataset provided by
    University of the Free State
    Authors
    Juda Baikété; Alhadji Malloum; Jeanet Conradie
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The data contains two sets of datasets. One for pKb, and the other for LogP machine learning prediction. The datasets contain several descriptors generated using RDKit and density functional theory (DFT).

  9. RF Signal Classification Using Logged Data

    • kaggle.com
    Updated Jul 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chidinma Idonor (2025). RF Signal Classification Using Logged Data [Dataset]. https://www.kaggle.com/datasets/chidinmaidonor/rf-signal-classification-using-logged-data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 30, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Chidinma Idonor
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    The dataset consists of labeled RF signal feature vectors, each row representing a single signal sample. Features may include signal strength, bandwidth, center frequency, modulation index, mean amplitude, FFT-based values, and other time/frequency domain attributes. Labels correspond to predefined signal types or modulation classes such as AM, FM, QPSK, etc.

  10. RF SPECTROGRAMS OF UAV_OUTDOOR

    • figshare.com
    zip
    Updated Dec 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Prajoy Podder (2024). RF SPECTROGRAMS OF UAV_OUTDOOR [Dataset]. http://doi.org/10.6084/m9.figshare.28063535.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 19, 2024
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Prajoy Podder
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    2, 3, 4, 5, 6, 7 respectively represent UAVs--Parrot, Fimi, Phantom 20, Mavic air, Mavic mini, and Phantom 10. Outdoors, pilot the UAVs at 50 m from the USRP.Also Cite the following Paper: P. Podder, M. Zawodniok and S. Madria, "Deep Learning for UAV Detection and Classification via Radio Frequency Signal Analysis," 2024 25th IEEE International Conference on Mobile Data Management (MDM), Brussels, Belgium, 2024, pp. 165-174, doi: 10.1109/MDM61037.2024.00040.

  11. RF Signal Data

    • kaggle.com
    zip
    Updated Jun 14, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Suraj (2023). RF Signal Data [Dataset]. https://www.kaggle.com/datasets/suraj520/RF-signal-data
    Explore at:
    zip(214073576 bytes)Available download formats
    Dataset updated
    Jun 14, 2023
    Authors
    Suraj
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    The generated dataset contains radio frequency (RF) signal data for a period of one month, from May 5, 2023, to June 11, 2023 collected via SDR hardware interfaced to DragonOS Focal. Each row of the dataset represents a single RF signal observation, with various features that describe the signal and its environment.

    The dataset can be used for tasks such as machine learning, statistical analysis, and signal processing.

    The following is a detailed description of each feature in the dataset:
    • Timestamp: The date and time of the signal observation.
    • Frequency: The frequency of the RF signal in Hertz (Hz).
    • Signal Strength: The strength of the RF signal in decibels relative to one milliwatt (dBm).
    • Modulation: The modulation type used for the RF signal. Possible options include Amplitude Modulation (AM), - ---
    • Frequency Modulation (FM), Quadrature Amplitude Modulation (QAM), Binary Phase Shift Keying (BPSK), Quadrature - Phase Shift Keying (QPSK), and 8 Phase Shift Keying (8PSK).
    • Bandwidth: The bandwidth of the RF signal in Hertz (Hz).
    • Location: The location where the signal was observed. The location is a string that includes the name of the city and the state/province.
    • Device Type: The type of RF device used to generate the signal. Possible options include HackRF, Halow-U, and - - SteamDeck.
    • Antenna Type: The type of antenna used to transmit the signal. Possible options include Omnidirectional, Directional, - Dipole, and Yagi.
    • Temperature: The temperature at the location of the signal observation in degrees Celsius.
    • Humidity: The relative humidity at the location of the signal observation as a percentage.
    • Wind Speed: The speed of the wind at the location of the signal observation in kilometers per hour (km/hr).
    • Precipitation: The amount of precipitation at the location of the signal observation in millimeters (mm).
    • Weather Condition: The weather condition at the location of the signal observation. Possible options include Sunny, Rainy, and Cloudy.
    • Interference Type: The type of interference present in the environment. Possible options include None, Co-channel, Adjacent-channel, and Intermodulation.
    • Battery Level: The remaining battery level of the device used to generate the signal as a percentage.
    • Power Source: Whether the device used to generate the signal is currently plugged into a power source or not.
    • CPU Usage: The percentage of the CPU usage of the device used to generate the signal.
    • Memory Usage: The percentage of the memory usage of the device used to generate the signal.
    • WiFi Strength: The strength of the WiFi signal at the location of the signal observation in dBm.
    • Disk Usage: The percentage of the disk usage of the device used to generate the signal.
    • System Load: The system load of the device used to generate the signal.
    • Latitude: The latitude of the location of the signal observation.
    • Longitude: The longitude of the location of the signal observation.
    • Altitude(m): The altitude of the location of the signal observation in meters.
    • Air Pressure: The air pressure at the location of the signal observation in hectopascals (hPa).
    • Device Status: The current status of the device used to generate the signal. Possible options include Streaming I/Q data, Transmitting beacon signal, and Running game.
    • I/Q Data: The in-phase and quadrature components of the signal as a complex valued array.

    The generated dataset can be used for various types of analysis and predictive analysis, which can help machine learning scientists in developing and testing models for RF signal processing, interference detection and mitigation, and device performance optimization. Some of the possible analysis and predictive analysis that can be performed using this data are:

    • Signal Classification: The dataset can be used to classify RF signals based on their modulation type, frequency, bandwidth, and other features. This can help in identifying specific types of signals, such as voice or data transmissions, and can aid in tasks such as signal detection, interception, and decoding.

    • Interference Detection: The dataset contains information about the type and level of interference present in the environment. This can be used to develop models for detecting and mitigating interference, which can improve the overall quality of the RF signal.

    • Device Performance Optimization: The dataset includes information about the type of RF device used to generate the signal, as well as its CPU usage, memory usage, and battery level. This can be used to develop models for optimizing the performance of RF devices, such as reducing power consumption or improving signal quality.

    • Weather Condition Analysis: The dataset provides information about the weather conditions at the time of signal observation, including temperature, humidity, wind speed, precipitation, and weather condition. This ...

  12. Tabascal SNN-NLN Dataset

    • zenodo.org
    • researchdata.edu.au
    • +2more
    zip
    Updated Oct 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nicholas James Pritchard; Nicholas James Pritchard (2023). Tabascal SNN-NLN Dataset [Dataset]. http://doi.org/10.5281/zenodo.8401763
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 31, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Nicholas James Pritchard; Nicholas James Pritchard
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset for training and evaluating RFI detection schemes representing MeerKat instrumentation and predominantly satellite-based contamination. These datasets are produced using Tabascal and output in hdf5 format. The choice of format is to allow for easy use with machine-learning workflows, not other astronomy pipelines (for example, measurement sets). These datasets are prepared for immediate loading with Tensorflow. The attached config.json files describe the parameters used to generate these datasets.

    Dataset parameters
    
    
        Name
        Num Satellite Sources
        Num Ground RFI Sources
    
    
    
    
        obs_100AST_0SAT_0GRD_512BSL_64A_512T-0440-1462_016I_512F-1.227e+09-1.334e+09
        0
        0
    
    
        obs_100AST_1SAT_0GRD_512BSL_64A_512T-0440-1462_016I_512F-1.227e+09-1.334e+09
        1
        0
    
    
        obs_100AST_1SAT_3GRD_512BSL_64A_512T-0440-1462_016I_512F-1.227e+09-1.334e+09
        1
        3
    
    
        obs_100AST_2SAT_0GRD_512BSL_64A_512T-0440-1462_016I_512F-1.227e+09-1.334e+09
        2
        0
    
    
        obs_100AST_2SAT_3GRD_512BSL_64A_512T-0440-1462_016I_512F-1.227e+09-1.334e+09
        2
        3
    

    Using simulated data allows for access to ground truth for noise contamination. As such, these datasets contain the observation visibility amplitudes (without noise), noise visibilities and boolean pixel-wise masks at several thresholds on the noise visibilities. We outline the dimensions of all datasets below:

    Dataset Dimensions
    
    
        Field
        vis
        masks_orig
        masks_0
        masks_1
        masks_2
        masks_4
        masks_8
        masks_16
    
    
        Datatype
        float32
        float32
        bool
        bool
        bool
        bool
        bool
        bool
    

    Of course, one can produce masks at arbitrary thresholds, but for convenience, we include several pre-computed options.

    All datasets and all fields have the dimensions 512, 512, 512, 1 (baseline, time, frequency, amplitude/mask)

  13. T

    Dataset for the First Radio-Frequency Spectrum-Sharing Challenge (RFSSC)

    • dataverse.tdl.org
    bin, zip
    Updated Aug 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chad Spooner; Chad Spooner (2025). Dataset for the First Radio-Frequency Spectrum-Sharing Challenge (RFSSC) [Dataset]. http://doi.org/10.18738/T8/VQZPCN
    Explore at:
    bin(4000000000), zip(196881), zip(3904886), zip(1111818458), zip(1533639906)Available download formats
    Dataset updated
    Aug 22, 2025
    Dataset provided by
    Texas Data Repository
    Authors
    Chad Spooner; Chad Spooner
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    These data were used in the First RF Spectrum-Sharing Challenge which was held in 2024-2025, culminating in announcing the winner AiRANACULUS on January 30, 2025. The first RFSSC was co-sponsored by the United States Air Force Research Laboratory (AFRL) and the National Security Innovation Network (NSIN). The first RFSSC was a data-centric contest open to US academic and industrial organizations intended to spur innovation in machine learning and signal processing for automatic radio-frequency scene analysis (RFSA).

  14. d

    Data from: Estimating global GPP from the plant functional type perspective...

    • search.dataone.org
    • data.niaid.nih.gov
    • +1more
    Updated Jul 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Renjie Guo; Tiexi Chen; Xin Chen; Wenping Yuan; Shuci Liu; Bin He; Lin Li; Shengzhen Wang; Ting Hu; Qingyun Yan; Xueqiong Wei; Jie Dai (2025). Estimating global GPP from the plant functional type perspective using a machine learning approach [Dataset]. http://doi.org/10.5061/dryad.dncjsxm2v
    Explore at:
    Dataset updated
    Jul 16, 2025
    Dataset provided by
    Dryad Digital Repository
    Authors
    Renjie Guo; Tiexi Chen; Xin Chen; Wenping Yuan; Shuci Liu; Bin He; Lin Li; Shengzhen Wang; Ting Hu; Qingyun Yan; Xueqiong Wei; Jie Dai
    Time period covered
    Mar 28, 2023
    Description

    The long-term monitoring of gross primary production (GPP) is crucial to the assessment of the carbon cycle of terrestrial ecosystems. In this study, a well-known machine learning model (Random Forest, RF) is established to reconstruct the global GPP dataset named ECGC_GPP. The model distinguished nine functional plant types, including C3 and C4 crops, using eddy fluxes, meteorological variables, and leaf area index as training data of the RF model. Based on ERA5_Land and the corrected GEOV2 data, the global monthly GPP dataset at a 0.05-degree resolution from 1999 to 2019 was estimated. The results showed that the RF model could explain 74.81% of the monthly variation of GPP in the testing dataset, of which the average contribution of Leaf Area Index (LAI) reached 41.73%. The average annual and standard deviation of GPP during 1999–2019 were 117.14 ± 1.51 Pg C yr-1, with an upward trend of 0.21 Pg C yr-2 (p < 0.01). By using the plant functional type classification, the underestimat..., We unified the ERA5_Land and the corrected GEOV2 datasets to 0.05 degree and monthly scales. The meteorological and remote sensing datasets were classified by the eight PFTs to estimate the GPP of different PFT. Particularly, we established site-level PFT training models for CRO_C3 and CRO_C4, respectively, due to their significant differences. The CRO cells were a mixture of CRO_C3 and CRO_C4. Therefore, trained CRO_C3 and CRO_C4 models were both applied to the CRO cells and multiplied by their respective proportions to generate the final GPP estimation of CRO. This is what we designed to improve the current situation of GPP underestimation over CRO_C4 dominated regions. In this way, we generated a 0.05 degree and monthly scales global GPP dataset (ECGC_GPP) from 1999 to 2019., The ECGC_GPP dataset is stored in .nc file format and can be opened using Matlab or Python.

  15. Data from: SynthSoM: A synthetic intelligent multi-modal...

    • springernature.figshare.com
    bin
    Updated May 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiang Cheng; Ziwei Huang; Yong Yu; Lu Bai; Mingran Sun; Zengrui Han; Ruide Zhang; Sijiang Li (2025). SynthSoM: A synthetic intelligent multi-modal sensing-communication dataset for Synesthesia of Machines (SoM) [Dataset]. http://doi.org/10.6084/m9.figshare.28123646.v1
    Explore at:
    binAvailable download formats
    Dataset updated
    May 20, 2025
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Xiang Cheng; Ziwei Huang; Yong Yu; Lu Bai; Mingran Sun; Zengrui Han; Ruide Zhang; Sijiang Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Given the importance of datasets for sensing-communication integration research, a novel simulation platform for constructing communication and multi-modal sensory dataset is developed. The developed platform integrates three high-precision software, i.e., AirSim, WaveFarer, and Wireless InSite, and further achieves in-depth integration and precise alignment of them. Based on the developed platform, a new synthetic intelligent multi-modal sensing-communication dataset for Synesthesia of Machines (SoM), named SynthSoM, is proposed. The SynthSoM dataset contains various air-ground multi-link cooperative scenarios with comprehensive conditions, including multiple weather conditions, times of the day, intelligent agent densities, frequency bands, and antenna types. The SynthSoM dataset encompasses multiple data modalities, including radio-frequency (RF) channel large-scale and small-scale fading data, RF millimeter wave (mmWave) radar sensory data, and non-RF sensory data, e.g., RGB images, depth maps, and light detection and ranging (LiDAR) point clouds. The quality of SynthSoM dataset is validated via statistics-based qualitative inspection and evaluation metrics through machine learning (ML) via real-world measurements. The SynthSoM dataset is open-sourced and provides consistent data for cross-comparing SoM-related algorithms.

  16. Z

    A Comparison framework for deep learning RFI detection algorithms in radio...

    • data.niaid.nih.gov
    Updated Feb 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Charl du Toit; Trienko Grobler; Danie Ludick (2024). A Comparison framework for deep learning RFI detection algorithms in radio astronomy [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_8275060
    Explore at:
    Dataset updated
    Feb 18, 2024
    Dataset provided by
    Student
    Co-supervisor
    Authors
    Charl du Toit; Trienko Grobler; Danie Ludick
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    These are the datasets used for the study titled: A Comparison framework for deep learning RFI detection algorithms in radio astronomy. These files are made publicly available as an additional resource to the submission of the author's Masters degree at Stellenbosch University. The detection is done in the field of radio astronomy. Each dataset consists of images/spectrograms/waterfall plots for baselines, and the corresponding binary mask for each image. The datasets can be used to train machine learning models, or for the case of this study, supervised fully convolutional neural networks.

    The LOFAR datasets consists of real observations and was slightly modified from https://zenodo.org/record/6724065. See this resource regarding the observational parameters used to retrieve the data from the LOFAR Long Term Archive.The HERA dataset consists of simulated observations generated with hera_sim (https://readthedocs.org/projects/hera-sim/). The 28 March dataset consists of accurate pixel-perfect binary masks for each image. The 20 July dataset is identical to the first, except the binary masks are generated with AOFlagger. All three datasets have a test set stored with pixel-perfected simulation masks (HERA) or expert hand labeled masks (LOFAR).

    The csv file contains the results of all trained models and and has fields for: model class, #filters, #FLOPS, #weights, preprocessing methods, train, validation and test accuracy scores as well as list of (threshold, FPR, TPR) values to generate receiver operating characteristic curves. See https://github.com/CharlDuToit/RFI-NLN to visualize the results, to train new models.

  17. f

    Data from: GHOST: Adjusting the Decision Threshold to Handle Imbalanced Data...

    • acs.figshare.com
    zip
    Updated Jun 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Carmen Esposito; Gregory A. Landrum; Nadine Schneider; Nikolaus Stiefl; Sereina Riniker (2023). GHOST: Adjusting the Decision Threshold to Handle Imbalanced Data in Machine Learning [Dataset]. http://doi.org/10.1021/acs.jcim.1c00160.s002
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    ACS Publications
    Authors
    Carmen Esposito; Gregory A. Landrum; Nadine Schneider; Nikolaus Stiefl; Sereina Riniker
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Machine learning classifiers trained on class imbalanced data are prone to overpredict the majority class. This leads to a larger misclassification rate for the minority class, which in many real-world applications is the class of interest. For binary data, the classification threshold is set by default to 0.5 which, however, is often not ideal for imbalanced data. Adjusting the decision threshold is a good strategy to deal with the class imbalance problem. In this work, we present two different automated procedures for the selection of the optimal decision threshold for imbalanced classification. A major advantage of our procedures is that they do not require retraining of the machine learning models or resampling of the training data. The first approach is specific for random forest (RF), while the second approach, named GHOST, can be potentially applied to any machine learning classifier. We tested these procedures on 138 public drug discovery data sets containing structure–activity data for a variety of pharmaceutical targets. We show that both thresholding methods improve significantly the performance of RF. We tested the use of GHOST with four different classifiers in combination with two molecular descriptors, and we found that most classifiers benefit from threshold optimization. GHOST also outperformed other strategies, including random undersampling and conformal prediction. Finally, we show that our thresholding procedures can be effectively applied to real-world drug discovery projects, where the imbalance and characteristics of the data vary greatly between the training and test sets.

  18. Data from: S1 Dataset -

    • plos.figshare.com
    rar
    Updated Aug 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Adel H. Elmetwalli; Asaad Derbala; Ibtisam Mohammed Alsudays; Eman A. Al-Shahari; Mahmoud Elhosary; Salah Elsayed; Laila A. Al-Shuraym; Farahat S. Moghanm; Osama Elsherbiny (2024). S1 Dataset - [Dataset]. http://doi.org/10.1371/journal.pone.0308826.s001
    Explore at:
    rarAvailable download formats
    Dataset updated
    Aug 26, 2024
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Adel H. Elmetwalli; Asaad Derbala; Ibtisam Mohammed Alsudays; Eman A. Al-Shahari; Mahmoud Elhosary; Salah Elsayed; Laila A. Al-Shuraym; Farahat S. Moghanm; Osama Elsherbiny
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Estimation of fruit quality parameters are usually based on destructive techniques which are tedious, costly and unreliable when dealing with huge amounts of fruits. Alternatively, non–destructive techniques such as image processing and spectral reflectance would be useful in rapid detection of fruit quality parameters. This research study aimed to assess the potential of image processing, spectral reflectance indices (SRIs), and machine learning models such as decision tree (DT) and random forest (RF) to qualitatively estimate characteristics of mandarin and tomato fruits at different ripening stages. Quality parameters such as chlorophyll a (Chl a), chlorophyll b (Chl b), total soluble solids (TSS), titratable acidity (TA), TSS/TA, carotenoids (car), lycopene and firmness were measured. The results showed that Red-Blue-Green (RGB) indices and newly developed SRIs demonstrated high efficiency for quantifying different fruit properties. For example, the R2 of the relationships between all RGB indices (RGBI) and measured parameters varied between 0.62 and 0.96 for mandarin and varied between 0.29 and 0.90 for tomato. The RGBI such as visible atmospheric resistant index (VARI) and normalized red (Rn) presented the highest R2 = 0.96 with car of mandarin fruits. While excess red vegetation index (ExR) presented the highest R2 = 0.84 with car of tomato fruits. The SRIs such as RSI 710,600, and R730,650 showed the greatest R2 values with respect to Chl a (R2 = 0.80) for mandarin fruits while the GI had the greatest R2 with Chl a (R2 = 0.68) for tomato fruits. Combining RGB and SRIs with DT and RF models would be a robust strategy for estimating eight observed variables associated with reasonable accuracy. Regarding mandarin fruits, in the task of predicting Chl a, the DT-2HV model delivered exceptional results, registering an R2 of 0.993 with an RMSE of 0.149 for the training set, and an R2 of 0.991 with an RMSE of 0.114 for the validation set. As well as for tomato fruits, the DT-5HV model demonstrated exemplary performance in the Chl a prediction, achieving an R2 of 0.905 and an RMSE of 0.077 for the training dataset, and an R2 of 0.785 with an RMSE of 0.077 for the validation dataset. The overall outcomes showed that the RGB, newly SRIs as well as DT and RF based RGBI, and SRIs could be used to evaluate the measured parameters of mandarin and tomato fruits.

  19. c

    RF Probability (version 0.1): Mapping Wetlands with High Resolution Planet...

    • figshare.canterbury.ac.nz
    tiff
    Updated Jul 28, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matthew Wilson; Saif Khan (2025). RF Probability (version 0.1): Mapping Wetlands with High Resolution Planet SuperDove Satellite Imagery [Dataset]. http://doi.org/10.26021/canterburynz.29310392.v1
    Explore at:
    tiffAvailable download formats
    Dataset updated
    Jul 28, 2025
    Dataset provided by
    University of Canterbury Data Repository
    Authors
    Matthew Wilson; Saif Khan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    These datasets represent predictions and associated probabilities using four machine learning methods, associated with collection: Mapping Wetlands with High Resolution Planet SuperDove Satellite Imagery: An Assessment of Machine Learning Models Across the Diverse Waterscapes of New Zealand (10.26021/canterburynz.c.7848596).The following datasets are available:HGB predictionHGB probabilityMLPC predictionMLPC probabilityRandom forest predictionRandom forest probability [this dataset]XGBoost predictionXGBoost probabilityFor details of the models developed, please see the collection and associated paper. The following files are available in each dataset, each representing an area within New Zealand:xxxxx_mmm_prediction.tif: model prediction, encoded as 8-bit integers where 1 is predicted as wetland (>50% probability), and NA (no data) is non-wetland.xxxxx_mmm_probability.tif: model wetland probability, encoded as 16-bit integers, with probability values from 0 to 1 rescaled from 0 to 10,000. Divide the values by 10,000 to obtain probabilities to four decimal places.In the tile filenames, xxxxx refers to the UUID of the grid area, which can be found in the file nzgrid_uuid.gpkg, and mmm is a code which refers to the model used:hgb: histogram gradient boostmlpc: multi-layer perceptron classificationrf: random forestxgb: extreme gradient boostingIn addition to the tif images, two virtual raster tile files are included to enable mapping at the national scale:_mmm_prediction.vrt_mmm_probability.vrtAll tif images are saved using cloud optimised geotiff (COG), which makes them fast to display even at a national level, although increases the data size. Total size is around 700 MB for the prediction datasets, and ~75 GB for the probability datasets.Metadata for the Planet SuperDove imagery used for each pixel of the predictions is available here: https://doi.org/10.26021/canterburynz.29231837.v

  20. r

    RF-based UAV Detection and Identifification System

    • resodate.org
    • service.tib.eu
    Updated Dec 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Olusiji Medaiyese; Martins Ezuma; Adrian Lauf; Ismail Guvenc (2024). RF-based UAV Detection and Identifification System [Dataset]. https://resodate.org/resources/aHR0cHM6Ly9zZXJ2aWNlLnRpYi5ldS9sZG1zZXJ2aWNlL2RhdGFzZXQvcmYtYmFzZWQtdWF2LWRldGVjdGlvbi1hbmQtaWRlbnRpZmktY2F0aW9uLXN5c3RlbQ==
    Explore at:
    Dataset updated
    Dec 2, 2024
    Dataset provided by
    Leibniz Data Manager
    Authors
    Olusiji Medaiyese; Martins Ezuma; Adrian Lauf; Ismail Guvenc
    Description

    The dataset used in this paper for RF-based UAV detection and identification system using machine learning.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Halcy0nic (2025). Radio Frequency (RF) Signal Image Classification [Dataset]. https://www.kaggle.com/datasets/halcy0nic/radio-frequecy-rf-signal-image-classification
Organization logo

Radio Frequency (RF) Signal Image Classification

RF Signals Classification Dataset

Explore at:
zip(2599544168 bytes)Available download formats
Dataset updated
May 15, 2025
Authors
Halcy0nic
License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

RF Signals Classification Dataset

This dataset contains images of various radio frequency (RF) signals captured on waterfall plots using a spectrum analyzer. It is designed to aid in the classification and identification of different types of RF signals commonly encountered in wireless communications and radio technologies.

Dataset Summary

The dataset comprises waterfall plot images of RF signals across 21 different classes, representing a wide range of communication protocols, technologies, and signal types. Each image in the dataset is a visual representation of a specific RF signal's frequency and time characteristics.

Supported Tasks

This dataset is primarily suited for:

  • Image Classification
  • Signal Identification
  • RF Signal Analysis
  • Machine Learning in Wireless Communications

Dataset Structure

Data Fields

  • image: A waterfall plot image of the RF signal
  • class: The label identifying the type of RF signal

Classes

The dataset includes the following 21 classes of RF signals:

  1. RS41-Radiosonde
  2. Radioteletype
  3. ADS-B
  4. AIS
  5. Automatic Picture Transmission
  6. Bluetooth
  7. Cellular
  8. Digital Audio Broadcasting
  9. Digital Speech Decoder
  10. FM
  11. LoRa
  12. Morse
  13. On-Off Keying
  14. Packet
  15. POCSAG
  16. Remote Keyless Entry
  17. SSTV
  18. WiFi
  19. Airband
  20. VOR
  21. Unknown

File Description

This dataset contains images of radio frequency (RF) signals captured as waterfall plots using a spectrum analyzer. The dataset is organized as follows:

- datasets
--- signal class
----- image

For example:

- datasets
--- bluetooth
----- c17afe0fe5cc3cc1308605cf390ecbb5.png

Dataset Creation

Source

The images in this dataset were captured using a spectrum analyzer, which visualizes RF signals as waterfall plots. These plots show the frequency content of a signal over time, with color representing signal strength.

Collection Process

RF signals were collected across various frequency bands using appropriate antennas and receivers. The spectrum analyzer was used to generate waterfall plots for each captured signal. Care was taken to ensure a diverse representation of signal types and conditions.

Known Limitations

  • Some classes may have more samples than others, potentially leading to class imbalance.
  • The dataset does not include raw signal data, only visual representations.
  • Most of the images were captured using SDRAngel, therefore any classifier will likely have a heavy bias towards other images captured using that platform.

Dataset Curators

Halcy0nic

Licensing Information

[MIT]

Contributions

We welcome contributions to improve this dataset.

Search
Clear search
Close search
Google apps
Main menu