27 datasets found
  1. m

    Data from: Large-Scale Curated Multivariate Time Series Anomaly Detection...

    • data.mendeley.com
    Updated Jul 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Veena More (2025). Large-Scale Curated Multivariate Time Series Anomaly Detection Dataset for Laptop Performance Metrics [Dataset]. http://doi.org/10.17632/97jn6xrs84.1
    Explore at:
    Dataset updated
    Jul 7, 2025
    Authors
    Veena More
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    High-quality multivariate time-series datasets are significantly less accessible compared to more common data types such as images or text, due to the resource-intensive process of continuous monitoring, precise annotation, and long-term observation. This paper introduces a cost-effective solution in the form of a large-scale, curated dataset specifically designed for anomaly detection in computing systems’ performance metrics. The dataset encompasses 45 GB of multivariate time-series data collected from 66 systems, capturing key performance indicators such as CPU usage, memory consumption, disk I/O, system load, and power consumption across diverse hardware configurations and real-world usage scenarios. Annotated anomalies, including performance degradation and resource inefficiencies, provide a reliable benchmark and ground truth for evaluating anomaly detection models. By addressing the accessibility challenges associated with time-series data, this resource facilitates advancements in machine learning applications, including anomaly detection, predictive maintenance, and system optimisation. Its comprehensive and practical design makes it a foundational asset for researchers and practitioners dedicated to developing reliable and efficient computing systems.

  2. u

    NCAR S-Pol radar time series data

    • data.ucar.edu
    archive
    Updated Aug 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NCAR/EOL S-Pol Team (2025). NCAR S-Pol radar time series data [Dataset]. http://doi.org/10.5065/D6FF3QP7
    Explore at:
    archiveAvailable download formats
    Dataset updated
    Aug 1, 2025
    Authors
    NCAR/EOL S-Pol Team
    Time period covered
    Feb 4, 2008 - Jun 28, 2008
    Area covered
    Description

    S-band Polarimetric (S-Pol) Radar radar processor time series data from the Terrain-Influenced Monsoon Rainfall Experiment (TIMREX). The files for this data set are large (up to 4 GB each) so please note the listed file sizes when ordering. There are four different file types: scanning, vertical, stationary, and solar.

  3. f

    BigDataAD Benchmark Dataset

    • figshare.com
    zip
    Updated Sep 29, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kingsley Pattinson (2023). BigDataAD Benchmark Dataset [Dataset]. http://doi.org/10.6084/m9.figshare.24040563.v8
    Explore at:
    zipAvailable download formats
    Dataset updated
    Sep 29, 2023
    Dataset provided by
    figshare
    Authors
    Kingsley Pattinson
    License

    https://www.apache.org/licenses/LICENSE-2.0.htmlhttps://www.apache.org/licenses/LICENSE-2.0.html

    Description

    The largest real-world dataset for multivariate time series anomaly detection (MTSAD) from the AIOps system of a Real-Time Data Warehouse (RTDW) from a top cloud computing company. All the metrics and labels in our dataset are derived from real-world scenarios. All metrics were obtained from the RTDW instance monitoring system and cover a rich variety of metric types, including CPU usage, queries per second (QPS) and latency, which are related to many important modules within RTDW AIOps Dataset. We obtain labels from the ticket system, which integrates three main sources of instance anomalies: user service requests, instance unavailability and fault simulations . User service requests refer to tickets that are submitted directly by users, whereas instance unavailability is typically detected through existing monitoring tools or discovered by Site Reliability Engineers (SREs). Since the system is usually very stable, we augment the anomaly samples by conducting fault simulations. Fault simulation refers to a special type of anomaly, planned beforehand, which is introduced to the system to test its performance under extreme conditions. All records in the ticket system are subject to follow-up processing by engineers, who meticulously mark the start and end times of each ticket. This rigorous approach ensures the accuracy of the labels in our dataset.

  4. w

    Global Soc Deep Learning Chip Market Research Report: By Deployment Model...

    • wiseguyreports.com
    Updated Aug 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    wWiseguy Research Consultants Pvt Ltd (2024). Global Soc Deep Learning Chip Market Research Report: By Deployment Model (Cloud-based, On-premises, Hybrid), By Application (Image and Video Analytics, Speech and Natural Language Processing, Computer Vision, Time Series Analysis), By Architecture (Central Processing Unit (CPU), Graphics Processing Unit (GPU), Field Programmable Gate Array (FPGA), Application Specific Integrated Circuit (ASIC), By Memory (High Bandwidth Memory (HBM), Graphics Double Data Rate (GDDR, Dynamic Random Access Memory (DRAM) and By Regional (North America, Europe, South America, Asia Pacific, Middle East and Africa) - Forecast to 2032. [Dataset]. https://www.wiseguyreports.com/reports/soc-deep-learning-chip-market
    Explore at:
    Dataset updated
    Aug 6, 2024
    Dataset authored and provided by
    wWiseguy Research Consultants Pvt Ltd
    License

    https://www.wiseguyreports.com/pages/privacy-policyhttps://www.wiseguyreports.com/pages/privacy-policy

    Time period covered
    Jan 8, 2024
    Area covered
    Global
    Description
    BASE YEAR2024
    HISTORICAL DATA2019 - 2024
    REPORT COVERAGERevenue Forecast, Competitive Landscape, Growth Factors, and Trends
    MARKET SIZE 202318.53(USD Billion)
    MARKET SIZE 202423.47(USD Billion)
    MARKET SIZE 2032155.5(USD Billion)
    SEGMENTS COVEREDDeployment Model ,Application ,Architecture ,Memory ,Regional
    COUNTRIES COVEREDNorth America, Europe, APAC, South America, MEA
    KEY MARKET DYNAMICSRising Artificial Intelligence AI Adoption Growing Demand for HighPerformance Computing Advancements in Machine Learning Algorithms Increasing Adoption of Cloud Computing Government Support for AI Research and Development
    MARKET FORECAST UNITSUSD Billion
    KEY COMPANIES PROFILEDMovidius ,Imagination Technologies ,Tensilica ,NVIDIA ,Xilinx ,Cadence Design Systems ,Synopsys ,NXP ,Google ,Analog Devices ,ARM ,Qualcomm ,CEVA ,Intel
    MARKET FORECAST PERIOD2025 - 2032
    KEY MARKET OPPORTUNITIESCloud and edge computing Artificial intelligence Automotive applications Healthcare and medical imaging Industrial automation
    COMPOUND ANNUAL GROWTH RATE (CAGR) 26.66% (2025 - 2032)
  5. Laptop Motherboard Health Monitoring Dataset

    • kaggle.com
    Updated Jun 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Md Zia (2024). Laptop Motherboard Health Monitoring Dataset [Dataset]. http://doi.org/10.34740/kaggle/dsv/8717402
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 18, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Md Zia
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    The Laptop Motherboard Health Monitoring Dataset is a synthetically generated dataset designed to aid in the development and testing of machine learning models for predictive maintenance and health monitoring of laptop motherboards. The dataset includes various health metrics such as CPU usage, RAM usage, temperature, voltage, disk usage, and fan speed, along with a label indicating whether a problem was detected and the type of problem.

    Dataset Columns ModelName: The name and model of the laptop (e.g., Dell Inspiron 1234, HP Pavilion 5678). This column includes realistic combinations of popular laptop brands and model series, making the dataset relatable and practical.

    CPUUsage: The CPU usage percentage, ranging from 0 to 100%. This metric indicates how much of the CPU's capacity is being utilized.

    RAMUsage: The RAM usage percentage, ranging from 0 to 100%. This metric shows the proportion of RAM being used out of the total available.

    Temperature: The temperature of the motherboard in degrees Celsius, ranging from 20 to 100°C. This metric is crucial for detecting overheating issues.

    Voltage: The operating voltage in volts, ranging from 10 to 20V. Voltage measurements help in identifying power-related problems.

    DiskUsage: The disk usage percentage, ranging from 0 to 100%. This metric indicates how much of the disk's capacity is being used.

    FanSpeed: The speed of the cooling fan in revolutions per minute (RPM), ranging from 1000 to 5000 RPM. Fan speed is an important indicator of cooling performance.

    ProblemDetected: The type of problem detected, if any. Possible values are:

    No Problem Overheating Power Issue Memory Leak Disk Failure Usage This dataset can be used to train and evaluate machine learning models for the purpose of predictive maintenance. Researchers and practitioners can use the data to classify the type of problem based on the health metrics provided. The dataset is ideal for experimenting with various classification algorithms and techniques in the field of hardware health monitoring.

    File Laptop_Motherboard_Health_Monitoring_Dataset.csv: The main dataset file containing 2000 rows of synthetic data. Acknowledgements This dataset is synthetically generated and does not represent real-world data. It is intended for educational and research purposes only.

  6. f

    Average CPU time (s) of all the referenced algorithm on benchmark function.

    • figshare.com
    xls
    Updated Jun 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aihua Guo (2023). Average CPU time (s) of all the referenced algorithm on benchmark function. [Dataset]. http://doi.org/10.1371/journal.pone.0272624.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 16, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Aihua Guo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Average CPU time (s) of all the referenced algorithm on benchmark function.

  7. d

    Data from: Approaches in highly parameterized inversion: TSPROC, a general...

    • datadiscoverystudio.org
    html, pdf
    Updated Jan 16, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2017). Approaches in highly parameterized inversion: TSPROC, a general time-series processor to assist in model calibration and result summarization [Dataset]. http://datadiscoverystudio.org/geoportal/rest/metadata/item/ba629376d4e04ab3901200a97be16b60/html
    Explore at:
    pdf, htmlAvailable download formats
    Dataset updated
    Jan 16, 2017
    Description

    Link to the ScienceBase Item Summary page for the item described by this metadata record. Service Protocol: Link to the ScienceBase Item Summary page for the item described by this metadata record. Application Profile: Web Browser. Link Function: information

  8. U

    Replication data for "Lightweight Behavior-Based Malware Detection"

    • dataverse.unimi.it
    Updated Nov 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nicola Bena; Nicola Bena; Marco Anisetti; Marco Anisetti; Claudio A. Ardagna; Claudio A. Ardagna; Gabriele Gianini; Gabriele Gianini; Vincenzo Giandomenico; Vincenzo Giandomenico (2024). Replication data for "Lightweight Behavior-Based Malware Detection" [Dataset]. http://doi.org/10.13130/RD_UNIMI/LJ6Z8V
    Explore at:
    tsv(118946), zip(52781371), bin(27523), tsv(10289), tsv(119218), text/x-python(1436), text/x-python(1147), tsv(112040), bin(156998), text/markdown(13542), txt(240000), application/x-ipynb+json(11533), txt(2018), application/x-ipynb+json(137862), zip(4469422), bin(1228541), tsv(112144), tsv(119217), application/x-ipynb+json(8672), bin(36712), application/x-ipynb+json(121867), bin(55), application/x-ipynb+json(1736968), bin(4245), txt(1694), text/x-python(1126), zip(3251335), tsv(10111), text/x-python(1339), tsv(119113)Available download formats
    Dataset updated
    Nov 3, 2024
    Dataset provided by
    UNIMI Dataverse
    Authors
    Nicola Bena; Nicola Bena; Marco Anisetti; Marco Anisetti; Claudio A. Ardagna; Claudio A. Ardagna; Gabriele Gianini; Gabriele Gianini; Vincenzo Giandomenico; Vincenzo Giandomenico
    License

    https://dataverse.unimi.it/api/datasets/:persistentId/versions/2.1/customlicense?persistentId=doi:10.13130/RD_UNIMI/LJ6Z8Vhttps://dataverse.unimi.it/api/datasets/:persistentId/versions/2.1/customlicense?persistentId=doi:10.13130/RD_UNIMI/LJ6Z8V

    Description

    Dataset containing real-world and synthetic samples on legit and malware samples in the form of time series. The samples consider machine-level performance metrics: CPU usage, RAM usage, number of bytes read and written from and to disk and network. Synthetic samples are generated using a GAN.

  9. h

    anomaly_detection_metrics_data

    • huggingface.co
    Updated Jul 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shreyas Patil (2023). anomaly_detection_metrics_data [Dataset]. https://huggingface.co/datasets/ShreyasP123/anomaly_detection_metrics_data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 20, 2023
    Authors
    Shreyas Patil
    Description

    Dataset Card: Anomaly Detection Metrics Data

      Dataset Summary
    

    This dataset contains system performance metrics collected over time for anomaly detection in time series data. It includes multiple system metrics such as CPU load, memory usage, and other resource utilization statistics, along with timestamps and additional attributes.

      Dataset Details
    

    Size: ~7.3 MB (raw JSON), 345 kB (auto-converted Parquet) Rows: 46,669 Format: JSON Libraries: datasets, pandas… See the full description on the dataset page: https://huggingface.co/datasets/ShreyasP123/anomaly_detection_metrics_data.

  10. Z

    Data for "Thermal transport of glasses via machine learning driven...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Feb 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Grasselli, Federico (2024). Data for "Thermal transport of glasses via machine learning driven simulations" [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_10225315
    Explore at:
    Dataset updated
    Feb 9, 2024
    Dataset provided by
    Pegolo, Paolo
    Grasselli, Federico
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This repository contains input and analysis scripts supporting the findings of Thermal transport of glasses via machine learning driven simulations, by P. Pegolo and F. Grasselli. Content:

    README.md: this file, information about the repository SiO2: vitreous silica parent folder

    NEP: folder with datasets and input scripts for NEP training

    train.xyz: training dataset test.xyz: validation dataset nep.in: NEP input script nep.txt: NEP model nep.restart: NEP restart file DP: folder with datasets and input scripts for DP training

    input.json: DeePMD training input dataset: DeePMD training dataset validation: DeePMD validation dataset frozen_model.pb: DP model GKMD: scripts for the GKMD simulations Tersoff: Tersoff reference simulation

    model.xyz: initial configuration run.in: GPUMD script SiO2.gpumd.tersoff88: Tersoff model parameters convert_movie_to_dump.py: script to convert GPUMD XYZ trajectory to LAMMPS format for re-running the trajectory with the MLPs DP: DP simulation

    init.data: LAMMPS initial configuration in.lmp: LAMMPS input to re-run the Tersoff trajectory with the DP NEP: NEP simulation

    init.data: LAMMPS initial configuration in.lmp: LAMMPS input to re-run the Tersoff trajectory with the NEP. Note that this needs the NEP-CPU user package installed in LAMMPS. At the moment it is not possible to re-run a trajectory with GPUMD. QHGK: scripts for the QHGK simulations

    DP: DP data

    second.npy: second-order interatomic force constants third.npy: third-order interatomic force constants replicated_atoms.xyz: configuration dynmat: scripts to compute interatomic force constants with the DP model. Analogous scripts were used also to compute IFCs with the other potentials.

    initial.data: non optimized configuration in.dynmat.lmp: LAMMPS script to minimize the structure and compute second-order interatomic force constants in.third.lmp: LAMMPS script to compute third-order interatomic force constants Tersoff: Tersoff data

    second.npy: second-order interatomic force constants third.npy: third-order interatomic force constants replicated_atoms.xyz: configuration NEP: NEP data

    second.npy: second-order interatomic force constants third.npy: third-order interatomic force constants replicated_atoms.xyz: configuration qhgk.py: script to compute QHGK lifetimes and thermal conductivity Si: vitreous silicon parent folder

    QHGK: scripts for the QHGK simulations

    qhgk.py: script to compute QHGK lifetimes [N]: folder with the calculations on a N-atoms system

    second.npy: second-order interatomic force constants third.npy: third-order interatomic force constants replicated_atoms.xyz: configuration LiSi: vitreous litihum-intercalated silicon parent folder

    NEP: folder with datasets and input scripts for NEP training

    train.xyz: training dataset test.xyz: validation dataset nep.in: NEP input script nep.txt: NEP model nep.restart: NEP restart file EMD: folder with data on the equilibrium molecular dynamics simulations

    70k: data of the simulations with ~70k atoms

    1-45: folder with input scripts for the simulations at different Li concentration

    fraction.dat: Li fraction, y, as in Li_{y}Si quench: scripts for the melt-quench-anneal sample preparation

    model.xyz: initial configuration restart.xyz: final configuration run.in: GPUMD input gk: scripts for the GKMD simulation

    model.xyz: initial configuration restart.xyz: final configuration run.in: GPUMD input cepstral: folder for cepstral analysis

    analyze.py: python script for cepstral analysis of the fluxes' time-series generated by the GKMD runs

  11. Data from: Transition and Drivers of Elastic to Inelastic Deformation in the...

    • zenodo.org
    pdf, zip
    Updated Jul 12, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sayyed Mohammad Javad Mirzadeh; Sayyed Mohammad Javad Mirzadeh; Shuanggen Jin; Shuanggen Jin; Estelle Chaussard; Estelle Chaussard; Roland Bürgmann; Roland Bürgmann; Abolfazl Rezaei; Abolfazl Rezaei; Saba Ghotbi; Saba Ghotbi; Andreas Braun; Andreas Braun (2024). Transition and Drivers of Elastic to Inelastic Deformation in the Abarkuh Plain from InSAR Multi-Sensor Time Series and Hydrogeological Data [Dataset]. http://doi.org/10.5281/zenodo.7786511
    Explore at:
    pdf, zipAvailable download formats
    Dataset updated
    Jul 12, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Sayyed Mohammad Javad Mirzadeh; Sayyed Mohammad Javad Mirzadeh; Shuanggen Jin; Shuanggen Jin; Estelle Chaussard; Estelle Chaussard; Roland Bürgmann; Roland Bürgmann; Abolfazl Rezaei; Abolfazl Rezaei; Saba Ghotbi; Saba Ghotbi; Andreas Braun; Andreas Braun
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Abarkuh
    Description

    This repository contains the datasets used in Mirzadeh et al., 2023. It includes three InSAR time-series datasets from the Envisat descending orbit, ALOS-1 ascending orbit, and Sentinel-1A in ascending and descending orbits, acquired over the Abarkuh Plain, Iran, as well as the geological map of the study area and the GNSS and hydrogeological data used in this research.

    Dataset 1: Envisat descending track 292

    • Date: 06 Oct 2003 - 05 Sep 2005 (12 acquisitions)
    • Processor: ISCE/stripmapStack + MintPy
    • Displacement time-series (in HDF-EOS5 format): timeseries_LOD_tropHgt_ramp_demErr.h5
    • Mean LOS Velocity (in HDF-EOS5 format): velocity.h5
    • Mask Temporal Coherence (in HDF-EOS5 format): maskTempCoh.h5
    • Geometry (in HDF-EOS5 format): geometryRadar.h5

    Dataset 2: ALOS-1 ascending track 569

    • Date: 06 Dec 2006 - 17 Dec 2010 (14 acquisitions)
    • Processor: ISCE/stripmapStack + MintPy
    • Displacement time-series (in HDF-EOS5 format): timeseries_ERA5_ramp_demErr.h5
    • Mean LOS Velocity (in HDF-EOS5 format): velocity.h5
    • Mask Temporal Coherence (in HDF-EOS5 format): maskTempCoh.h5
    • Geometry (in HDF-EOS5 format): geometryRadar.h5

    Dataset 2: Sentinel-1 ascending track 130 and descending track 137

    • Date: 14 Oct 2014 - 28 Mar 2020 (129 ascending acquisitions) + 27 Oct 2014 - 29 Mar 2020 (114 descending acquisitions)
    • Processor: ISCE/topsStack + MintPy
    • Displacement time-series (in HDF-EOS5 format): timeseries_ERA5_ramp_demErr.h5
    • Mean LOS Velocity (in HDF-EOS5 format): velocity.h5
    • Mask Temporal Coherence (in HDF-EOS5 format): maskTempCoh.h5
    • Geometry (in HDF-EOS5 format): geometryRadar.h5

    The time series and Mean LOS Velocity (MVL) products can be georeferenced and resampled using the makTempCoh and geometryRadar products and the MintPy commands/functions.

  12. VM Stats for Machine Learning/Data Science

    • kaggle.com
    Updated Jun 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rahadyan Rizqy A. (2025). VM Stats for Machine Learning/Data Science [Dataset]. https://www.kaggle.com/datasets/rahadyanrizqy/4-vms-stats-for-machine-learningdata-science/data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 24, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Rahadyan Rizqy A.
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains system metrics collected from 4 virtual machines (VMs) with identical hardware specifications, tested under a load-balanced Proxmox Virtual Environment using HAProxy.

    ⚙️ System Configuration: - Number of VMs: 4 (laos-1, laos-2, laos-3, laos-4) - VM Specifications: Identical CPU, Memory, and Network configurations - Application Stack: Each VM runs the same Laravel-based web application served via NGINX - Load Balancer: HAProxy, using the Least Connection algorithm - Test Scenario: 10-50 Virtual Users (VU) running continuously for 30 minutes, 9 GET Request, 3 POST Request (k6)

    📈 Data Contents: Each row represents a real-time snapshot of the VM usage during the test, with data collected once per second. - fetch: The n-th fetch cycle (1 fetch = 1 data collection from all 4 VMs) - update: Indicates whether there was a new score computed between fetch cycles (1 = updated, 0 = no change) - vm_id: Internal identifier of the VM - vm_name: Hostname of the VM - cpu_usage: CPU usage in percent (0–100) - max_cpu: Maximum CPU capacity (set to 1.0 for normalization in this test environment) - mem_usage: Memory usage in percent (0–100) - max_mem: Maximum memory set at 1GB - cum_netin: Cumulative incoming network traffic (RX) in bytes - cum_netout: Cumulative outgoing network traffic (TX) in bytes - rate_netin: Rate of incoming traffic (difference between current and previous RX) - rate_netout: Rate of outgoing traffic (difference between current and previous TX) - bw_usage: Total bandwidth usage, computed as rate_netin + rate_netout - max_bw: Maximum network interface bandwidth, set to 12,500,000 bytes per second - score: Composite score calculated as CPU% + Memory% + Bandwidth%, where a lower score indicates a better (less loaded) VM - priority: Ranking based on ascending score (1 = best VM for routing) - unix_timestamp: Unix timestamp equivalent of timestamp - timestamp: Human-readable timestamp

    🧪 This dataset may be suitable for: - Analyzing the effectiveness of load balancing strategies - Visualization of load distribution among VMs - Time series forecasting of VM workload - Building ML models for anomaly detection or auto-scaling policies

  13. f

    Project and allocation data from XDCDB.

    • plos.figshare.com
    xls
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Richard Knepper; Katy Börner (2023). Project and allocation data from XDCDB. [Dataset]. http://doi.org/10.1371/journal.pone.0157628.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Richard Knepper; Katy Börner
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Project and allocation data from XDCDB.

  14. d

    Autonomous Underwater Vehicle Monterey Bay Time Series - CTD from AUV Makai...

    • search.dataone.org
    • bco-dmo.org
    • +1more
    Updated Mar 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dr Chris Scholin (2025). Autonomous Underwater Vehicle Monterey Bay Time Series - CTD from AUV Makai on 2016-02-03 [Dataset]. http://doi.org/10.26008/1912/bco-dmo.644012.1
    Explore at:
    Dataset updated
    Mar 9, 2025
    Dataset provided by
    Biological and Chemical Oceanography Data Management Office (BCO-DMO)
    Authors
    Dr Chris Scholin
    Time period covered
    Feb 3, 2016
    Area covered
    Description

    No description is available. Visit https://dataone.org/datasets/sha256%3A8b9b600f61bbd7d944013b78645ce2bb2494d735129ab86e43ba55f51657d613 for complete metadata about this dataset.

  15. c

    Research data supporting "21st century progress in computing".

    • repository.cam.ac.uk
    zip
    Updated Mar 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Coyle, Diane; Hampton, Lucy (2025). Research data supporting "21st century progress in computing". [Dataset]. http://doi.org/10.17863/CAM.113404
    Explore at:
    zip(638529 bytes)Available download formats
    Dataset updated
    Mar 31, 2025
    Dataset provided by
    Apollo
    University of Cambridge
    Authors
    Coyle, Diane; Hampton, Lucy
    Description

    CPU and GPU time series of cost of computing, also time series of cost of cloud computing in UK. The detailed descriptions of the series are available in the associated paper. AI models miss disease in Black and female patients

  16. f

    Main dataset content.

    • plos.figshare.com
    xls
    Updated Dec 19, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohammed Hlayel; Hairulnizam Mahdin; Mohammad Hayajneh; Saleh H. AlDaajeh; Siti Salwani Yaacob; Mazidah Mat Rejab (2024). Main dataset content. [Dataset]. http://doi.org/10.1371/journal.pone.0314691.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Dec 19, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Mohammed Hlayel; Hairulnizam Mahdin; Mohammad Hayajneh; Saleh H. AlDaajeh; Siti Salwani Yaacob; Mazidah Mat Rejab
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The rapid development of Digital Twin (DT) technology has underlined challenges in resource-constrained mobile devices, especially in the application of extended realities (XR), which includes Augmented Reality (AR) and Virtual Reality (VR). These challenges lead to computational inefficiencies that negatively impact user experience when dealing with sizeable 3D model assets. This article applies multiple lossless compression algorithms to improve the efficiency of digital twin asset delivery in Unity’s AssetBundle and Addressable asset management frameworks. In this study, an optimal model will be obtained that reduces both bundle size and time required in visualization, simultaneously reducing CPU and RAM usage on mobile devices. This study has assessed compression methods, such as LZ4, LZMA, Brotli, Fast LZ, and 7-Zip, among others, for their influence on AR performance. This study also creates mathematical models for predicting resource utilization, like RAM and CPU time, required by AR mobile applications. Experimental results show a detailed comparison among these compression algorithms, which can give insights and help choose the best method according to the compression ratio, decompression speed, and resource usage. It finally leads to more efficient implementations of AR digital twins on resource-constrained mobile platforms with greater flexibility in development and a better end-user experience. Our results show that LZ4 and Fast LZ perform best in speed and resource efficiency, especially with RAM caching. At the same time, 7-Zip/LZMA achieves the highest compression ratios at the cost of slower loading. Brotli emerged as a strong option for web-based AR/VR content, striking a balance between compression efficiency and decompression speed, outperforming Gzip in WebGL contexts. The Addressable Asset system with LZ4 offers the most efficient balance for real-time AR applications. This study will deliver practical guidance on optimal compression method selection to improve user experience and scalability for AR digital twin implementations.

  17. f

    CPU hours, institutions, and PI's by year.

    • plos.figshare.com
    xls
    Updated Jun 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Richard Knepper; Katy Börner (2023). CPU hours, institutions, and PI's by year. [Dataset]. http://doi.org/10.1371/journal.pone.0157628.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Richard Knepper; Katy Börner
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    CPU hours, institutions, and PI's by year.

  18. f

    Summary of mono behaviour of loading methods.

    • plos.figshare.com
    xls
    Updated Dec 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohammed Hlayel; Hairulnizam Mahdin; Mohammad Hayajneh; Saleh H. AlDaajeh; Siti Salwani Yaacob; Mazidah Mat Rejab (2024). Summary of mono behaviour of loading methods. [Dataset]. http://doi.org/10.1371/journal.pone.0314691.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Dec 19, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Mohammed Hlayel; Hairulnizam Mahdin; Mohammad Hayajneh; Saleh H. AlDaajeh; Siti Salwani Yaacob; Mazidah Mat Rejab
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The rapid development of Digital Twin (DT) technology has underlined challenges in resource-constrained mobile devices, especially in the application of extended realities (XR), which includes Augmented Reality (AR) and Virtual Reality (VR). These challenges lead to computational inefficiencies that negatively impact user experience when dealing with sizeable 3D model assets. This article applies multiple lossless compression algorithms to improve the efficiency of digital twin asset delivery in Unity’s AssetBundle and Addressable asset management frameworks. In this study, an optimal model will be obtained that reduces both bundle size and time required in visualization, simultaneously reducing CPU and RAM usage on mobile devices. This study has assessed compression methods, such as LZ4, LZMA, Brotli, Fast LZ, and 7-Zip, among others, for their influence on AR performance. This study also creates mathematical models for predicting resource utilization, like RAM and CPU time, required by AR mobile applications. Experimental results show a detailed comparison among these compression algorithms, which can give insights and help choose the best method according to the compression ratio, decompression speed, and resource usage. It finally leads to more efficient implementations of AR digital twins on resource-constrained mobile platforms with greater flexibility in development and a better end-user experience. Our results show that LZ4 and Fast LZ perform best in speed and resource efficiency, especially with RAM caching. At the same time, 7-Zip/LZMA achieves the highest compression ratios at the cost of slower loading. Brotli emerged as a strong option for web-based AR/VR content, striking a balance between compression efficiency and decompression speed, outperforming Gzip in WebGL contexts. The Addressable Asset system with LZ4 offers the most efficient balance for real-time AR applications. This study will deliver practical guidance on optimal compression method selection to improve user experience and scalability for AR digital twin implementations.

  19. f

    Summary of multiple regression for decompression time prediction: Effects of...

    • plos.figshare.com
    xls
    Updated Dec 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohammed Hlayel; Hairulnizam Mahdin; Mohammad Hayajneh; Saleh H. AlDaajeh; Siti Salwani Yaacob; Mazidah Mat Rejab (2024). Summary of multiple regression for decompression time prediction: Effects of vertex count and video size. [Dataset]. http://doi.org/10.1371/journal.pone.0314691.t008
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Dec 19, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Mohammed Hlayel; Hairulnizam Mahdin; Mohammad Hayajneh; Saleh H. AlDaajeh; Siti Salwani Yaacob; Mazidah Mat Rejab
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Summary of multiple regression for decompression time prediction: Effects of vertex count and video size.

  20. f

    Allocation plan of CPU.

    • plos.figshare.com
    xls
    Updated Jun 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hailan Ran (2023). Allocation plan of CPU. [Dataset]. http://doi.org/10.1371/journal.pone.0259284.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 8, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Hailan Ran
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Allocation plan of CPU.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Veena More (2025). Large-Scale Curated Multivariate Time Series Anomaly Detection Dataset for Laptop Performance Metrics [Dataset]. http://doi.org/10.17632/97jn6xrs84.1

Data from: Large-Scale Curated Multivariate Time Series Anomaly Detection Dataset for Laptop Performance Metrics

Related Article
Explore at:
Dataset updated
Jul 7, 2025
Authors
Veena More
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

High-quality multivariate time-series datasets are significantly less accessible compared to more common data types such as images or text, due to the resource-intensive process of continuous monitoring, precise annotation, and long-term observation. This paper introduces a cost-effective solution in the form of a large-scale, curated dataset specifically designed for anomaly detection in computing systems’ performance metrics. The dataset encompasses 45 GB of multivariate time-series data collected from 66 systems, capturing key performance indicators such as CPU usage, memory consumption, disk I/O, system load, and power consumption across diverse hardware configurations and real-world usage scenarios. Annotated anomalies, including performance degradation and resource inefficiencies, provide a reliable benchmark and ground truth for evaluating anomaly detection models. By addressing the accessibility challenges associated with time-series data, this resource facilitates advancements in machine learning applications, including anomaly detection, predictive maintenance, and system optimisation. Its comprehensive and practical design makes it a foundational asset for researchers and practitioners dedicated to developing reliable and efficient computing systems.

Search
Clear search
Close search
Google apps
Main menu