5 datasets found
  1. o

    Nominal and adversarial synthetic PMU data for standard IEEE test systems

    • osti.gov
    Updated Jun 15, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pacific Northwest National Laboratory 2 (2021). Nominal and adversarial synthetic PMU data for standard IEEE test systems [Dataset]. http://doi.org/10.25584/DataHub/1788186
    Explore at:
    Dataset updated
    Jun 15, 2021
    Dataset provided by
    Pacific Northwest National Laboratory 2
    US
    PNNL
    Description

    GridSTAGE (Spatio-Temporal Adversarial scenario GEneration) is a framework for the simulation of adversarial scenarios and the generation of multivariate spatio-temporal data in cyber-physical systems. GridSTAGE is developed based on Matlab and leverages Power System Toolbox (PST) where the evolution of the power network is governed by nonlinear differential equations. Using GridSTAGE, one can create several event scenarios that correspond to several operating states of the power network by enabling or disabling any of the following: faults, AGC control, PSS control, exciter control, load changes, generation changes, and different types of cyber-attacks. Standard IEEE bus system data is used to define the power system environment. GridSTAGE emulates the data from PMU and SCADA sensors. The rate of frequency and location of the sensors can be adjusted as well. Detailed instructions on generating data scenarios with different system topologies, attack characteristics, load characteristics, sensor configuration, control parameters are available in the Github repository - https://github.com/pnnl/GridSTAGE. There is no existing adversarial data-generation framework that can incorporate several attack characteristics and yield adversarial PMU data. The GridSTAGE framework currently supports simulation of False Data Injection attacks (such as a ramp, step, random, trapezoidal, multiplicative, replay, freezing) and Denial of Service attacks (such as time-delay, packet-loss) on PMU data. Furthermore, it supports generating spatio-temporal time-series data corresponding to several random load changes across the network or corresponding to several generation changes. A Koopman mode decomposition (KMD) based algorithm to detect and identify the false data attacks in real-time is proposed in https://ieeexplore.ieee.org/document/9303022. Machine learning-based predictive models are developed to capture the dynamics of the underlying power system with a high level of accuracy under various operating conditions for IEEE 68 bus system. The corresponding machine learning models are available at https://github.com/pnnl/grid_prediction.

  2. Z

    Peatland Decomposition Database (1.1.0)

    • data.niaid.nih.gov
    Updated Mar 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Teickner, Henning; Knorr, Klaus-Holger (2025). Peatland Decomposition Database (1.1.0) [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_11276064
    Explore at:
    Dataset updated
    Mar 5, 2025
    Dataset provided by
    University of Münster
    Authors
    Teickner, Henning; Knorr, Klaus-Holger
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    1 Introduction

    The Peatland Decomposition Database (PDD) stores data from published litterbag experiments related to peatlands. Currently, the database focuses on northern peatlands and Sphagnum litter and peat, but it also contains data from some vascular plant litterbag experiments. Currently, the database contains entries from 34 studies, 2,160 litterbag experiments, and 7,297 individual samples with 117,841 measurements for various attributes (e.g. relative mass remaining, N content, holocellulose content, mesh size). The aim is to provide a harmonized data source that can be useful to re-analyse existing data and to plan future litterbag experiments.

    The Peatland Productivity and Decomposition Parameter Database (PPDPD) (Bona et al. 2018) is similar to the Peatland Decomposition Database (PDD) in that both contain data from peatland litterbag experiments. The differences are that both databases partly contain different data, that PPDPD additionally contains information on vegetation productivity, which PDD does not, and that PDD provides more information and metadata on litterbag experiments, and also measurement errors.

    2 Updates

    Compared to version 1.0.0, this version has a new structure for table experimental_design_format, contains additional metadata on the experimental design (these were omitted in version 1.0.0), and contains the scripts that were used to import the data into the database.

    3 Methods

    3.1 Data collection

    Data for the database was collected from published litterbag studies, by extracting published data from figures, tables, or other data sources, and by contacting the authors of the studies to obtain raw data. All data processing was done with R (R version 4.2.0 (2022-04-22)) (R Core Team 2022).

    Studies were identified via a Scopus search with search string (TITLE-ABS-KEY ( peat* AND ( "litter bag" OR "decomposition rate" OR "decay rate" OR "mass loss")) AND NOT ("tropic*")) (2022-12-17). These studies were further screened to exclude those which do not contain litterbag data or which recycle data from other studies that have already been considered. Additional studies with litterbag experiments in northern peatlands we were aware of, but which were not identified in the literature search were added to the list of publications. For studies not older than 10 years, authors were contacted to obtain raw data, however this was successful only in few cases. To date, the database focuses on Sphagnum litterbag experiments and not from all studies that were identified by the literature search data have been included yet in the database.

    Data from figures were extracted using the package ‘metaDigitise’ (1.0.1) (Pick, Nakagawa, and Noble 2018). Data from tables were extracted manually.

    Data from the following studies are currently included: Farrish and Grigal (1985), Bartsch and Moore (1985), Farrish and Grigal (1988), Vitt (1990), Hogg, Lieffers, and Wein (1992), Sanger, Billett, and Cresser (1994), Hiroki and Watanabe (1996), Szumigalski and Bayley (1996), Prevost, Belleau, and Plamondon (1997), Arp, Cooper, and Stednick (1999), Robbert A. Scheffer and Aerts (2000), R. A. Scheffer, Van Logtestijn, and Verhoeven (2001), Limpens and Berendse (2003), Waddington, Rochefort, and Campeau (2003), Asada, Warner, and Banner (2004), Thormann, Bayley, and Currah (2001), Trinder, Johnson, and Artz (2008), Breeuwer et al. (2008), Trinder, Johnson, and Artz (2009), Bragazza and Iacumin (2009), Hoorens, Stroetenga, and Aerts (2010), Straková et al. (2010), Straková et al. (2012), Orwin and Ostle (2012), Lieffers (1988), Manninen et al. (2016), Johnson and Damman (1991), Bengtsson, Rydin, and Hájek (2018a), Bengtsson, Rydin, and Hájek (2018b), Asada and Warner (2005), Bengtsson, Granath, and Rydin (2017), Bengtsson, Granath, and Rydin (2016), Hagemann and Moroni (2015), Hagemann and Moroni (2016), B. Piatkowski et al. (2021), B. T. Piatkowski et al. (2021), Mäkilä et al. (2018), Golovatskaya and Nikonova (2017), Golovatskaya and Nikonova (2017).

    4 Database records

    The database is a ‘MariaDB’ database and the database schema was designed to store data and metadata following the Ecological Metadata Language (EML) (Jones et al. 2019). Descriptions of the tables are shown in Tab. 1.

    The database contains general metadata relevant for litterbag experiments (e.g., geographical, temporal, and taxonomic coverage, mesh sizes, experimental design). However, it does not contain a detailed description of sample handling, sample preprocessing methods, site descriptions, because there currently are no discipline-specific metadata and reporting standards. Table 1: Description of the individual tables in the database.

    Name Description

    attributes Defines the attributes of the database and the values in column attribute_name in table data.

    citations Stores bibtex entries for references and data sources.

    citations_to_datasets Links entries in table citations with entries in table datasets.

    custom_units Stores custom units.

    data Stores measured values for samples, for example remaining masses.

    datasets Lists the individual datasets.

    experimental_design_format Stores information on the experimental design of litterbag experiments.

    measurement_scales, measurement_scales_date_time, measurement_scales_interval, measurement_scales_nominal, measurement_scales_ordinal, measurement_scales_ratio Defines data value types.

    missing_value_codes Defines how missing values are encoded.

    samples Stores information on individual samples.

    samples_to_samples Links samples to other samples, for example litter samples collected in the field to litter samples collected during the incubation of the litterbags.

    units, unit_types Stores information on measurement units.

    5 Attributes Table 2: Definition of attributes in the Peatland Decomposition Database and entries in the column attribute_name in table data.

    Name Definition Example value Unit Measurement scale Number type Minimum value Maximum value String format

    4_hydroxyacetophenone_mass_absolute A numeric value representing the content of 4-hydroxyacetophenone, as described in Straková et al. (2010). 0.26 g ratio real 0 Inf NA

    4_hydroxyacetophenone_mass_relative_mass A numeric value representing the content of 4-hydroxyacetophenone, as described in Straková et al. (2010). 0.26 g/g ratio real 0 1 NA

    4_hydroxybenzaldehyde_mass_absolute A numeric value representing the content of 4-hydroxybenzaldehyde, as described in Straková et al. (2010). 0.26 g ratio real 0 Inf NA

    4_hydroxybenzaldehyde_mass_relative_mass A numeric value representing the content of 4-hydroxybenzaldehyde, as described in Straková et al. (2010). 0.26 g/g ratio real 0 1 NA

    4_hydroxybenzoic_acid_mass_absolute A numeric value representing the content of 4-hydroxybenzoic acid, as described in Straková et al. (2010). 0.26 g ratio real 0 Inf NA

    4_hydroxybenzoic_acid_mass_relative_mass A numeric value representing the content of 4-hydroxybenzoic acid, as described in Straková et al. (2010). 0.26 g/g ratio real 0 1 NA

    abbreviation In table custom_units: A string representing an abbreviation for the custom unit. gC NA nominal NA NA NA NA

    acetone_extractives_mass_absolute A numeric value representing the content of acetone extractives, as described in Straková et al. (2010). 0.26 g ratio real 0 Inf NA

    acetone_extractives_mass_relative_mass A numeric value representing the content of acetone extractives, as described in Straková et al. (2010). 0.26 g/g ratio real 0 1 NA

    acetosyringone_mass_absolute A numeric value representing the content of acetosyringone, as described in Straková et al. (2010). 0.26 g ratio real 0 Inf NA

    acetosyringone_mass_relative_mass A numeric value representing the content of acetosyringone, as described in Straková et al. (2010). 0.26 g/g ratio real 0 1 NA

    acetovanillone_mass_absolute A numeric value representing the content of acetovanillone, as described in Straková et al. (2010). 0.26 g ratio real 0 Inf NA

    acetovanillone_mass_relative_mass A numeric value representing the content of acetovanillone, as described in Straková et al. (2010). 0.26 g/g ratio real 0 1 NA

    arabinose_mass_absolute A numeric value representing the content of arabinose, as described in Straková et al. (2010). 0.26 g ratio real 0 Inf NA

    arabinose_mass_relative_mass A numeric value representing the content of arabinose, as described in Straková et al. (2010). 0.26 g/g ratio real 0 1 NA

    ash_mass_absolute A numeric value representing the content of ash (after burning at 550°C). 4 g ratio real 0 Inf NA

    ash_mass_relative_mass A numeric value representing the content of ash (after burning at 550°C). 0.05 g/g ratio real 0 Inf NA

    attribute_definition A free text field with a textual description of the meaning of attributes in the dpeatdecomposition database. NA NA nominal NA NA NA NA

    attribute_name A string describing the names of the attributes in all tables of the dpeatdecomposition database. attribute_name NA nominal NA NA NA NA

    bibtex A string representing the bibtex code used for a literature reference throughout the dpeatdecomposition database. Galka.2021 NA nominal NA NA NA NA

    bounds_maximum A numeric value representing the minimum possible value for a numeric attribute. 0 NA interval real Inf Inf NA

    bounds_minimum A numeric value representing the maximum possible value for a numeric attribute. INF NA interval real Inf Inf NA

    bulk_density A numeric value representing the bulk density of the sample [g cm-3]. 0,2 g/cm^3 ratio real 0 Inf NA

    C_absolute The absolute mass of C in the sample. 1 g ratio real 0 Inf NA

    C_relative_mass The absolute mass of C in the sample. 1 g/g ratio real 0 Inf NA

    C_to_N A numeric value representing the C to N ratio of the sample. 35 g/g ratio real 0 Inf NA

    C_to_P A numeric value representing the C to P ratio of the sample. 35 g/g ratio real 0 Inf NA

    Ca_absolute The

  3. Statistical Area 2 2025

    • datafinder.stats.govt.nz
    csv, dwg, geodatabase +6
    Updated Aug 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stats NZ (2025). Statistical Area 2 2025 [Dataset]. https://datafinder.stats.govt.nz/layer/120978-statistical-area-2-2025/
    Explore at:
    pdf, csv, kml, mapinfo tab, shapefile, geopackage / sqlite, geodatabase, dwg, mapinfo mifAvailable download formats
    Dataset updated
    Aug 8, 2025
    Dataset provided by
    Statistics New Zealandhttp://www.stats.govt.nz/
    Authors
    Stats NZ
    License

    https://datafinder.stats.govt.nz/license/attribution-4-0-international/https://datafinder.stats.govt.nz/license/attribution-4-0-international/

    Area covered
    Description

    Refer to the 'Current Geographic Boundaries Table' layer for a list of all current geographies and recent updates.

    This dataset is the definitive version of the annually released statistical area 2 (SA2) boundaries as at 1 January 2025 as defined by Stats NZ. This version contains 2,395 SA2s (2,379 digitised and 16 with empty or null geometries (non-digitised)).

    SA2 is an output geography that provides higher aggregations of population data than can be provided at the statistical area 1 (SA1) level. The SA2 geography aims to reflect communities that interact together socially and economically. In populated areas, SA2s generally contain similar sized populations.

    The SA2 should:

    form a contiguous cluster of one or more SA1s,

    excluding exceptions below, allow the release of multivariate statistics with minimal data suppression,

    capture a similar type of area, such as a high-density urban area, farmland, wilderness area, and water area,

    be socially homogeneous and capture a community of interest. It may have, for example:

    • a shared road network,
    • shared community facilities,
    • shared historical or social links, or
    • socio-economic similarity,

    form a nested hierarchy with statistical output geographies and administrative boundaries. It must:

    • be built from SA1s,
    • either define or aggregate to define SA3s, urban areas, territorial authorities, and regional councils.

    SA2s in city council areas generally have a population of 2,000–4,000 residents while SA2s in district council areas generally have a population of 1,000–3,000 residents.

    In major urban areas, an SA2 or a group of SA2s often approximates a single suburb. In rural areas, rural settlements are included in their respective SA2 with the surrounding rural area.

    SA2s in urban areas where there is significant business and industrial activity, for example ports, airports, industrial, commercial, and retail areas, often have fewer than 1,000 residents. These SA2s are useful for analysing business demographics, labour markets, and commuting patterns.

    In rural areas, some SA2s have fewer than 1,000 residents because they are in conservation areas or contain sparse populations that cover a large area.

    To minimise suppression of population data, small islands with zero or low populations close to the mainland, and marinas are generally included in their adjacent land-based SA2.

    Zero or nominal population SA2s

    To ensure that the SA2 geography covers all of New Zealand and aligns with New Zealand’s topography and local government boundaries, some SA2s have zero or nominal populations. These include:

    • SA2s where territorial authority boundaries straddle regional council boundaries. These SA2s each have fewer than 200 residents and are: Arahiwi, Tiroa, Rangataiki, Kaimanawa, Taharua, Te More, Ngamatea, Whangamomona, and Mara.
    • SA2s created for single islands or groups of islands that are some distance from the mainland or to separate large unpopulated islands from urban areas
    • SA2s that represent inland water, inlets or oceanic areas including: inland lakes larger than 50 square kilometres, harbours larger than 40 square kilometres, major ports, other non-contiguous inlets and harbours defined by territorial authority, and contiguous oceanic areas defined by regional council.
    • SA2s for non-digitised oceanic areas, offshore oil rigs, islands, and the Ross Dependency. Each SA2 is represented by a single meshblock. The following 16 SA2s are held in non-digitised form (SA2 code; SA2 name):

    400001; New Zealand Economic Zone, 400002; Oceanic Kermadec Islands, 400003; Kermadec Islands, 400004; Oceanic Oil Rig Taranaki, 400005; Oceanic Campbell Island, 400006; Campbell Island, 400007; Oceanic Oil Rig Southland, 400008; Oceanic Auckland Islands, 400009; Auckland Islands, 400010 ; Oceanic Bounty Islands, 400011; Bounty Islands, 400012; Oceanic Snares Islands, 400013; Snares Islands, 400014; Oceanic Antipodes Islands, 400015; Antipodes Islands, 400016; Ross Dependency.

    SA2 numbering and naming

    Each SA2 is a single geographic entity with a name and a numeric code. The name refers to a geographic feature or a recognised place name or suburb. In some instances where place names are the same or very similar, the SA2s are differentiated by their territorial authority name, for example, Gladstone (Carterton District) and Gladstone (Invercargill City).

    SA2 codes have six digits. North Island SA2 codes start with a 1 or 2, South Island SA2 codes start with a 3 and non-digitised SA2 codes start with a 4. They are numbered approximately north to south within their respective territorial authorities. To ensure the north–south code pattern is maintained, the SA2 codes were given 00 for the last two digits when the geography was created in 2018. When SA2 names or boundaries change only the last two digits of the code will change.

    High-definition version

    This high definition (HD) version is the most detailed geometry, suitable for use in GIS for geometric analysis operations and for the computation of areas, centroids and other metrics. The HD version is aligned to the LINZ cadastre.

    Macrons

    Names are provided with and without tohutō/macrons. The column name for those without macrons is suffixed ‘ascii’.

    Digital data

    Digital boundary data became freely available on 1 July 2007.

    Further information

    To download geographic classifications in table formats such as CSV please use Ariā

    For more information please refer to the Statistical standard for geographic areas 2023.

    Contact: geography@stats.govt.nz

  4. Data from: Degrees of Freedom: Search Cost and Self-Consistency

    • tandf.figshare.com
    txt
    Updated Sep 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lijun Wang; Hongyu Zhao; Xiaodan Fan (2024). Degrees of Freedom: Search Cost and Self-Consistency [Dataset]. http://doi.org/10.6084/m9.figshare.26521195.v2
    Explore at:
    txtAvailable download formats
    Dataset updated
    Sep 24, 2024
    Dataset provided by
    Taylor & Francishttps://taylorandfrancis.com/
    Authors
    Lijun Wang; Hongyu Zhao; Xiaodan Fan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Model degrees of freedom (df) is a fundamental concept in statistics because it quantifies the flexibility of a fitting procedure and is indispensable in model selection. To investigate the gap between df and the number of independent variables in the fitting procedure, Tibshirani introduced the search degrees of freedom (sdf) concept to account for the search cost during model selection. However, this definition has two limitations: it does not consider fitting procedures in augmented spaces and does not use the same fitting procedure for sdf and df. We propose a modified search degrees of freedom (msdf) to directly account for the cost of searching in either original or augmented spaces. We check this definition for various fitting procedures, including classical linear regressions, spline methods, adaptive regressions (the best subset and the lasso), regression trees, and multivariate adaptive regression splines (MARS). In many scenarios when sdf is applicable, msdf reduces to sdf. However, for certain procedures like the lasso, msdf offers a fresh perspective on search costs. For some complex procedures like MARS, the df has been pre-determined during model fitting, but the df of the final fitted procedure might differ from the pre-determined one. To investigate this discrepancy, we introduce the concepts of nominal df and actual df, and define the property of self-consistency, which occurs when there is no gap between these two df’s. We propose a correction procedure for MARS to align these two df’s, demonstrating improved fitting performance through extensive simulations and two real data applications. Supplementary materials for this article are available online.

  5. Statistical Area 3 2023 (generalised)

    • datafinder.stats.govt.nz
    csv, dwg, geodatabase +6
    Updated Dec 1, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stats NZ (2022). Statistical Area 3 2023 (generalised) [Dataset]. https://datafinder.stats.govt.nz/layer/111202-statistical-area-3-2023-generalised/
    Explore at:
    kml, shapefile, dwg, pdf, geopackage / sqlite, mapinfo tab, geodatabase, mapinfo mif, csvAvailable download formats
    Dataset updated
    Dec 1, 2022
    Dataset provided by
    Statistics New Zealandhttp://www.stats.govt.nz/
    Authors
    Stats NZ
    License

    https://datafinder.stats.govt.nz/license/attribution-4-0-international/https://datafinder.stats.govt.nz/license/attribution-4-0-international/

    Area covered
    Description

    Statistical area 3 (SA3) is a new output geography, introduced in 2023, that allows aggregations of population data between the SA2 geography and territorial authority geography.

    This dataset is the definitive version of the annually released statistical area 3 (SA3) boundaries as at 1 January 2023 as defined by Stats NZ. This version contains 929 SA3s, including 4 non-digitised SA3s.

    The SA3 geography aims to meet three purposes:

    1. approximate suburbs in major, large, and medium urban areas,

    2. in predominantly rural areas, provide geographical areas that are larger in area and population size than SA2s but smaller than territorial authorities,

    3. minimise data suppression.

    SA3s in major, large, and medium urban areas were created by combining SA2s to approximate suburbs as delineated in the Fire and Emergency NZ (FENZ) Localities dataset. Some of the resulting SA3s have very large populations.

    Outside of major, large, and medium urban areas, SA3s generally have populations of 5,000–10,000. These SA3s may represent either a single small urban area, a combination of small urban areas and their surrounding rural SA2s, or a combination of rural SA2s.

    Zero or nominal population SA3s

    To minimise the amount of unsuppressed data that can be provided in multivariate statistical tables, SA2s with fewer than 1,000 residents are combined with other SA2s wherever possible to reach the 1,000 SA3 population target. However, there are still a number of SA3s with zero or nominal populations.

    Small population SA2s designed to maintain alignment between territorial authority and regional council geographies are merged with other SA2s to reach the 5,000–10,000 SA3 population target. These merges mean that some SA3s do not align with regional council boundaries but are aligned to territorial authority.

    Small population island SA2s are included in their adjacent land-based SA3.

    Island SA2s outside territorial authority or region are the same in the SA3 geography.

    Inland water SA2s are aggregated and named by territorial authority, as in the urban rural classification.

    Inlet SA2s are aggregated and named by territorial authority or regional council where the water area is outside the territorial authority.

    Oceanic SA2s translate directly to SA3s as they are already aggregated to regional council.

    The 16 non-digitised SA2s are aggregated to the following 4 non-digitised SA3s (SA3 code; SA3 name):

    70001; Oceanic outside region, 70002; Oceanic oil rigs, 70003; Islands outside region, 70004; Ross Dependency outside region.

    SA3 numbering and naming

    Each SA3 is a single geographic entity with a name and a numeric code. The name refers to a suburb,recognised place name, or portion of a territorial authority. In some instances where place names are the same or very similar, the SA3s are differentiated by their territorial authority, for example, Hillcrest (Hamilton City) and Hillcrest (Rotorua District).

    SA3 codes have five digits. North Island SA3 codes start with a 5, South Island SA3 codes start with a 6 and non-digitised SA3 codes start with a 7. They are numbered approximately north to south within their respective territorial authorities. When first created in 2023, the last digit of each code was 0. When SA3 boundaries change in future, only the last digit of the code will change to ensure the north-south pattern is maintained.

    For more information please refer to the Statistical standard for geographic areas 2023.

    Generalised version

    This generalised version has been simplified for rapid drawing and is designed for thematic or web mapping purposes.

    Macrons

    Names are provided with and without tohutō/macrons. The column name for those without macrons is suffixed ‘ascii’.

    Digital data

    Digital boundary data became freely available on 1 July 2007.

    To download geographic classifications in table formats such as CSV please use Ariā

  6. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Pacific Northwest National Laboratory 2 (2021). Nominal and adversarial synthetic PMU data for standard IEEE test systems [Dataset]. http://doi.org/10.25584/DataHub/1788186

Nominal and adversarial synthetic PMU data for standard IEEE test systems

Explore at:
Dataset updated
Jun 15, 2021
Dataset provided by
Pacific Northwest National Laboratory 2
US
PNNL
Description

GridSTAGE (Spatio-Temporal Adversarial scenario GEneration) is a framework for the simulation of adversarial scenarios and the generation of multivariate spatio-temporal data in cyber-physical systems. GridSTAGE is developed based on Matlab and leverages Power System Toolbox (PST) where the evolution of the power network is governed by nonlinear differential equations. Using GridSTAGE, one can create several event scenarios that correspond to several operating states of the power network by enabling or disabling any of the following: faults, AGC control, PSS control, exciter control, load changes, generation changes, and different types of cyber-attacks. Standard IEEE bus system data is used to define the power system environment. GridSTAGE emulates the data from PMU and SCADA sensors. The rate of frequency and location of the sensors can be adjusted as well. Detailed instructions on generating data scenarios with different system topologies, attack characteristics, load characteristics, sensor configuration, control parameters are available in the Github repository - https://github.com/pnnl/GridSTAGE. There is no existing adversarial data-generation framework that can incorporate several attack characteristics and yield adversarial PMU data. The GridSTAGE framework currently supports simulation of False Data Injection attacks (such as a ramp, step, random, trapezoidal, multiplicative, replay, freezing) and Denial of Service attacks (such as time-delay, packet-loss) on PMU data. Furthermore, it supports generating spatio-temporal time-series data corresponding to several random load changes across the network or corresponding to several generation changes. A Koopman mode decomposition (KMD) based algorithm to detect and identify the false data attacks in real-time is proposed in https://ieeexplore.ieee.org/document/9303022. Machine learning-based predictive models are developed to capture the dynamics of the underlying power system with a high level of accuracy under various operating conditions for IEEE 68 bus system. The corresponding machine learning models are available at https://github.com/pnnl/grid_prediction.

Search
Clear search
Close search
Google apps
Main menu