Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically
Data released under the Department of Energy's (DOE) Open Energy Data Initiative (OEDI). The Open Energy Data Initiative aims to improve and automate access of high-value energy data sets across the U.S. Department of Energy’s programs, offices, and national laboratories. OEDI aims to make data actionable and discoverable by researchers and industry to accelerate analysis and advance innovation.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The United States is embarking on an ambitious transition to a 100% clean energy economy by 2050, which will require improving the flexibility of electric grids. One way to achieve grid flexibility is to shed or shift demand to align with changing grid needs. To facilitate this, it is critical to understand how and when energy is used. High quality end-use load profiles (EULPs) provide this information, and can help cities, states, and utilities understand the time-sensitive value of energy efficiency, demand response, and distributed energy resources. Publicly available EULPs have traditionally had limited application because of age and incomplete geographic representation. To help fill this gap, the U.S. Department of Energy (DOE) funded a three-year project, End-Use Load Profiles for the U.S. Building Stock, that culminated in this publicly available dataset of calibrated and validated 15-minute resolution load profiles for all major residential and commercial building types and end uses, across all climate regions in the United States. These EULPs were created by calibrating the ResStock and ComStock physics-based building stock models using many different measured datasets, as described in the "Technical Report Documenting Methodology" linked in the submission.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The BUTTER Empirical Deep Learning Dataset represents an empirical study of the deep learning phenomena on dense fully connected networks, scanning across thirteen datasets, eight network shapes, fourteen depths, twenty-three network sizes (number of trainable parameters), four learning rates, six minibatch sizes, four levels of label noise, and fourteen levels of L1 and L2 regularization each. Multiple repetitions (typically 30, sometimes 10) of each combination of hyperparameters were preformed, and statistics including training and test loss (using a 80% / 20% shuffled train-test split) are recorded at the end of each training epoch. In total, this dataset covers 178 thousand distinct hyperparameter settings ("experiments"), 3.55 million individual training runs (an average of 20 repetitions of each experiments), and a total of 13.3 billion training epochs (three thousand epochs were covered by most runs). Accumulating this dataset consumed 5,448.4 CPU core-years, 17.8 GPU-years, and 111.2 node-years.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
From website:
Open Energy Info is a platform to connect the world?s energy data. It is a linked open data platform bringing together energy information to provide improved analyses, unique visualizations, and real-time access to data. OpenEI follows guidelines set by the White House?s Open Government Initiative , which is focused on transparency, collaboration, and participation. OpenEI strives to provide open access to this energy information, which will spur creativity and drive innovation in the energy sector.
Project representatives confirmed via email:
The default license for all information in OpenEI I Creative Commons Zero (http://wiki.creativecommons.org/CC0_FAQ). You can see this documented in a notice shown in the site's editing interface for any page (e.g. http://en.openei.org/w/index.php?title=Colorado&action=edit).
Project representatives confirmed via email:
As for bulk downloads of data... We currently offer the following:
- "Download CSV" links are present on certain pages (e.g. http://en.openei.org/wiki/Map_of_Clean_Energy_Companies)
- A SPARQL endpoint for querying data (http://en.openei.org/sparql), plus a few examples of its use (http://en.openei.org/resources/)
- RDF Exports of any given page via the "browse properties" link (e.g. http://en.openei.org/wiki/Special:ExportRDF/Colorado)
We realize that we could be doing more in this area, though. A few things we're planning to do:
- Provide a bulk download of our complete RDF (similar to how one can get complete dbpedia exports)
- Provide a better way to find various datasets (and download them in various formats, such as RDF and CSV)
- Provide a mechanism for submitting bulk datasets which is separate from, but complementary to, our Wiki)
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This is a synthetic building operation dataset which includes HVAC, lighting, miscellaneous electric loads (MELs) system operating conditions, occupant counts, environmental parameters, end-use and whole-building energy consumptions at 10-minute intervals. The data is created with 1395 annual simulations using the U.S. DOE detailed medium-sized reference office building, and 30 years' historical weather data in three typical climates including Miami, San Francisco, and Chicago. Three energy efficiency levels of the building and systems are considered. Assumptions regarding occupant movements, occupants' diverse temperature preferences, lighting, and MELs are adopted to reflect realistic building operations. A semantic building metadata schema - BRICK, is used to store the building metadata. The dataset is saved in a 1.2 TB of compressed HDF5 file. This dataset can be used in various applications, including building energy and load shape benchmarking, energy model calibration, evaluation of occupant and weather variability and their influences on building performance, algorithm development and testing for thermal and energy load prediction, model predictive control, policy development for reinforcement learning based building controls.
This dataset includes links to the PoroTomo DAS data in both SEG-Y and hdf5 (via h5py and HSDS with h5pyd) formats with tutorial notebooks for use. Data are hosted on Amazon Web Services (AWS) Simple Storage Service (S3) through the Open Energy Data Initiative (OEDI). Also included are links to the documentation for the dataset, Jupyter Notebook tutorials for working with the data as it is stored in AWS S3, and links to data viewers in OEDI for the horizontal (DASH) and vertical (DASV) DAS datasets. Horizontal DAS (DASH) data collection began 3/8/16, paused, and then started again on 3/11/2016 and ended 3/26/2016 using zigzag trenched fiber optic cabels. Vertical DAS (DASV) data collection began 3/17/2016 and ended 3/28/16 using a fiber optic cable through the first 363 m of a vertical well. These are raw data files from the DAS deployment at (DASH) and below (DASV) the surface during testing at the PoroTomo Natural Laboratory at Brady Hot Spring in Nevada. SEG-Y and hdf5 files are stored in 30 second files organized into directories by day. The hdf5 files available via HSDS are stored in daily files. Metadata includes information on the timing of recording gaps and a file count is included that lists the number of files from each day of recording. These data are available for download without login credentials through the free and publicly accessible Open Energy Data Initiative (OEDI) data viewer which allows users to browse and download individual or groups of files.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Lawrence Berkeley National Laboratory (Berkeley Lab) estimates hourly project-level generation data for utility-scale solar projects and hourly county-level generation data for residential and non-residential distributed photovoltaic (PV) systems in the seven organized wholesale markets and 10 additional Balancing Areas. To encourage its broader use, Berkeley Lab has made this data file public here at OEDI. The public project-level dataset is updated annually with data from the previous calendar year. For more information about the research project, including a technical report, briefing material, visualizations, and additional data, please visit the project homepage linked in this submission.
A newer version of the data exists and can be found linked in the resources of this submission under "Solar-to-Grid Public Data File Updated 2021".
Alternative fueling stations are located throughout the United States and Canada, and their availability continues to grow. The Alternative Fuels Data Center (AFDC) maintains a website where you can find alternative fueling stations near you or on a route, obtain counts of alternative fueling stations by state, view maps, and more. The most recent dataset available for download here provides a "snapshot" of the alternative fueling station information for compressed natural gas (CNG), ethanol (E85), propane/liquefied petroleum gas (LPG), biodiesel (B20 and above), electric vehicle charging, hydrogen, and liquefied natural gas (LNG), as of July 29, 2021.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Newberry Volcano, a voluminous (500 km3) basaltic/andesitic/rhyolitic shield volcano located near the intersection of the Cascade volcanic arc, the Oregon High Lava Plains and Brothers Fault Zone, and the northern Basin and Range Province, has been the site of geothermal exploration for more than 40 years. This has resulted in a unique resource: an extensive set of surficial and subsurface information appropriate to constrain the baseline structure of, and conditions within a high heat capacity magmatically hosted geothermal system.
In 2012 and 2014 AltaRock Energy conducted repeated stimulation of an enhanced geothermal systems (EGS) prospect along the western flank of the Newberry Volcano. A surface based monitoring effort was conducted independent of these stimulation attempts in both 2012 and 2014 through a collaboration between NETL, Oregon State University and Zonge International. This program included utilization of 3-D and 4-D magnetotelluric, InSAR, ground-based interferometric radar, and microgravity observations within and surrounding the planned EGS stimulation zone. These observations as well as borehole and microseismic stress field and location solutions provided by AltaRock and its collaborators, in combination with well logs, petrologic and geochemical data sets, LIDAR mapping of fault traces and extrusive volcanics, surficial geologic mapping and seismic tomography, have resulted in development of a framework, subsurface geologic model for Newberry Volcano.
The Newberry subsurface geologic model is a three-dimensional digital model constructed in EarthVision that enables lithology, directly and remotely measured material properties, and derived properties such as permeability, porosity and temperature, to be coregistered. This provides a powerful tool for characterizing and evaluating the sustainability of the site for EGS production and testing, particularly within the data-dense western portion of the volcano. The model has implications for understanding the previous EGS stimulations at Newberry as well as supporting future research and resource characterization opportunities. A portion of the Newberry area has been selected as a candidate site for the DOE FORGE (Frontier Observatory for Research in Geothermal Energy) Program through a collaboration between Pacific Northwest National Laboratory, Oregon State University, AltaRock Energy and additional partners. Thus, the conceptual geologic model presented here will support and benefit from future enhancements associated with that effort. --Mark-Moser et al. 2016
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Commercial reference buildings provide complete descriptions for whole building energy analysis using EnergyPlus (see "About EnergyPlus" resource link) simulation software. Included here is data pertaining to the reference building type "Small Hotel" for each of the 16 climate zones described on the Wiki page (see "OpenEI Wiki Page for Commercial Reference Buildings" resource link), and each of three construction categories: new (2004) construction, post-1980 construction existing buildings, and pre-1980 construction existing buildings.
The dataset includes four key components: building summary, zone summary, location summary and a picture. Building summary includes details about: form, fabric, and HVAC. Zone summary includes details such as: area, volume, lighting, and occupants for all types of zones in the building. Location summary includes key building information as it pertains to each climate zone, including: fabric and HVAC details, utility costs, energy end use, and peak energy demand.
In total, DOE developed 16 reference building types that represent approximately 70% of commercial buildings in the U.S.; for each type, building models are available for each of the three construction categories. The commercial reference buildings (formerly known as commercial building benchmark models) were developed by the U.S. Department of Energy (DOE), in conjunction with three of its national laboratories.
Additional data is available directly from DOE's Energy Efficiency & Renewable Energy (EERE) website (see "About Commercial Buildings" resource link), including EnergyPlus software input files (.idf) and results of the EnergyPlus simulations (.html).
Note: There have been many changes and improvements since this dataset was released. Several revisions have been made to the models and moved to a different approach to representing typical building energy consumption. For current data on building energy consumption please see the ComStock resource below.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This spreadsheet contains per-site metadata for the WIND Toolkit sites and serves as an index for the raw data hosted on Globus connect (nrel#globus:/globusro/met_data). Aside from the metadata, per site average power and capacity factor are given. This data was prepared by 3TIER under contract by NREL and is public domain.
Authoritative documentation on the creation of the underlying dataset is at:
Final Report on the Creation of the Wind Integration National Dataset (WIND) Toolkit and API: http://www.nrel.gov/docs/fy16osti/66189.pdf
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Super-Resolution for Renewable Energy Resource Data with Climate Change Impacts (Sup3rCC) data is a collection of 4km hourly wind, solar, temperature, humidity, and pressure fields for the contiguous United States under various climate change scenarios.
Sup3rCC is downscaled Global Climate Model (GCM) data. The downscaling process was performed using a generative machine learning approach called sup3r: Super-Resolution for Renewable Energy Resource Data (linked below as "Sup3r GitHub Repo"). The data includes both historical and future weather years, although the historical years represent the historical climate, not the actual historical weather that we experienced. You cannot use Sup3rCC data to study historical weather events, although other sup3r datasets may be intended for this.
The Sup3rCC data is intended to help researchers study the impact of climate change on energy systems with high levels of wind and solar capacity. Please note that all climate change data is only a representation of the possible future climate and contains significant uncertainty. Analysis of multiple climate change scenarios and multiple climate models can help quantify this uncertainty.
Note: v0.2.2 was released on May 7th 2025 for preliminary use while under peer review. Files and folders with a preliminary tag are subject to change in response to review feedback. The preliminary tag and this note will be removed upon final release.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Screening Tool for Equitable Adoption and DeploYment of Solar (STEADy Solar) is a database and mapping tool designed to promoting clean energy investments for low-income communities across the United States. The tool indicates locations that may be eligible for the Investment Tax Credit bonus adders defined in the 2022 Inflation Reduction Act (IRA) and combines this information with demographics, social vulnerability, solar technical potential, solar economics (modeled net present value), and building counts by use-type. It can be used by states, municipalities, community-based organizations, developers, and researchers to identify sites where solar projects may be economical and where federal incentives may be available to support equitable adoption of solar.Specific values include: Areas eligible for the Energy Communities Tax Credit Bonus Program (including brownfield site counts)Areas eligible for the Low Income Communities Bonus Credit Program (including Tribal Lands, and covered affordable housing project counts)Areas categorized as disadvantaged by Justice40Commercial and Residential Solar economics characterized by the Net Present Value and Simple Payback PeriodTotal Population, Race, and EthnicityMedian Household Income, Poverty rate, Household TenureSocial VulnerabilityCount of buildings, developable rooftop solar capacity (in kWdc) and estimated annual generation potential (in kWh) on four building types: Government General Services, Government Emergency Response, Grade Schools, and Colleges/Universities. The linked report describes the STEADy dataset metadata and presents high level insights from the data. The downloadable and formatted excel dataset makes it easy for users to gain insights for their locations. Supporting .csv and shapefiles provide users with the full data to run their own analyses on equitable solar siting.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This submission contains an open-source library of transient events in distributed system with high solar PV. The library includes the collected data, related documents and scripts for loading the data. The data library is built for transient event detection and machine learning based analysis algorithm development. The data was collected via both field test and software simulation. The units for the data are included in the data file headers for each data series. A text editor or spreadsheet software, such as Excel, and Matlab is required to view the data.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The NREL PVDAQ is a large-scale time-series database containing system metadata and performance data from a variety of experimental PV sites and commercial public PV sites. The datasets are used to perform on-going performance and degradation analysis. Some of the sets can exhibit common elements that effect PV performance (e.g. soiling). The dataset consists of a series of files devoted to each of the systems and an associated set of metadata information that explains details about the system hardware and the site geo-location. Some system datasets also include environmental sensors that cover irradiance, temperatures, wind speeds, and precipitation at the site.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time-coincident load, wind, and solar data including actual and probabilistic forecast datasets at 5-min resolution for ERCOT, MISO, NYISO, and SPP. Wind and solar profiles are supplied for existing sites as well as planned sites based on interconnection queue projects as of 2021. For ERCOT actuals are provided for 2017 and 2018 and forecasts for 2018, and for the remaining ISOs actuals are provided for 2018 and 2019 and forecasts for 2019.
There datasets were produced by NREL as part of the ARPA-E PERFORM project, an ARPA-E funded program that aim to use time-coincident power and load seeks to develop innovative management systems that represent the relative delivery risk of each asset and balance the collective risk of all assets across the grid. For more information on the datasets and methods used to generate them see https://github.com/PERFORM-Forecasts/documentation.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This repository documents the Open Water Rate Specification (OWRS), a machine-readable format for specifying and sharing water rate information. OWRS is designed for analysts, economists, and software developers interested in analyzing water rates. OWRS attempts to fully encode a water utility's rate structure and pricing schedules in a form that is easy to store, share, modify and apply programmatically.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Procurement Analysis Tool (PAT) was developed at NREL to help organizations explore renewable energy options that align with their goals. Users input facility data and answer goal-oriented questions. PAT analyzes this information to identify potential wind, solar, or storage resources and suitable procurement options (PPA, Green Tariffs) that align with their budget, location, and sustainability goals. For more information see the "Procurement Analysis Tool" resource below.
The Renewable Electricity Procurement Options Data (RE-POD) was an aggregated dataset meant to help local jurisdictions and utility customers within those jurisdictions understand the options that may be available to them to procure renewable electricity or renewable energy credits to meet energy goals. RE-POD has been discontinued and replaced with the PAT.
This data is part of a suite of state and local energy profile data available at the "State and Local Energy Profile Data Suite" link below and builds on Cities-LEAP energy modeling, available at the "EERE Cities-LEAP Page" link below. Examples of how to use the data to inform energy planning can be found at the "Example Uses" link below.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This research is to meant to demonstrate the OEDI SI use case for distributed optimal power flow (DOPF). The goal was to formulate the optimal power flow problem in the distribution system for active and reactive power setpoints of PV systems using topology information and voltage measurements. The co-simulation runs every 15 minutes as outlined within the scenario file for the given feeder configuration.
The linked GitHub repository includes five federates to achieve DOPF for the small, medium, large, and IEEE 123 feeder scenarios. We are using the OEDI SI framework, as well as the example feeder, sensor, recorder, and estimator federates provided in the example repository for OEDI SI. We also provide a runner script for switching between scenarios.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This project provides a national unified database of residential building retrofit measures and associated retail prices and end-user might experience. These data are accessible to software programs that evaluate most cost-effective retrofit measures to improve the energy efficiency of residential buildings and are used in the consumer-facing website https://remdb.nrel.gov/
This publicly accessible, centralized database of retrofit measures offers the following benefits:
This database provides full price estimates for many different retrofit measures. For each measure, the database provides a range of prices, as the data for a measure can vary widely across regions, houses, and contractors. Climate, construction, home features, local economy, maturity of a market, and geographic location are some of the factors that may affect the actual price of these measures.
This database is not intended to provide specific cost estimates for a specific project. The cost estimates do not include any rebates or tax incentives that may be available for the measures. Rather, it is meant to help determine which measures may be more cost-effective. The National Renewable Energy Laboratory (NREL) makes every effort to ensure accuracy of the data; however, NREL does not assume any legal liability or responsibility for the accuracy or completeness of the information.
Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically
Data released under the Department of Energy's (DOE) Open Energy Data Initiative (OEDI). The Open Energy Data Initiative aims to improve and automate access of high-value energy data sets across the U.S. Department of Energy’s programs, offices, and national laboratories. OEDI aims to make data actionable and discoverable by researchers and industry to accelerate analysis and advance innovation.