Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Electric utilities report a huge amount of information to the US government and other public agencies. This includes yearly, monthly, and even hourly data about fuel burned, electricity generated, operating expenses, power plant usage patterns and emissions. Unfortunately, much of this data is not released in well documented, ready-to-use, machine readable formats. Data from different agencies tends not to be standardized or easily used in tandem. Several commercial data services clean, package, and re-sell this this data, but at prices which are too high to be accessible to many smaller stakeholders.
The Public Utility Data Liberation (PUDL) project takes information that’s already publicly available, and makes it publicly usable, by cleaning, standardizing, and cross-linking utility data from different sources in a single database. Thus far our primary focus has been on fuel use, generation, operating costs, and operation history. It currently includes data from:
We archive snapshots of the raw inputs on Zenodo and all our data processing uses those snapshots as a starting place for reproducibility.
You can find the source code that generates this database in the PUDL repository on GitHub. The PUDL project is coordinated by Catalyst Cooperative.
The data is updated nightly by our automated nightly builds. When they are successful, new data is uploaded to the AWS Open Data Registry
We publish PUDL Data Dictionaries on Read the Docs which provide more descriptive information about the data.
Dataset header image courtesy of Gerry Machen via Flickr under a CC-BY-ND license
Facebook
TwitterThe Utility Energy Registry (UER) is a database platform that provides streamlined public access to aggregated community-scale utility-reported energy data. The UER is intended to promote and facilitate community-based energy planning and energy use awareness and engagement. On April 19, 2018, the New York State Public Service Commission (PSC) issued the Order Adopting the Utility Energy Registry under regulatory CASE 17-M-0315. The order requires utilities under its regulation to develop and report community energy use data to the UER.This dataset includes electricity and natural gas usage data reported at the city, town, and village level collected under a data protocol in effect between 2016 and 2021. Other UER datasets include energy use data reported at the county and ZIP code level. Data collected after 2021 were collected according to a modified protocol. Those data may be found at https://data.ny.gov/Energy-Environment/Utility-Energy-Registry-Monthly-Community-Energy-U/4txm-py4p.Data in the UER can be used for several important purposes such as planning community energy programs, developing community greenhouse gas emissions inventories, and relating how certain energy projects and policies may affect a particular community. It is important to note that the data are subject to privacy screening and fields that fail the privacy screen are withheld.
Facebook
Twitter***1. - Names and descriptions of variables in Utilities dataset. Variable Description month month (coded as a number) day day of the month on which bill was calculated year year of bill temp average temperature (degrees Celsius) for billing period kwh electricity usage (kwh) ccf gas usage (ccf) thermsPerDay a numeric vector billingDays number of billing days in billing period totalbill total monthly bill (in dollars) gasbill monthly gas bill (in dollars) elecbill monthly electric bill (in dollars) notes notes about the billing period donate (yes or no) did the person add money to their bill to be donated to Operation Fuel, a charity providing heat to families/small business in need
Facebook
TwitterODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
Monthly utility data for all City of Boston accounts. This data comes from Boston’s Enterprise Energy Management System. This software tool serves as the system of record for all municipal utility expenditures and energy/water use.
Facebook
TwitterSummary of the NYS gas and electric utilities’ customer service performance indicators for each of the last three calendar years. Reported by the utilities on a uniform basis, these performance indicators allow comparative analysis of customer service, identification of overall trends in customer service, and identification of service deficiencies.
Facebook
Twitterhttps://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy
According to our latest research, the global health data utility platforms market size reached USD 4.87 billion in 2024, driven by the accelerating adoption of digital health solutions and the need for integrated healthcare data management. The market is experiencing robust expansion, registering a CAGR of 18.2% from 2025 to 2033. By the end of 2033, the market is projected to achieve a value of USD 21.7 billion. This substantial growth is primarily fueled by the increasing demand for interoperable data platforms that can support clinical, operational, and research needs across diverse healthcare ecosystems.
One of the primary growth factors for the health data utility platforms market is the rapid digital transformation occurring within the global healthcare sector. Healthcare organizations are increasingly recognizing the value of harnessing data-driven insights to improve patient outcomes, optimize resource allocation, and enhance operational efficiency. The proliferation of electronic health records (EHRs), wearable devices, and telemedicine has resulted in an exponential increase in healthcare data volume and complexity. As a result, there is a pressing need for advanced platforms that can aggregate, normalize, and analyze disparate data sources, enabling stakeholders to make informed decisions and drive value-based care initiatives. Furthermore, the rising prevalence of chronic diseases and the growing emphasis on population health management are compelling healthcare providers and payers to invest in robust health data utility platforms.
Another significant driver propelling the growth of the health data utility platforms market is the evolving regulatory landscape and the increasing focus on data privacy and security. Governments and regulatory bodies worldwide are implementing stringent data protection frameworks, such as HIPAA in the United States and GDPR in Europe, to safeguard sensitive health information. These regulations necessitate the deployment of secure, compliant platforms capable of ensuring data integrity, traceability, and controlled access. Consequently, healthcare organizations are seeking solutions that not only facilitate seamless data exchange but also adhere to regulatory requirements, thus fostering trust among patients and stakeholders. The integration of advanced technologies such as artificial intelligence (AI), machine learning (ML), and blockchain further enhances the capabilities of health data utility platforms, enabling predictive analytics, real-time monitoring, and secure data sharing.
Additionally, the growing trend towards value-based care and outcome-driven reimbursement models is stimulating the adoption of health data utility platforms. Payers, providers, and pharmaceutical companies are increasingly collaborating to harness real-world evidence, improve care coordination, and streamline clinical workflows. Health data utility platforms play a pivotal role in bridging data silos, facilitating interoperability, and supporting comprehensive analytics across the healthcare continuum. As stakeholders strive to deliver personalized care, reduce costs, and improve patient satisfaction, the demand for scalable, flexible, and interoperable data utility solutions is expected to surge, further accelerating market growth.
From a regional perspective, North America currently dominates the health data utility platforms market, accounting for the largest revenue share in 2024. This leadership is attributed to the region’s advanced healthcare infrastructure, high adoption of digital health technologies, and supportive regulatory environment. Europe follows closely, driven by increasing investments in healthcare IT and a strong focus on data privacy. The Asia Pacific region is emerging as a key growth market, fueled by expanding healthcare access, rising digitalization efforts, and supportive government initiatives. Latin America and the Middle East & Africa are also witnessing steady growth, albeit at a relatively slower pace, as healthcare systems in these regions continue to modernize and adopt data-driven approaches.
The health data utility platforms market is segmented by component into software and services. The software segment encompasses a wide array of solutions designed to facilitate the aggregation, integration, and analysis of healthcare data from multiple sources. These platforms typically offer modules for data ingestion, normalization, interoperability, and a
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset, compiled by NREL using data from ABB, the Velocity Suite and the U.S. Energy Information Administration dataset 861, provides average residential, commercial and industrial electricity rates with likely zip codes for both investor owned utilities (IOU) and non-investor owned utilities. Note: the files include average rates for each utility (not average rates per zip code), but not the detailed rate structure data found in the OpenEI U.S. Utility Rate Database.
Facebook
TwitterThe Utility Energy Registry (UER) is a database platform that provides streamlined public access to aggregated community-scale energy data. The UER is intended to promote and facilitate community-based energy planning and energy use awareness and engagement. On April 19, 2018, the New York State Public Service Commission (PSC) issued the Order Adopting the Utility Energy Registry under regulatory CASE 17-M-0315, and updated the protocol in a modification order on August 12, 2021. The order requires utilities and CCA administrators under its regulation to develop and report community energy use data to the UER. This dataset includes electricity and natural gas usage data reported at the city, town, and village level. Other UER datasets include energy use data reported at the county and ZIP code level.
Data in the UER can be used for several important purposes such as planning community energy programs, developing community greenhouse gas emissions inventories, and relating how certain energy projects and policies may affect a particular community. It is important to note that the data are subject to privacy screening and fields that fail the privacy screen are withheld.
The New York State Energy Research and Development Authority (NYSERDA) offers objective information and analysis, innovative programs, technical expertise, and support to help New Yorkers increase energy efficiency, save money, use renewable energy, and reduce reliance on fossil fuels. To learn more about NYSERDA’s programs, visit nyserda.ny.gov or follow us on X, Facebook, YouTube, or Instagram.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
PUDL v2025.2.0 Data Release
This is our regular quarterly release for 2025Q1. It includes updates to all the datasets that are published with quarterly or higher frequency, plus initial verisons of a few new data sources that have been in the works for a while.
One major change this quarter is that we are now publishing all processed PUDL data as Apache Parquet files, alongside our existing SQLite databases. See Data Access for more on how to access these outputs.
Some potentially breaking changes to be aware of:
In the EIA Form 930 – Hourly and Daily Balancing Authority Operations Report a number of new energy sources have been added, and some old energy sources have been split into more granular categories. See Changes in energy source granularity over time.
We are now running the EPA’s CAMD to EIA unit crosswalk code for each individual year starting from 2018, rather than just 2018 and 2021, resulting in more connections between these two datasets and changes to some sub-plant IDs. See the note below for more details.
Many thanks to the organizations who make these regular updates possible! Especially GridLab, RMI, and the ZERO Lab at Princeton University. If you rely on PUDL and would like to help ensure that the data keeps flowing, please consider joining them as a PUDL Sustainer, as we are still fundraising for 2025.
New Data
EIA 176
Add a couple of semi-transformed interim EIA-176 (natural gas sources and dispositions) tables. They aren’t yet being written to the database, but are one step closer. See #3555 and PRs #3590, #3978. Thanks to @davidmudrauskas for moving this dataset forward.
Extracted these interim tables up through the latest 2023 data release. See #4002 and #4004.
EIA 860
Added EIA 860 Multifuel table. See #3438 and #3946.
FERC 1
Added three new output tables containing granular utility accounting data. See #4057, #3642 and the table descriptions in the data dictionary:
out_ferc1_yearly_detailed_income_statements
out_ferc1_yearly_detailed_balance_sheet_assets
out_ferc1_yearly_detailed_balance_sheet_liabilities
SEC Form 10-K Parent-Subsidiary Ownership
We have added some new tables describing the parent-subsidiary company ownership relationships reported in the SEC’s Form 10-K, Exhibit 21 “Subsidiaries of the Registrant”. Where possible these tables link the SEC filers or their subsidiary companies to the corresponding EIA utilities. This work was funded by a grant from the Mozilla Foundation. Most of the ML models and data preparation took place in the mozilla-sec-eia repository separate from the main PUDL ETL, as it requires processing hundreds of thousands of PDFs and the deployment of some ML experiment tracking infrastructure. The new tables are handed off as nearly finished products to the PUDL ETL pipeline. Note that these are preliminary, experimental data products and are known to be incomplete and to contain errors. Extracting data tables from unstructured PDFs and the SEC to EIA record linkage are necessarily probabalistic processes.
See PRs #4026, #4031, #4035, #4046, #4048, #4050 and check out the table descriptions in the PUDL data dictionary:
out_sec10k_parents_and_subsidiaries
core_sec10k_quarterly_filings
core_sec10k_quarterly_exhibit_21_company_ownership
core_sec10k_quarterly_company_information
Expanded Data Coverage
EPA CEMS
Added 2024 Q4 of CEMS data. See #4041 and #4052.
EPA CAMD EIA Crosswalk
In the past, the crosswalk in PUDL has used the EPA’s published crosswalk (run with 2018 data), and an additional crosswalk we ran with 2021 EIA 860 data. To ensure that the crosswalk reflects updates in both EIA and EPA data, we re-ran the EPA R code which generates the EPA CAMD EIA crosswalk with 4 new years of data: 2019, 2020, 2022 and 2023. Re-running the crosswalk pulls the latest data from the CAMD FACT API, which results in some changes to the generator and unit IDs reported on the EPA side of the crosswalk, which feeds into the creation of core_epa_assn_eia_epacamd.
The changes only result in the addition of new units and generators in the EPA data, with no changes to matches at the plant level. However, the updates to generator and unit IDs have resulted in changes to the subplant IDs - some EIA boilers and generators which previously had no matches to EPA data have now been matched to EPA unit data, resulting in an overall reduction in the number of rows in the core_epa_assn_eia_epacamd_subplant_ids table. See issues #4039 and PR #4056 for a discussion of the changes observed in the course of this update.
EIA 860M
Added EIA 860m through December 2024. See #4038 and #4047.
EIA 923
Added EIA 923 monthly data through September 2024. See #4038 and #4047.
EIA Bulk Electricity Data
Updated the EIA Bulk Electricity data to include data published up through 2024-11-01. See #4042 and PR #4051.
EIA 930
Updated the EIA 930 data to include data published up through the beginning of February 2025. See #4040 and PR #4054. 10 new energy sources were added and 3 were retired; see Changes in energy source granularity over time for more information.
Bug Fixes
Fix an accidentally swapped set of starting balance / ending balance column rename parameters in the pre-2021 DBF derived data that feeds into core_ferc1_yearly_other_regulatory_liabilities_sched278. See issue #3952 and PRs #3969, #3979. Thanks to @yolandazzz13 for making this fix.
Added preliminary data validation checks for several FERC 1 tables that were missing it #3860.
Fix spelling of Lake Huron and Lake Saint Clair in out_vcerare_hourly_available_capacity_factor and related tables. See issue #4007 and PR #4029.
Quality of Life Improvements
We added a sources parameter to pudl.metadata.classes.DataSource.from_id() in order to make it possible to use the pudl-archiver repository to archive datasets that won’t necessarily be ingested into PUDL. See this PUDL archiver issue and PRs #4003 and #4013.
Other PUDL v2025.2.0 Resources
PUDL v2025.2.0 Data Dictionary
PUDL v2025.2.0 Documentation
PUDL in the AWS Open Data Registry
PUDL v2025.2.0 in a free, public AWS S3 bucket: s3://pudl.catalyst.coop/v2025.2.0/
PUDL v2025.2.0 in a requester-pays GCS bucket: gs://pudl.catalyst.coop/v2025.2.0/
Zenodo archive of the PUDL GitHub repo for this release
PUDL v2025.2.0 release on GitHub
PUDL v2025.2.0 package in the Python Package Index (PyPI)
Contact Us
If you're using PUDL, we would love to hear from you! Even if it's just a note to let us know that you exist, and how you're using the software or data. Here's a bunch of different ways to get in touch:
Follow us on GitHub
Use the PUDL Github issue tracker to let us know about any bugs or data issues you encounter
GitHub Discussions is where we provide user support.
Watch our GitHub Project to see what we're working on.
Email us at hello@catalyst.coop for private communications.
On Mastodon: @CatalystCoop@mastodon.energy
On BlueSky: @catalyst.coop
On Twitter: @CatalystCoop
Connect with us on LinkedIn
Play with our data and notebooks on Kaggle
Combine our data with ML models on HuggingFace
Learn more about us on our website: https://catalyst.coop
Subscribe to our announcements list for email updates.
Facebook
TwitterThe table IOU Zipcodes 2019 is part of the dataset Open Energy Data Initiative: U.S. Electric Utility Consumption and Rates ***, available at https://redivis.com/datasets/w5hb-cs453cj2k. It contains 52246 rows across 9 variables.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Data about power company service areas and their announcements about outages are critical for the effective coordination of resources after disasters, and also for building community and neighborhood resilience. As part of the 2015 White House Mapathon, the Department of Energy's Office of Electricity created a national geospatial database of power company service areas with pointers to public outage information (eg, through Twitter, web sites, and toll-free telephone numbers).
Mapathon participants researched public outage information state by state, and populated a lookup table so that disaster-impacted residents, tourists, first responders and relief volunteers can easily get to the information they need on scope and estimated restore times for power outages. This project benefited from participation of private and public sector folks who need this data for their work, and of third party app developers such as Red Cross and The Weather Channel who will incorporate this data into the information services they offer their users.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This spreadsheet contains information reported by over 200 investor-owned utilities to the Federal Energy Regulatory Commission in the annual filing FERC Form 1 for the years 1994-2019. It contains 1) annual capital costs for new transmission, distribution, and administrative infrastructure; 2) annual operation and maintenance costs for transmission, distribution, and utility business administration; 3) total annual MWh sales and sales by customer class; 4) annual peak demand in MW; and 5) total customer count and the number of customers by class.
Annual spending on new capital infrastructure is read from pages 204 to 207 of FERC Form 1, titled Electric Plant in Service. Annual transmission capital additions are recorded from Line 58, Column C - Total Transmission Plant Additions. Likewise, annual distribution capital additions are recorded from Line 75, Column C - Total Distribution Plant Additions. Administrative capital additions are recorded from Line 5, Column C - Total Intangible Plant Additions, and Line 99, Column C - Total General Plant Additions.
Operation and maintenance costs associated with transmission, distribution, and utility administration are read from pages 320 to 323 of FERC Form 1, titled Electric Operation and Maintenance Expenses. Annual transmission operation and maintenance are recorded from Line 99, Column B - Total Transmission Operation Expenses for Current Year, and Line 111, Column B - Total Transmission Maintenance Expenses for Current Year. Likewise, annual distribution operation and maintenance costs are recorded from Line 144, Column B - Total Distribution Operation Expenses, and Line 155, Column B - Total Distribution Maintenance Expenses. Administrative operation and maintenance costs are recorded from: Line 164, Column B - Total Customers Accounts Expenses; Line 171, Column B - Total Customer Service and Information Expenses; Line 178, Column B - Total Sales Expenses; and Line 197, Column B - Total Administrative and General Expenses.
The annual peak demand in MW over the year is read from page 401, titled Monthly Peaks and Output. The monthly peak demand is listed in Lines 29 to 40, Column D. The maximum of these monthly reports during each year is taken as the annual peak demand in MW. The annual energy sales and customer count data come from page 300, Electric Operating Revenues. The values are provided in Line 2 - Residential Sales, Line 4 - Commercial Sales, Line 5 - Industrial Sales, and Line 10 - Total Sales to Ultimate Consumers.
More information about the database is available in an associated report published by the University of Texas at Austin Energy Institute: https://live-energy-institute.pantheonsite.io/sites/default/files/UTAustin_FCe_TDA_2016.pdf
Also see an associated paper published in the journal Energy Policy:
Fares, Robert L., and Carey W. King. "Trends in transmission, distribution, and administration costs for US investor-owned electric utilities." Energy Policy 105 (2017): 354-362. https://doi.org/10.1016/j.enpol.2017.02.036
All data come from the Federal Energy Regulatory Commission FERC Form 1 Database available in Microsoft Visual FoxPro Format: https://www.ferc.gov/docs-filing/forms/form-1/data.asp
Facebook
Twitterhttps://www.mordorintelligence.com/privacy-policyhttps://www.mordorintelligence.com/privacy-policy
Utility and Energy Analytics Market Segmented by Deployment (On-Premises, Cloud and Hybrid), Component (Software, Services and More), Application (Meter Operations and Data Management, Load and Generation Forecasting and More), End-User (Generation Utilities, Transmission and Distribution Operators and More) Utility Type (Electric, Gas and More), and Geography. The Market Forecasts are Provided in Terms of Value (USD).
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Map of the electric utility service areas in California.
Facebook
TwitterMap of the six electric investor owned utility (IOU) areas in California:- Bear Valley Electric Service- Liberty Utilities- PacifiCorp- PG&E: Pacific Gas & Electric Company- SDG&E: San Diego Gas & Electric Company- SCE: Southern California Edison
Facebook
TwitterOpen Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
This data set shows utility lines that provide services for: * power * water * communications * heating fuel They include: * communication lines/submerged communication lines * hydro lines/submerged hydro lines * natural gas pipelines/submerged natural gas pipelines * water pipelines/submerged water pipelines * unknown pipelines * unknown transmission lines This product requires the use of geographic information system (GIS) software.
Facebook
TwitterEIA previously collected sales and revenue data in a category called "Other." This category was defined as including activities such as public street highway lighting, other sales to public authorities, sales to railroads and railways, and interdepartmental sales. EIA has revised its survey to separate the transportation sales and reassign the other activities to the commercial and industrial sectors as appropriate.
<p class="Bodypara">This is an electric utility data file that includes
utility level retail sales of electricity and associated revenue by end-use sector, State, and reporting month. The data source is the survey: Form EIA-826, "Monthly Electric Utility Sales and Revenue Report with State Distributions." The Form EIA-826 is used to collect retail sales of electricity and associated revenue, each month, from a statistically chosen sample of electric utilities in the United States. The respondents to the Form EIA-826 are chosen from the Form EIA-861, "Annual Electric Utility Report." The data also include, for each State, a record (UTILITYID "000000") containing data values which represent the arithmetic differences between the "estimated" State totals and the sum of the retail sales and associated revenue data reported by the respondents to the Form EIA-826.
The data are compressed into a self-extracting (f826yyyy.exe) zip file. This self-extracting zip file expands into one DBF file (f826utilyyyy.dbf) that contains the yearly data and an ASCII text file (f826layoutyyyy.txt) that contains the file description and record layout for the data base structure. The
current year's file will be a year-to-date file and is maintained in
this monthly format until the data for the final month is finalized.
To expand the self-extracting zip file, type f826yyyy.exe
from a DOS window, or double click on the file name from File Manager
in Windows 3x or Windows Explorer in either Windows 95, Windows 98,
Windows 2000, XP, or ME. Or, click Start, then Run, then select name of
.EXE file to open, then "OK." (Requires approx. 600K space). Usually,
the current year's file will be a "year-to-date" file until the data for
the final month is finalized.
*Note: Substitute the applicable year for "yyyy" in the file name.
File Size: 200 k
Methodology is based on the "Model-Based Sampling, Inference and Imputation."
Contact:
Charlene Harris-Russell
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The databases contain all the technical, financial, and tariff data collected through the study "Making power affordable in Africa and viable for its utilities." The final study and background papers are available at http://www.worldbank.org/affordableviablepowerforafrica. The objective of making the database public is to make data collected through the study available to utility companies, regulators, and practitioners to provide benchmarks and help inform analysis. The databases will be updated from time to time to make corrections or updates for latest data available and therefore may differ from data that appears in the reports. This database is a publication of the African Renewable Energy Access Program (AFREA), a World Bank Trust Fund Grant Program funded by the Kingdom of the Netherlands through ESMAP. It was prepared by staff of the International Bank for Reconstruction and Development / The World Bank.
Facebook
TwitterIncrease the quality and quantity of electric and gas data that is made available to the public. More open data is needed to understand current trends and effectively provide input to our utility companies.
Facebook
Twitter
According to our latest research, the global Utility Data Mesh Platform market size reached USD 1.12 billion in 2024, driven by the increasing adoption of digital transformation initiatives across utility sectors. The market is expected to grow at a robust CAGR of 18.9% from 2025 to 2033, reaching a forecasted value of USD 6.32 billion by 2033. The primary growth factors fueling this expansion include the rising need for real-time data analytics, enhanced grid management, and the integration of distributed energy resources, all of which are propelling utilities to adopt advanced data mesh platforms for improved operational efficiency and decision-making.
One of the most significant drivers of the Utility Data Mesh Platform market is the escalating volume and complexity of data generated by modern utility operations. Utilities are increasingly deploying smart meters, IoT sensors, and distributed energy resources, all of which contribute to the exponential growth of data. Traditional data management architectures are struggling to keep pace with the need for real-time access, analytics, and actionable insights. Data mesh platforms offer a decentralized approach, enabling domain-oriented teams to own, manage, and serve data as a product. This paradigm shift is helping utilities break data silos, accelerate digital transformation, and improve agility in responding to dynamic market changes and regulatory requirements.
Another key growth factor is the urgent need for enhanced grid management and reliability in the face of evolving energy landscapes. The proliferation of renewable energy sources, electric vehicles, and distributed generation is increasing grid complexity, making it essential for utilities to have advanced analytics and data sharing capabilities. Utility Data Mesh Platforms facilitate seamless data integration across disparate systems and departments, supporting predictive maintenance, outage management, and load forecasting. These capabilities are crucial for maintaining grid stability, reducing operational costs, and meeting sustainability goals, which are top priorities for utilities worldwide.
Furthermore, regulatory pressures and the demand for customer-centric services are pushing utilities to modernize their IT infrastructure. Compliance with evolving energy regulations, data privacy laws, and environmental standards requires utilities to have transparent, auditable, and easily accessible data. Data mesh platforms provide the necessary tools for robust data governance, lineage, and security, enabling utilities to ensure compliance while delivering personalized services to customers. The ability to rapidly adapt to regulatory changes and customer expectations is becoming a competitive differentiator in the utility industry, further boosting the adoption of Utility Data Mesh Platforms.
From a regional perspective, North America and Europe are leading the adoption of Utility Data Mesh Platforms, driven by advanced digital infrastructure, stringent regulatory frameworks, and a high concentration of innovative utility companies. Asia Pacific is emerging as a high-growth market, fueled by rapid urbanization, increasing investments in smart grids, and government initiatives to modernize utility operations. Latin America and the Middle East & Africa are also witnessing gradual adoption, supported by expanding utility networks and the need for improved operational efficiency. Each region presents unique challenges and opportunities, shaping the overall trajectory of the global market.
The Component segment of the Utility Data Mesh Platform Market is bifurcated into Software and Services, each playing a critical role in shaping the market landscape. Software solutions form the backbone of data mesh platforms, providing the necessary infrastructure for data discovery, cataloging, security, and analytics. These platforms are designed to be highly scalable and interoperable, enabling utilities t
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Electric utilities report a huge amount of information to the US government and other public agencies. This includes yearly, monthly, and even hourly data about fuel burned, electricity generated, operating expenses, power plant usage patterns and emissions. Unfortunately, much of this data is not released in well documented, ready-to-use, machine readable formats. Data from different agencies tends not to be standardized or easily used in tandem. Several commercial data services clean, package, and re-sell this this data, but at prices which are too high to be accessible to many smaller stakeholders.
The Public Utility Data Liberation (PUDL) project takes information that’s already publicly available, and makes it publicly usable, by cleaning, standardizing, and cross-linking utility data from different sources in a single database. Thus far our primary focus has been on fuel use, generation, operating costs, and operation history. It currently includes data from:
We archive snapshots of the raw inputs on Zenodo and all our data processing uses those snapshots as a starting place for reproducibility.
You can find the source code that generates this database in the PUDL repository on GitHub. The PUDL project is coordinated by Catalyst Cooperative.
The data is updated nightly by our automated nightly builds. When they are successful, new data is uploaded to the AWS Open Data Registry
We publish PUDL Data Dictionaries on Read the Docs which provide more descriptive information about the data.
Dataset header image courtesy of Gerry Machen via Flickr under a CC-BY-ND license