Hourly Precipitation Data (HPD) is digital data set DSI-3240, archived at the National Climatic Data Center (NCDC). The primary source of data for this file is approximately 5,500 US National Weather Service (NWS), Federal Aviation Administration (FAA), and cooperative observer stations in the United States of America, Puerto Rico, the US Virgin Islands, and various Pacific Islands. The earliest data dates vary considerably by state and region: Maine, Pennsylvania, and Texas have data since 1900. The western Pacific region that includes Guam, American Samoa, Marshall Islands, Micronesia, and Palau have data since 1978. Other states and regions have earliest dates between those extremes. The latest data in all states and regions is from the present day. The major parameter in DSI-3240 is precipitation amounts, which are measurements of hourly or daily precipitation accumulation. Accumulation was for longer periods of time if for any reason the rain gauge was out of service or no observer was present. DSI 3240_01 contains data grouped by state; DSI 3240_02 contains data grouped by year.
U.S. 15 Minute Precipitation Data is digital data set DSI-3260, archived at the National Climatic Data Center (NCDC). This is precipitation data. The primary source of data for this file is approximately 2,000 mostly U.S. weather stations operated or managed by the U.S. National Weather Service. Stations are primary, secondary, or cooperative observer sites that have the capability to measure precipitation at 15 minute intervals. This dataset contains 15-minute precipitation data (reported 4 times per hour, if precip occurs) for U.S. stations along with selected non-U.S. stations in U.S. territories and associated nations. It includes major city locations and many small town locations. Daily total precipitation is also included as part of the data record. NCDC has in archive data from most states as far back as 1970 or 1971, and continuing to the present day. The major parameter is precipitation amounts at 15 minute intervals, when precipitation actually occurs.
The GPCP Daily analysis is a companion to the GPCP Monthly analysis, and provides globally complete precipitation estimates at a spatial resolution of one degree latitude-longitude and daily time scale from October 1996 to the present. Although derived using both some of the same, but also different, data sets and methods than used in the GPCP Monthly analysis, the GPCP Daily "adds up" to the GPCP Monthly. The GPCP Daily V1.3 analysis is currently computed by the University of Maryland and submitted to NCEI. The routine update of the product takes place two months after the end of the month, once all input data sets become available. The data set is part of World Climate Research Program (WCRP) and GEWEX activities, being part of the array of data sets describing the water and energy cycles of the planet under the auspices of the GEWEX Data and Assessment Panel (GDAP). Details of input data sets and methods can be found in: Huffman, G.J., R.F. Adler, M. Morrissey, D.T. Bolvin, S. Curtis, R. Joyce, B McGavock, J. Susskind, 2001: Global Precipitation at One-Degree Daily Resolution from Multi-Satellite Observations. J. Hydrometeor., 2(1), 36-50.
This dataset replaces the previous Time Bias Corrected Divisional Temperature-Precipitation Drought Index. The new divisional data set (NClimDiv) is based on the Global Historical Climatological Network-Daily (GHCN-D) and makes use of several improvements to the previous data set. For the input data, improvements include additional station networks, quality assurance reviews and temperature bias adjustments. Perhaps the most extensive improvement is to the computational approach, which now employs climatologically aided interpolation. This 5km grid based calculation nCLIMGRID helps to address topographic and network variability. This data set is primarily used by the National Oceanic and Atmospheric Administration (NOAA) National Climatic Data Center (NCDC) to issue State of the Climate Reports on a monthly basis. These reports summarize recent temperature and precipitation conditions and long-term trends at a variety of spatial scales, the smallest being the climate division level. Data at the climate division level are aggregated to compute statewide, regional and national snapshots of climate conditions. For CONUS, the period of record is from 1895-present. Derived quantities such as Standardized precipitation Index (SPI), Palmer Drought Indices (PDSI, PHDI, PMDI, and ZNDX) and degree days are also available for the CONUS sites. In March 2015, data for thirteen Alaskan climate divisions were added to the NClimDiv data set. Data for the new Alaskan climate divisions begin in 1925 through the present and are included in all monthly updates. Alaskan climate data include the following elements for divisional and statewide coverage: average temperature, maximum temperature (highs), minimum temperature (lows), and precipitation. The Alaska NClimDiv data were created and updated using similar methodology as that for the CONUS, but with a different approach to establishing the underlying climatology. The Alaska data are built upon the 1971-2000 PRISM averages whereas the CONUS values utilize a base climatology derived from the NClimGrid data set. As of November 2018, NClimDiv includes county data and additional inventory files.
Note that 2013 and 2014 datasets are available for download in the attachment tab below.The journal article describing GHCN-Daily is: Menne, M.J., I. Durre, R.S. Vose, B.E. Gleason, and T.G. Houston, 2012: An overview of the Global Historical Climatology Network-Daily Database. Journal of Atmospheric and Oceanic Technology, 29, 897-910, doi:10.1175/JTECH-D-11-00103.1.Menne, M.J., I. Durre, B. Korzeniewski, S. McNeal, K. Thomas, X. Yin, S. Anthony, R. Ray, R.S. Vose, B.E.Gleason, and T.G. Houston, 2012: Global Historical Climatology Network - Daily (GHCN-Daily), Version 3. [indicate subset used following decimal, e.g. Version 3.12]. NOAA National Climatic Data Center. http://doi.org/10.7289/V5D21VHZ
The Cooperative Observer Program (COOP) Hourly Precipitation Data (HPD) consists of quality controlled precipitation amounts, which are measurements of hourly accumulation of precipitation, including rain and snow for approximately 2,000 observing stations around the country, and several U.S. territories in the Caribbean and Pacific from the National Weather Service (NWS) Fischer-Porter Network. This new version of COOP HPD with faster automations due updated stations will result in faster access for the public. The data are from 1940 to present, depending upon when each station was installed. These stations, nearly all of which were part of HPD version 1, also known as DSI-3240, were gradually upgraded from paper punch tape data recording systems to a more modern electronic data logger system from 2004-2013.
The 15-min gauge depth time series are processed at NCEI via automated quality control and filtering algorithms to identify and remove spurious observations from noise and malfunctioning equipment, and also those due to natural phenomena such as evaporation and the necessary occasional emptying of the gauge. Hourly precipitation totals are then computed from the 15-min data and are quality controlled by a suite of automated algorithms that combine checks on the daily and hourly time scale. Data and metadata are ingested on a daily basis and combined in a single integrated dataset.
As with the legacy punch paper instrumentation, the electronic loggers record rain gauge depth every 15 minutes. Monthly site visits to each station are still performed, but instead of collecting punched paper (that would subsequently need conversion to a digital record via a MITRON reader), data are downloaded from the station's datalogger to a memory stick and centrally collected at the local Weather Forecast Office (WFO) for all stations in the WFO area. The WFO subsequently combines all data into a single tar file and transfers the data to NCEI via ftp upload nominally each month.
This updated HPD includes the historical data from the punch paper era and the recent digital era in order to provide the full period of record for each location. These data are formatted consistent with practices for NCEI Global In-situ datasets.
Global Surface Summary of the Day is derived from The Integrated Surface Hourly (ISH) dataset. The ISH dataset includes global data obtained from the USAF Climatology Center, located in the Federal Climate Complex with NCDC. The latest daily summary data are normally available 1-2 days after the date-time of the observations used in the daily summaries. The online data files begin with 1929 and are at the time of this writing at the Version 8 software level. Over 9000 stations' data are typically available. The daily elements included in the dataset (as available from each station) are: Mean temperature (.1 Fahrenheit) Mean dew point (.1 Fahrenheit) Mean sea level pressure (.1 mb) Mean station pressure (.1 mb) Mean visibility (.1 miles) Mean wind speed (.1 knots) Maximum sustained wind speed (.1 knots) Maximum wind gust (.1 knots) Maximum temperature (.1 Fahrenheit) Minimum temperature (.1 Fahrenheit) Precipitation amount (.01 inches) Snow depth (.1 inches) Indicator for occurrence of: Fog, Rain or Drizzle, Snow or Ice Pellets, Hail, Thunder, Tornado/Funnel Cloud Global summary of day data for 18 surface meteorological elements are derived from the synoptic/hourly observations contained in USAF DATSAV3 Surface data and Federal Climate Complex Integrated Surface Hourly (ISH). Historical data are generally available for 1929 to the present, with data from 1973 to the present being the most complete. For some periods, one or more countries' data may not be available due to data restrictions or communications problems. In deriving the summary of day data, a minimum of 4 observations for the day must be present (allows for stations which report 4 synoptic observations/day). Since the data are converted to constant units (e.g, knots), slight rounding error from the originally reported values may occur (e.g, 9.9 instead of 10.0). The mean daily values described below are based on the hours of operation for the station. For some stations/countries, the visibility will sometimes 'cluster' around a value (such as 10 miles) due to the practice of not reporting visibilities greater than certain distances. The daily extremes and totals--maximum wind gust, precipitation amount, and snow depth--will only appear if the station reports the data sufficiently to provide a valid value. Therefore, these three elements will appear less frequently than other values. Also, these elements are derived from the stations' reports during the day, and may comprise a 24-hour period which includes a portion of the previous day. The data are reported and summarized based on Greenwich Mean Time (GMT, 0000Z - 2359Z) since the original synoptic/hourly data are reported and based on GMT.
The U.S. Climate Normals are a large suite of data products that provide information about typical climate conditions for thousands of locations across the United States. Normals act both as a ruler to compare today’s weather and tomorrow’s forecast, and as a predictor of conditions in the near future. The official normals are calculated for a uniform 30 year period, and consist of annual/seasonal, monthly, daily, and hourly averages and statistics of temperature, precipitation, and other climatological variables from almost 15,000 U.S. weather stations.
NCEI generates the official U.S. normals every 10 years in keeping with the needs of our user community and the requirements of the World Meteorological Organization (WMO) and National Weather Service (NWS). The 1991–2020 U.S. Climate Normals are the latest in a series of decadal normals first produced in the 1950s. These data allow travelers to pack the right clothes, farmers to plant the best crop varieties, and utilities to plan for seasonal energy usage. Many other important economic decisions that are made beyond the predictive range of standard weather forecasts are either based on or influenced by climate normals.
Daily precipitation at five stations in or near the LEF have been compiled by the National Climate Data Center. Here, WE present monthly sums of precipitation (P, in cm). Data are summarized through July 1992. Station details are as follows:LocationLongitudeLatitudeElevation (in meters)Period CoveredType of record (P= precipitation, T= Temperature)Pico del Este 18 15' 65 45'10511969-1992P,TRio Grande18 21'65 49'1071956-1984PRio Blanco Upper18 16'65 47'4391955-1974PRio Blanco Lower18 14'65 47'351955-1992PFajardo18 20'65 39'121931- 1992P,T Note: Max and Min temperature can be found at: http://luq.lternet.edu/data/luqmetadata71 and http://luq.lternet.edu/data/luqmetadata72, respectively
The NOAA Cooperative Observer Program (COOP) 15-Minute Precipitation Data consists of quality controlled precipitation amounts, which are measurements of 15 minute accumulation of precipitation, including rain and snow for approximately 2,000 observing stations around the country, and several U.S. territories in the Caribbean and Pacific operated or managed by the NOAA National Weather Service (NWS). Stations are primary, secondary, or cooperative observer sites that have the capability to measure precipitation at 15 minute intervals. This dataset contains 15-minute precipitation data (reported 4 times per hour, if precipitation occurred) for U.S. stations along with selected non-U.S. stations in U.S. territories and associated nations. It includes major city locations and many small town locations. Daily total precipitation is also included as part of the data record. The dataset period of record is from May 1970 to December 2013. The dataset is archived by the NOAA National Centers for Environmental Information (NCEI).
U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
These daily weather records were compiled from a subset of stations in the Global Historical Climatological Network (GHCN)-Daily dataset. A weather record is considered broken if the value exceeds the maximum (or minimum) value recorded for an eligible station. A weather record is considered tied if the value is the same as the maximum (or minimum) value recorded for an eligible station. Daily weather parameters include Highest Min/Max Temperature, Lowest Min/Max Temperature, Highest Precipitation, Highest Snowfall and Highest Snow Depth. All stations meet defined eligibility criteria. For this application, a station is defined as the complete daily weather records at a particular location, having a unique identifier in the GHCN-Daily dataset. For a station to be considered for any weather parameter, it must have a minimum of 30 years of data with more than 182 days complete in each year. This is effectively a 30-year record of service requirement, but allows for inclusion of some stations which routinely shut down during certain seasons. Small station moves, such as a move from one property to an adjacent property, may occur within a station history. However, larger moves, such as a station moving from downtown to the city airport, generally result in the commissioning of a new station identifier. This tool treats each of these histories as a different station. In this way, it does not thread the separate histories into one record for a city. Records Timescales are characterized in three ways. In order of increasing noteworthiness, they are Daily Records, Monthly Records and All Time Records. For a given station, Daily Records refers to the specific calendar day: (e.g., the value recorded on March 7th compared to every other March 7th). Monthly Records exceed all values observed within the specified month (e.g., the value recorded on March 7th compared to all values recorded in every March). All-Time Records exceed the record of all observations, for any date, in a station's period of record. The Date Range and Location features are used to define the time and location ranges which are of interest to the user. For example, selecting a date range of March 1, 2012 through March 15, 2012 will return a list of records broken or tied on those 15 days. The Location Category and Country menus allow the user to define the geographic extent of the records of interest. For example, selecting Oklahoma will narrow the returned list of records to those that occurred in the state of Oklahoma, USA. The number of records broken for several recent periods is summarized in the table and updated daily. Due to late-arriving data, the number of recent records is likely underrepresented in all categories, but the ratio of records (warm to cold, for example) should be a fairly strong estimate of a final outcome. There are many more precipitation stations than temperature stations, so the raw number of precipitation records will likely exceed the number of temperature records in most climatic situations.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Historical changes of annual temperature and precipitation indices at selected 210 U.S. cities
This dataset provide:
Annual average temperature, total precipitation, and temperature and precipitation extremes calculations for 210 U.S. cities.
Historical rates of changes in annual temperature, precipitation, and the selected temperature and precipitation extreme indices in the 210 U.S. cities.
Estimated thresholds (reference levels) for the calculations of annual extreme indices including warm and cold days, warm and cold nights, and precipitation amount from very wet days in the 210 cities.
Annual average of daily mean temperature, Tmax, and Tmin are included for annual average temperature calculations. Calculations were based on the compiled daily temperature and precipitation records at individual cities.
Temperature and precipitation extreme indices include: warmest daily Tmax and Tmin, coldest daily Tmax and Tmin , warm days and nights, cold days and nights, maximum 1-day precipitation, maximum consecutive 5-day precipitation, precipitation amounts from very wet days.
Number of missing daily Tmax, Tmin, and precipitation values are included for each city.
Rates of change were calculated using linear regression, with some climate indices applied with the Box-Cox transformation prior to the linear regression.
The historical observations from ACIS belong to Global Historical Climatological Network - daily (GHCN-D) datasets. The included stations were based on NRCC’s “ThreadEx” project, which combined daily temperature and precipitation extremes at 255 NOAA Local Climatological Locations, representing all large and medium size cities in U.S. (See Owen et al. (2006) Accessing NOAA Daily Temperature and Precipitation Extremes Based on Combined/Threaded Station Records).
Resources:
See included README file for more information.
Additional technical details and analyses can be found in: Lai, Y., & Dzombak, D. A. (2019). Use of historical data to assess regional climate change. Journal of climate, 32(14), 4299-4320. https://doi.org/10.1175/JCLI-D-18-0630.1
Other datasets from the same project can be accessed at: https://kilthub.cmu.edu/projects/Use_of_historical_data_to_assess_regional_climate_change/61538
ACIS database for historical observations: http://scacis.rcc-acis.org/
GHCN-D datasets can also be accessed at: https://www.ncei.noaa.gov/data/global-historical-climatology-network-daily/
Station information for each city can be accessed at: http://threadex.rcc-acis.org/
2024 August updated -
Annual calculations for 2022 and 2023 were added.
Linear regression results and thresholds for extremes were updated because of the addition of 2022 and 2023 data.
Note that future updates may be infrequent.
2022 January updated -
Annual calculations for 2021 were added.
Linear regression results and thresholds for extremes were updated because of the addition of 2021 data.
2021 January updated -
Annual calculations for 2020 were added.
Linear regression results and thresholds for extremes were updated because of the addition of 2020 data.
2020 January updated -
Annual calculations for 2019 were added.
Linear regression results and thresholds for extremes were updated because of the addition of 2019 data.
Thresholds for all 210 cities were combined into one single file – Thresholds.csv.
2019 June updated -
Baltimore was updated with the 2018 data (previously version shows NA for 2018) and new ID to reflect the GCHN ID of Baltimore-Washington International AP. city_info file was updated accordingly.
README file was updated to reflect the use of "wet days" index in this study. The 95% thresholds for calculation of wet days utilized all daily precipitation data from the reference period and can be different from the same index from some other studies, where only days with at least 1 mm of precipitation were utilized to calculate the thresholds. Thus the thresholds in this study can be lower than the ones that would've be calculated from the 95% percentiles from wet days (i.e., with at least 1 mm of precipitation).
The CPC Unified Gauge-Based Analysis of Global Daily Precipitation dataset offers daily precipitation estimates over land from 1979 to the present. Developed by NOAA's Climate Prediction Center (CPC), it leverages an optimal interpolation technique to combine data from a global network of rain gauges, with over 30,000 gauges contributing to …
This data package contains locally verified daily meteorological observations from a NOAA National Weather Service station located at the USDA Jornada Experimental Range headquarters in southern New Mexico, USA. Daily data has been collected there by USDA staff since 1914 for minimum and maximum air temperature and daily accumulated precipitation using standard U.S. climatological service instrumentation and procedures. The included data were verified and transcribed directly from the original paper data sheets and have undergone quality control and assurance procedures different than those in place at NOAA. These data therefore differ from those directly downloadable from NOAA servers. Local verification and transcription of observations from the data sheets ceased in 2006 and data are now directly entered to the NOAA system. Therefore, this dataset is complete and will no longer be added to.All observations from this weather station have also undergone NOAA QA/QC procedures and those data are available by accessing the Jornada Experimental Range, NM US GHCN station through the National Climatic Data Center portal (https://www.ncdc.noaa.gov/cdo-web/datasets/GHCND/stations/GHCND:USC00294426/detail - daily and monthly data are available).
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
SUMMARY
This dataset contains historical hourly precipitation observations for selected U.S. stations.
SOURCE
A script was used to scrape this dataset from the following URL on January 31, 2025:
https://www.ncei.noaa.gov/metadata/geoportal/rest/metadata/item/gov.noaa.ncdc:C00313/html
The dataset's original distributor was the United States National Oceanic and Atmospheric Association (NOAA) National Centers for Environmental Information (NCEI). The description below was copied from a NOAA NCEI webpage on January 31, 2025.
(Note that the header and thumbnail image were downloaded from Unsplash. They were not present in the original dataset.)
DESCRIPTION
Hourly Precipitation Data (HPD) is digital data set DSI-3240, archived at the National Climatic Data Center (NCDC). The primary source of data for this file is approximately 5,500 US National Weather Service (NWS), Federal Aviation Administration (FAA), and cooperative observer stations in the United States of America, Puerto Rico, the US Virgin Islands, and various Pacific Islands. The earliest data dates vary considerably by state and region: Maine, Pennsylvania, and Texas have data since 1900. The western Pacific region that includes Guam, American Samoa, Marshall Islands, Micronesia, and Palau have data since 1978. Other states and regions have earliest dates between those extremes. The latest data in all states and regions is from the present day. The major parameter in DSI-3240 is precipitation amounts, which are measurements of hourly or daily precipitation accumulation. Accumulation was for longer periods of time if for any reason the rain gauge was out of service or no observer was present. DSI 3240_01 contains data grouped by state; DSI 3240_02 contains data grouped by year.
This dataset was originally established as a subset of relevant NOAA daily precipitation data. This has been replaced with links to NOAA station websites which contain this data, please visit these links in the dataset file here. Previously, daily precipitation for 5 stations in or near the LEF were compiled from the NOAA National Climate Data Center and posted here.
Support for this work was provided by grants BSR-8811902, DEB-9411973, DEB-9705814 , DEB-0080538, DEB-0218039 , DEB-0620910 , DEB-1239764, DEB-1546686, and DEB-1831952 from the National Science Foundation to the University of Puerto Rico as part of the Luquillo Long-Term Ecological Research Program. Additional support provided by the University of Puerto Rico and the International Institute of Tropical Forestry, USDA Forest Service.
The U.S. Daily Climate Normals for 1981 to 2010 are 30-year averages of meteorological parameters for thousands of U.S. stations located across the 50 states, as well as U.S. territories, commonwealths, the Compact of Free Association nations, and one station in Canada. NOAA Climate Normals are a large suite of data products that provide users with many tools to understand typical climate conditions for thousands of locations across the United States. As many NWS stations as possible are used, including those from the NWS Cooperative Observer Program (COOP) Network as well as some additional stations that have a Weather Bureau Army-Navy (WBAN) station identification number, including stations from the Climate Reference Network (CRN). The comprehensive U.S. Climate Normals dataset includes various derived products including daily air temperature normals (including maximum and minimum temperature normal, heating and cooling degree day normal, and others), precipitation normals (including snowfall and snow depth, percentiles, frequencies and other), and hourly normals (all normal derived from hourly data including temperature, dew point, heat index, wind chill, wind, cloudiness, heating and cooling degree hours, pressure normals). Users can access the data either by product or by station. Included in the dataset is extensive documentation to describe station metadata, filename descriptions, and methodology of producing the data. All data utilized in the computation of the 1981-2010 Climate Normals were taken from the ISD Lite (a subset of derived Integrated Surface Data), the Global Historical Climatology Network-Daily dataset, and standardized monthly temperature data (COOP). These source datasets (including intermediate datasets used in the computation of products) are also archived at the NOAA NCDC.
The NOAA Monthly U.S. Climate Gridded Dataset (NClimGrid) consists of four climate variables derived from the GHCN-D dataset: maximum temperature, minimum temperature, average temperature and precipitation. Each file provides monthly values in a 5x5 lat/lon grid for the Continental United States. Data is available from 1895 to the present. On an annual basis, approximately one year of "final" nClimGrid will be submitted to replace the initially supplied "preliminary" data for the same time period. Users should be sure to ascertain which level of data is required for their research.
EpiNOAA is an analysis ready dataset that consists of a daily time-series of nClimGrid measures (maximum temperature, minimum temperature, average temperature, and precipitation) at the county scale. Each file provides daily values for the Continental United States. Data are available from 1951 to the present. Daily data are updated every 3 days with a preliminary data file and replaced with the scaled (i.e., quality controlled) data file every three months. This derivative data product is an enhancement from the original daily nClimGrid dataset in that all four weather parameters are now packaged into one file and assembled in a daily time-series format. In addition to a direct download option, an R package and web interface has been developed to streamline access to the final data product. These options allow end users three separate access modes to arrive at a customized dataset unique to each end user’s application. Users should be sure to review the data documentation to inform which level of data is required for their research.
Daily precipitation, maximum, and minimum air temperature at five stations in or near the LEF have been compiled by the National Climate Data Center. Here I present monthly sums of precipitation (P, in cm) and monthly averages (T, in oF) of the maximum and minimum air temperatures. Data are summarized through July 1992 and additional data will be added when it becomes available.
The Global Historical Climatology Network daily (GHCNd) is an integrated database of daily climate summaries from land surface stations across the globe. GHCNd is made up of daily climate records from numerous sources that have been integrated and subjected to a common suite of quality assurance reviews.
GHCNd contains records from more than 100,000 stations in 180 countries and territories. NCEI provides numerous daily variables, including maximum and minimum temperature, total daily precipitation, snowfall, and snow depth. About half the stations only report precipitation. Both record length and period of record vary by station and cover intervals ranging from less than a year to more than 175 years.
The process of integrating data from multiple sources into GHCNd takes place in three steps:
%3C!-- --%3E
The process performs the first two of these steps whenever a new source dataset or additional stations become available, while the mingling of data is part of the automated processing that creates GHCNd on a regular basis.
A station within a source dataset is considered for inclusion in GHCNd if it meets all of the following conditions:
%3C!-- --%3E
The next step is to determine for each station in the source dataset if data for the same location are already contained in GHCNd, or if the station represents a new site. Whenever possible, stations are matched on the basis of network affiliation and station identification number. If no match exists, then there is consultation from different networks for existing cross-referenced lists that identify the correspondence of station identification numbers.
For example, data for Alabaster Shelby County Airport, Alabama, USA, is stored under Cooperative station ID 010116 in NCEI's datasets 3200 and 3206 as well as in the data stream from the High Plains Regional Climate Center; they are combined into one GHCNd record based on the ID. In data set 3210 and the various sources for ASOS stations, however, the data for this location are stored under WBAN ID 53864 and must be matched with the corresponding Cooperative station ID using NCEI's Master Station History Record.
A third approach is to match stations on the basis of their names and location. This strategy is more difficult to automate than the other two approaches because identification of multiple stations within the same city or town, with the same name and small differences in coordinates, can be the result of either differences in accuracy or the existence of multiple stations in close proximity to each other. As a result, the employment of the third approach is used only when stations cannot be matched on the basis of station identification numbers or cross-reference information. This is the case, for example, when there is a need for matching stations outside the U.S. whose data originate from the Global Summary of the Day dataset and from the International Collection.
The implementation of the above classification strategies yields a list of GHCNd stations and an inventory of the source datasets for integration of each station. This list forms the basis for integrating, or mingling, the data from the various sources to create GHCNd. Mingling takes place according to a hierarchy of data sources and in a manner that attempts to maximize the amount of data included while also minimizing the degree to which data from sources with different characteristics are mixed. While the mingling of precipitation, snowfall, and snow depth are separate, consideration of maximum and minimum temperatures is performed together in order to ensure the temperatures for a particular station and day always originate from the same source. Data from the Global Summary of the Day dataset are used only if no observations are available from any other source for that station, month, and element. Among the other sources, consideration of each day is made individually; if an observation for a particular station and day is available from more than one source, GHCNd uses the observation from the most preferred source available.
Several criteria are used for the hierarchy of data sources used in cases of overlap. In gener
Hourly Precipitation Data (HPD) is digital data set DSI-3240, archived at the National Climatic Data Center (NCDC). The primary source of data for this file is approximately 5,500 US National Weather Service (NWS), Federal Aviation Administration (FAA), and cooperative observer stations in the United States of America, Puerto Rico, the US Virgin Islands, and various Pacific Islands. The earliest data dates vary considerably by state and region: Maine, Pennsylvania, and Texas have data since 1900. The western Pacific region that includes Guam, American Samoa, Marshall Islands, Micronesia, and Palau have data since 1978. Other states and regions have earliest dates between those extremes. The latest data in all states and regions is from the present day. The major parameter in DSI-3240 is precipitation amounts, which are measurements of hourly or daily precipitation accumulation. Accumulation was for longer periods of time if for any reason the rain gauge was out of service or no observer was present. DSI 3240_01 contains data grouped by state; DSI 3240_02 contains data grouped by year.