Facebook
TwitterAttribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Some say climate change is the biggest threat of our age while others say it’s a myth based on dodgy science. We are turning some of the data over to you so you can form your own view.
Even more than with other data sets that Kaggle has featured, there’s a huge amount of data cleaning and preparation that goes into putting together a long-time study of climate trends. Early data was collected by technicians using mercury thermometers, where any variation in the visit time impacted measurements. In the 1940s, the construction of airports caused many weather stations to be moved. In the 1980s, there was a move to electronic thermometers that are said to have a cooling bias.
Given this complexity, there are a range of organizations that collate climate trends data. The three most cited land and ocean temperature data sets are NOAA’s MLOST, NASA’s GISTEMP and the UK’s HadCrut.
We have repackaged the data from a newer compilation put together by the Berkeley Earth, which is affiliated with Lawrence Berkeley National Laboratory. The Berkeley Earth Surface Temperature Study combines 1.6 billion temperature reports from 16 pre-existing archives. It is nicely packaged and allows for slicing into interesting subsets (for example by country). They publish the source data and the code for the transformations they applied. They also use methods that allow weather observations from shorter time series to be included, meaning fewer observations need to be thrown away.
In this dataset, we have include several files:
Global Land and Ocean-and-Land Temperatures (GlobalTemperatures.csv):
Other files include:
The raw data comes from the Berkeley Earth data page.
Facebook
TwitterThis dataset replaces the previous Time Bias Corrected Divisional Temperature-Precipitation Drought Index. The new divisional data set (NClimDiv) is based on the Global Historical Climatological Network-Daily (GHCN-D) and makes use of several improvements to the previous data set. For the input data, improvements include additional station networks, quality assurance reviews and temperature bias adjustments. Perhaps the most extensive improvement is to the computational approach, which now employs climatologically aided interpolation. This 5km grid based calculation nCLIMGRID helps to address topographic and network variability. This data set is primarily used by the National Oceanic and Atmospheric Administration (NOAA) National Climatic Data Center (NCDC) to issue State of the Climate Reports on a monthly basis. These reports summarize recent temperature and precipitation conditions and long-term trends at a variety of spatial scales, the smallest being the climate division level. Data at the climate division level are aggregated to compute statewide, regional and national snapshots of climate conditions. For CONUS, the period of record is from 1895-present. Derived quantities such as Standardized precipitation Index (SPI), Palmer Drought Indices (PDSI, PHDI, PMDI, and ZNDX) and degree days are also available for the CONUS sites. In March 2015, data for thirteen Alaskan climate divisions were added to the NClimDiv data set. Data for the new Alaskan climate divisions begin in 1925 through the present and are included in all monthly updates. Alaskan climate data include the following elements for divisional and statewide coverage: average temperature, maximum temperature (highs), minimum temperature (lows), and precipitation. The Alaska NClimDiv data were created and updated using similar methodology as that for the CONUS, but with a different approach to establishing the underlying climatology. The Alaska data are built upon the 1971-2000 PRISM averages whereas the CONUS values utilize a base climatology derived from the NClimGrid data set. As of November 2018, NClimDiv includes county data and additional inventory files.
Facebook
TwitterThis dataset was created by Muthu Priya
Facebook
TwitterNOAA's Climate Data Records (CDRs) are robust, sustainable, and scientifically sound climate records that provide trustworthy information on how, where, and to what extent the land, oceans, atmosphere and ice sheets are changing. These datasets are thoroughly vetted time series measurements with the longevity, consistency, and continuity to assess and measure climate variability and change. NOAA CDRs are vetted using standards established by the National Research Council (NRC).
Climate Data Records are created by merging data from surface, atmosphere, and space-based systems across decades. NOAA’s Climate Data Records provides authoritative and traceable long-term climate records. NOAA developed CDRs by applying modern data analysis methods to historical global satellite data. This process can clarify the underlying climate trends within the data and allows researchers and other users to identify economic and scientific value in these records. NCEI maintains and extends CDRs by applying the same methods to present-day and future satellite measurements.
Terrestrial CDRs are composed of sensor data that have been improved and quality controlled over time, together with ancillary calibration data.
Facebook
TwitterThis dataset includes processed climate change datasets related to climatology, hydrology, and water operations. The climatological data provided are change factors for precipitation and reference evapotranspiration gridded over the entire State. The hydrological data provided are projected stream inflows for major streams in the Central Valley, and streamflow change factors for areas outside of the Central Valley and smaller ungaged watersheds within the Central Valley. The water operations data provided are Central Valley reservoir outflows, diversions, and State Water Project (SWP) and Central Valley Project (CVP) water deliveries and select streamflow data. Most of the Central Valley inflows and all of the water operations data were simulated using the CalSim II model and produced for all projections.
These data were originally developed for the California Water Commission’s Water Storage Investment Program (WSIP). The WSIP data used as the basis for these climate change resources along with the technical reference document are located here: https://data.cnra.ca.gov/dataset/climate-change-projections-wsip-2030-2070. Additional processing steps were performed to improve user experience, ease of use for GSP development, and for Sustainable Groundwater Management Act (SGMA) implementation. Furthermore, the data, tools, and guidance may be useful for purposes other than sustainable groundwater management under SGMA.
Data are provided for projected climate conditions centered around 2030 and 2070. The climate projections are provided for these two future climate periods, and include one scenario for 2030 and three scenarios for 2070: a 2030 central tendency, a 2070 central tendency, and two 2070 extreme scenarios (i.e., one drier with extreme warming and one wetter with moderate warming). The climate scenario development process represents a climate period analysis where historical interannual variability from January 1915 through December 2011 is preserved while the magnitude of events may be increased or decreased based on projected changes in precipitation and air temperature from general circulation models.
DWR has collaborated with Lawrence Berkeley National Laboratory to improve the quality of the 2070 extreme scenarios. The 2070 extreme scenario update utilizes an improved climate period analysis method known as "quantile delta mapping" to better capture the GCM-projected change in temperature and precipitation. A technical note on the background and results of this process is provided here: https://data.cnra.ca.gov/dataset/extreme-climate-change-scenarios-for-water-supply-planning/resource/f2e1c61a-4946-4863-825f-e6d516b433ed.
Note: the original version of the 2070 extreme scenarios can be accessed in the archive posted here: https://data.cnra.ca.gov/dataset/sgma-climate-change-resources/resource/51b6ee27-4f78-4226-8429-86c3a85046f4
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
We present a continuous land climate reconstruction dataset extending from 60 kyr before present to the pre-industrial period at 0.5deg resolution on a monthly timestep for 0degN to 90degN. It has been generated from 42 discrete snapshot simulations using the HadCM3B-M2.1 coupled general circulation model. We incorporate Dansgaard-Oeschger (DO) and Heinrich events to represent millennial scale variability, based on a temperature reconstruction from Greenland ice-cores, with a spatial fingerprint based on a freshwater hosing simulation with HadCM3B-M2.1. Interannual variability is also added and derived from the initial snapshot simulations. Model output has been downscaled to 0.5deg resolution (using simple bilinear interpolation) and bias corrected using either the University of East Anglia, Climate Research Unit observational data (for temperature, precipitation, windchill, and minimum monthly temperature), or the EWEMBI dataset (for incoming shortwave energy). Here we provide datasets for; surface air temperature, precipitation, incoming shortwave energy, wind-chill, snow depth (as snow water equivalent), number of rainy days per month, minimum monthly temperature, and the land-sea mask and ice fractions used in the simulations. The datasets are in the form of NetCDF files. The variables are represented by a set of 24 files that have been compressed into nine folders: temp, precip, down_sw, wind_chill, snow, rainy_days, tempmonmin, landmask and icefrac. Each file represents 2500 years. The landmask and ice fraction are provided annually, whereas the climate variables are given as monthly files equivalent to 30000 months, between the latitudes 0deg to 90degN at 0.5deg resolution. Each of the climate files therefore have the dimensions 180 (lat) x 720 (lon) x 30000 (month). We also provide an example subset of the temperature dataset, which gives decadal averages for each month for 0-2500 years.
Facebook
TwitterThe NOAA Monthly U.S. Climate Gridded Dataset (NClimGrid) consists of four climate variables derived from the GHCN-D dataset: maximum temperature, minimum temperature, average temperature and precipitation. Each file provides monthly values in a 5x5 lat/lon grid for the Continental United States. Data is available from 1895 to the present. On an annual basis, approximately one year of "final" nClimGrid will be submitted to replace the initially supplied "preliminary" data for the same time period. Users should be sure to ascertain which level of data is required for their research.
Facebook
Twitterhttps://object-store.os-api.cci2.ecmwf.int:443/cci2-prod-catalogue/licences/insitu-gridded-observations-global-and-regional/insitu-gridded-observations-global-and-regional_15437b363f02bf5e6f41fc2995e3d19a590eb4daff5a7ce67d1ef6c269d81d68.pdfhttps://object-store.os-api.cci2.ecmwf.int:443/cci2-prod-catalogue/licences/insitu-gridded-observations-global-and-regional/insitu-gridded-observations-global-and-regional_15437b363f02bf5e6f41fc2995e3d19a590eb4daff5a7ce67d1ef6c269d81d68.pdf
This dataset provides high-resolution gridded temperature and precipitation observations from a selection of sources. Additionally the dataset contains daily global average near-surface temperature anomalies. All fields are defined on either daily or monthly frequency. The datasets are regularly updated to incorporate recent observations. The included data sources are commonly known as GISTEMP, Berkeley Earth, CPC and CPC-CONUS, CHIRPS, IMERG, CMORPH, GPCC and CRU, where the abbreviations are explained below. These data have been constructed from high-quality analyses of meteorological station series and rain gauges around the world, and as such provide a reliable source for the analysis of weather extremes and climate trends. The regular update cycle makes these data suitable for a rapid study of recently occurred phenomena or events. The NASA Goddard Institute for Space Studies temperature analysis dataset (GISTEMP-v4) combines station data of the Global Historical Climatology Network (GHCN) with the Extended Reconstructed Sea Surface Temperature (ERSST) to construct a global temperature change estimate. The Berkeley Earth Foundation dataset (BERKEARTH) merges temperature records from 16 archives into a single coherent dataset. The NOAA Climate Prediction Center datasets (CPC and CPC-CONUS) define a suite of unified precipitation products with consistent quantity and improved quality by combining all information sources available at CPC and by taking advantage of the optimal interpolation (OI) objective analysis technique. The Climate Hazards Group InfraRed Precipitation with Station dataset (CHIRPS-v2) incorporates 0.05° resolution satellite imagery and in-situ station data to create gridded rainfall time series over the African continent, suitable for trend analysis and seasonal drought monitoring. The Integrated Multi-satellitE Retrievals dataset (IMERG) by NASA uses an algorithm to intercalibrate, merge, and interpolate “all'' satellite microwave precipitation estimates, together with microwave-calibrated infrared (IR) satellite estimates, precipitation gauge analyses, and potentially other precipitation estimators over the entire globe at fine time and space scales for the Tropical Rainfall Measuring Mission (TRMM) and its successor, Global Precipitation Measurement (GPM) satellite-based precipitation products. The Climate Prediction Center morphing technique dataset (CMORPH) by NOAA has been created using precipitation estimates that have been derived from low orbiter satellite microwave observations exclusively. Then, geostationary IR data are used as a means to transport the microwave-derived precipitation features during periods when microwave data are not available at a location. The Global Precipitation Climatology Centre dataset (GPCC) is a centennial product of monthly global land-surface precipitation based on the ~80,000 stations world-wide that feature record durations of 10 years or longer. The data coverage per month varies from ~6,000 (before 1900) to more than 50,000 stations. The Climatic Research Unit dataset (CRU v4) features an improved interpolation process, which delivers full traceability back to station measurements. The station measurements of temperature and precipitation are public, as well as the gridded dataset and national averages for each country. Cross-validation was performed at a station level, and the results have been published as a guide to the accuracy of the interpolation. This catalogue entry complements the E-OBS record in many aspects, as it intends to provide high-resolution gridded meteorological observations at a global rather than continental scale. These data may be suitable as a baseline for model comparisons or extreme event analysis in the CMIP5 and CMIP6 dataset.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
SILO is a Queensland Government database containing continuous daily climate data for Australia from 1889 to present, in a number of ready-to-use formats, suitable for modelling and research applications. The SILO database contains two major classes of data: point (station) time series and spatial grids, both based on observed data from the Bureau of Meteorology ADAM (Australian Data Archive for Meteorology) database. For point data, interpolated or derived values are used where observations are missing. Gridded data are spatially interpolated from observations.
Facebook
TwitterThis dataset replaces the previous Time Bias Corrected Divisional Temperature-Precipitation Drought Index. The new divisional data set (nCLIMDIV) is based on the Global Historical Climatological Network-Daily (GHCN-D) and makes use of several improvements to the previous data set. For the input data, improvements include additional station networks, quality assurance reviews and temperature bias adjustments. Perhaps the most extensive improvement is to the computational approach, which now employs climatologically aided interpolation. This 5km grid based calculation nCLIMGRID helps to address topographic and network variability. This data set is primarily used by the National Oceanic and Atmospheric Administration (NOAA) National Climatic Data Center (NCDC) to issue State of the Climate Reports on a monthly basis. These reports summarize recent temperature and precipitation conditions and long-term trends at a variety of spatial scales, the smallest being the climate division level. Data at the climate division level are aggregated to compute statewide, regional and national snapshots of climate conditions. For CONUS, the period of record is from 1895-present. Derived quantities such as Standardized precipitation Index (SPI), Palmer Drought Indices (PDSI, PHDI, PMDI, and ZNDX) and degree days are also available for the CONUS sites.
In March 2015, data for thirteen Alaskan climate divisions were added to the nCLIMDIV data set. Data for the new Alaskan climate divisions begin in 1925 through the present and are included in all nCLIMDIV monthly updates. Alaskan climate data include the following elements for divisional and statewide coverage: average temperature, maximum temperature (highs), minimum temperature (lows), and precipitation. The Alaska nCLIMDIV data were created and updated using similar methodology as that for the CONUS, but with a different approach to establishing the underlying climatology. The Alaska data are built upon the 1971-2000 PRISM averages whereas the CONUS values utilize a base climatology derived from the nCLIMGRID data set.
As of November 2018, nClimDiv includes county data and additional inventory files.
Facebook
TwitterClimate Change Analysis
This dataset falls under the category Environmental Data Air Quality Data.
It contains the following data: Average temperature
This dataset was scouted on 2022-02-10 as part of a data sourcing project conducted by TUMI. License information might be outdated: Check original source for current licensing.
The data can be accessed using the following URL / API Endpoint: https://www.kaggle.com/bimal1990/climate-change-analysis/dataSee URL for data access and license information.
Facebook
Twitterhttps://datacatalog.worldbank.org/public-licenses?fragment=cchttps://datacatalog.worldbank.org/public-licenses?fragment=cc
The Climate Change Knowledge Portal (CCKP) is the World Bank's designated climate data service. CCKP offers a comprehensive suite of climate data and products that are derived from the latest generation of climate data archives. CCKP implements a systematic way of pre-processing the raw observed and model-based projection data to enable inter-comparable use across a broad range of applications. Data is available across an expansive range of climate variables and can be extracted per individual spatial units, variables, select timeframes, climate projection scenarios, across ensembles or individual models. Data is available as global gridded or spatially aggregated to national, subnational, watershed, and Exclusive Economic Zone scaled.
The Observed Climate Data, CRU ts4.07 0.5-degree dataset, CRU TS (Climatic Research Unit gridded Time Series) is the most widely used observational climate dataset. Data is presented on a 0.5° latitude by 0.5° longitude grid over all land domains except Antarctica. It is derived by the interpolation of monthly climate anomalies from extensive networks of weather station observations. The CRU TS version 4.07 gridded dataset is derived from observational data and provides quality-controlled temperature and rainfall values from thousands of weather stations worldwide, as well as derivative products including monthly climatologies and long term historical climatologies. Data products are derived from the raw data produced by the Climatic Research Unit (CRU) of the University of East Anglia (UEA).
Global gridded NetCDF files can be accessed via https://registry.opendata.aws/wbg-cckp/
Pre-computed statistics for spatially aggregated data is available as API or xls via
Facebook
TwitterThis data has been superseded by a newer version of the dataset. Please refer to NOAA's Climate Divisional Database for more information. The U.S. Climate Divisional Dataset provides data access to current U.S. temperature, precipitation and drought indeces. Divisional indices included are: Precipitation Index, Palmer Drought Severity Index, Palmer Hydrological Drought Index, Modified Palmer Drought Severity Index, Temperature, Palmer Z Index, Cooling Degree Days, Heating Degree Days, 1-Month Standardized Precipitation Index (SPI), 2-Month (SPI), 3-Month (SPI), 6-Month (SPI),12-Month (SPI) and the 24-Month (SPI). All of these Indices, except for the SPI, are available for Regional, State and National views as well. There are 344 climate divisions in the CONUS. For each climate division, monthly station temperature and precipitation values are computed from the daily observations. The divisional values are weighted by area to compute statewide values and the statewide values are weighted by area to compute regional values. The indices were computed using daily station data from 1895 to present.
Facebook
TwitterThe Florida Flood Hub for Applied Research and Innovation and the U.S. Geological Survey have developed projected future change factors for precipitation depth-duration-frequency (DDF) curves at 242 National Oceanic and Atmospheric Administration (NOAA) Atlas 14 stations in Florida. The change factors were computed as the ratio of projected future to historical extreme-precipitation depths fitted to extreme-precipitation data from downscaled climate datasets using a constrained maximum likelihood (CML) approach as described in https://doi.org/10.3133/sir20225093. The change factors correspond to the periods 2020-59 (centered in the year 2040) and 2050-89 (centered in the year 2070) as compared to the 1966-2005 historical period. Geospatial data provided in an ArcGIS shapefile are described herein. The shapefile contains polygons of climate regions including their acreages. As part of this project, best models from each downscaled climate dataset are determined for each climate region based on how well they reproduce historical climate extreme indices. Table 1 of Datasets_station_information.xlsx lists the NOAA Atlas 14 stations located within each climate region. See Best_model_lists.xlsx for a list of best models for each climate region in each downscaled climate dataset and when all downscaled climate datasets are considered together.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Nordic Gridded Climate Dataset (NGCD) is a high resolution, observational, gridded dataset of daily minimum, maximum and mean temperatures and daily precipitation totals, covering Finland, Sweden and Norway. The time period covered begins in January 1961 and continues to the present. The dataset is regularly updated every 6 months, in March and in September. In addition, there are daily, provisional updates. Spatial interpolation methods are applied to observational datasets to create gridded datasets. In general, there are three types of such methods: deterministic (type 1), stochastic (type 2) and pure mathematical (type 3). NGCD applies both a deterministic kriging (type 1) interpolation approach and a stochastic Bayesian (type 2) interpolation approach to the same in-situ observational dataset collected by weather stations. For more details on the algorithms, users are advised to read the product user guide. The input data is provided by the National Meteorological and Hydrological Services of Finland, Norway and Sweden. The time-series used for Finland and Sweden are the non-blended time-series from the station network of the European Climate Assessment & Dataset (ECA&D) project. For Norway, time-series are extracted from the climate database of the Norwegian Meteorological Institute.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains temperature exposure statistics for Europe (e.g. percentiles) derived from the daily 2 metre mean, minimum and maximum air temperature for the entire year, winter (DJF: December-January-February) and summer (JJA: June-July-August). These statistics were derived within the C3S European Health service and are available for different future time periods and using different climate change scenarios. Temperature percentiles are typically used in epidemiology and public health when defining health risk estimates and when looking at current and future health impacts, and they allow to identify a common threshold and comparison between different cities/areas. The temperature statistics are calculated, either for the season winter and summer or for the whole year, based on a bias-adjusted EURO-CORDEX dataset. The statistics are averaged for 30 years as a smoothed average from 1971 to 2100. This results in a timeseries covering the period from 1986 to 2085. Finally, the timeseries are averaged for the model ensemble and the standard deviation to this ensemble mean is provided.
Facebook
TwitterThe Florida Flood Hub for Applied Research and Innovation and the U.S. Geological Survey have developed projected future change factors for precipitation depth-duration-frequency (DDF) curves at 242 National Oceanic and Atmospheric Administration (NOAA) Atlas 14 stations in Florida. The change factors were computed as the ratio of projected future to historical extreme-precipitation depths fitted to extreme-precipitation data from downscaled climate datasets using a constrained maximum likelihood (CML) approach as described in https://doi.org/10.3133/sir20225093. The change factors correspond to the period 2020-59 (centered in 2040) or to the period 2050-89 (centered in the year 2070) as compared to the 1966-2005 historical period. A Microsoft Excel workbook is provided that tabulates best models for each downscaled climate dataset and for all downscaled climate datasets considered together. Best models were identified based on how well the models capture the climatology and interannual variability of four climate extreme indices using the Model Climatology Index (MCI) and the Model Variability Index (MVI) of Srivastava and others (2020). The four indices consist of annual maxima consecutive precipitation for durations of 1, 3, 5, and 7 days compared against the same indices computed based on the PRISM and SFWMD gridded precipitation datasets for five climate regions: climate region 1 in Northwest Florida, 2 in North Florida, 3 in North Central Florida, 4 in South Central Florida, and climate region 5 in South Florida. The PRISM dataset is based on the Parameter-elevation Relationships on Independent Slopes Model interpolation method of Daly and others (2008). The South Florida Water Management District’s (SFWMD) precipitation super-grid is a gridded precipitation dataset developed by modelers at the agency for use in hydrologic modeling (SFWMD, 2005). This dataset is considered by the SFWMD as the best available gridded rainfall dataset for south Florida and was used in addition to PRISM to identify best models in the South Central and South Florida climate regions. Best models were selected based on MCI and MVI evaluated within each individual downscaled dataset. In addition, best models were selected by comparison across datasets and referred to as "ALL DATASETS" hereafter. Due to the small sample size, all models in the using the Weather Research and Forecasting Model (JupiterWRF) dataset were considered as best models.
Facebook
TwitterDiaries and Journals containing weather information in a non-tabular format. Records date from 1735 through the early 20th century. Much of the weather and climate data recorded by the founding fathers of this country (Washington, Jefferson and Franklin to name a few) were archived in original manuscripts, then microfilmed and stored at the National Archive and Records Administration (NARA). Those records available from NARA on microfilm have been imaged and placed on the EV2 system. To date, there are more than 42 million of those images on-line. These colonial diaries and data are a treasure trove to the climatologist seeking data on climate of the 19th century.
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
The CRU JRA V2.5 dataset is a 6-hourly, land surface, gridded time series of ten meteorological variables produced by the Climatic Research Unit (CRU) at the University of East Anglia (UEA), and is intended to be used to drive models. The variables are provided on a 0.5 degree latitude x 0.5 degree longitude grid, the grid is near global but excludes Antarctica (this is the same as the CRU TS grid, though the set of variables is different). The data are available at a 6 hourly time-step from January 1901 to December 2023.
The dataset is constructed by regridding data from the Japanese Reanalysis data (JRA) produced by the Japanese Meteorological Agency (JMA), adjusting where possible to align with the CRU TS 4.08 data (see the Process section and the ReadMe file for full details).
The CRU JRA data consists of the following ten meteorological variables: 2-metre temperature, 2-metre maximum and minimum temperature, total precipitation, specific humidity, downward solar radiation flux, downward long wave radiation flux, pressure and the zonal and meridional components of wind speed (see the ReadMe file for further details).
The CRU JRA dataset is intended to be a replacement of the CRU NCEP forcing dataset. The CRU JRA dataset follows the style of Nicolas Viovy's original CRU NCEP dataset rather than that which is available from UCAR. A link to the CRU NCEP documentation for comparison is provided in the documentation section. This version of CRUJRA, v2.5 (1901-2023) is, where possible, adjusted to align with CRU TS monthly means or totals. A consequence of this is that, if CRU TS changes, then CRUJRA changes.
For this version, and version 4.07 of CRU TS, the CLD (cloud cover, %) variable is now actualised (converted from gridded anomalies) using the original CLD climatology and not the revised climatology introduced last year. This change/reversion is summarised here: https://crudata.uea.ac.uk/cru/data/hrg/cru_cl_1.1/Read_Me_CRU_CL_CLD_Reversion.txt
Since CLD is used to align DSWRF, CRUJRA Downward Short Wave Radiation Flux (DSWRF) will now be 'closer to' version 2.2 and earlier and should be used in preference to v2.3.
If this dataset is used in addition to citing the dataset as per the data citation string users must also cite the following:
Harris, I., Osborn, T.J., Jones, P. et al. Version 4 of the CRU TS monthly high-resolution gridded multivariate climate dataset. Sci Data 7, 109 (2020). https://doi.org/10.1038/s41597-020-0453-3
Harris, I., Jones, P.D., Osborn, T.J. and Lister, D.H. (2014), Updated high-resolution grids of monthly climatic observations - the CRU TS3.10 Dataset. International Journal of Climatology 34, 623-642.
Kobayashi, S., et. al., The JRA-55 Reanalysis: General Specifications and Basic Characteristics. J. Met. Soc. Jap., 93(1), 5-48 https://dx.doi.org/10.2151/jmsj.2015-001
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
ERA5 is the fifth generation ECMWF atmospheric reanalysis of the global climate covering the period from January 1940 to present. It is produced by the Copernicus Climate Change Service (C3S) at ECMWF and provides hourly estimates of a large number of atmospheric, land and oceanic climate variables. The data cover the Earth on a 31km grid and resolve the atmosphere using 137 levels from the surface up to a height of 80km. ERA5 includes an ensemble component at half the resolution to provide information on synoptic uncertainty of its products. ERA5.1 is a dedicated product with the same horizontal and vertical resolution that was produced for the years 2000 to 2006 inclusive to significantly improve a discontinuity in global-mean temperature in the stratosphere and uppermost troposphere that ERA5 suffers from during that period. Users that are interested in this part of the atmosphere in this era are advised to access ERA5.1 rather than ERA5. ERA5 and ERA5.1 use a state-of-the-art numerical weather prediction model to assimilate a variety of observations, including satellite and ground-based measurements, and produces a comprehensive and consistent view of the Earth's atmosphere. These products are widely used by researchers and practitioners in various fields, including climate science, weather forecasting, energy production and machine learning among others, to understand and analyse past and current weather and climate conditions.
Facebook
TwitterAttribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Some say climate change is the biggest threat of our age while others say it’s a myth based on dodgy science. We are turning some of the data over to you so you can form your own view.
Even more than with other data sets that Kaggle has featured, there’s a huge amount of data cleaning and preparation that goes into putting together a long-time study of climate trends. Early data was collected by technicians using mercury thermometers, where any variation in the visit time impacted measurements. In the 1940s, the construction of airports caused many weather stations to be moved. In the 1980s, there was a move to electronic thermometers that are said to have a cooling bias.
Given this complexity, there are a range of organizations that collate climate trends data. The three most cited land and ocean temperature data sets are NOAA’s MLOST, NASA’s GISTEMP and the UK’s HadCrut.
We have repackaged the data from a newer compilation put together by the Berkeley Earth, which is affiliated with Lawrence Berkeley National Laboratory. The Berkeley Earth Surface Temperature Study combines 1.6 billion temperature reports from 16 pre-existing archives. It is nicely packaged and allows for slicing into interesting subsets (for example by country). They publish the source data and the code for the transformations they applied. They also use methods that allow weather observations from shorter time series to be included, meaning fewer observations need to be thrown away.
In this dataset, we have include several files:
Global Land and Ocean-and-Land Temperatures (GlobalTemperatures.csv):
Other files include:
The raw data comes from the Berkeley Earth data page.