Facebook
TwitterAttribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Some say climate change is the biggest threat of our age while others say it’s a myth based on dodgy science. We are turning some of the data over to you so you can form your own view.
Even more than with other data sets that Kaggle has featured, there’s a huge amount of data cleaning and preparation that goes into putting together a long-time study of climate trends. Early data was collected by technicians using mercury thermometers, where any variation in the visit time impacted measurements. In the 1940s, the construction of airports caused many weather stations to be moved. In the 1980s, there was a move to electronic thermometers that are said to have a cooling bias.
Given this complexity, there are a range of organizations that collate climate trends data. The three most cited land and ocean temperature data sets are NOAA’s MLOST, NASA’s GISTEMP and the UK’s HadCrut.
We have repackaged the data from a newer compilation put together by the Berkeley Earth, which is affiliated with Lawrence Berkeley National Laboratory. The Berkeley Earth Surface Temperature Study combines 1.6 billion temperature reports from 16 pre-existing archives. It is nicely packaged and allows for slicing into interesting subsets (for example by country). They publish the source data and the code for the transformations they applied. They also use methods that allow weather observations from shorter time series to be included, meaning fewer observations need to be thrown away.
In this dataset, we have include several files:
Global Land and Ocean-and-Land Temperatures (GlobalTemperatures.csv):
Other files include:
The raw data comes from the Berkeley Earth data page.
Facebook
TwitterThis version has been superseded by a newer version. It is highly recommended for users to access the current version. Users should only access this superseded version for special cases, such as reproducing studies. If necessary, this version can be accessed by contacting NCEI. The NOAA Global Surface Temperature Dataset (NOAAGlobalTemp) is a blended product from two independent analysis products: the Extended Reconstructed Sea Surface Temperature (ERSST) analysis and the land surface temperature (LST) analysis using the Global Historical Climatology Network (GHCN) temperature database. The data is merged into a monthly global surface temperature dataset dating back from 1880 to the present. The monthly product output is in gridded (5 degree x 5 degree) and time series formats. The product is used in climate monitoring assessments of near-surface temperatures on a global scale. The changes from version 4 to version 5 include an update to the primary input datasets: ERSST version 5 (updated from v4), and GHCN-M version 4 (updated from v3.3.3). Version 5 updates also include a new netCDF file format with CF conventions. This dataset is formerly known as Merged Land-Ocean Surface Temperature (MLOST).
Facebook
TwitterThe table Global Temperatures by Major City is part of the dataset Climate Change: Earth Surface Temperature Data, available at https://columbia.redivis.com/datasets/1e0a-f4931vvyg. It contains 239177 rows across 7 variables.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Historical changes of annual temperature and precipitation indices at selected 210 U.S. cities
This dataset provide:
Annual average temperature, total precipitation, and temperature and precipitation extremes calculations for 210 U.S. cities.
Historical rates of changes in annual temperature, precipitation, and the selected temperature and precipitation extreme indices in the 210 U.S. cities.
Estimated thresholds (reference levels) for the calculations of annual extreme indices including warm and cold days, warm and cold nights, and precipitation amount from very wet days in the 210 cities.
Annual average of daily mean temperature, Tmax, and Tmin are included for annual average temperature calculations. Calculations were based on the compiled daily temperature and precipitation records at individual cities.
Temperature and precipitation extreme indices include: warmest daily Tmax and Tmin, coldest daily Tmax and Tmin , warm days and nights, cold days and nights, maximum 1-day precipitation, maximum consecutive 5-day precipitation, precipitation amounts from very wet days.
Number of missing daily Tmax, Tmin, and precipitation values are included for each city.
Rates of change were calculated using linear regression, with some climate indices applied with the Box-Cox transformation prior to the linear regression.
The historical observations from ACIS belong to Global Historical Climatological Network - daily (GHCN-D) datasets. The included stations were based on NRCC’s “ThreadEx” project, which combined daily temperature and precipitation extremes at 255 NOAA Local Climatological Locations, representing all large and medium size cities in U.S. (See Owen et al. (2006) Accessing NOAA Daily Temperature and Precipitation Extremes Based on Combined/Threaded Station Records).
Resources:
See included README file for more information.
Additional technical details and analyses can be found in: Lai, Y., & Dzombak, D. A. (2019). Use of historical data to assess regional climate change. Journal of climate, 32(14), 4299-4320. https://doi.org/10.1175/JCLI-D-18-0630.1
Other datasets from the same project can be accessed at: https://kilthub.cmu.edu/projects/Use_of_historical_data_to_assess_regional_climate_change/61538
ACIS database for historical observations: http://scacis.rcc-acis.org/
GHCN-D datasets can also be accessed at: https://www.ncei.noaa.gov/data/global-historical-climatology-network-daily/
Station information for each city can be accessed at: http://threadex.rcc-acis.org/
2024 August updated -
Annual calculations for 2022 and 2023 were added.
Linear regression results and thresholds for extremes were updated because of the addition of 2022 and 2023 data.
Note that future updates may be infrequent.
2022 January updated -
Annual calculations for 2021 were added.
Linear regression results and thresholds for extremes were updated because of the addition of 2021 data.
2021 January updated -
Annual calculations for 2020 were added.
Linear regression results and thresholds for extremes were updated because of the addition of 2020 data.
2020 January updated -
Annual calculations for 2019 were added.
Linear regression results and thresholds for extremes were updated because of the addition of 2019 data.
Thresholds for all 210 cities were combined into one single file – Thresholds.csv.
2019 June updated -
Baltimore was updated with the 2018 data (previously version shows NA for 2018) and new ID to reflect the GCHN ID of Baltimore-Washington International AP. city_info file was updated accordingly.
README file was updated to reflect the use of "wet days" index in this study. The 95% thresholds for calculation of wet days utilized all daily precipitation data from the reference period and can be different from the same index from some other studies, where only days with at least 1 mm of precipitation were utilized to calculate the thresholds. Thus the thresholds in this study can be lower than the ones that would've be calculated from the 95% percentiles from wet days (i.e., with at least 1 mm of precipitation).
Facebook
Twitterhttps://object-store.os-api.cci2.ecmwf.int:443/cci2-prod-catalogue/licences/insitu-gridded-observations-global-and-regional/insitu-gridded-observations-global-and-regional_15437b363f02bf5e6f41fc2995e3d19a590eb4daff5a7ce67d1ef6c269d81d68.pdfhttps://object-store.os-api.cci2.ecmwf.int:443/cci2-prod-catalogue/licences/insitu-gridded-observations-global-and-regional/insitu-gridded-observations-global-and-regional_15437b363f02bf5e6f41fc2995e3d19a590eb4daff5a7ce67d1ef6c269d81d68.pdf
This dataset provides high-resolution gridded temperature and precipitation observations from a selection of sources. Additionally the dataset contains daily global average near-surface temperature anomalies. All fields are defined on either daily or monthly frequency. The datasets are regularly updated to incorporate recent observations. The included data sources are commonly known as GISTEMP, Berkeley Earth, CPC and CPC-CONUS, CHIRPS, IMERG, CMORPH, GPCC and CRU, where the abbreviations are explained below. These data have been constructed from high-quality analyses of meteorological station series and rain gauges around the world, and as such provide a reliable source for the analysis of weather extremes and climate trends. The regular update cycle makes these data suitable for a rapid study of recently occurred phenomena or events. The NASA Goddard Institute for Space Studies temperature analysis dataset (GISTEMP-v4) combines station data of the Global Historical Climatology Network (GHCN) with the Extended Reconstructed Sea Surface Temperature (ERSST) to construct a global temperature change estimate. The Berkeley Earth Foundation dataset (BERKEARTH) merges temperature records from 16 archives into a single coherent dataset. The NOAA Climate Prediction Center datasets (CPC and CPC-CONUS) define a suite of unified precipitation products with consistent quantity and improved quality by combining all information sources available at CPC and by taking advantage of the optimal interpolation (OI) objective analysis technique. The Climate Hazards Group InfraRed Precipitation with Station dataset (CHIRPS-v2) incorporates 0.05° resolution satellite imagery and in-situ station data to create gridded rainfall time series over the African continent, suitable for trend analysis and seasonal drought monitoring. The Integrated Multi-satellitE Retrievals dataset (IMERG) by NASA uses an algorithm to intercalibrate, merge, and interpolate “all'' satellite microwave precipitation estimates, together with microwave-calibrated infrared (IR) satellite estimates, precipitation gauge analyses, and potentially other precipitation estimators over the entire globe at fine time and space scales for the Tropical Rainfall Measuring Mission (TRMM) and its successor, Global Precipitation Measurement (GPM) satellite-based precipitation products. The Climate Prediction Center morphing technique dataset (CMORPH) by NOAA has been created using precipitation estimates that have been derived from low orbiter satellite microwave observations exclusively. Then, geostationary IR data are used as a means to transport the microwave-derived precipitation features during periods when microwave data are not available at a location. The Global Precipitation Climatology Centre dataset (GPCC) is a centennial product of monthly global land-surface precipitation based on the ~80,000 stations world-wide that feature record durations of 10 years or longer. The data coverage per month varies from ~6,000 (before 1900) to more than 50,000 stations. The Climatic Research Unit dataset (CRU v4) features an improved interpolation process, which delivers full traceability back to station measurements. The station measurements of temperature and precipitation are public, as well as the gridded dataset and national averages for each country. Cross-validation was performed at a station level, and the results have been published as a guide to the accuracy of the interpolation. This catalogue entry complements the E-OBS record in many aspects, as it intends to provide high-resolution gridded meteorological observations at a global rather than continental scale. These data may be suitable as a baseline for model comparisons or extreme event analysis in the CMIP5 and CMIP6 dataset.
Facebook
Twitterhttps://cdla.io/sharing-1-0/https://cdla.io/sharing-1-0/
The synthetic sensor dataset contains more than 3000 samples, each representing a set of sensor readings. It consists of six columns: Temperature, Sensor1, Sensor2, Sensor3, Sensor4, and Sensor5.
The dataset is designed to mimic a scenario where temperature readings are influenced by multiple independent sensor measurements. The values of the independent variables and the added noise introduce variability in the temperature readings.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains temperature exposure statistics for Europe (e.g. percentiles) derived from the daily 2 metre mean, minimum and maximum air temperature for the entire year, winter (DJF: December-January-February) and summer (JJA: June-July-August). These statistics were derived within the C3S European Health service and are available for different future time periods and using different climate change scenarios. Temperature percentiles are typically used in epidemiology and public health when defining health risk estimates and when looking at current and future health impacts, and they allow to identify a common threshold and comparison between different cities/areas. The temperature statistics are calculated, either for the season winter and summer or for the whole year, based on a bias-adjusted EURO-CORDEX dataset. The statistics are averaged for 30 years as a smoothed average from 1971 to 2100. This results in a timeseries covering the period from 1986 to 2085. Finally, the timeseries are averaged for the model ensemble and the standard deviation to this ensemble mean is provided.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Temperature in the United States increased to 10.73 celsius in 2024 from 10.25 celsius in 2023. This dataset includes a chart with historical data for the United States Average Temperature.
Facebook
TwitterThis Climate Data Record (CDR) provides Land Surface Temperature (LST) derived from the Meteosat Visible and InfraRed Imager (MVIRI) on board the Meteosat First Generation (MFG) and the Spinning Enhanced Visible and InfraRed Imager (SEVIRI) onboard the Meteosat Second Generation (MSG) satellites. The covered time period ranges from January 1983 to December 2020. Original thermal radiances were inter-calibrated by the European Organisation for the Exploitation of Meteorological Satellites (EUMETSAT). The LST is derived from Meteosat by use of single-channel LST retrieval based on radiative transfer calculations. The LST is presented as hourly data and as monthly averaged diurnal cycle composites on a 0.05°x0.05° grid covering the Meteosat disk (Africa and Europe). A summary of the retrieval algorithms is provided by Duguay–Tetzlaff et al. 2015. This is a Thematic Climate Data Record (TCDR).
Facebook
TwitterThe NOAA Global Surface Temperature Dataset (NOAAGlobalTemp) is a merged land&ocean surface temperature analysis (formerly known as MLOST) It is a spatially gridded (5° - 5°) global surface temperature dataset, with monthly resolution from January 1880 to present. We combine a global sea surface (water) temperature (SST) dataset with a global land surface air temperature dataset into this merged dataset of both the Earth's and land's and ocean surface temperatures. The SST dataset is the Extended Reconstructed Sea Surface Temperature (ERSST) version 5. The land surface air temperature dataset is similar to ERSST but uses data from the Global Historical Climatology Network Monthly (GHCN-M) database, version 4.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This data set includes daily, population-weighted mean values of various heat metrics for every county in the contiguous United States from 2000-2020. The dataset methodology, usage notes, and additional citations are published in Scientific Data (see reference below for Spangler et al. [2022]). Minimum, maximum, and mean ambient temperature, dew-point temperature, humidex, heat index, net effective temperature, wet-bulb globe temperature, and Universal Thermal Climate Index are included. Note that Monroe County, Florida (FIPS: 12087) and Nantucket County, Massachusetts (FIPS 25019) are missing due to unavailability of ERA5-Land data for Key West, Florida and Nantucket, MA. To use these data, assign the data from the .Rds file to a new data frame in R using the readRDS() function. Please cite the use of this data set with the following reference. Note that additional citations for specific variables can be found in Table 2.
K.R. Spangler, S. Liang, and G.A. Wellenius. "Wet-Bulb Globe Temperature, Universal Thermal Climate Index, and Other Heat Metrics for US Counties, 2000-2020." Scientific Data (2022). doi: 10.1038/s41597-022-01405-3
This data set contains modified Copernicus Climate Change Service information (2022), as described and cited in the manuscript referenced above. Neither the European Commission nor ECMWF is responsible for any use that may be made of the Copernicus information or data it contains. This data set is provided “as is” with no warranty of any kind.
Facebook
TwitterMeasurements of surface air and ocean temperature are compiled from around the world each month by NOAA’s National Centers for Environmental Information and are analyzed and compared to the 1971-2000 average temperature for each location. The resulting temperature anomaly (or difference from the average) is shown in this feature service, which includes an archive going back to 1880. The mean of the 12 months each year is displayed here. Each annual update is available around the 15th of the following January (e.g., 2020 is available Jan 15th, 2021). The NOAAGlobalTemp dataset is the official U.S. long-term record of global temperature data and is often used to show trends in temperature change around the world. It combines thousands of land-based station measurements from the Global Historical Climatology Network (GHCN) along with surface ocean temperature from the Extended Reconstructed Sea Surface Temperature (ERSST) analysis. These two datasets are merged into a 5-degree resolution product. A report summary report by NOAA NCEI is available here. GHCN monthly mean station averages for temperature and precipitation for the 1981-2010 period are also available in Living Atlas here.What can you do with this layer? Visualization: This layer can be used to plot areas where temperature was higher or lower than the historical average for each year since 1880. Be sure to configure the time settings in your web map to view the timeseries correctly. Analysis: This layer can be used as an input to a variety of geoprocessing tools, such as Space Time Cubes and other trend analyses. For a more detailed temporal analysis, a monthly mean is available here.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The NOAA Global Surface Temperature Dataset (NOAAGlobalTemp) is a monthly global merged land-ocean surface temperature analysis product that is derived from two independent analyses. The first is the Extended Reconstructed Sea Surface Temperature (ERSST) analysis and the second is a land surface air temperature (LSAT) analysis that uses the Global Historical Climatology Network - Monthly (GHCN-M) temperature database. The NOAAGlobalTemp data set contains global surface temperatures in gridded (5° × 5°) and monthly resolution time series (from 1850 to present time) data files. The product is used in climate monitoring assessments of near-surface temperatures on a global scale. This version, v6.0, an updated version to the current operational release v5.1, is implemented by an Artificial Neural Network method to improve the surface temperature reconstruction over the land.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Q: Where was the monthly temperature warmer or cooler than usual? A: Colors show where average monthly temperature was above or below its 1991-2020 average. Blue areas experienced cooler-than-usual temperatures while areas shown in red were warmer than usual. The darker the color, the larger the difference from the long-term average temperature. Q: Where do these measurements come from? A: Weather stations on every continent record temperatures over land, and ocean surface temperatures come from measurements made by ships and buoys. NOAA scientists merge the readings from land and ocean into a single dataset. To calculate difference-from-average temperatures—also called temperature anomalies—scientists calculate the average monthly temperature across hundreds of small regions, and then subtract each region’s 1991-2020 average for the same month. If the result is a positive number, the region was warmer than the long-term average. A negative result from the subtraction means the region was cooler than usual. To generate the source images, visualizers apply a mathematical filter to the results to produce a map that has smooth color transitions and no gaps. Q: What do the colors mean? A: Shades of red show where average monthly temperature was warmer than the 1991-2020 average for the same month. Shades of blue show where the monthly average was cooler than the long-term average. The darker the color, the larger the difference from average temperature. White and very light areas were close to their long-term average temperature. Gray areas near the North and South Poles show where no data are available. Q: Why do these data matter? A: Over time, these data give us a planet-wide picture of how climate varies over months and years and changes over decades. Each month, some areas are cooler than the long-term average and some areas are warmer. Though we don’t see an increase in temperature at every location every month, the long-term trend shows a growing portion of Earth’s surface is warmer than it was during the base period. Q: How did you produce these snapshots? A: Data Snapshots are derivatives of existing data products: to meet the needs of a broad audience, we present the source data in a simplified visual style. NOAA's Environmental Visualization Laboratory (NNVL) produces the source images for the Difference from Average Temperature – Monthly maps. To produce our images, we run a set of scripts that access the source images, re-project them into desired projections at various sizes, and output them with a custom color bar. Additional information Source images available through NOAA's Environmental Visualization Lab (NNVL) are interpolated from data originally provided by the National Center for Environmental Information (NCEI) - Weather and Climate. NNVL images are based on NOAA Merged Land Ocean Global Surface Temperature Analysis data (NOAAGlobalTemp, formerly known as MLOST). References NCEI Monthly Global Analysis NOAA View Temperature Anomaly Merged Land Ocean Global Surface Temperature Analysis Global Surface Temperature Anomalies Climate at a Glance - Data Information Source: https://www.climate.gov/maps-data/data-snapshots/data-source/temperature-global-monthly-difference-a...This upload includes two additional files:* Temperature - Global Monthly, Difference from Average _NOAA Climate.gov.pdf is a screenshot of the main Climate.gov site for these snapshots (https://www.climate.gov/maps-data/data-snapshots/data-source/temperature-global-monthly-difference-a...)* Cimate_gov_ Data Snapshots.pdf is a screenshot of the data download page for the full-resolution files.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Temperature in Iran decreased to 19.18 celsius in 2024 from 19.61 celsius in 2023. This dataset includes a chart with historical data for Iran Average Temperature.
Facebook
TwitterBased on current monthly figures, on average, German climate has gotten a bit warmer. The average temperature for January 2025 was recorded at around 2 degrees Celsius, compared to 1.5 degrees a year before. In the broader context of climate change, average monthly temperatures are indicative of where the national climate is headed and whether attempts to control global warming are successful. Summer and winter Average summer temperature in Germany fluctuated in recent years, generally between 18 to 19 degrees Celsius. The season remains generally warm, and while there may not be as many hot and sunny days as in other parts of Europe, heat waves have occurred. In fact, 2023 saw 11.5 days with a temperature of at least 30 degrees, though this was a decrease compared to the year before. Meanwhile, average winter temperatures also fluctuated, but were higher in recent years, rising over four degrees on average in 2024. Figures remained in the above zero range since 2011. Numbers therefore suggest that German winters are becoming warmer, even if individual regions experiencing colder sub-zero snaps or even more snowfall may disagree. Rain, rain, go away Average monthly precipitation varied depending on the season, though sometimes figures from different times of the year were comparable. In 2024, the average monthly precipitation was highest in May and September, although rainfalls might increase in October and November with the beginning of the cold season. In the past, torrential rains have led to catastrophic flooding in Germany, with one of the most devastating being the flood of July 2021. Germany is not immune to the weather changing between two extremes, e.g. very warm spring months mostly without rain, when rain might be wished for, and then increased precipitation in other months where dry weather might be better, for example during planting and harvest seasons. Climate change remains on the agenda in all its far-reaching ways.
Facebook
TwitterThis data set contains in-situ soil moisture profile and soil temperature data collected at 20-minute intervals at SoilSCAPE (Soil moisture Sensing Controller and oPtimal Estimator) project sites in four states (California, Arizona, Oklahoma, and Michigan) in the United States. SoilSCAPE used wireless sensor technology to acquire high temporal resolution soil moisture and temperature data at up to 12 sites over varying durations since August 2011. At its maximum, the network consisted of over 200 wireless sensor installations (nodes), with a range of 6 to 27 nodes per site. The soil moisture sensors (EC-5 and 5-TM from Decagon Devices) were installed at three to four depths, nominally at 5, 20, and 50 cm below the surface. Soil conditions (e.g., hard soil or rocks) may have limited sensor placement. Temperature sensors were installed at 5 cm depth at six of the sites. Data collection started in August 2011 and continues at eight sites through the present. The data enables estimation of local-scale soil moisture at high temporal resolution and validation of remote sensing estimates of soil moisture at regional (airborne, e.g. NASA's Airborne Microwave Observation of Subcanopy and Subsurface Mission - AirMOSS) and national (spaceborne, e.g. NASA's Soil Moisture Active Passive - SMAP) scales.
Facebook
TwitterThe average temperature in South Korea in 2024 was **** degrees Celsius. The average temperature in South Korea has risen steadily over the years, which is shown in the graph. Extreme weather South Korea has a distinct four-season climate. Generally, summer in South Korea is humid and hot, while winter is dry and cold. However, the summer climate, which usually lasts from June to August, is getting longer and can last from May through to September. Especially in summer, extreme weather such as tropical nights, typhoons, and heatwaves occur. Recently, there was an increase in the consecutive days in which heatwaves reached temperatures above ** degrees. Greenhouse gas emissions South Korea is suffering from air pollution problems, such as yellow dust and fine dust, that have increased rapidly over recent years. In addition, as the carbon dioxide concentration has continued to rise, the average annual temperature has also risen steadily, resulting in abnormal climates, such as heatwaves in summer or extreme cold in winter. South Korea is one of the countries that produces a lot of greenhouse gases. Due to the manufacturing-oriented industrial structure, greenhouse gas emissions from energy use account for a large portion.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
A gridded 1 km resolution global (50°S ~79°N) daily maximum and minimum near-surface air temperature dataset for 2013 that was generated using a seamless 1 km resolution land surface temperature dataset, a 30-arc second (~1 km) resolution digital elevation model (DEM) data, and air temperature observations at weather stations and a spatially varying coefficient model with sign preservation (SVCM-SP) algorithm. The gridded air temperature dataset is of great use in global studies of urban, climate, and hydrology. Documentation for this dataset can be found with the 2003 data. Data from other years is available at: https://doi.org/10.25380/iastate.c.6005185
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
This data set includes daily, monthly, and yearly mean surface air temperatures for four interior West Antarctic sites between 1978 and 1997. Data include air surface temperatures measured at the Byrd, Lettau, Lynn, and Siple Station automatic weather stations. In addition, because weather stations in Antarctica are difficult to maintain, and resulting multi-decade records are often incomplete, the investigators also calculated surface temperatures from satellite passive microwave brightness temperatures. Calibration of 37-GHz vertically polarized brightness temperature data during periods of known air temperature, using emissivity modeling, allowed the investigators to replace data gaps with calibrated brightness temperatures.
MS Excel data files and GIF images derived from the data are available via ftp from the National Snow and Ice Data Center.
Facebook
TwitterAttribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Some say climate change is the biggest threat of our age while others say it’s a myth based on dodgy science. We are turning some of the data over to you so you can form your own view.
Even more than with other data sets that Kaggle has featured, there’s a huge amount of data cleaning and preparation that goes into putting together a long-time study of climate trends. Early data was collected by technicians using mercury thermometers, where any variation in the visit time impacted measurements. In the 1940s, the construction of airports caused many weather stations to be moved. In the 1980s, there was a move to electronic thermometers that are said to have a cooling bias.
Given this complexity, there are a range of organizations that collate climate trends data. The three most cited land and ocean temperature data sets are NOAA’s MLOST, NASA’s GISTEMP and the UK’s HadCrut.
We have repackaged the data from a newer compilation put together by the Berkeley Earth, which is affiliated with Lawrence Berkeley National Laboratory. The Berkeley Earth Surface Temperature Study combines 1.6 billion temperature reports from 16 pre-existing archives. It is nicely packaged and allows for slicing into interesting subsets (for example by country). They publish the source data and the code for the transformations they applied. They also use methods that allow weather observations from shorter time series to be included, meaning fewer observations need to be thrown away.
In this dataset, we have include several files:
Global Land and Ocean-and-Land Temperatures (GlobalTemperatures.csv):
Other files include:
The raw data comes from the Berkeley Earth data page.