Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Some say climate change is the biggest threat of our age while others say it’s a myth based on dodgy science. We are turning some of the data over to you so you can form your own view.
Even more than with other data sets that Kaggle has featured, there’s a huge amount of data cleaning and preparation that goes into putting together a long-time study of climate trends. Early data was collected by technicians using mercury thermometers, where any variation in the visit time impacted measurements. In the 1940s, the construction of airports caused many weather stations to be moved. In the 1980s, there was a move to electronic thermometers that are said to have a cooling bias.
Given this complexity, there are a range of organizations that collate climate trends data. The three most cited land and ocean temperature data sets are NOAA’s MLOST, NASA’s GISTEMP and the UK’s HadCrut.
We have repackaged the data from a newer compilation put together by the Berkeley Earth, which is affiliated with Lawrence Berkeley National Laboratory. The Berkeley Earth Surface Temperature Study combines 1.6 billion temperature reports from 16 pre-existing archives. It is nicely packaged and allows for slicing into interesting subsets (for example by country). They publish the source data and the code for the transformations they applied. They also use methods that allow weather observations from shorter time series to be included, meaning fewer observations need to be thrown away.
In this dataset, we have include several files:
Global Land and Ocean-and-Land Temperatures (GlobalTemperatures.csv):
Other files include:
The raw data comes from the Berkeley Earth data page.
According to an April 2024 survey on climate change conducted in the United States, some ** percent of the respondents claimed they believed that global warming was happening. A much smaller share, ** percent, believed global warming was not happening.
The United States contributed roughly 17 percent of global warming from 1851 to 2023. By contrast, India contributed five percent of warming during this period, despite the country having a far larger population than the United States. In total, G20 countries have contributed approximately three-quarters of global warming to date, while the least developed countries are responsible for just six percent.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This work combines global warming data from various publications and datasets, creating a new dataset covering a very long period - from the year 1 to 2100.
The dataset created in this work separates the actual records for the 1-2024 period from the forecast for the 2020-2100 period.
The work includes separate sets for land+ocean (GW), land only (GWL), and ocean only (GWO).
The online dataset is available on the site nowagreen.com.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The costs of climate change are often estimated in monetary terms but this raises ethical issues. Here we express them in terms of numbers of people left outside the ‘human climate niche’ – defined as the historically highly-conserved distribution of relative human population density with respect to mean annual temperature (MAT). We show that climate change has already put ~9% of people (>600 million) outside this niche. By end-of-century (2080-2100), current policies leading to around 2.7 °C global warming could leave one 22 third (22-39%) of people outside the niche. Reducing global warming from 2.7 to 1.5 °C results in a ~5-fold decrease in the population exposed to unprecedented heat (MAT ≥29 °C). The lifetime emissions of ~3.5 global average citizens today (or ~1.2 average US citizens) expose 1 future person to unprecedented heat by end-of-century. That person comes from a place where emissions today are around half of the global average. These results highlight the need for more decisive policy action to limit the human costs and inequities of climate change.
This dataset contains modeled temperature, ozone, and PM2.5 data for the United States over the 21st century, using two global climate model scenarios and two emissions datasets.
Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically
Data description
The FAOSTAT Temperature Change domain disseminates statistics of mean surface temperature change by country, with annual updates. The current dissemination covers the period 1961–2023. Statistics are available for monthly, seasonal and annual mean temperature anomalies, i.e., temperature change with respect to a baseline climatology, corresponding to the period 1951–1980. The standard deviation of the temperature change of the baseline methodology is also available. Data are based on the publicly available GISTEMP data, the Global Surface Temperature Change data distributed by the National Aeronautics and Space Administration Goddard Institute for Space Studies (NASA-GISS).
Statistical concepts and definitions
Statistical standards: Data in the Temperature Change domain are not an explicit SEEA variable. Nonetheless, country and regional calculations employ a definition of “Land area” consistent with SEEA Land Use definitions, specifically SEEA CF Table 5.11 “Land Use Classification” and SEEA AFF Table 4.8, “Physical asset account for land use.” The Temperature Change domain of the FAOSTAT Agri-Environmental Indicators section is compliant with the Framework for the Development of Environmental Statistics (FDES 2013), contributing to FDES Component 1: Environmental Conditions and Quality, Sub-component 1.1: Physical Conditions, Topic 1.1.1: Atmosphere, climate and weather, Core set/ Tier 1 statistics a.1.
Statistical unit: Countries and Territories.
Statistical population: Countries and Territories.
Reference area: Area of all the Countries and Territories of the world. In 2019: 190 countries and 37 other territorial entities.
Code - reference area: FAOSTAT, M49, ISO2 and ISO3 (http://www.fao.org/faostat/en/#definitions). FAO Global Administrative Unit Layer (GAUL National level – reference year 2014. FAO Geospatial data repository GeoNetwork. Permanent address: http://www.fao.org:80/geonetwork?uuid=f7e7adb0-88fd-11da-a88f-000d939bc5d8.
Code - Number of countries/areas covered: In 2019: 190 countries and 37 other territorial entities.
Time coverage: 1961-2023
Periodicity: Monthly, Seasonal, Yearly
Base period: 1951-1980
Unit of Measure: Celsius degrees °C
Reference period: Months, Seasons, Meteorological year
Documentation on methodology: Details on the methodology can be accessed at the Related Documents section of the Temperature Change (ET) domain in the Agri-Environmental Indicators section of FAOSTAT.
Quality documentation: For more information on the methods, coverage, accuracy and limitations of the Temperature Change dataset please refer to the NASA GISTEMP website: https://data.giss.nasa.gov/gistemp/
Source: http://www.fao.org/faostat/en/#data/ET/metadata
Climate change is one of the important issues that face the world in this technological era. The best proof of this situation is the historical temperature change. You can investigate if any hope there is for stopping global warming :)
Can you find any correlation between temperature change and any other variable? (Using ISO3 codes for merging any other countries' data sets possible.)
Prediction of temperature change: there is also an overall world temperature change in the country list as 'World'.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
A complete description of the dataset is given by Jones et al. (2023). Key information is provided below.
Background
A dataset describing the global warming response to national emissions CO2, CH4 and N2O from fossil and land use sources during 1851-2021.
National CO2 emissions data are collated from the Global Carbon Project (Andrew and Peters, 2024; Friedlingstein et al., 2024).
National CH4 and N2O emissions data are collated from PRIMAP-hist (HISTTP) (Gütschow et al., 2024).
We construct a time series of cumulative CO2-equivalent emissions for each country, gas, and emissions source (fossil or land use). Emissions of CH4 and N2O emissions are related to cumulative CO2-equivalent emissions using the Global Warming Potential (GWP*) approach, with best-estimates of the coefficients taken from the IPCC AR6 (Forster et al., 2021).
Warming in response to cumulative CO2-equivalent emissions is estimated using the transient climate response to cumulative carbon emissions (TCRE) approach, with best-estimate value of TCRE taken from the IPCC AR6 (Forster et al., 2021, Canadell et al., 2021). 'Warming' is specifically the change in global mean surface temperature (GMST).
The data files provide emissions, cumulative emissions and the GMST response by country, gas (CO2, CH4, N2O or 3-GHG total) and source (fossil emissions, land use emissions or the total).
Data records: overview
The data records include three comma separated values (.csv) files as described below.
All files are in ‘long’ format with one value provided in the Data column for each combination of the categorical variables Year, Country Name, Country ISO3 code, Gas, and Component columns.
Component specifies fossil emissions, LULUCF emissions or total emissions of the gas.
Gas specifies CO2, CH4, N2O or the three-gas total (labelled 3-GHG).
Country ISO3 codes are specifically the unique ISO 3166-1 alpha-3 codes of each country.
Data records: specifics
Data are provided relative to 2 reference years (denoted ref_year below): 1850 and 1991. 1850 is a mutual first year of data spanning all input datasets. 1991 is relevant because the United Nations Framework Convention on Climate Change was operationalised in 1992.
EMISSIONS_ANNUAL_{ref_year-20}-2023.csv: Data includes annual emissions of CO2 (Pg CO2 year-1), CH4 (Tg CH4 year-1) and N2O (Tg N2O year-1) during the period ref_year-20 to 2023. The Data column provides values for every combination of the categorical variables. Data are provided from ref_year-20 because these data are required to calculate GWP* for CH4.
EMISSIONS_CUMULATIVE_CO2e100_{ref_year+1}-2023.csv: Data includes the cumulative CO2 equivalent emissions in units Pg CO2-e100 during the period ref_year+1 to 2023 (i.e. since the reference year). The Data column provides values for every combination of the categorical variables.
GMST_response_{ref_year+1}-2023.csv: Data includes the change in global mean surface temperature (GMST) due to emissions of the three gases in units °C during the period ref_year+1 to 2023 (i.e. since the reference year). The Data column provides values for every combination of the categorical variables.
Accompanying Code
Code is available at: https://github.com/jonesmattw/National_Warming_Contributions .
The code requires Input.zip to run (see README at the GitHub link).
Further info: Country Groupings
We also provide estimates of the contributions of various country groupings as defined by the UNFCCC:
And other country groupings:
See COUNTRY_GROUPINGS.xlsx for the lists of countries in each group.
No further editions of this report will be published as it has been replaced by the Agri-climate report 2021.
This annual publication brings together existing statistics on English agriculture in order to help inform the understanding of agriculture and greenhouse gas emissions. The publication summarises available statistics that relate directly and indirectly to emissions and includes statistics on farmer attitudes to climate change mitigation and uptake of mitigation measures. It also incorporates statistics emerging from developing research and provides some international comparisons. It is updated when sufficient new information is available.
Next update: see the statistics release calendar
For further information please contact:
Agri.EnvironmentStatistics@defra.gov.uk
https://www.twitter.com/@defrastats" class="govuk-link">Twitter: @DefraStats
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Many people use these two terms interchangeably, but we think it’s important to acknowledge their differences. Global warming is an increase in the Earth’s average surface temperature from human-made greenhouse gas emissions. On the other hand, climate change refers to the long-term changes in the Earth’s climate, or a region on Earth, and includes more than just the average surface temperature. For example, variations in the amount of snow, sea levels, and sea ice can all be consequences of climate change.
Worldwide Climate Change & Global Warming keyword / topic search in Google Search Engine from 2004 - present
Google Trends Lab
As part of this national strategy, the Ministry of Education, Culture, Sports, Science and Technology (MEXT) had launched a 5-year (FY2007 - 2011) initiative called the Innovative Program of Climate Change Projection for the 21st Century (KAKUSHIN Program), using the Earth Simulator (ES) to address emerging research challenges, such as those derived from the outcomes of the MEXT's Kyosei Project (FY2002 - 2006), that had made substantial contributions to the Fourth Assessment Report (AR4) of the Intergovernmental Panel on Climate Change (IPCC). The KAKUSHIN Program was expected to further contribute to the Fifth Assessment Report (AR5).
The research items include the advancement and forecasting of global warming models, the quantification and reduction of model uncertainty, and the evaluation of the impacts of natural disasters based on forecast information. Much of the data submitted to CMIP5 from Japan were generated under this KAKUSHIN program using the global climate models and the Earth system models developed in Japan. This dataset is the result of using the Global Climate Model MIROC4h.
All CMIP5 data are collected, managed, and published by the Earth System Grid Federation (ESGF), and DIAS serves as an ESGF node. All public datasets, including this dataset, are available from ESGF. For information on how to use these datasets, including this dataset, see "CMIP5 Data - Getting Started" (URL is available in the online information below). Please note that an ESGF account is required to download the CMIP5 data.
Because the terms of use for CMIP5 data are different from CMIP6 in many respects, please check the following Terms of Use carefully: https://pcmdi.llnl.gov/mips/cmip5/terms-of-use.html Currently, all CMIP5 data, including this dataset, is classified as "unrestricted" within it.
By 2099, climate change could be one of the leading causes of death in the world. With an increase of *** degrees Celsius in mean surface temperature compared to a pre-industrial average, it was estimated that around ** people per 100,000 population could die in that year due to effects caused by climate change. Only death rates from heart disease and strokes would surpass that value.
This survey shows the concerns of U.S. Americans about the environmental threat of global warming from 1989 to 2021. As of March 2021, 43 percent of the respondents were worried "a great deal" about global warming.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Comprehensive dataset tracking climate change indicators including global warming, species extinction rates, deforestation, pollution levels, and environmental destruction across 196 countries.
This dataset includes processed climate change datasets related to climatology, hydrology, and water operations. The climatological data provided are change factors for precipitation and reference evapotranspiration gridded over the entire State. The hydrological data provided are projected stream inflows for major streams in the Central Valley, and streamflow change factors for areas outside of the Central Valley and smaller ungaged watersheds within the Central Valley. The water operations data provided are Central Valley reservoir outflows, diversions, and State Water Project (SWP) and Central Valley Project (CVP) water deliveries and select streamflow data. Most of the Central Valley inflows and all of the water operations data were simulated using the CalSim II model and produced for all projections.
These data were originally developed for the California Water Commission’s Water Storage Investment Program (WSIP). The WSIP data used as the basis for these climate change resources along with the technical reference document are located here: https://data.cnra.ca.gov/dataset/climate-change-projections-wsip-2030-2070. Additional processing steps were performed to improve user experience, ease of use for GSP development, and for Sustainable Groundwater Management Act (SGMA) implementation. Furthermore, the data, tools, and guidance may be useful for purposes other than sustainable groundwater management under SGMA.
Data are provided for projected climate conditions centered around 2030 and 2070. The climate projections are provided for these two future climate periods, and include one scenario for 2030 and three scenarios for 2070: a 2030 central tendency, a 2070 central tendency, and two 2070 extreme scenarios (i.e., one drier with extreme warming and one wetter with moderate warming). The climate scenario development process represents a climate period analysis where historical interannual variability from January 1915 through December 2011 is preserved while the magnitude of events may be increased or decreased based on projected changes in precipitation and air temperature from general circulation models.
DWR has collaborated with Lawrence Berkeley National Laboratory to improve the quality of the 2070 extreme scenarios. The 2070 extreme scenario update utilizes an improved climate period analysis method known as "quantile delta mapping" to better capture the GCM-projected change in temperature and precipitation. A technical note on the background and results of this process is provided here: https://data.cnra.ca.gov/dataset/extreme-climate-change-scenarios-for-water-supply-planning/resource/f2e1c61a-4946-4863-825f-e6d516b433ed.
Note: the original version of the 2070 extreme scenarios can be accessed in the archive posted here: https://data.cnra.ca.gov/dataset/sgma-climate-change-resources/resource/51b6ee27-4f78-4226-8429-86c3a85046f4
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Derived climate model projections data produced as part of the UK Climate Projections 2018 (UKCP18) project. The data produced by the UK Met Office Hadley Centre provides information on changes in 21st century climate for the UK helping to inform adaptation to a changing climate.
The derived climate model projections are estimated using a methodology based on time shift and other statistical approaches applied to a set of 28 projections comprising of 15 coupled simulations produced by the Met Office Hadley Centre, and 13 coupled simulations from CMIP5. The derived climate model projections exist for the RCP2.6 emissions scenario and for 2°C and 4°C global warming above pre-industrial levels.
The derived climate model projections are provided on a 60km spatial grid for the UK region and the projections consist of time series for the RCP2.6 emissions scenario that cover 1900-2100 and a 50 year time series for each of the global warming levels.
This dataset contains realisations scenario with global warming stabilised at 4°C
This data-set was collected to replicate the findings of Schuldt et al. (2011). It contains data from the UK, USA, and Australia collected between 2nd of January 2018 and the 29th of April 2019. It measures individuals political party, and belief in environmental phenomena.
This dataset includes processed climate change datasets related to climatology, hydrology, and water operations. The climatological data provided are change factors for precipitation and reference evapotranspiration gridded over the entire State. The hydrological data provided are projected stream inflows for major streams in the Central Valley, and streamflow change factors for areas outside of the Central Valley and smaller ungaged watersheds within the Central Valley. The water operations data provided are Central Valley reservoir outflows, diversions, and State Water Project (SWP) and Central Valley Project (CVP) water deliveries and select streamflow data. Most of the Central Valley inflows and all of the water operations data were simulated using the CalSim II model and produced for all projections.
These data were originally developed for the California Water Commission’s Water Storage Investment Program (WSIP). The WSIP data used as the basis for these climate change resources along with the technical reference document are located here: https://data.cnra.ca.gov/dataset/climate-change-projections-wsip-2030-2070. Additional processing steps were performed to improve user experience, ease of use for GSP development, and for Sustainable Groundwater Management Act (SGMA) implementation. Furthermore, the data, tools, and guidance may be useful for purposes other than sustainable groundwater management under SGMA.
Data are provided for projected climate conditions centered around 2030 and 2070. The climate projections are provided for these two future climate periods, and include one scenario for 2030 and three scenarios for 2070: a 2030 central tendency, a 2070 central tendency, and two 2070 extreme scenarios (i.e., one drier with extreme warming and one wetter with moderate warming). The climate scenario development process represents a climate period analysis where historical interannual variability from January 1915 through December 2011 is preserved while the magnitude of events may be increased or decreased based on projected changes in precipitation and air temperature from general circulation models.
DWR has collaborated with Lawrence Berkeley National Laboratory to improve the quality of the 2070 extreme scenarios. The 2070 extreme scenario update utilizes an improved climate period analysis method known as "quantile delta mapping" to better capture the GCM-projected change in temperature and precipitation. A technical note on the background and results of this process is provided here: https://data.cnra.ca.gov/dataset/extreme-climate-change-scenarios-for-water-supply-planning/resource/f2e1c61a-4946-4863-825f-e6d516b433ed.
Note: the original version of the 2070 extreme scenarios can be accessed in the archive posted here: https://data.cnra.ca.gov/dataset/sgma-climate-change-resources/resource/51b6ee27-4f78-4226-8429-86c3a85046f4
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Introduction:
Carbon dioxide (CO₂) is the most abundant greenhouse gas emitted by human activities. The burning of fossil fuels, such as coal, oil, and natural gas, is the primary source of CO₂ emissions. Other human activities, such as deforestation and industrial processes, also contribute to CO₂ emissions.
CO₂ emissions are a major driver of climate change. Climate change is the long-term alteration of temperature and typical weather patterns in a place. Climate change could refer to a particular location or the planet as a whole. Climate change may cause weather patterns to be less predictable. A region might experience lower or higher-than-average temperatures. Climate change may cause more frequent and severe weather events, such as storms, floods, and droughts.
Hurricane Beryl: Caribbean leader calls out rich countries for climate failures as ‘horrendous’ storm makes landfall. link
The effects of climate change are already being felt around the world, and they are expected to become more severe in the future. These effects include rising sea levels, more extreme weather events, changes in precipitation patterns, and loss of biodiversity.
Carl Sagan testified before Congress in 1985 on climate change.link
Unveiling the Composition of Air - A Glimpse into Earth's Breath: - Introduction: - Air, an essential and seemingly ubiquitous part of our daily lives, has a composition that remained a mystery for centuries. Only through groundbreaking discoveries and experiments did scientists unravel the complex tapestry of gases that make up Earth's atmosphere. - The Elemental Journey: - Ancient Greeks, over 2,500 years ago, identified air, along with earth, fire, and water, as fundamental elements of creation. However, it took until the late 18th century for this perspective to be challenged by the daring English chemist Joseph Priestley. His experiments shattered the belief that air was an elementary substance, revealing it as a composite mixture of gases. - Priestley's Revelation: - In 1774, Priestley's experiments led to the identification of "dephlogisticated air," later named oxygen by Antoine Lavoisier. This groundbreaking discovery marked the end of a 23-century-old notion that air was indestructible and unalterable, opening the door to a more sophisticated understanding of the gases enveloping our planet. - The Mid-18th Century and Elemental Evolution: - During the mid-18th century, the concept of elements was evolving, with researchers grappling to understand the nature of air and its role in chemical processes. The era saw a surge of interest in gases, driven by the transformative impact of the steam engine on civilisation. - British Chemists and Gas Exploration: - British chemists, including Joseph Black and Henry Cavendish, played a pivotal role in exploring gases. Black identified "fixed air" (now carbon dioxide) in 1754, and Cavendish produced hydrogen in 1766. - Daniel Rutherford's 1772 experiments led to the discovery of nitrogen, as he burned material in a bell jar, leaving behind a gas termed "noxious air." - Conclusion: - The composition of air, once thought to be a simple and unalterable substance, has been unveiled through the efforts of pioneering scientists. Priestley's revelation of oxygen, along with contributions from fellow chemists, has reshaped our understanding of the gases that constitute Earth's atmosphere. In this delicate balance of elements, air reveals itself as a dynamic and essential force, sustaining life on our planet.
The composition of air.
The air we breathe is primarily composed of a mixture of gases. The approximate composition of dry air, excluding water vapour, is as follows:
Additionally, trace amounts of other gases, such as ozone (O₃), nitrogen oxides (NOx), and sulphur dioxide (SO₂), are present in the atmosphere.
It's important to note that these percentages can vary slightly based on factors such as location, altitude, and environmental conditions. Water vapour is also a variable component of the atmosphere, with its concentration ranging from near zero to about 4% in humid conditions.
Greenhouse Effect:
The study of the greenhouse effect and global warming has a fascinating history, and it's amazing to see how early scientists like Arrhenius and Callendar laid the ground...
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
If you use the dataset, cite the paper: https://doi.org/10.1016/j.eswa.2022.117541
The most comprehensive dataset to date regarding climate change and human opinions via Twitter. It has the heftiest temporal coverage, spanning over 13 years, includes over 15 million tweets spatially distributed across the world, and provides the geolocation of most tweets. Seven dimensions of information are tied to each tweet, namely geolocation, user gender, climate change stance and sentiment, aggressiveness, deviations from historic temperature, and topic modeling, while accompanied by environmental disaster events information. These dimensions were produced by testing and evaluating a plethora of state-of-the-art machine learning algorithms and methods, both supervised and unsupervised, including BERT, RNN, LSTM, CNN, SVM, Naive Bayes, VADER, Textblob, Flair, and LDA.
The following columns are in the dataset:
➡ created_at: The timestamp of the tweet. ➡ id: The unique id of the tweet. ➡ lng: The longitude the tweet was written. ➡ lat: The latitude the tweet was written. ➡ topic: Categorization of the tweet in one of ten topics namely, seriousness of gas emissions, importance of human intervention, global stance, significance of pollution awareness events, weather extremes, impact of resource overconsumption, Donald Trump versus science, ideological positions on global warming, politics, and undefined. ➡ sentiment: A score on a continuous scale. This scale ranges from -1 to 1 with values closer to 1 being translated to positive sentiment, values closer to -1 representing a negative sentiment while values close to 0 depicting no sentiment or being neutral. ➡ stance: That is if the tweet supports the belief of man-made climate change (believer), if the tweet does not believe in man-made climate change (denier), and if the tweet neither supports nor refuses the belief of man-made climate change (neutral). ➡ gender: Whether the user that made the tweet is male, female, or undefined. ➡ temperature_avg: The temperature deviation in Celsius and relative to the January 1951-December 1980 average at the time and place the tweet was written. ➡ aggressiveness: That is if the tweet contains aggressive language or not.
Since Twitter forbids making public the text of the tweets, in order to retrieve it you need to do a process called hydrating. Tools such as Twarc or Hydrator can be used to hydrate tweets.
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Some say climate change is the biggest threat of our age while others say it’s a myth based on dodgy science. We are turning some of the data over to you so you can form your own view.
Even more than with other data sets that Kaggle has featured, there’s a huge amount of data cleaning and preparation that goes into putting together a long-time study of climate trends. Early data was collected by technicians using mercury thermometers, where any variation in the visit time impacted measurements. In the 1940s, the construction of airports caused many weather stations to be moved. In the 1980s, there was a move to electronic thermometers that are said to have a cooling bias.
Given this complexity, there are a range of organizations that collate climate trends data. The three most cited land and ocean temperature data sets are NOAA’s MLOST, NASA’s GISTEMP and the UK’s HadCrut.
We have repackaged the data from a newer compilation put together by the Berkeley Earth, which is affiliated with Lawrence Berkeley National Laboratory. The Berkeley Earth Surface Temperature Study combines 1.6 billion temperature reports from 16 pre-existing archives. It is nicely packaged and allows for slicing into interesting subsets (for example by country). They publish the source data and the code for the transformations they applied. They also use methods that allow weather observations from shorter time series to be included, meaning fewer observations need to be thrown away.
In this dataset, we have include several files:
Global Land and Ocean-and-Land Temperatures (GlobalTemperatures.csv):
Other files include:
The raw data comes from the Berkeley Earth data page.