100+ datasets found
  1. v

    Open data change log

    • opendata.vancouver.ca
    csv, excel, json
    Updated Nov 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Open data change log [Dataset]. https://opendata.vancouver.ca/explore/dataset/open-data-change-log/
    Explore at:
    csv, json, excelAvailable download formats
    Dataset updated
    Nov 18, 2025
    License

    https://opendata.vancouver.ca/pages/licence/https://opendata.vancouver.ca/pages/licence/

    Description

    ​Significant changes to the open data catalogue, including new datasets added, datasets renamed or retired, quarterly or annual updates to high-impact datasets, changes to data structure or definition. Smaller changes, such as adding or editing records or renaming a field in an existing dataset are not included. NoteThis log is published in the interest of transparency into the work of the open data program. You can subscribe to updates for a specific dataset by creating an account on the portal then clicking on the Follow button on the Information tab of any dataset. You can get updates by subscribing to our email newsletter. Data currency​New records will be added whenever a significant change is made to the open data catalogue.

  2. d

    2018 mean high water shoreline of the coast of MA used in shoreline change...

    • catalog.data.gov
    • data.usgs.gov
    • +2more
    Updated Nov 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). 2018 mean high water shoreline of the coast of MA used in shoreline change analysis [Dataset]. https://catalog.data.gov/dataset/2018-mean-high-water-shoreline-of-the-coast-of-ma-used-in-shoreline-change-analysis
    Explore at:
    Dataset updated
    Nov 21, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Description

    The Massachusetts Office of Coastal Zone Management launched the Shoreline Change Project in 1989 to identify erosion-prone areas of the coast by compiling a database of historical (mid 1800's-1989) shoreline positions. Trends of shoreline position over long and short-term timescales provide information to landowners, managers, and potential buyers about possible future impacts to coastal resources and infrastructure. In 2001, a 1994 shoreline was added to calculate both long- and short-term shoreline change rates at 40-meter intervals along ocean-facing sections of the Massachusetts coast. In the 2013 update, two oceanfront shorelines for Massachusetts were added using 2008-2009 color aerial orthoimagery and 2007 topographic lidar datasets obtained from NOAA's Ocean Service, Coastal Services Center. In the 2018 update, two new mean high water (MHW) shorelines for the Massachusetts coast extracted from lidar data between 2010-2014 were added to the dataset. This 2021 update includes one new shoreline extracted from lidar data collected in 2018 by the U.S. Army Corps of Engineers (USACE) Joint Airborne Lidar Bathymetry Technical Center of Expertise (JALBTCX). This new shoreline was extracted for the North Shore, South Shore, Cape Cod Bay, Outer Cape, South Cape, Nantucket, Martha’s Vineyard, and a small portion Buzzard’s Bay. 2018 Lidar data was not available for Boston or the Elizabeth Islands region. This new shoreline was extracted from the lidar survey using either a profile method or contour method, depending on the location of the shoreline. This data release also includes a compilation of previously published historical shoreline positions spanning 170 years (1844 to 2014), intended to be used as an authoritative shoreline database for the state. This data is an update to the Massachusetts Office of Coastal Zone Management Shoreline Change Project.

  3. m

    Dataset of development of business during the COVID-19 crisis

    • data.mendeley.com
    • narcis.nl
    Updated Nov 9, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tatiana N. Litvinova (2020). Dataset of development of business during the COVID-19 crisis [Dataset]. http://doi.org/10.17632/9vvrd34f8t.1
    Explore at:
    Dataset updated
    Nov 9, 2020
    Authors
    Tatiana N. Litvinova
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    To create the dataset, the top 10 countries leading in the incidence of COVID-19 in the world were selected as of October 22, 2020 (on the eve of the second full of pandemics), which are presented in the Global 500 ranking for 2020: USA, India, Brazil, Russia, Spain, France and Mexico. For each of these countries, no more than 10 of the largest transnational corporations included in the Global 500 rating for 2020 and 2019 were selected separately. The arithmetic averages were calculated and the change (increase) in indicators such as profitability and profitability of enterprises, their ranking position (competitiveness), asset value and number of employees. The arithmetic mean values of these indicators for all countries of the sample were found, characterizing the situation in international entrepreneurship as a whole in the context of the COVID-19 crisis in 2020 on the eve of the second wave of the pandemic. The data is collected in a general Microsoft Excel table. Dataset is a unique database that combines COVID-19 statistics and entrepreneurship statistics. The dataset is flexible data that can be supplemented with data from other countries and newer statistics on the COVID-19 pandemic. Due to the fact that the data in the dataset are not ready-made numbers, but formulas, when adding and / or changing the values in the original table at the beginning of the dataset, most of the subsequent tables will be automatically recalculated and the graphs will be updated. This allows the dataset to be used not just as an array of data, but as an analytical tool for automating scientific research on the impact of the COVID-19 pandemic and crisis on international entrepreneurship. The dataset includes not only tabular data, but also charts that provide data visualization. The dataset contains not only actual, but also forecast data on morbidity and mortality from COVID-19 for the period of the second wave of the pandemic in 2020. The forecasts are presented in the form of a normal distribution of predicted values and the probability of their occurrence in practice. This allows for a broad scenario analysis of the impact of the COVID-19 pandemic and crisis on international entrepreneurship, substituting various predicted morbidity and mortality rates in risk assessment tables and obtaining automatically calculated consequences (changes) on the characteristics of international entrepreneurship. It is also possible to substitute the actual values identified in the process and following the results of the second wave of the pandemic to check the reliability of pre-made forecasts and conduct a plan-fact analysis. The dataset contains not only the numerical values of the initial and predicted values of the set of studied indicators, but also their qualitative interpretation, reflecting the presence and level of risks of a pandemic and COVID-19 crisis for international entrepreneurship.

  4. S

    The global industrial value-added dataset under different global change...

    • scidb.cn
    Updated Aug 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Song Wei; li huan huan; Duan Jianping; Li Han; Xue Qian; Zhang Xuyang (2024). The global industrial value-added dataset under different global change scenarios (2010, 2030, and 2050) [Dataset]. http://doi.org/10.57760/sciencedb.11406
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 6, 2024
    Dataset provided by
    Science Data Bank
    Authors
    Song Wei; li huan huan; Duan Jianping; Li Han; Xue Qian; Zhang Xuyang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description
    1. Temporal Coverage of Data: The data collection periods are 2010, 2030, and 2050.2. Spatial Coverage and Projection:Spatial Coverage: GlobalLongitude: -180° - 180°Latitude: -90° - 90°Projection: GCS_WGS_19843. Disciplinary Scope: The data pertains to the fields of Earth Sciences and Geography.4. Data Volume: The total data volume is approximately 31.5 MB.5. Data Type: Raster (GeoTIFF)6. Thumbnail (illustrating dataset content or observation process/scene): · 7. Field (Feature) Name Explanation:a. Name Explanation: IND: Industrial Value Addedb. Unit of Measurement: Unit: US Dollars (USD)8. Data Source Description:a. Remote Sensing Data:2010 Global Vegetation Index data (Enhanced Vegetation Index, EVI, from MODIS monthly average data) and 2010 Nighttime Light Remote Sensing data (DMSP/OLS)b. Meteorological Data:From the CMCC-CM model in the Fifth International Coupled Model Intercomparison Project (CMIP5) published by the United Nations Intergovernmental Panel on Climate Change (IPCC)c. Statistical Data:From the World Development Indicators dataset of the World Bank and various national statistical agenciesd. Gross Domestic Product Data:Sourced from the project "Study on the Harmful Processes of Population and Economic Systems under Global Change" under the National Key R&D Program "Mechanisms and Assessment of Risks in Population and Economic Systems under Global Change," led by Researcher Sun Fubao at the Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciencese. Other Data:Rivers, roads, settlements, and DEM, sourced from the National Oceanic and Atmospheric Administration (NOAA), Global Risk Data Platform, and Natural Earth9. Data Processing Methods(1) Spatialization of Baseline Industrial Value Added: Using 2010 global EVI vegetation index data and nighttime light remote sensing data, we addressed the oversaturation issue in nighttime light data by constructing an adjusted nighttime light index to obtain the optimal global light data. The EANTIL model was developed using NTL, NTLn, and EVI data, with the following formula:Here, EANTLI represents the adjusted nighttime light index, NTL represents the original nighttime light intensity value, and NTLn represents the normalized nighttime light intensity value. Based on the optimal light index EANTLI and the industrial value-added data from the World Bank, we constructed a regression allocation model to derive industrial value added (I), generating the global 2010 industrial value-added data with the formula:Here, I represents the industrial value added for each grid cell, and Ii represents the industrial value added for each country, EANTLi derived from ArcGIS statistical analysis and the regression allocation model.(2) Spatial Boundaries for Future Industrial Value Added: Using the Logistic-CA-Markov simulation principle and global land use data from 2010 and 2015 (from the European Space Agency), we simulated national land use changes for 2030 and 2050 and extracted urban land data as the spatial boundaries for future industrial value added. To comprehensively characterize the influence of different factors on land use and considering the research scale, we selected elevation, slope, population, GDP, distance to rivers, and distance to roads as land use driving factors. Accuracy validation using global 2015 land use data showed an average accuracy of 91.89%.(3) Estimation of Future Industrial Value Added: Based on machine learning and using the random forest model, we constructed spatialization models for industrial value added under different climate change scenarios: Here, tem represents temperature, prep represents precipitation, GDP represents national economic output, L represents urban land, D represents slope, and P represents population. The random forest model was constructed using factors such as 2010 industrial value added, urban land distribution, elevation, slope, distances to rivers, roads, railways (considering transportation), and settlements (considering noise and environmental pollution from industrial buildings), along with temperature and precipitation as climate scenario data. Except for varying temperature and precipitation values across scenarios, other variables remained constant. The model comprised 100 decision trees, with each iteration randomly selecting 90% of the samples for model construction and using the remaining 10% as test data, achieving a training sample accuracy of 0.94 and a test sample accuracy of 0.81.By analyzing the proportion of industrial value added to GDP (average from 2000 to 2020, data from the World Bank) and projected GDP under future Shared Socioeconomic Pathways (SSPs), we derived future industrial value added for each country under different SSP scenarios. Using these projections, we constructed regression models to allocate future industrial value added proportionally, resulting in spatial distribution data for 2030 and 2050 under different SSP scenarios.10. Applications and Achievements of the Dataseta. Primary Application Areas: This dataset is mainly applied in environmental protection, ecological construction, pollution prevention and control, and the prevention and forecasting of natural disasters.b. Achievements in Application (Awards, Published Reports and Articles):Achievements: Developed a method for downscaling national-scale industrial value-added data by integrating DMSP/OLS nighttime light data, vegetation distribution, and other data. Published the global industrial value-added dataset.
  5. I

    Cline Center Coup d’État Project Dataset

    • databank.illinois.edu
    • aws-databank-alb.library.illinois.edu
    Updated Jan 31, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Buddy Peyton; Joseph Bajjalieh; Michael Martin; Sam Alahi; Norah Fadell; Maddie Jeralds (2025). Cline Center Coup d’État Project Dataset [Dataset]. http://doi.org/10.13012/B2IDB-9651987_V8
    Explore at:
    Dataset updated
    Jan 31, 2025
    Authors
    Buddy Peyton; Joseph Bajjalieh; Michael Martin; Sam Alahi; Norah Fadell; Maddie Jeralds
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Coups d'Ètat are important events in the life of a country. They constitute an important subset of irregular transfers of political power that can have significant and enduring consequences for national well-being. There are only a limited number of datasets available to study these events (Powell and Thyne 2011, Marshall and Marshall 2019). Seeking to facilitate research on post-WWII coups by compiling a more comprehensive list and categorization of these events, the Cline Center for Advanced Social Research (previously the Cline Center for Democracy) initiated the Coup d’État Project as part of its Societal Infrastructures and Development (SID) project. More specifically, this dataset identifies the outcomes of coup events (i.e., realized, unrealized, or conspiracy) the type of actor(s) who initiated the coup (i.e., military, rebels, etc.), as well as the fate of the deposed leader. Version 2.2.0 adds 94 additional coup events. 66 of these came from examining Powell and Thyne’s “discarded” events and 28 of these events were added to the data set in the normal annual review of potential new coup events. This version also updates the coding to events in Brazil in 1945 and the Congo in 1968. Version 2.1.3 adds 19 additional coup events to the data set, corrects the date of a coup in Tunisia, and reclassifies an attempted coup in Brazil in December 2022 as a conspiracy. Version 2.1.2 added 6 additional coup events that occurred in 2022 and updated the coding of an attempted coup event in Kazakhstan in January 2022. Version 2.1.1 corrected a mistake in version 2.1.0, where the designation of “dissident coup” had been dropped in error for coup_id: 00201062021. Version 2.1.1 fixed this omission by marking the case as both a dissident coup and an auto-coup. Version 2.1.0 added 36 cases to the data set and removed two cases from the v2.0.0 data. This update also added actor coding for 46 coup events and added executive outcomes to 18 events from version 2.0.0. A few other changes were made to correct inconsistencies in the coup ID variable and the date of the event. Version 2.0.0 improved several aspects of the previous version (v1.0.0) and incorporated additional source material to include: • Reconciling missing event data • Removing events with irreconcilable event dates • Removing events with insufficient sourcing (each event needs at least two sources) • Removing events that were inaccurately coded as coup events • Removing variables that fell below the threshold of inter-coder reliability required by the project • Removing the spreadsheet ‘CoupInventory.xls’ because of inadequate attribution and citations in the event summaries • Extending the period covered from 1945-2005 to 1945-2019 • Adding events from Powell and Thyne’s Coup Data (Powell and Thyne, 2011) Version 1.0.0 was released in 2013. This version consolidated coup data taken from the following sources: • The Center for Systemic Peace (Marshall and Marshall, 2007) • The World Handbook of Political and Social Indicators (Taylor and Jodice, 1983) • Coup d’Ètat: A Practical Handbook (Luttwak, 1979) • The Cline Center’s Social, Political and Economic Event Database (SPEED) Project (Nardulli, Althaus and Hayes, 2015) • Government Change in Authoritarian Regimes – 2010 Update (Svolik and Akcinaroglu, 2006)
    Items in this Dataset 1. Cline Center Coup d'État Codebook v.2.2.0 Codebook.pdf - This 17-page document describes the Cline Center Coup d’État Project dataset. The first section of this codebook provides a summary of the different versions of the data. The second section provides a succinct definition of a coup d’état used by the Coup d'État Project and an overview of the categories used to differentiate the wide array of events that meet the project's definition. It also defines coup outcomes. The third section describes the methodology used to produce the data. Revised January 2025 2. Coup Data v2.2.0.csv - This CSV (Comma Separated Values) file contains all of the coup event data from the Cline Center Coup d’État Project. It contains 29 variables and 1094 observations. Revised January 2025 3. Source Document v2.2.0.pdf - This 347-page document provides the sources used for each of the coup events identified in this dataset. Please use the value in the coup_id variable to identify the sources used to identify that particular event. Revised January 2025 4. README.md - This file contains useful information for the user about the dataset. It is a text file written in markdown language. Revised January 2025
    Citation Guidelines 1. To cite the codebook (or any other documentation associated with the Cline Center Coup d’État Project Dataset) please use the following citation: Peyton, Buddy, Joseph Bajjalieh, Dan Shalmon, Michael Martin, Jonathan Bonaguro, and Scott Althaus. 2025. “Cline Center Coup d’État Project Dataset Codebook”. Cline Center Coup d’État Project Dataset. Cline Center for Advanced Social Research. V.2.2.0. Janurary 30. University of Illinois Urbana-Champaign. doi: 10.13012/B2IDB-9651987_V8 2. To cite data from the Cline Center Coup d’État Project Dataset please use the following citation (filling in the correct date of access): Peyton, Buddy, Joseph Bajjalieh, Michael Martin, Sam Alahi, Norah Fadell, and Maddie Jeralds. 2025. Cline Center Coup d’État Project Dataset. Cline Center for Advanced Social Research. V.2.2.0. Janurary 30. University of Illinois Urbana-Champaign. doi: 10.13012/B2IDB-9651987_V8

  6. d

    Intersects for the Outer Cape Cod, Massachusetts, generated to calculate...

    • catalog.data.gov
    Updated Nov 13, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Intersects for the Outer Cape Cod, Massachusetts, generated to calculate shoreline change rates using the Digital Shoreline Analysis System version 5.1 [Dataset]. https://catalog.data.gov/dataset/intersects-for-the-outer-cape-cod-massachusetts-generated-to-calculate-shoreline-change-ra
    Explore at:
    Dataset updated
    Nov 13, 2025
    Dataset provided by
    U.S. Geological Survey
    Area covered
    Cape Cod, Massachusetts
    Description

    The Massachusetts Office of Coastal Zone Management launched the Shoreline Change Project in 1989 to identify erosion-prone areas of the coast and support local land-use decisions. Trends of shoreline position over long and short-term timescales provide information to landowners, managers, and potential buyers about possible future impacts to coastal resources and infrastructure. In 2001, a 1994 shoreline was added to calculate both long- and short-term shoreline change rates along ocean-facing sections of the Massachusetts coast. In 2013 two oceanfront shorelines for Massachusetts were added using 2008-2009 color aerial orthoimagery and 2007 topographic lidar datasets obtained from NOAA's Ocean Service, Coastal Services Center. In 2018, two new mean high water (MHW) shorelines for the Massachusetts coast extracted from lidar data between 2010-2014 were added to the dataset. This 2021 data release includes rates that incorporate one new shoreline from lidar data extracted in 2018 by the U.S. Army Corps of Engineers (USACE) Joint Airborne Lidar Bathymetry Technical Center of Expertise (JALBTCX), added to the existing database of all historical shorelines (1844-2014), for the North Shore, South Shore, Cape Cod Bay, Outer Cape, Buzzard’s Bay, South Cape, Nantucket, and Martha’s Vineyard. 2018 lidar data did not cover the Boston or Elizabeth Islands regions. Included in this data release is a proxy-datum bias reference line that accounts for the positional difference in a proxy shoreline (the high water Line shoreline) and a datum shoreline (the mean high water shoreline. This issue is explained further in Ruggiero and List (2009) and in the process steps of the metadata associated with the rates. This release includes both long-term (~150+ years) and short term (~30 years) rates. Files associated with the long-term rates have “LT” in their names, files associated with short-term rates have "ST” in their names.

  7. d

    Baselines for the Outer Cape Cod, Massachusetts, generated to calculate...

    • catalog.data.gov
    • s.cnmilf.com
    Updated Nov 26, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Baselines for the Outer Cape Cod, Massachusetts, generated to calculate shoreline change rates using the Digital Shoreline Analysis System version 5.1 [Dataset]. https://catalog.data.gov/dataset/baselines-for-the-outer-cape-cod-massachusetts-generated-to-calculate-shoreline-change-rat
    Explore at:
    Dataset updated
    Nov 26, 2025
    Dataset provided by
    U.S. Geological Survey
    Area covered
    Cape Cod, Massachusetts
    Description

    The Massachusetts Office of Coastal Zone Management launched the Shoreline Change Project in 1989 to identify erosion-prone areas of the coast by compiling a database of historical (mid 1800's-1989) shoreline positions. Trends of shoreline position over long and short-term timescales provide information to landowners, managers, and potential buyers about possible future impacts to coastal resources and infrastructure. In 2001, a 1994 shoreline was added to calculate both long- and short-term shoreline change rates along ocean-facing sections of the Massachusetts coast. In 2013, two oceanfront shorelines for Massachusetts were added using 2008-2009 color aerial orthoimagery and 2007 topographic lidar datasets obtained from the National Oceanic and Atmospheric Administration's Ocean Service (NOAA), Coastal Services Center. In 2018, two new mean high water (MHW) shorelines for the Massachusetts coast extracted from lidar data between 2010-2014 were added to the dataset. This 2021 data release includes rates that incorporate one new shoreline from lidar data extracted in 2018 by the U.S. Army Corps of Engineers (USACE) Joint Airborne Lidar Bathymetry Technical Center of Expertise (JALBTCX), added to the existing database of all historical shorelines (1844-2014), for the North Shore, South Shore, Cape Cod Bay, Outer Cape, Buzzard’s Bay, South Cape, Nantucket, and Martha’s Vineyard. 2018 lidar data did not cover the Boston or Elizabeth Islands regions. Included in this data release is a proxy-datum bias reference line that accounts for the positional difference in a proxy shoreline (the High Water Line shoreline) and a datum shoreline (the Mean High Water shoreline). This issue is explained further in Ruggiero and List (2009) and in the process steps of the metadata associated with the rates. This release includes both long-term (~150+ years) and short term (~30 years) rates. Files associated with the long-term rates have “LT” in their names, files associated with short-term rates have "ST” in their names.

  8. Data from: A dataset to model Levantine landcover and land-use change...

    • zenodo.org
    • data.niaid.nih.gov
    • +1more
    zip
    Updated Dec 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michael Kempf; Michael Kempf (2023). A dataset to model Levantine landcover and land-use change connected to climate change, the Arab Spring and COVID-19 [Dataset]. http://doi.org/10.5281/zenodo.10396148
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 16, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Michael Kempf; Michael Kempf
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Dec 16, 2023
    Area covered
    Levant
    Description

    Overview

    This dataset is the repository for the following paper submitted to Data in Brief:

    Kempf, M. A dataset to model Levantine landcover and land-use change connected to climate change, the Arab Spring and COVID-19. Data in Brief (submitted: December 2023).

    The Data in Brief article contains the supplement information and is the related data paper to:

    Kempf, M. Climate change, the Arab Spring, and COVID-19 - Impacts on landcover transformations in the Levant. Journal of Arid Environments (revision submitted: December 2023).

    Description/abstract

    The Levant region is highly vulnerable to climate change, experiencing prolonged heat waves that have led to societal crises and population displacement. Since 2010, the area has been marked by socio-political turmoil, including the Syrian civil war and currently the escalation of the so-called Israeli-Palestinian Conflict, which strained neighbouring countries like Jordan due to the influx of Syrian refugees and increases population vulnerability to governmental decision-making. Jordan, in particular, has seen rapid population growth and significant changes in land-use and infrastructure, leading to over-exploitation of the landscape through irrigation and construction. This dataset uses climate data, satellite imagery, and land cover information to illustrate the substantial increase in construction activity and highlights the intricate relationship between climate change predictions and current socio-political developments in the Levant.

    Folder structure

    The main folder after download contains all data, in which the following subfolders are stored are stored as zipped files:

    “code” stores the above described 9 code chunks to read, extract, process, analyse, and visualize the data.

    “MODIS_merged” contains the 16-days, 250 m resolution NDVI imagery merged from three tiles (h20v05, h21v05, h21v06) and cropped to the study area, n=510, covering January 2001 to December 2022 and including January and February 2023.

    “mask” contains a single shapefile, which is the merged product of administrative boundaries, including Jordan, Lebanon, Israel, Syria, and Palestine (“MERGED_LEVANT.shp”).

    “yield_productivity” contains .csv files of yield information for all countries listed above.

    “population” contains two files with the same name but different format. The .csv file is for processing and plotting in R. The .ods file is for enhanced visualization of population dynamics in the Levant (Socio_cultural_political_development_database_FAO2023.ods).

    “GLDAS” stores the raw data of the NASA Global Land Data Assimilation System datasets that can be read, extracted (variable name), and processed using code “8_GLDAS_read_extract_trend” from the respective folder. One folder contains data from 1975-2022 and a second the additional January and February 2023 data.

    “built_up” contains the landcover and built-up change data from 1975 to 2022. This folder is subdivided into two subfolder which contain the raw data and the already processed data. “raw_data” contains the unprocessed datasets and “derived_data” stores the cropped built_up datasets at 5 year intervals, e.g., “Levant_built_up_1975.tif”.

    Code structure

    1_MODIS_NDVI_hdf_file_extraction.R


    This is the first code chunk that refers to the extraction of MODIS data from .hdf file format. The following packages must be installed and the raw data must be downloaded using a simple mass downloader, e.g., from google chrome. Packages: terra. Download MODIS data from after registration from: https://lpdaac.usgs.gov/products/mod13q1v061/ or https://search.earthdata.nasa.gov/search (MODIS/Terra Vegetation Indices 16-Day L3 Global 250m SIN Grid V061, last accessed, 09th of October 2023). The code reads a list of files, extracts the NDVI, and saves each file to a single .tif-file with the indication “NDVI”. Because the study area is quite large, we have to load three different (spatially) time series and merge them later. Note that the time series are temporally consistent.


    2_MERGE_MODIS_tiles.R


    In this code, we load and merge the three different stacks to produce large and consistent time series of NDVI imagery across the study area. We further use the package gtools to load the files in (1, 2, 3, 4, 5, 6, etc.). Here, we have three stacks from which we merge the first two (stack 1, stack 2) and store them. We then merge this stack with stack 3. We produce single files named NDVI_final_*consecutivenumber*.tif. Before saving the final output of single merged files, create a folder called “merged” and set the working directory to this folder, e.g., setwd("your directory_MODIS/merged").


    3_CROP_MODIS_merged_tiles.R


    Now we want to crop the derived MODIS tiles to our study area. We are using a mask, which is provided as .shp file in the repository, named "MERGED_LEVANT.shp". We load the merged .tif files and crop the stack with the vector. Saving to individual files, we name them “NDVI_merged_clip_*consecutivenumber*.tif. We now produced single cropped NDVI time series data from MODIS.
    The repository provides the already clipped and merged NDVI datasets.


    4_TREND_analysis_NDVI.R


    Now, we want to perform trend analysis from the derived data. The data we load is tricky as it contains 16-days return period across a year for the period of 22 years. Growing season sums contain MAM (March-May), JJA (June-August), and SON (September-November). December is represented as a single file, which means that the period DJF (December-February) is represented by 5 images instead of 6. For the last DJF period (December 2022), the data from January and February 2023 can be added. The code selects the respective images from the stack, depending on which period is under consideration. From these stacks, individual annually resolved growing season sums are generated and the slope is calculated. We can then extract the p-values of the trend and characterize all values with high confidence level (0.05). Using the ggplot2 package and the melt function from reshape2 package, we can create a plot of the reclassified NDVI trends together with a local smoother (LOESS) of value 0.3.
    To increase comparability and understand the amplitude of the trends, z-scores were calculated and plotted, which show the deviation of the values from the mean. This has been done for the NDVI values as well as the GLDAS climate variables as a normalization technique.


    5_BUILT_UP_change_raster.R


    Let us look at the landcover changes now. We are working with the terra package and get raster data from here: https://ghsl.jrc.ec.europa.eu/download.php?ds=bu (last accessed 03. March 2023, 100 m resolution, global coverage). Here, one can download the temporal coverage that is aimed for and reclassify it using the code after cropping to the individual study area. Here, I summed up different raster to characterize the built-up change in continuous values between 1975 and 2022.


    6_POPULATION_numbers_plot.R


    For this plot, one needs to load the .csv-file “Socio_cultural_political_development_database_FAO2023.csv” from the repository. The ggplot script provided produces the desired plot with all countries under consideration.


    7_YIELD_plot.R


    In this section, we are using the country productivity from the supplement in the repository “yield_productivity” (e.g., "Jordan_yield.csv". Each of the single country yield datasets is plotted in a ggplot and combined using the patchwork package in R.


    8_GLDAS_read_extract_trend


    The last code provides the basis for the trend analysis of the climate variables used in the paper. The raw data can be accessed https://disc.gsfc.nasa.gov/datasets?keywords=GLDAS%20Noah%20Land%20Surface%20Model%20L4%20monthly&page=1 (last accessed 9th of October 2023). The raw data comes in .nc file format and various variables can be extracted using the [“^a variable name”] command from the spatraster collection. Each time you run the code, this variable name must be adjusted to meet the requirements for the variables (see this link for abbreviations: https://disc.gsfc.nasa.gov/datasets/GLDAS_CLSM025_D_2.0/summary, last accessed 09th of October 2023; or the respective code chunk when reading a .nc file with the ncdf4 package in R) or run print(nc) from the code or use names(the spatraster collection).
    Choosing one variable, the code uses the MERGED_LEVANT.shp mask from the repository to crop and mask the data to the outline of the study area.
    From the processed data, trend analysis are conducted and z-scores were calculated following the code described above. However, annual trends require the frequency of the time series analysis to be set to value = 12. Regarding, e.g., rainfall, which is measured as annual sums and not means, the chunk r.sum=r.sum/12 has to be removed or set to r.sum=r.sum/1 to avoid calculating annual mean values (see other variables). Seasonal subset can be calculated as described in the code. Here, 3-month subsets were chosen for growing seasons, e.g. March-May (MAM), June-July (JJA), September-November (SON), and DJF (December-February, including Jan/Feb of the consecutive year).
    From the data, mean values of 48 consecutive years are calculated and trend analysis are performed as describe above. In the same way, p-values are extracted and 95 % confidence level values are marked with dots on the raster plot. This analysis can be performed with a much longer time series, other variables, ad different spatial extent across the globe due to the availability of the GLDAS variables.

  9. Analysing Second Hand Car Sales Data

    • kaggle.com
    zip
    Updated May 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Anas Sarwar (2024). Analysing Second Hand Car Sales Data [Dataset]. https://www.kaggle.com/datasets/devantltd/analysing-second-hand-car-sales-data
    Explore at:
    zip(1024089 bytes)Available download formats
    Dataset updated
    May 9, 2024
    Authors
    Muhammad Anas Sarwar
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Analysing Second Hand Car Sales Data with Supervised and Unsupervised Learning Models

    The second-hand cars market is a dynamic and very complex sphere which is impacted by different criteria among them - manufacturer, model, engine specification, and fuel consumption, year of production, mileage, and price. In this exercise, we will look through mock data that contains facts on sale of second-or-used cars in UK. The data is made up of 50,000 different records that describe a transaction of a car sale singularly. Through the utilization of supervised learning and unsupervised learning, we plan to perform an analysis of the dataset. This analysis will facilitate car price prediction via a regression model, as well as cluster pattern identification.

    Single Numerical Input Feature Regression Models We started our work by using the regression model predicting the car price for each numerical input factor like the mileage, a size of the vehicle etc. This is followed by analyzing the associations over variables such as the car's price and numerical factors like the engine size, the vehicle model year, and mileage. The engine size was found to be the variable having the strongest relation to the auto price, which provided evidence that it is the most powerful driver. While a linear model was appropriate for the year of manufacture, other features that were more complicated like engine size needed a non-linear model in order for their interactions and price fluctuations to be accurately detected.

    Multiple Numerical Input Feature Regression Models The analysis was further expanded by incorporating several numeric input parameters while estimating the accuracy of the price predictions. What we reasonably benefited from the usage of extra usages like year of making a car and a number of its kilometers achievement was an improvement of predictive performance in comparison with single-input features models. This holistic approach of studying the many variables that influence car's prices has brought the importance to a limelight of using predictive models by considering many factors simultaneously.

    ** Regression Model with Categorical Variables** To expand our prediction models, we took categorical variables into account and added attributes of manufacturer and model into the regression. This increased the effectiveness of the algorithm theories more roads less traffic intersections construction of roads should take road traffic distribution between roads as well as traffic intersections into account busier streets less traffic less intersections

    ** Artificial Neural Network (ANN) Model**

    To achieve that, we have implemented the Artificial Neural Network (ANN) model. The ANN showed competitive performance in respect to other supervised learning models which can be attributed to its ability to learn even very complex relationships from the dataset. The architecture and hyper parameters of ANN were thoroughly tweaked for the best results in order to demonstrate its flexibility and effectiveness in dealing with complex datasets.

    Model Comparison and Conclusion After comprehensive assessment the Random Forest Regress or model was found to be the most efficient model for forecasting car prices. It’s incorporating both numerical and categorical variables and showing a strong predicting power made it a preferred one. Evaluation metrics and visualizations were given which gave the full picture of the model performance and helped us to arrive at our conclusion that the Random Forests regress or was better.

    k-Means Clustering Algorithm Coming to unsupervised learning, we employed the k-Means clustering algorithm to detect clusters in the car sales dataset. Changing input feature variables space in batches, we determined the number of clusters (k) using evaluation metrics by silhouette score. The variables like engine size, year of manufacture and mileage appeared to be critical in getting the most ideal clusters which emphasized their significance in segmenting the data set. Comparison with Other Clustering Algorithms Lastly, we observed the outcomes of the k-Means clustering technique adding the success of the other clustering techniques, for example, DBSCAN, and hierarchical clustering. Evaluation with metrics of rigorous title of the each method worked we assessed the performance to the dataset effective approach in cluster was identified. Just like k-Means achieved promising results, DBSCAN provided us with a base to be further extended by comparing with other algorithms like DBSCAN and emphasizing that several algorithms should be considered for clustering. Conclusion Finally, our extensive discussion on the sales data for used cars has demonstrated favorable results of supervised as well as unsupervised learning techniques towards understanding the information through regression models and so...

  10. d

    Baselines for the coast of Nantucket, Massachusetts, generated to calculate...

    • catalog.data.gov
    • s.cnmilf.com
    Updated Nov 26, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Baselines for the coast of Nantucket, Massachusetts, generated to calculate shoreline change rates using the Digital Shoreline Analysis System version 5.1 [Dataset]. https://catalog.data.gov/dataset/baselines-for-the-coast-of-nantucket-massachusetts-generated-to-calculate-shoreline-change
    Explore at:
    Dataset updated
    Nov 26, 2025
    Dataset provided by
    U.S. Geological Survey
    Area covered
    Nantucket, Massachusetts
    Description

    The Massachusetts Office of Coastal Zone Management launched the Shoreline Change Project in 1989 to identify erosion-prone areas of the coast by compiling a database of historical (mid 1800's-1989) shoreline positions. Trends of shoreline position over long and short-term timescales provide information to landowners, managers, and potential buyers about possible future impacts to coastal resources and infrastructure. In 2001, a 1994 shoreline was added to calculate both long- and short-term shoreline change rates along ocean-facing sections of the Massachusetts coast. In 2013, two oceanfront shorelines for Massachusetts were added using 2008-2009 color aerial orthoimagery and 2007 topographic lidar datasets obtained from the National Oceanic and Atmospheric Administration's Ocean Service (NOAA), Coastal Services Center. In 2018, two new mean high water (MHW) shorelines for the Massachusetts coast extracted from lidar data between 2010-2014 were added to the dataset. This 2021 data release includes rates that incorporate one new shoreline from lidar data extracted in 2018 by the U.S. Army Corps of Engineers (USACE) Joint Airborne Lidar Bathymetry Technical Center of Expertise (JALBTCX), added to the existing database of all historical shorelines (1844-2014), for the North Shore, South Shore, Cape Cod Bay, Outer Cape, Buzzard’s Bay, South Cape, Nantucket, and Martha’s Vineyard. 2018 lidar data did not cover the Boston or Elizabeth Islands regions. Included in this data release is a proxy-datum bias reference line that accounts for the positional difference in a proxy shoreline (the High Water Line shoreline) and a datum shoreline (the Mean High Water shoreline). This issue is explained further in Ruggiero and List (2009) and in the process steps of the metadata associated with the rates. This release includes both long-term (~150+ years) and short term (~30 years) rates. Files associated with the long-term rates have “LT” in their names, files associated with short-term rates have "ST” in their names.

  11. m

    Manufacturing, value added (current US$) - United States

    • macro-rankings.com
    csv, excel
    Updated Dec 31, 1997
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    macro-rankings (1997). Manufacturing, value added (current US$) - United States [Dataset]. https://www.macro-rankings.com/united-states/manufacturing-value-added-(current-us$)
    Explore at:
    csv, excelAvailable download formats
    Dataset updated
    Dec 31, 1997
    Dataset authored and provided by
    macro-rankings
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    United States
    Description

    Time series data for the statistic Manufacturing, value added (current US$) and country United States. Indicator Definition:Manufacturing refers to industries belonging to ISIC divisions 15-37. Value added is the net output of a sector after adding up all outputs and subtracting intermediate inputs. It is calculated without making deductions for depreciation of fabricated assets or depletion and degradation of natural resources. The origin of value added is determined by the International Standard Industrial Classification (ISIC), revision 3. Data are in current U.S. dollars.The indicator "Manufacturing, value added (current US$)" stands at 2.91 Trillion usd as of 12/31/2024, the highest value at least since 12/31/1998, the period currently displayed. Regarding the One-Year-Change of the series, the current value constitutes an increase of 2.56 percent compared to the value the year prior.The 1 year change in percent is 2.56.The 3 year change in percent is 20.94.The 5 year change in percent is 28.40.The 10 year change in percent is 44.95.The Serie's long term average value is 1.93 Trillion usd. It's latest available value, on 12/31/2024, is 50.91 percent higher, compared to it's long term average value.The Serie's change in percent from it's minimum value, on 12/31/1997, to it's latest available value, on 12/31/2024, is +110.65%.The Serie's change in percent from it's maximum value, on 12/31/2024, to it's latest available value, on 12/31/2024, is 0.0%.

  12. Datasets accompanying "GNSS Geodesy Quantifies Water-Storage Gains and...

    • figshare.com
    zip
    Updated May 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hilary R. Martens; Nicholas Lau; Matthew Swarr; Donald F Argus; Qian Cao; Zachary Young; Adrian Borsa; Ming Pan; Anna Wilson; Ellen Knappe; F Martin Ralph; Payton Gardner (2024). Datasets accompanying "GNSS Geodesy Quantifies Water-Storage Gains and Drought Improvements in California Spurred by Atmospheric Rivers" [Martens et al., 2024, GRL] [Dataset]. http://doi.org/10.6084/m9.figshare.24763929.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 16, 2024
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Hilary R. Martens; Nicholas Lau; Matthew Swarr; Donald F Argus; Qian Cao; Zachary Young; Adrian Borsa; Ming Pan; Anna Wilson; Ellen Knappe; F Martin Ralph; Payton Gardner
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    California
    Description

    These datasets accompany a publication in Geophysical Research Letters by Martens et al. (2024), entitled: "GNSS Geodesy Quantifies Water-Storage Gains and Drought Improvements in California Spurred by Atmospheric Rivers." Please refer to the manuscript and supporting information for additional details.Dataset 1: Seasonal Changes in TWS based on the Mean and Median of the Solution SetWe estimate net gains in water storage during the fall and winter of each year (October to March) using the mean TWS solutions from all nine inversion products, subtracting the average storage for October from the average storage for March in the following year. One-sigma standard deviations are computed as the square root of the sum of the variances for October and for March. The variance in each month is computed based on the nine independent estimates of mean monthly storage (see “GNSS Analysis and Inversion” in the Supporting Information).The dataset includes net gains in water storage for both the Sierra Nevada and the SST watersheds (see header lines). For each watershed, results are provided in units of volume (km3) and in units of equivalent water height (mm). Furthermore, for each watershed, we also provide the total storage gains based on non-detrended and linearly detrended time series. In columns four and five, respectively, we provide estimates of snow water equivalent (SWE) from SNODAS (National Operational Hydrologic Remote Sensing Center, 2023) and water-storage changes in surface reservoirs from CDEC (California Data Exchange Center, 2023). In the final column, we provide estimates of net gains in subsurface storage (soil moisture plus groundwater), which are computed by subtracting SWE and reservoir storage from total storage.For each data block, the columns are: (1) time period (October of the starting year to March of the following year); (2) average gain in total water storage constrained by nine inversions of GNSS data; (3) one-sigma standard deviation in the average gain in total water storage; (4) gain in snow water equivalent, computed by subtracting the average snow storage in October from the average snow storage in March of the following year; (5) gain in reservoir storage (CDEC database; within the boundaries of each watershed), computed by subtracting the average reservoir storage in October from the average reservoir storage in March of the following year; and (6) average gain in subsurface water storage, estimated as the average gain in total water storage minus the average gain in snow storage minus the gain in reservoir storage.For the period from October 2022 to March 2023, we also compute mean gains in total water storage using daily estimates of TWS. Here, we subtract the average storage for the first week in October 2022 (1-7 October) from the average storage for the last week in March 2023 (26 March – 1 April). The one-sigma standard deviation is computed as the square root of the sum of the variances for the first week in October and the last week in March. The variance in each week is computed based on the nine independent estimates of daily storage over seven days (63 values per week). The storage gains for 2022-2023 computed using these methods are distinguished in the datafile by an asterisk (2022-2023*; final row in each data section).Dataset 1a provides estimates of storage changes based on the mean and standard deviation of the solution set. Dataset 1b provides estimates of storage changes based on the median and inter-quartile range of the solution set.Dataset 2: Estimated Changes in TWS in the Sierra NevadaChanges in TWS (units of volume: km3) in the Sierra Nevada watersheds. The first column represents the date (YYYY-MM-DD). For monthly solutions, the TWS solutions apply to the month leading up to that date. The remaining nine columns represent each of the nine solutions described in the text. “UM” represents the University of Montana, “SIO” represents the Scripps Institution of Oceanography, and “JPL” represents the Jet Propulsion Laboratory. “NGL” refers to the use of GNSS analysis products from the Nevada Geodetic Laboratory, “CWU” refers to Central Washington University, and “MEaSUREs” refers to the Making Earth System Data Records for Use in Research Environments program. The time series have not been detrended.We highlight that we have added changes in reservoir storage (see Dataset 8) back into the JPL solutions, since reservoir storage had been modeled and removed from the GNSS time series prior to inversion in the JPL workflow (see “Detailed Description of Methods” in the Supporting Information). Thus, the storage values presented here for JPL differ slightly from storage values pulled directly from Dataset 6 and integrated over the area of the Sierra Nevada watersheds.Dataset 3: Estimated Changes in TWS in the Sacramento-San Joaquin-Tulare BasinSame as Dataset 2, except that data apply to the Sacramento-San Joaquin-Tulare (SST) Basin.Dataset 4: Inversion Products (SIO)Inversion solutions (NetCDF format) for TWS changes across the western US from January 2006 through March 2023. The products were produced at the Scripps Institution of Oceanography (SIO) using the methods described in the Supporting Information.Dataset 5: Inversion Products (UM)Inversion solutions (NetCDF format) for TWS changes across the western US from January 2006 through March 2023. The products were produced at the University of Montana (UM) using the methods described in the Supporting Information.Dataset 6: Inversion Products (JPL)Inversion solutions (NetCDF format) for TWS changes across the western US from January 2006 through March 2023. The products were produced at the Jet Propulsion Laboratory (JPL) using the methods described in the Supporting Information.Dataset 7: Lists of Excluded StationsStations are excluded from an inversion for TWS change based on a variety of criteria (detailed in the Supporting Information), including poroelastic behavior, high noise levels, and susceptibility to volcanic deformation. This dataset provides lists of excluded stations from each institution generating inversion products (SIO, UM, JPL).Dataset 8: Lists of Reservoirs and LakesLists of reservoirs and lakes from the California Data Exchange Center (CDEC) (California Data Exchange Center, 2023), which are shown in Figures 1 and 2 of the main manuscript. In the interest of figure clarity, Figure 1 depicts only those reservoirs that exhibited volume changes of at least 0.15 km3 during the first half of WY23.Dataset 8a includes all reservoirs and lakes in California that exhibited volume changes of at least 0.15 km3 between October 2022 and March 2023. The threshold of 0.15 km3 represents a natural break in the distribution of volume changes at all reservoirs and lakes in California over that period (169 reservoirs and lakes in total). Most of the 169 reservoirs and lakes exhibited volume changes near zero km3. Datasets 8b and 8c include subsets of reservoirs and lakes (from Dataset 8a) that fall within the boundaries of the Sierra Nevada and SST watersheds.Furthermore, in the JPL data-processing and inversion workflow (see “Detailed Description of Methods” in the Supporting Information), surface displacements induced by volume changes in select lakes and reservoirs are modeled and removed from GNSS time series prior to inversion. The water-storage changes in the lakes and reservoirs are then added back into the solutions for water storage, derived from the inversion of GNSS data. Dataset 8d includes the list of reservoirs used in the JPL workflow.Dataset 9: Interseismic Strain Accumulation along the Cascadia Subduction ZoneJPL and UM remove interseismic strain accumulation associated with locking of the Cascadia subduction zone using an updated version of the Li et al. model (Li et al., 2018); see Supporting Information Section 2d. The dataset lists the east, north, and up velocity corrections (in the 4th, 5th, and 6th columns of the dataset, respectively) at each station; units are mm/year. The station ID, latitude, and longitude are listed in columns one, two, and three, respectively, of the dataset.Dataset 10: Days Impacted by Atmospheric RiversA list of days impacted by atmospheric rivers within (a) the HUC-2 boundary for California from 1 January 2008 until 1 April 2023 [Dataset 10a] and (b) the Sierra Nevada and SST watersheds from 1 October 2022 until 1 April 2023 [Dataset 10b]. File formats: [decimal year; integrated water-vapor transport (IVT) in kg m-1 s-1; AR category; and calendar date as a two-digit year followed by a three-character month followed by a two-digit day]. The AR category reflects the peak intensity anywhere within the watershed. We use the detection and classification methods of (Ralph et al., 2019; Rutz et al., 2014, 2019). See also Supporting Information Section 2i.Dataset 10c provides a list of days and times when ARs made landfall along the California coast between October 1980 and September 2023, based on the MERRA-2 reanalysis using the methods of (Rutz et al., 2014, 2019). Only coastal grid cells are included. File format: [year, month, day, hour, latitude, longitude, and IVT in kg m-1 s-1]. Values are sorted by time (year, month, day, hour) and then by latitude. See also Supporting Information Section 2g.

  13. D

    Data from: Trait-based projections of climate change effects on global biome...

    • lifesciences.datastations.nl
    csv, txt +2
    Updated Sep 7, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    C.C.F. Boonman; C.C.F. Boonman (2021). Trait-based projections of climate change effects on global biome distributions [Dataset]. http://doi.org/10.17026/DANS-XF5-QDXD
    Explore at:
    type/x-r-syntax(7607), type/x-r-syntax(13178), type/x-r-syntax(7368), zip(16506), csv(6537488), txt(3732), type/x-r-syntax(19866)Available download formats
    Dataset updated
    Sep 7, 2021
    Dataset provided by
    DANS Data Station Life Sciences
    Authors
    C.C.F. Boonman; C.C.F. Boonman
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Climate change will likely modify the global distribution of biomes, but the magnitude of change is debated. In the paper belonging to this dataset, we followed a trait-based, statistical approach to model the influence of climate change on the global distribution of biomes.Step 1. We predicted the global distribution of plant community mean specific leaf area (SLA), height, and wood density as a function of climate and soil characteristics using an ensemble of statistical models.Step 2. We predicted the probability of occurrence of biomes as a function of the three traits with a classification model.Step 3. We projected changes in plant community mean traits and corresponding changes in biome distributions to 2070 for low (RCP 2.6; +1.2°C) and extreme (RCP 8.5; +3.5°C) future climate change scenarios.Four R scripts are added to this dataset, starting with step two described in the paper: from traits to vegetation types (Traits.to.VegT.R). Input for this file is the csv file: all.trait.predictions.untransformed.csv which includes the predictions for community mean trait values for Specific Leaf Area (SLA), height and wood density, for the current climate and under the predictions under a climate change scenario with RCP2.6 and RCP8.5.

  14. Behavioral Risk Factors: HRQOL

    • kaggle.com
    zip
    Updated Jan 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2023). Behavioral Risk Factors: HRQOL [Dataset]. https://www.kaggle.com/datasets/thedevastator/behavioral-risk-factors-hrqol/suggestions
    Explore at:
    zip(2247473 bytes)Available download formats
    Dataset updated
    Jan 21, 2023
    Authors
    The Devastator
    Description

    Behavioral Risk Factors: HRQOL

    Analyzing Health-Related Quality of Life in the United States

    By Health [source]

    About this dataset

    The Behavioral Risk Factor Surveillance System (BRFSS) is an annual state-based, telephone survey of adults in the United States. It collects a variety of health-related data, including Health Related Quality of Life (HRQOL). This dataset contains results from the HRQOL survey within a range of locations across the US for the year indicated.

    This dataset includes 14 columns which summarize and quantify different aspects concerning HRQOL topics. The year, location abbreviation, description and geo-location provide background contextual information which help define each row. The question column indicates the response provided to by respondents, while category classifies it into overarching groupings. Additionally there are columns covering sample size and data value attributes such as standard error, unit and type all evidence chipping away at informative insights into how Americans’ quality of life is changing over time — all cleverly presented in this one concise dataset!

    More Datasets

    For more datasets, click here.

    Featured Notebooks

    • 🚨 Your notebook can be here! 🚨!

    How to use the dataset

    In order to analyze this dataset, it is important have a good understanding of the columns included in it. The columns provide various pieces of information about the data such as year collected, location abbreviation, location name and type of data value collected. Furthermore, understanding what each column means is essential for proper interpretation and analysis; for example knowing that ‘Data_Value %’ indicates what percentage responded a certain way or that ‘Sample_Size’ shows how many people were surveyed can help you make better decisions when looking at patterns within the data set.

    Once you understand the general structure behind this dataset one should also familiarize themselves with some basic statistical analysis tools such as mean/median/mode calculations comparative/correlative analysis so they can really gain insights into how health-related quality of life affects different populations across countries or regions.. To get even more meaningful results you might also want to consider adding other variables or datasets into your report that correlate with HRQOL - like poverty rate or average income level - so you can make clearer conclusions about potential contributing factors towards certain insights you uncover while using this dataset alone.

    Research Ideas

    • Identifying trends between geolocation and health-related quality of life indicators to better understand how environmental factors may impact specific communities.
    • Visualizing the correlations between health-related quality of life variables across different locations over time to gain insights on potential driving developmental or environmental issues.
    • Monitoring the effects of public health initiatives dealing with qualitative health data such as those conducted by CDC, Department of Health and Human Services, and other organizations by tracking changes in different aspects of HRQOL measures over time across multiple locations

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. Data Source

    License

    See the dataset description for more information.

    Columns

    File: rows.csv | Column name | Description | |:-------------------------------|:-------------------------------------------------------------------------------------------------------------| | Year | Year when the data was collected. (Integer) | | LocationAbbr | Abbreviations of various locations where data was recorded. (String) | | LocationDesc | Full names of states whose records are included in this survey. (String) | | Category | Particular topic chosen for research such as “Healthy People 2010 Topics” or “Older Adults Issues”. (String) | | Question | Each question corresponds to metrics tracked within each topic. (String) | | DataSource | Source from which survey responses were collected. (String) | | Data_Value_Unit | Units taken for recording survey types...

  15. National Hydrography Dataset Plus Version 2.1

    • resilience.climate.gov
    • geodata.colorado.gov
    • +5more
    Updated Aug 16, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Esri (2022). National Hydrography Dataset Plus Version 2.1 [Dataset]. https://resilience.climate.gov/maps/4bd9b6892530404abfe13645fcb5099a
    Explore at:
    Dataset updated
    Aug 16, 2022
    Dataset authored and provided by
    Esrihttp://esri.com/
    Area covered
    Description

    The National Hydrography Dataset Plus (NHDplus) maps the lakes, ponds, streams, rivers and other surface waters of the United States. Created by the US EPA Office of Water and the US Geological Survey, the NHDPlus provides mean annual and monthly flow estimates for rivers and streams. Additional attributes provide connections between features facilitating complicated analyses. For more information on the NHDPlus dataset see the NHDPlus v2 User Guide.Dataset SummaryPhenomenon Mapped: Surface waters and related features of the United States and associated territories not including Alaska.Geographic Extent: The United States not including Alaska, Puerto Rico, Guam, US Virgin Islands, Marshall Islands, Northern Marianas Islands, Palau, Federated States of Micronesia, and American SamoaProjection: Web Mercator Auxiliary Sphere Visible Scale: Visible at all scales but layer draws best at scales larger than 1:1,000,000Source: EPA and USGSUpdate Frequency: There is new new data since this 2019 version, so no updates planned in the futurePublication Date: March 13, 2019Prior to publication, the NHDPlus network and non-network flowline feature classes were combined into a single flowline layer. Similarly, the NHDPlus Area and Waterbody feature classes were merged under a single schema.Attribute fields were added to the flowline and waterbody layers to simplify symbology and enhance the layer's pop-ups. Fields added include Pop-up Title, Pop-up Subtitle, On or Off Network (flowlines only), Esri Symbology (waterbodies only), and Feature Code Description. All other attributes are from the original NHDPlus dataset. No data values -9999 and -9998 were converted to Null values for many of the flowline fields.What can you do with this layer?Feature layers work throughout the ArcGIS system. Generally your work flow with feature layers will begin in ArcGIS Online or ArcGIS Pro. Below are just a few of the things you can do with a feature service in Online and Pro.ArcGIS OnlineAdd this layer to a map in the map viewer. The layer is limited to scales of approximately 1:1,000,000 or larger but a vector tile layer created from the same data can be used at smaller scales to produce a webmap that displays across the full range of scales. The layer or a map containing it can be used in an application. Change the layer’s transparency and set its visibility rangeOpen the layer’s attribute table and make selections. Selections made in the map or table are reflected in the other. Center on selection allows you to zoom to features selected in the map or table and show selected records allows you to view the selected records in the table.Apply filters. For example you can set a filter to show larger streams and rivers using the mean annual flow attribute or the stream order attribute. Change the layer’s style and symbologyAdd labels and set their propertiesCustomize the pop-upUse as an input to the ArcGIS Online analysis tools. This layer works well as a reference layer with the trace downstream and watershed tools. The buffer tool can be used to draw protective boundaries around streams and the extract data tool can be used to create copies of portions of the data.ArcGIS ProAdd this layer to a 2d or 3d map. Use as an input to geoprocessing. For example, copy features allows you to select then export portions of the data to a new feature class. Change the symbology and the attribute field used to symbolize the dataOpen table and make interactive selections with the mapModify the pop-upsApply Definition Queries to create sub-sets of the layerThis layer is part of the ArcGIS Living Atlas of the World that provides an easy way to explore the landscape layers and many other beautiful and authoritative maps on hundreds of topics.Questions?Please leave a comment below if you have a question about this layer, and we will get back to you as soon as possible.

  16. HadUK-Grid Gridded Climate Observations on a 60km grid over the UK,...

    • catalogue.ceda.ac.uk
    Updated Jun 27, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dan Hollis; Emily Carlisle; Michael Kendon; Stephen Packman; Amy Doherty (2025). HadUK-Grid Gridded Climate Observations on a 60km grid over the UK, v1.3.1.ceda (1836-2024) [Dataset]. https://catalogue.ceda.ac.uk/uuid/3b010220fe184e209462a01efd00d207
    Explore at:
    Dataset updated
    Jun 27, 2025
    Dataset provided by
    Centre for Environmental Data Analysishttp://www.ceda.ac.uk/
    Authors
    Dan Hollis; Emily Carlisle; Michael Kendon; Stephen Packman; Amy Doherty
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Time period covered
    Jan 1, 1836 - Dec 31, 2024
    Area covered
    Variables measured
    time, latitude, area_type, longitude, wind_speed, air_temperature, relative_humidity, surface_temperature, duration_of_sunshine, projection_x_coordinate, and 7 more
    Description

    HadUK-Grid is a collection of gridded climate variables derived from the network of UK land surface observations. The data have been interpolated from meteorological station data onto a uniform grid to provide complete and consistent coverage across the UK. The dataset at 60 km resolution is derived from the associated 1 km x 1 km resolution to allow for comparison to data from UKCP18 climate projections. The dataset spans the period from 1836 to 2024, but the start time is dependent on climate variable and temporal resolution.

    The gridded data are produced for daily, monthly, seasonal and annual timescales, as well as long term averages for a set of climatological reference periods. Variables include air temperature (maximum, minimum and mean), precipitation, sunshine, mean sea level pressure, wind speed, relative humidity, vapour pressure, days of snow lying, and days of ground frost.

    This data set supersedes the previous versions of this dataset which also superseded UKCP09 gridded observations. Subsequent versions may be released in due course and will follow the version numbering as outlined by Hollis et al. (2019, see linked documentation).

    The changes for v1.3.1.ceda HadUK-Grid datasets are as follows:

    Changes to the dataset * Added data for calendar year 2024 * Extended the daily temperature grids back to 1931

    Changes to the input data * Incorporated additional daily rainfall data for 60 sites in Scotland, 1922-45 * Incorporated additional monthly rainfall data for two sites - Westonbirt (1880-1951) & Ackworth School (1852-53) * Fixed a 1-day offset for sunshine duration values for six stations between 1971 and 1993 * Corrected the daily rainfall data for Macclesfield, 1958-60 (the values had been stored in the wrong units) * Improved the quality control of the most recent three months of rainfall data (Oct-Dec 2024) * Removed Corpach from the wind speed grids (the station is poorly modelled - this only affects 14 months) * Reviewed the quality control flags that had been applied automatically to historical air and grass minimum temperature data. In many cases it was possible to remove the flags and this has allowed us to incorporate additional data into the grids for 1961-1997 for these variables. * Improved the business logic relating to data completeness. This affects monthly wind speed and has allowed us to re-introduce some of the data that were excluded in the previous release.

    • Net changes to the input station data:
      • Total of 131314637 observations
      • 126821432 (96.6%) unchanged
      • 105327 (0.08%) modified for this version
      • 4387878 (3.34%) added in this version
      • 44224 (0.03%) deleted from this version

    The primary purpose of these data are to facilitate monitoring of UK climate and research into climate change, impacts and adaptation. The datasets have been created by the Met Office with financial support from the Department for Business, Energy and Industrial Strategy (BEIS) and Department for Environment, Food and Rural Affairs (DEFRA) in order to support the Public Weather Service Customer Group (PWSCG), the Hadley Centre Climate Programme, and the UK Climate Projections (UKCP18) project. The output from a number of data recovery activities relating to 19th and early 20th Century data have been used in the creation of this dataset, these activities were supported by: the Met Office Hadley Centre Climate Programme; the Natural Environment Research Council project "Analysis of historic drought and water scarcity in the UK"; the UK Research & Innovation (UKRI) Strategic Priorities Fund UK Climate Resilience programme; The UK Natural Environment Research Council (NERC) Public Engagement programme; the National Centre for Atmospheric Science; National Centre for Atmospheric Science and the NERC GloSAT project; and the contribution of many thousands of public volunteers. The dataset is provided under Open Government Licence.

  17. National Hydrography Dataset Plus High Resolution

    • oregonwaterdata.org
    • dangermondpreserve-tnc.hub.arcgis.com
    • +1more
    Updated Mar 16, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Esri (2023). National Hydrography Dataset Plus High Resolution [Dataset]. https://www.oregonwaterdata.org/maps/f1f45a3ba37a4f03a5f48d7454e4b654
    Explore at:
    Dataset updated
    Mar 16, 2023
    Dataset authored and provided by
    Esrihttp://esri.com/
    Area covered
    Description

    The National Hydrography Dataset Plus High Resolution (NHDplus High Resolution) maps the lakes, ponds, streams, rivers and other surface waters of the United States. Created by the US Geological Survey, NHDPlus High Resolution provides mean annual flow and velocity estimates for rivers and streams. Additional attributes provide connections between features facilitating complicated analyses.For more information on the NHDPlus High Resolution dataset see the User’s Guide for the National Hydrography Dataset Plus (NHDPlus) High Resolution.Dataset SummaryPhenomenon Mapped: Surface waters and related features of the United States and associated territoriesGeographic Extent: The Contiguous United States, Hawaii, portions of Alaska, Puerto Rico, Guam, US Virgin Islands, Northern Marianas Islands, and American SamoaProjection: Web Mercator Auxiliary Sphere Visible Scale: Visible at all scales but layer draws best at scales larger than 1:1,000,000Source: USGSUpdate Frequency: AnnualPublication Date: July 2022This layer was symbolized in the ArcGIS Map Viewer and while the features will draw in the Classic Map Viewer the advanced symbology will not. Prior to publication, the network and non-network flowline feature classes were combined into a single flowline layer. Similarly, the Area and Waterbody feature classes were merged under a single schema.Attribute fields were added to the flowline and waterbody layers to simplify symbology and enhance the layer's pop-ups. Fields added include Pop-up Title, Pop-up Subtitle, Esri Symbology (waterbodies only), and Feature Code Description. All other attributes are from the original dataset. No data values -9999 and -9998 were converted to Null values.What can you do with this layer?Feature layers work throughout the ArcGIS system. Generally your work flow with feature layers will begin in ArcGIS Online or ArcGIS Pro. Below are just a few of the things you can do with a feature service in Online and Pro.ArcGIS OnlineAdd this layer to a map in the map viewer. The layer or a map containing it can be used in an application. Change the layer’s transparency and set its visibility rangeOpen the layer’s attribute table and make selections. Selections made in the map or table are reflected in the other. Center on selection allows you to zoom to features selected in the map or table and show selected records allows you to view the selected records in the table.Apply filters. For example you can set a filter to show larger streams and rivers using the mean annual flow attribute or the stream order attribute.Change the layer’s style and symbologyAdd labels and set their propertiesCustomize the pop-upUse as an input to the ArcGIS Online analysis tools. This layer works well as a reference layer with the trace downstream and watershed tools. The buffer tool can be used to draw protective boundaries around streams and the extract data tool can be used to create copies of portions of the data.ArcGIS ProAdd this layer to a 2d or 3d map.Use as an input to geoprocessing. For example, copy features allows you to select then export portions of the data to a new feature class.Change the symbology and the attribute field used to symbolize the dataOpen table and make interactive selections with the mapModify the pop-upsApply Definition Queries to create sub-sets of the layerThis layer is part of the ArcGIS Living Atlas of the World that provides an easy way to explore the landscape layers and many other beautiful and authoritative maps on hundreds of topics.Questions?Please leave a comment below if you have a question about this layer, and we will get back to you as soon as possible.

  18. m

    Services, value added (current US$) - Georgia

    • macro-rankings.com
    csv, excel
    Updated Sep 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    macro-rankings (2025). Services, value added (current US$) - Georgia [Dataset]. https://www.macro-rankings.com/georgia/services-value-added-(current-us$)
    Explore at:
    excel, csvAvailable download formats
    Dataset updated
    Sep 10, 2025
    Dataset authored and provided by
    macro-rankings
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Georgia
    Description

    Time series data for the statistic Services, value added (current US$) and country Georgia. Indicator Definition:Services correspond to ISIC divisions 50-99. They include value added in wholesale and retail trade (including hotels and restaurants), transport, and government, financial, professional, and personal services such as education, health care, and real estate services. Also included are imputed bank service charges and import duties. Value added is the net output of a sector after adding up all outputs and subtracting intermediate inputs. It is calculated without making deductions for depreciation of fabricated assets or depletion and degradation of natural resources. The industrial origin of value added is determined by the International Standard Industrial Classification (ISIC), revision 3. Data are in current U.S. dollars.The indicator "Services, value added (current US$)" stands at 21.21 Billion usd as of 12/31/2024, the highest value at least since 12/31/1988, the period currently displayed. Regarding the One-Year-Change of the series, the current value constitutes an increase of 11.05 percent compared to the value the year prior.The 1 year change in percent is 11.05.The 3 year change in percent is 91.66.The 5 year change in percent is 100.08.The 10 year change in percent is 93.06.The Serie's long term average value is 6.62 Billion usd. It's latest available value, on 12/31/2024, is 220.37 percent higher, compared to it's long term average value.The Serie's change in percent from it's minimum value, on 12/31/1992, to it's latest available value, on 12/31/2024, is +2,407.34%.The Serie's change in percent from it's maximum value, on 12/31/2024, to it's latest available value, on 12/31/2024, is 0.0%.

  19. m

    Services, value added (current US$) - China

    • macro-rankings.com
    csv, excel
    Updated Sep 27, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    macro-rankings (2025). Services, value added (current US$) - China [Dataset]. https://www.macro-rankings.com/china/services-value-added-(current-us$)
    Explore at:
    excel, csvAvailable download formats
    Dataset updated
    Sep 27, 2025
    Dataset authored and provided by
    macro-rankings
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    China
    Description

    Time series data for the statistic Services, value added (current US$) and country China. Indicator Definition:Services correspond to ISIC divisions 50-99. They include value added in wholesale and retail trade (including hotels and restaurants), transport, and government, financial, professional, and personal services such as education, health care, and real estate services. Also included are imputed bank service charges and import duties. Value added is the net output of a sector after adding up all outputs and subtracting intermediate inputs. It is calculated without making deductions for depreciation of fabricated assets or depletion and degradation of natural resources. The industrial origin of value added is determined by the International Standard Industrial Classification (ISIC), revision 3. Data are in current U.S. dollars.The indicator "Services, value added (current US$)" stands at 10.64 Trillion usd as of 12/31/2024, the highest value at least since 12/31/1961, the period currently displayed. Regarding the One-Year-Change of the series, the current value constitutes an increase of 3.34 percent compared to the value the year prior.The 1 year change in percent is 3.34.The 3 year change in percent is 6.60.The 5 year change in percent is 32.27.The 10 year change in percent is 102.39.The Serie's long term average value is 1.83 Trillion usd. It's latest available value, on 12/31/2024, is 481.20 percent higher, compared to it's long term average value.The Serie's change in percent from it's minimum value, on 12/31/1963, to it's latest available value, on 12/31/2024, is +76,781.28%.The Serie's change in percent from it's maximum value, on 12/31/2024, to it's latest available value, on 12/31/2024, is 0.0%.

  20. e

    A global database of long-term changes in insect assemblages

    • knb.ecoinformatics.org
    • dataone.org
    • +4more
    Updated Jan 26, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Roel van Klink; Diana E. Bowler; Jonathan M. Chase; Orr Comay; Michael M. Driessen; S.K. Morgan Ernest; Alessandro Gentile; Francis Gilbert; Konstantin Gongalsky; Jennifer Owen; Guy Pe'er; Israel Pe'er; Vincent H. Resh; Ilia Rochlin; Sebastian Schuch; Ann E. Swengel; Scott R. Swengel; Thomas L. Valone; Rikjan Vermeulen; Tyson Wepprich; Jerome Wiedmann (2022). A global database of long-term changes in insect assemblages [Dataset]. http://doi.org/10.5063/F1ZC817H
    Explore at:
    Dataset updated
    Jan 26, 2022
    Dataset provided by
    Knowledge Network for Biocomplexity
    Authors
    Roel van Klink; Diana E. Bowler; Jonathan M. Chase; Orr Comay; Michael M. Driessen; S.K. Morgan Ernest; Alessandro Gentile; Francis Gilbert; Konstantin Gongalsky; Jennifer Owen; Guy Pe'er; Israel Pe'er; Vincent H. Resh; Ilia Rochlin; Sebastian Schuch; Ann E. Swengel; Scott R. Swengel; Thomas L. Valone; Rikjan Vermeulen; Tyson Wepprich; Jerome Wiedmann
    Time period covered
    Jan 1, 1925 - Jan 1, 2018
    Area covered
    Variables measured
    End, Link, Year, Realm, Start, CRUmnC, CRUmnK, Metric, Number, Period, and 63 more
    Description

    UPDATED on October 15 2020 After some mistakes in some of the data were found, we updated this data set. The changes to the data are detailed on Zenodo (http://doi.org/10.5281/zenodo.4061807), and an Erratum has been submitted. This data set under CC-BY license contains time series of total abundance and/or biomass of assemblages of insect, arachnid and Entognatha assemblages (grouped at the family level or higher taxonomic resolution), monitored by standardized means for ten or more years. The data were derived from 165 data sources, representing a total of 1668 sites from 41 countries. The time series for abundance and biomass represent the aggregated number of all individuals of all taxa monitored at each site. The data set consists of four linked tables, representing information on the study level, the plot level, about sampling, and the measured assemblage sizes. all references to the original data sources can be found in the pdf with references, and a Google Earth file (kml) file presents the locations (including metadata) of all datasets. When using (parts of) this data set, please respect the original open access licenses. This data set underlies all analyses performed in the paper 'Meta-analysis reveals declines in terrestrial, but increases in freshwater insect abundances', a meta-analysis of changes in insect assemblage sizes, and is accompanied by a data paper entitled 'InsectChange – a global database of temporal changes in insect and arachnid assemblages'. Consulting the data paper before use is recommended. Tables that can be used to calculate trends of specific taxa and for species richness will be added as they become available. The data set consists of four tables that are linked by the columns 'DataSource_ID'. and 'Plot_ID', and a table with references to original research. In the table 'DataSources', descriptive data is provided at the dataset level: Links are provided to online repositories where the original data can be found, it describes whether the dataset provides data on biomass, abundance or both, the invertebrate group under study, the realm, and describes the location of sampling at different geographic scales (continent to state). This table also contains a reference column. The full reference to the original data is found in the file 'References_to_original_data_sources.pdf'. In the table 'PlotData' more details on each site within each dataset are provided: there is data on the exact location of each plot, whether the plots were experimentally manipulated, and if there was any spatial grouping of sites (column 'Location'). Additionally, this table contains all explanatory variables used for analysis, e.g. climate change variables, land-use variables, protection status. The table 'SampleData' describes the exact source of the data (table X, figure X, etc), the extraction methods, as well as the sampling methods (derived from the original publications). This includes the sampling method, sampling area, sample size, and how the aggregation of samples was done, if reported. Also, any calculations we did on the original data (e.g. reverse log transformations) are detailed here, but more details are provided in the data paper. This table links to the table 'DataSources' by the column 'DataSource_ID'. Note that each datasource may contain multiple entries in the 'SampleData' table if the data were presented in different figures or tables, or if there was any other necessity to split information on sampling details. The table 'InsectAbundanceBiomassData' provides the insect abundance or biomass numbers as analysed in the paper. It contains columns matching to the tables 'DataSources' and 'PlotData', as well as year of sampling, a descriptor of the period within the year of sampling (this was used as a random effect), the unit in which the number is reported (abundance or biomass), and the estimated abundance or biomass. In the column for Number, missing data are included (NA). The years with missing data were added because this was essential for the analysis performed, and retained here because they are easier to remove than to add. Linking the table 'InsectAbundanceBiomassData.csv' with 'PlotData.csv' by column 'Plot_ID', and with 'DataSources.csv' by column 'DataSource_ID' will provide the full dataframe used for all analyses. Detailed explanations of all column headers and terms are available in the ReadMe file, and more details will be available in the forthcoming data paper. WARNING: Because of the disparate sampling methods and various spatial and temporal scales used to collect the original data, this dataset should never be used to test for differences in insect abundance/biomass among locations (i.e. differences in intercept). The data can only be used to study temporal trends, by testing for differences in slopes. The data are standardized within plots to allow the temporal comparison, but not necessarily among plots (even within one dataset).

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
(2025). Open data change log [Dataset]. https://opendata.vancouver.ca/explore/dataset/open-data-change-log/

Open data change log

Explore at:
csv, json, excelAvailable download formats
Dataset updated
Nov 18, 2025
License

https://opendata.vancouver.ca/pages/licence/https://opendata.vancouver.ca/pages/licence/

Description

​Significant changes to the open data catalogue, including new datasets added, datasets renamed or retired, quarterly or annual updates to high-impact datasets, changes to data structure or definition. Smaller changes, such as adding or editing records or renaming a field in an existing dataset are not included. NoteThis log is published in the interest of transparency into the work of the open data program. You can subscribe to updates for a specific dataset by creating an account on the portal then clicking on the Follow button on the Information tab of any dataset. You can get updates by subscribing to our email newsletter. Data currency​New records will be added whenever a significant change is made to the open data catalogue.

Search
Clear search
Close search
Google apps
Main menu