36 datasets found
  1. Geospatial Data Pack for Visualization

    • kaggle.com
    zip
    Updated Oct 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vega Datasets (2025). Geospatial Data Pack for Visualization [Dataset]. https://www.kaggle.com/datasets/vega-datasets/geospatial-data-pack
    Explore at:
    zip(1422109 bytes)Available download formats
    Dataset updated
    Oct 21, 2025
    Dataset authored and provided by
    Vega Datasets
    Description

    Geospatial Data Pack for Visualization 🗺️

    Learn Geographic Mapping with Altair, Vega-Lite and Vega using Curated Datasets

    Complete geographic and geophysical data collection for mapping and visualization. This consolidation includes 18 complementary datasets used by 31+ Vega, Vega-Lite, and Altair examples 📊. Perfect for learning geographic visualization techniques including projections, choropleths, point maps, vector fields, and interactive displays.

    Source data lives on GitHub and can also be accessed via CDN. The vega-datasets project serves as a common repository for example datasets used across these visualization libraries and related projects.

    Why Use This Dataset? 🤔

    • Comprehensive Geospatial Types: Explore a variety of core geospatial data models:
      • Vector Data: Includes points (like airports.csv), lines (like londonTubeLines.json), and polygons (like us-10m.json).
      • Raster-like Data: Work with gridded datasets (like windvectors.csv, annual-precip.json).
    • Diverse Formats: Gain experience with standard and efficient geospatial formats like GeoJSON (see Table 1, 2, 4), compressed TopoJSON (see Table 1), and plain CSV/TSV (see Table 2, 3, 4) for point data and attribute tables ready for joining.
    • Multi-Scale Coverage: Practice visualization across different geographic scales, from global and national (Table 1, 4) down to the city level (Table 1).
    • Rich Thematic Mapping: Includes multiple datasets (Table 3) specifically designed for joining attributes to geographic boundaries (like states or counties from Table 1) to create insightful choropleth maps.
    • Ready-to-Use & Example-Driven: Cleaned datasets tightly integrated with 31+ official examples (see Appendix) from Altair, Vega-Lite, and Vega, allowing you to immediately practice techniques like projections, point maps, network maps, and interactive displays.
    • Python Friendly: Works seamlessly with essential Python libraries like Altair (which can directly read TopoJSON/GeoJSON), Pandas, and GeoPandas, fitting perfectly into the Kaggle notebook environment.

    Table of Contents

    Dataset Inventory 🗂️

    This pack includes 18 datasets covering base maps, reference points, statistical data for choropleths, and geophysical data.

    1. BASE MAP BOUNDARIES (Topological Data)

    DatasetFileSizeFormatLicenseDescriptionKey Fields / Join Info
    US Map (1:10m)us-10m.json627 KBTopoJSONCC-BY-4.0US state and county boundaries. Contains states and counties objects. Ideal for choropleths.id (FIPS code) property on geometries
    World Map (1:110m)world-110m.json117 KBTopoJSONCC-BY-4.0World country boundaries. Contains countries object. Suitable for world-scale viz.id property on geometries
    London BoroughslondonBoroughs.json14 KBTopoJSONCC-BY-4.0London borough boundaries.properties.BOROUGHN (name)
    London CentroidslondonCentroids.json2 KBGeoJSONCC-BY-4.0Center points for London boroughs.properties.id, properties.name
    London Tube LineslondonTubeLines.json78 KBGeoJSONCC-BY-4.0London Underground network lines.properties.name, properties.color

    2. GEOGRAPHIC REFERENCE POINTS (Point Data) 📍

    DatasetFileSizeFormatLicenseDescriptionKey Fields / Join Info
    US Airportsairports.csv205 KBCSVPublic DomainUS airports with codes and coordinates.iata, state, `l...
  2. Data from: How can appropriate hue ranges be selected for sequential color...

    • figshare.com
    • datasetcatalog.nlm.nih.gov
    xlsx
    Updated Apr 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Taisheng Chen; Xi Lv; Kun Hu; Menglin Chen; Lu Cheng; Weixing Jiang (2024). How can appropriate hue ranges be selected for sequential color schemes on choropleth maps? A quantitative evaluation using map-reading experiments [Dataset]. http://doi.org/10.6084/m9.figshare.25572387.v3
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Apr 10, 2024
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Taisheng Chen; Xi Lv; Kun Hu; Menglin Chen; Lu Cheng; Weixing Jiang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We recruited 414 college students to participate in the experiment. Through the experiment, we collected their visual data and arranged them according to different visual indicators. Then we process our data through qualitative and quantitative analysis to get the final result.

  3. USA states GeoJson

    • kaggle.com
    zip
    Updated Aug 18, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kate Gallo (2020). USA states GeoJson [Dataset]. https://www.kaggle.com/pompelmo/usa-states-geojson
    Explore at:
    zip(30298 bytes)Available download formats
    Dataset updated
    Aug 18, 2020
    Authors
    Kate Gallo
    Area covered
    United States
    Description

    Context

    I created a dataset to help people create choropleth maps of United States states.

    Content

    One geojson to plot the countries borders, and one csv from the Census Bureau for the us population per state.

    Inspiration

    I think the best way to use this dataset is in joining it with other data. For example, I used this dataset to plot police killings using the data from https://www.kaggle.com/jpmiller/police-violence-in-the-us

  4. Create your own mapping templates - Excel Add-In

    • ckan.publishing.service.gov.uk
    Updated Jun 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ckan.publishing.service.gov.uk (2025). Create your own mapping templates - Excel Add-In [Dataset]. https://ckan.publishing.service.gov.uk/dataset/create-your-own-mapping-templates-excel-add-in
    Explore at:
    Dataset updated
    Jun 9, 2025
    Dataset provided by
    CKANhttps://ckan.org/
    Description

    With this add in it is possible to create map templates from GIS files in KML format, and create choropleths with them. Providing you have access to KML format map boundary files, it is possible to create your own quick and easy choropleth maps in Excel. The KML format files can be converted from 'shape' files. Many shape files are available to download for free from the web, including from Ordnance Survey and the London Datastore. Standard mapping packages such as QGIS (free to download) and ArcGIS can convert the files to KML format. A sample of a KML file (London wards) can be downloaded from this page, so that users can easily test the tool out. Macros must be enabled for the tool to function. When creating the map using the Excel tool, the 'unique ID' should normally be the area code, the 'Name' should be the area name and then if required and there is additional data in the KML file, further 'data' fields can be added. These columns will appear below and to the right of the map. If not, data can be added later on next to the codes and names. In the add-in version of the tool the final control, 'Scale (% window)' should not normally be changed. With the default value 0.5, the height of the map is set to be half the total size of the user's Excel window. To run a choropleth, select the menu option 'Run Choropleth' to get this form. To specify the colour ramp for the choropleth, the user needs to enter the number of boxes into which the range is to be divided, and the colours for the high and low ends of the range, which is done by selecting coloured option boxes as appropriate. If wished, hit the 'Swap' button to change which colours are for the different ends of the range. Then hit the 'Choropleth' button. The default options for the colours of the ends of the choropleth colour range are saved in the add in, but different values can be selected but setting up a column range of up to twelve cells, anywhere in Excel, filled with the option colours wanted. Then use the 'Colour range' control to select this range, and hit apply, having selected high or low values as wished. The button 'Copy' sets up a sheet 'ColourRamp' in the active workbook with the default colours, which can just be extended or deleted with just a few cells, so saving the user time. The add-in was developed entirely within the Excel VBA IDE by Tim Lund. He is kindly distributing the tool for free on the Datastore but suggests that users who find the tool useful make a donation to the Shelter charity. It is not intended to keep the actively maintained, but if any users or developers would like to add more features, email the author. Acknowledgments Calculation of Excel freeform shapes from latitudes and longitudes is done using calculations from the Ordnance Survey.

  5. Natural Earth 1:110m Countries

    • kaggle.com
    zip
    Updated Mar 14, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anton Poznyakovskiy (2020). Natural Earth 1:110m Countries [Dataset]. https://www.kaggle.com/datasets/poznyakovskiy/natural-earth-1110m-countries
    Explore at:
    zip(197544 bytes)Available download formats
    Dataset updated
    Mar 14, 2020
    Authors
    Anton Poznyakovskiy
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset contains geometry data for the countries of the world together with their names and country codes in various formats. The primary use case is choropleths, color-coded maps. The data can be read as a pandas DataFrame with geopandas and plotted with matplotlib. See the starter notebook for an example how to do it.

    The data was created by Natural Earth. It is in public domain and free to use for any purpose at the time of this writing; you might want to check their Terms of Use.

    Photo by KOBU Agency on Unsplash

  6. Mapping 2021 Census Data using the Living Atlas

    • teachwithgis.co.uk
    • lecture-with-gis-esriukeducation.hub.arcgis.com
    Updated Apr 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Esri UK Education (2025). Mapping 2021 Census Data using the Living Atlas [Dataset]. https://teachwithgis.co.uk/datasets/mapping-2021-census-data-using-the-living-atlas
    Explore at:
    Dataset updated
    Apr 30, 2025
    Dataset provided by
    Esrihttp://esri.com/
    Authors
    Esri UK Education
    Description

    Anyone who has taught GIS using Census Data knows it is an invaluable data set for showing students how to take data stored in a table and join it to boundary data to transform this data into something that can be visualised and analysed spatially. Joins are a core GIS skill and need to be learnt, as not every data set is going to come neatly packaged as a shapefile or feature layer with all the data you need stored within. I don't know how many times I taught students to download data as a table from Nomis, load it into a GIS and then join that table data to the appropriate boundary data so they could produce choropleth maps to do some visual analysis, but it was a lot! Once students had gotten the hang of joins using census data they'd often ask why this data doesn't exist as a prepackaged feature layer with all the data they wanted within it. Well good news, now a lot off it is and it's accessible through the Living Atlas! Don't get me wrong I fully understand the importance of teaching students how to perform joins but once you have this understanding if you can access data that already contains all the information you need then you should be taking advantage of it to save you time. So in this exercise I am going to show you how to load English and Welsh Census Data from the 2021 Census into the ArcGIS Map Viewer from the Living Atlas and produce some choropleth maps to use to perform visual analysis without having to perform a single join.

  7. NYC zipcode geodata

    • kaggle.com
    zip
    Updated Sep 23, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Saidakbarp (2019). NYC zipcode geodata [Dataset]. https://www.kaggle.com/saidakbarp/nyc-zipcode-geodata
    Explore at:
    zip(552766 bytes)Available download formats
    Dataset updated
    Sep 23, 2019
    Authors
    Saidakbarp
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    New York
    Description

    Context

    I used this publicly available data for making interactive map visualization of NYC. Zipcode geodata is useful for building interactive maps with each zip code area representing a separate area on the map.

    Content

    NYC zipcode geodata in geojson format

    Acknowledgements

    The rights belong to the original authors.

  8. List of sociodemographic variables used in PCA analysis to create new...

    • plos.figshare.com
    xls
    Updated Jun 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Julia Elizabeth Isaacson; Jinny Jing Ye; Lincoln Luís Silva; Thiago Augusto Hernandes Rocha; Luciano de Andrade; Joao Felipe Hermann Costa Scheidt; Fan Hui Wen; Jacqueline Sachett; Wuelton Marcelo Monteiro; Catherine Ann Staton; Joao Ricardo Nickenig Vissoci; Charles John Gerardo (2023). List of sociodemographic variables used in PCA analysis to create new indicators for spatial analysis. [Dataset]. http://doi.org/10.1371/journal.pntd.0011305.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 21, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Julia Elizabeth Isaacson; Jinny Jing Ye; Lincoln Luís Silva; Thiago Augusto Hernandes Rocha; Luciano de Andrade; Joao Felipe Hermann Costa Scheidt; Fan Hui Wen; Jacqueline Sachett; Wuelton Marcelo Monteiro; Catherine Ann Staton; Joao Ricardo Nickenig Vissoci; Charles John Gerardo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    List of sociodemographic variables used in PCA analysis to create new indicators for spatial analysis.

  9. d

    How to select appropriate hue ranges for sequential color schemes on...

    • datadryad.org
    • data.niaid.nih.gov
    • +1more
    zip
    Updated Aug 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tai sheng Chen; Xi Lv; Kun Hu; Meng lin Chen; Lu Cheng; Wei xing Jiang (2023). How to select appropriate hue ranges for sequential color schemes on choropleth maps? A quantitative evaluation using map reading experiments [Dataset]. http://doi.org/10.5061/dryad.c59zw3rdt
    Explore at:
    zipAvailable download formats
    Dataset updated
    Aug 2, 2023
    Dataset provided by
    Dryad
    Authors
    Tai sheng Chen; Xi Lv; Kun Hu; Meng lin Chen; Lu Cheng; Wei xing Jiang
    Time period covered
    Jul 26, 2023
    Description

    These datasets are map cognitive data collected in experiments and processed through programs. The README file provides relevant information.

    Datasets included:

    File

    Description

    Controlled experiments-web-based survey.xlsx Web survey data for a controlled experiment, where the map data is randomly generated simulation data

    Controlled experiments-eye-tracking.xlsx Eye movement data from a controlled experiment, which added an eye tracker to record eye movement data

    Real-map experiments-web-based survey.xlsx Real experiment web survey data, the experimental map data is real data

    Real-map experiments-eye-tracking.xlsx Eye movement data from real experiments

    Correlation and significance test results.xlsx Correlation significance parameters of various experimental data

    Blank spaces in the table indicate no data.All tables in the dataset can be opened using Office 2010 and above versions.

  10. d

    Data from: CrimeMapTutorial Workbooks and Sample Data for ArcView and...

    • catalog.data.gov
    • icpsr.umich.edu
    • +1more
    Updated Nov 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Justice (2025). CrimeMapTutorial Workbooks and Sample Data for ArcView and MapInfo, 2000 [Dataset]. https://catalog.data.gov/dataset/crimemaptutorial-workbooks-and-sample-data-for-arcview-and-mapinfo-2000-3c9be
    Explore at:
    Dataset updated
    Nov 14, 2025
    Dataset provided by
    National Institute of Justice
    Description

    CrimeMapTutorial is a step-by-step tutorial for learning crime mapping using ArcView GIS or MapInfo Professional GIS. It was designed to give users a thorough introduction to most of the knowledge and skills needed to produce daily maps and spatial data queries that uniformed officers and detectives find valuable for crime prevention and enforcement. The tutorials can be used either for self-learning or in a laboratory setting. The geographic information system (GIS) and police data were supplied by the Rochester, New York, Police Department. For each mapping software package, there are three PDF tutorial workbooks and one WinZip archive containing sample data and maps. Workbook 1 was designed for GIS users who want to learn how to use a crime-mapping GIS and how to generate maps and data queries. Workbook 2 was created to assist data preparers in processing police data for use in a GIS. This includes address-matching of police incidents to place them on pin maps and aggregating crime counts by areas (like car beats) to produce area or choropleth maps. Workbook 3 was designed for map makers who want to learn how to construct useful crime maps, given police data that have already been address-matched and preprocessed by data preparers. It is estimated that the three tutorials take approximately six hours to complete in total, including exercises.

  11. FOLIUM_INDIA

    • kaggle.com
    zip
    Updated Jun 15, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    KD007 (2020). FOLIUM_INDIA [Dataset]. https://www.kaggle.com/krishcross/india-shape-map
    Explore at:
    zip(16183750 bytes)Available download formats
    Dataset updated
    Jun 15, 2020
    Authors
    KD007
    Area covered
    India
    Description

    Folium makes it easy to visualize data that’s been manipulated in Python on an interactive leaflet map. It enables both the binding of data to a map for choropleth visualizations as well as passing rich vector/raster/HTML visualizations as markers on the map. These files can be used to mark the state boundaries on the map of INDIA using folium library and the CSV also contains the state data and how to use it in our notebooks. I have used it in one of my kernels which can be viewed.

    The library has a number of built-in tilesets from OpenStreetMap, Mapbox, and Stamen, and supports custom tilesets with Mapbox or Cloudmade API keys. folium supports both Image, Video, GeoJSON, and TopoJSON overlays. Due to extensible functionalities I find folium the best map plotting library in python. Do give it a try and use it in your kernels.

  12. Population of Cities in Ecuador 2022

    • kaggle.com
    zip
    Updated Nov 13, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kei (2022). Population of Cities in Ecuador 2022 [Dataset]. https://www.kaggle.com/datasets/kokitashiro/population-of-cities-in-ecuador-2022
    Explore at:
    zip(1384 bytes)Available download formats
    Dataset updated
    Nov 13, 2022
    Authors
    Kei
    Area covered
    Ecuador
    Description

    This is dataset which you can find population of Ecuadorian cities in 2022 . The data downloaded from this website. In my case, I utilize this data for making choropleth map for analyzing data of "Store Sales - Time Series Forecasting" data and please freely utilize this data for such use. (Thank you very much for "World Population Review"!)

  13. NFIP Community Layer No Overlaps Whole

    • catalog.data.gov
    • gimi9.com
    • +1more
    Updated Jun 7, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FEMA/Resilience/Federal Insurance Directorate (2025). NFIP Community Layer No Overlaps Whole [Dataset]. https://catalog.data.gov/dataset/nfip-community-layer-no-overlaps-whole
    Explore at:
    Dataset updated
    Jun 7, 2025
    Dataset provided by
    Federal Emergency Management Agencyhttp://www.fema.gov/
    Description

    This dataset is flattened and multicounty communities are unsplit by county lines. Flattened means that there are no overlaps; larger shapes like counties are punched out or clipped where smaller communities are contained within them. This allows for choropleth shading and other mapping techniques such as calculating unincorporated county land area. Multicounty cities like Houston are a single feature, undivided by counties. This layer is derived from Census, State of Maine, and National Flood Hazard Layer political boundaries.rnrnThe Community Layer datasets contain geospatial community boundaries associated with Census and NFIP data. The dataset does not contain personal identifiable information (PII). The Community Layer can be used to tie Community ID numbers (CID) to jurisdiction, tribal, and special land use area boundaries.rnrnA geodatabase (GDB) link is Included in the Full Data section below. The compressed file contains a collection of files that can store, query, and manage both spatial and nonspatial data using software that can read such a file. It bcontains all of the community layers/b, not just the layer for which this dataset page describes. rnThis layer can also be accessed from the FEMA ArcGIS viewer online: https://fema.maps.arcgis.com/home/item.html?id=8dcf28fc5b97404bbd9d1bc6d3c9b3cfrnrnrnCitation: FEMA's citation requirements for datasets (API usage or file downloads) can be found on the OpenFEMA Terms and Conditions page, Citing Data section: https://www.fema.gov/about/openfema/terms-conditions.rnrnFor answers to Frequently Asked Questions (FAQs) about the OpenFEMA program, API, and publicly available datasets, please visit: https://www.fema.gov/about/openfema/faq.rnIf you have media inquiries about this dataset, please email the FEMA News Desk at FEMA-News-Desk@fema.dhs.gov or call (202) 646-3272. For inquiries about FEMA's data and Open Government program, please email the OpenFEMA team at OpenFEMA@fema.dhs.gov.

  14. Homicide Rates in Mexico by State (1990-2023)

    • figshare.com
    csv
    Updated Nov 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Montserrat Mora (2025). Homicide Rates in Mexico by State (1990-2023) [Dataset]. http://doi.org/10.6084/m9.figshare.28067651.v4
    Explore at:
    csvAvailable download formats
    Dataset updated
    Nov 20, 2025
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Montserrat Mora
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Mexico
    Description

    This project provides a comprehensive dataset on intentional homicides in Mexico from 1990 to 2023, disaggregated by sex and state. It includes both raw data and tools for visualization, making it a valuable resource for researchers, policymakers, and analysts studying violence trends, gender disparities, and regional patterns.ContentsHomicide Data: Total number of male and female victims per state and year.Population Data: Corresponding male and female population estimates for each state and year.Homicide Rates: Per 100,000 inhabitants, calculated for both sexes.Choropleth Map Script: A Python script that generates homicide rate maps using a GeoJSON file.GeoJSON File: A spatial dataset defining Mexico's state boundaries, used for mapping.Sample Figure: A pre-generated homicide rate map for 2023 as an example.Requirements File: A requirements.txt file listing necessary dependencies for running the script.SourcesHomicide Data: INEGI - Vital Statistics MicrodataPopulation Data: Mexican Population Projections 2020-2070This dataset enables spatial analysis and data visualization, helping users explore homicide trends across Mexico in a structured and reproducible way.

  15. n

    Digital Equity and Inclusion in WPC

    • data.wpcouncils.nsw.gov.au
    • data.theparks.nsw.gov.au
    csv, excel, geojson +1
    Updated Mar 12, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Digital Equity and Inclusion in WPC [Dataset]. https://data.wpcouncils.nsw.gov.au/explore/dataset/digital-equity-and-inclusion-in-wpc/
    Explore at:
    geojson, excel, json, csvAvailable download formats
    Dataset updated
    Mar 12, 2024
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Digital Equity and Inclusion in Western Parkland CityThe Western Parkland City Digital Equity and Inclusion project shows the localised findings of targeted research aimed at benchmarking digital inclusion across the Western Parkland City. The index scores, depicted through a choropleth map, highlight specific gaps and priorities for improving overall digital inclusion and dimensions of Access, Ability and Affordability across each of the eight participating Council areas.AbilityDigital Ability is about our skill levels: what we are able do online, and our confidence in doing it. Having limited digital capability in the types of skills and knowledge needed to get online, operate functions and navigate sites with confidence and safety has been referred to the ‘second level’ digital divide. In measuring Digital Ability, the ADII draws upon the Internet Skills Scale (ISS) to focus on six skills domains. The ADII’s Digital Ability score measures the following skills components:Basic operational (i.e., downloading and opening files, connecting to the internet, and setting passwords)Advanced operational (i.e., saving to the cloud, determining what is safe to download, customising devices and connections, and adjusting privacy settings (e.g. downloading and opening files, connecting to the internet))Information navigation (i.e., searching and navigating, verifying trustworthy information, and managing third party data collection)Social (i.e., deciding what to share, how, and who with, managing and monitoring contacts, and communicating with others)Creative (i.e., editing, producing, and posting content, as well as having a broad understanding of the rules that may apply to these activities)Smart (i.e., connecting, operating, and managing smart devices and IoT technologies)AccessThe Digital Access dimension within the ADII is a measure of several interrelated components of internet usage that include intensity and frequency of use, types of devices, and use of fixed and mobile plans. It is well recognised that the quality of both fixed and mobile connectivity is problematic and underserviced in regional and remote areas – often due to intermittent and unreliable access to the nbn.AffordabilityAs connected technologies have developed and more people move online, some gaps in connectivity access have narrowed. However, for many people, particularly in areas with higher concentrations of low-income individuals and households, affordability can present significant barriers to achieving digital equity across the city. This includes being able to afford quality and reliable mobile and fixed broadband plans and the devices needed to connect online.For additional information click this linkSource: Data is sourced through a collaboration between Smart Places, Cities and Active Transport, Transport for NSW and The Parks, Sydney’s Parkland Councils, an alliance of the eight local government areas that comprise Western Parkland City as part of the Western Parkland City Digital Equity and Inclusion Insights Program. This is currently a one-off release. At this time we do not have plans to update this dataset regularly.

  16. S

    PostGIS data for London and Greater London ward boundaries as of 2018

    • splitgraph.com
    Updated Aug 19, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    splitgraph (2020). PostGIS data for London and Greater London ward boundaries as of 2018 [Dataset]. https://www.splitgraph.com/splitgraph/london_wards/
    Explore at:
    application/openapi+json, application/vnd.splitgraph.image, jsonAvailable download formats
    Dataset updated
    Aug 19, 2020
    Authors
    splitgraph
    Area covered
    Greater London, London
    Description

    PostGIS data for London and Greater London ward boundaries as of 2018.

    This dataset is used in the london_votes sample Splitfile in which the 2017 General Election results and London Ward geodata are joined through the ONS UK Ward-Constituency lookup table to build a dataset of London constituencies and Conservative/Labour votes in each, ready for plotting as a Choropleth map.

    https://data.london.gov.uk/dataset/statistical-gis-boundary-files-london

    Contains National Statistics data © Crown copyright and database right 2012

    Contains Ordnance Survey data © Crown copyright and database right 2012

  17. a

    Multiple Hazard Index for United States Counties

    • hub.arcgis.com
    • gis-fema.hub.arcgis.com
    Updated Jul 29, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    jjs2154_columbia (2016). Multiple Hazard Index for United States Counties [Dataset]. https://hub.arcgis.com/items/800f684ebadf423bae4c669cb0a1d7da
    Explore at:
    Dataset updated
    Jul 29, 2016
    Dataset authored and provided by
    jjs2154_columbia
    Area covered
    Description

    OverviewThe multiple hazard index for the United States Counties was designed to map natural hazard relating to exposure to multiple natural disasters. The index was created to provide communities and public health officials with an overview of the risks that are prominent in their county, and to facilitate the comparison of hazard level between counties. Most existing hazard maps focus on a single disaster type. By creating a measure that aggregates the hazard from individual disasters, the increased hazard that results from exposure to multiple natural disasters can be better understood. The multiple hazard index represents the aggregate of hazard from eleven individual disasters. Layers displaying the hazard from each individual disaster are also included.

    The hazard index is displayed visually as a choropleth map, with the color blue representing areas with less hazard and red representing areas with higher hazard. Users can click on each county to view its hazard index value, and the level of hazard for each individual disaster. Layers describing the relative level of hazard from each individual disaster are also available as choropleth maps with red areas representing high, orange representing medium, and yellow representing low levels of hazard.Methodology and Data CitationsMultiple Hazard Index

    The multiple hazard index was created by coding the individual hazard classifications and summing the coded values for each United States County. Each individual hazard is weighted equally in the multiple hazard index. Alaska and Hawaii were excluded from analysis because one third of individual hazard datasets only describe the coterminous United States.

    Avalanche Hazard

    University of South Carolina Hazards and Vulnerability Research Institute. “Spatial Hazard Events and Losses Database”. United States Counties. “Avalanches United States 2001-2009”. < http://hvri.geog.sc.edu/SHELDUS/

    Downloaded 06/2016.

    Classification

    Avalanche hazard was classified by dividing counties based upon the number of avalanches they experienced over the nine year period in the dataset. Avalanche hazard was not normalized by total county area because it caused an over-emphasis on small counties, and because avalanches are a highly local hazard.

    None = 0 AvalanchesLow = 1 AvalancheMedium = 2-5 AvalanchesHigh = 6-10 Avalanches

    Earthquake Hazard

    United States Geological Survey. “Earthquake Hazard Maps”. 1:2,000,000. “Peak Ground Acceleration 2% in 50 Years”. < http://earthquake.usgs.gov/hazards/products/conterminous/

    . Downloaded 07/2016.

    Classification

    Peak ground acceleration (% gravity) with a 2% likelihood in 50 years was averaged by United States County, and the earthquake hazard of counties was classified based upon this average.

    Low = 0 - 14.25 % gravity peak ground accelerationMedium = 14.26 - 47.5 % gravity peak ground accelerationHigh = 47.5+ % gravity peak ground acceleration

    Flood Hazard

    United States Federal Emergency Management Administration. “National Flood Hazard Layer”. 1:10,000. “0.2 Percent Annual Flood Area”. < https://data.femadata.com/FIMA/Risk_MAP/NFHL/

    . Downloaded 07/2016.

    Classification

    The National Flood Hazard Layer 0.2 Percent Annual Flood Area was spatially intersected with the United States Counties layer, splitting flood areas by county and adding county information to flood areas. Flood area was aggregated by county, expressed as a fraction of the total county land area, and flood hazard was classified based upon percentage of land that is susceptible to flooding. National Flood Hazard Layer does not cover the entire United States; coverage is focused on populated areas. Areas not included in National Flood Hazard Layer were assigned flood risk of Low in order to include these areas in further analysis.

    Low = 0-.001% area susceptibleMedium = .00101 % - .005 % area susceptibleHigh = .00501+ % area susceptible

    Heat Wave Hazard

    United States Center for Disease Control and Prevention. “National Climate Assessment”. Contiguous United States Counties. “Extreme Heat Events: Heat Wave Days in May - September for years 1981-2010”. Downloaded 06/2016.

    Classification

    Heat wave was classified by dividing counties based upon the number of heat wave days they experienced over the 30 year time period described in the dataset.

    Low = 126 - 171 Heat wave DaysMedium = 172 – 187 Heat wave DaysHigh = 188 – 255 Heat wave Days

    Hurricane Hazard

    National Oceanic and Atmospheric Administration. Coastal Services Center. “Historical North Atlantic Tropical Cyclone Tracks, 1851-2004”. 1: 2,000,000. < https://catalog.data.gov/dataset/historical-north-atlantic-tropical-cyclone-tracks-1851-2004-direct-download

    . Downloaded 06/2016.

    National Oceanic and Atmospheric Administration. Coastal Services Center. “Historical North Pacific Tropical Cyclone Tracks, 1851-2004”. 1: 2,000,000. < https://catalog.data.gov/dataset/historical-north-atlantic-tropical-cyclone-tracks-1851-2004-direct-download

    . Downloaded 06/2016.

    Classification

    Atlantic and Pacific datasets were merged. Tropical storm and disturbance tracks were filtered out leaving hurricane tracks. Each hurricane track was assigned the value of the category number that describes that event. Weighting each event by intensity ensures that areas with higher intensity events are characterized as being more hazardous. Values describing each hurricane event were aggregated by United States County, normalized by total county area, and the hurricane hazard of counties was classified based upon the normalized value.

    Landslide Hazard

    United States Geological Survey. “Landslide Overview Map of the United States”. 1:4,000,000. “Landslide Incidence and Susceptibility in the Conterminous United States”. < https://catalog.data.gov/dataset/landslide-incidence-and-susceptibility-in-the-conterminous-united-states-direct-download

    . Downloaded 07/2016.

    Classification

    The classifications of High, Moderate, and Low landslide susceptibility and incidence from the study were numerically coded, the average value was computed for each county, and the landslide hazard was classified based upon the average value.

    Long-Term Drought Hazard

    United States Drought Monitor, Drought Mitigation Center, United States Department of Agriculture, National Oceanic and Atmospheric Administration. “Drought Monitor Summary Map”. “Long-Term Drought Impact”. < http://droughtmonitor.unl.edu/MapsAndData/GISData.aspx >. Downloaded 06/2016.

    Classification

    Short-term drought areas were filtered from the data; leaving only long-term drought areas. United States Counties were assigned the average U.S. Drought Monitor Classification Scheme Drought Severity Classification value that characterizes the county area. County long-term drought hazard was classified based upon average Drought Severity Classification value.

    Low = 1 – 1.75 average Drought Severity Classification valueMedium = 1.76 -3.0 average Drought Severity Classification valueHigh = 3.0+ average Drought Severity Classification value

    Snowfall Hazard

    United States National Oceanic and Atmospheric Administration. “1981-2010 U.S. Climate Normals”. 1: 2,000,000. “Annual Snow Normal”. < http://www1.ncdc.noaa.gov/pub/data/normals/1981-2010/products/precipitation/

    . Downloaded 08/2016.

    Classification

    Average yearly snowfall was joined with point location of weather measurement stations, and stations without valid snowfall measurements were filtered out (leaving 6233 stations). Snowfall was interpolated using least squared distance interpolation to create a .05 degree raster describing an estimate of yearly snowfall for the United States. The average yearly snowfall raster was aggregated by county to yield the average yearly snowfall per United States County. The snowfall risk of counties was classified by average snowfall.

    None = 0 inchesLow = .01- 10 inchesMedium = 10.01- 50 inchesHigh = 50.01+ inches

    Tornado Hazard

    United States National Oceanic and Atmospheric Administration Storm Prediction Center. “Severe Thunderstorm Database and Storm Data Publication”. 1: 2,000,000. “United States Tornado Touchdown Points 1950-2004”. < https://catalog.data.gov/dataset/united-states-tornado-touchdown-points-1950-2004-direct-download

    . Downloaded 07/2016.

    Classification

    Each tornado touchdown point was assigned the value of the Fujita Scale that describes that event. Weighting each event by intensity ensures that areas with higher intensity events are characterized as more hazardous. Values describing each tornado event were aggregated by United States County, normalized by total county area, and the tornado hazard of counties was classified based upon the normalized value.

    Volcano Hazard

    Smithsonian Institution National Volcanism Program. “Volcanoes of the World”. “Holocene Volcanoes”. < http://volcano.si.edu/search_volcano.cfm

    . Downloaded 07/2016.

    Classification

    Volcano coordinate locations from spreadsheet were mapped and aggregated by United States County. Volcano count was normalized by county area, and the volcano hazard of counties was classified based upon the number of volcanoes present per unit area.

    None = 0 volcanoes/100 kilometersLow = 0.000915 - 0.007611 volcanoes / 100 kilometersMedium = 0.007612 - 0.018376 volcanoes / 100 kilometersHigh = 0.018377- 0.150538 volcanoes / 100 kilometers

    Wildfire Hazard

    United States Department of Agriculture, Forest Service, Fire, Fuel, and Smoke Science Program. “Classified 2014 Wildfire Hazard Potential”. 270 meters. < http://www.firelab.org/document/classified-2014-whp-gis-data-and-maps

    . Downloaded 06/2016.

    Classification

    The classifications of Very High, High, Moderate, Low, Very Low, and Non-Burnable/Water wildfire hazard from the study were numerically coded, the average value was computed for each county, and the wildfire hazard was classified based upon the average value.

  18. Geographic data of Japan

    • kaggle.com
    zip
    Updated May 8, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    zhanhao h. (2021). Geographic data of Japan [Dataset]. https://www.kaggle.com/zhanhaoh/geographic-data-of-japan
    Explore at:
    zip(1257833 bytes)Available download formats
    Dataset updated
    May 8, 2021
    Authors
    zhanhao h.
    Area covered
    Japan
    Description

    Context

    The dataset defines the geographic polygon shapes of the prefectures of Japan. You can use it for plotting Mapbox Choropleth maps by the plotly package conveniently. It is a small modification from the dataset at https://github.com/dataofjapan/land/blob/master/japan.geojson.

    Content

    For each prefecture, an id is assigned. The id naming is something like 'Kyoto' which means for the Kyoto prefecture, and 'Okinawa' which means for the Okinawa prefecture.

    Acknowledgements

    It is a small modification from the original dataset at https://github.com/dataofjapan/land/blob/master/japan.geojson. I have added id for each element so that it can be conveniently used for plotting Mapbox Choropleth maps.

  19. Brooklyn Home Sales, 2003 to 2017

    • kaggle.com
    zip
    Updated Feb 15, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tommy Wu (2018). Brooklyn Home Sales, 2003 to 2017 [Dataset]. https://www.kaggle.com/tianhwu/brooklynhomes2003to2017
    Explore at:
    zip(80381469 bytes)Available download formats
    Dataset updated
    Feb 15, 2018
    Authors
    Tommy Wu
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    Brooklyn
    Description

    Context

    I'm trying to make a Choropleth map over time of home sale prices by block in Brooklyn for the last 15 years to visualize gentrification. I have the entire dataset for all 5 boroughs of New York, but am starting with Brooklyn.

    Content and Acknowledgements

    Primary dataset is the NYC Housing Sales Data Found in this Link: http://www1.nyc.gov/site/finance/taxes/property-rolling-sales-data.page

    The data in all the separate excel spreadsheets for 2003-2017 was merged via VBA scripting in Excel and further cleaned & de-duped in R

    Additionally, in my hunt for shapefiles I discovered these wonderful shapefiles from NYCPluto: https://www1.nyc.gov/site/planning/data-maps/open-data/dwn-pluto-mappluto.page

    I left joined it by "Block" & "Lot" onto the primary data frame, but 25% of the block/lot combo's ended up not having a corresponding entry in the Pluto shapefile and are NAs.

    Note that as in other uploaded datasets of NYC housing on Kaggle, many of these transactions have a sale_price of $0 or only a nominal amount far less than market value. These are likely property transfers to relatives and should be excluded from any analysis of market prices.

    Inspiration

    Can you model Brooklyn home prices accurately?

  20. usapolygones4

    • kaggle.com
    zip
    Updated Nov 7, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gaétan Dubuc (2020). usapolygones4 [Dataset]. https://www.kaggle.com/gatandubuc/usapolygones4
    Explore at:
    zip(8560931 bytes)Available download formats
    Dataset updated
    Nov 7, 2020
    Authors
    Gaétan Dubuc
    Description

    Context

    It's difficult to find shapefiles to construct choropleth map. So this dataset is trying to gather a maximum of shapefiles. Be carefull to downloads all the files, otherwise you will have errors.

    Content

    tl_2014_us_state: It gathers 56 american territories. 50 states and 6 others, Guam, Puerto Rico, United States Virgin Islands, Commonwealth of the Northern Mariana Islands, American Samoa, District of Columbia.

    Acknowledgements

    Found here: ftp://ftp2.census.gov/geo/tiger/TIGER2014/STATE/

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Vega Datasets (2025). Geospatial Data Pack for Visualization [Dataset]. https://www.kaggle.com/datasets/vega-datasets/geospatial-data-pack
Organization logo

Geospatial Data Pack for Visualization

Learn Geographic Mapping with Altair, Vega-Lite and Vega using Curated Datasets

Explore at:
zip(1422109 bytes)Available download formats
Dataset updated
Oct 21, 2025
Dataset authored and provided by
Vega Datasets
Description

Geospatial Data Pack for Visualization 🗺️

Learn Geographic Mapping with Altair, Vega-Lite and Vega using Curated Datasets

Complete geographic and geophysical data collection for mapping and visualization. This consolidation includes 18 complementary datasets used by 31+ Vega, Vega-Lite, and Altair examples 📊. Perfect for learning geographic visualization techniques including projections, choropleths, point maps, vector fields, and interactive displays.

Source data lives on GitHub and can also be accessed via CDN. The vega-datasets project serves as a common repository for example datasets used across these visualization libraries and related projects.

Why Use This Dataset? 🤔

  • Comprehensive Geospatial Types: Explore a variety of core geospatial data models:
    • Vector Data: Includes points (like airports.csv), lines (like londonTubeLines.json), and polygons (like us-10m.json).
    • Raster-like Data: Work with gridded datasets (like windvectors.csv, annual-precip.json).
  • Diverse Formats: Gain experience with standard and efficient geospatial formats like GeoJSON (see Table 1, 2, 4), compressed TopoJSON (see Table 1), and plain CSV/TSV (see Table 2, 3, 4) for point data and attribute tables ready for joining.
  • Multi-Scale Coverage: Practice visualization across different geographic scales, from global and national (Table 1, 4) down to the city level (Table 1).
  • Rich Thematic Mapping: Includes multiple datasets (Table 3) specifically designed for joining attributes to geographic boundaries (like states or counties from Table 1) to create insightful choropleth maps.
  • Ready-to-Use & Example-Driven: Cleaned datasets tightly integrated with 31+ official examples (see Appendix) from Altair, Vega-Lite, and Vega, allowing you to immediately practice techniques like projections, point maps, network maps, and interactive displays.
  • Python Friendly: Works seamlessly with essential Python libraries like Altair (which can directly read TopoJSON/GeoJSON), Pandas, and GeoPandas, fitting perfectly into the Kaggle notebook environment.

Table of Contents

Dataset Inventory 🗂️

This pack includes 18 datasets covering base maps, reference points, statistical data for choropleths, and geophysical data.

1. BASE MAP BOUNDARIES (Topological Data)

DatasetFileSizeFormatLicenseDescriptionKey Fields / Join Info
US Map (1:10m)us-10m.json627 KBTopoJSONCC-BY-4.0US state and county boundaries. Contains states and counties objects. Ideal for choropleths.id (FIPS code) property on geometries
World Map (1:110m)world-110m.json117 KBTopoJSONCC-BY-4.0World country boundaries. Contains countries object. Suitable for world-scale viz.id property on geometries
London BoroughslondonBoroughs.json14 KBTopoJSONCC-BY-4.0London borough boundaries.properties.BOROUGHN (name)
London CentroidslondonCentroids.json2 KBGeoJSONCC-BY-4.0Center points for London boroughs.properties.id, properties.name
London Tube LineslondonTubeLines.json78 KBGeoJSONCC-BY-4.0London Underground network lines.properties.name, properties.color

2. GEOGRAPHIC REFERENCE POINTS (Point Data) 📍

DatasetFileSizeFormatLicenseDescriptionKey Fields / Join Info
US Airportsairports.csv205 KBCSVPublic DomainUS airports with codes and coordinates.iata, state, `l...
Search
Clear search
Close search
Google apps
Main menu