Facebook
TwitterLearn Geographic Mapping with Altair, Vega-Lite and Vega using Curated Datasets
Complete geographic and geophysical data collection for mapping and visualization. This consolidation includes 18 complementary datasets used by 31+ Vega, Vega-Lite, and Altair examples đ. Perfect for learning geographic visualization techniques including projections, choropleths, point maps, vector fields, and interactive displays.
Source data lives on GitHub and can also be accessed via CDN. The vega-datasets project serves as a common repository for example datasets used across these visualization libraries and related projects.
airports.csv), lines (like londonTubeLines.json), and polygons (like us-10m.json).windvectors.csv, annual-precip.json).This pack includes 18 datasets covering base maps, reference points, statistical data for choropleths, and geophysical data.
| Dataset | File | Size | Format | License | Description | Key Fields / Join Info |
|---|---|---|---|---|---|---|
| US Map (1:10m) | us-10m.json | 627 KB | TopoJSON | CC-BY-4.0 | US state and county boundaries. Contains states and counties objects. Ideal for choropleths. | id (FIPS code) property on geometries |
| World Map (1:110m) | world-110m.json | 117 KB | TopoJSON | CC-BY-4.0 | World country boundaries. Contains countries object. Suitable for world-scale viz. | id property on geometries |
| London Boroughs | londonBoroughs.json | 14 KB | TopoJSON | CC-BY-4.0 | London borough boundaries. | properties.BOROUGHN (name) |
| London Centroids | londonCentroids.json | 2 KB | GeoJSON | CC-BY-4.0 | Center points for London boroughs. | properties.id, properties.name |
| London Tube Lines | londonTubeLines.json | 78 KB | GeoJSON | CC-BY-4.0 | London Underground network lines. | properties.name, properties.color |
| Dataset | File | Size | Format | License | Description | Key Fields / Join Info |
|---|---|---|---|---|---|---|
| US Airports | airports.csv | 205 KB | CSV | Public Domain | US airports with codes and coordinates. | iata, state, `l... |
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The choropleth map is a device used for the display of socioeconomic data associated with an areal partition of geographic space. Cartographers emphasize the need to standardize any raw count data by an area-based total before displaying the data in a choropleth map. The standardization process converts the raw data from an absolute measure into a relative measure. However, there is recognition that the standardizing process does not enable the map reader to distinguish between lowâlow and highâhigh numerator/denominator differences. This research uses concentration-based classification schemes using Lorenz curves to address some of these issues. A test data set of nonwhite birth rate by county in North Carolina is used to demonstrate how this approach differs from traditional meanâvariance-based systems such as the Jenksâ optimal classification scheme.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset contains geometry data for the countries of the world together with their names and country codes in various formats. The primary use case is choropleths, color-coded maps. The data can be read as a pandas DataFrame with geopandas and plotted with matplotlib. See the starter notebook for an example how to do it.
The data was created by Natural Earth. It is in public domain and free to use for any purpose at the time of this writing; you might want to check their Terms of Use.
Photo by KOBU Agency on Unsplash
Facebook
TwitterI created a dataset to help people create choropleth maps of United States states.
One geojson to plot the countries borders, and one csv from the Census Bureau for the us population per state.
I think the best way to use this dataset is in joining it with other data. For example, I used this dataset to plot police killings using the data from https://www.kaggle.com/jpmiller/police-violence-in-the-us
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
A comprehensive dataset of 1,513 Pakistani cities, towns, tehsils, districts and places with latitude/longitude, administrative region, population (when available) and Wikidata IDs â ideal for mapping, geospatial analysis, enrichment, and location-based ML.
Why this dataset is valuable:
Highlights (fetched from the data):
Column definitions (short):
Typical & high-value use cases:
Facebook
TwitterAnyone who has taught GIS using Census Data knows it is an invaluable data set for showing students how to take data stored in a table and join it to boundary data to transform this data into something that can be visualised and analysed spatially. Joins are a core GIS skill and need to be learnt, as not every data set is going to come neatly packaged as a shapefile or feature layer with all the data you need stored within. I don't know how many times I taught students to download data as a table from Nomis, load it into a GIS and then join that table data to the appropriate boundary data so they could produce choropleth maps to do some visual analysis, but it was a lot! Once students had gotten the hang of joins using census data they'd often ask why this data doesn't exist as a prepackaged feature layer with all the data they wanted within it. Well good news, now a lot off it is and it's accessible through the Living Atlas! Don't get me wrong I fully understand the importance of teaching students how to perform joins but once you have this understanding if you can access data that already contains all the information you need then you should be taking advantage of it to save you time. So in this exercise I am going to show you how to load English and Welsh Census Data from the 2021 Census into the ArcGIS Map Viewer from the Living Atlas and produce some choropleth maps to use to perform visual analysis without having to perform a single join.
Facebook
TwitterWe propose map reading experiments to quantitatively evaluate the selection of hue ranges for sequential color schemes on choropleth maps. In these experiments, 60 sequential color schemes with six base hues and ten hue ranges were employed as experimental color schemes, and a total of 414 college students were invited to complete identification, comparison, and ranking tasks. Both controlled and real-map experiments were performed, each involving a web-based survey and an eye-tracking experiment. In the controlled experiments, the shapes of the map objects were relatively regular, and attribute data were randomized. In contrast, the shapes were complex in real-map experiments, and real data were employed. Our findings show that widely used color schemes with a hue range of 0ĂÂş yield poor performance in all tasks; 15ĂÂş hue ranges yield good performance in the comparison and ranking tasks but poor performance in the identification task. For large hue ranges of 120-360ĂÂş, participants showed...
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
I used this publicly available data for making interactive map visualization of NYC. Zipcode geodata is useful for building interactive maps with each zip code area representing a separate area on the map.
NYC zipcode geodata in geojson format
The rights belong to the original authors.
Facebook
TwitterThe dataset defines the geographic polygon shapes of the prefectures of Japan. You can use it for plotting Mapbox Choropleth maps by the plotly package conveniently. It is a small modification from the dataset at https://github.com/dataofjapan/land/blob/master/japan.geojson.
For each prefecture, an id is assigned. The id naming is something like 'Kyoto' which means for the Kyoto prefecture, and 'Okinawa' which means for the Okinawa prefecture.
It is a small modification from the original dataset at https://github.com/dataofjapan/land/blob/master/japan.geojson. I have added id for each element so that it can be conveniently used for plotting Mapbox Choropleth maps.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset provides a comprehensive list of countries and dependent territories worldwide, along with their most recent population estimates.îThe data is sourced from the Wikipedia page List of countries and dependencies by population, which compiles figures from national statistical offices and the United Nations Population Divisionîî
Facebook
TwitterCrimeMapTutorial is a step-by-step tutorial for learning crime mapping using ArcView GIS or MapInfo Professional GIS. It was designed to give users a thorough introduction to most of the knowledge and skills needed to produce daily maps and spatial data queries that uniformed officers and detectives find valuable for crime prevention and enforcement. The tutorials can be used either for self-learning or in a laboratory setting. The geographic information system (GIS) and police data were supplied by the Rochester, New York, Police Department. For each mapping software package, there are three PDF tutorial workbooks and one WinZip archive containing sample data and maps. Workbook 1 was designed for GIS users who want to learn how to use a crime-mapping GIS and how to generate maps and data queries. Workbook 2 was created to assist data preparers in processing police data for use in a GIS. This includes address-matching of police incidents to place them on pin maps and aggregating crime counts by areas (like car beats) to produce area or choropleth maps. Workbook 3 was designed for map makers who want to learn how to construct useful crime maps, given police data that have already been address-matched and preprocessed by data preparers. It is estimated that the three tutorials take approximately six hours to complete in total, including exercises.
Facebook
TwitterFolium makes it easy to visualize data thatâs been manipulated in Python on an interactive leaflet map. It enables both the binding of data to a map for choropleth visualizations as well as passing rich vector/raster/HTML visualizations as markers on the map. These files can be used to mark the state boundaries on the map of INDIA using folium library and the CSV also contains the state data and how to use it in our notebooks. I have used it in one of my kernels which can be viewed.
The library has a number of built-in tilesets from OpenStreetMap, Mapbox, and Stamen, and supports custom tilesets with Mapbox or Cloudmade API keys. folium supports both Image, Video, GeoJSON, and TopoJSON overlays. Due to extensible functionalities I find folium the best map plotting library in python. Do give it a try and use it in your kernels.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
GeoJSON file containing India state borders along with their non-spatial attributes (id, name, etc) for use in Plotly Choropleth maps.
Facebook
TwitterIn June 2022, Thailand legalized recreational cannabis. Currently, cannabis is now the most consumed drug. Cannabis usage can increase inflammatory responses in the respiratory tract. Sharing of cannabis waterpipes has been linked to increased tuberculosis risks. Using a national in-patient databank, we aimed to 1) describe the spatiotemporal correlation between cannabis-related and tuberculosis hospital admissions, and 2) compare the rate of subsequent pulmonary tuberculosis admission between those with prior admissions for cannabis-related causes and those without. Both admission types were aggregated to the number of admissions in monthly and provincial units. Temporal and spatial patterns were visualized using line plots and choropleth maps, respectively. A matched cohort analysis was conducted to compare the incidence density rate of subsequent tuberculosis admission and the hazard ratio. Throughout 2017â2022, we observed a gradual decline in tuberculosis admissions, in contrast to the increase in cannabis-related admissions. Both admissions shared a hotspot in Northeastern Thailand. Between matched cohorts of 6,773 in-patients, the incidence density rate per 100,000 personâyears of subsequent tuberculosis admissions was 267.6 and 165.9 in in-patients with and without past cannabis-admission, respectively. After adjusting for covariates, we found that a cannabis-related admission history was associated with a hazard ratio of 1.48 (P = 0.268) for subsequent tuberculosis admission. Our findings failed to support the evidence that cannabis consumption increased pulmonary tuberculosis risk. Other study types are needed to further assess the association between cannabis consumption and pulmonary tuberculosis.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Synthetic populations for regions of the World (SPW) | Jordan
Dataset information
A synthetic population of a region as provided here, captures the people of the region with selected demographic attributes, their organization into households, their assigned activities for a day, the locations where the activities take place and thus where interactions among population members happen (e.g., spread of epidemics).
License
Acknowledgment
This project was supported by the National Science Foundation under the NSF RAPID: COVID-19 Response Support: Building Synthetic Multi-scale Networks (PI: Madhav Marathe, Co-PIs: Henning Mortveit, Srinivasan Venkatramanan; Fund Number: OAC-2027541).
Contact information
Henning.Mortveit@virginia.edu
Identifiers
| Region name | Jordan |
| Region ID | jor |
| Model | coarse |
| Version | 0_9_0 |
Statistics
| Name | Value |
|---|---|
| Population | 5723567.0 |
| Average age | 23.5 |
| Households | 1235755.0 |
| Average household size | 4.6 |
| Residence locations | 1235755.0 |
| Activity locations | 131978.0 |
| Average number of activities | 6.4 |
| Average travel distance | 44.5 |
Sources
| Description | Name | Version | Url |
|---|---|---|---|
| Activity template data | World Bank | 2021 | https://data.worldbank.org |
| Administrative boundaries | ADCW | 7.6 | https://www.adci.com/adc-worldmap |
| Curated POIs based on OSM | SLIPO/OSM POIs | http://slipo.eu/?p=1551 https://www.openstreetmap.org/ | |
| Household data | DHS | https://dhsprogram.com | |
| Population count with demographic attributes | GPW | v4.11 | https://sedac.ciesin.columbia.edu/data/set/gpw-v4-admin-unit-center-points-population-estimates-rev11 |
Files description
Base data files (jor_data_v_0_9.zip)
| Filename | Description |
|---|---|
jor_person_v_0_9.csv | Data for each person including attributes such as age, gender, and household ID. |
jor_household_v_0_9.csv | Data at household level. |
jor_residence_locations_v_0_9.csv | Data about residence locations |
jor_activity_locations_v_0_9.csv | Data about activity locations, including what activity types are supported at these locations |
jor_activity_location_assignment_v_0_9.csv | For each person and for each of their activities, this file specifies the location where the activity takes place |
Derived data files
| Filename | Description |
|---|---|
jor_contact_matrix_v_0_9.csv | A POLYMOD-type contact matrix constructed from a network representation of the location assignment data and a within-location contact model. |
Validation and measures files
| Filename | Description |
|---|---|
jor_household_grouping_validation_v_0_9.pdf | Validation plots for household construction |
jor_activity_durations_{adult,child}_v_0_9.pdf | Comparison of time spent on generated activities with survey data |
jor_activity_patterns_{adult,child}_v_0_9.pdf | Comparison of generated activity patterns by the time of day with survey data |
jor_location_construction_0_9.pdf | Validation plots for location construction |
jor_location_assignement_0_9.pdf | Validation plots for location assignment, including travel distribution plots |
jor_jor_ver_0_9_0_avg_travel_distance.pdf | Choropleth map visualizing average travel distance |
jor_jor_ver_0_9_0_travel_distr_combined.pdf | Travel distance distribution |
jor_jor_ver_0_9_0_num_activity_loc.pdf | Choropleth map visualizing number of activity locations |
jor_jor_ver_0_9_0_avg_age.pdf | Choropleth map visualizing average age |
jor_jor_ver_0_9_0_pop_density_per_sqkm.pdf | Choropleth map visualizing population density |
jor_jor_ver_0_9_0_pop_size.pdf | Choropleth map visualizing population size |
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Synthetic populations for regions of the World (SPW) | Delhi
Dataset information
A synthetic population of a region as provided here, captures the people of the region with selected demographic attributes, their organization into households, their assigned activities for a day, the locations where the activities take place and thus where interactions among population members happen (e.g., spread of epidemics).
License
Acknowledgment
This project was supported by the National Science Foundation under the NSF RAPID: COVID-19 Response Support: Building Synthetic Multi-scale Networks (PI: Madhav Marathe, Co-PIs: Henning Mortveit, Srinivasan Venkatramanan; Fund Number: OAC-2027541).
Contact information
Henning.Mortveit@virginia.edu
Identifiers
| Region name | Delhi |
| Region ID | ind_140001944 |
| Model | coarse |
| Version | 0_9_0 |
Statistics
| Name | Value |
|---|---|
| Population | 15951510 |
| Average age | 28.2 |
| Households | 3625935 |
| Average household size | 4.4 |
| Residence locations | 3625935 |
| Activity locations | 1309377 |
| Average number of activities | 5.5 |
| Average travel distance | 26.6 |
Sources
| Description | Name | Version | Url |
|---|---|---|---|
| Activity template data | World Bank | 2021 | https://data.worldbank.org |
| Administrative boundaries | ADCW | 7.6 | https://www.adci.com/adc-worldmap |
| Curated POIs based on OSM | SLIPO/OSM POIs | http://slipo.eu/?p=1551 https://www.openstreetmap.org/ | |
| Household data | DHS | https://dhsprogram.com | |
| Population count with demographic attributes | GPW | v4.11 | https://sedac.ciesin.columbia.edu/data/set/gpw-v4-admin-unit-center-points-population-estimates-rev11 |
Files description
Base data files (ind_140001944_data_v_0_9.zip)
| Filename | Description |
|---|---|
ind_140001944_person_v_0_9.csv | Data for each person including attributes such as age, gender, and household ID. |
ind_140001944_household_v_0_9.csv | Data at household level. |
ind_140001944_residence_locations_v_0_9.csv | Data about residence locations |
ind_140001944_activity_locations_v_0_9.csv | Data about activity locations, including what activity types are supported at these locations |
ind_140001944_activity_location_assignment_v_0_9.csv | For each person and for each of their activities, this file specifies the location where the activity takes place |
Derived data files
| Filename | Description |
|---|---|
ind_140001944_contact_matrix_v_0_9.csv | A POLYMOD-type contact matrix constructed from a network representation of the location assignment data and a within-location contact model. |
Validation and measures files
| Filename | Description |
|---|---|
ind_140001944_household_grouping_validation_v_0_9.pdf | Validation plots for household construction |
ind_140001944_activity_durations_{adult,child}_v_0_9.pdf | Comparison of time spent on generated activities with survey data |
ind_140001944_activity_patterns_{adult,child}_v_0_9.pdf | Comparison of generated activity patterns by the time of day with survey data |
ind_140001944_location_construction_0_9.pdf | Validation plots for location construction |
ind_140001944_location_assignement_0_9.pdf | Validation plots for location assignment, including travel distribution plots |
ind_140001944_ind_140001944_ver_0_9_0_avg_travel_distance.pdf | Choropleth map visualizing average travel distance |
ind_140001944_ind_140001944_ver_0_9_0_travel_distr_combined.pdf | Travel distance distribution |
ind_140001944_ind_140001944_ver_0_9_0_num_activity_loc.pdf | Choropleth map visualizing number of activity locations |
ind_140001944_ind_140001944_ver_0_9_0_avg_age.pdf | Choropleth map visualizing average age |
ind_140001944_ind_140001944_ver_0_9_0_pop_density_per_sqkm.pdf | Choropleth map visualizing population density |
ind_140001944_ind_140001944_ver_0_9_0_pop_size.pdf | Choropleth map visualizing population size |
Facebook
TwitterODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
This dataset contains a comprehensive collection of geographic shapefiles representing the boundaries of countries and territories worldwide. The shapefiles define the outlines of each nation and are based on the most recent and accurate geographical data available. The dataset includes polygon geometries that accurately represent the territorial extent of each country, making it suitable for various geographical analyses, visualizations, and spatial applications.
Content: The dataset comprises shapefiles in the ESRI shapefile format (.shp) along with associated files (.shx, .dbf, etc.) that contain the attributes of each country, such as country names, ISO codes, and other relevant information. The polygons in the shapefiles correspond to the land boundaries of each nation, enabling precise mapping and spatial analysis.
Use Cases: This dataset can be utilized in a wide range of applications, including but not limited to:
Source: The shapefile data is sourced from reputable and authoritative geographic databases, ensuring its accuracy and reliability for diverse applications.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Synthetic populations for regions of the World (SPW) | Manipur
Dataset information
A synthetic population of a region as provided here, captures the people of the region with selected demographic attributes, their organization into households, their assigned activities for a day, the locations where the activities take place and thus where interactions among population members happen (e.g., spread of epidemics).
License
Acknowledgment
This project was supported by the National Science Foundation under the NSF RAPID: COVID-19 Response Support: Building Synthetic Multi-scale Networks (PI: Madhav Marathe, Co-PIs: Henning Mortveit, Srinivasan Venkatramanan; Fund Number: OAC-2027541).
Contact information
Henning.Mortveit@virginia.edu
Identifiers
| Region name | Manipur |
| Region ID | ind_140001942 |
| Model | coarse |
| Version | 0_9_0 |
Statistics
| Name | Value |
|---|---|
| Population | 2796700 |
| Average age | 27.5 |
| Households | 635806 |
| Average household size | 4.4 |
| Residence locations | 635806 |
| Activity locations | 192709 |
| Average number of activities | 5.5 |
| Average travel distance | 78.3 |
Sources
| Description | Name | Version | Url |
|---|---|---|---|
| Activity template data | World Bank | 2021 | https://data.worldbank.org |
| Administrative boundaries | ADCW | 7.6 | https://www.adci.com/adc-worldmap |
| Curated POIs based on OSM | SLIPO/OSM POIs | http://slipo.eu/?p=1551 https://www.openstreetmap.org/ | |
| Household data | DHS | https://dhsprogram.com | |
| Population count with demographic attributes | GPW | v4.11 | https://sedac.ciesin.columbia.edu/data/set/gpw-v4-admin-unit-center-points-population-estimates-rev11 |
Files description
Base data files (ind_140001942_data_v_0_9.zip)
| Filename | Description |
|---|---|
ind_140001942_person_v_0_9.csv | Data for each person including attributes such as age, gender, and household ID. |
ind_140001942_household_v_0_9.csv | Data at household level. |
ind_140001942_residence_locations_v_0_9.csv | Data about residence locations |
ind_140001942_activity_locations_v_0_9.csv | Data about activity locations, including what activity types are supported at these locations |
ind_140001942_activity_location_assignment_v_0_9.csv | For each person and for each of their activities, this file specifies the location where the activity takes place |
Derived data files
| Filename | Description |
|---|---|
ind_140001942_contact_matrix_v_0_9.csv | A POLYMOD-type contact matrix constructed from a network representation of the location assignment data and a within-location contact model. |
Validation and measures files
| Filename | Description |
|---|---|
ind_140001942_household_grouping_validation_v_0_9.pdf | Validation plots for household construction |
ind_140001942_activity_durations_{adult,child}_v_0_9.pdf | Comparison of time spent on generated activities with survey data |
ind_140001942_activity_patterns_{adult,child}_v_0_9.pdf | Comparison of generated activity patterns by the time of day with survey data |
ind_140001942_location_construction_0_9.pdf | Validation plots for location construction |
ind_140001942_location_assignement_0_9.pdf | Validation plots for location assignment, including travel distribution plots |
ind_140001942_ind_140001942_ver_0_9_0_avg_travel_distance.pdf | Choropleth map visualizing average travel distance |
ind_140001942_ind_140001942_ver_0_9_0_travel_distr_combined.pdf | Travel distance distribution |
ind_140001942_ind_140001942_ver_0_9_0_num_activity_loc.pdf | Choropleth map visualizing number of activity locations |
ind_140001942_ind_140001942_ver_0_9_0_avg_age.pdf | Choropleth map visualizing average age |
ind_140001942_ind_140001942_ver_0_9_0_pop_density_per_sqkm.pdf | Choropleth map visualizing population density |
ind_140001942_ind_140001942_ver_0_9_0_pop_size.pdf | Choropleth map visualizing population size |
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
List of sociodemographic variables used in PCA analysis to create new indicators for spatial analysis.
Facebook
TwitterThis dataset is flattened and multicounty communities are unsplit by county lines. Flattened means that there are no overlaps; larger shapes like counties are punched out or clipped where smaller communities are contained within them. This allows for choropleth shading and other mapping techniques such as calculating unincorporated county land area. Multicounty cities like Houston are a single feature, undivided by counties. This layer is derived from Census, State of Maine, and National Flood Hazard Layer political boundaries.rnrnThe Community Layer datasets contain geospatial community boundaries associated with Census and NFIP data. The dataset does not contain personal identifiable information (PII). The Community Layer can be used to tie Community ID numbers (CID) to jurisdiction, tribal, and special land use area boundaries.rnrnA geodatabase (GDB) link is Included in the Full Data section below. The compressed file contains a collection of files that can store, query, and manage both spatial and nonspatial data using software that can read such a file. It bcontains all of the community layers/b, not just the layer for which this dataset page describes. rnThis layer can also be accessed from the FEMA ArcGIS viewer online: https://fema.maps.arcgis.com/home/item.html?id=8dcf28fc5b97404bbd9d1bc6d3c9b3cfrnrnrnCitation: FEMA's citation requirements for datasets (API usage or file downloads) can be found on the OpenFEMA Terms and Conditions page, Citing Data section: https://www.fema.gov/about/openfema/terms-conditions.rnrnFor answers to Frequently Asked Questions (FAQs) about the OpenFEMA program, API, and publicly available datasets, please visit: https://www.fema.gov/about/openfema/faq.rnIf you have media inquiries about this dataset, please email the FEMA News Desk at FEMA-News-Desk@fema.dhs.gov or call (202) 646-3272. For inquiries about FEMA's data and Open Government program, please email the OpenFEMA team at OpenFEMA@fema.dhs.gov.
Facebook
TwitterLearn Geographic Mapping with Altair, Vega-Lite and Vega using Curated Datasets
Complete geographic and geophysical data collection for mapping and visualization. This consolidation includes 18 complementary datasets used by 31+ Vega, Vega-Lite, and Altair examples đ. Perfect for learning geographic visualization techniques including projections, choropleths, point maps, vector fields, and interactive displays.
Source data lives on GitHub and can also be accessed via CDN. The vega-datasets project serves as a common repository for example datasets used across these visualization libraries and related projects.
airports.csv), lines (like londonTubeLines.json), and polygons (like us-10m.json).windvectors.csv, annual-precip.json).This pack includes 18 datasets covering base maps, reference points, statistical data for choropleths, and geophysical data.
| Dataset | File | Size | Format | License | Description | Key Fields / Join Info |
|---|---|---|---|---|---|---|
| US Map (1:10m) | us-10m.json | 627 KB | TopoJSON | CC-BY-4.0 | US state and county boundaries. Contains states and counties objects. Ideal for choropleths. | id (FIPS code) property on geometries |
| World Map (1:110m) | world-110m.json | 117 KB | TopoJSON | CC-BY-4.0 | World country boundaries. Contains countries object. Suitable for world-scale viz. | id property on geometries |
| London Boroughs | londonBoroughs.json | 14 KB | TopoJSON | CC-BY-4.0 | London borough boundaries. | properties.BOROUGHN (name) |
| London Centroids | londonCentroids.json | 2 KB | GeoJSON | CC-BY-4.0 | Center points for London boroughs. | properties.id, properties.name |
| London Tube Lines | londonTubeLines.json | 78 KB | GeoJSON | CC-BY-4.0 | London Underground network lines. | properties.name, properties.color |
| Dataset | File | Size | Format | License | Description | Key Fields / Join Info |
|---|---|---|---|---|---|---|
| US Airports | airports.csv | 205 KB | CSV | Public Domain | US airports with codes and coordinates. | iata, state, `l... |