Facebook
TwitterLearn Geographic Mapping with Altair, Vega-Lite and Vega using Curated Datasets
Complete geographic and geophysical data collection for mapping and visualization. This consolidation includes 18 complementary datasets used by 31+ Vega, Vega-Lite, and Altair examples đ. Perfect for learning geographic visualization techniques including projections, choropleths, point maps, vector fields, and interactive displays.
Source data lives on GitHub and can also be accessed via CDN. The vega-datasets project serves as a common repository for example datasets used across these visualization libraries and related projects.
airports.csv), lines (like londonTubeLines.json), and polygons (like us-10m.json).windvectors.csv, annual-precip.json).This pack includes 18 datasets covering base maps, reference points, statistical data for choropleths, and geophysical data.
| Dataset | File | Size | Format | License | Description | Key Fields / Join Info |
|---|---|---|---|---|---|---|
| US Map (1:10m) | us-10m.json | 627 KB | TopoJSON | CC-BY-4.0 | US state and county boundaries. Contains states and counties objects. Ideal for choropleths. | id (FIPS code) property on geometries |
| World Map (1:110m) | world-110m.json | 117 KB | TopoJSON | CC-BY-4.0 | World country boundaries. Contains countries object. Suitable for world-scale viz. | id property on geometries |
| London Boroughs | londonBoroughs.json | 14 KB | TopoJSON | CC-BY-4.0 | London borough boundaries. | properties.BOROUGHN (name) |
| London Centroids | londonCentroids.json | 2 KB | GeoJSON | CC-BY-4.0 | Center points for London boroughs. | properties.id, properties.name |
| London Tube Lines | londonTubeLines.json | 78 KB | GeoJSON | CC-BY-4.0 | London Underground network lines. | properties.name, properties.color |
| Dataset | File | Size | Format | License | Description | Key Fields / Join Info |
|---|---|---|---|---|---|---|
| US Airports | airports.csv | 205 KB | CSV | Public Domain | US airports with codes and coordinates. | iata, state, `l... |
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
A comprehensive dataset of 1,513 Pakistani cities, towns, tehsils, districts and places with latitude/longitude, administrative region, population (when available) and Wikidata IDs â ideal for mapping, geospatial analysis, enrichment, and location-based ML.
Why this dataset is valuable:
Highlights (fetched from the data):
Column definitions (short):
Typical & high-value use cases:
Facebook
TwitterWe propose map reading experiments to quantitatively evaluate the selection of hue ranges for sequential color schemes on choropleth maps. In these experiments, 60 sequential color schemes with six base hues and ten hue ranges were employed as experimental color schemes, and a total of 414 college students were invited to complete identification, comparison, and ranking tasks. Both controlled and real-map experiments were performed, each involving a web-based survey and an eye-tracking experiment. In the controlled experiments, the shapes of the map objects were relatively regular, and attribute data were randomized. In contrast, the shapes were complex in real-map experiments, and real data were employed. Our findings show that widely used color schemes with a hue range of 0ĂÂș yield poor performance in all tasks; 15ĂÂș hue ranges yield good performance in the comparison and ranking tasks but poor performance in the identification task. For large hue ranges of 120-360ĂÂș, participants showed...
Facebook
TwitterAnyone who has taught GIS using Census Data knows it is an invaluable data set for showing students how to take data stored in a table and join it to boundary data to transform this data into something that can be visualised and analysed spatially. Joins are a core GIS skill and need to be learnt, as not every data set is going to come neatly packaged as a shapefile or feature layer with all the data you need stored within. I don't know how many times I taught students to download data as a table from Nomis, load it into a GIS and then join that table data to the appropriate boundary data so they could produce choropleth maps to do some visual analysis, but it was a lot! Once students had gotten the hang of joins using census data they'd often ask why this data doesn't exist as a prepackaged feature layer with all the data they wanted within it. Well good news, now a lot off it is and it's accessible through the Living Atlas! Don't get me wrong I fully understand the importance of teaching students how to perform joins but once you have this understanding if you can access data that already contains all the information you need then you should be taking advantage of it to save you time. So in this exercise I am going to show you how to load English and Welsh Census Data from the 2021 Census into the ArcGIS Map Viewer from the Living Atlas and produce some choropleth maps to use to perform visual analysis without having to perform a single join.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
I used this publicly available data for making interactive map visualization of NYC. Zipcode geodata is useful for building interactive maps with each zip code area representing a separate area on the map.
NYC zipcode geodata in geojson format
The rights belong to the original authors.
Facebook
TwitterFolium makes it easy to visualize data thatâs been manipulated in Python on an interactive leaflet map. It enables both the binding of data to a map for choropleth visualizations as well as passing rich vector/raster/HTML visualizations as markers on the map. These files can be used to mark the state boundaries on the map of INDIA using folium library and the CSV also contains the state data and how to use it in our notebooks. I have used it in one of my kernels which can be viewed.
The library has a number of built-in tilesets from OpenStreetMap, Mapbox, and Stamen, and supports custom tilesets with Mapbox or Cloudmade API keys. folium supports both Image, Video, GeoJSON, and TopoJSON overlays. Due to extensible functionalities I find folium the best map plotting library in python. Do give it a try and use it in your kernels.
Facebook
TwitterCrimeMapTutorial is a step-by-step tutorial for learning crime mapping using ArcView GIS or MapInfo Professional GIS. It was designed to give users a thorough introduction to most of the knowledge and skills needed to produce daily maps and spatial data queries that uniformed officers and detectives find valuable for crime prevention and enforcement. The tutorials can be used either for self-learning or in a laboratory setting. The geographic information system (GIS) and police data were supplied by the Rochester, New York, Police Department. For each mapping software package, there are three PDF tutorial workbooks and one WinZip archive containing sample data and maps. Workbook 1 was designed for GIS users who want to learn how to use a crime-mapping GIS and how to generate maps and data queries. Workbook 2 was created to assist data preparers in processing police data for use in a GIS. This includes address-matching of police incidents to place them on pin maps and aggregating crime counts by areas (like car beats) to produce area or choropleth maps. Workbook 3 was designed for map makers who want to learn how to construct useful crime maps, given police data that have already been address-matched and preprocessed by data preparers. It is estimated that the three tutorials take approximately six hours to complete in total, including exercises.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
I wanted to Collect all the Covid-19 cases all over the world and make analysis on it
Data is simple but can bring a lot of insights
Data is classified into 4 columns (Country/Region', 'Confirmed', 'Country Abbr 2', 'Country Abbr 3)
This 2 columns are useful to use in visualization of Choropleth with plotly to make the world map Data is collected from many resources to be accurate
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Public protest represents an important sanction on rulers and institutions. Protest is a quotidian phenomenon in South Africa; perhaps the defining element of post-apartheid political life. Geographic representations of protest abound â typically dot distribution maps â but these merely confirm that more protests occur where there are more people. Visualisations of protest per capita and protestors per capita (or âgeneral propensityâ), which are best rendered as choropleth maps, are well-placed to overcome this limitation. The South African Police Services' database of protest is the largest publicly-available single-country protest event database. Having used machine learning to classify 89,000 protest events, I locate each within one of the country's 234 municipalities, and depict these events using counts, count per capita, and the general propensity. This reveals a proportionally high number of rural protests, and that municipalities hosting major industries, along with provincial seats of government, present the highest propensity for protest.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This project provides a comprehensive dataset on intentional homicides in Mexico from 1990 to 2023, disaggregated by sex and state. It includes both raw data and tools for visualization, making it a valuable resource for researchers, policymakers, and analysts studying violence trends, gender disparities, and regional patterns.ContentsHomicide Data: Total number of male and female victims per state and year.Population Data: Corresponding male and female population estimates for each state and year.Homicide Rates: Per 100,000 inhabitants, calculated for both sexes.Choropleth Map Script: A Python script that generates homicide rate maps using a GeoJSON file.GeoJSON File: A spatial dataset defining Mexico's state boundaries, used for mapping.Sample Figure: A pre-generated homicide rate map for 2023 as an example.Requirements File: A requirements.txt file listing necessary dependencies for running the script.SourcesHomicide Data: INEGI - Vital Statistics MicrodataPopulation Data: Mexican Population Projections 2020-2070This dataset enables spatial analysis and data visualization, helping users explore homicide trends across Mexico in a structured and reproducible way.
Facebook
TwitterMedical Service Study Areas (MSSAs)As defined by California's Office of Statewide Health Planning and Development (OSHPD) in 2013, "MSSAs are sub-city and sub-county geographical units used to organize and display population, demographic and physician data" (Source). Each census tract in CA is assigned to a given MSSA. The most recent MSSA dataset (2014) was used. Spatial data are available via OSHPD at the California Open Data Portal. This information may be useful in studying health equity.Age-Adjusted Incidence Rate (AAIR)Age-adjustment is a statistical method that allows comparisons of incidence rates to be made between populations with different age distributions. This is important since the incidence of most cancers increases with age. An age-adjusted cancer incidence (or death) rate is defined as the number of new cancers (or deaths) per 100,000 population that would occur in a certain period of time if that population had a 'standard' age distribution. In the California Health Maps, incidence rates are age-adjusted using the U.S. 2000 Standard Population.Cancer incidence ratesIncidence rates were calculated using case counts from the California Cancer Registry. Population data from 2010 Census and SEER 2015 census tract estimates by race/origin (controlling to Vintage 2015) were used to estimate population denominators. Yearly SEER 2015 census tract estimates by race/origin (controlling to Vintage 2015) were used to estimate population denominators for 5-year incidence rates (2013-2017)According to California Department of Public Health guidelines, cancer incidence rates cannot be reported if based on <15 cancer cases and/or a population <10,000 to ensure confidentiality and stable statistical rates.Spatial extent: CaliforniaSpatial Unit: MSSACreated: n/aUpdated: n/aSource: California Health MapsContact Email: gbacr@ucsf.eduSource Link: https://www.californiahealthmaps.org/?areatype=mssa&address=&sex=Both&site=AllSite&race=&year=05yr&overlays=none&choropleth=Obesity
Facebook
TwitterAttribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
« BIXI Montréal is a public bicycle sharing system serving Montréal, Quebec, Canada.
Launched in May 2009, it is North America's first large-scale bike sharing system and the original BIXI brand of systems.
The location of a BIXI bike station is determined by several parameters, including population density, points of interest and activities (universities, bike paths, other transportation networks, and data on travel patterns of the general public. In 2009, 5,000 bikes were deployed in Montreal through a network of pay stations located mainly in the boroughs of RosemontâLa Petite-Patrie, the Plateau-Mont-Royal and Ville-Marie, spilling over into parts of Outremont and the South West. As of 2011, the system has spread to Hochelaga-Maisonneuve, VillerayâSaint-MichelâParc-Extension, Ahuntsic, CĂŽte-des-NeigesâNotre-Dame-de-GrĂące, Westmount and Verdun. » [1]
1. BIXI - Movements history
Datasets containing the details of the travels made via the BIXI Montréal self-service bike network. Each year is a .zip file (ex. BixiMontrealRentals2014.zip) containing several .CSV files (ex. OD_2014-04.csv) for each months.
Version 2 : All .csv are merged per year (OD-year.csv).
The data is extracted from the BIXI Montréal station and bike management system. Trips of less than 1 minute or more than 2 hours are excluded. The station identifiers used correspond to those of the station status data set
Data Sructure
2. BIXI - The condition of stations
This dataset presents the list of stations in the BIXI Montréal self-service bicycle network, including the geographic position, the number of bicycles available and the number of terminals available. It is a .json file, which corresponds to a dictionary.
The data is produced by the BIXI Montréal station management system with a refresh rate of 5 minutes. Station locations are generally stable over time, but may be subject to change during the season, particularly when the City of Montreal is carrying out work or as part of special events. Temporary storage stations are not included in station status. The data is automatically generated on the BIXI servers, so the date of last update of this dataset does not represent the actual date of update. Information about bikes and bad terminals is available in the JSON format file.
Data Sructure
3. Geographical boundaries of Montreal (Borough and related city)
Creative Commons Attribution 4.0 International
For more details :
http://donnees.ville.montreal.qc.ca/dataset/bixi-historique-des-deplacements
http://donnees.ville.montreal.qc.ca/dataset/bixi-etat-des-stations
http://donnees.ville.montreal.qc.ca/dataset/polygones-arrondissements
Can you find pattern in the behavior of Bixi users?
Are there any inefficient stations ?
What insights can we use from this data for decision making ?
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
TwitterLearn Geographic Mapping with Altair, Vega-Lite and Vega using Curated Datasets
Complete geographic and geophysical data collection for mapping and visualization. This consolidation includes 18 complementary datasets used by 31+ Vega, Vega-Lite, and Altair examples đ. Perfect for learning geographic visualization techniques including projections, choropleths, point maps, vector fields, and interactive displays.
Source data lives on GitHub and can also be accessed via CDN. The vega-datasets project serves as a common repository for example datasets used across these visualization libraries and related projects.
airports.csv), lines (like londonTubeLines.json), and polygons (like us-10m.json).windvectors.csv, annual-precip.json).This pack includes 18 datasets covering base maps, reference points, statistical data for choropleths, and geophysical data.
| Dataset | File | Size | Format | License | Description | Key Fields / Join Info |
|---|---|---|---|---|---|---|
| US Map (1:10m) | us-10m.json | 627 KB | TopoJSON | CC-BY-4.0 | US state and county boundaries. Contains states and counties objects. Ideal for choropleths. | id (FIPS code) property on geometries |
| World Map (1:110m) | world-110m.json | 117 KB | TopoJSON | CC-BY-4.0 | World country boundaries. Contains countries object. Suitable for world-scale viz. | id property on geometries |
| London Boroughs | londonBoroughs.json | 14 KB | TopoJSON | CC-BY-4.0 | London borough boundaries. | properties.BOROUGHN (name) |
| London Centroids | londonCentroids.json | 2 KB | GeoJSON | CC-BY-4.0 | Center points for London boroughs. | properties.id, properties.name |
| London Tube Lines | londonTubeLines.json | 78 KB | GeoJSON | CC-BY-4.0 | London Underground network lines. | properties.name, properties.color |
| Dataset | File | Size | Format | License | Description | Key Fields / Join Info |
|---|---|---|---|---|---|---|
| US Airports | airports.csv | 205 KB | CSV | Public Domain | US airports with codes and coordinates. | iata, state, `l... |