Facebook
TwitterLearn Geographic Mapping with Altair, Vega-Lite and Vega using Curated Datasets
Complete geographic and geophysical data collection for mapping and visualization. This consolidation includes 18 complementary datasets used by 31+ Vega, Vega-Lite, and Altair examples 📊. Perfect for learning geographic visualization techniques including projections, choropleths, point maps, vector fields, and interactive displays.
Source data lives on GitHub and can also be accessed via CDN. The vega-datasets project serves as a common repository for example datasets used across these visualization libraries and related projects.
airports.csv), lines (like londonTubeLines.json), and polygons (like us-10m.json).windvectors.csv, annual-precip.json).This pack includes 18 datasets covering base maps, reference points, statistical data for choropleths, and geophysical data.
| Dataset | File | Size | Format | License | Description | Key Fields / Join Info |
|---|---|---|---|---|---|---|
| US Map (1:10m) | us-10m.json | 627 KB | TopoJSON | CC-BY-4.0 | US state and county boundaries. Contains states and counties objects. Ideal for choropleths. | id (FIPS code) property on geometries |
| World Map (1:110m) | world-110m.json | 117 KB | TopoJSON | CC-BY-4.0 | World country boundaries. Contains countries object. Suitable for world-scale viz. | id property on geometries |
| London Boroughs | londonBoroughs.json | 14 KB | TopoJSON | CC-BY-4.0 | London borough boundaries. | properties.BOROUGHN (name) |
| London Centroids | londonCentroids.json | 2 KB | GeoJSON | CC-BY-4.0 | Center points for London boroughs. | properties.id, properties.name |
| London Tube Lines | londonTubeLines.json | 78 KB | GeoJSON | CC-BY-4.0 | London Underground network lines. | properties.name, properties.color |
| Dataset | File | Size | Format | License | Description | Key Fields / Join Info |
|---|---|---|---|---|---|---|
| US Airports | airports.csv | 205 KB | CSV | Public Domain | US airports with codes and coordinates. | iata, state, `l... |
Facebook
TwitterDataset for the textbook Computational Methods and GIS Applications in Social Science (3rd Edition), 2023 Fahui Wang, Lingbo Liu Main Book Citation: Wang, F., & Liu, L. (2023). Computational Methods and GIS Applications in Social Science (3rd ed.). CRC Press. https://doi.org/10.1201/9781003292302 KNIME Lab Manual Citation: Liu, L., & Wang, F. (2023). Computational Methods and GIS Applications in Social Science - Lab Manual. CRC Press. https://doi.org/10.1201/9781003304357 KNIME Hub Dataset and Workflow for Computational Methods and GIS Applications in Social Science-Lab Manual Update Log If Python package not found in Package Management, use ArcGIS Pro's Python Command Prompt to install them, e.g., conda install -c conda-forge python-igraph leidenalg NetworkCommDetPro in CMGIS-V3-Tools was updated on July 10,2024 Add spatial adjacency table into Florida on June 29,2024 The dataset and tool for ABM Crime Simulation were updated on August 3, 2023, The toolkits in CMGIS-V3-Tools was updated on August 3rd,2023. Report Issues on GitHub https://github.com/UrbanGISer/Computational-Methods-and-GIS-Applications-in-Social-Science Following the website of Fahui Wang : http://faculty.lsu.edu/fahui Contents Chapter 1. Getting Started with ArcGIS: Data Management and Basic Spatial Analysis Tools Case Study 1: Mapping and Analyzing Population Density Pattern in Baton Rouge, Louisiana Chapter 2. Measuring Distance and Travel Time and Analyzing Distance Decay Behavior Case Study 2A: Estimating Drive Time and Transit Time in Baton Rouge, Louisiana Case Study 2B: Analyzing Distance Decay Behavior for Hospitalization in Florida Chapter 3. Spatial Smoothing and Spatial Interpolation Case Study 3A: Mapping Place Names in Guangxi, China Case Study 3B: Area-Based Interpolations of Population in Baton Rouge, Louisiana Case Study 3C: Detecting Spatiotemporal Crime Hotspots in Baton Rouge, Louisiana Chapter 4. Delineating Functional Regions and Applications in Health Geography Case Study 4A: Defining Service Areas of Acute Hospitals in Baton Rouge, Louisiana Case Study 4B: Automated Delineation of Hospital Service Areas in Florida Chapter 5. GIS-Based Measures of Spatial Accessibility and Application in Examining Healthcare Disparity Case Study 5: Measuring Accessibility of Primary Care Physicians in Baton Rouge Chapter 6. Function Fittings by Regressions and Application in Analyzing Urban Density Patterns Case Study 6: Analyzing Population Density Patterns in Chicago Urban Area >Chapter 7. Principal Components, Factor and Cluster Analyses and Application in Social Area Analysis Case Study 7: Social Area Analysis in Beijing Chapter 8. Spatial Statistics and Applications in Cultural and Crime Geography Case Study 8A: Spatial Distribution and Clusters of Place Names in Yunnan, China Case Study 8B: Detecting Colocation Between Crime Incidents and Facilities Case Study 8C: Spatial Cluster and Regression Analyses of Homicide Patterns in Chicago Chapter 9. Regionalization Methods and Application in Analysis of Cancer Data Case Study 9: Constructing Geographical Areas for Mapping Cancer Rates in Louisiana Chapter 10. System of Linear Equations and Application of Garin-Lowry in Simulating Urban Population and Employment Patterns Case Study 10: Simulating Population and Service Employment Distributions in a Hypothetical City Chapter 11. Linear and Quadratic Programming and Applications in Examining Wasteful Commuting and Allocating Healthcare Providers Case Study 11A: Measuring Wasteful Commuting in Columbus, Ohio Case Study 11B: Location-Allocation Analysis of Hospitals in Rural China Chapter 12. Monte Carlo Method and Applications in Urban Population and Traffic Simulations Case Study 12A. Examining Zonal Effect on Urban Population Density Functions in Chicago by Monte Carlo Simulation Case Study 12B: Monte Carlo-Based Traffic Simulation in Baton Rouge, Louisiana Chapter 13. Agent-Based Model and Application in Crime Simulation Case Study 13: Agent-Based Crime Simulation in Baton Rouge, Louisiana Chapter 14. Spatiotemporal Big Data Analytics and Application in Urban Studies Case Study 14A: Exploring Taxi Trajectory in ArcGIS Case Study 14B: Identifying High Traffic Corridors and Destinations in Shanghai Dataset File Structure 1 BatonRouge Census.gdb BR.gdb 2A BatonRouge BR_Road.gdb Hosp_Address.csv TransitNetworkTemplate.xml BR_GTFS Google API Pro.tbx 2B Florida FL_HSA.gdb R_ArcGIS_Tools.tbx (RegressionR) 3A China_GX GX.gdb 3B BatonRouge BR.gdb 3C BatonRouge BRcrime R_ArcGIS_Tools.tbx (STKDE) 4A BatonRouge BRRoad.gdb 4B Florida FL_HSA.gdb HSA Delineation Pro.tbx Huff Model Pro.tbx FLplgnAdjAppend.csv 5 BRMSA BRMSA.gdb Accessibility Pro.tbx 6 Chicago ChiUrArea.gdb R_ArcGIS_Tools.tbx (RegressionR) 7 Beijing BJSA.gdb bjattr.csv R_ArcGIS_Tools.tbx (PCAandFA, BasicClustering) 8A Yunnan YN.gdb R_ArcGIS_Tools.tbx (SaTScanR) 8B Jiangsu JS.gdb 8C Chicago ChiCity.gdb cityattr.csv ...
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Welcome to the Google Places Comprehensive Business Dataset! This dataset has been meticulously scraped from Google Maps and presents extensive information about businesses across several countries. Each entry in the dataset provides detailed insights into business operations, location specifics, customer interactions, and much more, making it an invaluable resource for data analysts and scientists looking to explore business trends, geographic data analysis, or consumer behaviour patterns.
This dataset is ideal for a variety of analytical projects, including: - Market Analysis: Understand business distribution and popularity across different regions. - Customer Sentiment Analysis: Explore relationships between customer ratings and business characteristics. - Temporal Trend Analysis: Analyze patterns of business activity throughout the week. - Geospatial Analysis: Integrate with mapping software to visualise business distribution or cluster businesses based on location.
The dataset contains 46 columns, providing a thorough profile for each listed business. Key columns include:
business_id: A unique Google Places identifier for each business, ensuring distinct entries.phone_number: The contact number associated with the business. It provides a direct means of communication.name: The official name of the business as listed on Google Maps.full_address: The complete postal address of the business, including locality and geographic details.latitude: The geographic latitude coordinate of the business location, useful for mapping and spatial analysis.longitude: The geographic longitude coordinate of the business location.review_count: The total number of reviews the business has received on Google Maps.rating: The average user rating out of 5 for the business, reflecting customer satisfaction.timezone: The world timezone the business is located in, important for temporal analysis.website: The official website URL of the business, providing further information and contact options.category: The category or type of service the business provides, such as restaurant, museum, etc.claim_status: Indicates whether the business listing has been claimed by the owner on Google Maps.plus_code: A sho...
Facebook
TwitterThe files linked to this reference are the geospatial data created as part of the completion of the baseline vegetation inventory project for the NPS park unit. Current format is ArcGIS file geodatabase but older formats may exist as shapefiles. We converted the photointerpreted data into a format usable in a geographic information system (GIS) by employing three fundamental processes: (1) orthorectify, (2) digitize, and (3) develop the geodatabase. All digital map automation was projected in Universal Transverse Mercator (UTM), Zone 16, using the North American Datum of 1983 (NAD83). Orthorectify: We orthorectified the interpreted overlays by using OrthoMapper, a softcopy photogrammetric software for GIS. One function of OrthoMapper is to create orthorectified imagery from scanned and unrectified imagery (Image Processing Software, Inc., 2002). The software features a method of visual orientation involving a point-and-click operation that uses existing orthorectified horizontal and vertical base maps. Of primary importance to us, OrthoMapper also has the capability to orthorectify the photointerpreted overlays of each photograph based on the reference information provided. Digitize: To produce a polygon vector layer for use in ArcGIS (Environmental Systems Research Institute [ESRI], Redlands, California), we converted each raster-based image mosaic of orthorectified overlays containing the photointerpreted data into a grid format by using ArcGIS. In ArcGIS, we used the ArcScan extension to trace the raster data and produce ESRI shapefiles. We digitally assigned map-attribute codes (both map-class codes and physiognomic modifier codes) to the polygons and checked the digital data against the photointerpreted overlays for line and attribute consistency. Ultimately, we merged the individual layers into a seamless layer. Geodatabase: At this stage, the map layer has only map-attribute codes assigned to each polygon. To assign meaningful information to each polygon (e.g., map-class names, physiognomic definitions, links to NVCS types), we produced a feature-class table, along with other supportive tables and subsequently related them together via an ArcGIS Geodatabase. This geodatabase also links the map to other feature-class layers produced from this project, including vegetation sample plots, accuracy assessment (AA) sites, aerial photo locations, and project boundary extent. A geodatabase provides access to a variety of interlocking data sets, is expandable, and equips resource managers and researchers with a powerful GIS tool.
Facebook
TwitterThe files linked to this reference are the geospatial data created as part of the completion of the baseline vegetation inventory project for the NPS park unit. Current format is ArcGIS file geodatabase but older formats may exist as shapefiles. Vegetation map development for KNRI has somewhat different protocols than for other Parks. Normally photointerpretation is preceded by extensive field work which includes plot selection and vegetation sampling using detailed descriptions which are subsequently analyzed using ordination and other statistical techniques. The data are then summarized and association descriptions are assigned to each plot or, if the association is previously unrecognized, then a new association name is assigned. Subsequently, the plots locations are compared to its photographic signature and a photointerpretive key is developed. Given the very small size of KNRI and the extensive historical impact and alteration of the vegetation a simplified technique was used. NatureServe developed a list of potential vegetation types prior to any field work. This list was referenced during the field visit and modified after comparison of site characteristics and vegetation descriptions. Aerial photographs were viewed prior to the field visit and areas of like signature were differentiated. All vegetation and land-use information was then transferred to a GIS database using the latest grayscale USGS digital orthophoto quarter-quads as the base map and using a combination of on-screen digitizing and scanning techniques. Overall thematic map accuracy for the Park is considered 100% as all interpreted polygons received a filed visit for verification.
Facebook
TwitterThe geographic data are built from the Technical Information Management System (TIMS). TIMS consists of two separate databases: an attribute database and a spatial database. The attribute information for offshore activities is stored in the TIMS database. The spatial database is a combination of the ARC/INFO and FINDER databases and contains all the coordinates and topology information for geographic features. The attribute and spatial databases are interconnected through the use of common data elements in both databases, thereby creating the spatial datasets. The data in the mapping files are made up of straight-line segments. If an arc existed in the original data, it has been replaced with a series of straight lines that approximate the arc. The Gulf of America OCS Region stores all its mapping data in longitude and latitude format. All coordinates are in NAD 27. Data can be obtained in three types of digital formats: INTERACTIVE MAP: The ArcGIS web maps are an interactive display of geographic information, containing a basemap, a set of data layers (many of which include interactive pop-up windows with information about the data), an extent, navigation tools to pan and zoom, and additional tools for geospatial analysis. SHP: A Shapefile is a digital vector (non-topological) storage format for storing geometric location and associated attribute information. Shapefiles can support point, line, and area features with attributes held in a dBASE format file. GEODATABASE: An ArcGIS geodatabase is a collection of geographic datasets of various types held in a common file system folder, a Microsoft Access database, or a multiuser relational DBMS (such as Oracle, Microsoft SQL Server, PostgreSQL, Informix, or IBM DB2). The geodatabase is the native data structure for ArcGIS and is the primary data format used for editing and data management.
Facebook
TwitterHigh resolution land cover data set for New York City. This is the 3ft version of the high-resolution land cover dataset for New York City. Seven land cover classes were mapped: (1) tree canopy, (2) grass/shrub, (3) bare earth, (4) water, (5) buildings, (6) roads, and (7) other paved surfaces. The minimum mapping unit for the delineation of features was set at 3 square feet. The primary sources used to derive this land cover layer were the 2010 LiDAR and the 2008 4-band orthoimagery. Ancillary data sources included GIS data (city boundary, building footprints, water, parking lots, roads, railroads, railroad structures, ballfields) provided by New York City (all ancillary datasets except railroads); UVM Spatial Analysis Laboratory manually created railroad polygons from manual interpretation of 2008 4-band orthoimagery. The tree canopy class was considered current as of 2010; the remaining land-cover classes were considered current as of 2008. Object-Based Image Analysis (OBIA) techniques were employed to extract land cover information using the best available remotely sensed and vector GIS datasets. OBIA systems work by grouping pixels into meaningful objects based on their spectral and spatial properties, while taking into account boundaries imposed by existing vector datasets. Within the OBIA environment a rule-based expert system was designed to effectively mimic the process of manual image analysis by incorporating the elements of image interpretation (color/tone, texture, pattern, location, size, and shape) into the classification process. A series of morphological procedures were employed to insure that the end product is both accurate and cartographically pleasing. More than 35,000 corrections were made to the classification. Overall accuracy was 96%. This dataset was developed as part of the Urban Tree Canopy (UTC) Assessment for New York City. As such, it represents a 'top down' mapping perspective in which tree canopy over hanging other features is assigned to the tree canopy class. At the time of its creation this dataset represents the most detailed and accurate land cover dataset for the area. This project was funded by National Urban and Community Forestry Advisory Council (NUCFAC) and the National Science Fundation (NSF), although it is not specifically endorsed by either agency. The methods used were developed by the University of Vermont Spatial Analysis Laboratory, in collaboration with the New York City Urban Field Station, with funding from the USDA Forest Service.
Facebook
Twitter
According to our latest research, the global cartography software market size reached USD 2.15 billion in 2024, driven by increasing demand for advanced mapping solutions across diverse sectors. The market is expected to expand at a CAGR of 9.2% between 2025 and 2033, with the market size forecasted to reach USD 4.79 billion by 2033. This robust growth is primarily attributed to rapid urbanization, the proliferation of geospatial data, and growing integration of GIS technologies in government and commercial applications.
The primary growth factor propelling the cartography software market is the accelerating adoption of geospatial intelligence and geographic information systems (GIS) across various sectors. Governments, urban planners, and commercial enterprises are increasingly leveraging cartography software for enhanced decision-making, spatial data visualization, and resource management. The surge in smart city initiatives and infrastructure development projects worldwide is further boosting demand for sophisticated mapping tools. These tools enable stakeholders to visualize complex datasets, analyze spatial relationships, and optimize planning processes, thereby improving efficiency and reducing operational costs.
Another significant driver is the technological evolution within the cartography software landscape. The integration of artificial intelligence, machine learning, and cloud computing has transformed traditional mapping solutions into dynamic, interactive, and real-time platforms. These advancements have broadened the application scope of cartography software, making it indispensable in fields such as disaster management, environmental monitoring, and business intelligence. The ability to process large volumes of geospatial data quickly and accurately has enhanced the value proposition of cartography solutions, attracting investments from both public and private sectors.
Furthermore, the growing need for disaster risk management and environmental monitoring is catalyzing the adoption of cartography software. Governments and humanitarian organizations are increasingly utilizing these tools to map vulnerable areas, monitor climate change impacts, and plan emergency response strategies. The software’s capability to provide real-time situational awareness and predictive analytics is critical in mitigating risks and enhancing preparedness. As climate-related challenges intensify, the reliance on advanced cartographic solutions is expected to deepen, further fueling market growth.
From a regional perspective, North America currently dominates the cartography software market, supported by substantial investments in geospatial infrastructure and a high concentration of technology-driven enterprises. However, Asia Pacific is poised for the fastest growth, driven by rapid urbanization, expanding infrastructure projects, and increasing government focus on smart city development. Europe also holds a significant share, benefiting from robust regulatory frameworks and widespread adoption of GIS technologies across various sectors. The Middle East & Africa and Latin America are emerging as promising markets, with growing awareness of the benefits of digital mapping in resource management and urban planning.
The cartography software market by component is bifurcated into software and services. The software segment captures the largest market share, accounting for over 65% in 2024, owing to the widespread adoption of advanced mapping solutions across government, commercial, and utility sectors. Modern cartography software platforms offer comprehensive features such as data visualization, spatial analysis, and real-time collaboration, making them indispensable tools for urban planners, environmental agencies, and businesses. The proliferation of open-source platforms and the availability of customizable mapping solutions have further accelerated the adoption of cartography software globally.
<
Facebook
TwitterDataset Summary:
The Monterey, San Benito and San Luis Obispo County Impervious Surfaces map is a 5-class fine-scale polygon vector representation of artificial impervious surfaces in the region. There are 1,253,729 features in the dataset. Non-impervious areas are not mapped and are not covered by polygons. The impervious map represents the state of the landscape in summer, 2022. This technical mapping work for this product was conducted by the impervious mapping team at the University of Vermont Spatial Analysis Lab and EarthDefine. Table 1 lists download locations for the dataset.Table 1. Monterey, San Benito, and San Luis Obispo Counties counties impervious surfaces data
Description
Link
File Geodatabase Feature Class
https://vegmap.press/central_coast_impervious_fgdb
Vector Tile Service
https://vegmap.press/central_coast_impervious_vt
Detailed Dataset Description: The impervious map was created using a combination of AI techniques and “expert systems” rulesets developed in Trimble eCognition. Initial impervious polygons for populated areas were produced using AI techniques by EarthDefine. These were refined and classified into impervious types by the UVM Spatial Analysis Lab using eCognition. Impervious surfaces in less populated areas were produced entirely by the UVM Spatial Analysis Lab in eCognition. ECognition rulesets combine automated image segmentation with-object based image classification techniques. In contrast with machine learning approaches, expert systems rulesets are developed heuristically based on the knowledge of experienced image analysts. Key data sets used in the expert systems rulesets for impervious mapping included: high resolution (0.6 meter) 4-band NAIP imagery (2022), the unified lidar point cloud, which is an amalgamation of the most recent/best available lidar data for the three-county area, and lidar derived rasters from the unified point cloud such as the canopy height model (CHM) and normalized digital surface model (nDSM).
After production in AI and eCognition, the preliminary impervious map product was manually edited by a team of UVM’s photo interpreters. Manual editing corrected errors where the automated methods produced incorrect results.
The impervious map has 5 classes representing different types of impervious features, which are described below:
·
Building – Structures including homes, commercial
buildings, outbuildings, and other human-made structures such as water tanks
and silage silos. Structures fully
occluded by vegetation will not be mapped.
·
Paved Road – Roads that are paved and wide enough for a
vehicle.
· Dirt/Gravel Road – Dirt or gravel roads wide enough for a vehicle. Non-ephemeral fire roads, ranch roads and long driveways. Polygons representing narrow unpaved (single track) trails are not included in this data product.
· Other Dirt/Gravel Surface – Dirt or gravel surfaces that are highly compacted and used by humans and equipment, such as parking lots, road pull-offs, some dirt or gravel paths, and highly compacted areas around commercial activities. This class DOES NOT include natural turf playing fields, very lightly used dirt roads, livestock areas, naturally occurring bare soil or rock, or bare areas around ponds.
· Other Paved Surface – Includes parking lots, sidewalks, paved walking paths, swimming pools, tennis courts.
Miscellaneous quality control and processing notes:·
Zoom level used during manual quality control
was no finer than 1 to 500.·
Vector data was created with no overlapping
polygons.
Data Limitations: This is not a planimetric data product and was created using semi-automated techniques. It provides a reasonable and useful depiction of impervious surfaces for planners and managers but does not have the accuracy or precision to support engineering applications. No formal accuracy assessment was conducted for this dataset. Users should apply caution when using the data for applications requiring high positional or classification accuracy. Appropriate uses of the data product include:· As an input to storm water models· For planners to assess % imperviousness in a parcel/watershed· To help identify areas of human infrastructure for fuels and fire management· As an input to fuel models that are used in fire behavior and fire spread models· For cartography and mapping· Generally for use at scales 1:1,000 and smaller
Inappropriate uses of this product include:· Measuring exact square footage of structures or impervious features for building projects· Using the impervious polygons as geographically precise information for transportation and public works engineering projects· Determining ownership or maintenance responsibility of a particular feature, such as a paved or dirt road· Identifying publicly accessible areas for recreation or other uses· Confirming the suitability of a surface for any use including driving, hiking, bicycling, etc.
Common errors in this dataset are inter-class confusion and errors of commission to impervious. These are discussed in more detail below:
·
Inter-Class
Confusion: The accuracy of the map
for impervious versus pervious is very high (although no quantitative assessment
of accuracy was funded for this product).
However, the accuracy for individual impervious classes will be much
lower. For example, confusion exists
between the ‘Other Paved’ and ‘Other Dirt/Gravel Surface,’ classes, even though
these are both mapped correctly as impervious surfaces.
· Errors of Commission: The most widespread error in this map are areas mapped as impervious that are actual pervious surfaces of dried out herbaceous land cover. Some dried-out herbaceous cover may be mistakenly classified as impervious due to spectral similarity. Manual editing minimized but did not completely eliminate these errors.
Facebook
TwitterThis project evaluated ecosystem damage and recovery by developing a time series of habitat maps for the Rookery Bay National Estuarine Research Reserve. Habitat maps were created based on WorldView-2 and Landsat-8 satellite imagery from 2010-2018 using an automated technique and validated with a field campaign. Landsat images were mapped using the Support Vector Machine machine learning method...
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Spatial management tools, such as marine spatial planning and marine protected areas, are playing an increasingly important role in attempts to improve marine management and accommodate conflicting needs. Robust data are needed to inform decisions among different planning options, and early inclusion of stakeholder involvement is widely regarded as vital for success. One of the biggest stakeholder groups, and the most likely to be adversely impacted by spatial restrictions, is the fishing community. In order to take their priorities into account, planners need to understand spatial variation in their perceived value of the sea. Here a readily accessible, novel method for quantitatively mapping fishers’ spatial access priorities is presented. Spatial access priority mapping, or SAPM, uses only basic functions of standard spreadsheet and GIS software. Unlike the use of remote-sensing data, SAPM actively engages fishers in participatory mapping, documenting rather than inferring their priorities. By so doing, SAPM also facilitates the gathering of other useful data, such as local ecological knowledge. The method was tested and validated in Northern Ireland, where over 100 fishers participated in a semi-structured questionnaire and mapping exercise. The response rate was excellent, 97%, demonstrating fishers’ willingness to be involved. The resultant maps are easily accessible and instantly informative, providing a very clear visual indication of which areas are most important for the fishers. The maps also provide quantitative data, which can be used to analyse the relative impact of different management options on the fishing industry and can be incorporated into planning software, such as MARXAN, to ensure that conservation goals can be met at minimum negative impact to the industry. This research shows how spatial access priority mapping can facilitate the early engagement of fishers and the ready incorporation of their priorities into the decision-making process in a transparent, quantitative way.
Facebook
TwitterOverview
The Office of the Geographer and Global Issues at the U.S. Department of State produces the Large Scale International Boundaries (LSIB) dataset. The current edition is version 11.4 (published 24 February 2025). The 11.4 release contains updated boundary lines and data refinements designed to extend the functionality of the dataset. These data and generalized derivatives are the only international boundary lines approved for U.S. Government use. The contents of this dataset reflect U.S. Government policy on international boundary alignment, political recognition, and dispute status. They do not necessarily reflect de facto limits of control.
National Geospatial Data Asset
This dataset is a National Geospatial Data Asset (NGDAID 194) managed by the Department of State. It is a part of the International Boundaries Theme created by the Federal Geographic Data Committee.
Dataset Source Details
Sources for these data include treaties, relevant maps, and data from boundary commissions, as well as national mapping agencies. Where available and applicable, the dataset incorporates information from courts, tribunals, and international arbitrations. The research and recovery process includes analysis of satellite imagery and elevation data. Due to the limitations of source materials and processing techniques, most lines are within 100 meters of their true position on the ground.
Cartographic Visualization
The LSIB is a geospatial dataset that, when used for cartographic purposes, requires additional styling. The LSIB download package contains example style files for commonly used software applications. The attribute table also contains embedded information to guide the cartographic representation. Additional discussion of these considerations can be found in the Use of Core Attributes in Cartographic Visualization section below.
Additional cartographic information pertaining to the depiction and description of international boundaries or areas of special sovereignty can be found in Guidance Bulletins published by the Office of the Geographer and Global Issues: https://data.geodata.state.gov/guidance/index.html
Contact
Direct inquiries to internationalboundaries@state.gov. Direct download: https://data.geodata.state.gov/LSIB.zip
Attribute Structure
The dataset uses the following attributes divided into two categories: ATTRIBUTE NAME | ATTRIBUTE STATUS CC1 | Core CC1_GENC3 | Extension CC1_WPID | Extension COUNTRY1 | Core CC2 | Core CC2_GENC3 | Extension CC2_WPID | Extension COUNTRY2 | Core RANK | Core LABEL | Core STATUS | Core NOTES | Core LSIB_ID | Extension ANTECIDS | Extension PREVIDS | Extension PARENTID | Extension PARENTSEG | Extension
These attributes have external data sources that update separately from the LSIB: ATTRIBUTE NAME | ATTRIBUTE STATUS CC1 | GENC CC1_GENC3 | GENC CC1_WPID | World Polygons COUNTRY1 | DoS Lists CC2 | GENC CC2_GENC3 | GENC CC2_WPID | World Polygons COUNTRY2 | DoS Lists LSIB_ID | BASE ANTECIDS | BASE PREVIDS | BASE PARENTID | BASE PARENTSEG | BASE
The core attributes listed above describe the boundary lines contained within the LSIB dataset. Removal of core attributes from the dataset will change the meaning of the lines. An attribute status of “Extension” represents a field containing data interoperability information. Other attributes not listed above include “FID”, “Shape_length” and “Shape.” These are components of the shapefile format and do not form an intrinsic part of the LSIB.
Core Attributes
The eight core attributes listed above contain unique information which, when combined with the line geometry, comprise the LSIB dataset. These Core Attributes are further divided into Country Code and Name Fields and Descriptive Fields.
County Code and Country Name Fields
“CC1” and “CC2” fields are machine readable fields that contain political entity codes. These are two-character codes derived from the Geopolitical Entities, Names, and Codes Standard (GENC), Edition 3 Update 18. “CC1_GENC3” and “CC2_GENC3” fields contain the corresponding three-character GENC codes and are extension attributes discussed below. The codes “Q2” or “QX2” denote a line in the LSIB representing a boundary associated with areas not contained within the GENC standard.
The “COUNTRY1” and “COUNTRY2” fields contain the names of corresponding political entities. These fields contain names approved by the U.S. Board on Geographic Names (BGN) as incorporated in the ‘"Independent States in the World" and "Dependencies and Areas of Special Sovereignty" lists maintained by the Department of State. To ensure maximum compatibility, names are presented without diacritics and certain names are rendered using common cartographic abbreviations. Names for lines associated with the code "Q2" are descriptive and not necessarily BGN-approved. Names rendered in all CAPITAL LETTERS denote independent states. Names rendered in normal text represent dependencies, areas of special sovereignty, or are otherwise presented for the convenience of the user.
Descriptive Fields
The following text fields are a part of the core attributes of the LSIB dataset and do not update from external sources. They provide additional information about each of the lines and are as follows: ATTRIBUTE NAME | CONTAINS NULLS RANK | No STATUS | No LABEL | Yes NOTES | Yes
Neither the "RANK" nor "STATUS" fields contain null values; the "LABEL" and "NOTES" fields do. The "RANK" field is a numeric expression of the "STATUS" field. Combined with the line geometry, these fields encode the views of the United States Government on the political status of the boundary line.
ATTRIBUTE NAME | | VALUE | RANK | 1 | 2 | 3 STATUS | International Boundary | Other Line of International Separation | Special Line
A value of “1” in the “RANK” field corresponds to an "International Boundary" value in the “STATUS” field. Values of ”2” and “3” correspond to “Other Line of International Separation” and “Special Line,” respectively.
The “LABEL” field contains required text to describe the line segment on all finished cartographic products, including but not limited to print and interactive maps.
The “NOTES” field contains an explanation of special circumstances modifying the lines. This information can pertain to the origins of the boundary lines, limitations regarding the purpose of the lines, or the original source of the line.
Use of Core Attributes in Cartographic Visualization
Several of the Core Attributes provide information required for the proper cartographic representation of the LSIB dataset. The cartographic usage of the LSIB requires a visual differentiation between the three categories of boundary lines. Specifically, this differentiation must be between:
Rank 1 lines must be the most visually prominent. Rank 2 lines must be less visually prominent than Rank 1 lines. Rank 3 lines must be shown in a manner visually subordinate to Ranks 1 and 2. Where scale permits, Rank 2 and 3 lines must be labeled in accordance with the “Label” field. Data marked with a Rank 2 or 3 designation does not necessarily correspond to a disputed boundary. Please consult the style files in the download package for examples of this depiction.
The requirement to incorporate the contents of the "LABEL" field on cartographic products is scale dependent. If a label is legible at the scale of a given static product, a proper use of this dataset would encourage the application of that label. Using the contents of the "COUNTRY1" and "COUNTRY2" fields in the generation of a line segment label is not required. The "STATUS" field contains the preferred description for the three LSIB line types when they are incorporated into a map legend but is otherwise not to be used for labeling.
Use of
Facebook
TwitterU.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
eXtension Foundation, the University of New Hampshire, and Virginia Tech have developed a mapping and data exploration tool to assist Cooperative Extension staff and administrators in making strategic planning and programming decisions. The tool, called the National Extension Web-mapping Tool (or NEWT), is the key in efforts to make spatial data available within cooperative extension system. NEWT requires no GIS experience to use. NEWT provides access for CES staff and administrators to relevant spatial data at a variety of scales (national, state, county) in useful formats (maps, tables, graphs), all without the need for any experience or technical skills in Geographic Information System (GIS) software. By providing consistent access to relevant spatial data throughout the country in a format useful to CES staff and administrators, NEWT represents a significant advancement for the use of spatial technology in CES. Users of the site will be able to discover the data layers which are of most interest to them by making simple, guided choices about topics related to their work. Once the relevant data layers have been chosen, a mapping interface will allow the exploration of spatial relationships and the creation and export of maps. Extension areas to filter searches include 4-H Youth & Family, Agriculture, Business, Community, Food & Health, and Natural Resources. Users will also be able to explore data by viewing data tables and graphs. This Beta release is open for public use and feedback. Resources in this dataset:Resource Title: Website Pointer to NEWT National Extension Web-mapping Tool Beta. File Name: Web Page, url: https://www.mapasyst.org/newt/ The site leads the user through the process of selecting the data in which they would be most interested, then provides a variety of ways for the user to explore the data (maps, graphs, tables).
Facebook
TwitterBy US Open Data Portal, data.gov [source]
This dataset provides crucial geographic data related to two of the U.S. Health Information Technology for Economic and Clinical Health (HITECH) Act programs: the Health IT Regional Extension Centers (REC) Program and the Beacon Communities Program. As part of the American Recovery and Reinvestment Act (ARRA), these grants were made available to provide citizens with access to health IT infrastructure investments throughout diverse communities across the United States. This crosswalk is an essential resource for anyone looking to link regional, state, county and zip code level program financials with performance metrics for visualization or comparison. With detailed information about region counties, codes, states, FIPS codes and ZIP codes associated with local HITECH grantees, this data presentation helps shed light on a financially impactful initiative from our federal government that can drastically improve healthcare delivery in thousands of cities nationwide!
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This dataset provides geographic data for the service areas of two of the HITECH programs, the Health IT Regional Extension Centers (REC) Program and the Beacon Communities Program. This can be used to map and visualize data related to those programs. Here is a guide on how to use this dataset:
- Get familiar with key columns: Familiarize yourself with the columns included in this dataset, including column names and descriptions for each column such as region, region_code, county_name, state_fips, county_fips and zip.
- Review data formats: If there are any discrepancies between your current format of data presented in this dataset versus what you may have currently in your system or within other sources of information - make sure to review those discrepancies prior exploring more from here onwards.
- Understand regional coverage: Refine the analysis by filtering out different grantee located based on specific regions or states - use necessary filters such as Region code or County FIPs code that will give you an easier view on which region/county grantee has been provided funding through these HHS programs as part of Hitech Act program distribution.
- Map & Visualize grantees: We can visualise geographically where are REC-Program & Beacon Communities Grants distributed across US by making a heatmap while taking desired geolocation coordinates like zip codes; query all available details under columns we need like zip codes along their respective countyp location & state value so that grants can be highlighted after it renders practical Map visuals for us giving an ease if further status / details required about entities who had taken these grants within certain area / regions!
- Creating an interactive map to visualize grant program performance by region and county.
- Using the data to create a color-coded scatterplot graphic to show active grant program sites in the US.
- Generating reports on HITECH Grantee performance over time, grouped by geographic area or region
If you use this dataset in your research, please credit the original authors. Data Source
Unknown License - Please check the dataset description for more information.
File: healthit-dashboard-areatype-crosswalk-csv-1.csv | Column name | Description | |:------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------| | region | The region of the grantee. (String) | | region_code | The code for the region of the grantee. (String) | | county_name | The name of county where the grantee is located. (String) | | state_fips | The Federal Information Processing Standard (FIPS) code for knowledge of which state it is located in. (String) | | county_fips | The Federa...
Facebook
TwitterThe establishment of a BES Multi-User Geodatabase (BES-MUG) allows for the storage, management, and distribution of geospatial data associated with the Baltimore Ecosystem Study. At present, BES data is distributed over the internet via the BES website. While having geospatial data available for download is a vast improvement over having the data housed at individual research institutions, it still suffers from some limitations. BES-MUG overcomes these limitations; improving the quality of the geospatial data available to BES researches, thereby leading to more informed decision-making. BES-MUG builds on Environmental Systems Research Institute's (ESRI) ArcGIS and ArcSDE technology. ESRI was selected because its geospatial software offers robust capabilities. ArcGIS is implemented agency-wide within the USDA and is the predominant geospatial software package used by collaborating institutions. Commercially available enterprise database packages (DB2, Oracle, SQL) provide an efficient means to store, manage, and share large datasets. However, standard database capabilities are limited with respect to geographic datasets because they lack the ability to deal with complex spatial relationships. By using ESRI's ArcSDE (Spatial Database Engine) in conjunction with database software, geospatial data can be handled much more effectively through the implementation of the Geodatabase model. Through ArcSDE and the Geodatabase model the database's capabilities are expanded, allowing for multiuser editing, intelligent feature types, and the establishment of rules and relationships. ArcSDE also allows users to connect to the database using ArcGIS software without being burdened by the intricacies of the database itself. For an example of how BES-MUG will help improve the quality and timeless of BES geospatial data consider a census block group layer that is in need of updating. Rather than the researcher downloading the dataset, editing it, and resubmitting to through ORS, access rules will allow the authorized user to edit the dataset over the network. Established rules will ensure that the attribute and topological integrity is maintained, so that key fields are not left blank and that the block group boundaries stay within tract boundaries. Metadata will automatically be updated showing who edited the dataset and when they did in the event any questions arise. Currently, a functioning prototype Multi-User Database has been developed for BES at the University of Vermont Spatial Analysis Lab, using Arc SDE and IBM's DB2 Enterprise Database as a back end architecture. This database, which is currently only accessible to those on the UVM campus network, will shortly be migrated to a Linux server where it will be accessible for database connections over the Internet. Passwords can then be handed out to all interested researchers on the project, who will be able to make a database connection through the Geographic Information Systems software interface on their desktop computer. This database will include a very large number of thematic layers. Those layers are currently divided into biophysical, socio-economic and imagery categories. Biophysical includes data on topography, soils, forest cover, habitat areas, hydrology and toxics. Socio-economics includes political and administrative boundaries, transportation and infrastructure networks, property data, census data, household survey data, parks, protected areas, land use/land cover, zoning, public health and historic land use change. Imagery includes a variety of aerial and satellite imagery. See the readme: http://96.56.36.108/geodatabase_SAL/readme.txt See the file listing: http://96.56.36.108/geodatabase_SAL/diroutput.txt
Facebook
TwitterThe files linked to this reference are the geospatial data created as part of the completion of the baseline vegetation inventory project for the NPS park unit. Current format is ArcGIS file geodatabase but older formats may exist as shapefiles. Using the National Vegetation Classification System (NVCS) developed by Natureserve, with additional classes and modifiers, overstory vegetation communities for each park were interpreted from stereo color infrared aerial photographs using manual interpretation methods. Using a minimum mapping unit of 0.5 hectares (MMU = 0.5 ha), polygons representing areas of relatively uniform vegetation were delineated and annotated on clear plastic overlays registered to the aerial photographs. Polygons were labeled according to the dominant vegetation community. Where the polygons were not uniform, second and third vegetation classes were added. Further, a number of modifier codes were employed to indicate important aspects of the polygon that could be interpreted from the photograph (for example, burn condition). The polygons on the plastic overlays were then corrected using photogrammetric procedures and converted to vector format for use in creating a geographic information system (GIS) database for each park. In addition, high resolution color orthophotographs were created from the original aerial photographs for use in the GIS. Upon completion of the GIS database (including vegetation, orthophotos and updated roads and hydrology layers), both hardcopy and softcopy maps were produced for delivery. Metadata for each database includes a description of the vegetation classification system used for each park, summary statistics and documentation of the sources, procedures and spatial accuracies of the data. At the time of this writing, an accuracy assessment of the vegetation mapping has not been performed for most of these parks.
Facebook
TwitterLand use data is critically important to the work of the Department of Water Resources (DWR) and other California agencies. Understanding the impacts of land use, crop location, acreage, and management practices on environmental attributes and resource management is an integral step in the ability of Groundwater Sustainability Agencies (GSAs) to produce Groundwater Sustainability Plans (GSPs) and implement projects to attain sustainability. Land IQ was contracted by DWR to develop a comprehensive and accurate spatial land use database for the 2021 water year (WY 2021), covering over 10.7 million acres of agriculture on a field scale and additional areas of urban extent.The primary objective of this effort was to produce a spatial land use database with an accuracy exceeding 95% using remote sensing, statistical, and temporal analysis methods. This project is an extension of the land use mapping which began in the 2014 crop year, which classified over 15 million acres of land into agricultural and urban areas. Unlike the 2014 and 2016 datasets, the annual WY datasets from and including 2018, 2019, 2020, and 2021 include multi-cropping.Land IQ integrated crop production knowledge with detailed ground truth information and multiple satellite and aerial image resources to conduct remote sensing land use analysis at the field scale. Individual fields (boundaries of homogeneous crop types representing true cropped area, rather than legal parcel boundaries) were classified using a crop category legend and a more specific crop type legend. A supervised classification process using a random forest approach was used to classify delineated fields and was carried out county by county where training samples were available. Random forest approaches are currently some of the highest performing methods for data classification and regression. To determine frequency and seasonality of multicropped fields, peak growth dates were determined for each field of annual crops. Fields were attributed with DWR crop categories, which included citrus/subtropical, deciduous fruits and nuts, field crops, grain and hay, idle, pasture, rice, truck crops, urban, vineyards, and young perennials. These categories represent aggregated groups of specific crop types in the Land IQ dataset.Accuracy was calculated for the crop mapping using both DWR and Land IQ crop legends. The overall accuracy result for the crop mapping statewide was 97% using the Land IQ legend (Land IQ Subclass) and 98% using the DWR legend (DWR Class). Accuracy and error results varied among crop types. Some less extensive crops that have very few validation samples may have a skewed accuracy result depending on the number and nature of validation sample points. DWR revised crops and conditions from the Land IQ classification were encoded using standard DWR land use codes added to feature attributes, and each modified classification is indicated by the value 'r' in the ‘DWR_REVISE' data field. Polygons drawn by DWR, not included in Land IQ dataset receive the 'n' code for new. Boundary change (i.e. DWR changed the boundary that LIQ delivered, could be split boundary) indicated by 'b'. Each polygon classification is consistent with DWR attribute standards, however some of DWR's traditional attribute definitions are modified and extended to accommodate unavoidable constraints within remote-sensing classifications, or to make data more specific for DWR's water balance computation needs. The original Land IQ classifications reported for each polygon are preserved for comparison, and are also expressed as DWR standard attributes. Comments, problems, improvements, updates, or suggestions about local conditions or revisions in the final data set should be forwarded to the appropriate Regional Office Senior Land Use Supervisor.Revisions were made if:- DWR corrected the original crop classification based on local knowledge and analysis,-PARTIALLY IRRIGATED CROPS Crops, irrigated for only part of their normal irrigation season were given the special condition of ‘X’,-In certain areas, DWR changed the irrigation status to non-irrigated. Among those areas the special condition may have been changed to 'Partially Irrigated' based on image analysis and local knowledge,- young versus mature stages of perennial orchards and vineyards were identified (DWR added ‘Young’ to Special Condition attributes),- DWR determined that a field originally classified ‘Idle’ or 'Unclassified' were actually cropped one or more times during the year,- the percent of cropped area was changed from the original acres reported by Land IQ (values indicated in DWR ‘Percent’ column),- DWR determined that the field boundary should have been changed to better reflect the cropped area of the polygon and is identified by a 'b' in the DWR_REVISED column,- DWR determined that the field boundary should have been split to better reflect separate crops within the same polygon and identified by a 'b' in the DWR_REVISED column,- The ‘Mixed’ was added to the MULTIUSE column refers to no boundary change, but percent of field is changed where more than one crop is found,- DWR identified a distinct early or late crop on the field before the main season crop (‘Double’ was added to the MULTIUSE column); if the 1st and 2nd sequential crops occupied different portions of the total field acreage, the area percentages were indicated for each crop).This dataset includes multicropped fields. If the field was determined to have more than one crop during the course of the WY (Water Year begins October 1 and ends September 30 of the following year), the order of the crops is sequential, beginning with Class 1. All single cropped fields will be placed in Class 2, so every polygon will have a crop in the Class 2 and CropType2 columns. In the case that a permanent crop was removed during the WY, the Class 2 crop will be the permanent crop followed by ‘X’ – Unclassified fallow in the Class 3 column. In the case of Intercropping, the main crop will be placed in the Class 2 column with the partial crop in the Class 3 column.A new column for the 2019, 2020, and 2021 datasets is called ‘MAIN_CROP’. This column indicates which field Land IQ identified as the main season crop for the WY representing the crop grown during the dominant growing season for each county. The column ‘MAIN_CROP_DATE’, another addition to the 2019, 2020, and 2021 datasets, indicates the Normalized Difference Vegetation Index (NDVI) peak date for this main season crop. The column 'EMRG_CROP' for 2019, 2020, and 2021 indicates an emerging crop at the end of the WY. Crops listed indicate that at the end of the WY, September 2021, crop activity was detected from a crop that reached peak NDVI in the following WY (2022 WY). This attribute is included to account for water use of crops that span multiple WYs and are not exclusive to a single WY. It is indicative of early crop growth and initial water use in the current WY, but a majority of crop development and water use in the following WY. Crops listed in the ‘EMRG_CROP’ attribute will also be captured as the first crop (not necessarily Crop 1) in the following WY (2022 WY). These crops are not included in the 2021 UCF_ATT code as their peak date occurred in the following WY.For the 2021 dataset new columns added are: 'YR_PLANTED' which represent the year orchard / grove was planted. 'SEN_CROP' indicates a senescing crop at the beginning of the WY. Crops listed indicate that at the beginning of the WY, October 2020, crop activity was detected from a crop that reached peak NDVI in the previous WY (2020 WY), thus was a senescing crop. This is included to account for water use of crop growth periods that span multiple WYs and are not exclusive to a WY. Crops listed in the ‘SEN_CROP’ attribute are also captured in the CROPTYP 1 through 4 sequence of the previous WY (2020 WY). These crops are not included in the 2021 UCF_ATT code as their peak NDVI occurred in the previous WY. CTYP#_NOTE: indicates a more specific land use subclassification from the DWR Standard Land Use Legend that is not included in the primary, DWR Remote Sensing Land Use Legend.DWR reviewed and revised the data in some cases. The associated data are considered DWR enterprise GIS data, which meet all appropriate requirements of the DWR Spatial Data Standards, specifically the DWR Spatial Data Standard version 3.6, dated September 27, 2023. This data set was not produced by DWR. Data were originally developed and supplied by Land IQ, LLC, under contract to California Department of Water Resources. DWR makes no warranties or guarantees - either expressed or implied - as to the completeness, accuracy, or correctness of the data. DWR neither accepts nor assumes liability arising from or for any incorrect, incomplete, or misleading subject data. Detailed compilation and reviews of Statewide Crop Mapping and metadata development were performed by DWR Land Use Unit staff, therefore you may forward your questions to Landuse@water.ca.gov.This dataset is current as of 2021.
Facebook
TwitterLand Cover Mapping Analysis And Urban Growth Modelling Using Remote Sensing Techniques In Greater Cairo RegionEgypt
This dataset falls under the category Traffic Generating Parameters Land Cover.
It contains the following data: This study modelled the urban growth in the Greater Cairo Region (GCR), one of the fastest growing mega cities in the world, using remote sensing data and ancillary data. Three land use land cover (LULC) maps (1984, 2003 and 2014) were produced from satellite images by using Support Vector Machines (SVM). Then, land cover changes were detected by applying a high level mapping technique that combines binary maps (change/no-change) and post classification comparison technique. The spatial and temporal urban growth patterns were analyzed using selected statistical metrics developed in the FRAGSTATS software. Major transitions to urban were modelled to predict the future scenarios for year 2025 using Land Change Modeller (LCM) embedded in the IDRISI software. The model results, after validation, indicated that 14% of the vegetation and 4% of the desert in 2014 will be urbanized in 2025. The urban areas within a 5-km buffer around: the Great Pyramids, Islamic Cairo and Al-Baron Palace were calculated, highlighting an intense urbanization especially around the Pyramids; 28% in 2014 up to 40% in 2025. Knowing the current and estimated urbanization situation in GCR will help decision makers to adjust and develop new plans to achieve a sustainable development of urban areas and to protect the historical locations.
This dataset was scouted on 2022-02-03 as part of a data sourcing project conducted by TUMI. License information might be outdated: Check original source for current licensing.
The data can be accessed using the following URL / API Endpoint: https://www.researchgate.net/publication/282321895_Land_Cover_Mapping_Analysis_and_Urban_Growth_Modelling_Using_Remote_Sensing_Techniques_in_Greater_Cairo_Region-Egypt Please note: This link leads to an external resource. If you experience any issues with its availability, please try again later.
Facebook
TwitterThe files linked to this reference are the geospatial data created as part of the completion of the baseline vegetation inventory project for the NPS park unit. Current format is ArcGIS file geodatabase but older formats may exist as shapefiles.
A combination of methods was used to delineate and interpret vegetation polygons for the mapping component of the CEBR project area. RMGSC cartographers acquired or created a number of ancillary spatial datasets: soils, geology, fire data, climate, fauna, invasive species, and CEBR and project boundaries. Digital elevation model (DEM) data were downloaded from the USGS Seamless Data Server (http://seamless.usgs.gov) and processed into a 10-meter ERDAS Imagine file. Slope and aspect information were derived from the DEM coverage. Vegetation information was obtained from Zion National Park from a survey that occurred in 1988 for the CEBR area. Mylar overlays and MOSS data layers were used to create this dataset. After receiving the files, the data were subsequently transferred to current digital orthophotos and output to a shapefile. Two classifications were nested within the original spatial dataset: a habitat type classification with 18 categories (145 polygons) and a community level classification with 29 classes (127 polygons). The polygon delineation was fairly coarse and the shapefile was used only as a reference for the current work. Another vegetation dataset that was acquired from the NPS was a USFS layer. This dataset was clipped to the CEBR area; this subset contained 125 polygons with 16 map classes. The age of these data and any additional metadata is unknown; therefore these general polygons were only used as a guide for the current project.
Facebook
Twitter2022 STATEWIDE CROP MAPPING - PROVISIONALLand use data is critically important to the work of the Department of Water Resources (DWR) and other California agencies. Understanding the impacts of land use, crop location, acreage, and management practices on environmental attributes and resource management is an integral step in the ability of Groundwater Sustainability Agencies (GSAs) to produce Groundwater Sustainability Plans (GSPs) and implement projects to attain sustainability. Land IQ was contracted by DWR to develop a comprehensive and accurate spatial land use database for the 2022 water year (WY 2022). The primary objective of this effort was to produce a spatial land use database with accuracies exceeding 95% using remote sensing, statistical, and temporal analysis methods. This project is an extension of the 2014, 2016, 2018, 2019, 2020 and 2021 land use mapping, which classified over 14 million acres of land into irrigated agriculture and urban area. Unlike the 2014 and 2016 datasets, the WY 2018, 2019, 2020, 2021 and 2022 datasets include multi-cropping and incorporates DWR ground-truth data from Siskiyou, Modoc, Lassen and Shasta counties. Land IQ integrated crop production knowledge with detailed ground truth information and multiple satellite and aerial image resources to conduct remote sensing land use analysis at the field scale. Individual fields (boundaries of homogeneous crop types representing cropped area, rather than legal parcel boundaries) were classified using a crop category legend and a more specific crop type legend. A supervised classification method using a random forest approach was used to classify delineated fields and was carried out county by county where training samples were available. Random forest approaches are currently some of the highest performing methods for data classification and regression. To determine frequency and seasonality of multiple-cropped fields, peak growth dates were determined for annual crops. Fields were attributed with DWR crop categories and included citrus/subtropical, deciduous fruits and nuts, field crops, grain and hay, idle, pasture, rice, truck crops, urban, vineyards, and young perennials. These categories represent aggregated groups of specific crop types in the Land IQ dataset. Accuracy was calculated for the crop mapping using both DWR and Land IQ crop legends. The overall accuracy result for the crop mapping statewide was 98.1% at the DWR Class level and 96.7% at the DWR Subclass level. Accuracy and error results varied among crop types. In particular, some less extensive crops that have very few validation samples may have a skewed accuracy result depending on the number and nature of validation sample points. DWR revised crops and conditions from the Land IQ classification were encoded using standard DWR land use codes added to feature attributes, and each modified classification is indicated by the value 'r' in the ‘DWR_REVISE' data field. Polygons drawn by DWR, not included in Land IQ dataset receive the 'n' code for new. Boundary change (i.e. DWR changed the boundary that LIQ delivered could be split boundary) indicated by 'b'. Each polygon classification is consistent with DWR attribute standards, however some of DWR's traditional attribute definitions are modified and extended to accommodate unavoidable constraints within remote-sensing classifications, or to make data more specific for DWR's water balance computation needs. The original Land IQ classifications reported for each polygon are preserved for comparison, and are also expressed as DWR standard attributes. Comments, problems, improvements, updates, or suggestions about local conditions or revisions in the final data set should be forwarded to the appropriate Regional Office Senior Land Use Supervisor. Revisions were made if: - DWR corrected the original crop classification based on local knowledge and analysis, -PARTIALLY IRRIGATED CROPS Crops irrigated for only part of their normal irrigation season were given the special condition of ‘X’, -In certain areas, DWR changed the irrigation status to irrigated or non-irrigated. Among those areas the special condition may have been changed to 'Partially Irrigated' based on image analysis and local knowledge, - young versus mature stages of perennial orchards and vineyards were identified (DWR added ‘Young’ to Special Condition attributes), - DWR determined that a field originally classified ‘Idle’ was actually cropped one or more times during the year, - the percent of cropped area was changed from the original acres reported by Land IQ (values indicated in DWR ‘Percent’ column), - DWR determined that the field boundary should have been split to better reflect separate crops within the same polygon and identified by a 'b' in the DWR_REVISED column, - The ‘Mixed’ was added to the MULTIUSE column refers to no boundary change, but percent of field is changed where more than one crop is found, - DWR identified a distinct early or late crop on the field before the main season crop (‘Double’ was added to the MULTIUSE column); if the 1st and 2nd sequential crops occupied different portions of the total field acreage, the area percentages were indicated for each crop). This dataset includes multicropped fields. If the field was determined to have more than one crop during the course of the water year, the order of the crops is sequential, beginning with Class 1. All single cropped fields will be placed in Class 2, so every polygon will have a crop in the Class 2 and CropType2 columns. In the case that a permanent crop was removed during the water year, the Class 2 crop will be the permanent crop followed by ‘X’ – Unclassified fallow in the Class 3 column. In the case of Intercropping, the main crop will be placed in the Class 2 column with the partial crop in the Class 3 column. The column 'MAIN_CROP' was added in 2019 and has been continued through the 2022 dataset. This column indicates which field Land IQ identified as the main season crop for the water year representing the crop grown during the dominant growing season for each county. The column ‘MAIN_CROP_DATE’, continued in the 2022 dataset, indicates the NDVI peak date for this main season crop. Asterisks (* or **) in attribute table indicates no data have been collected for that specific attribute.This provisional metadata does not contain the full metadata per the California Department of Water Resources (DWR) Spatial Data Standards. DWR reviewed and revised the data in some cases. The associated data are considered DWR enterprise GIS data, which meet all appropriate requirements of the DWR Spatial Data Standards, specifically the DWR Spatial Data Standard version 3.5, dated March 22, 2023. This dataset was not produced by DWR. Data were originally developed and supplied by Land IQ, LLC, under contract to California Department of Water Resources. Comments, problems, improvements, updates, or suggestions should be forwarded to LandUse@water.ca.gov.Prior to WY 2021 final mapping release, pasture areas that where mechanically harvested during a water year were classified as P6-Miscellaneous Grasses. Starting with the WY 2021 final mapping release and moving forward these harvested pasture areas are classified as P3-Mixed Pasture.
Facebook
TwitterLearn Geographic Mapping with Altair, Vega-Lite and Vega using Curated Datasets
Complete geographic and geophysical data collection for mapping and visualization. This consolidation includes 18 complementary datasets used by 31+ Vega, Vega-Lite, and Altair examples 📊. Perfect for learning geographic visualization techniques including projections, choropleths, point maps, vector fields, and interactive displays.
Source data lives on GitHub and can also be accessed via CDN. The vega-datasets project serves as a common repository for example datasets used across these visualization libraries and related projects.
airports.csv), lines (like londonTubeLines.json), and polygons (like us-10m.json).windvectors.csv, annual-precip.json).This pack includes 18 datasets covering base maps, reference points, statistical data for choropleths, and geophysical data.
| Dataset | File | Size | Format | License | Description | Key Fields / Join Info |
|---|---|---|---|---|---|---|
| US Map (1:10m) | us-10m.json | 627 KB | TopoJSON | CC-BY-4.0 | US state and county boundaries. Contains states and counties objects. Ideal for choropleths. | id (FIPS code) property on geometries |
| World Map (1:110m) | world-110m.json | 117 KB | TopoJSON | CC-BY-4.0 | World country boundaries. Contains countries object. Suitable for world-scale viz. | id property on geometries |
| London Boroughs | londonBoroughs.json | 14 KB | TopoJSON | CC-BY-4.0 | London borough boundaries. | properties.BOROUGHN (name) |
| London Centroids | londonCentroids.json | 2 KB | GeoJSON | CC-BY-4.0 | Center points for London boroughs. | properties.id, properties.name |
| London Tube Lines | londonTubeLines.json | 78 KB | GeoJSON | CC-BY-4.0 | London Underground network lines. | properties.name, properties.color |
| Dataset | File | Size | Format | License | Description | Key Fields / Join Info |
|---|---|---|---|---|---|---|
| US Airports | airports.csv | 205 KB | CSV | Public Domain | US airports with codes and coordinates. | iata, state, `l... |