Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
MAP-CC
🌐 Homepage | 🤗 MAP-CC | 🤗 CHC-Bench | 🤗 CT-LLM | 📖 arXiv | GitHub An open-source Chinese pretraining dataset with a scale of 800 billion tokens, offering the NLP community high-quality Chinese pretraining data.
Disclaimer
This model, developed for academic purposes, employs rigorously compliance-checked training data to uphold the highest standards of integrity and compliance. Despite our efforts, the inherent complexities of data and the broad spectrum of… See the full description on the dataset page: https://huggingface.co/datasets/m-a-p/MAP-CC.
https://brightdata.com/licensehttps://brightdata.com/license
The Google Maps dataset is ideal for getting extensive information on businesses anywhere in the world. Easily filter by location, business type, and other factors to get the exact data you need. The Google Maps dataset includes all major data points: timestamp, name, category, address, description, open website, phone number, open_hours, open_hours_updated, reviews_count, rating, main_image, reviews, url, lat, lon, place_id, country, and more.
The Digital Geologic Map of the U.S. Geological Survey Mapping in the Western Portion of Amistad National Recreation Area, Texas is composed of GIS data layers complete with ArcMap 9.3 layer (.LYR) files, two ancillary GIS tables, a Map PDF document with ancillary map text, figures and tables, a FGDC metadata record and a 9.3 ArcMap (.MXD) Document that displays the digital map in 9.3 ArcGIS. The data were completed as a component of the Geologic Resources Inventory (GRI) program, a National Park Service (NPS) Inventory and Monitoring (I&M) funded program that is administered by the NPS Geologic Resources Division (GRD). Source geologic maps and data used to complete this GRI digital dataset were provided by the following: Eddie Collins, Amanda Masterson and Tom Tremblay (Texas Bureau of Economic Geology); Rick Page (U.S. Geological Survey); Gilbert Anaya (International Boundary and Water Commission). Detailed information concerning the sources used and their contribution the GRI product are listed in the Source Citation sections(s) of this metadata record (wpam_metadata.txt; available at http://nrdata.nps.gov/amis/nrdata/geology/gis/wpam_metadata.xml). All GIS and ancillary tables were produced as per the NPS GRI Geology-GIS Geodatabase Data Model v. 2.1. (available at: http://science.nature.nps.gov/im/inventory/geology/GeologyGISDataModel.cfm). The GIS data is available as a 9.3 personal geodatabase (wpam_geology.mdb), and as shapefile (.SHP) and DBASEIV (.DBF) table files. The GIS data projection is NAD83, UTM Zone 14N. The data is within the area of interest of Amistad National Recreation Area.
The USGS Topo base map service from The National Map is a combination of contours, shaded relief, woodland and urban tint, along with vector layers, such as geographic names, governmental unit boundaries, hydrography, structures, and transportation, to provide a composite topographic base map. Data sources are the National Atlas for small scales, and The National Map for medium to large scales.
Attribution-ShareAlike 2.0 (CC BY-SA 2.0)https://creativecommons.org/licenses/by-sa/2.0/
License information was derived automatically
This web map references the live tiled map service from the OpenStreetMap (OSM) project. OpenStreetMap (OSM) is an open collaborative project to create a free editable map of the world. Volunteers gather location data using GPS, local knowledge, and other free sources of information and upload it. The resulting free map can be viewed and downloaded from the OpenStreetMap server: https://www.OpenStreetMap.org. See that website for additional information about OpenStreetMap. It is made available as a basemap for GIS work in ESRI products under a Creative Commons Attribution-ShareAlike license. Tip: This service is one of the basemaps used in the ArcGIS.com map viewer. Simply click one of those links to launch the interactive application of your choice, and then choose Open Street Map from the Basemap control to start using this service. You'll also find this service in the Basemap gallery in ArcGIS Explorer Desktop and ArcGIS Desktop 10. Tip: Here are some well known locations as they appear in this web map, accessed by launching the web map with a URL that contains location parameters: Athens, Cairo, Jakarta, Moscow, Mumbai, Nairobi, Paris, Rio De Janeiro, Shanghai
In 2007, the California Ocean Protection Council initiated the California Seafloor Mapping Program (CSMP), designed to create a comprehensive seafloor map of high-resolution bathymetry, marine benthic habitats, and geology within California’s State Waters. The program supports a large number of coastal-zone- and ocean-management issues, including the California Marine Life Protection Act (MLPA) (California Department of Fish and Wildlife, 2008), which requires information about the distribution of ecosystems as part of the design and proposal process for the establishment of Marine Protected Areas. A focus of CSMP is to map California’s State Waters with consistent methods at a consistent scale. The CSMP approach is to create highly detailed seafloor maps through collection, integration, interpretation, and visualization of swath sonar data (the undersea equivalent of satellite remote-sensing data in terrestrial mapping), acoustic backscatter, seafloor video, seafloor photography, high-resolution seismic-reflection profiles, and bottom-sediment sampling data. The map products display seafloor morphology and character, identify potential marine benthic habitats, and illustrate both the surficial seafloor geology and shallow (to about 100 m) subsurface geology. It is emphasized that the more interpretive habitat and geology data rely on the integration of multiple, new high-resolution datasets and that mapping at small scales would not be possible without such data. This approach and CSMP planning is based in part on recommendations of the Marine Mapping Planning Workshop (Kvitek and others, 2006), attended by coastal and marine managers and scientists from around the state. That workshop established geographic priorities for a coastal mapping project and identified the need for coverage of “lands” from the shore strand line (defined as Mean Higher High Water; MHHW) out to the 3-nautical-mile (5.6-km) limit of California’s State Waters. Unfortunately, surveying the zone from MHHW out to 10-m water depth is not consistently possible using ship-based surveying methods, owing to sea state (for example, waves, wind, or currents), kelp coverage, and shallow rock outcrops. Accordingly, some of the data presented in this series commonly do not cover the zone from the shore out to 10-m depth. This data is part of a series of online U.S. Geological Survey (USGS) publications, each of which includes several map sheets, some explanatory text, and a descriptive pamphlet. Each map sheet is published as a PDF file. Geographic information system (GIS) files that contain both ESRI ArcGIS raster grids (for example, bathymetry, seafloor character) and geotiffs (for example, shaded relief) are also included for each publication. For those who do not own the full suite of ESRI GIS and mapping software, the data can be read using ESRI ArcReader, a free viewer that is available at http://www.esri.com/software/arcgis/arcreader/index.html (last accessed September 20, 2013). The California Seafloor Mapping Program is a collaborative venture between numerous different federal and state agencies, academia, and the private sector. CSMP partners include the California Coastal Conservancy, the California Ocean Protection Council, the California Department of Fish and Wildlife, the California Geological Survey, California State University at Monterey Bay’s Seafloor Mapping Lab, Moss Landing Marine Laboratories Center for Habitat Studies, Fugro Pelagos, Pacific Gas and Electric Company, National Oceanic and Atmospheric Administration (NOAA, including National Ocean Service–Office of Coast Surveys, National Marine Sanctuaries, and National Marine Fisheries Service), U.S. Army Corps of Engineers, the Bureau of Ocean Energy Management, the National Park Service, and the U.S. Geological Survey. These web services for the Point Sur to Point Arguello map area includes data layers that are associated to GIS and map sheets available from the USGS CSMP web page at https://walrus.wr.usgs.gov/mapping/csmp/index.html. Each published CSMP map area includes a data catalog of geographic information system (GIS) files; map sheets that contain explanatory text; and an associated descriptive pamphlet. This web service represents the available data layers for this map area. Data was combined from different sonar surveys to generate a comprehensive high-resolution bathymetry and acoustic-backscatter coverage of the map area. These data reveal a range of physiographic including exposed bedrock outcrops, large fields of sand waves, as well as many human impacts on the seafloor. To validate geological and biological interpretations of the sonar data, the U.S. Geological Survey towed a camera sled over specific offshore locations, collecting both video and photographic imagery; these “ground-truth” surveying data are available from the CSMP Video and Photograph Portal at https://doi.org/10.5066/F7J1015K. The “seafloor character” data layer shows classifications of the seafloor on the basis of depth, slope, rugosity (ruggedness), and backscatter intensity and which is further informed by the ground-truth-survey imagery. The “potential habitats” polygons are delineated on the basis of substrate type, geomorphology, seafloor process, or other attributes that may provide a habitat for a specific species or assemblage of organisms. Representative seismic-reflection profile data from the map area is also include and provides information on the subsurface stratigraphy and structure of the map area. The distribution and thickness of young sediment (deposited over the past about 21,000 years, during the most recent sea-level rise) is interpreted on the basis of the seismic-reflection data. The geologic polygons merge onshore geologic mapping (compiled from existing maps by the California Geological Survey) and new offshore geologic mapping that is based on integration of high-resolution bathymetry and backscatter imagery seafloor-sediment and rock samplesdigital camera and video imagery, and high-resolution seismic-reflection profiles. The information provided by the map sheets, pamphlet, and data catalog has a broad range of applications. High-resolution bathymetry, acoustic backscatter, ground-truth-surveying imagery, and habitat mapping all contribute to habitat characterization and ecosystem-based management by providing essential data for delineation of marine protected areas and ecosystem restoration. Many of the maps provide high-resolution baselines that will be critical for monitoring environmental change associated with climate change, coastal development, or other forcings. High-resolution bathymetry is a critical component for modeling coastal flooding caused by storms and tsunamis, as well as inundation associated with longer term sea-level rise. Seismic-reflection and bathymetric data help characterize earthquake and tsunami sources, critical for natural-hazard assessments of coastal zones. Information on sediment distribution and thickness is essential to the understanding of local and regional sediment transport, as well as the development of regional sediment-management plans. In addition, siting of any new offshore infrastructure (for example, pipelines, cables, or renewable-energy facilities) will depend on high-resolution mapping. Finally, this mapping will both stimulate and enable new scientific research and also raise public awareness of, and education about, coastal environments and issues. Web services were created using an ArcGIS service definition file. The ArcGIS REST service and OGC WMS service include all Point Sur to Point Arguello map area data layers. Data layers are symbolized as shown on the associated map sheets.
The Nova Map (World Edition) web map provides a detailed world basemap featuring a dark background with glowing blue symbology and colors that are reminiscent of science-fiction shows, where one is looking at a map of the world on a 'head's up' device or a map that would be projected from a transparent glass wall. The map is designed with a grid pattern across the ocean and stripes or square stippled patterns for land use features visible at larger scales. Additional graphics in the oceans presents a futuristic user interface. The futuristic and less terrestrial feel theme continues with the geometric patterns, starburst city dot symbols, and cool color scheme. The fonts displayed are clean and squarish (san serif) with a futuristic, science-fiction, or high technology appearance.This basemap, included in the ArcGIS Living Atlas of the World, uses the Nova vector tile layer.The vector tile layer in this web map is built using the same data sources used for other Esri Vector Basemaps. For details on data sources contributed by the GIS community, view the map of Community Maps Basemap Contributors. Esri Vector Basemaps are updated monthly.Use this MapThis map is designed to be used as a basemap for overlaying other layers of information or as a stand-alone reference map. You can add layers to this web map and save as your own map. If you like, you can add this web map to a custom basemap gallery for others in your organization to use in creating web maps. If you would like to add this map as a layer in other maps you are creating, you may use the tile layer referenced in this map.
This cached web mapping service provides access to a seamless version of the Kentucky Topographic Map Series, also know as KyTopo. The Kentucky-specific map series has newly generated contours, spot elevations, and hillshade based on the KyFromAbove LiDAR-derived DEM. Quadrangle names were developed utilizing a USGS methodology that focuses on the most prominent map features. Public domain data from a variety of state and federal agencies was leveraged to create the map series. All layers utilized during production are available on the KyGeoNet as downloadable data or web mapping services. Updates to this map service will be made on a periodic basis.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
ICDAR 2021 Competition on Historical Map Segmentation — Dataset
This is the dataset of the ICDAR 2021 Competition on Historical Map Segmentation (“MapSeg”). This competition ran from November 2020 to April 2021. Evaluation tools are freely available but distributed separately.
Official competition website: https://icdar21-mapseg.github.io/
The competition report can be cited as:
Joseph Chazalon, Edwin Carlinet, Yizi Chen, Julien Perret, Bertrand Duménieu, Clément Mallet, Thierry Géraud, Vincent Nguyen, Nam Nguyen, Josef Baloun, Ladislav Lenc, and Pavel Král, "ICDAR 2021 Competition on Historical Map Segmentation", in Proceedings of the 16th International Conference on Document Analysis and Recognition (ICDAR'21), September 5-10, 2021, Lausanne, Switzerland.
BibTeX entry:
@InProceedings{chazalon.21.icdar.mapseg, author = {Joseph Chazalon and Edwin Carlinet and Yizi Chen and Julien Perret and Bertrand Duménieu and Clément Mallet and Thierry Géraud and Vincent Nguyen and Nam Nguyen and Josef Baloun and Ladislav Lenc and and Pavel Král}, title = {ICDAR 2021 Competition on Historical Map Segmentation}, booktitle = {Proceedings of the 16th International Conference on Document Analysis and Recognition (ICDAR'21)}, year = {2021}, address = {Lausanne, Switzerland}, }
We thank the City of Paris for granting us with the permission to use and reproduce the atlases used in this work.
The images of this dataset are extracted from a series of 9 atlases of the City of Paris produced between 1894 and 1937 by the Map Service (“Service du plan”) of the City of Paris, France, for the purpose of urban management and planning. For each year, a set of approximately 20 sheets forms a tiled view of the city, drawn at 1/5000 scale using trigonometric triangulation.
Sample citation of original documents:
Atlas municipal des vingt arrondissements de Paris. 1894, 1895, 1898, 1905, 1909, 1912, 1925, 1929, and 1937. Bibliothèque de l’Hôtel de Ville. City of Paris. France.
Motivation
This competition aims as encouraging research in the digitization of historical maps. In order to be usable in historical studies, information contained in such images need to be extracted. The general pipeline involves multiples stages; we list some essential ones here:
segment map content: locate the area of the image which contains map content;
extract map object from different layers: detect objects like roads, buildings, building blocks, rivers, etc. to create geometric data;
georeference the map: by detecting objects at known geographic coordinate, compute the transformation to turn geometric objects into geographic ones (which can be overlaid on current maps).
Task overview
Task 1: Detection of building blocks
Task 2: Segmentation of map content within map sheets
Task 3: Localization of graticule lines intersections
Please refer to the enclosed README.md file or to the official website for the description of tasks and file formats.
Evaluation metrics and tools
Evaluation metrics are described in the competition report and tools are available at https://github.com/icdar21-mapseg/icdar21-mapseg-eval and should also be archived using Zenodo.
📊 Dataset Overview
The emoji-map dataset, created by omarkamali, contains text data in parquet format. It consists of 10K-100K entries, specifically 5.03k rows. The dataset is available in the train split.
📁 Data Structure
The dataset includes two main columns: emoji and unicode_description. The emoji column contains various emoji characters, while the unicode_description column provides a textual description of each emoji.
🔍 Sample Data
Examples from the… See the full description on the dataset page: https://huggingface.co/datasets/omarkamali/emoji-map.
Do not delete.
The Economic Development web map is used to author the Economic Development Experience Builder application. It displays the economic development districts, enterprise zones, industrial areas, economic development zones, Baton Rouge Airport property, and Louisiana Opportunity Zones data in East Baton Rouge Parish, Louisiana.
This map provides a colorized representation of slope, generated dynamically using server-side slope function on the Terrain layer. The degree of slope steepness is depicted by light to dark colors - flat surfaces as gray, shallow slopes as light yellow, moderate slopes as light orange and steep slopes as red-brown. A scaling is applied to slope values to generate appropriate visualization at each map scale. This service should only be used for visualization, such as a base layer in applications or maps. Note: If access to non-scaled slope values is required, use the Slope Degrees or Slope Percent functions, which return values from 0 to 90 degrees, or 0 to 1000%, respectively.Units: DegreesUpdate Frequency: QuarterlyCoverage: World/GlobalData Sources: This layer is compiled from a variety of best available sources from several data providers. To see the coverage and extents of various datasets comprising this service in an interactive map, see World Elevation Coverage Map.What can you do with this layer?Use for Visualization: Yes. This colorized slope is appropriate for visualizing the steepness of the terrain at all map scales. This layer can be added to applications or maps to enhance contextual understanding. Use for Analysis: No. 8 bit color values returned by this service represent scaled slope values. For analysis with non-scaled values, use the Slope Degrees or Slope Percent functions.For more details such as Data Sources, Mosaic method used in this layer, please see the Terrain layer. This layer allows query, identify, and export image requests. The layer is restricted to a 5,000 x 5,000 pixel limit in a single export image request.
This layer is part of a larger collection of elevation layers that you can use to perform a variety of mapping analysis tasks.
A web map used to visualize available digital parcel data for Organized Towns and Unorganized Territories throughout the state of Maine. Individual towns submit parcel data on a voluntary basis; the data are compiled by the Maine Office of GIS for dissemination by the Maine GeoLibrary, and where available, the web map also includes assessor data contained in the Parcels_ADB related table.This web map is intended for use within the Maine Geoparcel Viewer Application; it is not intended for use as a standalone web map.Within Maine, real property data is maintained by the government organization responsible for assessing and collecting property tax for a given location. Organized towns and townships maintain authoritative data for their communities and may voluntarily submit these data to the Maine GeoLibrary Parcel Project. Maine Parcels Organized Towns and Maine Parcels Organized Towns ADB are the product of these voluntary submissions. Communities provide updates to the Maine GeoLibrary on a non-regular basis, sometimes many years apart, which affects the currency of Maine GeoLibrary parcels data. Another resource for real property transaction data is the County Registry of Deeds, although organized town data should very closely match registry information, except in the case of in-process property conveyance transactions.
This dataset represents the cadastral maps created by the Geomatics branch in support of real property acquisitions within the Department of Water Resources. The geographic extent of each map frame was created after using all the spatial attributes available in each map to appropriately georeference it and create the extents from the outer frame of the map. The maps were digitally scanned from the original paper format that were archived after moving to the new resources building. As new maps are created by the branch for real property acquisition services, they will be georeference, attributed and updated into this dataset. The associated data are considered DWR enterprise GIS data, which meet all appropriate requirements of the DWR Spatial Data Standards, specifically the DWR Spatial Data Standard version 3.6, dated September 27, 2023. DWR makes no warranties or guarantees either expressed or implied as to the completeness, accuracy, or correctness of the data. DWR neither accepts nor assumes liability arising from or for any incorrect, incomplete, or misleading subject data. Original internal source projection for this dataset was Teale Albers/NAD83. For copies of data in the original projection, please contact DWR. Comments, problems, improvements, updates, or suggestions should be forwarded to gis@water.ca.gov as available and appropriate.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
## Overview
Leaflet Map is a dataset for object detection tasks - it contains Places Maps annotations for 278 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This data set provides four land cover and ecosystem classification maps for northern Alaska. The maps were produced for several projects and from different data sources including Landsat imagery and existing maps and models, and cover a range of ecosystem and vegetation classes. The data used to derive the maps covered the period 1976-08-04 to 2014-09-01.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
## Overview
Map is a dataset for object detection tasks - it contains Objects Traffic annotations for 513 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
The 3D Visualisation Map (Individualised models) are a set of digital data of 3D models featuring geometry models and texture maps to represent the geometrical shape, appearance and position of different types of ground objects, including building, infrastructure, vegetation, site, waterbody, terrain and generic (others). The dataset covers the whole territory of Hong Kong. You can click the link below to access the 3D Visualisation Map (https://3d.map.gov.hk/).
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
MAP-CC
🌐 Homepage | 🤗 MAP-CC | 🤗 CHC-Bench | 🤗 CT-LLM | 📖 arXiv | GitHub An open-source Chinese pretraining dataset with a scale of 800 billion tokens, offering the NLP community high-quality Chinese pretraining data.
Disclaimer
This model, developed for academic purposes, employs rigorously compliance-checked training data to uphold the highest standards of integrity and compliance. Despite our efforts, the inherent complexities of data and the broad spectrum of… See the full description on the dataset page: https://huggingface.co/datasets/m-a-p/MAP-CC.