Facebook
TwitterThe Global Biodiversity Information Facility (GBIF) is an international network and data infrastructure funded by the world's governments providing global data that document the occurrence of species. GBIF currently integrates datasets documenting over 1.6 billion species occurrences, growing daily. The GBIF occurrence dataset combines data from a wide array of sources including specimen-related data from natural history museums, observations from citizen science networks and environment recording schemes. While these data are constantly changing at GBIF.org, periodic snapshots are taken and made available on AWS.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The GBIF Backbone Taxonomy is a single, synthetic management classification with the goal of covering all names GBIF is dealing with. It's the taxonomic backbone that allows GBIF to integrate name based information from different resources, no matter if these are occurrence datasets, species pages, names from nomenclators or external sources like EOL, Genbank or IUCN. This backbone allows taxonomic search, browse and reporting operations across all those resources in a consistent way and to provide means to crosswalk names from one source to another.
It is updated regulary through an automated process in which the Catalogue of Life acts as a starting point also providing the complete higher classification above families. Additional scientific names only found in other authoritative nomenclatural and taxonomic datasets are then merged into the tree, thus extending the original catalogue and broadening the backbones name coverage. The GBIF Backbone taxonomy also includes identifiers for Operational Taxonomic Units (OTUs) drawn from the barcoding resources iBOL and UNITE.
International Barcode of Life project (iBOL), Barcode Index Numbers (BINs). BINs are connected to a taxon name and its classification by taking into account all names applied to the BIN and picking names with at least 80% consensus. If there is no consensus of name at the species level, the selection process is repeated moving up the major Linnaean ranks until consensus is achieved.
UNITE - Unified system for the DNA based fungal species, Species Hypotheses (SHs). SHs are connected to a taxon name and its classification based on the determination of the RefS (reference sequence) if present or the RepS (representative sequence). In the latter case, if there is no match in the UNITE taxonomy, the lowest rank with 100% consensus within the SH will be used.
The GBIF Backbone Taxonomy is available for download at https://hosted-datasets.gbif.org/datasets/backbone/ in different formats together with an archive of all previous versions.
The following 105 sources have been used to assemble the GBIF backbone with number of names given in brackets:
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Xiphosura of the world
Xiphosura
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The Global Biodiversity Information Facility (GBIF) indexes thousands of biodiversity datasets from Natural History Collections, citizen science initiatives (e.g., iNaturalist, eBird), and other sources. As part of the index process, GBIF associates at least two identifiers with indexed records: a record id (aka gbifID) and a dataset id (aka dataset key). These ids are central to do lookup, reference data, and package interpreted data products.
This publication contains an exhaustive list of GBIF IDs and ids associated by their data providers as derived from:
GBIF.org (01 March 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.pk3trq
The resource (size: ~260GB) provided by GBIF had content id hash://sha256/c8bac8acb28c8524c53589b3a40e322dbbbdadf5689fef2e20266fbf6ddf6b97 and was used to generate the resource included in this publication using
preston cat 'zip:hash://sha256/c8bac8acb28c8524c53589b3a40e322dbbbdadf5689fef2e20266fbf6ddf6b97!/0015281-230224095556074.csv'
| cut -f 1,2,3,37,38,39
| gzip\
gbifid.tsv.gz
with the content id of gbifid.tsv.gz (size: ~35GB) being hash://sha256/a339e32e10edaad585f61f2ded06cbb23e0618c65a6360db18d7d729054940a8 .
the first 10 lines of gbifid.tsv.gz as extracted via
preston cat --remote https://zenodo.org/record/7789866/files,https://linker.bio hash://sha256/a339e32e10edaad585f61f2ded06cbb23e0618c65a6360db18d7d729054940a8
| gunzip
| head
are:
gbifID datasetKey occurrenceID institutionCode collectionCode catalogNumber 2997162320 c71c8000-9fc7-422c-804a-ce6abe751771 3399442 CEPEC CEPEC CEPEC00109669 2997162309 c71c8000-9fc7-422c-804a-ce6abe751771 2733085 CEPEC CEPEC CEPEC00000818 2997162317 c71c8000-9fc7-422c-804a-ce6abe751771 2733086 CEPEC CEPEC CEPEC00000888 2997162313 c71c8000-9fc7-422c-804a-ce6abe751771 3399443 CEPEC CEPEC CEPEC00109744 2997162306 c71c8000-9fc7-422c-804a-ce6abe751771 2733087 CEPEC CEPEC CEPEC00000889 2997162316 c71c8000-9fc7-422c-804a-ce6abe751771 3399440 CEPEC CEPEC CEPEC00109605 2997162324 c71c8000-9fc7-422c-804a-ce6abe751771 2733088 CEPEC CEPEC CEPEC00000890 2997162308 c71c8000-9fc7-422c-804a-ce6abe751771 3399441 CEPEC CEPEC CEPEC00109615 2997162303 c71c8000-9fc7-422c-804a-ce6abe751771 2733089 CEPEC CEPEC CEPEC00000891
Note that at time of writing, the html resource associated with the occurrence id 2997162320, and data set key c71c8000-9fc7-422c-804a-ce6abe751771 (extracted from of the first data row example above) are available via:
https://gbif.org/occurrence/2997162320
and
https://gbif.org/dataset/c71c8000-9fc7-422c-804a-ce6abe751771
respectively.
This resource was initially created to help integrate with Bionomia (https://bionomia.net) to help associate people identifiers provided by bionomia to their original records via their GBIF ids. Bionomia re-uses GBIF records ids as a way to define links between records and the people (e.g., curators, collectors, identifiers) that worked on them.
In other words, this resource provides a versioned translation table from the GBIF data universe (as defined by GBIF record ids, and dataset keys) to the data collections that exist (and evolve) independent of it.
Note that the resource identified by hash://sha256/c8bac8acb28c8524c53589b3a40e322dbbbdadf5689fef2e20266fbf6ddf6b97 was not included in this publication it was too big (260GB) to fit. You may be able to retrieve the resource from its original location at https://api.gbif.org/v1/occurrence/download/request/0015281-230224095556074.zip .
Facebook
TwitterPublic Domain Mark 1.0https://creativecommons.org/publicdomain/mark/1.0/
License information was derived automatically
Dataset that provides a direct link to PNG's data hosted on the GBIF website/ records.
Contact emails: info@gbif.org / helpdesk@gbif.org
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
A dataset containing 797279757 species occurrences available in GBIF matching the query: All data. The dataset includes 797279757 records from 17347 constituent datasets: Please see http://www.gbif.org/occurrence/download/0000865-170822121629195 for full list of all constituents.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Mammals of the world
Mammals
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Publication date:
2022-12-06T07:37:19-06:00
A Repackaged Taxonomic Backbone of Global Biodiversity Information Facility (GBIF)
---
Global Biodiversity Information Facility (GBIF) facilitates access to billions of biodiversity data records. These records include detailed accounts of life on earth.
To help records of specific life forms, GBIF provides a taxonomic backbone [1,2]. This backbone contains a long list of names used to describe species and associated hierarchies and taxonomic publications. These lists are sourced from datasets around the world.
At time of writing (6 Dec 2022), GBIF publishes a simplified version of their taxonomic backbone at [https://hosted-datasets.gbif.org/datasets/backbone/](https://hosted-datasets.gbif.org/datasets/backbone/) [1].
This repository provides script to pre-process https://hosted-datasets.gbif.org/datasets/backbone/current/simple.txt.gz to help facilitate access and improve performance of the creation of search indexes.
Pre-process steps currently include:
1. reducing amount of columns
2. reverse sort by id
3. reverse sort by name
Contents
---
README:
this file
repackage-gbif-backbone.sh:
script used to repackage GBIF Simple Backbone.
repackage-gbif-backbone.log:
log of repackaging of GBIF Simple Backbone.
backbone-current-simple.txt.gz:
original GBIF backbone archive
gbif-backbone-by-name.tsv.gz:
two columns, gzipped, tab-separated text file with columns name, and id
reverse sorted by name
gbif-backbone-by-name.tsv.sha256:
sha256 hash of the uncompressed gbif-backbone-by-name.tsv.gz
gbif-backbone-by-id.tsv.gz:
20 columns, gzipped, tab-separated text file with first 20 columns of repackaged GBIF backbone file
reverse sorted by id
gbif-backbone-by-id.tsv.sha256:
sha256 hash of the uncompressed gbif-backbone-by-id.tsv.gz
References
---
[1] Simplied GBIF Backbone Taxonomy. Accessed at https://hosted-datasets.gbif.org/datasets/backbone/ on 2022-12-06.
[2] GBIF Secretariat (2021). GBIF Backbone Taxonomy. Checklist dataset https://doi.org/10.15468/39omei accessed via GBIF.org on 2021-08-18.
Hash URIs
---
This publication includes the following content uris:
hash://sha256/82d5f2153b4533322692d95eeb18b0f103e1b2297e38bd9ea935b07ba86cd7d5
hash://sha256/50c155f66efb2efba0b8b624f8541e81cbe16a701d420a5073791fb993f72919
hash://sha256/9cd7d4c91292d86c726210446cd6fe45602505a7c0ea3b7c4f4f481f85f193ad (uncompressed)
hash://sha256/f950dde25cce9ba9cce67caa1c68ce0c99cb31fe2dc9658fec85a987d9f31654
hash://sha256/f21c6b90f17c6083fcfb4853f3c581dcc2aadd291691fa128392a205321f420b (uncompressed)
hash://sha256/5e0a4d1d2d1cccbdcc6b2c9831fafe61c54eb055f2d13ec40d9ac161889b9f89
hash://sha256/f6e477133d0585706ee5522963b204200cb3cd198f011cbf62be0fa8519763b5 (uncompressed)
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
This dataset contains occurrence data of flora and fauna species. From the Netherlands on a 5 x 5 km scale, data from other countries are exact. Observations from Belgium are excluded and can be accessed on GBIF through Natuurpunt and Natagora. It summarizes the observations recorded by >175.000 volunteers.
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
A dataset containing 702745831 species occurrences available in GBIF matching the query: All data. The dataset includes 702745831 records from 14508 constituent datasets: Please see http://www.gbif.org/occurrence/download/0042046-160910150852091 for full list of all constituents.
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
A dataset containing 976783221 species occurrences available in GBIF matching the query: All data. The dataset includes 976783221 records from 19279 constituent datasets: Please see https://www.gbif.org/occurrence/download/0001762-180412121330197 for full list of all constituents.
Facebook
TwitterTraffic analytics, rankings, and competitive metrics for gbif.org as of November 2025
Facebook
Twitterhttps://semrush.ebundletools.com/company/legal/terms-of-service/https://semrush.ebundletools.com/company/legal/terms-of-service/
gbif.org is ranked #7412 in MX with 614.09K Traffic. Categories: Science. Learn more about website traffic, market share, and more!
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This data publication contains a versioned archive of records used to extract mammalian trait data for the NSF funded Ranges Project. These data document the origin of the derived mammalian trait data, a core product of the Ranges project.
These records were compiled using the download service of the Global Biodiversity Information Facility (GBIF, https://gbif.org, [1]) as shown in the table below. Then, the resulting Darwin Core Archives associated with these compiled GBIF records were archived using Preston [2,3], a biodiversity data tracking tool.
Ranges is an NSF-funded project that seeks to digitize traits from over one million mammal specimens from 19 natural history museums, with a focus on western North America.
The project will allow researchers to build better baselines for biodiversity and improve predictions of how mammals respond to changing environments to address major digitization challenges, expand the utility of specimens and use them to create new scientific knowledge.
| Date | Institution | Download Citation | DOI | Downloaded by | Notes |
| 16Dec2023 | UNR | GBIF.org (16 December 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.z35kxh | https://doi.org/10.15468/dl.z35kxh | db | Query for records Mammalia only, countries: US, CA, MX - Not all collectioons have records from all three countries. Must d/l DwC-A. |
| 16Dec2023 | UWYMV | GBIF.org (16 December 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.bv2b4u | https://doi.org/10.15468/dl.bv2b4u | db | Query for records Mammalia only, countries: US, CA, MX - Not all collectioons have records from all three countries. Must d/l DwC-A. |
| 16Dec2023 | HSU Vert Museum | GBIF.org (16 December 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.f5dvwv | https://doi.org/10.15468/dl.f5dvwv | db | Query for records Mammalia only, countries: US, CA, MX - Not all collectioons have records from all three countries. Must d/l DwC-A. |
| 16Dec2023 | ASU | GBIF.org (16 December 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.gsaz93 | https://doi.org/10.15468/dl.gsaz93 | db | Query for records Mammalia only, countries: US, CA, MX - Not all collectioons have records from all three countries. Must d/l DwC-A. |
| 16Dec2023 | CSULB | GBIF.org (16 December 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.5z35e5 | https://doi.org/10.15468/dl.5z35e5 | db | Query for records Mammalia only, countries: US, CA, MX - Not all collectioons have records from all three countries. Must d/l DwC-A. |
| 16Dec2023 | UMZM | GBIF.org (16 December 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.9jym5b | https://doi.org/10.15468/dl.9jym5b | db | Query for records Mammalia only, countries: US, CA, MX - Not all collectioons have records from all three countries. Must d/l DwC-A. |
| 16Dec2023 | CAS | GBIF.org (16 December 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.nh67uf | https://doi.org/10.15468/dl.nh67uf | db | Query for records Mammalia only, countries: US, CA, MX - Not all collectioons have records from all three countries. Must d/l DwC-A. |
| 16Dec2023 | DMNS | GBIF.org (16 December 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.2kv7cs | https://doi.org/10.15468/dl.2kv7cs | db | Query for records Mammalia only, countries: US, CA, MX - Not all collectioons have records from all three countries. Must d/l DwC-A. |
| 16Dec2023 | FMNH | GBIF.org (16 December 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.m58jzk | https://doi.org/10.15468/dl.m58jzk | db | Query for records Mammalia only, countries: US, CA, MX - Not all collectioons have records from all three countries. Must d/l DwC-A. |
| 16Dec2023 | TCWC | GBIF.org (16 December 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.ze6gmy | https://doi.org/10.15468/dl.ze6gmy | db | Query for records Mammalia only, countries: US, CA, MX - Not all collectioons have records from all three countries. Must d/l DwC-A. |
| 16Dec2023 | NHMU | GBIF.org (16 December 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.ycsyqf | https://doi.org/10.15468/dl.ycsyqf | db | Query for records Mammalia only, countries: US, CA, MX - Not all collectioons have records from all three countries. Must d/l DwC-A. |
| 16Dec2023 | UMMZ | GBIF.org (16 December 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.t3zmw6 | https://doi.org/10.15468/dl.t3zmw6 | db | Query for records Mammalia only, countries: US, CA, MX - Not all collectioons have records from all three countries. Must d/l DwC-A. |
| 16Dec2023 | UWBM | GBIF.org (16 December 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.g9edtd | https://doi.org/10.15468/dl.g9edtd | db | Query for records Mammalia only, countries: US, CA, MX - Not all collectioons have records from all three countries. Must d/l DwC-A. |
| 16Dec2023 | LACM | GBIF.org (16 December 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.uj984s | https://doi.org/10.15468/dl.uj984s | db | Query for records Mammalia only, countries: US, CA, MX - Not all collectioons have records from all three countries. Must d/l DwC-A. |
| 16Dec2023 | KU | GBIF.org (16 December 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.vr3upy | https://doi.org/10.15468/dl.vr3upy | db | Query for records Mammalia only, countries: US, CA, MX - Not all collectioons have records from all three countries. Must d/l DwC-A. |
| 16Dec2023 | TTU | GBIF.org (16 December 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.cdtfn9 | https://doi.org/10.15468/dl.cdtfn9 | db | Query for records Mammalia only, countries: US, CA, MX - Not all collectioons have records from all three countries. Must d/l DwC-A. |
| 16Dec2023 | UAM | GBIF.org (16 December 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.xuwz73 | https://doi.org/10.15468/dl.xuwz73 | db | Query for records Mammalia only, countries: US, CA, MX - Not all collectioons have records from all three countries. Must d/l DwC-A. |
| 16Dec2023 | MVZ | GBIF.org (16 December 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.7aujag | https://doi.org/10.15468/dl.7aujag | db | Query for records Mammalia only, countries: US, CA, MX - Not all collectioons have records from all three countries. Must d/l DwC-A. |
| 16Dec2023 | MSB | GBIF.org (16 December 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.7p2tty | https://doi.org/10.15468/dl.7p2tty | db | Query for records Mammalia only, countries: US, CA, MX - Not all collectioons have records from all three countries. Must d/l DwC-A. |
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
A dataset containing 705448282 species occurrences available in GBIF matching the query: All data. The dataset includes 705448282 records from 15401 constituent datasets: Please see http://www.gbif.org/occurrence/download/0058761-160910150852091 for full list of all constituents.
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Vascular plants collection at SANT Herbarium
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The Fish Collection at the Auburn University Museum of Natural History has fishes from all over the world. The collection has particularly large holdings of fishes from the Southeastern US and South America. Approximately 750,000 preserved specimens in more than 72,000 lots are cataloged in the collection.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The vertebrate paleontology program at the University of Kansas has, for over a century, sustained a national and international reputation. The reputation of the collection has been based more on intensive use than on sheer size. We now hold over 150,000 cataloged specimens (>75,000 are available in digital format) and around 400 publications related to our collections have been published in the last 35 years. Research strengths include: Paleozoic and Mesozoic fishes, Paleozoic tetrapods, Mesozoic marine vertebrates, Cenozoic small mammals and Natural Trap Cave fauna.
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
This citizen science web site is focused on expanding our understanding of the distribution, biogeography, biodiversity, and identification of Odonata (dragonflies and damselflies) in the Western Hemisphere. Citizen scientists are encouraged to contribute their Odonata observations to the database using a web-based interface for submission. These observations may be human observation only or may be vouchered by photographs or specimens. An expert team of vetters reviews records of the rarer species and either accepts or declines the record based on the documentation. Declined records are not uploaded to GBIF. Odonata Central was started in 2003 but older records are accepted into the database.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
A collection of marine biological survey data collated from literature.
Facebook
TwitterThe Global Biodiversity Information Facility (GBIF) is an international network and data infrastructure funded by the world's governments providing global data that document the occurrence of species. GBIF currently integrates datasets documenting over 1.6 billion species occurrences, growing daily. The GBIF occurrence dataset combines data from a wide array of sources including specimen-related data from natural history museums, observations from citizen science networks and environment recording schemes. While these data are constantly changing at GBIF.org, periodic snapshots are taken and made available on AWS.