Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset provides values for GDP reported in several countries. The data includes current values, previous releases, historical highs and record lows, release frequency, reported unit and currency.
The United States Census Bureau’s international dataset provides estimates of country populations since 1950 and projections through 2050. Specifically, the dataset includes midyear population figures broken down by age and gender assignment at birth. Additionally, time-series data is provided for attributes including fertility rates, birth rates, death rates, and migration rates.
You can use the BigQuery Python client library to query tables in this dataset in Kernels. Note that methods available in Kernels are limited to querying data. Tables are at bigquery-public-data.census_bureau_international.
What countries have the longest life expectancy? In this query, 2016 census information is retrieved by joining the mortality_life_expectancy and country_names_area tables for countries larger than 25,000 km2. Without the size constraint, Monaco is the top result with an average life expectancy of over 89 years!
SELECT
age.country_name,
age.life_expectancy,
size.country_area
FROM (
SELECT
country_name,
life_expectancy
FROM
bigquery-public-data.census_bureau_international.mortality_life_expectancy
WHERE
year = 2016) age
INNER JOIN (
SELECT
country_name,
country_area
FROM
bigquery-public-data.census_bureau_international.country_names_area
where country_area > 25000) size
ON
age.country_name = size.country_name
ORDER BY
2 DESC
/* Limit removed for Data Studio Visualization */
LIMIT
10
Which countries have the largest proportion of their population under 25? Over 40% of the world’s population is under 25 and greater than 50% of the world’s population is under 30! This query retrieves the countries with the largest proportion of young people by joining the age-specific population table with the midyear (total) population table.
SELECT
age.country_name,
SUM(age.population) AS under_25,
pop.midyear_population AS total,
ROUND((SUM(age.population) / pop.midyear_population) * 100,2) AS pct_under_25
FROM (
SELECT
country_name,
population,
country_code
FROM
bigquery-public-data.census_bureau_international.midyear_population_agespecific
WHERE
year =2017
AND age < 25) age
INNER JOIN (
SELECT
midyear_population,
country_code
FROM
bigquery-public-data.census_bureau_international.midyear_population
WHERE
year = 2017) pop
ON
age.country_code = pop.country_code
GROUP BY
1,
3
ORDER BY
4 DESC /* Remove limit for visualization*/
LIMIT
10
The International Census dataset contains growth information in the form of birth rates, death rates, and migration rates. Net migration is the net number of migrants per 1,000 population, an important component of total population and one that often drives the work of the United Nations Refugee Agency. This query joins the growth rate table with the area table to retrieve 2017 data for countries greater than 500 km2.
SELECT
growth.country_name,
growth.net_migration,
CAST(area.country_area AS INT64) AS country_area
FROM (
SELECT
country_name,
net_migration,
country_code
FROM
bigquery-public-data.census_bureau_international.birth_death_growth_rates
WHERE
year = 2017) growth
INNER JOIN (
SELECT
country_area,
country_code
FROM
bigquery-public-data.census_bureau_international.country_names_area
Historic (none)
United States Census Bureau
Terms of use: This dataset is publicly available for anyone to use under the following terms provided by the Dataset Source - http://www.data.gov/privacy-policy#data_policy - and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.
See the GCP Marketplace listing for more details and sample queries: https://console.cloud.google.com/marketplace/details/united-states-census-bureau/international-census-data
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset presents median household incomes for various household sizes in Brazos Country, TX, as reported by the U.S. Census Bureau. The dataset highlights the variation in median household income with the size of the family unit, offering valuable insights into economic trends and disparities within different household sizes, aiding in data analysis and decision-making.
Key observations
https://i.neilsberg.com/ch/brazos-country-tx-median-household-income-by-household-size.jpeg" alt="Brazos Country, TX median household income, by household size (in 2022 inflation-adjusted dollars)">
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.
Household Sizes:
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for Brazos Country median household income. You can refer the same here
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset provides values for GDP reported in several countries. The data includes current values, previous releases, historical highs and record lows, release frequency, reported unit and currency.
This paper presents the largest globally comparable panel database of education quality. The database includes 163 countries and regions over 1965-2015. The globally comparable achievement outcomes were constructed by linking standardized, psychometrically-robust international and regional achievement tests. The paper contributes to the literature in the following ways: (1) it is the largest and most current globally comparable data set, covering more than 90 percent of the global population; (2) the data set includes 100 developing areas and the most developing countries included in such a data set to date -- the countries that have the most to gain from the potential benefits of a high-quality education; (3) the data set contains credible measures of globally comparable achievement distributions as well as mean scores; (4) the data set uses multiple methods to link assessments, including mean and percentile linking methods, thus enhancing the robustness of the data set; (5) the data set includes the standard errors for the estimates, enabling explicit quantification of the degree of reliability of each estimate; and (6) the data set can be disaggregated across gender, socioeconomic status, rural/urban, language, and immigration status, thus enabling greater precision and equity analysis. A first analysis of the data set reveals a few important trends: learning outcomes in developing countries are often clustered at the bottom of the global scale; although variation in performance is high in developing countries, the top performers still often perform worse than the bottom performers in developed countries; gender gaps are relatively small, with high variation in the direction of the gap; and distributions reveal meaningfully different trends than mean scores, with less than 50 percent of students reaching the global minimum threshold of proficiency in developing countries relative to 86 percent in developed countries. The paper also finds a positive and significant association between educational achievement and economic growth. The data set can be used to benchmark global progress on education quality, as well as to uncover potential drivers of education quality, growth, and development.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset presents median household incomes for various household sizes in Lost Nation, IA, as reported by the U.S. Census Bureau. The dataset highlights the variation in median household income with the size of the family unit, offering valuable insights into economic trends and disparities within different household sizes, aiding in data analysis and decision-making.
Key observations
https://i.neilsberg.com/ch/lost-nation-ia-median-household-income-by-household-size.jpeg" alt="Lost Nation, IA median household income, by household size (in 2022 inflation-adjusted dollars)">
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.
Household Sizes:
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for Lost Nation median household income. You can refer the same here
The QoG Institute is an independent research institute within the Department of Political Science at the University of Gothenburg. Overall 30 researchers conduct and promote research on the causes, consequences and nature of Good Governance and the Quality of Government - that is, trustworthy, reliable, impartial, uncorrupted and competent government institutions.
The main objective of our research is to address the theoretical and empirical problem of how political institutions of high quality can be created and maintained. A second objective is to study the effects of Quality of Government on a number of policy areas, such as health, the environment, social policy, and poverty.
QoG Standard Dataset is the largest dataset consisting of more than 2,000 variables from sources related to the Quality of Government. The data exist in both time-series (year 1946 and onwards) and cross-section (year 2020). Many of the variables are available in both datasets, but some are not. The datasets draws on a number of freely available data sources related to QoG and its correlates.
In the QoG Standard CS dataset, data from and around 2020 is included. Data from 2020 is prioritized; however, if no data is available for a country for 2020, data for 2021 is included. If no data exists for 2021, data for 2019 is included, and so on up to a maximum of +/- 3 years.
In the QoG Standard TS dataset, data from 1946 and onwards is included and the unit of analysis is country-year (e.g., Sweden-1946, Sweden-1947, etc.).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Tanzania is one of the mega-biodiversity rich countries globally. The country located in the tropical region of African and favourable climatic condition supports high biodiversity. It has extensive species diversity with at least 14,500 known species (7,714 plants) and is among 15 countries globally with the highest number of endemic and threatened species. University of Dar es salaam Herbarium (DSM) is in the Dar es Salaam region, Tanzania, and it holds collections of preserved plants and Chromista specimens collected since February 1928. The Herbarium continues with the collection of samples of biota for research and teaching.
This dataset comprises 3500 occurrence records of specimens of the Rubiaceae family preserved at the University of Dar es Salaam Herbarium. The family Rubiaceae consists of about 13,500 species in about 620 genera of terrestrial trees, shrubs, lianas, or herbs, making it the fourth-largest angiosperm globally. The dataset covers essential biodiversity information, including taxonomic, geographic, temporal coverage of 352 species in 39 genera collected from different parts of Tanzania from February 1928 to November 2020. Coffea is one of the genera included in this dataset. The genus is valuable for its commercial value products traded commodity used in food, cosmetic, and pharmaceutical industries due to its caffeine and high polyphenol content. The dataset provides IUCN red list information of the assessed species for research and conservation management. The dataset consists of fifty threatened, three hundred thirty-four near threatened, 85 least concern and 33 species have Data Deficiency.
Information in this dataset was drawn from the herbarium sheet labels and transformed into Darwin Core Standard. Darwin Core quick reference guide aided the development of an Excel sheet of 63 columns and 3004 rows for data digitisation. Each row contains information on a particular Rubiaceae preserved specimen. The information about the data publisher, institution code, herbarium code and Catalogue number used in developing the occurrence. The GEOLocate Web-Based Clients used to translate textual locality into geographic coordinates. GBIF species matching, GBIF validator and Darwin Core Archive Assistant tools aided the mapping and validation of the dataset. Integrated Publishing Toolkit (IPT) enabled open access of the dataset on preserved specimens occurrence records in the Department of Botany, the University of Dar Es Salaam, formerly locked.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset provides values for GDP reported in several countries. The data includes current values, previous releases, historical highs and record lows, release frequency, reported unit and currency.
Techsalerator’s Import/Export Trade Data for North America
Techsalerator’s Import/Export Trade Data for North America delivers an exhaustive and nuanced analysis of trade activities across the North American continent. This extensive dataset provides detailed insights into import and export transactions involving companies across various sectors within North America.
Coverage Across All North American Countries
The dataset encompasses all key countries within North America, including:
The dataset provides detailed trade information for the United States, the largest economy in the region. It includes extensive data on trade volumes, product categories, and the key trading partners of the U.S. 2. Canada
Data for Canada covers a wide range of trade activities, including import and export transactions, product classifications, and trade relationships with major global and regional partners. 3. Mexico
Comprehensive data for Mexico includes detailed records on its trade activities, including exports and imports, key sectors, and trade agreements affecting its trade dynamics. 4. Central American Countries:
Belize Costa Rica El Salvador Guatemala Honduras Nicaragua Panama The dataset covers these countries with information on their trade flows, key products, and trade relations with North American and international partners. 5. Caribbean Countries:
Bahamas Barbados Cuba Dominica Dominican Republic Grenada Haiti Jamaica Saint Kitts and Nevis Saint Lucia Saint Vincent and the Grenadines Trinidad and Tobago Trade data for these Caribbean nations includes detailed transaction records, sector-specific trade information, and their interactions with North American trade partners. Comprehensive Data Features
Transaction Details: The dataset includes precise details on each trade transaction, such as product descriptions, quantities, values, and dates. This allows for an accurate understanding of trade flows and patterns across North America.
Company Information: It provides data on companies involved in trade, including names, locations, and industry sectors, enabling targeted business analysis and competitive intelligence.
Categorization: Transactions are categorized by industry sectors, product types, and trade partners, offering insights into market dynamics and sector-specific trends within North America.
Trade Trends: Historical data helps users analyze trends over time, identify emerging markets, and assess the impact of economic or political events on trade flows in the region.
Geographical Insights: The data offers insights into regional trade flows and cross-border dynamics between North American countries and their global trade partners, including significant international trade relationships.
Regulatory and Compliance Data: Information on trade regulations, tariffs, and compliance requirements is included, helping businesses navigate the complex regulatory environments within North America.
Applications and Benefits
Market Research: Companies can leverage the data to discover new market opportunities, analyze competitive landscapes, and understand demand for specific products across North American countries.
Strategic Planning: Insights from the data enable companies to refine trade strategies, optimize supply chains, and manage risks associated with international trade in North America.
Economic Analysis: Analysts and policymakers can monitor economic performance, evaluate trade balances, and make informed decisions on trade policies and economic development strategies.
Investment Decisions: Investors can assess trade trends and market potentials to make informed decisions about investments in North America's diverse economies.
Techsalerator’s Import/Export Trade Data for North America offers a vital resource for organizations involved in international trade, providing a thorough, reliable, and detailed view of trade activities across the continent.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset provides values for GOLD RESERVES reported in several countries. The data includes current values, previous releases, historical highs and record lows, release frequency, reported unit and currency.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset presents the mean household income for each of the five quintiles in Town And Country, MO, as reported by the U.S. Census Bureau. The dataset highlights the variation in mean household income across quintiles, offering valuable insights into income distribution and inequality.
Key observations
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.
Income Levels:
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for Town And Country median household income. You can refer the same here
Census data reveals that population density varies noticeably from area to area. Small area census data do a better job depicting where the crowded neighborhoods are. In this map, the yellow areas of highest density range from 30,000 to 150,000 persons per square kilometer. In those areas, if the people were spread out evenly across the area, there would be just 4 to 9 meters between them. Very high density areas exceed 7,000 persons per square kilometer. High density areas exceed 5,200 persons per square kilometer. The last categories break at 3,330 persons per square kilometer, and 1,500 persons per square kilometer.This dataset is comprised of multiple sources. All of the demographic data are from Michael Bauer Research with the exception of the following countries:Australia: Esri Australia and MapData ServicesCanada: Esri Canada and EnvironicsFrance: Esri FranceGermany: Esri Germany and NexigaIndia: Esri India and IndicusJapan: Esri JapanSouth Korea: Esri Korea and OPENmateSpain: Esri España and AISUnited States: Esri Demographics
https://pacific-data.sprep.org/dataset/data-portal-license-agreements/resource/de2a56f5-a565-481a-8589-406dc40b5588https://pacific-data.sprep.org/dataset/data-portal-license-agreements/resource/de2a56f5-a565-481a-8589-406dc40b5588
The population was compiled from available census reports and validated using other available datasets. For each country, population counts from the finest resolution was trended to 2010 using a country-specific annual growth rate assumptions. Underlying vector geometry comes from regional sources, primarily SPC. Primary Data Source(s): PopGIS, Federated States of Micronesia Division of Statistics Secondary Data Source(s): None Geographical Resolutions Available (with count): 1. State (4) 2. Municipality (421) 3. Electoral District (373) Additional Comments: 1. The Electoral District and Municipality geographical resolutions are not related to each other and are roughly at the same level of granularity but with different defined boundaries. 2. This population database is misaligned due to the source data provided in the SPC?s PopGIS data set. This misalignment is not linear and the largest measured misalignment in a significantly populated region is approximately 500 meters. After the creation of this deliverable we received updated boundary files from SPC. These boundary files have not been integrated into the delivered population database. Complied by AIR Worldwide
Varieties of Democracy (V-Dem) is a new approach to conceptualizing and measuring democracy. It is a collaboration among more than 50 scholars worldwide which is co-hosted by the Department of Political Science at the University of Gothenburg, Sweden; and the Kellogg Institute at the University of Notre Dame, USA.
With four Principal Investigators, two Program managers, fifteen Project Managers, more than thirty Regional Managers, almost 200 Country Coordinators, and approximately 2,800 Country Experts, the V-Dem project is one of the largest social science data collection projects focusing on research.
V-Dem collects data for 350+ indicators across a wide range of democracy aspects. Electoral democracy is in the centre and linked to this concept we find six additional dimensions of democracy: liberal, majoritarian, deliberative, participatory, consensual and egalitarian. In addition to a number of main indices, data is broken down into a number of components that are available to the user along with all indicators. Through the unique character of the database, old and new questions about the nature, growth and survival of democracy can be tested in a way not possible before.
Data is available for 177 countries from 1900 to 2016. Altogether, the database consists of approximately 17 million data points. The database is updated annually and new datasets are launched every year in the spring.
The dataset is available for download here: https://www.v-dem.net/en/data/data-version-7-1/
The data can also be explored online via: https://www.v-dem.net/en/analysis/
Purpose:
The world's largest database on democracy. The database provides 350+ indicators for 177 countries 1900-2016.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Summary
This dataset accompanies the publication "Flock size and structure influence reproductive success in four species of flamingo in 540 captive populations worldwide" published in Zoo Biology. It contains anonymised data from 540 captive flamingo populations, and includes the four species: Phoeniconaias minor, Phoenicopterus chilensis, Phoenicopterus roseus and Phoenicopterus ruber. Data were sourced from the Zoological Information Management System (ZIMS), operated by Species360 (https://www.species360.org/). ZIMS is the largest real-time database of comprehensive and standardized information spanning more than 1,200 zoological collections globally, and provides the number of institutions currently managing each flamingo species and both their current and historic population sizes. These data were used to investigate the relationship between reproductive success and both flock size, and structure, on a global scale.
This dataset also contains climatic data provided by WorldClim, which were used to assess the influence of climatic variables on captive flamingo reproductive success globally. The WorldClim database averages 19 different climatic variables derived from monthly temperature and rainfall values at a 1 km spatial resolution for the period 1970-2000. Using geographic coordinates (latitude and longitude) we calculated several climatic metrics for each institution.
Description of the Dataset
One file is provided for each species (P. minor, P. chilensis, P. roseus and P. ruber) as a csv file. Each file contains the following 15 columns:
Note: Mean Annual Temperature (MAT) is provided by WorldClim as °C multiplied by 10, and similarly mean annual variation in temperature as MAT standard deviation multiplied by 100. In the corresponding publication, both were divided (by 10 and 100 respectively) prior to modelling to avoid confusion in the units used.
Acknowledgements
We acknowledge and thank all Species360 member institutions for their continued support and data input. The research which data refers to was funded by the Irish Research Council Laureate Awards 2017/2018 IRCLA/2017/60 to Y.M.B. Additionally, S.Q.S. received funding from the International Max Planck Research School for Organismal Biology. The Species360 Conservation Science Alliance would like to thank their sponsors: the World Association of Zoos and Aquariums, Wildlife Reserves of Singapore, and Copenhagen Zoo.
Disclaimer
Despite our best efforts at screening the data for errors and inconsistencies, some information could be erroneous. Similarly, data contained within ZIMS are based on submitted records from individual institutions, and are not subject to editorial verification, potentially permitting errors or failure to update species holdings etc. Despite this, ZIMS represents the only global database of zoo collection composition records, and as a result, is used by the IUCN, Convention on International Trade in Endangered Species (CITES), the Wildlife Trade Monitoring Network (TRAFFIC), United States Fish and Wildlife Service (USFWS) and Department for Environment, Food and Rural Affairs (DEFRA).
Credit
If you use this dataset, please cite the corresponding publication:
Mooney, A., Teare, J. A., Staerk, J.,Smeele, S. Q., Rose, P., Edell, R. H., King, C. E., Conrad, L., & Buckley, Y. M. (2023). Flock size and structure influence reproductive success in four species of flamingo in 540 captive populations worldwide. Zoo Biology, 1–14. https://doi.org/10.1002/zoo.21753
Country scientific indicators developed from the information contained in the Scopus® database (Elsevier B.V.). These indicators can be used to assess and analyze scientific domains. Country rankings may be compared or analysed separately. Indicators offered for each country: H Index, Documents, Citations, Citation per Document and Citable Documents.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The average for 2023 based on 193 countries was -0.07 points. The highest value was in Liechtenstein: 1.61 points and the lowest value was in Syria: -2.75 points. The indicator is available from 1996 to 2023. Below is a chart for all countries where data are available.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset tabulates the Country Club Hills population distribution across 18 age groups. It lists the population in each age group along with the percentage population relative of the total population for Country Club Hills. The dataset can be utilized to understand the population distribution of Country Club Hills by age. For example, using this dataset, we can identify the largest age group in Country Club Hills.
Key observations
The largest age group in Country Club Hills, MO was for the group of age 25 to 29 years years with a population of 109 (10.72%), according to the ACS 2019-2023 5-Year Estimates. At the same time, the smallest age group in Country Club Hills, MO was the 75 to 79 years years with a population of 4 (0.39%). Source: U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates
Age groups:
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for Country Club Hills Population by Age. You can refer the same here
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
We present the GLOBAL ROADKILL DATA, the largest worldwide compilation of roadkill data on terrestrial vertebrates. We outline the workflow (Fig. 1) to illustrate the sequential steps of the study, in which we merged local-scale survey datasets and opportunistic records into a unified roadkill large dataset comprising 208,570 roadkill records. These records include 2283 species and subspecies from 54 countries across six continents, ranging from 1971 to 2024.Large roadkill datasets offer the advantage ofpreventing the collection of redundant data and are valuable resources for both local and macro-scale analyses regarding roadkill rates, road and landscape features associated with roadkill risk, species more vulnerable to road traffic, and populations at risk due to additional mortality. The standardization of data - such as scientific names, projection coordinates, and units - in a user-friendly format, makes themreadily accessible to a broader scientific and non-scientific community, including NGOs, consultants, public administration officials, and road managers. The open-access approach promotes collaboration among researchers and road practitioners, facilitating the replication of studies, validation of findings, and expansion of previous work. Moreover, researchers can utilize suchdatasets to develop new hypotheses, conduct meta-analyses, address pressing challenges more efficiently and strengthen the robustness of road ecology research. Ensuring widespreadaccess to roadkill data fosters a more diverse and inclusive research community. This not only grants researchers in emerging economies with more data for analysis, but also cultivates a diverse array of perspectives and insightspromoting the advance of infrastructure ecology.MethodsInformation sources: A core team from different continents performed a systematic literature search in Web of Science and Google Scholar for published peer-reviewed papers and dissertations. It was searched for the following terms: “roadkill* OR “road-kill” OR “road mortality” AND (country) in English, Portuguese, Spanish, French and/or Mandarin. This initiative was also disseminated to the mailing lists associated with transport infrastructure: The CCSG Transport Working Group (WTG), Infrastructure & Ecology Network Europe (IENE) and Latin American & Caribbean Transport Working Group (LACTWG) (Fig. 1). The core team identified 750 scientific papers and dissertations with information on roadkill and contacted the first authors of the publications to request georeferenced locations of roadkill andofferco-authorship to this data paper. Of the 824 authors contacted, 145agreed to sharegeoreferenced roadkill locations, often involving additional colleagues who contributed to data collection. Since our main goal was to provide open access to data that had never been shared in this format before, data from citizen science projects (e.g., globalroakill.net) that are already available were not included.Data compilation: A total of 423 co-authors compiled the following information: continent, country, latitude and longitude in WGS 84 decimal degrees of the roadkill, coordinates uncertainty, class, order, family, scientific name of the roadkill, vernacular name, IUCN status, number of roadkill, year, month, and day of the record, identification of the road, type of road, survey type, references, and observers that recorded the roadkill (Supplementary Information Table S1 - description of the fields and Table S2 - reference list). When roadkill data were derived from systematic surveys, the dataset included additional information on road length that was surveyed, latitude and longitude of the road (initial and final part of the road segment), survey period, start year of the survey, final year of the survey, 1st month of the year surveyed, last month of the year surveyed, and frequency of the survey. We consolidated 142 valid datasets into a single dataset. We complemented this data with OccurenceID (a UUID generated using Java code), basisOfRecord, countryCode, locality using OpenStreetMap’s API (https://www.openstreetmap.org), geodeticDatum, verbatimScientificName, Kingdom, phylum, genus, specificEpithet, infraspecificEpithet, acceptedNameUsage, scientific name authorship, matchType, taxonRank using Darwin Core Reference Guide (https://dwc.tdwg.org/terms/#dwc:coordinateUncertaintyInMeters) and link of the associatedReference (URL).Data standardization - We conducted a clustering analysis on all text fields to identify similar entries with minor variations, such as typos, and corrected them using OpenRefine (http://openrefine.org). Wealsostandardized all date values using OpenRefine. Coordinate uncertainties listed as 0 m were adjusted to either 30m or 100m, depending on whether they were recorded after or before 2000, respectively, following the recommendation in the Darwin Core Reference Guide (https://dwc.tdwg.org/terms/#dwc:coordinateUncertaintyInMeters).Taxonomy - We cross-referenced all species names with the Global Biodiversity Information Facility (GBIF) Backbone Taxonomy using Java and GBIF’s API (https://doi.org/10.15468/39omei). This process aimed to rectify classification errors, include additional fields such as Kingdom, Phylum, and scientific authorship, and gather comprehensive taxonomic information to address any gap withinthe datasets. For species not automatically matched (matchType - Table S1), we manually searched for correct synonyms when available.Species conservation status - Using the species names, we retrieved their conservation status and also vernacular names by cross-referencing with the database downloaded from the IUCNRed List of Threatened Species (https://www.iucnredlist.org). Species without a match were categorized as "Not Evaluated".Data RecordsGLOBAL ROADKILL DATA is available at Figshare27 https://doi.org/10.6084/m9.figshare.25714233. The dataset incorporates opportunistic (collected incidentally without data collection efforts) and systematic data (collected through planned, structured, and controlled methods designed to ensure consistency and reliability). In total, it comprises 208,570 roadkill records across 177,428 different locations(Fig. 2). Data were collected from the road network of 54 countries from 6 continents: Europe (n = 19), Asia (n = 16), South America (n=7), North America (n = 4), Africa (n = 6) and Oceania (n = 2).(Figure 2 goes here)All data are georeferenced in WGS84 decimals with maximum uncertainty of 5000 m. Approximately 92% of records have a location uncertainty of 30 m or less, with only 1138 records having location uncertainties ranging from 1000 to 5000 m. Mammals have the highest number of roadkill records (61%), followed by amphibians (21%), reptiles (10%) and birds (8%). The species with the highest number of records were roe deer (Capreolus capreolus, n = 44,268), pool frog (Pelophylax lessonae, n = 11,999) and European fallow deer (Dama dama, n = 7,426).We collected information on 126 threatened species with a total of 4570 records. Among the threatened species, the giant anteater (Myrmecophaga tridactyla, VULNERABLE) has the highest number of records n = 1199), followed by the common fire salamander (Salamandra salamandra, VULNERABLE, n=1043), and European rabbit (Oryctolagus cuniculus, ENDANGERED, n = 440). Records ranged from 1971 and 2024, comprising 72% of the roadkill recorded since 2013. Over 46% of the records were obtained from systematic surveys, with road length and survey period averaging, respectively, 66 km (min-max: 0.09-855 km) and 780 days (1-25,720 days).Technical ValidationWe employed the OpenStreetMap API through Java todetect location inaccuracies, andvalidate whether the geographic coordinates aligned with the specified country. We calculated the distance of each occurrence to the nearest road using the GRIP global roads database28, ensuring that all records were within the defined coordinate uncertainty. We verified if the survey duration matched the provided initial and final survey dates. We calculated the distance between the provided initial and final road coordinates and cross-checked it with the given road length. We identified and merged duplicate entries within the same dataset (same location, species, and date), aggregating the number of roadkills for each occurrence.Usage NotesThe GLOBAL ROADKILL DATA is a compilation of roadkill records and was designed to serve as a valuable resource for a wide range of analyses. Nevertheless, to prevent the generation of meaningless results, users should be aware of the followinglimitations:- Geographic representation – There is an evident bias in the distribution of records. Data originatedpredominantly from Europe (60% of records), South America (22%), and North America (12%). Conversely, there is a notable lack of records from Asia (5%), Oceania (1%) and Africa (0.3%). This dataset represents 36% of the initial contacts that provided geo-referenced records, which may not necessarily correspond to locations where high-impact roads are present.- Location accuracy - Insufficient location accuracy was observed for 1% of the data (ranging from 1000 to 5000 m), that was associated with various factors, such as survey methods, recording practices, or timing of the survey.- Sampling effort - This dataset comprised both opportunistic data and records from systematic surveys, with a high variability in survey duration and frequency. As a result, the use of both opportunistic and systematic surveys may affect the relative abundance of roadkill making it hard to make sound comparisons among species or areas.- Detectability and carcass removal bias - Although several studies had a high frequency of road surveys,the duration of carcass persistence on roads may vary with species size and environmental conditions, affecting detectability. Accordingly, several approaches account for survey frequency and target speciesto estimate more
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset provides values for GDP reported in several countries. The data includes current values, previous releases, historical highs and record lows, release frequency, reported unit and currency.