Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Related article: Bergroth, C., Järv, O., Tenkanen, H., Manninen, M., Toivonen, T., 2022. A 24-hour population distribution dataset based on mobile phone data from Helsinki Metropolitan Area, Finland. Scientific Data 9, 39.
In this dataset:
We present temporally dynamic population distribution data from the Helsinki Metropolitan Area, Finland, at the level of 250 m by 250 m statistical grid cells. Three hourly population distribution datasets are provided for regular workdays (Mon – Thu), Saturdays and Sundays. The data are based on aggregated mobile phone data collected by the biggest mobile network operator in Finland. Mobile phone data are assigned to statistical grid cells using an advanced dasymetric interpolation method based on ancillary data about land cover, buildings and a time use survey. The data were validated by comparing population register data from Statistics Finland for night-time hours and a daytime workplace registry. The resulting 24-hour population data can be used to reveal the temporal dynamics of the city and examine population variations relevant to for instance spatial accessibility analyses, crisis management and planning.
Please cite this dataset as:
Bergroth, C., Järv, O., Tenkanen, H., Manninen, M., Toivonen, T., 2022. A 24-hour population distribution dataset based on mobile phone data from Helsinki Metropolitan Area, Finland. Scientific Data 9, 39. https://doi.org/10.1038/s41597-021-01113-4
Organization of data
The dataset is packaged into a single Zipfile Helsinki_dynpop_matrix.zip which contains following files:
HMA_Dynamic_population_24H_workdays.csv represents the dynamic population for average workday in the study area.
HMA_Dynamic_population_24H_sat.csv represents the dynamic population for average saturday in the study area.
HMA_Dynamic_population_24H_sun.csv represents the dynamic population for average sunday in the study area.
target_zones_grid250m_EPSG3067.geojson represents the statistical grid in ETRS89/ETRS-TM35FIN projection that can be used to visualize the data on a map using e.g. QGIS.
Column names
YKR_ID : a unique identifier for each statistical grid cell (n=13,231). The identifier is compatible with the statistical YKR grid cell data by Statistics Finland and Finnish Environment Institute.
H0, H1 ... H23 : Each field represents the proportional distribution of the total population in the study area between grid cells during a one-hour period. In total, 24 fields are formatted as “Hx”, where x stands for the hour of the day (values ranging from 0-23). For example, H0 stands for the first hour of the day: 00:00 - 00:59. The sum of all cell values for each field equals to 100 (i.e. 100% of total population for each one-hour period)
In order to visualize the data on a map, the result tables can be joined with the target_zones_grid250m_EPSG3067.geojson data. The data can be joined by using the field YKR_ID as a common key between the datasets.
License Creative Commons Attribution 4.0 International.
Related datasets
Järv, Olle; Tenkanen, Henrikki & Toivonen, Tuuli. (2017). Multi-temporal function-based dasymetric interpolation tool for mobile phone data. Zenodo. https://doi.org/10.5281/zenodo.252612
Tenkanen, Henrikki, & Toivonen, Tuuli. (2019). Helsinki Region Travel Time Matrix [Data set]. Zenodo. http://doi.org/10.5281/zenodo.3247564
Facebook
TwitterREVISED 1/2/2019. SEE UPDATE LINK BELOW. This database contains unit cost information for different components that may be used to integrate distributed photovotaic D-PV systems onto distribution systems. Some of these upgrades and costs may also apply to integration of other distributed energy resources DER. Which components are required and how many of each is system-specific and should be determined by analyzing the effects of distributed PV at a given penetration level on the circuit of interest in combination with engineering assessments on the efficacy of different solutions to increase the ability of the circuit to host additional PV as desired. The current state of the distribution system should always be considered in these types of analysis. The data in this database was collected from a variety of utilities PV developers technology vendors and published research reports. Where possible we have included information on the source of each data point and relevant notes. In some cases where data provided is sensitive or proprietary we were not able to specify the source but provide other information that may be useful to the user e.g. year _location where equipment was installed. NREL has carefully reviewed these sources prior to inclusion in this database. Additional information about the database data sources and assumptions is included in the Unit_cost_database_guide.doc file included in this submission. This guide provides important information on what costs are included in each entry. Please refer to this guide before using the unit cost database for any purpose.
Facebook
TwitterKnowledge of the spatial distribution of populations is fundamental to management plans for any species. When tracking data are used to describe distributions, it is sometimes assumed that the reported locations of individuals delineate the spatial extent of areas used by the target population.
Here, we examine existing approaches to validate this assumption, highlight caveats, and propose a new method for a more informative assessment of the number of tracked animals (i.e. sample size) necessary to identify distribution patterns. We show how this assessment can be achieved by considering the heterogeneous use of habitats by a target species using the probabilistic property of a utilisation distribution. Our methods are compiled in the R package SDLfilter.
We illustrate and compare the protocols underlying existing and new methods using conceptual models and demonstrate an application of our approach using a large satellite tracking data-set of flatback turtles, Natator depre...
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Returns of the closing price of the Colcap, Bovespa and S&P index and Analysis program for MTAR model of the paper "Bayesian estimation of a multivariate TAR model when the noise process follows a Student-t distribution"
Facebook
TwitterThe Global Landslide Mortality Risks and Distribution is a 2.5 minute grid of global landslide mortality risks. Gridded Population of the World, Version 3 (GPWv3) data provide a baseline estimation of population per grid cell from which to estimate potential mortality risks due to landslide hazard. Mortality loss estimates per hazard event are caculated using regional, hazard-specific mortality records of the Emergency Events Database (EM-DAT) that span the 20 years between 1981 and 2000. Data regarding the frequency and distribution of landslide hazard are obtained from the Global Landslide Hazard Distribution data set. In order to more accurately reflect the confidence associated with the data and procedures, the potential mortality estimate range is classified into deciles, 10 classes of increasing risk with an approximately equal number of grid cells per class, producing a relative estimate of landslide-based mortality risks. This data set is the result of collaboration among the Columbia University Center for Hazards and Risk Research (CHRR), International Bank for Reconstruction and Development/The World Bank, and Columbia University Center for International Earth Science Information Network (CIESIN).
Facebook
TwitterThe Global Cyclone Mortality Risks and Distribution is a 2.5 minute grid of global cyclone mortality risks. Gridded Population of the World, Version 3 (GPWv3) data provide a baseline estimation of population per grid cell from which to estimate potential mortality loss. Mortality loss estimates per hazard event are calculated using regional, hazard-specific mortality records of the Emergency Events Database (EM-DAT) that span the 20 years between 1981 and 2000. Data regarding the frequency and distribution of cyclone hazard are obtained from the Global Cyclone Hazard Frequency and Distribution data set. In order to more accurately reflect the confidence associated with the data and procedures, the potential mortality estimate range is classified into deciles, 10 classes of an approximately equal number of grid cells, providing a relative estimate of cyclone-based mortality risks. This data set is the result of collaboration among the Columbia University Center for Hazards and Risk Research (CHRR), International Bank for Reconstruction and Development/The World Bank, and Columbia University Center for International Earth Science Information Network (CIESIN).
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
ABSTRACT We build upon an already known but scarcely developed feature of growth theory: the importance of asset distribution in an aggregate production function. We elaborate on a simple model of two individuals, and then generalize its deductions to an extended model of n agents, concluding that perfectly distributed productive capital leads to positive and optimum long-run “endogenous” growth. Recent and classical empirical literature on the topic suggests this interpretation. In addition, we find exploratory panel data evidence that supports our theory of growth and distribution in a set of Latin American countries.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Blockchain data query: Falcon distribution data
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains the optimization results using resilience surrogate measures in four chosen cases (i.e., HAN, FOS, PES, MOD). It also includes mechanical reliability calculation results of optimized network layouts obtained by the surrogate measures.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Summer Steelhead Distribution October 2009 Version This dataset depicts observation-based stream-level geographic distribution of anadromous summer-run steelhead trout, Oncorhynchus mykiss irideus (O. mykiss), in California. It was developed for the express purpose of assisting with steelhead recovery planning efforts. The distributions reported in this dataset were derived from a subset of the data contained in the Aquatic Species Observation Database (ASOD), a Microsoft Access multi-species observation data capture application. ASOD is an ongoing project designed to capture as complete a set of statewide inland aquatic vertebrate species observation information as possible. Please note: A separate distribution is available for winter-run steelhead. Contact information is the same as for the above. ASOD Observation data were used to develop a network of stream segments. These lines are developed by "tracing down" from each observation to the sea using the flow properties of USGS National Hydrography Dataset (NHD) High Resolution hydrography. Lastly these lines, representing stream segments, were assigned a value of either Anad Present (Anadromous present). The end result (i.e., this layer) consists of a set of lines representing the distribution of steelhead based on observations in the Aquatic Species Observation Database. This dataset represents stream reaches that are known or believed to be used by steelhead based on steelhead observations. Thus, it contains only positive steelhead occurrences. The absence of distribution on a stream does not necessarily indicate that steelhead do not utilize that stream. Additionally, steelhead may not be found in all streams or reaches each year. This is due to natural variations in run size, water conditions, and other environmental factors. The information in this data set should be used as an indicator of steelhead presence/suspected presence at the time of the observation as indicated by the 'Late_Yr' (Latest Year) field attribute. The line features in the dataset may not represent the maximum extent of steelhead on a stream; rather it is important to note that this distribution most likely underestimates the actual distribution of steelhead. This distribution is based on observations found in the ASOD database. The individual observations may not have occurred at the upper extent of anadromous occupation. In addition, no attempt was made to capture every observation of O. mykiss and so it should not be assumed that this dataset is complete for each stream. The distribution dataset was built solely from the ASOD observational data. No additional data (habitat mapping, barriers data, gradient modeling, etc.) were utilized to either add to or validate the data. It is very possible that an anadromous observation in this dataset has been recorded above (upstream of) a barrier as identified in the Passage Assessment Database (PAD). In the near future, we hope to perform a comparative analysis between this dataset and the PAD to identify and resolve all such discrepancies. Such an analysis will add rigor to and help validate both datasets. This dataset has recently undergone a review. Data source contributors as well as CDFG fisheries biologists have been provided the opportunity to review and suggest edits or additions during a recent review. Data contributors were notified and invited to review and comment on the handling of the information that they provided. The distribution was then posted to an intranet mapping application and CDFG biologists were provided an opportunity to review and comment on the dataset. During this review, biologists were also encouraged to add new observation data. This resulting final distribution contains their suggestions and additions. Please refer to "Use Constraints" section below.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
http://inspire.ec.europa.eu/metadata-codelist/LimitationsOnPublicAccess/noLimitationshttp://inspire.ec.europa.eu/metadata-codelist/LimitationsOnPublicAccess/noLimitations
INSPIRE Priority Data Set (Compliant) - Alien species distribution
Facebook
TwitterOpen Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Since 2005, Fisheries and Oceans Canada has been collecting monitoring data for aquatic invasive species (e.g. https://open.canada.ca/data/en/dataset/8d87f574-0661-40a0-822f-e9eabc35780d, https://open.canada.ca/data/en/dataset/503a957e-7d6b-11e9-aef3-f48c505b2a29, https://open.canada.ca/data/en/dataset/8661edcf-f525-4758-a051-cb3fc8c74423). This monitoring data, as well additional occurrence information from online databases and the scientific literature, have been paired with high resolution environmental data and oceanographic models in species distribution models that predict the present-day and future potential distributions of 12 moderate to high risk invasive species on Canada’s east and west coasts. Future distributions were predicted for 2075, under Representative Concentration Pathway 8.5 from the Intergovernmental Panel on Climate Change’s fifth Assessment Report. Present-day and future richness of these species (i.e., hotspots) has also been estimated by summing their occurrence probabilities. This data set includes the occurrence locations of each species, the present-day and future species distribution modeling results for each species, and the estimated species richness. This research has been published in the scientific literature(Lyons et al. 2020). Lyons DA, Lowen JB, Therriault TW, Brickman D, Guo L, Moore AM, Peña MA, Wang Z, DiBacco C. (In Press) Identifying Marine Invasion Hotspots Using Stacked Species Distribution Models. Biological Invasions Cite this data as: Lyons DA., Lowen JB, Therriault TW., Brickman D., Guo L., Moore AM., Peña MA., Wang Z., DiBacco C. Data of: Species distribution models and occurrence data for marine invasive species hotspot identification. Published: November 2020. Coastal Ecosystems Science Division, Fisheries and Oceans Canada, Dartmouth, N.S. https://open.canada.ca/data/en/dataset/1bbd5131-8b34-4245-b999-3b4c4259d74f
Facebook
TwitterReptiles make up one-third of tetrapods, however they are often omitted from global conservation analyses. Understanding the determinants of reptile distribution is the foundation for reptile conservation research. We assembled a dataset on the distribution of 231 reptile species (nearly 50% of recorded species in China). We then investigated the association of species range filling (the proportion of observed ranges compared to species potential climate distributions) with climate, range size, topography, and human activity, using three regression methods. At the species level, we found the most primary factors influencing the recent distribution pattern of reptiles across China were the mean annual precipitation (MAP) and the mean annual temperature (MAT). In contrast, human activity came in last. Similarly, at aspatial level, MAP and MAT were still the most important factors. Geographically, the south and east of China support the highest reptile diversity, partially due to high prec..., 614 pieces of literature for reptile occurrence records in China Results of this analysis, , # Data from: Precipitation and temperature primarily determine the reptile distributions in China
https://doi.org/10.5061/dryad.x0k6djhtp
The first excel offer 614 pieces of literature for reptile occurrence records in China, the second excel offer results of analysis. Our data description is in excel files.
'NA' value in the results represent so that there's not not applicable.
The data that support the findings of this study are also available in supplementary material of this paper, refer 10.1111/ecog.07005.
Facebook
TwitterDistribution of diabetic retinopathy data set.
Facebook
TwitterAttribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F8676029%2F094ad6be5c855e931da3721967ec333a%2Fminiature-figures-7129617_1280.jpg?generation=1689328141763429&alt=media" alt="">
The World Bank, the World Inequality Database (WID), and the Luxembourg Income Study (LIS) are all sources of data on poverty and inequality. They differ in terms of the income measure they use, the countries they cover, and the frequency of their data updates.
The World Bank uses a measure of income after taxes and transfers, which is called disposable income. It covers a wide range of countries, but the data is not updated as frequently as the data from the other two sources. The WID uses a measure of net national income after taxes, which is called net national income per adult. It covers a smaller range of countries than the World Bank, but the data is updated more frequently. The LIS uses a measure of disposable household income per capita. It covers a smaller range of countries than the World Bank or the WID, but the data is very detailed and goes back further in time. In general, the LIS data is considered to be the most reliable source of data on poverty and inequality. However, the World Bank and WID data are also useful, especially for countries that are not covered by the LIS.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
An annotated check-list is given of Crinoidea species occurring deeper than 2000 m in the seas bordering Europe.
Facebook
TwitterThe relative importance of the different processes that determine the distribution of species and the assembly of communities is a key question in ecology. The distribution of any individual species is affected by a wide range of environmental variables as well as through interactions with other species; the resulting distributions determine the pool of species available to form local communities at fine spatial scales. A challenge in community ecology is that these interactions (e.g., competition, facilitation, etc.) often are not directly measurable. Here, we used Hierarchical Modelling of Species Communities (HMSC), a recently developed framework for joint species distribution modelling, to estimate the role of biotic effects alongside environmental factors using latent variables. We investigate the role of these factors determining species distributions in communities of Artiodactyla, Perissodactyla and Proboscidea in the Afrotropics, an area of peak species richness for hoofed mamm..., Data were collected as described in the manuscript.  , , # Data from: Climatic Variables Alone do not Determine Ungulate Distributions in the Afrotropics
This repository contains all data used for the paper "Climatic Variables Alone do not Determine Ungulate Distributions in the Afrotropics".
Presab_plus_climate_variables_PROTECTEDAREASONLY_Res10arcminutes.csv, _OpenHabitats.csv, _TropicalForests.csv
These three files contain presence/absence data for all ungulate species included in the analysis for all points in the Afrotropics at 10 arcminute resolution, with data derived from IUCN species range maps processed using the letsR package. Additionally, they contain environmental data for all points in the matrix for all 19 bioclimatic variables (from WorldClim) and land cover data (from the FAO Land and Water Division.)
Trait_Data_AllUngulates.csv
This file contains all trait data used in the calculation of Gower's Distance.
Other data (i...
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This data set is similar to gamma-ray spectroscopy data and is designed for machine-learning data analysis. This dataset is generated by computer.
In gamma-ray spectroscopy, data is generated by capturing the number of emissions within a specific channel range of the radiation emitted by the sample. In scientific data, the sample produces photopeaks exhibiting a Gaussian distribution when statistically examined. A Gaussian distribution (Normal distribution) is a probability distribution dependent on three parameters.
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F7989877%2F3854c6aa9a72ee0d2558ee878194a7be%2FGauss_dis%20-%20Kopya.png?generation=1689283862004724&alt=media" alt="">
for more information: https://en.wikipedia.org/wiki/Normal_distribution
In Gamma Ray Spectroscopy
Co-60 Gamma-ray Spectroscopy Example
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F7989877%2F8fad8994bf11dca48657dc3d3e21f628%2Fco60-repc.png?generation=1689324671029928&alt=media" alt="">
cha_ : Number of radiations captured by the channel from 0 to 2000 with 10 intervals
Facebook
TwitterUnderstanding the spatial distribution of individuals is essential for effective species conservation. We investigated the spatio-temporal distribution of Aquila chrysaetos (golden eagle) in Finland using nest surveys, citizen science observations, and environmental data from 1982 to 2021. We extended a popular N-mixture model to estimate population abundance while accounting for the number of recently hatched nestlings, the spatial distribution, and correcting for biases in the observation process, all integrated into a two-stage Bayesian hierarchical model. Results of our model aligned with the previously known north-oriented distribution of A. chrysaetos in Finland, supporting its applicability to A. chrysaetos and other species with similar ecological requirements and life-history traits. Furthermore, we observed the highest densities of successful nests near open landscapes, such as marshes and peat bogs, whereas dense forests, port areas, and areas of high human popula..., , # Multimodal data helps in identifying spatio-temporal patterns and habitat associations of Aquila chrysaetos (Golden Eagle) in Finland
Dataset DOI: 10.5061/dryad.41ns1rnsg
This repository contains all data, code, and results for the two-stage simulation process described in our study. Each stage is organized into a separate folder with standardized file formats to ensure reproducibility and clarity.
All datasets are structured as matrices, where rows represent spatial sites and columns represent time steps (years). The dataset consists of Stage_I_II.zip folder.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
This list ranks the 3 cities in the Humphreys County, TN by Jordanian population, as estimated by the United States Census Bureau. It also highlights population changes in each city over the past five years.
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 5-Year Estimates, including:
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Related article: Bergroth, C., Järv, O., Tenkanen, H., Manninen, M., Toivonen, T., 2022. A 24-hour population distribution dataset based on mobile phone data from Helsinki Metropolitan Area, Finland. Scientific Data 9, 39.
In this dataset:
We present temporally dynamic population distribution data from the Helsinki Metropolitan Area, Finland, at the level of 250 m by 250 m statistical grid cells. Three hourly population distribution datasets are provided for regular workdays (Mon – Thu), Saturdays and Sundays. The data are based on aggregated mobile phone data collected by the biggest mobile network operator in Finland. Mobile phone data are assigned to statistical grid cells using an advanced dasymetric interpolation method based on ancillary data about land cover, buildings and a time use survey. The data were validated by comparing population register data from Statistics Finland for night-time hours and a daytime workplace registry. The resulting 24-hour population data can be used to reveal the temporal dynamics of the city and examine population variations relevant to for instance spatial accessibility analyses, crisis management and planning.
Please cite this dataset as:
Bergroth, C., Järv, O., Tenkanen, H., Manninen, M., Toivonen, T., 2022. A 24-hour population distribution dataset based on mobile phone data from Helsinki Metropolitan Area, Finland. Scientific Data 9, 39. https://doi.org/10.1038/s41597-021-01113-4
Organization of data
The dataset is packaged into a single Zipfile Helsinki_dynpop_matrix.zip which contains following files:
HMA_Dynamic_population_24H_workdays.csv represents the dynamic population for average workday in the study area.
HMA_Dynamic_population_24H_sat.csv represents the dynamic population for average saturday in the study area.
HMA_Dynamic_population_24H_sun.csv represents the dynamic population for average sunday in the study area.
target_zones_grid250m_EPSG3067.geojson represents the statistical grid in ETRS89/ETRS-TM35FIN projection that can be used to visualize the data on a map using e.g. QGIS.
Column names
YKR_ID : a unique identifier for each statistical grid cell (n=13,231). The identifier is compatible with the statistical YKR grid cell data by Statistics Finland and Finnish Environment Institute.
H0, H1 ... H23 : Each field represents the proportional distribution of the total population in the study area between grid cells during a one-hour period. In total, 24 fields are formatted as “Hx”, where x stands for the hour of the day (values ranging from 0-23). For example, H0 stands for the first hour of the day: 00:00 - 00:59. The sum of all cell values for each field equals to 100 (i.e. 100% of total population for each one-hour period)
In order to visualize the data on a map, the result tables can be joined with the target_zones_grid250m_EPSG3067.geojson data. The data can be joined by using the field YKR_ID as a common key between the datasets.
License Creative Commons Attribution 4.0 International.
Related datasets
Järv, Olle; Tenkanen, Henrikki & Toivonen, Tuuli. (2017). Multi-temporal function-based dasymetric interpolation tool for mobile phone data. Zenodo. https://doi.org/10.5281/zenodo.252612
Tenkanen, Henrikki, & Toivonen, Tuuli. (2019). Helsinki Region Travel Time Matrix [Data set]. Zenodo. http://doi.org/10.5281/zenodo.3247564