100+ datasets found
  1. Z

    Data from: A 24-hour dynamic population distribution dataset based on mobile...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Feb 16, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Claudia Bergroth; Olle Järv; Henrikki Tenkanen; Matti Manninen; Tuuli Toivonen (2022). A 24-hour dynamic population distribution dataset based on mobile phone data from Helsinki Metropolitan Area, Finland [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4724388
    Explore at:
    Dataset updated
    Feb 16, 2022
    Dataset provided by
    Unit of Urban Research and Statistics, City of Helsinki / Digital Geography Lab, Department of Geosciences and Geography, University of Helsinki
    Elisa Corporation
    Digital Geography Lab, Department of Geosciences and Geography, University of Helsinki
    Department of Built Environment, Aalto University / Centre for Advanced Spatial Analysis, University College London
    Authors
    Claudia Bergroth; Olle Järv; Henrikki Tenkanen; Matti Manninen; Tuuli Toivonen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Finland, Helsinki Metropolitan Area
    Description

    Related article: Bergroth, C., Järv, O., Tenkanen, H., Manninen, M., Toivonen, T., 2022. A 24-hour population distribution dataset based on mobile phone data from Helsinki Metropolitan Area, Finland. Scientific Data 9, 39.

    In this dataset:

    We present temporally dynamic population distribution data from the Helsinki Metropolitan Area, Finland, at the level of 250 m by 250 m statistical grid cells. Three hourly population distribution datasets are provided for regular workdays (Mon – Thu), Saturdays and Sundays. The data are based on aggregated mobile phone data collected by the biggest mobile network operator in Finland. Mobile phone data are assigned to statistical grid cells using an advanced dasymetric interpolation method based on ancillary data about land cover, buildings and a time use survey. The data were validated by comparing population register data from Statistics Finland for night-time hours and a daytime workplace registry. The resulting 24-hour population data can be used to reveal the temporal dynamics of the city and examine population variations relevant to for instance spatial accessibility analyses, crisis management and planning.

    Please cite this dataset as:

    Bergroth, C., Järv, O., Tenkanen, H., Manninen, M., Toivonen, T., 2022. A 24-hour population distribution dataset based on mobile phone data from Helsinki Metropolitan Area, Finland. Scientific Data 9, 39. https://doi.org/10.1038/s41597-021-01113-4

    Organization of data

    The dataset is packaged into a single Zipfile Helsinki_dynpop_matrix.zip which contains following files:

    HMA_Dynamic_population_24H_workdays.csv represents the dynamic population for average workday in the study area.

    HMA_Dynamic_population_24H_sat.csv represents the dynamic population for average saturday in the study area.

    HMA_Dynamic_population_24H_sun.csv represents the dynamic population for average sunday in the study area.

    target_zones_grid250m_EPSG3067.geojson represents the statistical grid in ETRS89/ETRS-TM35FIN projection that can be used to visualize the data on a map using e.g. QGIS.

    Column names

    YKR_ID : a unique identifier for each statistical grid cell (n=13,231). The identifier is compatible with the statistical YKR grid cell data by Statistics Finland and Finnish Environment Institute.

    H0, H1 ... H23 : Each field represents the proportional distribution of the total population in the study area between grid cells during a one-hour period. In total, 24 fields are formatted as “Hx”, where x stands for the hour of the day (values ranging from 0-23). For example, H0 stands for the first hour of the day: 00:00 - 00:59. The sum of all cell values for each field equals to 100 (i.e. 100% of total population for each one-hour period)

    In order to visualize the data on a map, the result tables can be joined with the target_zones_grid250m_EPSG3067.geojson data. The data can be joined by using the field YKR_ID as a common key between the datasets.

    License Creative Commons Attribution 4.0 International.

    Related datasets

    Järv, Olle; Tenkanen, Henrikki & Toivonen, Tuuli. (2017). Multi-temporal function-based dasymetric interpolation tool for mobile phone data. Zenodo. https://doi.org/10.5281/zenodo.252612

    Tenkanen, Henrikki, & Toivonen, Tuuli. (2019). Helsinki Region Travel Time Matrix [Data set]. Zenodo. http://doi.org/10.5281/zenodo.3247564

  2. f

    Data from: S1 Data set -

    • figshare.com
    • plos.figshare.com
    xlsx
    Updated Jul 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nassim Dehouche; Sorawit Viravan; Ubolrat Santawat; Nungruethai Torsuwan; Sakuna Taijan; Atthakorn Intharakosum; Yongyut Sirivatanauksorn (2023). S1 Data set - [Dataset]. http://doi.org/10.1371/journal.pone.0288239.s012
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jul 13, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Nassim Dehouche; Sorawit Viravan; Ubolrat Santawat; Nungruethai Torsuwan; Sakuna Taijan; Atthakorn Intharakosum; Yongyut Sirivatanauksorn
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundThe typical hospital Length of Stay (LOS) distribution is known to be right-skewed, to vary considerably across Diagnosis Related Groups (DRGs), and to contain markedly high values, in significant proportions. These very long stays are often considered outliers, and thin-tailed statistical distributions are assumed. However, resource consumption and planning occur at the level of medical specialty departments covering multiple DRGs, and when considered at this decision-making scale, extreme LOS values represent a significant component of the distribution of LOS (the right tail) that determines many of its statistical properties.ObjectiveTo build actionable statistical models of LOS for resource planning at the level of healthcare units.MethodsThrough a study of 46, 364 electronic health records over four medical specialty departments (Pediatrics, Obstetrics/Gynecology, Surgery, and Rehabilitation Medicine) in the largest hospital in Thailand (Siriraj Hospital in Bangkok), we show that the distribution of LOS exhibits a tail behavior that is consistent with a subexponential distribution. We analyze some empirical properties of such a distribution that are of relevance to cost and resource planning, notably the concentration of resource consumption among a minority of admissions/patients, an increasing residual LOS, where the longer a patient has been admitted, the longer they would be expected to remain admitted, and a slow convergence of the Law of Large Numbers, making empirical estimates of moments (e.g. mean, variance) unreliable.ResultsWe propose a novel Beta-Geometric model that shows a good fit with observed data and reproduces these empirical properties of LOS. Finally, we use our findings to make practical recommendations regarding the pricing and management of LOS.

  3. c

    Data from: DEPRECATED. SEE UPDATE LINK BELOW. Distribution System Upgrade...

    • s.cnmilf.com
    • data.openei.org
    • +2more
    Updated Jan 20, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Renewable Energy Laboratory (2025). DEPRECATED. SEE UPDATE LINK BELOW. Distribution System Upgrade Unit Cost Database [Dataset]. https://s.cnmilf.com/user74170196/https/catalog.data.gov/dataset/deprecated-see-update-link-below-distribution-system-upgrade-unit-cost-database-9deda
    Explore at:
    Dataset updated
    Jan 20, 2025
    Dataset provided by
    National Renewable Energy Laboratory
    Description

    REVISED 1/2/2019. SEE UPDATE LINK BELOW. This database contains unit cost information for different components that may be used to integrate distributed photovotaic D-PV systems onto distribution systems. Some of these upgrades and costs may also apply to integration of other distributed energy resources DER. Which components are required and how many of each is system-specific and should be determined by analyzing the effects of distributed PV at a given penetration level on the circuit of interest in combination with engineering assessments on the efficacy of different solutions to increase the ability of the circuit to host additional PV as desired. The current state of the distribution system should always be considered in these types of analysis. The data in this database was collected from a variety of utilities PV developers technology vendors and published research reports. Where possible we have included information on the source of each data point and relevant notes. In some cases where data provided is sensitive or proprietary we were not able to specify the source but provide other information that may be useful to the user e.g. year _location where equipment was installed. NREL has carefully reviewed these sources prior to inclusion in this database. Additional information about the database data sources and assumptions is included in the Unit_cost_database_guide.doc file included in this submission. This guide provides important information on what costs are included in each entry. Please refer to this guide before using the unit cost database for any purpose.

  4. d

    Global Landslide Mortality Risks and Distribution

    • catalog.data.gov
    • dataverse.harvard.edu
    • +3more
    Updated Aug 22, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SEDAC (2025). Global Landslide Mortality Risks and Distribution [Dataset]. https://catalog.data.gov/dataset/global-landslide-mortality-risks-and-distribution
    Explore at:
    Dataset updated
    Aug 22, 2025
    Dataset provided by
    SEDAC
    Description

    The Global Landslide Mortality Risks and Distribution is a 2.5 minute grid of global landslide mortality risks. Gridded Population of the World, Version 3 (GPWv3) data provide a baseline estimation of population per grid cell from which to estimate potential mortality risks due to landslide hazard. Mortality loss estimates per hazard event are caculated using regional, hazard-specific mortality records of the Emergency Events Database (EM-DAT) that span the 20 years between 1981 and 2000. Data regarding the frequency and distribution of landslide hazard are obtained from the Global Landslide Hazard Distribution data set. In order to more accurately reflect the confidence associated with the data and procedures, the potential mortality estimate range is classified into deciles, 10 classes of increasing risk with an approximately equal number of grid cells per class, producing a relative estimate of landslide-based mortality risks. This data set is the result of collaboration among the Columbia University Center for Hazards and Risk Research (CHRR), International Bank for Reconstruction and Development/The World Bank, and Columbia University Center for International Earth Science Information Network (CIESIN).

  5. Global Cyclone Mortality Risks and Distribution - Dataset - NASA Open Data...

    • data.nasa.gov
    Updated Apr 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    nasa.gov (2025). Global Cyclone Mortality Risks and Distribution - Dataset - NASA Open Data Portal [Dataset]. https://data.nasa.gov/dataset/global-cyclone-mortality-risks-and-distribution
    Explore at:
    Dataset updated
    Apr 23, 2025
    Dataset provided by
    NASAhttp://nasa.gov/
    Description

    The Global Cyclone Mortality Risks and Distribution is a 2.5 minute grid of global cyclone mortality risks. Gridded Population of the World, Version 3 (GPWv3) data provide a baseline estimation of population per grid cell from which to estimate potential mortality loss. Mortality loss estimates per hazard event are calculated using regional, hazard-specific mortality records of the Emergency Events Database (EM-DAT) that span the 20 years between 1981 and 2000. Data regarding the frequency and distribution of cyclone hazard are obtained from the Global Cyclone Hazard Frequency and Distribution data set. In order to more accurately reflect the confidence associated with the data and procedures, the potential mortality estimate range is classified into deciles, 10 classes of an approximately equal number of grid cells, providing a relative estimate of cyclone-based mortality risks. This data set is the result of collaboration among the Columbia University Center for Hazards and Risk Research (CHRR), International Bank for Reconstruction and Development/The World Bank, and Columbia University Center for International Earth Science Information Network (CIESIN).

  6. Data from: Growth and distribution endogenously determined: a theoretical...

    • scielo.figshare.com
    jpeg
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HERNÁN BORRERO; NESTOR GARZA (2023). Growth and distribution endogenously determined: a theoretical model and empirical evidence [Dataset]. http://doi.org/10.6084/m9.figshare.8091632.v1
    Explore at:
    jpegAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    SciELOhttp://www.scielo.org/
    Authors
    HERNÁN BORRERO; NESTOR GARZA
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    ABSTRACT We build upon an already known but scarcely developed feature of growth theory: the importance of asset distribution in an aggregate production function. We elaborate on a simple model of two individuals, and then generalize its deductions to an extended model of n agents, concluding that perfectly distributed productive capital leads to positive and optimum long-run “endogenous” growth. Recent and classical empirical literature on the topic suggests this interpretation. In addition, we find exploratory panel data evidence that supports our theory of growth and distribution in a set of Latin American countries.

  7. i

    INSPIRE Priority Data Set (Compliant) - Alien species distribution

    • inspire-geoportal.lt
    • inspire-geoportal.ec.europa.eu
    Updated Jun 17, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Construction Sector Development Agency (2020). INSPIRE Priority Data Set (Compliant) - Alien species distribution [Dataset]. https://www.inspire-geoportal.lt/geonetwork/srv/api/records/328d8064-b72c-49ef-b764-c88b528d48a0
    Explore at:
    www:link-1.0-http--link, ogc:wms-1.3.0-http-get-capabilities, www:download-1.0-http--downloadAvailable download formats
    Dataset updated
    Jun 17, 2020
    Dataset provided by
    Ministry of Environment
    Construction Sector Development Agency
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    http://inspire.ec.europa.eu/metadata-codelist/LimitationsOnPublicAccess/noLimitationshttp://inspire.ec.europa.eu/metadata-codelist/LimitationsOnPublicAccess/noLimitations

    Area covered
    Description

    INSPIRE Priority Data Set (Compliant) - Alien species distribution

  8. 4

    Financial data set and analysis program

    • data.4tu.nl
    • figshare.com
    zip
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lizet Romero, Financial data set and analysis program [Dataset]. http://doi.org/10.4121/uuid:c55dcc10-e2b5-4a3b-a75d-234be5ae99fc
    Explore at:
    zipAvailable download formats
    Dataset provided by
    4TU.Centre for Research Data
    Authors
    Lizet Romero
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Returns of the closing price of the Colcap, Bovespa and S&P index and Analysis program for MTAR model of the paper "Bayesian estimation of a multivariate TAR model when the noise process follows a Student-t distribution"

  9. Data set for "Novel surrogate measures for improving water distribution...

    • zenodo.org
    zip
    Updated Oct 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Qi WANG; Yuehua HUANG; Qi WANG; Yuehua HUANG (2024). Data set for "Novel surrogate measures for improving water distribution systems' resilience via pipe diameter uniformity enhancement" [Dataset]. http://doi.org/10.5281/zenodo.14007921
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 29, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Qi WANG; Yuehua HUANG; Qi WANG; Yuehua HUANG
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains the optimization results using resilience surrogate measures in four chosen cases (i.e., HAN, FOS, PES, MOD). It also includes mechanical reliability calculation results of optimized network layouts obtained by the surrogate measures.

  10. Data from: Summer Steelhead Distribution [ds341]

    • data.ca.gov
    • data.cnra.ca.gov
    • +5more
    Updated Oct 12, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    California Department of Fish and Wildlife (2023). Summer Steelhead Distribution [ds341] [Dataset]. https://data.ca.gov/dataset/summer-steelhead-distribution-ds3411
    Explore at:
    geojson, html, kml, csv, zip, arcgis geoservices rest apiAvailable download formats
    Dataset updated
    Oct 12, 2023
    Dataset authored and provided by
    California Department of Fish and Wildlifehttps://wildlife.ca.gov/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Summer Steelhead Distribution October 2009 Version This dataset depicts observation-based stream-level geographic distribution of anadromous summer-run steelhead trout, Oncorhynchus mykiss irideus (O. mykiss), in California. It was developed for the express purpose of assisting with steelhead recovery planning efforts. The distributions reported in this dataset were derived from a subset of the data contained in the Aquatic Species Observation Database (ASOD), a Microsoft Access multi-species observation data capture application. ASOD is an ongoing project designed to capture as complete a set of statewide inland aquatic vertebrate species observation information as possible. Please note: A separate distribution is available for winter-run steelhead. Contact information is the same as for the above. ASOD Observation data were used to develop a network of stream segments. These lines are developed by "tracing down" from each observation to the sea using the flow properties of USGS National Hydrography Dataset (NHD) High Resolution hydrography. Lastly these lines, representing stream segments, were assigned a value of either Anad Present (Anadromous present). The end result (i.e., this layer) consists of a set of lines representing the distribution of steelhead based on observations in the Aquatic Species Observation Database. This dataset represents stream reaches that are known or believed to be used by steelhead based on steelhead observations. Thus, it contains only positive steelhead occurrences. The absence of distribution on a stream does not necessarily indicate that steelhead do not utilize that stream. Additionally, steelhead may not be found in all streams or reaches each year. This is due to natural variations in run size, water conditions, and other environmental factors. The information in this data set should be used as an indicator of steelhead presence/suspected presence at the time of the observation as indicated by the 'Late_Yr' (Latest Year) field attribute. The line features in the dataset may not represent the maximum extent of steelhead on a stream; rather it is important to note that this distribution most likely underestimates the actual distribution of steelhead. This distribution is based on observations found in the ASOD database. The individual observations may not have occurred at the upper extent of anadromous occupation. In addition, no attempt was made to capture every observation of O. mykiss and so it should not be assumed that this dataset is complete for each stream. The distribution dataset was built solely from the ASOD observational data. No additional data (habitat mapping, barriers data, gradient modeling, etc.) were utilized to either add to or validate the data. It is very possible that an anadromous observation in this dataset has been recorded above (upstream of) a barrier as identified in the Passage Assessment Database (PAD). In the near future, we hope to perform a comparative analysis between this dataset and the PAD to identify and resolve all such discrepancies. Such an analysis will add rigor to and help validate both datasets. This dataset has recently undergone a review. Data source contributors as well as CDFG fisheries biologists have been provided the opportunity to review and suggest edits or additions during a recent review. Data contributors were notified and invited to review and comment on the handling of the information that they provided. The distribution was then posted to an intranet mapping application and CDFG biologists were provided an opportunity to review and comment on the dataset. During this review, biologists were also encouraged to add new observation data. This resulting final distribution contains their suggestions and additions. Please refer to "Use Constraints" section below.

  11. f

    Distribution of diabetic retinopathy data set.

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Nov 23, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wang, Simin; Zhou, Lun; Gao, Song; Zhang, Hengsheng; Liu, Ruochen; Liu, Jiaming (2022). Distribution of diabetic retinopathy data set. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000261680
    Explore at:
    Dataset updated
    Nov 23, 2022
    Authors
    Wang, Simin; Zhou, Lun; Gao, Song; Zhang, Hengsheng; Liu, Ruochen; Liu, Jiaming
    Description

    Distribution of diabetic retinopathy data set.

  12. d

    Data from: Precipitation and temperature primarily determine the reptile...

    • search.dataone.org
    • datadryad.org
    Updated Jul 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chunrong Mi; Baojun Sun (2024). Precipitation and temperature primarily determine the reptile distributions in China [Dataset]. http://doi.org/10.5061/dryad.x0k6djhtp
    Explore at:
    Dataset updated
    Jul 26, 2024
    Dataset provided by
    Dryad Digital Repository
    Authors
    Chunrong Mi; Baojun Sun
    Area covered
    China
    Description

    Reptiles make up one-third of tetrapods, however they are often omitted from global conservation analyses. Understanding the determinants of reptile distribution is the foundation for reptile conservation research. We assembled a dataset on the distribution of 231 reptile species (nearly 50% of recorded species in China). We then investigated the association of species range filling (the proportion of observed ranges compared to species potential climate distributions) with climate, range size, topography, and human activity, using three regression methods. At the species level, we found the most primary factors influencing the recent distribution pattern of reptiles across China were the mean annual precipitation (MAP) and the mean annual temperature (MAT). In contrast, human activity came in last. Similarly, at aspatial level, MAP and MAT were still the most important factors. Geographically, the south and east of China support the highest reptile diversity, partially due to high prec..., 614 pieces of literature for reptile occurrence records in China Results of this analysis, , # Data from: Precipitation and temperature primarily determine the reptile distributions in China

    https://doi.org/10.5061/dryad.x0k6djhtp

    Description of the data and file structure

    The first excel offer 614 pieces of literature for reptile occurrence records in China, the second excel offer results of analysis. Our data description is in excel files.

    'NA' value in the results represent so that there's not not applicable.

    Sharing/Access information

    The data that support the findings of this study are also available in supplementary material of this paper, refer 10.1111/ecog.07005.

  13. d

    Profiles of salinity, temperature, depth, turbidity, and distributions of...

    • catalog.data.gov
    • data.usgs.gov
    Updated Nov 12, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Profiles of salinity, temperature, depth, turbidity, and distributions of particle size in suspension collected during four days in South San Francisco Bay, California, June 2021 to January 2022 [Dataset]. https://catalog.data.gov/dataset/profiles-of-salinity-temperature-depth-turbidity-and-distributions-of-particle-size-in-sus
    Explore at:
    Dataset updated
    Nov 12, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Area covered
    South San Francisco, California
    Description

    Profiles of salinity, temperature, turbidity, and particle size distribution were collected by the U.S. Geological Survey (USGS) Pacific Coastal and Marine Science Center in South San Francisco Bay. Data were collected at depth intervals ranging between 0.5 and 2 m (depending on total water depth); sensors remained at each depth for 1-2 minutes. Each profile was collected from surface to bed, and the near-surface region was sampled again at the end of the profile to check steady-state conditions. Profiles were collected on 4 days: June 22, July 21, and December 3 of 2021, and on January 4, 2022 (UTC). Data files are grouped by season (summer or winter) and by instrument (CTD or LISST). No LISST data were collected in the winter. Users are advised to assess data quality carefully.

  14. Interviews.docx data set.docx

    • figshare.com
    docx
    Updated Jan 11, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jacqueline Baxter; Christoper Cornforth (2019). Interviews.docx data set.docx [Dataset]. http://doi.org/10.6084/m9.figshare.7578596.v1
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jan 11, 2019
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Jacqueline Baxter; Christoper Cornforth
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Sample of MATs for research into MAT communication with Communities

  15. Crinoidea distribution data from: Deep-sea fauna of European seas - an...

    • gbif.org
    • obis.org
    • +3more
    Updated Sep 17, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alexander Mironov; Nadia Ameziane; Marc Eleaume; Alexander Mironov; Nadia Ameziane; Marc Eleaume (2025). Crinoidea distribution data from: Deep-sea fauna of European seas - an annotated species check-list of benthic invertebrates living deeper than 2000 m in the seas bordering Europe [Dataset]. http://doi.org/10.14284/253
    Explore at:
    Dataset updated
    Sep 17, 2025
    Dataset provided by
    Global Biodiversity Information Facilityhttps://www.gbif.org/
    Flanders Marine Institute
    Authors
    Alexander Mironov; Nadia Ameziane; Marc Eleaume; Alexander Mironov; Nadia Ameziane; Marc Eleaume
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 1, 1872 - Jan 1, 2014
    Area covered
    Description

    An annotated check-list is given of Crinoidea species occurring deeper than 2000 m in the seas bordering Europe.

  16. E

    Data from: The Cost of Stochastic Resetting

    • find.data.gov.scot
    • dtechtive.com
    txt, zip
    Updated Apr 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    University of Edinburgh. School of Physics and Astronomy. Institute for Condensed Matter and Complex Systems (2023). The Cost of Stochastic Resetting [Dataset]. http://doi.org/10.7488/ds/3839
    Explore at:
    txt(0.0011 MB), txt(0.0166 MB), zip(9.699 MB), zip(0.0043 MB)Available download formats
    Dataset updated
    Apr 19, 2023
    Dataset provided by
    University of Edinburgh. School of Physics and Astronomy. Institute for Condensed Matter and Complex Systems
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Resetting a stochastic process has been shown to expedite the completion time of some complex task, such as finding a target for the first time. Here we consider the cost of resetting by associating a cost to each reset, which is a function of the distance travelled during the reset event. We compute the Laplace transform of the joint probability of first passage time $t_f$, number of resets $N$ and resetting cost $C$, and use this to study the statistics of the total cost. We show that in the limit of zero resetting rate the mean cost is finite for a linear cost function, vanishes for a sub-linear cost function and diverges for a super-linear cost function. This result contrasts with the case of no resetting where the cost is always zero. For the case of an exponentially increasing cost function we show that the mean cost diverges at a finite resetting rate. We explain this by showing that the distribution of the cost has a power-law tail with continuously varying exponent that depends on the resetting rate. The dataset is related to the upcoming paper John C. Sunil, Richard A. Blythe, Martin R. Evans and Satya N. Majumdar (in submission), 'The Cost of Stochastic Resetting'.

  17. Gender, Age, and Emotion Detection from Voice

    • kaggle.com
    zip
    Updated May 29, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rohit Zaman (2021). Gender, Age, and Emotion Detection from Voice [Dataset]. https://www.kaggle.com/rohitzaman/gender-age-and-emotion-detection-from-voice
    Explore at:
    zip(967820 bytes)Available download formats
    Dataset updated
    May 29, 2021
    Authors
    Rohit Zaman
    Description

    Context

    Our target was to predict gender, age and emotion from audio. We found audio labeled datasets on Mozilla and RAVDESS. So by using R programming language 20 statistical features were extracted and then after adding the labels these datasets were formed. Audio files were collected from "Mozilla Common Voice" and “Ryerson AudioVisual Database of Emotional Speech and Song (RAVDESS)”.

    Content

    Datasets contains 20 feature columns and 1 column for denoting the label. The 20 statistical features were extracted through the Frequency Spectrum Analysis using R programming Language. They are: 1) meanfreq - The mean frequency (in kHz) is a pitch measure, that assesses the center of the distribution of power across frequencies. 2) sd - The standard deviation of frequency is a statistical measure that describes a dataset’s dispersion relative to its mean and is calculated as the variance’s square root. 3) median - The median frequency (in kHz) is the middle number in the sorted, ascending, or descending list of numbers. 4) Q25 - The first quartile (in kHz), referred to as Q1, is the median of the lower half of the data set. This means that about 25 percent of the data set numbers are below Q1, and about 75 percent are above Q1. 5) Q75 - The third quartile (in kHz), referred to as Q3, is the central point between the median and the highest distributions. 6) IQR - The interquartile range (in kHz) is a measure of statistical dispersion, equal to the difference between 75th and 25th percentiles or between upper and lower quartiles. 7) skew - The skewness is the degree of distortion from the normal distribution. It measures the lack of symmetry in the data distribution. 8) kurt - The kurtosis is a statistical measure that determines how much the tails of distribution vary from the tails of a normal distribution. It is actually the measure of outliers present in the data distribution. 9) sp.ent - The spectral entropy is a measure of signal irregularity that sums up the normalized signal’s spectral power. 10) sfm - The spectral flatness or tonality coefficient, also known as Wiener entropy, is a measure used for digital signal processing to characterize an audio spectrum. Spectral flatness is usually measured in decibels, which, instead of being noise-like, offers a way to calculate how tone-like a sound is. 11) mode - The mode frequency is the most frequently observed value in a data set. 12) centroid - The spectral centroid is a metric used to describe a spectrum in digital signal processing. It means where the spectrum’s center of mass is centered. 13) meanfun - The meanfun is the average of the fundamental frequency measured across the acoustic signal. 14) minfun - The minfun is the minimum fundamental frequency measured across the acoustic signal 15) maxfun - The maxfun is the maximum fundamental frequency measured across the acoustic signal. 16) meandom - The meandom is the average of dominant frequency measured across the acoustic signal. 17) mindom - The mindom is the minimum of dominant frequency measured across the acoustic signal. 18) maxdom - The maxdom is the maximum of dominant frequency measured across the acoustic signal 19) dfrange - The dfrange is the range of dominant frequency measured across the acoustic signal. 20) modindx - the modindx is the modulation index, which calculates the degree of frequency modulation expressed numerically as the ratio of the frequency deviation to the frequency of the modulating signal for a pure tone modulation.

    Acknowledgements

    Gender and Age Audio Data Souce: Link: https://commonvoice.mozilla.org/en Emotion Audio Data Souce: Link : https://smartlaboratory.org/ravdess/

  18. e

    Data from: Distribution, Abundance, and Coexistence of Two Species of Sucker...

    • knb.ecoinformatics.org
    • dataone.org
    • +1more
    Updated Jan 6, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sagehen Creek Field Station; University of California Natural Reserve System; Lynn Marie Decker (2015). Distribution, Abundance, and Coexistence of Two Species of Sucker (Catostomus) in Sagehen Creek, California and Their Status in the Western Lahontan Basin [Dataset]. http://doi.org/10.5063/AA/nrs.692.1
    Explore at:
    Dataset updated
    Jan 6, 2015
    Dataset provided by
    Knowledge Network for Biocomplexity
    Authors
    Sagehen Creek Field Station; University of California Natural Reserve System; Lynn Marie Decker
    Time period covered
    Jul 21, 1982 - Sep 22, 1983
    Area covered
    Description

    MASTERS THESIS: In 1982 (the researcher) initiated a study to examine any differences in the distribution and abundance between mountain and Tahoe sucker and to determine how these two similar species coexist in Sagehen Creek. Early in the study (she) noticed that the relative abundance of these species had shifted dramatically when compared to earlier reports. IN addition to Sagehen Creek, four nearby streams were surveyed in 1982 to locate both species. At all sites mountain sucker were either absent or rare and only 10 individuals were eventually collected. The apparent scarcity of mountain sucker was perplexing; early records had shown mountain sucker were not only present and abundant but were also more numerous than the associated Tahoe sucker at these sites. (She) expanded the study to address the following objectives: 1. To determine if significant shifts in the relative abundance of suckers have occurred in Sagehen Creek since early surveys, 2. To determine if similar shifts have occurred elsewhere, 3. To distinguish differences in the distribution and abundance between the two species of sucker in Sagehen Creek where both species still exist, and 4. To determine how mountain and Tahoe sucker manage to coexist in a relatively small segment of stream.

  19. A

    SAGA: Calculate Percentile

    • data.amerigeoss.org
    esri rest, html
    Updated Oct 1, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States (2018). SAGA: Calculate Percentile [Dataset]. https://data.amerigeoss.org/gl/dataset/saga-calculate-percentile
    Explore at:
    esri rest, htmlAvailable download formats
    Dataset updated
    Oct 1, 2018
    Dataset provided by
    United States
    License

    http://geospatial-usace.opendata.arcgis.com/datasets/9defaa133d434c0a8bb82d5db54e1934/license.jsonhttp://geospatial-usace.opendata.arcgis.com/datasets/9defaa133d434c0a8bb82d5db54e1934/license.json

    Description

    A sieve analysis (or gradation test) is a practice or procedure commonly used in civil engineering to assess the particle size distribution (also called gradation) of a granular material.

    As part of the Sediment Analysis and Geo-App (SAGA) a series of data processing web services are available to assist in computing sediment statistics based on results of sieve analysis. The Calculate Percentile service returns one of the following percentiles: D5, D10, D16, D35, D50, D84, D90, D95.

    Percentiles can also be computed for classification sub-groups: Overall (OVERALL), <62.5 um (DS_FINE), 62.5-250um (DS_MED), and > 250um (DS_COARSE)

    Parameter #1: Input Sieve Size, Percent Passing, Sieve Units.

    • Semi-colon separated. ex: 75000, 100, um; 50000, 100, um; 37500, 100, um; 25000,100,um; 19000,100,um
    • A minimum of 4 sieve sizes must be used. Units supported: um, mm, inches, #, Mesh, phi
    • All sieve sizes must be numeric

    Parameter #2: Percentile

    • Options: D5, D10, D16, D35, D50, D84, D90, D95

    Parameter #3: Subgroup

    • Options: OVERALL, DS_COARSE, DS_MED, DS_FINE
    • The statistics are computed for the overall sample and into Coarse, Medium, and Fine sub-classes
      • Coarse (> 250 um) DS_COARSE
      • Medium (62.5 – 250 um) DS_MED
      • Fine (< 62.5 um) DS_FINE
      • OVERALL (all records)

    Parameter #4: Outunits

    • Options: phi, m, um

    This service is part of the Sediment Analysis and Geo-App (SAGA) Toolkit.

    Looking for a comprehensive user interface to run this tool?
    Go to SAGA Online to view this geoprocessing service with data already stored in the SAGA database.

    This service can be used independently of the SAGA application and user interface, or the tool can be directly accessed through http://navigation.usace.army.mil/SEM/Analysis/GSD

  20. h

    Data publication: Generating structured foam via flowing through a wire...

    • rodare.hzdr.de
    7z
    Updated Feb 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Skrypnik, Artem; Knüpfer, Leon; Trtik, Pavel; Lappan, Tobias; Ziauddin, Muhammad; Heitkam, Sascha (2025). Data publication: Generating structured foam via flowing through a wire array [Dataset]. http://doi.org/10.14278/rodare.3583
    Explore at:
    7zAvailable download formats
    Dataset updated
    Feb 26, 2025
    Dataset provided by
    Helmholtz-Zentrum Dresden-Rossendorf
    Paul Scherrer Institute
    Technische Universität Dresden
    Technische Universität Dresden, Helmholtz-Zentrum Dresden-Rossendorf
    Authors
    Skrypnik, Artem; Knüpfer, Leon; Trtik, Pavel; Lappan, Tobias; Ziauddin, Muhammad; Heitkam, Sascha
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The structure of liquid foam is generally considered random and isotropic. However, when foam flows past a set of wires, an inhomogeneous liquid fraction distribution, or layering, can be observed within the bulk. This dataset presents neutron radiography data of foam flowing past a set of thin metal wires. During the experiments, the gas flow rate and bubble size were varied. Additionally, a dataset for foam flow past a single wire is included for reference.


    The folder includes initial data for the manuscript "Generating structured foam via flowing through a wire array".

    Folder includes:

    01_scripts scripts used for the data processing
    02_rawdata Initial neutron imaging data (.tif images)
    03_evaluation folder with MATLAB scripts used for data analysis

    LABBOOK Experimental labbook explaining the experimental sequence.
    Protocol The Neutron imaging protocol with the data of neutron source and image resolution

    The data processing is shown for the O1 bubble generator. It includes:
    1. MASK_... script used to define the cell walls and determine the mask, used further for the liquid fraction calculation.
    2. N13_INIT... scritps to define normalised image, which further used to determine liquid fraction distribution
    3. POST_BOT... scripts used to postprocess the data: define Liquid fraction distribution and DFT of those distributions.

    Note:

    1. The data were analysed at two positions: bottom (0) and top (100), meaining at the wire grid and 100 mm downstream the grid. To this end, mask should be calculated also for the top part of the nozzle, if needed, as shown in the presented examples.

    2. The data for the empty cell were calculated for the foam flow through the cell with a single thin wire. The data were extracted
    in the ROI before the wire (run 553-557).

    3. Data processing was performed as suggested in https://doi.org/10.1371/journal.pone.0210300

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Claudia Bergroth; Olle Järv; Henrikki Tenkanen; Matti Manninen; Tuuli Toivonen (2022). A 24-hour dynamic population distribution dataset based on mobile phone data from Helsinki Metropolitan Area, Finland [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4724388

Data from: A 24-hour dynamic population distribution dataset based on mobile phone data from Helsinki Metropolitan Area, Finland

Related Article
Explore at:
Dataset updated
Feb 16, 2022
Dataset provided by
Unit of Urban Research and Statistics, City of Helsinki / Digital Geography Lab, Department of Geosciences and Geography, University of Helsinki
Elisa Corporation
Digital Geography Lab, Department of Geosciences and Geography, University of Helsinki
Department of Built Environment, Aalto University / Centre for Advanced Spatial Analysis, University College London
Authors
Claudia Bergroth; Olle Järv; Henrikki Tenkanen; Matti Manninen; Tuuli Toivonen
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Area covered
Finland, Helsinki Metropolitan Area
Description

Related article: Bergroth, C., Järv, O., Tenkanen, H., Manninen, M., Toivonen, T., 2022. A 24-hour population distribution dataset based on mobile phone data from Helsinki Metropolitan Area, Finland. Scientific Data 9, 39.

In this dataset:

We present temporally dynamic population distribution data from the Helsinki Metropolitan Area, Finland, at the level of 250 m by 250 m statistical grid cells. Three hourly population distribution datasets are provided for regular workdays (Mon – Thu), Saturdays and Sundays. The data are based on aggregated mobile phone data collected by the biggest mobile network operator in Finland. Mobile phone data are assigned to statistical grid cells using an advanced dasymetric interpolation method based on ancillary data about land cover, buildings and a time use survey. The data were validated by comparing population register data from Statistics Finland for night-time hours and a daytime workplace registry. The resulting 24-hour population data can be used to reveal the temporal dynamics of the city and examine population variations relevant to for instance spatial accessibility analyses, crisis management and planning.

Please cite this dataset as:

Bergroth, C., Järv, O., Tenkanen, H., Manninen, M., Toivonen, T., 2022. A 24-hour population distribution dataset based on mobile phone data from Helsinki Metropolitan Area, Finland. Scientific Data 9, 39. https://doi.org/10.1038/s41597-021-01113-4

Organization of data

The dataset is packaged into a single Zipfile Helsinki_dynpop_matrix.zip which contains following files:

HMA_Dynamic_population_24H_workdays.csv represents the dynamic population for average workday in the study area.

HMA_Dynamic_population_24H_sat.csv represents the dynamic population for average saturday in the study area.

HMA_Dynamic_population_24H_sun.csv represents the dynamic population for average sunday in the study area.

target_zones_grid250m_EPSG3067.geojson represents the statistical grid in ETRS89/ETRS-TM35FIN projection that can be used to visualize the data on a map using e.g. QGIS.

Column names

YKR_ID : a unique identifier for each statistical grid cell (n=13,231). The identifier is compatible with the statistical YKR grid cell data by Statistics Finland and Finnish Environment Institute.

H0, H1 ... H23 : Each field represents the proportional distribution of the total population in the study area between grid cells during a one-hour period. In total, 24 fields are formatted as “Hx”, where x stands for the hour of the day (values ranging from 0-23). For example, H0 stands for the first hour of the day: 00:00 - 00:59. The sum of all cell values for each field equals to 100 (i.e. 100% of total population for each one-hour period)

In order to visualize the data on a map, the result tables can be joined with the target_zones_grid250m_EPSG3067.geojson data. The data can be joined by using the field YKR_ID as a common key between the datasets.

License Creative Commons Attribution 4.0 International.

Related datasets

Järv, Olle; Tenkanen, Henrikki & Toivonen, Tuuli. (2017). Multi-temporal function-based dasymetric interpolation tool for mobile phone data. Zenodo. https://doi.org/10.5281/zenodo.252612

Tenkanen, Henrikki, & Toivonen, Tuuli. (2019). Helsinki Region Travel Time Matrix [Data set]. Zenodo. http://doi.org/10.5281/zenodo.3247564

Search
Clear search
Close search
Google apps
Main menu