100+ datasets found
  1. f

    The ranges of values of the hyperparameters of the benchmark datasets.

    • datasetcatalog.nlm.nih.gov
    Updated Feb 23, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shin, Jungpil; Bin Aziz, Abu Zahid; Al Mehedi Hasan, (2021). The ranges of values of the hyperparameters of the benchmark datasets. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000782313
    Explore at:
    Dataset updated
    Feb 23, 2021
    Authors
    Shin, Jungpil; Bin Aziz, Abu Zahid; Al Mehedi Hasan,
    Description

    The ranges of values of the hyperparameters of the benchmark datasets.

  2. UKCP09: Time Series of Annual values of Extreme temperature range - Dataset...

    • ckan.publishing.service.gov.uk
    Updated Jan 26, 2011
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ckan.publishing.service.gov.uk (2011). UKCP09: Time Series of Annual values of Extreme temperature range - Dataset - data.gov.uk [Dataset]. https://ckan.publishing.service.gov.uk/dataset/ukcp09-time-series-of-annual-values-of-extreme-temperature-range
    Explore at:
    Dataset updated
    Jan 26, 2011
    Dataset provided by
    CKANhttps://ckan.org/
    Description

    UKCP09 Time series of extreme temperatures. Annual maximum temperature minus annual minimum temperature. The datasets have been created with financial support from the Department for Environment, Food and Rural Affairs (Defra) and they are being promoted by the UK Climate Impacts Programme (UKCIP) as part of the UK Climate Projections (UKCP09). http://ukclimateprojections.defra.gov.uk/content/view/12/689/. To view this data you will have to register on the Met Office website, here: http://www.metoffice.gov.uk/research/climate/climate-monitoring/UKCP09/register

  3. d

    Data from: HomeRange: A global database of mammalian home ranges

    • search.dataone.org
    • data.niaid.nih.gov
    • +1more
    Updated Jul 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maarten Broekman; Selwyn Hoeks; Rosa Freriks; Merel Langendoen; Katharina Runge; Ecaterina Savenco; Ruben ter Harmsel; Mark Huijbregts; Marlee Tucker (2025). HomeRange: A global database of mammalian home ranges [Dataset]. http://doi.org/10.5061/dryad.d2547d85x
    Explore at:
    Dataset updated
    Jul 15, 2025
    Dataset provided by
    Dryad Digital Repository
    Authors
    Maarten Broekman; Selwyn Hoeks; Rosa Freriks; Merel Langendoen; Katharina Runge; Ecaterina Savenco; Ruben ter Harmsel; Mark Huijbregts; Marlee Tucker
    Time period covered
    Jan 1, 2022
    Description

    Motivation: Home range is a common measure of animal space use as it provides ecological information that is useful for conservation applications. In macroecological studies, values are typically aggregated to species means to examine general patterns of animal space use. However, this ignores the environmental context in which the home range was estimated and does not account for intraspecific variation in home range size. In addition, the focus of macroecological studies on home ranges has been historically biased toward terrestrial mammals. The use of aggregated numbers and terrestrial focus limits our ability to examine home range patterns across different environments, variation in time and between different levels of organisation. Here we introduce HomeRange, a global database with 75,611 home-range values across 960 different mammal species, including terrestrial, as well as aquatic and aerial species. Main types of variable contained: The dataset contains mammal home-range estim..., Mammalian home range papers were compiled via an extensive literature search. All home range values were extracted from the literature including individual, group and population-level home range values. Associated values were also compiled including species names, methodological information on data collection, home-range estimation method, period of data collection, study coordinates and name of location, as well as species traits derived from the studies, such as body mass, life stage, reproductive status and locomotor habit. Here we include the database, associated metadata and reference list of all sources from which home range data was extracted from. We also provide an R package, which can be installed from https://github.com/SHoeks/HomeRange. The HomeRange R package provides functions for downloading the latest version of the HomeRange database and loading it as a standard dataframe into R, plotting several statistics of the database and finally attaching species traits (e.g. spe..., , # Title of Dataset: HomeRange: A global database of mammalian home ranges

    Mammalian home range papers were compiled via an extensive literature search. All home range values were extracted from the literature including individual, group and population-level home range values. Associated values were also compiled including species names, methodological information on data collection, home-range estimation method, period of data collection, study coordinates and name of location, as well as species traits derived from the studies, such as body mass, life stage, reproductive status and locomotor habit.

    We also provide an R package, which can be installed from https://github.com/SHoeks/HomeRange. The HomeRange R package provides functions for downloading the latest version of the HomeRange database and loading it as a standard dataframe into R, plotting several statistics of the database and finally attaching species traits (e.g. species average body mass, trophic level). from the CO...

  4. w

    UKCP09: Gridded Datasets of Annual values of extreme temperature range

    • data.wu.ac.at
    • ckan.publishing.service.gov.uk
    zip
    Updated Jul 14, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Met Office (2016). UKCP09: Gridded Datasets of Annual values of extreme temperature range [Dataset]. https://data.wu.ac.at/odso/data_gov_uk/MGJiNjVhNzctYzJiNS00ZWM5LWJjN2QtZDVjZDA1OGVmNzNk
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jul 14, 2016
    Dataset provided by
    Met Office
    Description

    UKCP09: Gridded datasets of annual values. Extreme temperature range. The day-by-day sum of the mean number of degrees by which the air temperature is more than a value of 22 °C Annual maximum temperature minus annual minimum temperature.

    The datasets have been created with financial support from the Department for Environment, Food and Rural Affairs (Defra) and they are being promoted by the UK Climate Impacts Programme (UKCIP) as part of the UK Climate Projections (UKCP09). http://ukclimateprojections.defra.gov.uk/content/view/12/689/.

    To view this data you will have to register on the Met Office website, here: http://www.metoffice.gov.uk/research/climate/climate-monitoring/UKCP09/register

  5. Savings Bonds Value Files

    • catalog.data.gov
    • datasets.ai
    • +1more
    Updated Dec 1, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bureau of the Fiscal Service (2023). Savings Bonds Value Files [Dataset]. https://catalog.data.gov/dataset/savings-bonds-value-files
    Explore at:
    Dataset updated
    Dec 1, 2023
    Dataset provided by
    Bureau of the Fiscal Servicehttps://www.fiscal.treasury.gov/
    Description

    The Savings Bond Value Files dataset is used by developers of bond pricing programs to update their systems with new redemption values for accrual savings bonds (Series E, EE, I & Savings Notes). The core data is the same as the Redemption Tables but there are differences in format, amount of data, and date range. The Savings Bonds Value Files dataset is meant for programmers and developers to read in redemption values without having to first convert PDFs.

  6. 10-m Topographic Wetness Index

    • catalog.data.gov
    • datasets.ai
    • +1more
    Updated Jun 5, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Park Service (2024). 10-m Topographic Wetness Index [Dataset]. https://catalog.data.gov/dataset/10-m-topographic-wetness-index
    Explore at:
    Dataset updated
    Jun 5, 2024
    Dataset provided by
    National Park Servicehttp://www.nps.gov/
    Description

    The lidar Topographic Wetness Index (TWI) is the TWI data product produced and distributed by the National Park Service, Great Smoky Mountains National Park. Concave, low gradient areas will gather water (low TWI values), whereas steep, convex areas will shed water (high TWI values). Values range range from less than 1 (dry cells) to greater than 20 (wet cells).

  7. v

    Appendix 1. Sources, values, and ranges for selected Precipitation-Runoff...

    • res1catalogd-o-tdatad-o-tgov.vcapture.xyz
    • data.usgs.gov
    • +1more
    Updated Jul 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2024). Appendix 1. Sources, values, and ranges for selected Precipitation-Runoff Modeling System parameters for the O'Fallon, Redwater, Little Dry, Middle Musselshell, Judith, Cottonwood Creek, and Belt watersheds in eastern and central Montana. [Dataset]. https://res1catalogd-o-tdatad-o-tgov.vcapture.xyz/dataset/appendix-1-sources-values-and-ranges-for-selected-precipitation-runoff-modeling-systempara
    Explore at:
    Dataset updated
    Jul 6, 2024
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Area covered
    Musselshell, Montana
    Description

    The Precipitation-Runoff Modeling System (PRMS) was used to produce simulations of streamflow for seven watersheds in eastern and central Montana for a baseline period (water years 1982-1999) and three future periods (water years 2021-2038, 2046-2063, and 2071-2038). The seven areas that were modeled are the O'Fallon, Redwater, Little Dry, Middle Musselshell, Judith, Cottonwood Creek, and Belt watersheds.

  8. f

    Data from: Count-Based Morgan Fingerprint: A More Efficient and...

    • acs.figshare.com
    xlsx
    Updated Jul 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shifa Zhong; Xiaohong Guan (2023). Count-Based Morgan Fingerprint: A More Efficient and Interpretable Molecular Representation in Developing Machine Learning-Based Predictive Regression Models for Water Contaminants’ Activities and Properties [Dataset]. http://doi.org/10.1021/acs.est.3c02198.s002
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jul 5, 2023
    Dataset provided by
    ACS Publications
    Authors
    Shifa Zhong; Xiaohong Guan
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    In this study, we introduce the count-based Morgan fingerprint (C-MF) to represent chemical structures of contaminants and develop machine learning (ML)-based predictive models for their activities and properties. Compared with the binary Morgan fingerprint (B-MF), C-MF not only qualifies the presence or absence of an atom group but also quantifies its counts in a molecule. We employ six different ML algorithms (ridge regression, SVM, KNN, RF, XGBoost, and CatBoost) to develop models on 10 contaminant-related data sets based on C-MF and B-MF to compare them in terms of the model’s predictive performance, interpretation, and applicability domain (AD). Our results show that C-MF outperforms B-MF in nine of 10 data sets in terms of model predictive performance. The advantage of C-MF over B-MF is dependent on the ML algorithm, and the performance enhancements are proportional to the difference in the chemical diversity of data sets calculated by B-MF and C-MF. Model interpretation results show that the C-MF-based model can elucidate the effect of atom group counts on the target and have a wider range of SHAP values. AD analysis shows that C-MF-based models have an AD similar to that of B-MF-based ones. Finally, we developed a “ContaminaNET” platform to deploy these C-MF-based models for free use.

  9. ECMWF Reanalysis v5

    • ecmwf.int
    application/x-grib
    Updated Dec 31, 1969
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    European Centre for Medium-Range Weather Forecasts (1969). ECMWF Reanalysis v5 [Dataset]. https://www.ecmwf.int/en/forecasts/dataset/ecmwf-reanalysis-v5
    Explore at:
    application/x-grib(1 datasets)Available download formats
    Dataset updated
    Dec 31, 1969
    Dataset authored and provided by
    European Centre for Medium-Range Weather Forecastshttp://ecmwf.int/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    land and oceanic climate variables. The data cover the Earth on a 31km grid and resolve the atmosphere using 137 levels from the surface up to a height of 80km. ERA5 includes information about uncertainties for all variables at reduced spatial and temporal resolutions.

  10. f

    Table of parameter descriptions and ranges of values used in the model.

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Nov 9, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dougall, Paul; Bailey, Susan; Greene, James; Darko, Frederick Laud Amoah; Annan, William; Dalton, Mackenzie; Asante-Asamani, Emmanuel; White, Diana (2022). Table of parameter descriptions and ranges of values used in the model. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000282026
    Explore at:
    Dataset updated
    Nov 9, 2022
    Authors
    Dougall, Paul; Bailey, Susan; Greene, James; Darko, Frederick Laud Amoah; Annan, William; Dalton, Mackenzie; Asante-Asamani, Emmanuel; White, Diana
    Description

    All parameters are assumed non-negative. S(0), , , I2(0), and R(0) define the initial population sizes. Dashes are used when values are arbitrarily chosen from some range.

  11. ERA5 post-processed daily statistics on single levels from 1940 to present

    • cds.climate.copernicus.eu
    grib
    Updated Sep 10, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ECMWF (2025). ERA5 post-processed daily statistics on single levels from 1940 to present [Dataset]. http://doi.org/10.24381/cds.4991cf48
    Explore at:
    gribAvailable download formats
    Dataset updated
    Sep 10, 2025
    Dataset provided by
    European Centre for Medium-Range Weather Forecastshttp://ecmwf.int/
    Authors
    ECMWF
    License

    https://object-store.os-api.cci2.ecmwf.int:443/cci2-prod-catalogue/licences/cc-by/cc-by_f24dc630aa52ab8c52a0ac85c03bc35e0abc850b4d7453bdc083535b41d5a5c3.pdfhttps://object-store.os-api.cci2.ecmwf.int:443/cci2-prod-catalogue/licences/cc-by/cc-by_f24dc630aa52ab8c52a0ac85c03bc35e0abc850b4d7453bdc083535b41d5a5c3.pdf

    Time period covered
    Jan 1, 1940 - Sep 4, 2025
    Description

    ERA5 is the fifth generation ECMWF reanalysis for the global climate and weather for the past 8 decades. Data is available from 1940 onwards. ERA5 replaces the ERA-Interim reanalysis. Reanalysis combines model data with observations from across the world into a globally complete and consistent dataset using the laws of physics. This principle, called data assimilation, is based on the method used by numerical weather prediction centres, where every so many hours (12 hours at ECMWF) a previous forecast is combined with newly available observations in an optimal way to produce a new best estimate of the state of the atmosphere, called analysis, from which an updated, improved forecast is issued. Reanalysis works in the same way, but at reduced resolution to allow for the provision of a dataset spanning back several decades. Reanalysis does not have the constraint of issuing timely forecasts, so there is more time to collect observations, and when going further back in time, to allow for the ingestion of improved versions of the original observations, which all benefit the quality of the reanalysis product. This catalogue entry provides post-processed ERA5 hourly single-level data aggregated to daily time steps. In addition to the data selection options found on the hourly page, the following options can be selected for the daily statistic calculation:

    The daily aggregation statistic (daily mean, daily max, daily min, daily sum*) The sub-daily frequency sampling of the original data (1 hour, 3 hours, 6 hours) The option to shift to any local time zone in UTC (no shift means the statistic is computed from UTC+00:00)

    *The daily sum is only available for the accumulated variables (see ERA5 documentation for more details). Users should be aware that the daily aggregation is calculated during the retrieval process and is not part of a permanently archived dataset. For more details on how the daily statistics are calculated, including demonstrative code, please see the documentation. For more details on the hourly data used to calculate the daily statistics, please refer to the ERA5 hourly single-level data catalogue entry and the documentation found therein.

  12. Data from: Predicting Fraction Unbound in Human Plasma from Chemical...

    • acs.figshare.com
    • datasetcatalog.nlm.nih.gov
    • +1more
    xlsx
    Updated Jun 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Reiko Watanabe; Tsuyoshi Esaki; Hitoshi Kawashima; Yayoi Natsume-Kitatani; Chioko Nagao; Rikiya Ohashi; Kenji Mizuguchi (2023). Predicting Fraction Unbound in Human Plasma from Chemical Structure: Improved Accuracy in the Low Value Ranges [Dataset]. http://doi.org/10.1021/acs.molpharmaceut.8b00785.s002
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    ACS Publications
    Authors
    Reiko Watanabe; Tsuyoshi Esaki; Hitoshi Kawashima; Yayoi Natsume-Kitatani; Chioko Nagao; Rikiya Ohashi; Kenji Mizuguchi
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Predicting the fraction unbound in plasma provides a good understanding of the pharmacokinetic properties of a drug to assist candidate selection in the early stages of drug discovery. It is also an effective tool to mitigate the risk of late-stage attrition and to optimize further screening. In this study, we built in silico prediction models of fraction unbound in human plasma with freely available software, aiming specifically to improve the accuracy in the low value ranges. We employed several machine learning techniques and built prediction models trained on the largest ever data set of 2738 experimental values. The classification model showed a high true positive rate of 0.826 for the low fraction unbound class on the test set. The strongly biased distribution of the fraction unbound in plasma was mitigated by a logarithmic transformation in the regression model, leading to improved accuracy at lower values. Overall, our models showed better performance than those of previously published methods, including commercial software. Our prediction tool can be used on its own or integrated into other pharmacokinetic modeling systems.

  13. Values and ranges for variables and parameters used for generating numerical...

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peter Ankomah; Paul J. T. Johnson; Bruce R. Levin (2023). Values and ranges for variables and parameters used for generating numerical solutions. [Dataset]. http://doi.org/10.1371/journal.ppat.1003300.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 4, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Peter Ankomah; Paul J. T. Johnson; Bruce R. Levin
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    *Values in parentheses are the standard values used for numerical analysis of the model.

  14. g

    European Values Study 2017: Integrated Dataset (EVS 2017)

    • search.gesis.org
    • pollux-fid.de
    Updated May 16, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gedeshi, Ilir; Pachulia, Merab; Poghosyan, Gevorg; Rotman, David; Kritzinger, Sylvia; Fotev, Georgy; Kolenović-Đapo, Jadranka; Baloban, Josip; Baloban, Stjepan; Rabušic, Ladislav; Frederiksen, Morten; Saar, Erki; Ketola, Kimmo; Wolf, Christof; Pachulia, Merab; Bréchon, Pierre; Voas, David; Rosta, Gergely; Jónsdóttir, Guðbjörg A.; Rovati, Giancarlo; Ziliukaite, Ruta; Petkovska, Antoanela; Komar, Olivera; Reeskens, Tim; Jenssen, Anders T.; Soboleva, Natalia; Marody, Mirosława; Voicu, Bogdan; Strapcová, Katarina; Bešić, Miloš; Uhan, Samo; Silvestre Cabrera, María; Wallman-Lundåsen, Susanne; Ernst Stähli, Michèle; Ramos, Alice; Balakireva, Olga; Mieriņa, Inta (2022). European Values Study 2017: Integrated Dataset (EVS 2017) [Dataset]. http://doi.org/10.4232/1.13897
    Explore at:
    (12272043), (9726384)Available download formats
    Dataset updated
    May 16, 2022
    Dataset provided by
    GESIS search
    GESIS
    Authors
    Gedeshi, Ilir; Pachulia, Merab; Poghosyan, Gevorg; Rotman, David; Kritzinger, Sylvia; Fotev, Georgy; Kolenović-Đapo, Jadranka; Baloban, Josip; Baloban, Stjepan; Rabušic, Ladislav; Frederiksen, Morten; Saar, Erki; Ketola, Kimmo; Wolf, Christof; Pachulia, Merab; Bréchon, Pierre; Voas, David; Rosta, Gergely; Jónsdóttir, Guðbjörg A.; Rovati, Giancarlo; Ziliukaite, Ruta; Petkovska, Antoanela; Komar, Olivera; Reeskens, Tim; Jenssen, Anders T.; Soboleva, Natalia; Marody, Mirosława; Voicu, Bogdan; Strapcová, Katarina; Bešić, Miloš; Uhan, Samo; Silvestre Cabrera, María; Wallman-Lundåsen, Susanne; Ernst Stähli, Michèle; Ramos, Alice; Balakireva, Olga; Mieriņa, Inta
    License

    https://www.gesis.org/en/institute/data-usage-termshttps://www.gesis.org/en/institute/data-usage-terms

    Time period covered
    Jun 19, 2017 - Oct 1, 2021
    Variables measured
    year - survey year, dweight - Design Weight, v225 - sex respondent (Q63), studyno - GESIS study number, gweight - Calibration weights, mode - mode of data collection, doi - Digital Object Identifier, v277 - date of interview (Q107), version - GESIS archive version, pweight - Population size weight, and 464 more
    Description

    The European Values Study is a large-scale, cross-national and longitudinal survey research program on how Europeans think about family, work, religion, politics, and society. Repeated every nine years in an increasing number of countries, the survey provides insights into the ideas, beliefs, preferences, attitudes, values, and opinions of citizens all over Europe.

    As previous waves conducted in 1981, 1990, 1999, 2008, the fifth EVS wave maintains a persistent focus on a broad range of values. Questions are highly comparable across waves and regions, making EVS suitable for research aimed at studying trends over time.

    The new wave has seen a strengthening of the methodological standards. The full release of the EVS 2017 includes data and documentation of altogether 37 participating countries. For more information, please go to the EVS website.

    Morale, religious, societal, political, work, and family values of Europeans.

    Topics: 1. Perceptions of life: importance of work, family, friends and acquaintances, leisure time, politics and religion; happiness; self-assessment of own health; memberships in voluntary organisations (religious or church organisations, cultural activities, trade unions, political parties or groups, environment, ecology, animal rights, professional associations, sports, recreation, or other groups, none); active or inactive membership of humanitarian or charitable organisation, consumer organisation, self-help group or mutual aid; voluntary work in the last six months; tolerance towards minorities (people of a different race, heavy drinkers, immigrants, foreign workers, drug addicts, homosexuals, Christians, Muslims, Jews, and gypsies - social distance); trust in people; estimation of people´s fair and helpful behavior; internal or external control; satisfaction with life; importance of educational goals: desirable qualities of children.

    1. Work: attitude towards work (job needed to develop talents, receiving money without working is humiliating, people turn lazy not working, work is a duty towards society, work always comes first); importance of selected aspects of occupational work; give priority to nationals over foreigners as well as men over women in jobs.

    2. Religion and morale: religious denomination; current and former religious denomination; current frequency of church attendance and at the age of 12; self-assessment of religiousness; belief in God, life after death, hell, heaven, and re-incarnation; personal god vs. spirit or life force; importance of God in one´s life (10-point-scale); frequency of prayers; morale attitudes (scale: claiming state benefits without entitlement, cheating on taxes, taking soft drugs, accepting a bribe, homosexuality, abortion, divorce, euthanasia, suicide, paying cash to avoid taxes, casual sex, avoiding fare on public transport, prostitution, in-vitro fertilization, political violence, death penalty).

    3. Family: trust in family; most important criteria for a successful marriage or partnership (faithfulness, adequate income, good housing, sharing household chores, children, time for friends and personal hobbies); marriage is an outdated institution; attitude towards traditional understanding of one´s role of man and woman in occupation and family (gender roles); homosexual couples are as good parents as other couples; duty towards society to have children; responsibility of adult children for their parents when they are in need of long-term care; to make own parents proud is a main goal in life.

    4. Politics and society: political interest; political participation; preference for individual freedom or social equality; self-assessment on a left-right continuum (10-point-scale) (left-right self-placement); individual vs. state responsibility for providing; take any job vs. right to refuse job when unemployed; competition good vs. harmful for people; equal incomes vs. incentives for individual effort; private vs. government ownership of business and industry; postmaterialism (scale); most important aims of the country for the next ten years; willingness to fight for the country; expectation of future development (less importance placed on work and greater respect for authority); trust in institutions; essential characteristics of democracy; importance of democracy for the respondent; rating democracy in own country; satisfaction with the political system in the country; preferred type of political system (strong leader, expert decisions, army should ...

  15. m

    Bridging the Gap in Hypertension Management: Evaluating Blood Pressure...

    • data.mendeley.com
    Updated Jan 15, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    abu sufian (2025). Bridging the Gap in Hypertension Management: Evaluating Blood Pressure Control and Associated Risk Factors in a Resource-Constrained Setting [Dataset]. http://doi.org/10.17632/56jyjndvcr.1
    Explore at:
    Dataset updated
    Jan 15, 2025
    Authors
    abu sufian
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset Description

    This dataset contains a simulated collection of 1,00000 patient records designed to explore hypertension management in resource-constrained settings. It provides comprehensive data for analyzing blood pressure control rates, associated risk factors, and complications. The dataset is ideal for predictive modelling, risk analysis, and treatment optimization, offering insights into demographic, clinical, and treatment-related variables.

    Dataset Structure

    1. Dataset Volume

      • Size: 10,000 records. • Features: 19 variables, categorized into Sociodemographic, Clinical, Complications, and Treatment/Control groups.

    2. Variables and Categories

    A. Sociodemographic Variables

    1. Age:
    •  Continuous variable in years.
    •  Range: 18–80 years.
    •  Mean ± SD: 49.37 ± 12.81.
    2. Sex:
    •  Categorical variable.
    •  Values: Male, Female.
    3. Education:
    •  Categorical variable.
    •  Values: No Education, Primary, Secondary, Higher Secondary, Graduate, Post-Graduate, Madrasa.
    4. Occupation:
    •  Categorical variable.
    •  Values: Service, Business, Agriculture, Retired, Unemployed, Housewife.
    5. Monthly Income:
    •  Categorical variable in Bangladeshi Taka.
    •  Values: <5000, 5001–10000, 10001–15000, >15000.
    6. Residence:
    •  Categorical variable.
    •  Values: Urban, Sub-urban, Rural.
    

    B. Clinical Variables

    7. Systolic BP:
    •  Continuous variable in mmHg.
    •  Range: 100–200 mmHg.
    •  Mean ± SD: 140 ± 15 mmHg.
    8. Diastolic BP:
    •  Continuous variable in mmHg.
    •  Range: 60–120 mmHg.
    •  Mean ± SD: 90 ± 10 mmHg.
    9. Elevated Creatinine:
    •  Binary variable (\geq 1.4 \, \text{mg/dL}).
    •  Values: Yes, No.
    10. Diabetes Mellitus:
    •  Binary variable.
    •  Values: Yes, No.
    11. Family History of CVD:
    •  Binary variable.
    •  Values: Yes, No.
    12. Elevated Cholesterol:
    •  Binary variable (\geq 200 \, \text{mg/dL}).
    •  Values: Yes, No.
    13. Smoking:
    •  Binary variable.
    •  Values: Yes, No.
    

    C. Complications

    14. LVH (Left Ventricular Hypertrophy):
    •  Binary variable (ECG diagnosis).
    •  Values: Yes, No.
    15. IHD (Ischemic Heart Disease):
    •  Binary variable.
    •  Values: Yes, No.
    16. CVD (Cerebrovascular Disease):
    •  Binary variable.
    •  Values: Yes, No.
    17. Retinopathy:
    •  Binary variable.
    •  Values: Yes, No.
    

    D. Treatment and Control

    18. Treatment:
    •  Categorical variable indicating therapy type.
    •  Values: Single Drug, Combination Drugs.
    19. Control Status:
    •  Binary variable.
    •  Values: Controlled, Uncontrolled.
    

    Dataset Applications

    1. Predictive Modeling:
    •  Develop models to predict blood pressure control status using demographic and clinical data.
    2. Risk Analysis:
    •  Identify significant factors influencing hypertension control and complications.
    3. Severity Scoring:
    •  Quantify hypertension severity for patient risk stratification.
    4. Complications Prediction:
    •  Forecast complications like IHD, LVH, and CVD for early intervention.
    5. Treatment Guidance:
    •  Analyze therapy efficacy to recommend optimal treatment strategies.
    
  16. f

    Parameters definition and range of values.

    • datasetcatalog.nlm.nih.gov
    • figshare.com
    • +1more
    Updated May 18, 2012
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Menu, Frédéric; Rascalou, Guilhem; Gourbière, Sébastien; Pontier, Dominique (2012). Parameters definition and range of values. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001141670
    Explore at:
    Dataset updated
    May 18, 2012
    Authors
    Menu, Frédéric; Rascalou, Guilhem; Gourbière, Sébastien; Pontier, Dominique
    Description

    (1)Vector, human and non-human hosts natural death rates were estimated as 1/individual longevity. The range of variation of longevity (i.e. 1/death rate parameter defined in the model), as those are the raw data found in the literature (see sections ‘Vector local growth rate’ and ‘Human and non-human hosts natural death rates’ in Text S1).(2)Death rates were calculated as the sum of the natural death rate of human or non-human hosts and additional mortality imposed by the pathogen to infectious and ‘recovered’ individuals (as calculated in section ‘Human and non-human hosts mortality induced by the pathogen’ in Text S1).

  17. d

    Chemical alteration index values and rare earth element data and expected...

    • catalog.data.gov
    • data.usgs.gov
    • +3more
    Updated Jul 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2024). Chemical alteration index values and rare earth element data and expected ranges for regolith overlying the Stewartsville pluton, Virginia [Dataset]. https://catalog.data.gov/dataset/chemical-alteration-index-values-and-rare-earth-element-data-and-expected-ranges-for-regol
    Explore at:
    Dataset updated
    Jul 6, 2024
    Dataset provided by
    U.S. Geological Survey
    Area covered
    Stewartsville, Virginia
    Description

    This dataset contains whole major element geochemical data used to calculate values of the chemical alteration index (CIA), data for Nd, Sm, Y, and total REE and expected ranges for total REEY for samples of regolith overlying the Stewartsville pluton, Virginia. The southeastern United States was first identified as prospective for regolith-hosted REE deposits based on the recognition that the region has been subjected to a long history of intense differential chemical weathering and saprolitization, comparable to that which formed the REE clay deposits of South China and Southeast Asia since the break-up of Pangea (Foley and Ayuso, 2013). Foley et al. (2014) established that due to their inherent high concentrations of REE, anorogenic (A-type) and highly fractionated igneous (I-type) granitic rocks of southeastern United States were highly prospective source rocks for deposits of this type. More recently, additional studies investigated accumulation processes resulting in high concentrations of REE in granite-derived regolith deposits related to the Stewartsville pluton and other plutons in Virginia. The Stewartsville pluton was emplaced along the flank of the Blue Ridge province during regional crustal extension related to the opening of the Iapetus Ocean and breakup of the supercontinent Rodinia. The studied rock samples consist of medium- to coarse-grained biotite granite and are mineralogically complex. They contain phenocrysts of quartz, sericitized and albitized k-feldspar, sodic plagioclase, and mafic clots and stringers that are composed primarily of biotite and stilpnomelane and, less typically, include magnetite and remnant cores of green and green-brown hornblende. Feldspar contains inclusions of synchysite and fergusonite; other accessory minerals include abundant and diagnostic allanite and fluorite, as well as apatite, epidote, garnet, Nb-rutile, fergusonite, monazite, titanite, xenotime, gadolinite, and zircon (Foley and Ayuso, 2015 and references therein). Granite outcrop exposures in the Piedmont and Blue Ridge areas of Virginia tend to be intensely weathered, with overlying regoliths ranging from thin and discontinuous to meters thick and laterally extensive, and often with overlying B-horizon type soils. Saprolite can extend down to depths of tens of meters below the B-horizon. In the case of the Stewartsville Pluton, regolith is well developed in multiple exposures. The sampled section described in this data release is >20 meters high by >60 meters long. The profile includes nearly fresh rock, partially to highly weathered saprolite, indurated gravels and sands, and poorly delineated layers of subsoil and topsoil. Granite at the base of the profile is iron stained (mostly goethite) and weathered on exposed surfaces and along cracks. Partially weathered sections of the outcrop display a range of rock textures throughout, rather than systematic changes from base to surface. For example, in the lower parts, cobble and boulder-sized relics of spheroidally weathered granite knobs retain distinctive primary textures but are surrounded by nearly disaggregated granite that crumbles to sand and gravel-sized fragments when sampled. Subsoils, mainly B-horizon, comprise the uppermost meter of the section and contain a higher proportion of clay minerals (i.e. kaolinite-nontronite-iron-oxide mixtures) than the underlying saprolite.

  18. m

    Transformed Customer Shopping Dataset with Advanced Feature Engineering and...

    • data.mendeley.com
    Updated Jul 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Md Zinnahtur Rahman Zitu (2025). Transformed Customer Shopping Dataset with Advanced Feature Engineering and Anonymization [Dataset]. http://doi.org/10.17632/fnhyc6drm8.1
    Explore at:
    Dataset updated
    Jul 21, 2025
    Authors
    Md Zinnahtur Rahman Zitu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset represents a thoroughly transformed and enriched version of a publicly available customer shopping dataset. It has undergone comprehensive processing to ensure it is clean, privacy-compliant, and enriched with new features, making it highly suitable for advanced analytics, machine learning, and business research applications.

    The transformation process focused on creating a high-quality dataset that supports robust customer behavior analysis, segmentation, and anomaly detection, while maintaining strict privacy through anonymization and data validation.

    ➡ Data Cleaning and Preprocessing : Duplicates were removed. Missing numerical values (Age, Purchase Amount, Review Rating) were filled with medians; missing categorical values labeled “Unknown.” Text data were cleaned and standardized, and numeric fields were clipped to valid ranges.

    ➡ Feature Engineering : New informative variables were engineered to augment the dataset’s analytical power. These include: • Avg_Amount_Per_Purchase: Average purchase amount calculated by dividing total purchase value by the number of previous purchases, capturing spending behavior per transaction. • Age_Group: Categorical age segmentation into meaningful bins such as Teen, Young Adult, Adult, Senior, and Elder. • Purchase_Frequency_Score: Quantitative mapping of purchase frequency to annualized values to facilitate numerical analysis. • Discount_Impact: Monetary quantification of discount application effects on purchases. • Processing_Date: Timestamp indicating the dataset transformation date for provenance tracking.

    ➡ Data Filtering : Rows with ages outside 0–100 were removed. Only core categories (Clothing, Footwear, Outerwear, Accessories) and the top 25% of high-value customers by purchase amount were retained for focused analysis.

    ➡ Data Transformation : Key numeric features were standardized, and log transformations were applied to skewed data to improve model performance.

    ➡ Advanced Features : Created a category-wise average purchase and a loyalty score combining purchase frequency and volume.

    ➡ Segmentation & Anomaly Detection : Used KMeans to cluster customers into four groups and Isolation Forest to flag anomalies.

    ➡ Text Processing : Cleaned text fields and added a binary indicator for clothing items.

    ➡ Privacy : Hashed Customer ID and removed sensitive columns like Location to ensure privacy.

    ➡ Validation : Automated checks for data integrity, including negative values and valid ranges.

    This transformed dataset supports a wide range of research and practical applications, including customer segmentation, purchase behavior modeling, marketing strategy development, fraud detection, and machine learning education. It serves as a reliable and privacy-aware resource for academics, data scientists, and business analysts.

  19. N

    South Range, MI Population Breakdown by Gender and Age Dataset: Male and...

    • neilsberg.com
    csv, json
    Updated Feb 24, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). South Range, MI Population Breakdown by Gender and Age Dataset: Male and Female Population Distribution Across 18 Age Groups // 2025 Edition [Dataset]. https://www.neilsberg.com/research/datasets/e200fba9-f25d-11ef-8c1b-3860777c1fe6/
    Explore at:
    csv, jsonAvailable download formats
    Dataset updated
    Feb 24, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    South Range, Michigan
    Variables measured
    Male and Female Population Under 5 Years, Male and Female Population over 85 years, Male and Female Population Between 5 and 9 years, Male and Female Population Between 10 and 14 years, Male and Female Population Between 15 and 19 years, Male and Female Population Between 20 and 24 years, Male and Female Population Between 25 and 29 years, Male and Female Population Between 30 and 34 years, Male and Female Population Between 35 and 39 years, Male and Female Population Between 40 and 44 years, and 8 more
    Measurement technique
    The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. To measure the three variables, namely (a) Population (Male), (b) Population (Female), and (c) Gender Ratio (Males per 100 Females), we initially analyzed and categorized the data for each of the gender classifications (biological sex) reported by the US Census Bureau across 18 age groups, ranging from under 5 years to 85 years and above. These age groups are described above in the variables section. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    The dataset tabulates the population of South Range by gender across 18 age groups. It lists the male and female population in each age group along with the gender ratio for South Range. The dataset can be utilized to understand the population distribution of South Range by gender and age. For example, using this dataset, we can identify the largest age group for both Men and Women in South Range. Additionally, it can be used to see how the gender ratio changes from birth to senior most age group and male to female ratio across each age group for South Range.

    Key observations

    Largest age group (population): Male # 20-24 years (49) | Female # 20-24 years (50). Source: U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

    Age groups:

    • Under 5 years
    • 5 to 9 years
    • 10 to 14 years
    • 15 to 19 years
    • 20 to 24 years
    • 25 to 29 years
    • 30 to 34 years
    • 35 to 39 years
    • 40 to 44 years
    • 45 to 49 years
    • 50 to 54 years
    • 55 to 59 years
    • 60 to 64 years
    • 65 to 69 years
    • 70 to 74 years
    • 75 to 79 years
    • 80 to 84 years
    • 85 years and over

    Scope of gender :

    Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis.

    Variables / Data Columns

    • Age Group: This column displays the age group for the South Range population analysis. Total expected values are 18 and are define above in the age groups section.
    • Population (Male): The male population in the South Range is shown in the following column.
    • Population (Female): The female population in the South Range is shown in the following column.
    • Gender Ratio: Also known as the sex ratio, this column displays the number of males per 100 females in South Range for each age group.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

    Recommended for further research

    This dataset is a part of the main dataset for South Range Population by Gender. You can refer the same here

  20. Stock Market Dataset for August 2025

    • kaggle.com
    Updated Aug 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kshitij Saini (2025). Stock Market Dataset for August 2025 [Dataset]. https://www.kaggle.com/datasets/kshitijsaini121/stock-market-prediction-for-july-2025-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 7, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Kshitij Saini
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset Overview

    This dataset contains comprehensive stock market data for June 2025, capturing daily trading information across multiple companies and sectors. The dataset represents a substantial collection of market data with detailed financial metrics and trading statistics.

    Basic Dataset Information

    • Time Period: June 1-21, 2025 (21 trading days)
    • Total Records: Approximately 11,600+ entries
    • Companies Covered: 500+ unique stocks
    • Data Type: Daily stock market trading data with fundamental metrics

    Markdown Table Format

    Column NameData TypeDescriptionExample Values
    DateDateTrading date in DD-MM-YYYY format01-06-2025, 02-06-2025
    TickerStringStock ticker symbol (3-4 characters)AAPL, GOOGL, TSLA
    Open PriceFloatOpening price of the stock34.92, 206.5, 125.1

    Dataset Information Table

    Dataset Overview

    AttributeDetails
    Dataset NameStock Market Data - June 2025
    File FormatCSV
    File Size~2.5 MB
    Number of Records11,600+
    Number of Features13
    Time PeriodJune 1-21, 2025

    Data Schema

    Column NameData TypeDescriptionExample Values
    DateDateTrading date in DD-MM-YYYY format01-06-2025, 02-06-2025
    TickerStringStock ticker symbol (3-4 characters)AAPL, GOOGL, TSLA, SLH
    Open PriceFloatOpening price of the stock34.92, 206.5, 125.1
    Close PriceFloatClosing price of the stock34.53, 208.45, 124.03
    High PriceFloatHighest price during the trading day35.22, 210.51, 127.4
    Low PriceFloatLowest price during the trading day34.38, 205.12, 121.77
    Volume TradedIntegerNumber of shares traded2,966,611, 1,658,738
    Market CapFloatMarket capitalization in dollars57,381,363,838.88
    PE RatioFloatPrice-to-Earnings ratio29.63, 13.03, 29.19
    Dividend YieldFloatDividend yield percentage2.85, 2.73, 2.64
    EPSFloatEarnings per Share1.17, 16.0, 4.25
    52 Week HighFloatHighest price in the last 52 weeks39.39, 227.38, 138.35
    52 Week LowFloatLowest price in the last 52 weeks28.44, 136.79, 100.69
    SectorStringIndustry sector classificationIndustrials, Energy, Healthcare

    Market Capitalization Tiers

    • Mega Cap (>$1T): 6 companies (AAPL, MSFT, NVDA, AMZN, GOOGL, META)
    • Large Cap ($200B-$1T): 28 companies
    • Mid Cap ($50B-$200B): 47 companies

    Key Market Characteristics

    Price Volatility by Sector

    • Technology: Higher volatility (±3.5% daily range)
    • Energy: High volatility (±4.0% daily range)
    • Utilities: Lower volatility (±1.5% daily range)
    • Healthcare/Financials: Moderate volatility (±2.5% daily range)

    Trading Volume Patterns

    • Mega Cap: 25M - 90M shares daily
    • Large Cap: 8M - 35M shares daily
    • Mid Cap: 2M - 15M shares daily
    • Small Cap: 500K - 5M shares daily

    Financial Metrics Distribution

    • Average P/E Ratio: 25.9 (market-wide)
    • Average Dividend Yield: 1.25%
    • Price Range: $19 (T) to $3,850 (BKNG)
    • EPS Range: $1.50 to $70.00

    Notable Market Features

    High-Value Stocks

    • BKNG (Booking Holdings): $3,650-$3,850 range
    • AVGO (Broadcom): $1,650-$1,750 range
    • REGN (Regeneron): $1,050-$1,150 range
    • LLY (Eli Lilly): $920-$980 range

    High-Dividend Yielders

    • T (AT&T): 7.1% dividend yield
    • VZ (Verizon): 6.2% dividend yield
    • PFE (Pfizer): 5.8% dividend yield

    Growth & Technology Leaders

    • NOW (ServiceNow): P/E ratio of 85
    • NVDA (NVIDIA): P/E ratio of 45
    • TSLA (Tesla): P/E ratio of 55

    Data Quality & Realism Features

    ✅ Authentic Price Ranges: Based on realistic 2025 market projections ✅ Sector-Appropriate Volatility: Different volatility patterns by industry ✅ Correlated Metrics: P/E ratios, dividend yields, and EPS align with market caps ✅ Realistic Trading Volumes: Volume scaled appropriately to market cap ✅ Temporal Consistency: Logical price progression over 53-day period ✅ Market Cap Accuracy: Daily fluctuations reflect actual price movements

    Intended Use Cases

    • Financial Analysis & Modeling: Portfolio optimization, risk assessment
    • Machine Learning Applications: Predictive modeling, algorithmic trading
    • Educational Purposes: Finance courses, data science training
    • Algorithm Development: Backtesting trading strategies
    • Market Research: Sector analysis, correlation studies
    • Visualization Projects: Interactive dashboards, market trend analysis

    This dataset provides a comprehensive foundation for quantitative finance research, offering both breadth across market sectors and depth in daily trading dynamics while maintaining statistical realism throughout the observation period...

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Shin, Jungpil; Bin Aziz, Abu Zahid; Al Mehedi Hasan, (2021). The ranges of values of the hyperparameters of the benchmark datasets. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000782313

The ranges of values of the hyperparameters of the benchmark datasets.

Explore at:
Dataset updated
Feb 23, 2021
Authors
Shin, Jungpil; Bin Aziz, Abu Zahid; Al Mehedi Hasan,
Description

The ranges of values of the hyperparameters of the benchmark datasets.

Search
Clear search
Close search
Google apps
Main menu