100+ datasets found

f
The ranges of values of the hyperparameters of the benchmark datasets.
datasetcatalog.nlm.nih.gov
Updated Feb 23, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shin, Jungpil; Bin Aziz, Abu Zahid; Al Mehedi Hasan, (2021). The ranges of values of the hyperparameters of the benchmark datasets. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000782313
Explore at:
Dataset updated
Feb 23, 2021
Authors
Shin, Jungpil; Bin Aziz, Abu Zahid; Al Mehedi Hasan,
Description
The ranges of values of the hyperparameters of the benchmark datasets.
UKCP09: Time Series of Annual values of Extreme temperature range - Dataset...
ckan.publishing.service.gov.uk
Updated Jan 26, 2011
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ckan.publishing.service.gov.uk (2011). UKCP09: Time Series of Annual values of Extreme temperature range - Dataset - data.gov.uk [Dataset]. https://ckan.publishing.service.gov.uk/dataset/ukcp09-time-series-of-annual-values-of-extreme-temperature-range
Explore at:
Dataset updated
Jan 26, 2011
Dataset provided by
CKANhttps://ckan.org/
Description
UKCP09 Time series of extreme temperatures. Annual maximum temperature minus annual minimum temperature. The datasets have been created with financial support from the Department for Environment, Food and Rural Affairs (Defra) and they are being promoted by the UK Climate Impacts Programme (UKCIP) as part of the UK Climate Projections (UKCP09). http://ukclimateprojections.defra.gov.uk/content/view/12/689/. To view this data you will have to register on the Met Office website, here: http://www.metoffice.gov.uk/research/climate/climate-monitoring/UKCP09/register
d
Data from: HomeRange: A global database of mammalian home ranges
search.dataone.org
data.niaid.nih.gov
+1more
Updated Jul 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Maarten Broekman; Selwyn Hoeks; Rosa Freriks; Merel Langendoen; Katharina Runge; Ecaterina Savenco; Ruben ter Harmsel; Mark Huijbregts; Marlee Tucker (2025). HomeRange: A global database of mammalian home ranges [Dataset]. http://doi.org/10.5061/dryad.d2547d85x
Explore at:
Unique identifier
https://doi.org/10.5061/dryad.d2547d85x
Dataset updated
Jul 15, 2025
Dataset provided by
Dryad Digital Repository
Authors
Maarten Broekman; Selwyn Hoeks; Rosa Freriks; Merel Langendoen; Katharina Runge; Ecaterina Savenco; Ruben ter Harmsel; Mark Huijbregts; Marlee Tucker
Time period covered
Jan 1, 2022
Description
Motivation: Home range is a common measure of animal space use as it provides ecological information that is useful for conservation applications. In macroecological studies, values are typically aggregated to species means to examine general patterns of animal space use. However, this ignores the environmental context in which the home range was estimated and does not account for intraspecific variation in home range size. In addition, the focus of macroecological studies on home ranges has been historically biased toward terrestrial mammals. The use of aggregated numbers and terrestrial focus limits our ability to examine home range patterns across different environments, variation in time and between different levels of organisation. Here we introduce HomeRange, a global database with 75,611 home-range values across 960 different mammal species, including terrestrial, as well as aquatic and aerial species. Main types of variable contained: The dataset contains mammal home-range estim..., Mammalian home range papers were compiled via an extensive literature search. All home range values were extracted from the literature including individual, group and population-level home range values. Associated values were also compiled including species names, methodological information on data collection, home-range estimation method, period of data collection, study coordinates and name of location, as well as species traits derived from the studies, such as body mass, life stage, reproductive status and locomotor habit. Here we include the database, associated metadata and reference list of all sources from which home range data was extracted from.Â We also provide an R package, which can be installed from https://github.com/SHoeks/HomeRange. The HomeRange R package provides functions for downloading the latest version of the HomeRange database and loading it as a standard dataframe into R, plotting several statistics of the database and finally attaching species traits (e.g. spe..., , # Title of Dataset: HomeRange: A global database of mammalian home ranges

Mammalian home range papers were compiled via an extensive literature search. All home range values were extracted from the literature including individual, group and population-level home range values. Associated values were also compiled including species names, methodological information on data collection, home-range estimation method, period of data collection, study coordinates and name of location, as well as species traits derived from the studies, such as body mass, life stage, reproductive status and locomotor habit.

We also provide an R package, which can be installed from https://github.com/SHoeks/HomeRange. The HomeRange R package provides functions for downloading the latest version of the HomeRange database and loading it as a standard dataframe into R, plotting several statistics of the database and finally attaching species traits (e.g. species average body mass, trophic level). from the CO...
w
UKCP09: Gridded Datasets of Annual values of extreme temperature range
data.wu.ac.at
ckan.publishing.service.gov.uk
zip
Updated Jul 14, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Met Office (2016). UKCP09: Gridded Datasets of Annual values of extreme temperature range [Dataset]. https://data.wu.ac.at/odso/data_gov_uk/MGJiNjVhNzctYzJiNS00ZWM5LWJjN2QtZDVjZDA1OGVmNzNk
Explore at:
zipAvailable download formats
Dataset updated
Jul 14, 2016
Dataset provided by
Met Office
Description
UKCP09: Gridded datasets of annual values. Extreme temperature range. The day-by-day sum of the mean number of degrees by which the air temperature is more than a value of 22 °C Annual maximum temperature minus annual minimum temperature.

The datasets have been created with financial support from the Department for Environment, Food and Rural Affairs (Defra) and they are being promoted by the UK Climate Impacts Programme (UKCIP) as part of the UK Climate Projections (UKCP09). http://ukclimateprojections.defra.gov.uk/content/view/12/689/.

To view this data you will have to register on the Met Office website, here: http://www.metoffice.gov.uk/research/climate/climate-monitoring/UKCP09/register
Savings Bonds Value Files
catalog.data.gov
datasets.ai
+1more
Updated Dec 1, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bureau of the Fiscal Service (2023). Savings Bonds Value Files [Dataset]. https://catalog.data.gov/dataset/savings-bonds-value-files
Explore at:
Dataset updated
Dec 1, 2023
Dataset provided by
Bureau of the Fiscal Servicehttps://www.fiscal.treasury.gov/
Description
The Savings Bond Value Files dataset is used by developers of bond pricing programs to update their systems with new redemption values for accrual savings bonds (Series E, EE, I & Savings Notes). The core data is the same as the Redemption Tables but there are differences in format, amount of data, and date range. The Savings Bonds Value Files dataset is meant for programmers and developers to read in redemption values without having to first convert PDFs.
10-m Topographic Wetness Index
catalog.data.gov
datasets.ai
+1more
Updated Jun 5, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Park Service (2024). 10-m Topographic Wetness Index [Dataset]. https://catalog.data.gov/dataset/10-m-topographic-wetness-index
Explore at:
Dataset updated
Jun 5, 2024
Dataset provided by
National Park Servicehttp://www.nps.gov/
Description
The lidar Topographic Wetness Index (TWI) is the TWI data product produced and distributed by the National Park Service, Great Smoky Mountains National Park. Concave, low gradient areas will gather water (low TWI values), whereas steep, convex areas will shed water (high TWI values). Values range range from less than 1 (dry cells) to greater than 20 (wet cells).
v
Appendix 1. Sources, values, and ranges for selected Precipitation-Runoff...
res1catalogd-o-tdatad-o-tgov.vcapture.xyz
data.usgs.gov
+1more
Updated Jul 6, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Geological Survey (2024). Appendix 1. Sources, values, and ranges for selected Precipitation-Runoff Modeling System parameters for the O'Fallon, Redwater, Little Dry, Middle Musselshell, Judith, Cottonwood Creek, and Belt watersheds in eastern and central Montana. [Dataset]. https://res1catalogd-o-tdatad-o-tgov.vcapture.xyz/dataset/appendix-1-sources-values-and-ranges-for-selected-precipitation-runoff-modeling-systempara
Explore at:
Dataset updated
Jul 6, 2024
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Area covered
Musselshell, Montana
Description
The Precipitation-Runoff Modeling System (PRMS) was used to produce simulations of streamflow for seven watersheds in eastern and central Montana for a baseline period (water years 1982-1999) and three future periods (water years 2021-2038, 2046-2063, and 2071-2038). The seven areas that were modeled are the O'Fallon, Redwater, Little Dry, Middle Musselshell, Judith, Cottonwood Creek, and Belt watersheds.
f
Data from: Count-Based Morgan Fingerprint: A More Efficient and...
acs.figshare.com
xlsx
Updated Jul 5, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shifa Zhong; Xiaohong Guan (2023). Count-Based Morgan Fingerprint: A More Efficient and Interpretable Molecular Representation in Developing Machine Learning-Based Predictive Regression Models for Water Contaminants’ Activities and Properties [Dataset]. http://doi.org/10.1021/acs.est.3c02198.s002
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1021/acs.est.3c02198.s002
Dataset updated
Jul 5, 2023
Dataset provided by
ACS Publications
Authors
Shifa Zhong; Xiaohong Guan
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
In this study, we introduce the count-based Morgan fingerprint (C-MF) to represent chemical structures of contaminants and develop machine learning (ML)-based predictive models for their activities and properties. Compared with the binary Morgan fingerprint (B-MF), C-MF not only qualifies the presence or absence of an atom group but also quantifies its counts in a molecule. We employ six different ML algorithms (ridge regression, SVM, KNN, RF, XGBoost, and CatBoost) to develop models on 10 contaminant-related data sets based on C-MF and B-MF to compare them in terms of the model’s predictive performance, interpretation, and applicability domain (AD). Our results show that C-MF outperforms B-MF in nine of 10 data sets in terms of model predictive performance. The advantage of C-MF over B-MF is dependent on the ML algorithm, and the performance enhancements are proportional to the difference in the chemical diversity of data sets calculated by B-MF and C-MF. Model interpretation results show that the C-MF-based model can elucidate the effect of atom group counts on the target and have a wider range of SHAP values. AD analysis shows that C-MF-based models have an AD similar to that of B-MF-based ones. Finally, we developed a “ContaminaNET” platform to deploy these C-MF-based models for free use.
ECMWF Reanalysis v5
ecmwf.int
application/x-grib
Updated Dec 31, 1969
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
European Centre for Medium-Range Weather Forecasts (1969). ECMWF Reanalysis v5 [Dataset]. https://www.ecmwf.int/en/forecasts/dataset/ecmwf-reanalysis-v5
Explore at:
application/x-grib(1 datasets)Available download formats
Dataset updated
Dec 31, 1969
Dataset authored and provided by
European Centre for Medium-Range Weather Forecastshttp://ecmwf.int/
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
land and oceanic climate variables. The data cover the Earth on a 31km grid and resolve the atmosphere using 137 levels from the surface up to a height of 80km. ERA5 includes information about uncertainties for all variables at reduced spatial and temporal resolutions.
f
Table of parameter descriptions and ranges of values used in the model.
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated Nov 9, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dougall, Paul; Bailey, Susan; Greene, James; Darko, Frederick Laud Amoah; Annan, William; Dalton, Mackenzie; Asante-Asamani, Emmanuel; White, Diana (2022). Table of parameter descriptions and ranges of values used in the model. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000282026
Explore at:
Dataset updated
Nov 9, 2022
Authors
Dougall, Paul; Bailey, Susan; Greene, James; Darko, Frederick Laud Amoah; Annan, William; Dalton, Mackenzie; Asante-Asamani, Emmanuel; White, Diana
Description
All parameters are assumed non-negative. S(0), , , I2(0), and R(0) define the initial population sizes. Dashes are used when values are arbitrarily chosen from some range.
ERA5 post-processed daily statistics on single levels from 1940 to present
cds.climate.copernicus.eu
grib
Updated Sep 10, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ECMWF (2025). ERA5 post-processed daily statistics on single levels from 1940 to present [Dataset]. http://doi.org/10.24381/cds.4991cf48
Explore at:
gribAvailable download formats
Unique identifier
https://doi.org/10.24381/cds.4991cf48
Dataset updated
Sep 10, 2025
Dataset provided by
European Centre for Medium-Range Weather Forecastshttp://ecmwf.int/
Authors
ECMWF
License
https://object-store.os-api.cci2.ecmwf.int:443/cci2-prod-catalogue/licences/cc-by/cc-by_f24dc630aa52ab8c52a0ac85c03bc35e0abc850b4d7453bdc083535b41d5a5c3.pdfhttps://object-store.os-api.cci2.ecmwf.int:443/cci2-prod-catalogue/licences/cc-by/cc-by_f24dc630aa52ab8c52a0ac85c03bc35e0abc850b4d7453bdc083535b41d5a5c3.pdf
Time period covered
Jan 1, 1940 - Sep 4, 2025
Description
ERA5 is the fifth generation ECMWF reanalysis for the global climate and weather for the past 8 decades. Data is available from 1940 onwards. ERA5 replaces the ERA-Interim reanalysis. Reanalysis combines model data with observations from across the world into a globally complete and consistent dataset using the laws of physics. This principle, called data assimilation, is based on the method used by numerical weather prediction centres, where every so many hours (12 hours at ECMWF) a previous forecast is combined with newly available observations in an optimal way to produce a new best estimate of the state of the atmosphere, called analysis, from which an updated, improved forecast is issued. Reanalysis works in the same way, but at reduced resolution to allow for the provision of a dataset spanning back several decades. Reanalysis does not have the constraint of issuing timely forecasts, so there is more time to collect observations, and when going further back in time, to allow for the ingestion of improved versions of the original observations, which all benefit the quality of the reanalysis product. This catalogue entry provides post-processed ERA5 hourly single-level data aggregated to daily time steps. In addition to the data selection options found on the hourly page, the following options can be selected for the daily statistic calculation:

The daily aggregation statistic (daily mean, daily max, daily min, daily sum*) The sub-daily frequency sampling of the original data (1 hour, 3 hours, 6 hours) The option to shift to any local time zone in UTC (no shift means the statistic is computed from UTC+00:00)

*The daily sum is only available for the accumulated variables (see ERA5 documentation for more details). Users should be aware that the daily aggregation is calculated during the retrieval process and is not part of a permanently archived dataset. For more details on how the daily statistics are calculated, including demonstrative code, please see the documentation. For more details on the hourly data used to calculate the daily statistics, please refer to the ERA5 hourly single-level data catalogue entry and the documentation found therein.
Data from: Predicting Fraction Unbound in Human Plasma from Chemical...
acs.figshare.com
datasetcatalog.nlm.nih.gov
+1more
xlsx
Updated Jun 2, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Reiko Watanabe; Tsuyoshi Esaki; Hitoshi Kawashima; Yayoi Natsume-Kitatani; Chioko Nagao; Rikiya Ohashi; Kenji Mizuguchi (2023). Predicting Fraction Unbound in Human Plasma from Chemical Structure: Improved Accuracy in the Low Value Ranges [Dataset]. http://doi.org/10.1021/acs.molpharmaceut.8b00785.s002
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1021/acs.molpharmaceut.8b00785.s002
Dataset updated
Jun 2, 2023
Dataset provided by
ACS Publications
Authors
Reiko Watanabe; Tsuyoshi Esaki; Hitoshi Kawashima; Yayoi Natsume-Kitatani; Chioko Nagao; Rikiya Ohashi; Kenji Mizuguchi
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
Predicting the fraction unbound in plasma provides a good understanding of the pharmacokinetic properties of a drug to assist candidate selection in the early stages of drug discovery. It is also an effective tool to mitigate the risk of late-stage attrition and to optimize further screening. In this study, we built in silico prediction models of fraction unbound in human plasma with freely available software, aiming specifically to improve the accuracy in the low value ranges. We employed several machine learning techniques and built prediction models trained on the largest ever data set of 2738 experimental values. The classification model showed a high true positive rate of 0.826 for the low fraction unbound class on the test set. The strongly biased distribution of the fraction unbound in plasma was mitigated by a logarithmic transformation in the regression model, leading to improved accuracy at lower values. Overall, our models showed better performance than those of previously published methods, including commercial software. Our prediction tool can be used on its own or integrated into other pharmacokinetic modeling systems.
Values and ranges for variables and parameters used for generating numerical...
plos.figshare.com
datasetcatalog.nlm.nih.gov
xls
Updated Jun 4, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Peter Ankomah; Paul J. T. Johnson; Bruce R. Levin (2023). Values and ranges for variables and parameters used for generating numerical solutions. [Dataset]. http://doi.org/10.1371/journal.ppat.1003300.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.ppat.1003300.t001
Dataset updated
Jun 4, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Peter Ankomah; Paul J. T. Johnson; Bruce R. Levin
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
*Values in parentheses are the standard values used for numerical analysis of the model.
g
European Values Study 2017: Integrated Dataset (EVS 2017)
search.gesis.org
pollux-fid.de
Updated May 16, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gedeshi, Ilir; Pachulia, Merab; Poghosyan, Gevorg; Rotman, David; Kritzinger, Sylvia; Fotev, Georgy; Kolenović-Đapo, Jadranka; Baloban, Josip; Baloban, Stjepan; Rabušic, Ladislav; Frederiksen, Morten; Saar, Erki; Ketola, Kimmo; Wolf, Christof; Pachulia, Merab; Bréchon, Pierre; Voas, David; Rosta, Gergely; Jónsdóttir, Guðbjörg A.; Rovati, Giancarlo; Ziliukaite, Ruta; Petkovska, Antoanela; Komar, Olivera; Reeskens, Tim; Jenssen, Anders T.; Soboleva, Natalia; Marody, Mirosława; Voicu, Bogdan; Strapcová, Katarina; Bešić, Miloš; Uhan, Samo; Silvestre Cabrera, María; Wallman-Lundåsen, Susanne; Ernst Stähli, Michèle; Ramos, Alice; Balakireva, Olga; Mieriņa, Inta (2022). European Values Study 2017: Integrated Dataset (EVS 2017) [Dataset]. http://doi.org/10.4232/1.13897
Explore at:
(12272043), (9726384)Available download formats
Unique identifier
https://doi.org/10.4232/1.13897
Dataset updated
May 16, 2022
Dataset provided by
GESIS search
GESIS
Authors
Gedeshi, Ilir; Pachulia, Merab; Poghosyan, Gevorg; Rotman, David; Kritzinger, Sylvia; Fotev, Georgy; Kolenović-Đapo, Jadranka; Baloban, Josip; Baloban, Stjepan; Rabušic, Ladislav; Frederiksen, Morten; Saar, Erki; Ketola, Kimmo; Wolf, Christof; Pachulia, Merab; Bréchon, Pierre; Voas, David; Rosta, Gergely; Jónsdóttir, Guðbjörg A.; Rovati, Giancarlo; Ziliukaite, Ruta; Petkovska, Antoanela; Komar, Olivera; Reeskens, Tim; Jenssen, Anders T.; Soboleva, Natalia; Marody, Mirosława; Voicu, Bogdan; Strapcová, Katarina; Bešić, Miloš; Uhan, Samo; Silvestre Cabrera, María; Wallman-Lundåsen, Susanne; Ernst Stähli, Michèle; Ramos, Alice; Balakireva, Olga; Mieriņa, Inta
License
https://www.gesis.org/en/institute/data-usage-termshttps://www.gesis.org/en/institute/data-usage-terms
Time period covered
Jun 19, 2017 - Oct 1, 2021
Variables measured
year - survey year, dweight - Design Weight, v225 - sex respondent (Q63), studyno - GESIS study number, gweight - Calibration weights, mode - mode of data collection, doi - Digital Object Identifier, v277 - date of interview (Q107), version - GESIS archive version, pweight - Population size weight, and 464 more
Description
The European Values Study is a large-scale, cross-national and longitudinal survey research program on how Europeans think about family, work, religion, politics, and society. Repeated every nine years in an increasing number of countries, the survey provides insights into the ideas, beliefs, preferences, attitudes, values, and opinions of citizens all over Europe.

As previous waves conducted in 1981, 1990, 1999, 2008, the fifth EVS wave maintains a persistent focus on a broad range of values. Questions are highly comparable across waves and regions, making EVS suitable for research aimed at studying trends over time.

The new wave has seen a strengthening of the methodological standards. The full release of the EVS 2017 includes data and documentation of altogether 37 participating countries. For more information, please go to the EVS website.

Morale, religious, societal, political, work, and family values of Europeans.

Topics: 1. Perceptions of life: importance of work, family, friends and acquaintances, leisure time, politics and religion; happiness; self-assessment of own health; memberships in voluntary organisations (religious or church organisations, cultural activities, trade unions, political parties or groups, environment, ecology, animal rights, professional associations, sports, recreation, or other groups, none); active or inactive membership of humanitarian or charitable organisation, consumer organisation, self-help group or mutual aid; voluntary work in the last six months; tolerance towards minorities (people of a different race, heavy drinkers, immigrants, foreign workers, drug addicts, homosexuals, Christians, Muslims, Jews, and gypsies - social distance); trust in people; estimation of people´s fair and helpful behavior; internal or external control; satisfaction with life; importance of educational goals: desirable qualities of children.

Work: attitude towards work (job needed to develop talents, receiving money without working is humiliating, people turn lazy not working, work is a duty towards society, work always comes first); importance of selected aspects of occupational work; give priority to nationals over foreigners as well as men over women in jobs.

Religion and morale: religious denomination; current and former religious denomination; current frequency of church attendance and at the age of 12; self-assessment of religiousness; belief in God, life after death, hell, heaven, and re-incarnation; personal god vs. spirit or life force; importance of God in one´s life (10-point-scale); frequency of prayers; morale attitudes (scale: claiming state benefits without entitlement, cheating on taxes, taking soft drugs, accepting a bribe, homosexuality, abortion, divorce, euthanasia, suicide, paying cash to avoid taxes, casual sex, avoiding fare on public transport, prostitution, in-vitro fertilization, political violence, death penalty).

Family: trust in family; most important criteria for a successful marriage or partnership (faithfulness, adequate income, good housing, sharing household chores, children, time for friends and personal hobbies); marriage is an outdated institution; attitude towards traditional understanding of one´s role of man and woman in occupation and family (gender roles); homosexual couples are as good parents as other couples; duty towards society to have children; responsibility of adult children for their parents when they are in need of long-term care; to make own parents proud is a main goal in life.

Politics and society: political interest; political participation; preference for individual freedom or social equality; self-assessment on a left-right continuum (10-point-scale) (left-right self-placement); individual vs. state responsibility for providing; take any job vs. right to refuse job when unemployed; competition good vs. harmful for people; equal incomes vs. incentives for individual effort; private vs. government ownership of business and industry; postmaterialism (scale); most important aims of the country for the next ten years; willingness to fight for the country; expectation of future development (less importance placed on work and greater respect for authority); trust in institutions; essential characteristics of democracy; importance of democracy for the respondent; rating democracy in own country; satisfaction with the political system in the country; preferred type of political system (strong leader, expert decisions, army should ...

Bridging the Gap in Hypertension Management: Evaluating Blood Pressure...

data.mendeley.com

Updated Jan 15, 2025

+ more versions

Facebook

Twitter

Click to copy link

Link copied

Cite

abu sufian (2025). Bridging the Gap in Hypertension Management: Evaluating Blood Pressure Control and Associated Risk Factors in a Resource-Constrained Setting [Dataset]. http://doi.org/10.17632/56jyjndvcr.1

Explore at:

Unique identifier

https://doi.org/10.17632/56jyjndvcr.1

Dataset updated

Jan 15, 2025

Authors

abu sufian

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Dataset Description

This dataset contains a simulated collection of 1,00000 patient records designed to explore hypertension management in resource-constrained settings. It provides comprehensive data for analyzing blood pressure control rates, associated risk factors, and complications. The dataset is ideal for predictive modelling, risk analysis, and treatment optimization, offering insights into demographic, clinical, and treatment-related variables.

Dataset Structure

Dataset Volume

• Size: 10,000 records. • Features: 19 variables, categorized into Sociodemographic, Clinical, Complications, and Treatment/Control groups.
Variables and Categories

A. Sociodemographic Variables

1. Age:
•  Continuous variable in years.
•  Range: 18–80 years.
•  Mean ± SD: 49.37 ± 12.81.
2. Sex:
•  Categorical variable.
•  Values: Male, Female.
3. Education:
•  Categorical variable.
•  Values: No Education, Primary, Secondary, Higher Secondary, Graduate, Post-Graduate, Madrasa.
4. Occupation:
•  Categorical variable.
•  Values: Service, Business, Agriculture, Retired, Unemployed, Housewife.
5. Monthly Income:
•  Categorical variable in Bangladeshi Taka.
•  Values: <5000, 5001–10000, 10001–15000, >15000.
6. Residence:
•  Categorical variable.
•  Values: Urban, Sub-urban, Rural.

B. Clinical Variables

7. Systolic BP:
•  Continuous variable in mmHg.
•  Range: 100–200 mmHg.
•  Mean ± SD: 140 ± 15 mmHg.
8. Diastolic BP:
•  Continuous variable in mmHg.
•  Range: 60–120 mmHg.
•  Mean ± SD: 90 ± 10 mmHg.
9. Elevated Creatinine:
•  Binary variable (\geq 1.4 \, \text{mg/dL}).
•  Values: Yes, No.
10. Diabetes Mellitus:
•  Binary variable.
•  Values: Yes, No.
11. Family History of CVD:
•  Binary variable.
•  Values: Yes, No.
12. Elevated Cholesterol:
•  Binary variable (\geq 200 \, \text{mg/dL}).
•  Values: Yes, No.
13. Smoking:
•  Binary variable.
•  Values: Yes, No.

C. Complications

14. LVH (Left Ventricular Hypertrophy):
•  Binary variable (ECG diagnosis).
•  Values: Yes, No.
15. IHD (Ischemic Heart Disease):
•  Binary variable.
•  Values: Yes, No.
16. CVD (Cerebrovascular Disease):
•  Binary variable.
•  Values: Yes, No.
17. Retinopathy:
•  Binary variable.
•  Values: Yes, No.

D. Treatment and Control

18. Treatment:
•  Categorical variable indicating therapy type.
•  Values: Single Drug, Combination Drugs.
19. Control Status:
•  Binary variable.
•  Values: Controlled, Uncontrolled.

Dataset Applications

1. Predictive Modeling:
•  Develop models to predict blood pressure control status using demographic and clinical data.
2. Risk Analysis:
•  Identify significant factors influencing hypertension control and complications.
3. Severity Scoring:
•  Quantify hypertension severity for patient risk stratification.
4. Complications Prediction:
•  Forecast complications like IHD, LVH, and CVD for early intervention.
5. Treatment Guidance:
•  Analyze therapy efficacy to recommend optimal treatment strategies.

f
Parameters definition and range of values.
datasetcatalog.nlm.nih.gov
figshare.com
+1more
Updated May 18, 2012
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Menu, Frédéric; Rascalou, Guilhem; Gourbière, Sébastien; Pontier, Dominique (2012). Parameters definition and range of values. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001141670
Explore at:
Dataset updated
May 18, 2012
Authors
Menu, Frédéric; Rascalou, Guilhem; Gourbière, Sébastien; Pontier, Dominique
Description
(1)Vector, human and non-human hosts natural death rates were estimated as 1/individual longevity. The range of variation of longevity (i.e. 1/death rate parameter defined in the model), as those are the raw data found in the literature (see sections ‘Vector local growth rate’ and ‘Human and non-human hosts natural death rates’ in Text S1).(2)Death rates were calculated as the sum of the natural death rate of human or non-human hosts and additional mortality imposed by the pathogen to infectious and ‘recovered’ individuals (as calculated in section ‘Human and non-human hosts mortality induced by the pathogen’ in Text S1).
d
Chemical alteration index values and rare earth element data and expected...
catalog.data.gov
data.usgs.gov
+3more
Updated Jul 6, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Geological Survey (2024). Chemical alteration index values and rare earth element data and expected ranges for regolith overlying the Stewartsville pluton, Virginia [Dataset]. https://catalog.data.gov/dataset/chemical-alteration-index-values-and-rare-earth-element-data-and-expected-ranges-for-regol
Explore at:
Dataset updated
Jul 6, 2024
Dataset provided by
U.S. Geological Survey
Area covered
Stewartsville, Virginia
Description
This dataset contains whole major element geochemical data used to calculate values of the chemical alteration index (CIA), data for Nd, Sm, Y, and total REE and expected ranges for total REEY for samples of regolith overlying the Stewartsville pluton, Virginia. The southeastern United States was first identified as prospective for regolith-hosted REE deposits based on the recognition that the region has been subjected to a long history of intense differential chemical weathering and saprolitization, comparable to that which formed the REE clay deposits of South China and Southeast Asia since the break-up of Pangea (Foley and Ayuso, 2013). Foley et al. (2014) established that due to their inherent high concentrations of REE, anorogenic (A-type) and highly fractionated igneous (I-type) granitic rocks of southeastern United States were highly prospective source rocks for deposits of this type. More recently, additional studies investigated accumulation processes resulting in high concentrations of REE in granite-derived regolith deposits related to the Stewartsville pluton and other plutons in Virginia. The Stewartsville pluton was emplaced along the flank of the Blue Ridge province during regional crustal extension related to the opening of the Iapetus Ocean and breakup of the supercontinent Rodinia. The studied rock samples consist of medium- to coarse-grained biotite granite and are mineralogically complex. They contain phenocrysts of quartz, sericitized and albitized k-feldspar, sodic plagioclase, and mafic clots and stringers that are composed primarily of biotite and stilpnomelane and, less typically, include magnetite and remnant cores of green and green-brown hornblende. Feldspar contains inclusions of synchysite and fergusonite; other accessory minerals include abundant and diagnostic allanite and fluorite, as well as apatite, epidote, garnet, Nb-rutile, fergusonite, monazite, titanite, xenotime, gadolinite, and zircon (Foley and Ayuso, 2015 and references therein). Granite outcrop exposures in the Piedmont and Blue Ridge areas of Virginia tend to be intensely weathered, with overlying regoliths ranging from thin and discontinuous to meters thick and laterally extensive, and often with overlying B-horizon type soils. Saprolite can extend down to depths of tens of meters below the B-horizon. In the case of the Stewartsville Pluton, regolith is well developed in multiple exposures. The sampled section described in this data release is >20 meters high by >60 meters long. The profile includes nearly fresh rock, partially to highly weathered saprolite, indurated gravels and sands, and poorly delineated layers of subsoil and topsoil. Granite at the base of the profile is iron stained (mostly goethite) and weathered on exposed surfaces and along cracks. Partially weathered sections of the outcrop display a range of rock textures throughout, rather than systematic changes from base to surface. For example, in the lower parts, cobble and boulder-sized relics of spheroidally weathered granite knobs retain distinctive primary textures but are surrounded by nearly disaggregated granite that crumbles to sand and gravel-sized fragments when sampled. Subsoils, mainly B-horizon, comprise the uppermost meter of the section and contain a higher proportion of clay minerals (i.e. kaolinite-nontronite-iron-oxide mixtures) than the underlying saprolite.
m
Transformed Customer Shopping Dataset with Advanced Feature Engineering and...
data.mendeley.com
Updated Jul 21, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Md Zinnahtur Rahman Zitu (2025). Transformed Customer Shopping Dataset with Advanced Feature Engineering and Anonymization [Dataset]. http://doi.org/10.17632/fnhyc6drm8.1
Explore at:
Unique identifier
https://doi.org/10.17632/fnhyc6drm8.1
Dataset updated
Jul 21, 2025
Authors
Md Zinnahtur Rahman Zitu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset represents a thoroughly transformed and enriched version of a publicly available customer shopping dataset. It has undergone comprehensive processing to ensure it is clean, privacy-compliant, and enriched with new features, making it highly suitable for advanced analytics, machine learning, and business research applications.

The transformation process focused on creating a high-quality dataset that supports robust customer behavior analysis, segmentation, and anomaly detection, while maintaining strict privacy through anonymization and data validation.

➡ Data Cleaning and Preprocessing : Duplicates were removed. Missing numerical values (Age, Purchase Amount, Review Rating) were filled with medians; missing categorical values labeled “Unknown.” Text data were cleaned and standardized, and numeric fields were clipped to valid ranges.

➡ Feature Engineering : New informative variables were engineered to augment the dataset’s analytical power. These include: • Avg_Amount_Per_Purchase: Average purchase amount calculated by dividing total purchase value by the number of previous purchases, capturing spending behavior per transaction. • Age_Group: Categorical age segmentation into meaningful bins such as Teen, Young Adult, Adult, Senior, and Elder. • Purchase_Frequency_Score: Quantitative mapping of purchase frequency to annualized values to facilitate numerical analysis. • Discount_Impact: Monetary quantification of discount application effects on purchases. • Processing_Date: Timestamp indicating the dataset transformation date for provenance tracking.

➡ Data Filtering : Rows with ages outside 0–100 were removed. Only core categories (Clothing, Footwear, Outerwear, Accessories) and the top 25% of high-value customers by purchase amount were retained for focused analysis.

➡ Data Transformation : Key numeric features were standardized, and log transformations were applied to skewed data to improve model performance.

➡ Advanced Features : Created a category-wise average purchase and a loyalty score combining purchase frequency and volume.

➡ Segmentation & Anomaly Detection : Used KMeans to cluster customers into four groups and Isolation Forest to flag anomalies.

➡ Text Processing : Cleaned text fields and added a binary indicator for clothing items.

➡ Privacy : Hashed Customer ID and removed sensitive columns like Location to ensure privacy.

➡ Validation : Automated checks for data integrity, including negative values and valid ranges.

This transformed dataset supports a wide range of research and practical applications, including customer segmentation, purchase behavior modeling, marketing strategy development, fraud detection, and machine learning education. It serves as a reliable and privacy-aware resource for academics, data scientists, and business analysts.
N
South Range, MI Population Breakdown by Gender and Age Dataset: Male and...
neilsberg.com
csv, json
Updated Feb 24, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2025). South Range, MI Population Breakdown by Gender and Age Dataset: Male and Female Population Distribution Across 18 Age Groups // 2025 Edition [Dataset]. https://www.neilsberg.com/research/datasets/e200fba9-f25d-11ef-8c1b-3860777c1fe6/
Explore at:
csv, jsonAvailable download formats
Dataset updated
Feb 24, 2025
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
South Range, Michigan
Variables measured
Male and Female Population Under 5 Years, Male and Female Population over 85 years, Male and Female Population Between 5 and 9 years, Male and Female Population Between 10 and 14 years, Male and Female Population Between 15 and 19 years, Male and Female Population Between 20 and 24 years, Male and Female Population Between 25 and 29 years, Male and Female Population Between 30 and 34 years, Male and Female Population Between 35 and 39 years, Male and Female Population Between 40 and 44 years, and 8 more
Measurement technique
The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. To measure the three variables, namely (a) Population (Male), (b) Population (Female), and (c) Gender Ratio (Males per 100 Females), we initially analyzed and categorized the data for each of the gender classifications (biological sex) reported by the US Census Bureau across 18 age groups, ranging from under 5 years to 85 years and above. These age groups are described above in the variables section. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the population of South Range by gender across 18 age groups. It lists the male and female population in each age group along with the gender ratio for South Range. The dataset can be utilized to understand the population distribution of South Range by gender and age. For example, using this dataset, we can identify the largest age group for both Men and Women in South Range. Additionally, it can be used to see how the gender ratio changes from birth to senior most age group and male to female ratio across each age group for South Range.

Key observations

Largest age group (population): Male # 20-24 years (49) | Female # 20-24 years (50). Source: U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

Age groups:

Under 5 years

5 to 9 years

10 to 14 years

15 to 19 years

20 to 24 years

25 to 29 years

30 to 34 years

35 to 39 years

40 to 44 years

45 to 49 years

50 to 54 years

55 to 59 years

60 to 64 years

65 to 69 years

70 to 74 years

75 to 79 years

80 to 84 years

85 years and over

Scope of gender :

Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis.

Variables / Data Columns

Age Group: This column displays the age group for the South Range population analysis. Total expected values are 18 and are define above in the age groups section.

Population (Male): The male population in the South Range is shown in the following column.

Population (Female): The female population in the South Range is shown in the following column.

Gender Ratio: Also known as the sex ratio, this column displays the number of males per 100 females in South Range for each age group.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for South Range Population by Gender. You can refer the same here

Stock Market Dataset for August 2025

kaggle.com

Updated Aug 7, 2025

Facebook

Twitter

Click to copy link

Link copied

Cite

Kshitij Saini (2025). Stock Market Dataset for August 2025 [Dataset]. https://www.kaggle.com/datasets/kshitijsaini121/stock-market-prediction-for-july-2025-dataset

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Aug 7, 2025

Dataset provided by

Kagglehttp://kaggle.com/

Authors

Kshitij Saini

License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Dataset Overview

This dataset contains comprehensive stock market data for June 2025, capturing daily trading information across multiple companies and sectors. The dataset represents a substantial collection of market data with detailed financial metrics and trading statistics.

Basic Dataset Information

Time Period: June 1-21, 2025 (21 trading days)
Total Records: Approximately 11,600+ entries
Companies Covered: 500+ unique stocks
Data Type: Daily stock market trading data with fundamental metrics

Markdown Table Format

Column Name	Data Type	Description	Example Values
Date	Date	Trading date in DD-MM-YYYY format	01-06-2025, 02-06-2025
Ticker	String	Stock ticker symbol (3-4 characters)	AAPL, GOOGL, TSLA
Open Price	Float	Opening price of the stock	34.92, 206.5, 125.1

Dataset Information Table

Dataset Overview

Attribute	Details
Dataset Name	Stock Market Data - June 2025
File Format	CSV
File Size	~2.5 MB
Number of Records	11,600+
Number of Features	13
Time Period	June 1-21, 2025

Data Schema

Column Name	Data Type	Description	Example Values
Date	Date	Trading date in DD-MM-YYYY format	01-06-2025, 02-06-2025
Ticker	String	Stock ticker symbol (3-4 characters)	AAPL, GOOGL, TSLA, SLH
Open Price	Float	Opening price of the stock	34.92, 206.5, 125.1
Close Price	Float	Closing price of the stock	34.53, 208.45, 124.03
High Price	Float	Highest price during the trading day	35.22, 210.51, 127.4
Low Price	Float	Lowest price during the trading day	34.38, 205.12, 121.77
Volume Traded	Integer	Number of shares traded	2,966,611, 1,658,738
Market Cap	Float	Market capitalization in dollars	57,381,363,838.88
PE Ratio	Float	Price-to-Earnings ratio	29.63, 13.03, 29.19
Dividend Yield	Float	Dividend yield percentage	2.85, 2.73, 2.64
EPS	Float	Earnings per Share	1.17, 16.0, 4.25
52 Week High	Float	Highest price in the last 52 weeks	39.39, 227.38, 138.35
52 Week Low	Float	Lowest price in the last 52 weeks	28.44, 136.79, 100.69
Sector	String	Industry sector classification	Industrials, Energy, Healthcare

Market Capitalization Tiers

Mega Cap (>$1T): 6 companies (AAPL, MSFT, NVDA, AMZN, GOOGL, META)
Large Cap ($200B-$1T): 28 companies
Mid Cap ($50B-$200B): 47 companies

Key Market Characteristics

Price Volatility by Sector

Technology: Higher volatility (±3.5% daily range)
Energy: High volatility (±4.0% daily range)
Utilities: Lower volatility (±1.5% daily range)
Healthcare/Financials: Moderate volatility (±2.5% daily range)

Trading Volume Patterns

Mega Cap: 25M - 90M shares daily
Large Cap: 8M - 35M shares daily
Mid Cap: 2M - 15M shares daily
Small Cap: 500K - 5M shares daily

Financial Metrics Distribution

Average P/E Ratio: 25.9 (market-wide)
Average Dividend Yield: 1.25%
Price Range: $19 (T) to $3,850 (BKNG)
EPS Range: $1.50 to $70.00

Notable Market Features

High-Value Stocks

BKNG (Booking Holdings): $3,650-$3,850 range
AVGO (Broadcom): $1,650-$1,750 range
REGN (Regeneron): $1,050-$1,150 range
LLY (Eli Lilly): $920-$980 range

High-Dividend Yielders

T (AT&T): 7.1% dividend yield
VZ (Verizon): 6.2% dividend yield
PFE (Pfizer): 5.8% dividend yield

Growth & Technology Leaders

NOW (ServiceNow): P/E ratio of 85
NVDA (NVIDIA): P/E ratio of 45
TSLA (Tesla): P/E ratio of 55

Data Quality & Realism Features

✅ Authentic Price Ranges: Based on realistic 2025 market projections ✅ Sector-Appropriate Volatility: Different volatility patterns by industry ✅ Correlated Metrics: P/E ratios, dividend yields, and EPS align with market caps ✅ Realistic Trading Volumes: Volume scaled appropriately to market cap ✅ Temporal Consistency: Logical price progression over 53-day period ✅ Market Cap Accuracy: Daily fluctuations reflect actual price movements

Intended Use Cases

Financial Analysis & Modeling: Portfolio optimization, risk assessment
Machine Learning Applications: Predictive modeling, algorithmic trading
Educational Purposes: Finance courses, data science training
Algorithm Development: Backtesting trading strategies
Market Research: Sector analysis, correlation studies
Visualization Projects: Interactive dashboards, market trend analysis

This dataset provides a comprehensive foundation for quantitative finance research, offering both breadth across market sectors and depth in daily trading dynamics while maintaining statistical realism throughout the observation period...

Facebook

Twitter

Click to copy link

Link copied

Cite

Shin, Jungpil; Bin Aziz, Abu Zahid; Al Mehedi Hasan, (2021). The ranges of values of the hyperparameters of the benchmark datasets. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000782313

The ranges of values of the hyperparameters of the benchmark datasets.

Explore at:

Dataset updated

Feb 23, 2021

Authors

Shin, Jungpil; Bin Aziz, Abu Zahid; Al Mehedi Hasan,

Description

The ranges of values of the hyperparameters of the benchmark datasets.

Clear search

Close search

Google apps

Main menu

The ranges of values of the hyperparameters of the benchmark datasets.

UKCP09: Time Series of Annual values of Extreme temperature range - Dataset...

Data from: HomeRange: A global database of mammalian home ranges

UKCP09: Gridded Datasets of Annual values of extreme temperature range

Savings Bonds Value Files

10-m Topographic Wetness Index

Appendix 1. Sources, values, and ranges for selected Precipitation-Runoff...

Data from: Count-Based Morgan Fingerprint: A More Efficient and...

ECMWF Reanalysis v5

Table of parameter descriptions and ranges of values used in the model.

ERA5 post-processed daily statistics on single levels from 1940 to present

Data from: Predicting Fraction Unbound in Human Plasma from Chemical...

Values and ranges for variables and parameters used for generating numerical...

European Values Study 2017: Integrated Dataset (EVS 2017)

Bridging the Gap in Hypertension Management: Evaluating Blood Pressure...

Parameters definition and range of values.

Chemical alteration index values and rare earth element data and expected...

Transformed Customer Shopping Dataset with Advanced Feature Engineering and...

South Range, MI Population Breakdown by Gender and Age Dataset: Male and...

About this dataset

Content

Inspiration

Recommended for further research

Stock Market Dataset for August 2025

Dataset Overview

Basic Dataset Information

Markdown Table Format

Dataset Information Table

Dataset Overview

Data Schema

Market Capitalization Tiers

Key Market Characteristics

Price Volatility by Sector

Trading Volume Patterns

Financial Metrics Distribution

Notable Market Features

High-Value Stocks

High-Dividend Yielders

Growth & Technology Leaders

Data Quality & Realism Features

Intended Use Cases

The ranges of values of the hyperparameters of the benchmark datasets.