76 datasets found

Indicators of Anxiety or Depression Based on Reported Frequency of Symptoms...
catalog.data.gov
healthdata.gov
+5more
Updated Apr 23, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Centers for Disease Control and Prevention (2025). Indicators of Anxiety or Depression Based on Reported Frequency of Symptoms During Last 7 Days [Dataset]. https://catalog.data.gov/dataset/indicators-of-anxiety-or-depression-based-on-reported-frequency-of-symptoms-during-last-7-
Explore at:
Dataset updated
Apr 23, 2025
Dataset provided by
Centers for Disease Control and Preventionhttp://www.cdc.gov/
Description
The U.S. Census Bureau, in collaboration with five federal agencies, launched the Household Pulse Survey to produce data on the social and economic impacts of Covid-19 on American households. The Household Pulse Survey was designed to gauge the impact of the pandemic on employment status, consumer spending, food security, housing, education disruptions, and dimensions of physical and mental wellness. The survey was designed to meet the goal of accurate and timely weekly estimates. It was conducted by an internet questionnaire, with invitations to participate sent by email and text message. The sample frame is the Census Bureau Master Address File Data. Housing units linked to one or more email addresses or cell phone numbers were randomly selected to participate, and one respondent from each housing unit was selected to respond for him or herself. Estimates are weighted to adjust for nonresponse and to match Census Bureau estimates of the population by age, sex, race and ethnicity, and educational attainment. All estimates shown meet the NCHS Data Presentation Standards for Proportions,
b
SrTi(1-x)V(x)O3 SX data - Datasets - data.bris
data.bris.ac.uk
Updated Aug 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). SrTi(1-x)V(x)O3 SX data - Datasets - data.bris [Dataset]. https://data.bris.ac.uk/data/dataset/17qsutaz7etrk2ffakyiqo2ppj
Explore at:
Dataset updated
Aug 14, 2025
Description
Soft X-ray (SX) experimental data of SrTi(1-x)V(x)O3 thin films, including X-ray absorption spectroscopy (XAS) data, resonant inelastic X-ray scattering (RIXS) data, and accompanying density functional theory (DFT) + dynamical mean-field theory (DMFT) calculations. Complete download (zip, 299.1 KiB)
Stack Exchange Graphs (SNAP)
kaggle.com
zip
Updated Dec 16, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Subhajit Sahu (2021). Stack Exchange Graphs (SNAP) [Dataset]. https://www.kaggle.com/wolfram77/graphs-snap-sx
Explore at:
zip(1480133729 bytes)Available download formats
Dataset updated
Dec 16, 2021
Authors
Subhajit Sahu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Ask Ubuntu temporal network

https://snap.stanford.edu/data/sx-askubuntu.html

Dataset information

This is a temporal network of interactions on the stack exchange web site
Ask Ubuntu (http://askubuntu.com/). There are three different types of
interactions represented by a directed edge (u, v, t):

user u answered user v's question at time t (in the graph sx-askubuntu-a2q) user u commented on user v's question at time t (in the graph
sx-askubuntu-c2q) user u commented on user v's answer at time t (in the
graph sx-askubuntu-c2a)

The graph sx-askubuntu contains the union of these graphs. These graphs
were constructed from the Stack Exchange Data Dump. Node ID numbers
correspond to the 'OwnerUserId' tag in that data dump.

Dataset statistics (sx-askubuntu)
Nodes 159,316
Temporal Edges 964,437
Edges in static graph 596,933
Time span 2613 days

Dataset statistics (sx-askubuntu-a2q)
Nodes 137,517
Temporal Edges 280,102
Edges in static graph 262,106
Time span 2613 days

Dataset statistics (sx-askubuntu-c2q)
Nodes 79,155
Temporal Edges 327,513
Edges in static graph 198,852
Time span 2047 days

Dataset statistics (sx-askubuntu-c2a)
Nodes 75,555
Temporal Edges 356,822
Edges in static graph 178,210
Time span 2418 days

Source (citation)
Ashwin Paranjape, Austin R. Benson, and Jure Leskovec. "Motifs in Temporal Networks." In Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017.

Files
File Description
sx-askubuntu.txt.gz All interactions
sx-askubuntu-a2q.txt.gz Answers to questions
sx-askubuntu-c2q.txt.gz Comments to questions
sx-askubuntu-c2a.txt.gz Comments to answers

Data format

SRC DST UNIXTS

where edges are separated by a new line and

SRC: id of the source node (a user) TGT: id of the target node (a user) UNIXTS: Unix timestamp (seconds since the epoch)

Notes on inclusion into the SuiteSparse Matrix Collection, July 2018:

The SNAP graph is 1-based, with nodes in all graphs numbered 1 to
n=515,280.

In the SuiteSparse Matrix Collection, the primary matrix, Problem.A, is
the overall static graph, with 596,993 edges, of size n-by-n with
n=159,316. These edges represent the 964,437 temporal edges. A(i,j) is
the number of times person u=nodeid(i) interacted with person v=nodeid(j), with a temporal edge (u,v,t), with any kind of interaction.
Problem.aux.nodeid is a list of the node id's that appear in the SNAP data set.

A2Q = Problem.aux.Q2A is the static sx-askubuntu-a2q graph.
C2Q = Problem.aux.C2Q is the static sx-askubuntu-c2q graph.
C2A = Problem.aux.C2A is the static sx-askubuntu-c2a graph.
These sum together to give the the overall graph. That is,
A = A2Q + C2Q + C2A.

A2Q(u,v) is the number of times person u answered v's questions.
C2Q(u,v) is the number of times person u commented on v's question.
C2A(u,v) is the number of times person u commented on v's answer.

The temporal edges are held in:
Problem.aux.temporal_edges: [964437x3]
Problem.aux.temporal_edges_a2q: [280102x3]
Problem.aux.temporal_edges_c2q: [327513x3]
Problem.aux.temporal_edges_c2a: [356822x3]

Each row in these matrices is a single temporal edge, (u,v,t). Summing up all entries in A gives 964,437, and likewise the sum of entries in the
other graphs gives the number of temporal edges they represent.

Math Overflow temporal network

https://snap.stanford.edu/data/sx-mathoverflow.html

Dataset information

This is a temporal network of interactions on the stack exchange web site
Math Overflow (http://mathoverflow.net/). There are three different types
of interactions represented by a directed edge (u, v, t):

user u answered user v's question at time t (in the graph
sx-mathoverflow-a2q) user u commented on user v's question at time t (in
the graph sx-mathoverflow-c2q) user u commented on user v's answer at time t (in the graph sx-mathoverflow-c2a)

The graph sx-mathoverflow contains the union of these graphs. These graphs were constructed from the Stack Exchange Data Dump. Node ID numbers
correspond to the 'OwnerUserId' tag in that data dump.

Dataset statistics (sx-mathoverflow)
Nodes 24,818
Temporal Edges 506,550
Edges in static graph 239,978
Time span 2350 days

Dataset statistics (sx-mathoverflow-a2q)
Nodes 21,688
Temporal Edges 107,581
Edges in static graph 90,489
Time span 2350 days

Dataset statistics (sx-mathoverflow-c2q)
Nodes 16,836
Prevalence of long COVID symptoms and COVID-19 complications
ons.gov.uk
cy.ons.gov.uk
xlsx
Updated Dec 16, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Office for National Statistics (2020). Prevalence of long COVID symptoms and COVID-19 complications [Dataset]. https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/healthandlifeexpectancies/datasets/prevalenceoflongcovidsymptomsandcovid19complications
Explore at:
xlsxAvailable download formats
Dataset updated
Dec 16, 2020
Dataset provided by
Office for National Statisticshttp://www.ons.gov.uk/
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Description
Experimental estimates of the prevalence and duration of long COVID symptoms, and rates of adverse events for hospitalised coronavirus (COVID-19) patients compared with those for matched control patients.
Post-COVID Conditions
catalog.data.gov
data.virginia.gov
+4more
Updated Apr 23, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Centers for Disease Control and Prevention (2025). Post-COVID Conditions [Dataset]. https://catalog.data.gov/dataset/post-covid-conditions-89bb3
Explore at:
Dataset updated
Apr 23, 2025
Dataset provided by
Centers for Disease Control and Preventionhttp://www.cdc.gov/
Description
As part of an ongoing partnership with the Census Bureau, the National Center for Health Statistics (NCHS) recently added questions to assess the prevalence of post-COVID-19 conditions (long COVID), on the experimental Household Pulse Survey. This 20-minute online survey was designed to complement the ability of the federal statistical system to rapidly respond and provide relevant information about the impact of the coronavirus pandemic in the U.S. Data collection began on April 23, 2020. Beginning in Phase 3.5 (on June 1, 2022), NCHS included questions about the presence of symptoms of COVID that lasted three months or longer. Phase 3.5 will continue with a two-weeks on, two-weeks off collection and dissemination approach. Estimates on this page are derived from the Household Pulse Survey and show the percentage of adults aged 18 and over who a) as a proportion of the U.S. population, the percentage of adults who EVER experienced post-COVID conditions (long COVID). These adults had COVID and had some symptoms that lasted three months or longer; b) as a proportion of adults who said they ever had COVID, the percentage who EVER experienced post-COVID conditions; c) as a proportion of the U.S. population, the percentage of adults who are CURRENTLY experiencing post-COVID conditions. These adults had COVID, had long-term symptoms, and are still experiencing symptoms; d) as a proportion of adults who said they ever had COVID, the percentage who are CURRENTLY experiencing post-COVID conditions; and e) as a proportion of the U.S. population, the percentage of adults who said they ever had COVID.
ICP Forests Defoliation and Symptoms Data Set
envidat.ch
data.europa.eu
gif, json +3
Updated Jun 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yuman Sun; Ross Shackleton; Marco Ferretti; Dominik Haas-Artho (2025). ICP Forests Defoliation and Symptoms Data Set [Dataset]. http://doi.org/10.16904/envidat.576
Explore at:
txt, not available, json, gif, xmlAvailable download formats
Unique identifier
https://doi.org/10.16904/envidat.576
Dataset updated
Jun 3, 2025
Dataset provided by
Swiss Federal Institute for Forest, Snow and Landscape Research
Authors
Yuman Sun; Ross Shackleton; Marco Ferretti; Dominik Haas-Artho
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Dataset funded by
European Commissionhttp://ec.europa.eu/
Description
This data set has been obtained after processing original data from the ICP Forests data infrastructure. It includes "clean" defoliation and symptoms data from 19 countries over the period 1990-2022 after removing dubious cases of e.g. species attribution, dubious coding, long-standing dead trees. It is based on ICP Forests Level I plots. Defoliation is the relative loss (shed or not formed) of tree needles / leaves in relation to a hypothetical fully foliated optimum and is visually assessed using a sliding scale recorded in 5% steps (from 0%= no defoliation to 100%=standing dead tree). Occurrence of s^Symptoms attributable to damaging agents (e.g., insects, fungi, drought, hail, fire, direct action of men…) on each tree and associated to defoliation are also included in this data set. This data set includes a total of 2’688’512 observations from 219’854 trees in 12’104 plots.
Prevalence of symptoms and impact of respiratory infections, UK, dataset: 10...
cy.ons.gov.uk
ons.gov.uk
xlsx
Updated Jul 10, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Office for National Statistics (2023). Prevalence of symptoms and impact of respiratory infections, UK, dataset: 10 July 2023 [Dataset]. https://cy.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/conditionsanddiseases/datasets/prevalenceofsymptomsandimpactofrespiratoryinfectionsukdataset10july2023
Explore at:
xlsxAvailable download formats
Dataset updated
Jul 10, 2023
Dataset provided by
Office for National Statisticshttp://www.ons.gov.uk/
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Description
Results from the COVID-19 and Respiratory Infections Survey.
c
SX Network Price Prediction Data
coinbase.com
Updated Sep 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). SX Network Price Prediction Data [Dataset]. https://www.coinbase.com/en-br/price-prediction/sportx
Explore at:
Dataset updated
Sep 27, 2025
Variables measured
Growth Rate, Predicted Price
Measurement technique
User-defined projections based on compound growth. This is not a formal financial forecast.
Description
This dataset contains the predicted prices of the asset SX Network over the next 16 years. This data is calculated initially using a default 5 percent annual growth rate, and after page load, it features a sliding scale component where the user can then further adjust the growth rate to their own positive or negative projections. The maximum positive adjustable growth rate is 100 percent, and the minimum adjustable growth rate is -100 percent.
p
Health and beauty shops Business Data for SX
poidata.io
csv, json
Updated Sep 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Business Data Provider (2025). Health and beauty shops Business Data for SX [Dataset]. https://poidata.io/report/health-and-beauty-shop/sx
Explore at:
json, csvAvailable download formats
Dataset updated
Sep 20, 2025
Dataset authored and provided by
Business Data Provider
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
2025
Area covered
SX
Variables measured
Website URL, Phone Number, Review Count, Business Name, Email Address, Business Hours, Customer Rating, Business Address, Business Categories, Geographic Coordinates
Description
Comprehensive dataset containing 11 verified Health and beauty shop businesses in SX with complete contact information, ratings, reviews, and location data.
Prevalence of ongoing symptoms following coronavirus (COVID-19) infection in...
ons.gov.uk
cy.ons.gov.uk
xlsx
Updated Mar 30, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Office for National Statistics (2023). Prevalence of ongoing symptoms following coronavirus (COVID-19) infection in the UK [Dataset]. https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/conditionsanddiseases/datasets/alldatarelatingtoprevalenceofongoingsymptomsfollowingcoronaviruscovid19infectionintheuk
Explore at:
xlsxAvailable download formats
Dataset updated
Mar 30, 2023
Dataset provided by
Office for National Statisticshttp://www.ons.gov.uk/
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Description
Estimates of the prevalence of self-reported long COVID and associated activity limitation, using UK Coronavirus (COVID-19) Infection Survey data. Experimental Statistics.
n
Patient-reported outcomes via electronic health record portal vs. telephone:...
data.niaid.nih.gov
search.dataone.org
+1more
zip
Updated Oct 23, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Heidi Munger Clary; Beverly Snively (2022). Patient-reported outcomes via electronic health record portal vs. telephone: process and retention data in a pilot trial of anxiety or depression symptoms in epilepsy [Dataset]. http://doi.org/10.5061/dryad.qz612jmk3
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.qz612jmk3
Dataset updated
Oct 23, 2022
Dataset provided by
Atrium Health Wake Forest Baptist
Authors
Heidi Munger Clary; Beverly Snively
License
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Description
Objective: To close gaps between research and clinical practice, tools are needed for efficient pragmatic trial recruitment and patient-reported outcome(PROM) collection. The objective was to assess feasibility and process measures for patient-reported outcome collection in a randomized trial comparing electronic health record(EHR) patient portal questionnaires to telephone interview among adults with epilepsy and anxiety or depression symptoms. Results: Participants were 60% women, 77% White/non-Hispanic, with mean age 42.5 years. Among 15 individuals randomized to EHR portal, 10(67%, CI 41.7-84.8%) met the 6-month retention endpoint, versus 100%(CI 79.6-100%) in the telephone group(p=0.04). EHR outcome collection at 6 months required 11.8 minutes less research staff time per participant than telephone (5.9, CI 3.3-7.7 vs. 17.7, CI 14.1-20.2). Subsequent telephone contact after unsuccessful EHR attempts enabled near complete data collection and still saved staff time. Discussion: Data from this randomized pilot study of pragmatic outcome collection methods for patients with anxiety or depression symptoms in epilepsy includes baseline participant characteristics, recruitment flow resulting from a novel EHR-based, care-embedded recruitment process, and data on retention along with various process measures at 6-months. Methods The dataset was collected via a combination of the following: 1. manual extraction of EHR-based data followed by entry into REDCap and then analysis and further processing in SAS 9.4; 2. Data pull of Epic EHR-based data from Clarity database using standard programming techniques, followed by processing in SAS 9.4 and merging with data from REDCap; 3. Collection of data directly from participants via telephone with entry into REDCap and further processing in SAS 9.4; 4. Collection of process measures from study team tracking records followed by entry into REDCap and further processing in SAS 9.4. One file in the dataset contains aggregate data generated following merging of Clarity data pull-origin dataset with a REDCap dataset and further manual processing. Recruitment for the randomized trial began at an epilepsy clinic visit, with EHR-embedded validated anxiety and depression instruments, followed by automated EHR-based research screening consent and eligibility assessment. Fully eligible individuals later completed telephone consent, enrollment and randomization. Thirty total participants were randomized 1:1 to EHR portal versus telephone outcome assessment, and patient-reported and process outcomes were collected at 3- and 6-months, with primary outcome 6-month retention in EHR arm(feasibility target: ≥11 participants retained). Variables in this dataset include recruitment flow diagram data, baseline participant sociodemographic and clinical characteristics, retention (successful PROM collection at 6 months), and process measures. The process measures included research staff time to collect outcomes, research staff time to collect outcomes and enter data, time from initial outcome collection reminder to outcome collection, and number of reminders sent to participants for outcome collection. PROMs were collected via the randomized method only at 3 months. At 6 months, if the criteria for retention was not met by the randomized method (failure to return outcomes by 1 week after 5 post-due date reminders for outcome collection), up to 3 additional attempts were made to collect outcomes by the alternative method, and process measures were also collected during this hybrid outcome collection method approach.
n
Malaria disease and grading system dataset from public hospitals reflecting...
data.niaid.nih.gov
datadryad.org
zip
Updated Nov 10, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Temitope Olufunmi Atoyebi; Rashidah Funke Olanrewaju; N. V. Blamah; Emmanuel Chinanu Uwazie (2023). Malaria disease and grading system dataset from public hospitals reflecting complicated and uncomplicated conditions [Dataset]. http://doi.org/10.5061/dryad.4xgxd25gn
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.4xgxd25gn
Dataset updated
Nov 10, 2023
Dataset provided by
Nasarawa State University
Authors
Temitope Olufunmi Atoyebi; Rashidah Funke Olanrewaju; N. V. Blamah; Emmanuel Chinanu Uwazie
License
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Description
Malaria is the leading cause of death in the African region. Data mining can help extract valuable knowledge from available data in the healthcare sector. This makes it possible to train models to predict patient health faster than in clinical trials. Implementations of various machine learning algorithms such as K-Nearest Neighbors, Bayes Theorem, Logistic Regression, Support Vector Machines, and Multinomial Naïve Bayes (MNB), etc., has been applied to malaria datasets in public hospitals, but there are still limitations in modeling using the Naive Bayes multinomial algorithm. This study applies the MNB model to explore the relationship between 15 relevant attributes of public hospitals data. The goal is to examine how the dependency between attributes affects the performance of the classifier. MNB creates transparent and reliable graphical representation between attributes with the ability to predict new situations. The model (MNB) has 97% accuracy. It is concluded that this model outperforms the GNB classifier which has 100% accuracy and the RF which also has 100% accuracy. Methods Prior to collection of data, the researcher was be guided by all ethical training certification on data collection, right to confidentiality and privacy reserved called Institutional Review Board (IRB). Data was be collected from the manual archive of the Hospitals purposively selected using stratified sampling technique, transform the data to electronic form and store in MYSQL database called malaria. Each patient file was extracted and review for signs and symptoms of malaria then check for laboratory confirmation result from diagnosis. The data was be divided into two tables: the first table was called data1 which contain data for use in phase 1 of the classification, while the second table data2 which contains data for use in phase 2 of the classification. Data Source Collection Malaria incidence data set is obtained from Public hospitals from 2017 to 2021. These are the data used for modeling and analysis. Also, putting in mind the geographical location and socio-economic factors inclusive which are available for patients inhabiting those areas. Naive Bayes (Multinomial) is the model used to analyze the collected data for malaria disease prediction and grading accordingly. Data Preprocessing: Data preprocessing shall be done to remove noise and outlier. Transformation: The data shall be transformed from analog to electronic record. Data Partitioning The data which shall be collected will be divided into two portions; one portion of the data shall be extracted as a training set, while the other portion will be used for testing. The training portion shall be taken from a table stored in a database and will be called data which is training set1, while the training portion taking from another table store in a database is shall be called data which is training set2. The dataset was split into two parts: a sample containing 70% of the training data and 30% for the purpose of this research. Then, using MNB classification algorithms implemented in Python, the models were trained on the training sample. On the 30% remaining data, the resulting models were tested, and the results were compared with the other Machine Learning models using the standard metrics. Classification and prediction: Base on the nature of variable in the dataset, this study will use Naïve Bayes (Multinomial) classification techniques; Classification phase 1 and Classification phase 2. The operation of the framework is illustrated as follows: i. Data collection and preprocessing shall be done. ii. Preprocess data shall be stored in a training set 1 and training set 2. These datasets shall be used during classification. iii. Test data set is shall be stored in database test data set. iv. Part of the test data set must be compared for classification using classifier 1 and the remaining part must be classified with classifier 2 as follows: Classifier phase 1: It classify into positive or negative classes. If the patient is having malaria, then the patient is classified as positive (P), while a patient is classified as negative (N) if the patient does not have malaria.
Classifier phase 2: It classify only data set that has been classified as positive by classifier 1, and then further classify them into complicated and uncomplicated class label. The classifier will also capture data on environmental factors, genetics, gender and age, cultural and socio-economic variables. The system will be designed such that the core parameters as a determining factor should supply their value.
m
A Brazilian dataset of symptomatic patients for screening the risk of...
data.mendeley.com
Updated Mar 8, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Íris Viana dos Santos Santana (2021). A Brazilian dataset of symptomatic patients for screening the risk of COVID-19 [Dataset]. http://doi.org/10.17632/b7zcgmmwx4.5
Explore at:
Unique identifier
https://doi.org/10.17632/b7zcgmmwx4.5
Dataset updated
Mar 8, 2021
Authors
Íris Viana dos Santos Santana
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The original COVID-19 dataset included information about tested patients, containing early-stage symptoms, comorbidities, demographics information, and symptoms description. The patients were tested by applying viral or rapid tests. The raw data was collected by the public health agency of the city of Campina Grande, Paraíba state, in Northeast Brazil. Such a public agency is informed by all the COVID-19 exams performed in the city of Campina Grande. The health agency employees removed patient identification, and the data made available were reused to enable this study.

We preprocessed the dataset by selecting only completed tests, being marked as positive or negative, applied string matching algorithms to correct some inconsistencies, and removed rows with duplicated instances and asymptomatic patients. We also focused on the most frequent and relevant demographics information and reported early-stage symptoms to select features, and balanced the data considering positive and negative cases by random undersampling using the NearMiss algorithm. We also use unbalanced datasets.

Using this dataset, we implemented and evaluated supervised machine learning models for COVID-19 detection in Brazil based on early-stage symptoms and basic personal information.

This dataset relates to the study entitled "Machine Learning Classification Models for COVID-19 Test Prioritization in Brazil".
c
Animal Condition Classification Dataset
cubig.ai
Updated Aug 30, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CUBIG (2024). Animal Condition Classification Dataset [Dataset]. https://cubig.ai/store/products/208/animal-condition-classification-dataset
Explore at:
Dataset updated
Aug 30, 2024
Dataset authored and provided by
CUBIG
License
https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service
Measurement technique
Privacy-preserving data transformation via differential privacy, Synthetic data generation using AI techniques for model training
Description
1) Data introduction • Animal-disease dataset aims to develop a predictive model that determines whether an animal's condition is dangerous based on five distinct symptoms.

2) Data utilization (1)Animal-disease data has characteristics that: • This is a data set that indicates the type of animals such as dogs and cats, the degree of symptoms from 1 to 5, and includes whether the condition is dangerous. (2)Animal-disease data can be used to: • Animal health monitoring: Data supports the development of monitoring systems that track the health of animals over time and provide warnings of potentially hazardous situations. • Veterinary diagnostics: Using this data, veterinarians can develop tools to detect critical conditions in animals early and enable timely and effective therapeutic interventions.
c
SX Network Price Prediction Data
coinbase.com
Updated Sep 18, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). SX Network Price Prediction Data [Dataset]. https://www.coinbase.com/en-ar/price-prediction/sportx
Explore at:
Dataset updated
Sep 18, 2025
Variables measured
Growth Rate, Predicted Price
Measurement technique
User-defined projections based on compound growth. This is not a formal financial forecast.
Description
This dataset contains the predicted prices of SX Network for the upcoming years based on user-defined projections.
Lower Back Pain Symptoms Dataset(labelled)
kaggle.com
Updated Dec 5, 2017
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ali Hussain (2017). Lower Back Pain Symptoms Dataset(labelled) [Dataset]. https://www.kaggle.com/datasets/alihussain1993/lower-back-pain-symptoms-datasetlabelled
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 5, 2017
Dataset provided by
Kaggle
Authors
Ali Hussain
Description
310 Observations, 13 Attributes (12 Numeric Predictors, 1 Binary Class Attribute - No Demographics)

Lower back pain can be caused by a variety of problems with any parts of the complex, interconnected network of spinal muscles, nerves, bones, discs or tendons in the lumbar spine. Typical sources of low back pain include:

The large nerve roots in the low back that go to the legs may be irritated The smaller nerves that supply the low back may be irritated The large paired lower back muscles (erector spinae) may be strained The bones, ligaments or joints may be damaged An intervertebral disc may be degenerating An irritation or problem with any of these structures can cause lower back pain and/or pain that radiates or is referred to other parts of the body. Many lower back problems also cause back muscle spasms, which don't sound like much but can cause severe pain and disability.

While lower back pain is extremely common, the symptoms and severity of lower back pain vary greatly. A simple lower back muscle strain might be excruciating enough to necessitate an emergency room visit, while a degenerating disc might cause only mild, intermittent discomfort.

This data set is about to identify a person is abnormal or normal using collected physical spine details/data.
f
Descriptive statistics for the characteristics and associations with anxiety...
datasetcatalog.nlm.nih.gov
figshare.com
Updated Jul 22, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gong, Yanhong; Yang, Guoan; Zhuang, Runsen; Han, Tieguang; Chen, Wei; Chen, Yuqi; Yin, Xiaoxv; Dib, Hassan H.; Tong, Xinyue; Lu, Zuxun (2014). Descriptive statistics for the characteristics and associations with anxiety and depressive symptoms of the participants. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001179926
Explore at:
Dataset updated
Jul 22, 2014
Authors
Gong, Yanhong; Yang, Guoan; Zhuang, Runsen; Han, Tieguang; Chen, Wei; Chen, Yuqi; Yin, Xiaoxv; Dib, Hassan H.; Tong, Xinyue; Lu, Zuxun
Description
Note: P values are associated with Chi-square tests.aSD is standard deviation.bP value is associated with analysis of variance.cP value is associated with Cochran-Mantel-Haenszel statistics.
w
.sx.cn TLD Whois Database | Whois Data Center
whoisdatacenter.com
csv
Updated Jul 19, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AllHeart Web Inc (2024). .sx.cn TLD Whois Database | Whois Data Center [Dataset]. https://whoisdatacenter.com/tld/.sx.cn/
Explore at:
csvAvailable download formats
Dataset updated
Jul 19, 2024
Dataset authored and provided by
AllHeart Web Inc
License
https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/
Time period covered
Sep 22, 2025 - Dec 30, 2025
Description
.SX.CN Whois Database, discover comprehensive ownership details, registration dates, and more for .SX.CN TLD with Whois Data Center.
i
Data from: Disease Prediction Dataset
ieee-dataport.org
Updated Feb 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ayush Nautiyal (2025). Disease Prediction Dataset [Dataset]. https://ieee-dataport.org/documents/disease-prediction-dataset
Explore at:
Dataset updated
Feb 20, 2025
Authors
Ayush Nautiyal
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains symptoms and disease information. It contains total of 1325 symptoms covered with 391 disease.This dataset is refernced from website MedLinePlus. This dataset have training and testing dataset and can be used to train disease prediction algorithm . It is created on own for project disease prediction and do not involves any funding or promotional terms.
e
Temporal associations between environmental tobacco smoke exposure and...
b2find.eudat.eu
Updated Nov 10, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Temporal associations between environmental tobacco smoke exposure and nicotine dependence symptoms - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/45e6b5ff-2741-5799-9d01-93a5b223c0a8
Explore at:
Dataset updated
Nov 10, 2024
Description
Three datasets described in manuscript are preprocessed data (imputed). One meta-data file for all (variables are the same), and R codes used for analyses can be found here. If you want to replicate main analyses of the findings, you can use the R script in 'Analyses_EMA_Data.html' and two pre-processed datasets: 'dfimp_smokers.csv' and 'dfimp_nonsmokers.csv'. The follow-up analyses were done on dataset dfimp_smokerswith.csv'. Please see 'Follow_up_Analyses_EMA_Data.html'. This should enable replication. R scripts from 'Preparation_descriptives_EMA_Data.html' were preparatory steps with some descriptive statistics on non-anonymized dataset that is not shared, this does provide some insight into construction and presented descriptives. Link to same and other materials is: https://doi.org/10.17605/OSF.IO/J2WQK

Facebook

Twitter

Click to copy link

Link copied

Cite

Centers for Disease Control and Prevention (2025). Indicators of Anxiety or Depression Based on Reported Frequency of Symptoms During Last 7 Days [Dataset]. https://catalog.data.gov/dataset/indicators-of-anxiety-or-depression-based-on-reported-frequency-of-symptoms-during-last-7-

Indicators of Anxiety or Depression Based on Reported Frequency of Symptoms During Last 7 Days

Explore at:

16 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

Apr 23, 2025

Dataset provided by

Centers for Disease Control and Preventionhttp://www.cdc.gov/

Description

The U.S. Census Bureau, in collaboration with five federal agencies, launched the Household Pulse Survey to produce data on the social and economic impacts of Covid-19 on American households. The Household Pulse Survey was designed to gauge the impact of the pandemic on employment status, consumer spending, food security, housing, education disruptions, and dimensions of physical and mental wellness. The survey was designed to meet the goal of accurate and timely weekly estimates. It was conducted by an internet questionnaire, with invitations to participate sent by email and text message. The sample frame is the Census Bureau Master Address File Data. Housing units linked to one or more email addresses or cell phone numbers were randomly selected to participate, and one respondent from each housing unit was selected to respond for him or herself. Estimates are weighted to adjust for nonresponse and to match Census Bureau estimates of the population by age, sex, race and ethnicity, and educational attainment. All estimates shown meet the NCHS Data Presentation Standards for Proportions,

Clear search

Close search

Google apps

Main menu

Indicators of Anxiety or Depression Based on Reported Frequency of Symptoms...

SrTi(1-x)V(x)O3 SX data - Datasets - data.bris

Stack Exchange Graphs (SNAP)

Ask Ubuntu temporal network

Notes on inclusion into the SuiteSparse Matrix Collection, July 2018:

Math Overflow temporal network

Prevalence of long COVID symptoms and COVID-19 complications

Post-COVID Conditions

ICP Forests Defoliation and Symptoms Data Set

Prevalence of symptoms and impact of respiratory infections, UK, dataset: 10...

SX Network Price Prediction Data

Health and beauty shops Business Data for SX

Prevalence of ongoing symptoms following coronavirus (COVID-19) infection in...

Patient-reported outcomes via electronic health record portal vs. telephone:...

Malaria disease and grading system dataset from public hospitals reflecting...

A Brazilian dataset of symptomatic patients for screening the risk of...

Animal Condition Classification Dataset

SX Network Price Prediction Data

Lower Back Pain Symptoms Dataset(labelled)

Descriptive statistics for the characteristics and associations with anxiety...

.sx.cn TLD Whois Database | Whois Data Center

Data from: Disease Prediction Dataset

Temporal associations between environmental tobacco smoke exposure and...

Indicators of Anxiety or Depression Based on Reported Frequency of Symptoms During Last 7 DaysSee More Versions

Indicators of Anxiety or Depression Based on Reported Frequency of Symptoms During Last 7 Days