Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
About Dataset Safa S. Abdul-Jabbar, Alaa k. Farhan
Context This is the first Dataset for various ordinary patients in Iraq. The Dataset provides the patients’ Cell Blood Count test information that can be used to create a Hematology diagnosis/prediction system. Also, this Data was collected in 2022 from Al-Zahraa Al-Ahly Hospital. These data can be cleaned & analyzed using any programming language because it is provided in an excel file that can be accessed and manipulated easily. The user just needs to understand how rows and columns are arranged because the data was collected as images(CBC images) from the laboratories and then stored the extracted data in an excel file. Content This Dataset contains 500 rows. For each row (patient information), there are 21 columns containing CBC test features that can be described as follows:
ID: Patients Identifier
WBC: White Blood Cell, Normal Ranges: 4.0 to 10.0, Unit: 10^9/L.
LYMp: Lymphocytes percentage, which is a type of white blood cell, Normal Ranges: 20.0 to 40.0, Unit: %
MIDp: Indicates the percentage combined value of the other types of white blood cells not classified as lymphocytes or granulocytes, Normal Ranges: 1.0 to 15.0, Unit: %
NEUTp: Neutrophils are a type of white blood cell (leukocytes); neutrophils percentage, Normal Ranges: 50.0 to 70.0, Unit: %
LYMn: Lymphocytes number are a type of white blood cell, Normal Ranges: 0.6 to 4.1, Unit: 10^9/L.
MIDn: Indicates the combined number of other white blood cells not classified as lymphocytes or granulocytes, Normal Ranges: 0.1 to 1.8, Unit: 10^9/L.
NEUTn: Neutrophils Number, Normal Ranges: 2.0 to 7.8, Unit: 10^9/L.
RBC: Red Blood Cell, Normal Ranges: 3.50 to 5.50, Unit: 10^12/L
HGB: Hemoglobin, Normal Ranges: 11.0 to 16.0, Unit: g/dL
HCT: Hematocrit is the proportion, by volume, of the Blood that consists of red blood cells, Normal Ranges: 36.0 to 48.0, Unit: %
MCV: Mean Corpuscular Volume, Normal Ranges: 80.0 to 99.0, Unit: fL
MCH: Mean Corpuscular Hemoglobin is the average amount of haemoglobin in the average red cell, Normal Ranges: 26.0 to 32.0, Unit: pg
MCHC: Mean Corpuscular Hemoglobin Concentration, Normal Ranges: 32.0 to 36.0, Unit: g/dL
RDWSD: Red Blood Cell Distribution Width, Normal Ranges: 37.0 to 54.0, Unit: fL
RDWCV: Red blood cell distribution width, Normal Ranges: 11.5 to 14.5, Unit: %
PLT: Platelet Count, Normal Ranges: 100 to 400, Unit: 10^9/L
MPV: Mean Platelet Volume, Normal Ranges: 7.4 to 10.4, Unit: fL
PDW: Red Cell Distribution Width, Normal Ranges: 10.0 to 17.0, Unit: %
PCT: The level of Procalcitonin in the Blood, Normal Ranges: 0.10 to 0.28, Unit: %
PLCR: Platelet Large Cell Ratio, Normal Ranges: 13.0 to 43.0, Unit: %
Acknowledgements We thank the entire Al-Zahraa Al-Ahly Hospital Hospital team, especially the hospital manager, for cooperating with us in collecting this data while maintaining patients' confidentiality.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
** RD DATASET ** RD dataset was created by the images from the melanoma community on the internet (https://reddit.com/r/melanoma). Consecutive images were included using a python library (https://github.com/aliparlakci/bulk-downloader-for-reddit) from Jan 25, 2020, to July 30, 2021. The ground truth was voted by four dermatologists and one plastic surgeon while referring to the chief complaint and brief history. A total of 1,282 images (1,201 cases) were finally included. Because of the deleted cases by users, the links of 860 cases are valid in July 2021.
RD_RAW.xlsx The download links and ground truth of the RD dataset are included in this excel file. In addition, the raw data of the AI (Model Dermatology Build2021 - https://modelderm.com) and 32 laypersons were included.
v1_public.zip "v1_public.zip" includes the 1,282 lesional images (full-size). The 24 images that were excluded from the study are also available.
v1_private.zip is not available here. Wide field images are not available here. If the archive is needed for research purpose, please email to Dr. Han Seung Seog (whria78@gmail.com) or Dr Cristian Navarrete-Dechent (ctnavarr@gmail.com).
References - The Degradation of Performance of a State-of-the-art Skin Image Classifier When Applied to Patient-driven Internet Search - Scientific Report (in-press)
** Background normal test with the ISIC images ** ISIC dataset (https://www.isic-archive.com; Gallery -> 2018 JID Editorial images; 99 images; ISIC_0024262 and ISIC_0024261 are identical images and ISIC_0024262 was skipped) was used for the background normal test. We defined 10% area rectangle crop to “specialist-size crop”, and 5% area rectangle crop to “layperson-size crop” a) S-crops.zip: specialist-size crops Format: CROPNO_AGE(0~99)_GENDER(1=male,0=female)[m]_FILENAME.png b) L-crops.zip: layperson-size crops Format: CROPNO_AGE(0~99)_GENDER(1=male,0=female)[m]_FILENAME.png c) result_S.zip: Background normal test result using the specialist-size crops d) result_L.zip; Background normal test result using the layperson-size crops
Reference - Automated Dermatological Diagnosis: Hype or Reality? - https://doi.org/10.1016/j.jid.2018.04.040 - Multiclass Artificial Intelligence in Dermatology: Progress but Still Room for Improvement - https://doi.org/10.1016/j.jid.2020.06.040
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This folder contains the material of an experiment that compares MDD and CDD with respect to Bug Localization tasks.
The material in this folder* is organized in three subfolders: 01_EXPERIMENT, which contains the information used to run the experiment; 02_RESULTS, which contains the information collected in the experiment; and 03_STATISTICAL ANALYSIS, which contains the data set and the results of the statistical analysis performed.
The documentation contained in each of these folders is described below:
01_EXPERIMENT
11_ CdEUSJ_FavorableReport: pdf file of the Favorable report from the ethical and scientific committee regarding the execution of the experiment.
12_Forms&Tables: This folder contains the Taks Sheet for each group and the scenarios of each task and each group.
13_SupportMaterial: This folder contains the support material that the subjects could use during the performance of the different tasks of the experiment.
14_SessionMaterial: This folder contains the material used by the instructors during the session, including the video tutorial.
15_CorrecctionMaterial: This folder contains the solution to the tasks and the correction template.
02_RESULTS
21_RESULTS: Excel file with the data extracted from the Forms and the pre-processed results before the statistical analysis.
22_SubjectComments&FocusGroup. Pdf file with the analysis of the subjects' comments on the forms, and the transcription of the comments during the sessions and in the focus group.
03_STATISTICAL ANALYSIS
301_DATASET: Data files containing the values of variables and factors necessary for conducting the statistical analysis proposed in the study. They are included in both IBM SPSS Statistics (.sav) and Microsoft Excel (.xlsx) formats.
302_DEMOGRAPHIC: Excel file with the demographic results.
303_DESCRIPTIVES: Files that contain the values of the main descriptive statistics and the results of the normality tests that correspond to all the variables measured in the study: the response variables and the independent variables or factors. They are included in both IBM SPSS Statistics (.spv) and PDF Reader (.pdf) formats.
304_LMM: Files that contain the execution and results of the LMM Type III test of fixed effects with unstructured repeated covariance for all the variables in the study with different statistical models. They are included in both IBM SPSS Statistics (.spv) and PDF Reader (.pdf) formats.
305_DATASET_BugLocalization.xlsx_lmmRes: Files that contain the dataset that includes the residuals of LMM executions. They are included in both IBM SPSS Statistics (.sav) and PDF Reader (.xls) formats.
306_NORMALITYTEST: Files that contain the dataset that includes the normality test of the residuals of LMM executions. They are included in both IBM SPSS Statistics (.spv) and PDF Reader (.pdf) formats.
307_EFFECTSIZE-Cohend: Files that include the computations executed to determine the effect size of the factors in all the dependent variables. They are included in both IBM SPSS Statistics (.spv) and PDF Reader (.pdf) formats.
308_DATASET_BugLocalization.xlsx_lmm_student_Res: Files that contain the dataset that includes the residuals of LMM executions for non-experienced subjects (students) in the study. They are included in both IBM SPSS Statistics (.sav) and PDF Reader (.xls) formats.
309_LMM_Students: Files that contain the execution and results of the LMM tests (Type III test of fixed effects with unstructured repeated covariance) for non-experienced subjects (students) in the study for all the variables in the study with different statistical models, the normality analysis of the residuals and the computations executed to determine the effect size of the factors considered in each dependent variable. They are included in both IBM SPSS Statistics (.spv) and PDF Reader (.pdf) formats.
310_DATASET_BugLocalization.xlsx_lmm_professionals_Res.xlsx: Files that contain the dataset that includes the residuals of LMM executions for experienced subjects (professionals) in the study. They are included in both IBM SPSS Statistics (.sav) and PDF Reader (.xls) formats.
311_LMM_Profesionales: Files that contain the execution and results of the LMM tests (Type III test of fixed effects with unstructured repeated covariance) for the experienced subjects (professionals) in the study for all the variables in the study with different statistical models, the normality analysis of the residuals and the computations executed to determine the effect size of the factors considered in each dependent variable. They are included in both IBM SPSS Statistics (.spv) and PDF Reader (.pdf) formats.
312_Boxplots: Files that contain the execution and results of the boxplots by method (MDD/CDD) and by Experience (students/professionals). They are included in both IBM SPSS Statistics (.spv) and PDF Reader (.pdf) formats.
*Note: The documentation that appears in this folder contains texts in Spanish, since this is the language in which the experiment is executed.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Five files, one of which is a ZIP archive, containing data that support the findings of this study. PDF file "IA screenshots CSU Libraries search config" contains screenshots captured from the Internet Archive's Wayback Machine for all 24 CalState libraries' homepages for years 2017 - 2019. Excel file "CCIHE2018-PublicDataFile" contains Carnegie Classifications data from the Indiana University Center for Postsecondary Research for all of the CalState campuses from 2018. CSV file "2017-2019_RAW" contains the raw data exported from Ex Libris Primo Analytics (OBIEE) for all 24 CalState libraries for calendar years 2017 - 2019. CSV file "clean_data" contains the cleaned data from Primo Analytics which was used for all subsequent analysis such as charting and import into SPSS for statistical testing. ZIP archive file "NonparametricStatisticalTestsFromSPSS" contains 23 SPSS files [.spv format] reporting the results of testing conducted in SPSS. This archive includes things such as normality check, descriptives, and Kruskal-Wallis H-test results.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This replication package contains the following materials:
For any issues, questions, or further assistance, please do not hesitate to contact the authors of the paper. We are here to help!
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This replication package contains the following materials:
For any issues, questions, or further assistance, please do not hesitate to contact the authors of the paper. We are here to help!
Additional file 4: Results of the replicated PCR experiments to test for normality of Ct distribution. An Excel file that lists the 9 GPCRs that were used for replicated PCR experiments on genomic DNA as well as the results of the statistical analysis conducted to determine if Ct distributions displayed on Additional files 5, 6, 7, 8, 9, 10, 11, 12 and 13 are Gaussian (XLS 12 KB)
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Materials and Methods The study was held in the Oral and Maxillofacial Surgery department and Kasturba Hospital, Manipal, from November 2019 to October 2021 after approval from the Institutional Ethics Committee (IEC: 924/2019). The study included patients between 18-70 years. Patients with associated diseases like cysts or tumors of the jaw bones, pregnant women, and those with underlying psychological issues were excluded from the study. The patients were assessed 8-12 weeks after surgical intervention. A data schedule was prepared to document age, sex, and fracture type. The study consisted of 182 subjects divided into two groups of 91 each (Group A: Mild to moderate facial injury and Group B: Severe facial injury) based on the severity of maxillofacial fractures and facial injury. Informed consent was obtained from each of the study participants. We followed Facial Injury Severity Scale (FISS) to determine the severity of facial fractures and injuries. The face is divided horizontally into the mandibular, mid-facial, and upper facial thirds. Fractures in these thirds are given points based on their type (Table 1). Injuries with a total score above 4.4 were considered severe facial injuries (Group A), and those with a total score below 4.4 were considered mild/ moderate facial injuries (Group B). The QOL was compared between the two groups. Meticulous management of hard and soft tissue injuries in our state-of-the-art tertiary care hospital was implemented. All elective cases were surgically treated at least 72 hours after the initial trauma. The facial fractures were adequately reduced and fixed with high–end Titanium miniplates and screws (AO Principles of Fracture Management). Soft tissue injuries were managed by wound debridement, removal of foreign bodies, and layered wound closure. Adequate pain-relieving medication was prescribed to the patients postoperatively for effective pain control. The QOL of the subjects was assessed using the 'Twenty-point Quality of life assessment in facial trauma patients in Indian population' assessment tool. This tool contains 20 questions and uses a five-point Likert response scale. The Twenty – point quality of life assessment tool included two zones: Zone 1 (Psychosocial impact) and Zone 2 (Functional and esthetic impact), with ten questions (domains) each (Table 2). The scores for each question ranged from 1- 5, the higher score denoting better Quality of life. Accordingly, the score in each zone for a patient ranged from 10 -50, and the total scores of both zones were recorded to determine the QOL. The sum of both zones determined the prognosis following surgery (Table 2). The data collected was entered into a Microsoft Excel spreadsheet and analyzed using IBM SPSS Statistics, Version 22(Armonk, NY: IBM Corp). Descriptive data were presented in the form of frequency and percentage for categorical variables and in the form of mean, median, standard deviation, and quartiles for continuous variables. Since the data were not following normal distribution, a non-parametric test was used. QOL scores were compared between the study groups using the Mann-Whitney U test. P value < 0.05 was considered statistically significant.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Subgroup analysis based on gender and major.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
About Dataset Safa S. Abdul-Jabbar, Alaa k. Farhan
Context This is the first Dataset for various ordinary patients in Iraq. The Dataset provides the patients’ Cell Blood Count test information that can be used to create a Hematology diagnosis/prediction system. Also, this Data was collected in 2022 from Al-Zahraa Al-Ahly Hospital. These data can be cleaned & analyzed using any programming language because it is provided in an excel file that can be accessed and manipulated easily. The user just needs to understand how rows and columns are arranged because the data was collected as images(CBC images) from the laboratories and then stored the extracted data in an excel file. Content This Dataset contains 500 rows. For each row (patient information), there are 21 columns containing CBC test features that can be described as follows:
ID: Patients Identifier
WBC: White Blood Cell, Normal Ranges: 4.0 to 10.0, Unit: 10^9/L.
LYMp: Lymphocytes percentage, which is a type of white blood cell, Normal Ranges: 20.0 to 40.0, Unit: %
MIDp: Indicates the percentage combined value of the other types of white blood cells not classified as lymphocytes or granulocytes, Normal Ranges: 1.0 to 15.0, Unit: %
NEUTp: Neutrophils are a type of white blood cell (leukocytes); neutrophils percentage, Normal Ranges: 50.0 to 70.0, Unit: %
LYMn: Lymphocytes number are a type of white blood cell, Normal Ranges: 0.6 to 4.1, Unit: 10^9/L.
MIDn: Indicates the combined number of other white blood cells not classified as lymphocytes or granulocytes, Normal Ranges: 0.1 to 1.8, Unit: 10^9/L.
NEUTn: Neutrophils Number, Normal Ranges: 2.0 to 7.8, Unit: 10^9/L.
RBC: Red Blood Cell, Normal Ranges: 3.50 to 5.50, Unit: 10^12/L
HGB: Hemoglobin, Normal Ranges: 11.0 to 16.0, Unit: g/dL
HCT: Hematocrit is the proportion, by volume, of the Blood that consists of red blood cells, Normal Ranges: 36.0 to 48.0, Unit: %
MCV: Mean Corpuscular Volume, Normal Ranges: 80.0 to 99.0, Unit: fL
MCH: Mean Corpuscular Hemoglobin is the average amount of haemoglobin in the average red cell, Normal Ranges: 26.0 to 32.0, Unit: pg
MCHC: Mean Corpuscular Hemoglobin Concentration, Normal Ranges: 32.0 to 36.0, Unit: g/dL
RDWSD: Red Blood Cell Distribution Width, Normal Ranges: 37.0 to 54.0, Unit: fL
RDWCV: Red blood cell distribution width, Normal Ranges: 11.5 to 14.5, Unit: %
PLT: Platelet Count, Normal Ranges: 100 to 400, Unit: 10^9/L
MPV: Mean Platelet Volume, Normal Ranges: 7.4 to 10.4, Unit: fL
PDW: Red Cell Distribution Width, Normal Ranges: 10.0 to 17.0, Unit: %
PCT: The level of Procalcitonin in the Blood, Normal Ranges: 0.10 to 0.28, Unit: %
PLCR: Platelet Large Cell Ratio, Normal Ranges: 13.0 to 43.0, Unit: %
Acknowledgements We thank the entire Al-Zahraa Al-Ahly Hospital Hospital team, especially the hospital manager, for cooperating with us in collecting this data while maintaining patients' confidentiality.