100+ datasets found

m
Raw data outputs 1-18
bridges.monash.edu
researchdata.edu.au
xlsx
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Abbas Salavaty Hosein Abadi; Sara Alaei; Mirana Ramialison; Peter Currie (2023). Raw data outputs 1-18 [Dataset]. http://doi.org/10.26180/21259491.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.26180/21259491.v1
Dataset updated
May 30, 2023
Dataset provided by
Monash University
Authors
Abbas Salavaty Hosein Abadi; Sara Alaei; Mirana Ramialison; Peter Currie
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Raw data outputs 1-18 Raw data output 1. Differentially expressed genes in AML CSCs compared with GTCs as well as in TCGA AML cancer samples compared with normal ones. This data was generated based on the results of AML microarray and TCGA data analysis. Raw data output 2. Commonly and uniquely differentially expressed genes in AML CSC/GTC microarray and TCGA bulk RNA-seq datasets. This data was generated based on the results of AML microarray and TCGA data analysis. Raw data output 3. Common differentially expressed genes between training and test set samples the microarray dataset. This data was generated based on the results of AML microarray data analysis. Raw data output 4. Detailed information on the samples of the breast cancer microarray dataset (GSE52327) used in this study. Raw data output 5. Differentially expressed genes in breast CSCs compared with GTCs as well as in TCGA BRCA cancer samples compared with normal ones. Raw data output 6. Commonly and uniquely differentially expressed genes in breast cancer CSC/GTC microarray and TCGA BRCA bulk RNA-seq datasets. This data was generated based on the results of breast cancer microarray and TCGA BRCA data analysis. CSC, and GTC are abbreviations of cancer stem cell, and general tumor cell, respectively. Raw data output 7. Differential and common co-expression and protein-protein interaction of genes between CSC and GTC samples. This data was generated based on the results of AML microarray and STRING database-based protein-protein interaction data analysis. CSC, and GTC are abbreviations of cancer stem cell, and general tumor cell, respectively. Raw data output 8. Differentially expressed genes between AML dormant and active CSCs. This data was generated based on the results of AML scRNA-seq data analysis. Raw data output 9. Uniquely expressed genes in dormant or active AML CSCs. This data was generated based on the results of AML scRNA-seq data analysis. Raw data output 10. Intersections between the targeting transcription factors of AML key CSC genes and differentially expressed genes between AML CSCs vs GTCs and between dormant and active AML CSCs or the uniquely expressed genes in either class of CSCs. Raw data output 11. Targeting desirableness score of AML key CSC genes and their targeting transcription factors. These scores were generated based on an in-house scoring function described in the Methods section. Raw data output 12. CSC-specific targeting desirableness score of AML key CSC genes and their targeting transcription factors. These scores were generated based on an in-house scoring function described in the Methods section. Raw data output 13. The protein-protein interactions between AML key CSC genes with themselves and their targeting transcription factors. This data was generated based on the results of AML microarray and STRING database-based protein-protein interaction data analysis. Raw data output 14. The previously confirmed associations of genes having the highest targeting desirableness and CSC-specific targeting desirableness scores with AML or other cancers’ (stem) cells as well as hematopoietic stem cells. These data were generated based on a PubMed database-based literature mining. Raw data output 15. Drug score of available drugs and bioactive small molecules targeting AML key CSC genes and/or their targeting transcription factors. These scores were generated based on an in-house scoring function described in the Methods section. Raw data output 16. CSC-specific drug score of available drugs and bioactive small molecules targeting AML key CSC genes and/or their targeting transcription factors. These scores were generated based on an in-house scoring function described in the Methods section. Raw data output 17. Candidate drugs for experimental validation. These drugs were selected based on their respective (CSC-specific) drug scores. CSC is the abbreviation of cancer stem cell. Raw data output 18. Detailed information on the samples of the AML microarray dataset GSE30375 used in this study.
f
raw data+statistical analysis.xlsx
figshare.com
xlsx
Updated Nov 14, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Guangwei Wang (2022). raw data+statistical analysis.xlsx [Dataset]. http://doi.org/10.6084/m9.figshare.21551916.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.21551916.v1
Dataset updated
Nov 14, 2022
Dataset provided by
figshare
Authors
Guangwei Wang
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
sheet1 raw data sheet 2 base line sheet3 subgroup raw data sheet4 results of statistical analysis
Scooter Sales - Excel Project
kaggle.com
Updated Jun 8, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ann Truong (2023). Scooter Sales - Excel Project [Dataset]. https://www.kaggle.com/datasets/bvanntruong/scooter-sales-excel-project
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 8, 2023
Dataset provided by
Kaggle
Authors
Ann Truong
Description
The link for the Excel project to download can be found on GitHub here. It includes the raw data, Pivot Tables, and an interactive dashboard with Pivot Charts and Slicers. The project also includes business questions and the formulas I used to answer. The image below is included for ease. https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F12904052%2F61e460b5f6a1fa73cfaaa33aa8107bd5%2FBusinessQuestions.png?generation=1686190703261971&alt=media" alt=""> The link for the Tableau adjusted dashboard can be found here.

A screenshot of the interactive Excel dashboard is also included below for ease. https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F12904052%2Fe581f1fce8afc732f7823904da9e4cce%2FScooter%20Dashboard%20Image.png?generation=1686190815608343&alt=media" alt="">
f
UC_vs_US Statistic Analysis.xlsx
figshare.com
xlsx
Updated Jul 9, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
F. (Fabiano) Dalpiaz (2020). UC_vs_US Statistic Analysis.xlsx [Dataset]. http://doi.org/10.23644/uu.12631628.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.23644/uu.12631628.v1
Dataset updated
Jul 9, 2020
Dataset provided by
Utrecht University
Authors
F. (Fabiano) Dalpiaz
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Sheet 1 (Raw-Data): The raw data of the study is provided, presenting the tagging results for the used measures described in the paper. For each subject, it includes multiple columns: A. a sequential student ID B an ID that defines a random group label and the notation C. the used notation: user Story or use Cases D. the case they were assigned to: IFA, Sim, or Hos E. the subject's exam grade (total points out of 100). Empty cells mean that the subject did not take the first exam F. a categorical representation of the grade L/M/H, where H is greater or equal to 80, M is between 65 included and 80 excluded, L otherwise G. the total number of classes in the student's conceptual model H. the total number of relationships in the student's conceptual model I. the total number of classes in the expert's conceptual model J. the total number of relationships in the expert's conceptual model K-O. the total number of encountered situations of alignment, wrong representation, system-oriented, omitted, missing (see tagging scheme below) P. the researchers' judgement on how well the derivation process explanation was explained by the student: well explained (a systematic mapping that can be easily reproduced), partially explained (vague indication of the mapping ), or not present.

Tagging scheme: Aligned (AL) - A concept is represented as a class in both models, either

with the same name or using synonyms or clearly linkable names; Wrongly represented (WR) - A class in the domain expert model is incorrectly represented in the student model, either (i) via an attribute, method, or relationship rather than class, or (ii) using a generic term (e.g., user'' instead ofurban planner''); System-oriented (SO) - A class in CM-Stud that denotes a technical implementation aspect, e.g., access control. Classes that represent legacy system or the system under design (portal, simulator) are legitimate; Omitted (OM) - A class in CM-Expert that does not appear in any way in CM-Stud; Missing (MI) - A class in CM-Stud that does not appear in any way in CM-Expert.

All the calculations and information provided in the following sheets

originate from that raw data.

Sheet 2 (Descriptive-Stats): Shows a summary of statistics from the data collection,

including the number of subjects per case, per notation, per process derivation rigor category, and per exam grade category.

Sheet 3 (Size-Ratio):

The number of classes within the student model divided by the number of classes within the expert model is calculated (describing the size ratio). We provide box plots to allow a visual comparison of the shape of the distribution, its central value, and its variability for each group (by case, notation, process, and exam grade) . The primary focus in this study is on the number of classes. However, we also provided the size ratio for the number of relationships between student and expert model.

Sheet 4 (Overall):

Provides an overview of all subjects regarding the encountered situations, completeness, and correctness, respectively. Correctness is defined as the ratio of classes in a student model that is fully aligned with the classes in the corresponding expert model. It is calculated by dividing the number of aligned concepts (AL) by the sum of the number of aligned concepts (AL), omitted concepts (OM), system-oriented concepts (SO), and wrong representations (WR). Completeness on the other hand, is defined as the ratio of classes in a student model that are correctly or incorrectly represented over the number of classes in the expert model. Completeness is calculated by dividing the sum of aligned concepts (AL) and wrong representations (WR) by the sum of the number of aligned concepts (AL), wrong representations (WR) and omitted concepts (OM). The overview is complemented with general diverging stacked bar charts that illustrate correctness and completeness.

For sheet 4 as well as for the following four sheets, diverging stacked bar

charts are provided to visualize the effect of each of the independent and mediated variables. The charts are based on the relative numbers of encountered situations for each student. In addition, a "Buffer" is calculated witch solely serves the purpose of constructing the diverging stacked bar charts in Excel. Finally, at the bottom of each sheet, the significance (T-test) and effect size (Hedges' g) for both completeness and correctness are provided. Hedges' g was calculated with an online tool: https://www.psychometrica.de/effect_size.html. The independent and moderating variables can be found as follows:

Sheet 5 (By-Notation):

Model correctness and model completeness is compared by notation - UC, US.

Sheet 6 (By-Case):

Model correctness and model completeness is compared by case - SIM, HOS, IFA.

Sheet 7 (By-Process):

Model correctness and model completeness is compared by how well the derivation process is explained - well explained, partially explained, not present.

Sheet 8 (By-Grade):

Model correctness and model completeness is compared by the exam grades, converted to categorical values High, Low , and Medium.
Z
Quantitative raw data for "Large scale regional citizen surveys report"...
data.niaid.nih.gov
zenodo.org
+1more
Updated Feb 3, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Panori, Anastasia; Bakratsas, Thomas; Chapizanis, Dimitrios; Altsitsiadis, Efthymios; Hauschildt, Christian (2022). Quantitative raw data for "Large scale regional citizen surveys report" (D1.4) [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_5958017
Explore at:
Dataset updated
Feb 3, 2022
Dataset provided by
White Research SRL
Authors
Panori, Anastasia; Bakratsas, Thomas; Chapizanis, Dimitrios; Altsitsiadis, Efthymios; Hauschildt, Christian
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset presents the quantitative raw data that was collected under the H2020 RRI2SCALE project for the D1.4 - “Large scale regional citizen surveys report”. The dataset includes the answers that were provided by almost 8,000 participants from 4 pilot European regions (Kriti, Vestland, Galicia, and Overijssel) regarding the general public's views, concerns, and moral issues about the current and future trajectories of their RTD&I ecosystem. The original survey questionnaire was created by White Research SRL and disseminated to the regions through supporting pilot partners. Data collection took place from June 2020 to September 2020 through 4 different waves – one for each region. Based on the conclusion of a consortium vote during the kick-off meeting, it was decided that instead of resource-intensive methods that would render data collection unduly expensive, to fill in the quotas responses were collected through online panels by survey companies that were used for each region. For the statistical analysis of the data and the conclusions drawn from the analysis, you can access the "Large scale regional citizen surveys report" (D1.4).
f
Raw data used for statistical analysis.
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated May 7, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hohenauer, Erich; Wellauer, Vanessa; Bianchi, Giannina; Riggi, Emilia; Clijsen, Ron (2025). Raw data used for statistical analysis. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0002095424
Explore at:
Dataset updated
May 7, 2025
Authors
Hohenauer, Erich; Wellauer, Vanessa; Bianchi, Giannina; Riggi, Emilia; Clijsen, Ron
Description
This study compared the effects of cold water immersion (CWI) and hot water immersion (HWI) on muscle recovery following a muscle-damaging exercise protocol in women. Thirty healthy women (23.3 ± 2.9 years) were randomly assigned to either the CWI, HWI, or control (CON) groups. Participants completed a standardised exercise protocol (5 x 20 drop-jumps), followed by a 10 min recovery intervention (CWI, HWI, or CON) immediately and 120 min post-exercise. Physiological responses, including muscle oxygen saturation (SmO2), core and skin temperature, and heart rate, were assessed at baseline, immediately post-exercise, after the first recovery intervention (postInt), and during 30 min follow-up. Recovery was evaluated through maximal voluntary isometric contraction, muscle swelling, muscle soreness ratings, and serum creatine kinase at baseline, 24, 48, and 72 h post-exercise. A mixed-effects model was used to account for repeated measures over time. Results showed lower SmO2 values in the CWI compared to the HWI group at 20 min (Δ-6.76%, CI: −0.27 to −13.25, p = 0.038) and 30 min (Δ-9.86%, CI: −3.37 to −16.35, p = 0.001), and compared to CON at 30 min (Δ-7.28%, CI: −13.77 to −0.79, p = 0.022). Core temperature was significantly higher in the HWI than the CWI group (postInt and 30 min), while it was significantly lower in the CWI group than CON (30 min). CWI caused a substantial decrease in skin temperature compared to HWI and CON between postInt and 30 min follow-up (all p < 0.001). Skin temperature was higher in the HWI group compared to CON at postInt and throughout 30 min follow-up (all p < 0.001). No significant differences in recovery markers were observed between CWI and HWI groups, although HWI led to slightly higher creatine kinase levels (24 h and 72 h) and greater muscle swelling (24 h) compared to CON. Despite distinct acute physiological responses to CWI and HWI, neither improved subjective or objective recovery outcomes during the 72 h follow-up compared to CON in women following a muscle-damaging exercise protocol.Trial registration numberNCT04902924 (ClinicalTrials.gov), SNCTP000004468 (Swiss National Clinical Trial Portal).

Cafe Sales - Dirty Data for Cleaning Training

kaggle.com

zip

Updated Jan 17, 2025

Facebook

Twitter

Click to copy link

Link copied

Cite

Ahmed Mohamed (2025). Cafe Sales - Dirty Data for Cleaning Training [Dataset]. https://www.kaggle.com/datasets/ahmedmohamed2003/cafe-sales-dirty-data-for-cleaning-training

Explore at:

zip(113510 bytes)Available download formats

Dataset updated

Jan 17, 2025

Authors

Ahmed Mohamed

License

Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically

Description

Dirty Cafe Sales Dataset

Overview

The Dirty Cafe Sales dataset contains 10,000 rows of synthetic data representing sales transactions in a cafe. This dataset is intentionally "dirty," with missing values, inconsistent data, and errors introduced to provide a realistic scenario for data cleaning and exploratory data analysis (EDA). It can be used to practice cleaning techniques, data wrangling, and feature engineering.

File Information

File Name: dirty_cafe_sales.csv
Number of Rows: 10,000
Number of Columns: 8

Columns Description

Column Name	Description	Example Values
`Transaction ID`	A unique identifier for each transaction. Always present and unique.	`TXN_1234567`
`Item`	The name of the item purchased. May contain missing or invalid values (e.g., "ERROR").	`Coffee`, `Sandwich`
`Quantity`	The quantity of the item purchased. May contain missing or invalid values.	`1`, `3`, `UNKNOWN`
`Price Per Unit`	The price of a single unit of the item. May contain missing or invalid values.	`2.00`, `4.00`
`Total Spent`	The total amount spent on the transaction. Calculated as `Quantity * Price Per Unit`.	`8.00`, `12.00`
`Payment Method`	The method of payment used. May contain missing or invalid values (e.g., `None`, "UNKNOWN").	`Cash`, `Credit Card`
`Location`	The location where the transaction occurred. May contain missing or invalid values.	`In-store`, `Takeaway`
`Transaction Date`	The date of the transaction. May contain missing or incorrect values.	`2023-01-01`

Data Characteristics

Missing Values:
- Some columns (e.g., Item, Payment Method, Location) may contain missing values represented as None or empty cells.
Invalid Values:
- Some rows contain invalid entries like "ERROR" or "UNKNOWN" to simulate real-world data issues.
Price Consistency:
- Prices for menu items are consistent but may have missing or incorrect values introduced.

Menu Items

The dataset includes the following menu items with their respective price ranges:

Item	Price($)
Coffee	2
Tea	1.5
Sandwich	4
Salad	5
Cake	3
Cookie	1
Smoothie	4
Juice	3

Use Cases

This dataset is suitable for: - Practicing data cleaning techniques such as handling missing values, removing duplicates, and correcting invalid entries. - Exploring EDA techniques like visualizations and summary statistics. - Performing feature engineering for machine learning workflows.

Cleaning Steps Suggestions

To clean this dataset, consider the following steps: 1. Handle Missing Values: - Fill missing numeric values with the median or mean. - Replace missing categorical values with the mode or "Unknown."

Handle Invalid Values:
- Replace invalid entries like "ERROR" and "UNKNOWN" with NaN or appropriate values.
Date Consistency:
- Ensure all dates are in a consistent format.
- Fill missing dates with plausible values based on nearby records.
Feature Engineering:
- Create new columns, such as Day of the Week or Transaction Month, for further analysis.

License

This dataset is released under the CC BY-SA 4.0 License. You are free to use, share, and adapt it, provided you give appropriate credit.

Feedback

If you have any questions or feedback, feel free to reach out through the dataset's discussion board on Kaggle.

Raw data and R code used in meta-analysis
figshare.com
txt
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gabriele Midolo (2023). Raw data and R code used in meta-analysis [Dataset]. http://doi.org/10.6084/m9.figshare.7694453.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.7694453.v1
Dataset updated
Jun 1, 2023
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Gabriele Midolo
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Data and code for figures and statistics reported in Midolo, G., De Frenne, P., Hölzel, N. & Wellstein, C. (2019). Global patterns of intraspecific leaf trait responses to elevation.For a complete list reference of studies included in the meta-analysis, see the Supporting Information of our article.See the "csv_file_description" file for detailed information on dataset description and the R code to analyze the data.For any question, please contact me at: gabriele.midolo@natec.unibz.it
Raw data from datasets used in SIMON analysis
data.europa.eu
unknown
Updated Jan 27, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zenodo (2022). Raw data from datasets used in SIMON analysis [Dataset]. https://data.europa.eu/data/datasets/oai-zenodo-org-2580414?locale=hr
Explore at:
unknown(312591)Available download formats
Dataset updated
Jan 27, 2022
Dataset authored and provided by
Zenodohttp://zenodo.org/
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Here you can find raw data and information about each of the 34 datasets generated by the mulset algorithm and used for further analysis in SIMON. Each dataset is stored in separate folder which contains 4 files: json_info: This file contains, number of features with their names and number of subjects that are available for the same dataset data_testing: data frame with data used to test trained model data_training: data frame with data used to train models results: direct unfiltered data from database Files are written in feather format. Here is an example of data structure for each file in repository. File was compressed using 7-Zip available at https://www.7-zip.org/.
f
Data from: Raw Data Files
datasetcatalog.nlm.nih.gov
Updated Jun 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Eshima, Jarrett (2024). Raw Data Files [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001352154
Explore at:
Dataset updated
Jun 2, 2024
Authors
Eshima, Jarrett
Description
Supplementary raw data files associated with R scripts for data analysis (https://github.com/BSmithLab/Biodome). Files show peaks after filtering known contaminants.
CSV file used in statistical analyses
data.csiro.au
researchdata.edu.au
+1more
Updated Oct 13, 2014
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CSIRO (2014). CSV file used in statistical analyses [Dataset]. http://doi.org/10.4225/08/543B4B4CA92E6
Explore at:
Unique identifier
https://doi.org/10.4225/08/543B4B4CA92E6
Dataset updated
Oct 13, 2014
Dataset authored and provided by
CSIROhttp://www.csiro.au/
License
https://research.csiro.au/dap/licences/csiro-data-licence/https://research.csiro.au/dap/licences/csiro-data-licence/
Time period covered
Mar 14, 2008 - Jun 9, 2009
Dataset funded by
CSIROhttp://www.csiro.au/
Description
A csv file containing the tidal frequencies used for statistical analyses in the paper "Estimating Freshwater Flows From Tidally-Affected Hydrographic Data" by Dan Pagendam and Don Percival.
B
Data Cleaning Sample
borealisdata.ca
dataone.org
Updated Jul 13, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rong Luo (2023). Data Cleaning Sample [Dataset]. http://doi.org/10.5683/SP3/ZCN177
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.5683/SP3/ZCN177
Dataset updated
Jul 13, 2023
Dataset provided by
Borealis
Authors
Rong Luo
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Sample data for exercises in Further Adventures in Data Cleaning.
Data analysis project
kaggle.com
zip
Updated Aug 15, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
SzymonBiaas (2024). Data analysis project [Dataset]. https://www.kaggle.com/datasets/szymonbiaas/data-analysis-project
Explore at:
zip(116043378 bytes)Available download formats
Dataset updated
Aug 15, 2024
Authors
SzymonBiaas
Description
This dashboard was created from data published by Olist Store (a Brazilian e-commerce public dataset). Raw data contains information about 100 000 orders from 2016 to 2018 placed in many regions of Brazil.

The raw datasets were imported into Excel using “Get data” option (formerly known as “Power Query”) and cleaned. An additional table with the names of Brazilian states was also imported from the Wikipedia page.

A Data Table about payment information was created based on imported statistics with the usage of nested formulas. Then, proper pivot charts were used to build an Olist Store Payment Dashboard which allows you to review the data using a connected timeline and slicers.
f
Raw data for data analysis and graph construction at pre-treatment...
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated Dec 3, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sithithaworn, Jiraporn; Worasith, Chanika; Techasen, Anchalee; Eamudomkarn, Chatanun; Tippayawat, Patcharaporn; Noordin, Rahmah; Pitaksakulrat, Opal; Wongphutorn, Phattharaphon; Kopolrat, Kulthida Y.; Sithithaworn, Paiboon; Crellen, Thomas; Hongsrichan, Nuttanan (2024). Raw data for data analysis and graph construction at pre-treatment (baseline) and post-treatment. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001346078
Explore at:
Dataset updated
Dec 3, 2024
Authors
Sithithaworn, Jiraporn; Worasith, Chanika; Techasen, Anchalee; Eamudomkarn, Chatanun; Tippayawat, Patcharaporn; Noordin, Rahmah; Pitaksakulrat, Opal; Wongphutorn, Phattharaphon; Kopolrat, Kulthida Y.; Sithithaworn, Paiboon; Crellen, Thomas; Hongsrichan, Nuttanan
Description
Raw data for data analysis and graph construction at pre-treatment (baseline) and post-treatment.
Retail data analysis project (excel)
kaggle.com
zip
Updated Dec 9, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Soe Yan Naung (2024). Retail data analysis project (excel) [Dataset]. https://www.kaggle.com/datasets/ericyang19/retail-data-analysis-project-excel
Explore at:
zip(4306415 bytes)Available download formats
Dataset updated
Dec 9, 2024
Authors
Soe Yan Naung
License
Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
Description
In this project, I conducted a comprehensive analysis of retail and warehouse sales data to derive actionable insights. The primary objective was to understand sales trends, evaluate performance across channels, and identify key contributors to overall business success.

To achieve this, I transformed raw data into interactive Excel dashboards that highlight sales performance and channel contributions, providing a clear and concise representation of business metrics.

Key Highlights of the Project:

Created two dashboards: Sales Dashboard and Contribution Dashboard. Answered critical business questions, such as monthly trends, channel performance, and top contributors. Presented actionable insights with professional visuals, making it easy for stakeholders to make data-driven decisions.
f
Data to follow the statistical analysis including raw data as CSV files.
datasetcatalog.nlm.nih.gov
figshare.com
+1more
Updated Feb 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fey, Philipp; Mörchel, Philipp; Haddad, Daniel; Jakob, Peter; Hansmann, Jan; Stebani, Jannik; Weber, Daniel Ludwig; Hiller, Karl-Heinz (2023). Data to follow the statistical analysis including raw data as CSV files. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000951591
Explore at:
Dataset updated
Feb 21, 2023
Authors
Fey, Philipp; Mörchel, Philipp; Haddad, Daniel; Jakob, Peter; Hansmann, Jan; Stebani, Jannik; Weber, Daniel Ludwig; Hiller, Karl-Heinz
Description
Data that was used to train the SVM. As the train-test data were assigned randomly for every training iteration, the individual data used for generating the subfigures b–e are not separately listed, as these cannot be manually recreated but depend on the train-test assignment by the algorithm. (ZIP)
f
Raw data analysis code; Regression analysis code from Age and sex influence...
datasetcatalog.nlm.nih.gov
rs.figshare.com
Updated May 26, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wasser, Samuel K.; Ellis, Samuel; Nielsen, Mia L. K.; Balcomb, Kenneth C.; Weiss, Michael N.; Croft, Darren P.; Grimes, Charli; Ellifrit, David K.; Domenici, Paolo; Youngstrom, Sadie; Giles, Deborah A.; Franks, Daniel W.; Cant, Michael A. (2021). Raw data analysis code; Regression analysis code from Age and sex influence social interactions, but not associations, within a killer whale pod [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000760553
Explore at:
Dataset updated
May 26, 2021
Authors
Wasser, Samuel K.; Ellis, Samuel; Nielsen, Mia L. K.; Balcomb, Kenneth C.; Weiss, Michael N.; Croft, Darren P.; Grimes, Charli; Ellifrit, David K.; Domenici, Paolo; Youngstrom, Sadie; Giles, Deborah A.; Franks, Daniel W.; Cant, Michael A.
Description
R code to reproduce the construction of social networks from the raw data, bout analysis, data checking, and randomization-based null models, ;R code to reproduce dyadic and nodal regression analyses; uses the processed data from the aninet R package
f
Raw data and statistical analysis pertaining to use of alteplase in COVID 19...
datasetcatalog.nlm.nih.gov
figshare.com
Updated Aug 16, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jacob, Ipe (2021). Raw data and statistical analysis pertaining to use of alteplase in COVID 19 [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000844364
Explore at:
Dataset updated
Aug 16, 2021
Authors
Jacob, Ipe
Description
Raw data and statistical analysis pertaining to use of alteplase in COVID 19
Raw data of the specimens (1)
figshare.com
application/gzip
Updated Sep 25, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shiyu Du (2018). Raw data of the specimens (1) [Dataset]. http://doi.org/10.6084/m9.figshare.7127279.v1
Explore at:
application/gzipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.7127279.v1
Dataset updated
Sep 25, 2018
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Shiyu Du
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Raw data of the specimens in Insect Collection of Central South University of Forestry and Technology.
Raw data in SPSS Software
zenodo.org
Updated Jul 16, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Esubalew Tesfahun; Esubalew Tesfahun (2023). Raw data in SPSS Software [Dataset]. http://doi.org/10.5281/zenodo.8151987
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.8151987
Dataset updated
Jul 16, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Esubalew Tesfahun; Esubalew Tesfahun
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Raw data used for analysis

Facebook

Twitter

Click to copy link

Link copied

Cite

Abbas Salavaty Hosein Abadi; Sara Alaei; Mirana Ramialison; Peter Currie (2023). Raw data outputs 1-18 [Dataset]. http://doi.org/10.26180/21259491.v1

Raw data outputs 1-18

Explore at:

xlsxAvailable download formats

Unique identifier

https://doi.org/10.26180/21259491.v1

Dataset updated

May 30, 2023

Dataset provided by

Monash University

Authors

Abbas Salavaty Hosein Abadi; Sara Alaei; Mirana Ramialison; Peter Currie

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Raw data outputs 1-18 Raw data output 1. Differentially expressed genes in AML CSCs compared with GTCs as well as in TCGA AML cancer samples compared with normal ones. This data was generated based on the results of AML microarray and TCGA data analysis. Raw data output 2. Commonly and uniquely differentially expressed genes in AML CSC/GTC microarray and TCGA bulk RNA-seq datasets. This data was generated based on the results of AML microarray and TCGA data analysis. Raw data output 3. Common differentially expressed genes between training and test set samples the microarray dataset. This data was generated based on the results of AML microarray data analysis. Raw data output 4. Detailed information on the samples of the breast cancer microarray dataset (GSE52327) used in this study. Raw data output 5. Differentially expressed genes in breast CSCs compared with GTCs as well as in TCGA BRCA cancer samples compared with normal ones. Raw data output 6. Commonly and uniquely differentially expressed genes in breast cancer CSC/GTC microarray and TCGA BRCA bulk RNA-seq datasets. This data was generated based on the results of breast cancer microarray and TCGA BRCA data analysis. CSC, and GTC are abbreviations of cancer stem cell, and general tumor cell, respectively. Raw data output 7. Differential and common co-expression and protein-protein interaction of genes between CSC and GTC samples. This data was generated based on the results of AML microarray and STRING database-based protein-protein interaction data analysis. CSC, and GTC are abbreviations of cancer stem cell, and general tumor cell, respectively. Raw data output 8. Differentially expressed genes between AML dormant and active CSCs. This data was generated based on the results of AML scRNA-seq data analysis. Raw data output 9. Uniquely expressed genes in dormant or active AML CSCs. This data was generated based on the results of AML scRNA-seq data analysis. Raw data output 10. Intersections between the targeting transcription factors of AML key CSC genes and differentially expressed genes between AML CSCs vs GTCs and between dormant and active AML CSCs or the uniquely expressed genes in either class of CSCs. Raw data output 11. Targeting desirableness score of AML key CSC genes and their targeting transcription factors. These scores were generated based on an in-house scoring function described in the Methods section. Raw data output 12. CSC-specific targeting desirableness score of AML key CSC genes and their targeting transcription factors. These scores were generated based on an in-house scoring function described in the Methods section. Raw data output 13. The protein-protein interactions between AML key CSC genes with themselves and their targeting transcription factors. This data was generated based on the results of AML microarray and STRING database-based protein-protein interaction data analysis. Raw data output 14. The previously confirmed associations of genes having the highest targeting desirableness and CSC-specific targeting desirableness scores with AML or other cancers’ (stem) cells as well as hematopoietic stem cells. These data were generated based on a PubMed database-based literature mining. Raw data output 15. Drug score of available drugs and bioactive small molecules targeting AML key CSC genes and/or their targeting transcription factors. These scores were generated based on an in-house scoring function described in the Methods section. Raw data output 16. CSC-specific drug score of available drugs and bioactive small molecules targeting AML key CSC genes and/or their targeting transcription factors. These scores were generated based on an in-house scoring function described in the Methods section. Raw data output 17. Candidate drugs for experimental validation. These drugs were selected based on their respective (CSC-specific) drug scores. CSC is the abbreviation of cancer stem cell. Raw data output 18. Detailed information on the samples of the AML microarray dataset GSE30375 used in this study.

Clear search

Close search

Google apps

Main menu

Raw data outputs 1-18

raw data+statistical analysis.xlsx

Scooter Sales - Excel Project

UC_vs_US Statistic Analysis.xlsx

Quantitative raw data for "Large scale regional citizen surveys report"...

Raw data used for statistical analysis.

Cafe Sales - Dirty Data for Cleaning Training

Dirty Cafe Sales Dataset

Overview

File Information

Columns Description

Data Characteristics

Menu Items

Use Cases

Cleaning Steps Suggestions

License

Feedback

Raw data and R code used in meta-analysis

Raw data from datasets used in SIMON analysis

Data from: Raw Data Files

CSV file used in statistical analyses

Data Cleaning Sample

Data analysis project

Raw data for data analysis and graph construction at pre-treatment...

Retail data analysis project (excel)

Data to follow the statistical analysis including raw data as CSV files.

Raw data analysis code; Regression analysis code from Age and sex influence...

Raw data and statistical analysis pertaining to use of alteplase in COVID 19...

Raw data of the specimens (1)

Raw data in SPSS Software

Raw data outputs 1-18