Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Training, test data and model parameters. The last 3 columns show the MinORG, LT and HT parameters used to create the pathogenicity families and build the model for each of the 10 models. Zthr is a threshold value, calculated for each model at the cross validation phase, which is used, given the final prediction score, to decide if the input organisms will be predicted as pathogenic or non-pathogenic. The parameters for each model are chosen after 5-fold cross-validation tests.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
2063 F and t tests that were automatically extracted from articles published in Journal of Vision. For more information, see http://www.cogsci.nl/.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The data support the thesis for the additional data.
The zip file enclosed contains README_Gamble_DevTox GLR_Sup Data_v1.docx, Gamble_DevTox GLR_Sup Fig_Submission_v1.docx, Gamble_DevTox GLR_Tables_Submission_v1.xlsx, Gamble_DevTox GLR_Sup Tables_Submission_v1.xlsx, ToxCast Pipeline plots for AEID 3093-3098. This dataset is associated with the following publication: Gamble, J., K. Hopperstad, and C. Deisenroth. The DevTox Germ Layer Reporter Platform: An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics. MDPI, Basel, SWITZERLAND, 10(7): 392, (2022).
This dataset includes the results of a simulation study using the source inversion techniques available in the Water Security Toolkit. The data was created to test the different techniques for accuracy, specificity, false positive rate, and false negative rate. The tests examined different parameters including measurement error, modeling error, injection characteristics, time horizon, network size, and sensor placement. The water distribution system network models that were used in the study are also included in the dataset. This dataset is associated with the following publication: Seth, A., K. Klise, J. Siirola, T. Haxton , and C. Laird. Testing Contamination Source Identification Methods for Water Distribution Networks. Journal of Environmental Division, Proceedings of American Society of Civil Engineers. American Society of Civil Engineers (ASCE), Reston, VA, USA, ., (2016).
https://www.gnu.org/licenses/old-licenses/gpl-2.0-standalone.htmlhttps://www.gnu.org/licenses/old-licenses/gpl-2.0-standalone.html
A BAM file containing five MNVs (Multiple Nucleotide Variants) for the purposes of testing bioinformatics pipelines.
Dataset supports: J. Hollebon and F. M. Fazi, “Binaural Rendering Using Higher-Order Stereophony," in Raw data and statistical analysis of listening test for ‘Binaural Rendering Using Higher-Order Stereophony’. The data is presented in excel file: BinauralReneringUsingHigherOrderStereophony_Data.xlsx This work was supported by the Engineering and Physical Sciences Research Council (EPSRC) through the University of Southampton’s Doctoral Training Partnership under Grant 2106106.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Results of t-test and descriptive statistics.
Training and test-related data to accompany "Keyphrase Extraction for Technical Language Processing" by Alden Dima and Aaron Massey (in press). The subdirectories "keyphrase-extraction-jct-train" and "keyphrase-extraction-jct-test" contain a total of 1153 ThermoML files which are each associated with a corresponding Journal of Chemical Thermodynamics (JCT) article. These ThermoML files contain information about these papers in extensible markup language (XML) format including the title, authors, abstract, digital object identifier (DOI) and keywords. They also contain thermophysical property data unrelated to the keyphrase extraction study. These files were obtained from the National Institute of Standard and Technology (NIST) Thermodynamics Research Center (TRC) in Boulder, Colorado (https://trc.nist.gov/). Readers wishing to replicate this work will also need to obtain the original JCT articles which can be obtained from https://www.sciencedirect.com/journal/the-journal-of-chemical-thermodynamics.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Replication data stored in the Journal Data Archive for "Erratum: The likelihood ratio test under nonstandard conditions: Testing the Markov switching model of GNP (replication data)".
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
• If you publish material based on this dataset, please cite the following :
• The Zenodo repository : Laila El-Hamamsy, Barbara Bruno, Jessica Dehler Zufferey, & Francesco Mondada (2023). Extended dataset for the validation of the competent Computational Thinking test in grades 3-6 [Data set]. Zenodo. https://doi.org/10.5281/zenodo.7983525
• The article on the validation of the computational thinking test for grades 3-6 : El-Hamamsy, L., Zapata-Cáceres, M., Martín-Barroso, E., Mondada, F., Dehler Zufferey, J., Bruno, B., and Román-González, M. (2023). The competent computational thinking test (cCTt): a valid, reliable and gender-fair test for longitudinal CT studies in grades 3-6. El-Hamamsy, L., Zapata-Cáceres, M., Martín-Barroso, E., Mondada, F., Zufferey, J. D., Bruno, B., & Román-González, M. arXiv. https://doi.org/10.48550/arXiv.2305.19526
• License : This work is licensed under a Creative Commons Attribution 4.0 International license (CC-BY-4.0)
• Creators : El-Hamamsy, L., Bruno, B., Dehler Zufferey, J., and Mondada, F.
• Date May 30th 2023
• Subject : Computational Thinking (CT), Assessment, Primary education, Psychometric validation
• Dataset format : CSV. The dataset contains four files (one per grade, see detailed description below). Please note that the spreadsheets may contain missing values due to students not being present for a part of the data collection. To have access to the specific cCTt questions please refer to the original publication [1] and Zenodo repository [2] which provide the full set of questions and correct responses.
• Dataset size < 500 kB
• Data collection period : January and November 2021
• Abbreviations : - CT : Computational Thinking - cCTt: competent CT test
• Funding : This work was funded by the the NCCR Robotics, a National Centre of Competence in Research, funded by the Swiss National Science Foundation (grant number 51NF40_185543)
[1] El-Hamamsy, L., Zapata-Cáceres, M., Barroso, E. M., Mondada, F., Zufferey, J. D., & Bruno, B. (2022). The Competent Computational Thinking Test: Development and Validation of an Unplugged Computational Thinking Test for Upper Primary School. Journal of Educational Computing Research, 60(7), 1818–1866. https://doi.org/10.1177/07356331221081753
[2] El-Hamamsy, L., Zapata-Cáceres, M., Marcelino, P., Dehler Zufferey, J., Bruno, B., Martín Barroso, E., & Román-González, M. (2022). Dataset for the comparison of two Computational Thinking (CT) test for upper primary school (grades 3-4) : the Beginners' CT test (BCTt) and the competent CT test (cCTt) (Version 1) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.5885034
[3] El-Hamamsy, L., Zapata-Cáceres, M., Martín-Barroso, E., Mondada, F., Dehler Zufferey, J., Bruno, B., and Román-González, M. (2023). The competent computational thinking test (cCTt): a valid, reliable and gender-fair test for longitudinal CT studies in grades 3-6. El-Hamamsy, L., Zapata-Cáceres, M., Martín-Barroso, E., Mondada, F., Zufferey, J. D., Bruno, B., & Román-González, M. arXiv. https://doi.org/10.48550/arXiv.2305.19526
[4] Brennan, K. and Resnick, M. (2012). New frameworks for studying and assessing the development of computational thinking. page 25
[5] El-Hamamsy, L., Zapata-Cáceres, M., Marcelino, P., Bruno, B., Dehler Zufferey, J., Martín-Barroso, E., & Román-González, M. (2022). Comparing the psychometric properties of two primary school Computational Thinking (CT) assessments for grades 3 and 4: The Beginners’ CT test (BCTt) and the competent CT test (cCTt). Frontiers in Psychology, 13. https://www.frontiersin.org/articles/10.3389/fpsyg.2022.1082659
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This repository provides the data artefacts for the experiments conducted using our tool TCTracer for the journal paper "TCTracer: Establishing Test-to-Code Traceability links Using Dynamic and Static Techniques" as submitted to the Empirical Software Engineering journal in 2021.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Here are the training and testing data sets involved in the numerical experiments in the article that has been submitted to the journal “Journal of Geophysical Research: Solid Earth”, named “Joint Model and Data-Driven Simultaneous Inversion of Velocity and Density”: Marmousi model. Each dataset consists of two parts: a training dataset and a testing dataset. Both training and testing data sets contain three parts: seismic data, velocity model and density model.
VideoEspresso
[Paper] Our code and dataset will be released soon.
News:
[2024/12/16] 🔥 The test set has been released!
Citation:
@article{han2024videoespresso, title={VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection}, author={Han, Songhao and Huang, Wei and Shi, Hairong and Zhuo, Le and Su, Xiu and Zhang, Shifeng and Zhou, Xu and Qi, Xiaojuan and Liao, Yue and Liu, Si}, journal={arXiv… See the full description on the dataset page: https://huggingface.co/datasets/hshjerry0315/VideoEspresso-Test.
Data (2 excel files) consist of the analytical test results on water sample collected from the two adsorption media tanks of the arsenic removal system during the regeneration processes conducted multiply times over a five-year period. Data set also includes the companion bed volumes of water treated by the tank of media at the time the water samples were collected. This dataset is associated with the following publications: Sorg, T., A. Chen, L. Wang, and R. Kolich. Regeneration of a Full-Scale Arsenic Removal Adsorptive Media System,Part 1: The Regeneration Process. JOURNAL OF AMERICAN WATER WORKS ASSOCIATION. American Water Resources Association, Middleburg, VA, USA, 109(5): 13-24, (2017). Sorg, T., R. Kolich, A.S.C. Chen, and L. Wang. Regeneration of a Full-Scale Arsenic Removal Adsorptive Media System,Part 2: The Performance and Cost. JOURNAL OF THE AMERICAN WATER WORKS ASSOCIATION. American Water Works Association, Denver, CO, USA, 109(5): E122-E128, (2017).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Mann-Whitney U test for different document types.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Here are the training and testing data sets involved in the numerical experiments in the article that has been submitted to the journal “Journal of Geophysical Research: Solid Earth”, named “Joint Model and Data-Driven Simultaneous Inversion of Velocity and Density”: SigsbeeA model. Each dataset consists of two parts: a training dataset and a testing dataset. Both training and testing data sets contain three parts: seismic data, velocity model and density model.
This data release includes the detailed results from laboratory testing of colluvium and landslide deposit specimens collected from coastal bluffs near Mukilteo, Washington. The specimens were collected as part of a larger effort to characterize the potential for shallow landslide initiation along the Puget Sound Railway corridor between the cities of Everett and Seattle. The details of the specimen collection and research objectives of the study are provided in: “Assessing Landslide Potential on Coastal Bluffs near Mukilteo, Washington—Geologic Site Characterization for Hydrologic Monitoring” (doi:10.3133/ofr20161082). Laboratory experiments includes test to estimate the following properties: specific gravity, porosity, bulk and grain densities, grain-size distributions, in situ volumetric water content, liquid and plastic limits, saturated hydraulic conductivity, water-retention and unsaturated hydraulic conductivity relations, and soil strength properties. The testing of the specimens was performed by Cooper Testing Labs Inc. in Palo Alto, California, and by York Lewis at Colorado School of Mines in Golden, Colorado. The following citations relate to this data release. Mirus, B.B., Smith, J.B., Stark, Benjamin, Lewis, York, Michel, Abigail, and Baum, R.L., 2016, Assessing landslide potential on coastal bluffs near Mukilteo, Washington—Geologic site characterization for hydrologic monitoring: U.S. Geological Survey Open-File Report 2016–1082, 28 p., http://dx.doi.org/10.3133/ofr20161082. van Genuchten, MTh., 1980. A closed-form equation for predicting the hydraulic conductivity of unsaturated soils: Soil Science Society America Journal, vol. 44, p. 892-898. Wayllace, A., and Lu, N., 2012, A transient water release and imbibitions method for rapidly measuring wetting and drying soil water retention and hydraulic conductivity functions: Geotechnical Testing Journal, vol. 35, doi: 10.1520/GTJ103596.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This paper replicates Cheung and Lai (Journal of Business and Economic Studies 1995; 13(3): 277-280), who use response surface analysis to obtain approximate finite-sample critical values adjusted for lag order and sample size for the augmented Dickey-Fuller test. We obtain results that are quite close to their results. We provide the Ox source code. We also provide a Windows application with a graphical user interface, which makes obtaining custom critical values quite simple.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
In this paper, we propose a unified approach to generating standardized-residuals-based correlation tests for checking GARCH-type models. This approach is valid in the presence of estimation uncertainty, is robust to various standardized error distributions, and is applicable to testing various types of misspecifications. By using this approach, we also propose a class of power-transformed-series (PTS) correlation tests that provides certain robustifications and power extensions to the Box-Pierce, McLeod-Li, Li-Mak, and Berkes-Horváth-Kokoszka tests in diagnosing GARCH-type models. Our simulation and empirical example show that the PTS correlation tests outperform these existing autocorrelation tests in financial time series analysis.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Training, test data and model parameters. The last 3 columns show the MinORG, LT and HT parameters used to create the pathogenicity families and build the model for each of the 10 models. Zthr is a threshold value, calculated for each model at the cross validation phase, which is used, given the final prediction score, to decide if the input organisms will be predicted as pathogenic or non-pathogenic. The parameters for each model are chosen after 5-fold cross-validation tests.