CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
A dataset of 688,112 statistical test results reported according to the standards prescribed by the American Psychological Association (APA), mined from 50,845 articles out of 167,318 published by the APA, Springer, Sage, and Taylor & Francis. Mining from Wiley and Elsevier was actively blocked. Metadata for each article are included. All journals included, scripts, etc. are available at https://github.com/chartgerink/2016statcheck_data and preserved at http://dx.doi.org/10.5281/zenodo.59818
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
*** Please note: this dataset has been replaced by a new version: see relations for the update***A dataset of 686,220 statistical test results reported according to the standards prescribed by the American Psychological Association (APA), mined from 50,740 articles out of 276,669 published by the APA, Springer, Sage, and Taylor & Francis. Mining from Wiley and Elsevier was actively blocked. Metadata for each article are included. Metadata for each article are included. All scripts, etc. are available at github.com/chartgerink/2016statcheck_data
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Correct classification percentage for the 3-Classifier in four attempts during the cross-validation phase: using the marginal distribution for inter-tweet delay (ITD), using the marginal distribution for tweet time (TT), using the joint distribution of both properties as independent variables (JI), and using the joint distribution of both properties as non-independent variables (JNI).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Average SD, minimum and maximum number of days that accounts were active (posting tweets that were collected by our crawler) in each class.
https://www.rioxx.net/licenses/all-rights-reserved/https://www.rioxx.net/licenses/all-rights-reserved/
Triple random ensemble method for multi-label classification
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This dataset contains conversations between users and experienced psychologists related to mental health topics. Carefully collected and anonymized, the data can be used to further the development of Natural Language Processing (NLP) models which focus on providing mental health advice and guidance. It consists of a variety of questions which will help train NLP models to provide users with appropriate advice in response to their queries. Whether you're an AI developer interested in building the next wave of mental health applications or a therapist looking for insights into how technology is helping people connect; this dataset provides invaluable support for advancing our understanding of human relationships through Artificial Intelligence
More Datasets For more datasets, click here.
Featured Notebooks 🚨 Your notebook can be here! 🚨! How to use the dataset This guide will provide you with the necessary knowledge to effectively use this dataset for Natural Language Processing (NLP)-based applications.
Download and install the dataset: To begin using the dataset, download it from Kaggle onto your system. Once downloaded, unzip and extract the .csv file into a directory of your choice.
Familiarize yourself with the columns: Before working with the data, it’s important to familiarize yourself with all of its components. This dataset contains two columns - Context and Response - which are intentionally structured to produce conversations between users and psychologists related to mental health topics for NLP models dedicated to providing mental health advice and guidance.
Analyze data entries: If possible or desired, take time now to analyze what is included in each entry; this may help you better untangle any challenges that come up during subsequent processes yet won't be required for most steps going forward if you prefer not too jump ahead of yourself at this juncture of your work process just yet! Examine questions asked by users as well as answers provided by experts in order glean an overall picture of what types of conversations are taking place within this pool of data that can help guide further work on NLP models for AI-driven mental health guidance purposes later on down the road!
Cleanse any information not applicable to NLP decisioning relevant application goals: It's important that only meaningful items related towards achieving AI-driven results remain within a clean copy of this Dataset going forward; consider removing all extra many verbatim entries or other pieces uneeded while also otherwise making sure all included content adheres closely enough one particular decisions purpose expected from an end goal perspective before proceeding onwards now until an ultimate end result has been successfully achieved eventually afterwards later on next afterward soon afterwards too following conveniently satisfyingly after accordingly shortly near therefore meaningfully likewise conclusively thoroughly properly productively purposely then eventually effectively finally indeed desirably plus concludingly enjoyably popularly splendidly attractively satisfactorally propitiously outstandingly fluently promisingly opportunely in conclusion efficiently hopefully progressively breathtaking deliciousness ideally genius mayhem invented unique impossibility everlastingly intense qualitative cohesiveness behaviorally affectionately fixed voraciously like alive supportively choicest decisively luckily chaotically co-creatively introducing ageless intricacy voicing auspicious promise enterprisingly preferred mathematically godly happening humorous respective achieve ultra favorability fundamentals essentials speciality grandiose selectively perfectly
Research Ideas Creating sentence-matching algorithms for natural language processing to accurately match given questions with appropriate advice and guidance. Analyzing the psychological conversations to gain insights into topics such as stress, anxiety, and depression. Developing personalized natural language processing models tailored to provide users with appropriate advice based on their queries and based on their individual state of mental health Acknowledgements If you use this dataset in your research, please credit the original authors. Data Source
License License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
Columns File: train.csv
Column name Description Context The conversation between the user and the psychologist. (Text) Response The response from the psychologist to the user. (Text) Acknowledgements If you use this dataset in your research, please credit the original authors. If you use this dataset in your research, please credit Huggingface Hub.
Original Data Source: NLP Mental Health Conversations
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Correct classification percentage for the 2-Classifier in four attempts during the cross-validation phase: using the marginal distribution for inter-tweet delay (ITD), using the marginal distribution for tweet time (TT), using the joint distribution of both properties as independent variables (JI), and using the joint distribution of both properties as non-independent variables (JNI).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
To test for independence between the tweet time and inter-tweet delay variables, we performed Pearson's and Kendall's correlation tests using all samples in each account class. All tests resulted in very low values, proving that the two variables are indeed independent.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
ACO parameters setting.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Average SD coefficient of determination () obtained for each class by the two probabilistic prediction models during cross-validation. We compare the performance of our models to the results of a null model, which was created with random samples generated from a uniform distribution over range 1 to 1,000,000.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
MCI converter/non-converter classification comparison with different datasets in terms of accuracy, sensitivity and specificity. Methods applied here include the combinations of wHLFS and different classification methods. The different feature datasets are META (E), MRI (M), and META without baseline cognitive scores (META-22). Parameters are selected by five-fold cross validation on the training dataset. The number in the parenthesis indicates the number of features in the specific dataset. The bolded and underlined entry denotes the best performance for that particular method. The standard deviations are shown in the parentheses along with the accuracy.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The dataset includes one master spreadsheet containing literatures searched, catalogued and summarised in the fields of Cultural Studies, Education, History, Media and Communication, Philosophy, Political Science, Psychology and Sociology, which contains 770 selected texts.
The aims of the data collected are to produce an integrated theory that builds on the findings of different disciplines (Cultural Studies, Education, History, Media and Communication, Philosophy, Political Science, Psychology and Sociology) focused on the understanding of factors and processes (from the macro social level to the social and psychological level), within the different life contexts, that promote or hinder youth active citizenship in EU.
It is possible that similar databases of literature around Europe, Young People and Active Citizenship across the fields of Cultural Studies, Education, History, Media and Communication, Philosophy, Political Science, Psychology and Sociology exist in other forms, perhaps collected for studies on one or more of the included disciplines, but we do not currently have access to a similar repository.
With that said, it is highly unlikely that an exact dataset corresponding to the specifics of this study exist in any form elsewhere, thus justifying the creation of new data for this study in the absence of suitable existing data. Data collected here will bridge the gap between global aggregated literatures on youth and citizenship separated by discipline on the one hand, and a new dataset offering an integrated literature analysis of different fields of study.
The data sources are available in bibliographic format and attached via csv document.
The dataset relies on the following information taken from the data sources: specific identifying information about the text itself (title/author/year/publisher); and abstract or summarizing information either taken directly from the text or summarized by the researcher.
Finally, the aggregated literature review spreadsheet constitutes raw data which can be reused by researchers who want to compare our data with similar data collected in different countries, or to perform textual analysis (content analysis and/or data mining) on our data.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains 35 of 39 taxonomies that were the result of a systematic review. The systematic review was conducted with the goal of identifying taxonomies suitable for semantically annotating research data. A special focus was set on research data from the hybrid societies domain.
The following taxonomies were identified as part of the systematic review:
Filename
Taxonomy Title
acm_ccs
ACM Computing Classification System [1]
amec
A Taxonomy of Evaluation Towards Standards [2]
bibo
A BIBO Ontology Extension for Evaluation of Scientific Research Results [3]
cdt
Cross-Device Taxonomy [4]
cso
Computer Science Ontology [5]
ddbm
What Makes a Data-driven Business Model? A Consolidated Taxonomy [6]
ddi_am
DDI Aggregation Method [7]
ddi_moc
DDI Mode of Collection [8]
n/a
DemoVoc [9]
discretization
Building a New Taxonomy for Data Discretization Techniques [10]
dp
Demopaedia [11]
dsg
Data Science Glossary [12]
ease
A Taxonomy of Evaluation Approaches in Software Engineering [13]
eco
Evidence & Conclusion Ontology [14]
edam
EDAM: The Bioscientific Data Analysis Ontology [15]
n/a
European Language Social Science Thesaurus [16]
et
Evaluation Thesaurus [17]
glos_hci
The Glossary of Human Computer Interaction [18]
n/a
Humanities and Social Science Electronic Thesaurus [19]
hcio
A Core Ontology on the Human-Computer Interaction Phenomenon [20]
hft
Human-Factors Taxonomy [21]
hri
A Taxonomy to Structure and Analyze Human–Robot Interaction [22]
iim
A Taxonomy of Interaction for Instructional Multimedia [23]
interrogation
A Taxonomy of Interrogation Methods [24]
iot
Design Vocabulary for Human–IoT Systems Communication [25]
kinect
Understanding Movement and Interaction: An Ontology for Kinect-Based 3D Depth Sensors [26]
maco
Thesaurus Mass Communication [27]
n/a
Thesaurus Cognitive Psychology of Human Memory [28]
mixed_initiative
Mixed-Initiative Human-Robot Interaction: Definition, Taxonomy, and Survey [29]
qos_qoe
A Taxonomy of Quality of Service and Quality of Experience of Multimodal Human-Machine Interaction [30]
ro
The Research Object Ontology [31]
senses_sensors
A Human-Centered Taxonomy of Interaction Modalities and Devices [32]
sipat
A Taxonomy of Spatial Interaction Patterns and Techniques [33]
social_errors
A Taxonomy of Social Errors in Human-Robot Interaction [34]
sosa
Semantic Sensor Network Ontology [35]
swo
The Software Ontology [36]
tadirah
Taxonomy of Digital Research Activities in the Humanities [37]
vrs
Virtual Reality and the CAVE: Taxonomy, Interaction Challenges and Research Directions [38]
xdi
Cross-Device Interaction [39]
We converted the taxonomies into SKOS (Simple Knowledge Organisation System) representation. The following 4 taxonomies were not converted as they were already available in SKOS and were for this reason excluded from this dataset:
1) DemoVoc, cf. http://thesaurus.web.ined.fr/navigateur/ available at https://thesaurus.web.ined.fr/exports/demovoc/demovoc.rdf
2) European Language Social Science Thesaurus, cf. https://thesauri.cessda.eu/elsst/en/ available at https://zenodo.org/record/5506929
3) Humanities and Social Science Electronic Thesaurus, cf. https://hasset.ukdataservice.ac.uk/hasset/en/ available at https://zenodo.org/record/7568355
4) Thesaurus Cognitive Psychology of Human Memory, cf. https://www.loterre.fr/presentation/ available at https://skosmos.loterre.fr/P66/en/
References
[1] “The 2012 ACM Computing Classification System,” ACM Digital Library, 2012. https://dl.acm.org/ccs (accessed May 08, 2023).
[2] AMEC, “A Taxonomy of Evaluation Towards Standards.” Aug. 31, 2016. Accessed: May 08, 2023. [Online]. Available: https://amecorg.com/amecframework/home/supporting-material/taxonomy/
[3] B. Dimić Surla, M. Segedinac, and D. Ivanović, “A BIBO ontology extension for evaluation of scientific research results,” in Proceedings of the Fifth Balkan Conference in Informatics, in BCI ’12. New York, NY, USA: Association for Computing Machinery, Sep. 2012, pp. 275–278. doi: 10.1145/2371316.2371376.
[4] F. Brudy et al., “Cross-Device Taxonomy: Survey, Opportunities and Challenges of Interactions Spanning Across Multiple Devices,” in Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, in CHI ’19. New York, NY, USA: Association for Computing Machinery, Mai 2019, pp. 1–28. doi: 10.1145/3290605.3300792.
[5] A. A. Salatino, T. Thanapalasingam, A. Mannocci, F. Osborne, and E. Motta, “The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas,” in Lecture Notes in Computer Science 1137, D. Vrandečić, K. Bontcheva, M. C. Suárez-Figueroa, V. Presutti, I. Celino, M. Sabou, L.-A. Kaffee, and E. Simperl, Eds., Monterey, California, USA: Springer, Oct. 2018, pp. 187–205. Accessed: May 08, 2023. [Online]. Available: http://oro.open.ac.uk/55484/
[6] M. Dehnert, A. Gleiss, and F. Reiss, “What makes a data-driven business model? A consolidated taxonomy,” presented at the European Conference on Information Systems, 2021.
[7] DDI Alliance, “DDI Controlled Vocabulary for Aggregation Method,” 2014. https://ddialliance.org/Specification/DDI-CV/AggregationMethod_1.0.html (accessed May 08, 2023).
[8] DDI Alliance, “DDI Controlled Vocabulary for Mode Of Collection,” 2015. https://ddialliance.org/Specification/DDI-CV/ModeOfCollection_2.0.html (accessed May 08, 2023).
[9] INED - French Institute for Demographic Studies, “Thésaurus DemoVoc,” Feb. 26, 2020. https://thesaurus.web.ined.fr/navigateur/en/about (accessed May 08, 2023).
[10] A. A. Bakar, Z. A. Othman, and N. L. M. Shuib, “Building a new taxonomy for data discretization techniques,” in 2009 2nd Conference on Data Mining and Optimization, Oct. 2009, pp. 132–140. doi: 10.1109/DMO.2009.5341896.
[11] N. Brouard and C. Giudici, “Unified second edition of the Multilingual Demographic Dictionary (Demopaedia.org project),” presented at the 2017 International Population Conference, IUSSP, Oct. 2017. Accessed: May 08, 2023. [Online]. Available: https://iussp.confex.com/iussp/ipc2017/meetingapp.cgi/Paper/5713
[12] DuCharme, Bob, “Data Science Glossary.” https://www.datascienceglossary.org/ (accessed May 08, 2023).
[13] A. Chatzigeorgiou, T. Chaikalis, G. Paschalidou, N. Vesyropoulos, C. K. Georgiadis, and E. Stiakakis, “A Taxonomy of Evaluation Approaches in Software Engineering,” in Proceedings of the 7th Balkan Conference on Informatics Conference, in BCI ’15. New York, NY, USA: Association for Computing Machinery, Sep. 2015, pp. 1–8. doi: 10.1145/2801081.2801084.
[14] M. C. Chibucos, D. A. Siegele, J. C. Hu, and M. Giglio, “The Evidence and Conclusion Ontology (ECO): Supporting GO Annotations,” in The Gene Ontology Handbook, C. Dessimoz and N. Škunca, Eds., in Methods in Molecular Biology. New York, NY: Springer, 2017, pp. 245–259. doi: 10.1007/978-1-4939-3743-1_18.
[15] M. Black et al., “EDAM: the bioscientific data analysis ontology,” F1000Research, vol. 11, Jan. 2021, doi: 10.7490/f1000research.1118900.1.
[16] Council of European Social Science Data Archives (CESSDA), “European Language Social Science Thesaurus ELSST,” 2021. https://thesauri.cessda.eu/en/ (accessed May 08, 2023).
[17] M. Scriven, Evaluation Thesaurus, 3rd Edition. Edgepress, 1981. Accessed: May 08, 2023. [Online]. Available: https://us.sagepub.com/en-us/nam/evaluation-thesaurus/book3562
[18] Papantoniou, Bill et al., The Glossary of Human Computer Interaction. Interaction Design Foundation. Accessed: May 08, 2023. [Online]. Available: https://www.interaction-design.org/literature/book/the-glossary-of-human-computer-interaction
[19] “UK Data Service Vocabularies: HASSET Thesaurus.” https://hasset.ukdataservice.ac.uk/hasset/en/ (accessed May 08, 2023).
[20] S. D. Costa, M. P. Barcellos, R. de A. Falbo, T. Conte, and K. M. de Oliveira, “A core ontology on the Human–Computer Interaction phenomenon,” Data Knowl. Eng., vol. 138, p. 101977, Mar. 2022, doi: 10.1016/j.datak.2021.101977.
[21] V. J. Gawron et al., “Human Factors Taxonomy,” Proc. Hum. Factors Soc. Annu. Meet., vol. 35, no. 18, pp. 1284–1287, Sep. 1991, doi: 10.1177/154193129103501807.
[22] L. Onnasch and E. Roesler, “A Taxonomy to Structure and Analyze Human–Robot Interaction,” Int. J. Soc. Robot., vol. 13, no. 4, pp. 833–849, Jul. 2021, doi: 10.1007/s12369-020-00666-5.
[23] R. A. Schwier, “A Taxonomy of Interaction for Instructional Multimedia.” Sep. 28, 1992. Accessed: May 09, 2023. [Online]. Available: https://eric.ed.gov/?id=ED352044
[24] C. Kelly, J. Miller, A. Redlich, and S. Kleinman, “A Taxonomy of Interrogation Methods,”
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The symptom attributes used to predict HD in the experiment.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
There are 13 different types of ADAS Sub-Scores and Total Scores and 11 different types of Neuropsychological Battery features. A detailed explanation of each cognitive score and lab test can be found at www.public.asu.edu/~jye02/AD-Progression/.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Experimental results for HD prediction problem. Accuracy percentage values of ACO and Benchmark approaches in the context of Hungarian population, ( is the classifier compared to ).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Experimental results for HD prediction problem. Accuracy percentage values of ACO and Benchmark approaches in the context of Cleveland population, ( is the classifier compared to ).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Experimental results for HD prediction problem. Accuracy percentage values of ACO and benchmark approaches in the context of Long-Beach population, ( is the classifier compared to ).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The CTG attributes used to predict potential fetal pathologies.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This zip file represents the data to support our paper in Nature communication titled "Neuroimaging evidence for a network sampling theory of human intelligence"
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
A dataset of 688,112 statistical test results reported according to the standards prescribed by the American Psychological Association (APA), mined from 50,845 articles out of 167,318 published by the APA, Springer, Sage, and Taylor & Francis. Mining from Wiley and Elsevier was actively blocked. Metadata for each article are included. All journals included, scripts, etc. are available at https://github.com/chartgerink/2016statcheck_data and preserved at http://dx.doi.org/10.5281/zenodo.59818