3 datasets found

f
DataSheet_1_CL-PMI: A Precursor MicroRNA Identification Method Based on...
frontiersin.figshare.com
pdf
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Huiqing Wang; Yue Ma; Chunlin Dong; Chun Li; Jingjing Wang; Dan Liu (2023). DataSheet_1_CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks.pdf [Dataset]. http://doi.org/10.3389/fgene.2019.00967.s001
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.3389/fgene.2019.00967.s001
Dataset updated
May 31, 2023
Dataset provided by
Frontiers
Authors
Huiqing Wang; Yue Ma; Chunlin Dong; Chun Li; Jingjing Wang; Dan Liu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
MicroRNAs (miRNAs) are the major class of gene-regulating molecules that bind mRNAs. They function mainly as translational repressors in mammals. Therefore, how to identify miRNAs is one of the most important problems in medical treatment. Many known pre-miRNAs have a hairpin ring structure containing more structural features, and it is difficult to identify mature miRNAs because of their short length. Therefore, most research focuses on the identification of pre-miRNAs. Most computational models rely on manual feature extraction to identify pre-miRNAs and do not consider the sequential and spatial characteristics of pre-miRNAs, resulting in a loss of information. As the number of unidentified pre-miRNAs is far greater than that of known pre-miRNAs, there is a dataset imbalance problem, which leads to a degradation of the performance of pre-miRNA identification methods. In order to overcome the limitations of existing methods, we propose a pre-miRNA identification algorithm based on a cascaded CNN-LSTM framework, called CL-PMI. We used a convolutional neural network to automatically extract features and obtain pre-miRNA spatial information. We also employed long short-term memory (LSTM) to capture time characteristics of pre-miRNAs and improve attention mechanisms for long-term dependence modeling. Focal loss was used to improve the dataset imbalance. Compared with existing methods, CL-PMI achieved better performance on all datasets. The results demonstrate that this method can effectively identify pre-miRNAs by simultaneously considering their spatial and sequential information, as well as dealing with imbalance in the datasets.
f
Distribution of labels of included datasets.
plos.figshare.com
xls
Updated Jun 2, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Youho Myong; Dan Yoon; Byeong Soo Kim; Young Gyun Kim; Yongsik Sim; Suji Lee; Jiyoung Yoon; Minwoo Cho; Sungwan Kim (2023). Distribution of labels of included datasets. [Dataset]. http://doi.org/10.1371/journal.pone.0279349.t002
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0279349.t002
Dataset updated
Jun 2, 2023
Dataset provided by
PLOS ONE
Authors
Youho Myong; Dan Yoon; Byeong Soo Kim; Young Gyun Kim; Yongsik Sim; Suji Lee; Jiyoung Yoon; Minwoo Cho; Sungwan Kim
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
BackgroundAccurate interpretation of chest radiographs requires years of medical training, and many countries face a shortage of medical professionals to meet such requirements. Recent advancements in artificial intelligence (AI) have aided diagnoses; however, their performance is often limited due to data imbalance. The aim of this study was to augment imbalanced medical data using generative adversarial networks (GANs) and evaluate the clinical quality of the generated images via a multi-center visual Turing test.MethodsUsing six chest radiograph datasets, (MIMIC, CheXPert, CXR8, JSRT, VBD, and OpenI), starGAN v2 generated chest radiographs with specific pathologies. Five board-certified radiologists from three university hospitals, each with at least five years of clinical experience, evaluated the image quality through a visual Turing test. Further evaluations were performed to investigate whether GAN augmentation enhanced the convolutional neural network (CNN) classifier performances.ResultsIn terms of identifying GAN images as artificial, there was no significant difference in the sensitivity between radiologists and random guessing (result of radiologists: 147/275 (53.5%) vs result of random guessing: 137.5/275, (50%); p = .284). GAN augmentation enhanced CNN classifier performance by 11.7%.ConclusionRadiologists effectively classified chest pathologies with synthesized radiographs, suggesting that the images contained adequate clinical information. Furthermore, GAN augmentation enhanced CNN performance, providing a bypass to overcome data imbalance in medical AI training. CNN based methods rely on the amount and quality of training data; the present study showed that GAN augmentation could effectively augment training data for medical AI.
f
Different size convergence times.
plos.figshare.com
xls
Updated Jun 2, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
HuanQing Xu; Xian Shao; Shiji Hui; Li Jin (2023). Different size convergence times. [Dataset]. http://doi.org/10.1371/journal.pone.0282350.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0282350.t003
Dataset updated
Jun 2, 2023
Dataset provided by
PLOS ONE
Authors
HuanQing Xu; Xian Shao; Shiji Hui; Li Jin
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
ObjectivesBreast cancer is a major health problem with high mortality rates. Early detection of breast cancer will promote treatment. A technology that determines whether a tumor is benign desirable. This article introduces a new method in which deep learning is used to classify breast cancer.MethodsA new computer-aided detection (CAD) system is presented to classify benign and malignant masses in breast tumor cell samples. In the CAD system, (1) for the pathological data of unbalanced tumors, the training results are biased towards the side with the larger number of samples. This paper uses a Conditional Deep Convolution Generative Adversarial Network (CDCGAN) method to generate small samples by orientation data set to solve the imbalance problem of collected data. (2) For the high-dimensional data redundancy problem, this paper proposes an integrated dimension reduction convolutional neural network (IDRCNN) model, which solves the high-dimensional data dimension reduction problem of breast cancer and extracts effective features. The subsequent classifier found that by using the IDRCNN model proposed in this paper, the accuracy of the model was improved.ResultsExperimental results show that IDRCNN combined with the model of CDCGAN model has superior classification performance than existing methods, as revealed by sensitivity, area under the curve (AUC), ROC curve and accuracy, recall, sensitivity, specificity, precision,PPV,NPV and f-values analysis.ConclusionThis paper proposes a Conditional Deep Convolution Generative Adversarial Network (CDCGAN) which can solve the imbalance problem of manually collected data by directionally generating small sample data sets. And an integrated dimension reduction convolutional neural network (IDRCNN) model, which solves the high-dimensional data dimension reduction problem of breast cancer and extracts effective features.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Huiqing Wang; Yue Ma; Chunlin Dong; Chun Li; Jingjing Wang; Dan Liu (2023). DataSheet_1_CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks.pdf [Dataset]. http://doi.org/10.3389/fgene.2019.00967.s001

DataSheet_1_CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks.pdf

Explore at:

pdfAvailable download formats

Unique identifier

https://doi.org/10.3389/fgene.2019.00967.s001

Dataset updated

May 31, 2023

Dataset provided by

Frontiers

Authors

Huiqing Wang; Yue Ma; Chunlin Dong; Chun Li; Jingjing Wang; Dan Liu

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

MicroRNAs (miRNAs) are the major class of gene-regulating molecules that bind mRNAs. They function mainly as translational repressors in mammals. Therefore, how to identify miRNAs is one of the most important problems in medical treatment. Many known pre-miRNAs have a hairpin ring structure containing more structural features, and it is difficult to identify mature miRNAs because of their short length. Therefore, most research focuses on the identification of pre-miRNAs. Most computational models rely on manual feature extraction to identify pre-miRNAs and do not consider the sequential and spatial characteristics of pre-miRNAs, resulting in a loss of information. As the number of unidentified pre-miRNAs is far greater than that of known pre-miRNAs, there is a dataset imbalance problem, which leads to a degradation of the performance of pre-miRNA identification methods. In order to overcome the limitations of existing methods, we propose a pre-miRNA identification algorithm based on a cascaded CNN-LSTM framework, called CL-PMI. We used a convolutional neural network to automatically extract features and obtain pre-miRNA spatial information. We also employed long short-term memory (LSTM) to capture time characteristics of pre-miRNAs and improve attention mechanisms for long-term dependence modeling. Focal loss was used to improve the dataset imbalance. Compared with existing methods, CL-PMI achieved better performance on all datasets. The results demonstrate that this method can effectively identify pre-miRNAs by simultaneously considering their spatial and sequential information, as well as dealing with imbalance in the datasets.

Clear search

Close search

Google apps

Main menu

DataSheet_1_CL-PMI: A Precursor MicroRNA Identification Method Based on...

Distribution of labels of included datasets.

Different size convergence times.

DataSheet_1_CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks.pdf