3 datasets found
  1. f

    DataSheet_1_CL-PMI: A Precursor MicroRNA Identification Method Based on...

    • frontiersin.figshare.com
    pdf
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Huiqing Wang; Yue Ma; Chunlin Dong; Chun Li; Jingjing Wang; Dan Liu (2023). DataSheet_1_CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks.pdf [Dataset]. http://doi.org/10.3389/fgene.2019.00967.s001
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    Frontiers
    Authors
    Huiqing Wang; Yue Ma; Chunlin Dong; Chun Li; Jingjing Wang; Dan Liu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    MicroRNAs (miRNAs) are the major class of gene-regulating molecules that bind mRNAs. They function mainly as translational repressors in mammals. Therefore, how to identify miRNAs is one of the most important problems in medical treatment. Many known pre-miRNAs have a hairpin ring structure containing more structural features, and it is difficult to identify mature miRNAs because of their short length. Therefore, most research focuses on the identification of pre-miRNAs. Most computational models rely on manual feature extraction to identify pre-miRNAs and do not consider the sequential and spatial characteristics of pre-miRNAs, resulting in a loss of information. As the number of unidentified pre-miRNAs is far greater than that of known pre-miRNAs, there is a dataset imbalance problem, which leads to a degradation of the performance of pre-miRNA identification methods. In order to overcome the limitations of existing methods, we propose a pre-miRNA identification algorithm based on a cascaded CNN-LSTM framework, called CL-PMI. We used a convolutional neural network to automatically extract features and obtain pre-miRNA spatial information. We also employed long short-term memory (LSTM) to capture time characteristics of pre-miRNAs and improve attention mechanisms for long-term dependence modeling. Focal loss was used to improve the dataset imbalance. Compared with existing methods, CL-PMI achieved better performance on all datasets. The results demonstrate that this method can effectively identify pre-miRNAs by simultaneously considering their spatial and sequential information, as well as dealing with imbalance in the datasets.

  2. f

    Distribution of labels of included datasets.

    • plos.figshare.com
    xls
    Updated Jun 2, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Youho Myong; Dan Yoon; Byeong Soo Kim; Young Gyun Kim; Yongsik Sim; Suji Lee; Jiyoung Yoon; Minwoo Cho; Sungwan Kim (2023). Distribution of labels of included datasets. [Dataset]. http://doi.org/10.1371/journal.pone.0279349.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Youho Myong; Dan Yoon; Byeong Soo Kim; Young Gyun Kim; Yongsik Sim; Suji Lee; Jiyoung Yoon; Minwoo Cho; Sungwan Kim
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundAccurate interpretation of chest radiographs requires years of medical training, and many countries face a shortage of medical professionals to meet such requirements. Recent advancements in artificial intelligence (AI) have aided diagnoses; however, their performance is often limited due to data imbalance. The aim of this study was to augment imbalanced medical data using generative adversarial networks (GANs) and evaluate the clinical quality of the generated images via a multi-center visual Turing test.MethodsUsing six chest radiograph datasets, (MIMIC, CheXPert, CXR8, JSRT, VBD, and OpenI), starGAN v2 generated chest radiographs with specific pathologies. Five board-certified radiologists from three university hospitals, each with at least five years of clinical experience, evaluated the image quality through a visual Turing test. Further evaluations were performed to investigate whether GAN augmentation enhanced the convolutional neural network (CNN) classifier performances.ResultsIn terms of identifying GAN images as artificial, there was no significant difference in the sensitivity between radiologists and random guessing (result of radiologists: 147/275 (53.5%) vs result of random guessing: 137.5/275, (50%); p = .284). GAN augmentation enhanced CNN classifier performance by 11.7%.ConclusionRadiologists effectively classified chest pathologies with synthesized radiographs, suggesting that the images contained adequate clinical information. Furthermore, GAN augmentation enhanced CNN performance, providing a bypass to overcome data imbalance in medical AI training. CNN based methods rely on the amount and quality of training data; the present study showed that GAN augmentation could effectively augment training data for medical AI.

  3. f

    Different size convergence times.

    • plos.figshare.com
    xls
    Updated Jun 2, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HuanQing Xu; Xian Shao; Shiji Hui; Li Jin (2023). Different size convergence times. [Dataset]. http://doi.org/10.1371/journal.pone.0282350.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    PLOS ONE
    Authors
    HuanQing Xu; Xian Shao; Shiji Hui; Li Jin
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    ObjectivesBreast cancer is a major health problem with high mortality rates. Early detection of breast cancer will promote treatment. A technology that determines whether a tumor is benign desirable. This article introduces a new method in which deep learning is used to classify breast cancer.MethodsA new computer-aided detection (CAD) system is presented to classify benign and malignant masses in breast tumor cell samples. In the CAD system, (1) for the pathological data of unbalanced tumors, the training results are biased towards the side with the larger number of samples. This paper uses a Conditional Deep Convolution Generative Adversarial Network (CDCGAN) method to generate small samples by orientation data set to solve the imbalance problem of collected data. (2) For the high-dimensional data redundancy problem, this paper proposes an integrated dimension reduction convolutional neural network (IDRCNN) model, which solves the high-dimensional data dimension reduction problem of breast cancer and extracts effective features. The subsequent classifier found that by using the IDRCNN model proposed in this paper, the accuracy of the model was improved.ResultsExperimental results show that IDRCNN combined with the model of CDCGAN model has superior classification performance than existing methods, as revealed by sensitivity, area under the curve (AUC), ROC curve and accuracy, recall, sensitivity, specificity, precision,PPV,NPV and f-values analysis.ConclusionThis paper proposes a Conditional Deep Convolution Generative Adversarial Network (CDCGAN) which can solve the imbalance problem of manually collected data by directionally generating small sample data sets. And an integrated dimension reduction convolutional neural network (IDRCNN) model, which solves the high-dimensional data dimension reduction problem of breast cancer and extracts effective features.

  4. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Huiqing Wang; Yue Ma; Chunlin Dong; Chun Li; Jingjing Wang; Dan Liu (2023). DataSheet_1_CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks.pdf [Dataset]. http://doi.org/10.3389/fgene.2019.00967.s001

DataSheet_1_CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks.pdf

Related Article
Explore at:
pdfAvailable download formats
Dataset updated
May 31, 2023
Dataset provided by
Frontiers
Authors
Huiqing Wang; Yue Ma; Chunlin Dong; Chun Li; Jingjing Wang; Dan Liu
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

MicroRNAs (miRNAs) are the major class of gene-regulating molecules that bind mRNAs. They function mainly as translational repressors in mammals. Therefore, how to identify miRNAs is one of the most important problems in medical treatment. Many known pre-miRNAs have a hairpin ring structure containing more structural features, and it is difficult to identify mature miRNAs because of their short length. Therefore, most research focuses on the identification of pre-miRNAs. Most computational models rely on manual feature extraction to identify pre-miRNAs and do not consider the sequential and spatial characteristics of pre-miRNAs, resulting in a loss of information. As the number of unidentified pre-miRNAs is far greater than that of known pre-miRNAs, there is a dataset imbalance problem, which leads to a degradation of the performance of pre-miRNA identification methods. In order to overcome the limitations of existing methods, we propose a pre-miRNA identification algorithm based on a cascaded CNN-LSTM framework, called CL-PMI. We used a convolutional neural network to automatically extract features and obtain pre-miRNA spatial information. We also employed long short-term memory (LSTM) to capture time characteristics of pre-miRNAs and improve attention mechanisms for long-term dependence modeling. Focal loss was used to improve the dataset imbalance. Compared with existing methods, CL-PMI achieved better performance on all datasets. The results demonstrate that this method can effectively identify pre-miRNAs by simultaneously considering their spatial and sequential information, as well as dealing with imbalance in the datasets.

Search
Clear search
Close search
Google apps
Main menu