11 datasets found
  1. Niphotrichum japonicum genome project

    • figshare.com
    txt
    Updated Sep 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xuping Zhou (2023). Niphotrichum japonicum genome project [Dataset]. http://doi.org/10.6084/m9.figshare.23573514.v5
    Explore at:
    txtAvailable download formats
    Dataset updated
    Sep 16, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Xuping Zhou
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset contains the genome and annotation file of Niphotrichum japonicum. The raw data have been deposited in National Genomics Data Center (NGDC; https://ngdc.cncb.ac.cn/bioproject/) under the BioProject accession number PRJCA017860.

  2. Genome assembly and annotation of the allotetraploid wild grass Aegilops...

    • figshare.com
    application/gzip
    Updated Dec 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zehou Liu (2024). Genome assembly and annotation of the allotetraploid wild grass Aegilops ventricosa RM271 [Dataset]. http://doi.org/10.6084/m9.figshare.27966039.v1
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Dec 5, 2024
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Zehou Liu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Genome assembly and annotation of the allotetraploid wild grass Aegilops ventricosa RM271, which has been published in https://doi.org/10.1016/j.xplc.2024.101131. All the raw sequencing data for Ae. ventricosa RM271 are available in the National Genomics Data Center (https://ngdc.cncb.ac.cn/?lang=en) under BioProject numbers PRJCA018801 (https://ngdc.cncb.ac.cn/bioproject/browse/PRJCA018801).

  3. S

    Data from: A near-complete genome assembly for northern wild rice (Zizania...

    • scidb.cn
    Updated Sep 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Qing Ma; Lei Gu; Yanning Xie; Wanhong Li; Jie Qiu; Zhongfeng Zhang; Ning Yan (2025). A near-complete genome assembly for northern wild rice (Zizania palustris L.) [Dataset]. http://doi.org/10.57760/sciencedb.j00210.00043
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 12, 2025
    Dataset provided by
    Science Data Bank
    Authors
    Qing Ma; Lei Gu; Yanning Xie; Wanhong Li; Jie Qiu; Zhongfeng Zhang; Ning Yan
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Northern wild rice (NWR; Zizania palustris L.), an annual aquatic plant in the Poaceae family, has high economic importance due to its nutrient-rich grains. However, the existing NWR genome assembly for this species has severe fragmentation and incomplete gene representation. A near-complete genome was assembled in this study to provide a high-quality genomic reference for NWR-associated research. The assembled genome exhibited a total contig length of 1.41 Gb and a contig N50 of 109.22 Mb. Overall, a 73.60% repetitive sequence content was identified and 47,804 genes predicted. Phylogenetic analysis indicated that Z. palustris was most closely related to Zizania latifolia, with an estimated divergence time of 4.57–8.15 Mya. Meanwhile, Z. palustris underwent a recent, species-specific long terminal repeat (LTR) expansion, associated with its larger genome size. We identified two genomic blocks in the Z. palustris and Z. latifolia genomes that exhibit strong synteny with the rice phytocassane biosynthetic gene cluster. The centromeric satellite repeats in Z. palustris identified in this study primarily comprised a 145 bp repetitive unit. The findings also revealed centromere homogenisation and rearrangement accompanied by LTR invasion in NWR. Among the genes missing in the previous NWR genome, we observed LTR insertion events that resulted in expanded gene lengths in our updated NWR genome. The present updated NWR genome provides a valuable resource for crop genetic improvement, functional gene discovery, and research on critical biological processes.The whole-genome sequence data reported in this paper were also deposited in the Genome Warehouse in National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences/China National Center for Bioinformation under accession number GWHFIIQ00000000.1 at https://ngdc.cncb.ac.cn/gwh.

  4. S

    Annotation of the genome for the Banna miniature inbred pig

    • scidb.cn
    Updated Jul 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hong-Man Chen; Hong-Jiang Wei (2024). Annotation of the genome for the Banna miniature inbred pig [Dataset]. http://doi.org/10.57760/sciencedb.10833
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 22, 2024
    Dataset provided by
    Science Data Bank
    Authors
    Hong-Man Chen; Hong-Jiang Wei
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The annotation includes protein-coding genes, ncRNAs, and repeat elements, which are stored in three separate files in gff format. The corresponding genome assembly has been deposited in the GWH database under accession number GWHEUVB00000000.1 and is publicly available at https://ngdc.cncb.ac.cn/gwh.

  5. N

    Single-cell sequencing of tumor ecosystems in papillary thyroid carcinoma

    • data.niaid.nih.gov
    Updated May 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Weilin Pu; Xiao Shi; Pengcheng Yu; Meiying Zhang; Xiaoming Zhang; Yulong Wang (2024). Single-cell sequencing of tumor ecosystems in papillary thyroid carcinoma [Dataset]. https://data.niaid.nih.gov/resources?id=gse184362
    Explore at:
    Dataset updated
    May 1, 2024
    Authors
    Weilin Pu; Xiao Shi; Pengcheng Yu; Meiying Zhang; Xiaoming Zhang; Yulong Wang
    Description

    The tumor ecosystem of papillary thyroid carcinoma (PTC) is poorly characterized. Using single-cell RNA sequencing, we profiled transcriptomes of 158,577 cells from 11 patients’ paratumors, localized/advanced tumors, initially-treated/recurrent lymph nodes and radioactive iodine (RAI)-refractory distant metastases, covering comprehensive clinical courses of PTC.Our study illuminates a comprehensive landscape of PTC ecosystem that suggests potential prognostic and therapeutic implications. Our work not only adds dimensions to the biological underpinnings of PTC heterogeneity, but also provides a benchmark dataset for this malignancy. In total, 23 fresh surgical specimens (7 primary tumors, 6 para-tumors, 8 metastatic LNs and 2 subcutaneous metastatic loci) were collected and subjected to scRNA-seq with 10X Genomics. Submitter declares that the raw data have been deposited in the Genome Sequence Archive (https://ngdc.cncb.ac.cn/gsa/) under submission number HRA001107.

  6. f

    Data from: Gut microbiota impacts bone via Bacteroides vulgatus-valeric...

    • datasetcatalog.nlm.nih.gov
    • figshare.com
    Updated Aug 29, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Greenbaum, Jonathan; Liu, Hui-Min; Xu, Xue-Juan; Su, Kuan-Jui; Lv, Wan-Qiang; Shen, Jie; Xiao, Hong-Mei; Papasian, Chris J.; Guo, Bo-Yi; Zhou, Rou; Song, Yu-Qian; Ge, Chang-Li; Deng, Hong-Wen; Wang, Xia-Fang; Chen, Yuan-Cheng; Pan, Dao-Yan; Li, Zhang-Fang; Shi, Xing-Ming; Yi, Neng-Jun; Meng, Xiang-He; Zhao, Qi; Lv, Feng-Ye; Zhang, Yin-Hua; Peng, Cheng; Luo, Zhe; Gong, Rui; Shen, Hui; Zhang, Qiang; Lin, Xu; Lu, Jun-Min; Ao, Zeng-Xin; chen, zhi (2023). Gut microbiota impacts bone via Bacteroides vulgatus-valeric acid-related pathways [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000964191
    Explore at:
    Dataset updated
    Aug 29, 2023
    Authors
    Greenbaum, Jonathan; Liu, Hui-Min; Xu, Xue-Juan; Su, Kuan-Jui; Lv, Wan-Qiang; Shen, Jie; Xiao, Hong-Mei; Papasian, Chris J.; Guo, Bo-Yi; Zhou, Rou; Song, Yu-Qian; Ge, Chang-Li; Deng, Hong-Wen; Wang, Xia-Fang; Chen, Yuan-Cheng; Pan, Dao-Yan; Li, Zhang-Fang; Shi, Xing-Ming; Yi, Neng-Jun; Meng, Xiang-He; Zhao, Qi; Lv, Feng-Ye; Zhang, Yin-Hua; Peng, Cheng; Luo, Zhe; Gong, Rui; Shen, Hui; Zhang, Qiang; Lin, Xu; Lu, Jun-Min; Ao, Zeng-Xin; chen, zhi
    Description

    We performed integrative analyses in a Chinese cohort of peri-/post-menopausal women (n = 517) with metagenomics (n = 499), targeted metabolomics (n = 500) and whole-genome sequencing (n = 500) to identify novel microbiome-related biomarkers for bone health. We also performed metagenomics in a US whites cohort for validation (n = 59).The characteristics of the cohorts, genome-wide association study (GWAS) data, concentrations of serum short chain fatty acids (SCFAs), relative abundance of gut microbiota and gut microbiota-associated functional KEGG modules are shown in this dataset.The sequencing data of the Chinese cohort (whole genome sequencing and metagenomics) can be found in “Genome Sequence Archive for Human” (https://ngdc.cncb.ac.cn/gsa-human, accession No. HRA004900). The metagenomic sequencing data can be found in “Sequence Read Archive” (https://www.ncbi.nlm.nih.gov/sra, accession No. PRJNA986283 and PRJNA1011937).

  7. S

    Genome annotation of the assembly for Fulvetta ruficapilla (Fruf_v1)

    • scidb.cn
    Updated Jun 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chen Yan; Ming-Shan Wang (2024). Genome annotation of the assembly for Fulvetta ruficapilla (Fruf_v1) [Dataset]. http://doi.org/10.57760/sciencedb.09502
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 25, 2024
    Dataset provided by
    Science Data Bank
    Authors
    Chen Yan; Ming-Shan Wang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Genome annotation of the assembly for Fulvetta ruficapilla (Fruf_v1).This annotation includes aonnotation for protein-coding genes, ncRNAs, and repetitive elements in three different files in the gff format. The associated genome assembly has been deposited in the GWH database under the accession number GWHETLV00000000.1, publicly available at https://ngdc.cncb.ac.cn/gwh.This version of dataset is provided a natural picture of the species Fulvetta ruficapilla as the cover image, which is taken by authors of the dataset.

  8. Platycodon grandifloras genome annotation

    • figshare.com
    txt
    Updated Nov 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bo Wang (2024). Platycodon grandifloras genome annotation [Dataset]. http://doi.org/10.6084/m9.figshare.19093331.v4
    Explore at:
    txtAvailable download formats
    Dataset updated
    Nov 22, 2024
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Bo Wang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Platycodon grandifloras genome annotationpasa2.longest.filter-update-v2.gff3 corresponds to the reference genome publicly available at https://ngdc.cncb.ac.cn/gwh (GWH: GWHARYT00000000.1).

  9. S

    Raw data of DAP-Seq of GiMYB76

    • scidb.cn
    Updated Sep 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yuping Li; Zhigeng Wu; Jixiang Zhu; Jiangyi Zeng; Yongliang Liu; Jihua Wang; Ling Yuan; Ying Wang; Yongqing Li (2025). Raw data of DAP-Seq of GiMYB76 [Dataset]. http://doi.org/10.57760/sciencedb.j00210.00049
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 28, 2025
    Dataset provided by
    Science Data Bank
    Authors
    Yuping Li; Zhigeng Wu; Jixiang Zhu; Jiangyi Zeng; Yongliang Liu; Jihua Wang; Ling Yuan; Ying Wang; Yongqing Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Genomic DNA from G. inflata was extracted by using phenol/ chloroform method, and purified DNA was quantified using a Qubit (Thermo Fisher Scientific). A total of 5 μg genomic DNA was sheared to ~200 bp fragments; the repaired ends were ligated Illumina-based sequencing adaptors to construct an adaptor-ligated DNA library.The ORF sequence of GiMYB76 was fused with the Halo affinity tag and expressed in vitro using TNT® SP6 Coupled Wheat Germ Extract System (Promega, Fitchburg, WI, USA). The fusion protein was bound to magnetic Halo Tag ligand (chloroalkane) beads, isolated from the expression mixture, and verified by Western blotting with anti-Halo Tag antibody (Promega), with nonspecific proteins were washed away. HaloTag-GiMYB76 fusion proteins were incubated with 500 ng of adaptor-ligated genomic DNA library at 30 ℃ for 2 h. Unbound DNA fragments were removed by washing, and samples were heated at 98 ℃ for 10 min to release GiMYB76 bound DNA The recovered DNA was amplified by PCR using indexed TruSeq primers. Indexed DNA samples were subsequently pooled and size-selected to remove residual adaptor dimers. Purified DNA libraries were subjected to next generation sequencing (PE150).The DAP-Seq data used in this study are also available from the National Genomics Data Center (https://ngdc.cncb.ac.cn/; PRJCA039808).

  10. Data from: Exploring the maize transcriptional regulatory landscape through...

    • zenodo.org
    application/gzip, bin +1
    Updated Oct 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Qiang Huo; Qiang Huo (2025). Exploring the maize transcriptional regulatory landscape through large-scale profiling of transcription factor binding sites [Dataset]. http://doi.org/10.5281/zenodo.16027369
    Explore at:
    csv, bin, application/gzipAvailable download formats
    Dataset updated
    Oct 7, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Qiang Huo; Qiang Huo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset Description - Version 1

    This dataset contains:

    Network files:

    • mGRN.csv (12.33 MB) - Tissue-specific networks (root, ear, tassel, seed, stem, leaf)

    Transcription factor data:

    • 405_TFs_motif.rar (163.28 KB) - Transcription factor motif data
    • 513_TFs_peak_filter files - Peak files for 513 transcription factors (before and after regulatory region filtering)

    Functional analysis:

    • GO term enrichment results (17.02 MB)
    • Regulatory region analysis (14.24 MB)

    Resources:

    ⚠️ Note: An updated version (v2) with additional ChIPSeeker annotation files is available at: https://doi.org/10.5281/zenodo.16029269

    Citation:
    If you use the data or code, please cite: Huo Q., Zhang Z., Zhang K., Wang Q., Zhang W., Ye X., Lyu Q., Galbraith D.W., Ma Z., and Song R. (2025). Exploring the maize transcriptional regulatory landscape through large-scale profiling of transcription factor binding sites. Mol. Plant. 18, 1777–1798. doi: https://doi.org/10.1016/j.molp.2025.08.009

  11. S

    The complete mitochondrial genomes of Pennisetum glaucum ‘23A’ and ‘23B’

    • scidb.cn
    Updated Sep 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhenzhen Tan; Wentao Xue; Chen Qian; Zhiwei Liu; Juanzi Wu; Xiaoxian Zhong (2025). The complete mitochondrial genomes of Pennisetum glaucum ‘23A’ and ‘23B’ [Dataset]. http://doi.org/10.57760/sciencedb.j00210.00047
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 20, 2025
    Dataset provided by
    Science Data Bank
    Authors
    Zhenzhen Tan; Wentao Xue; Chen Qian; Zhiwei Liu; Juanzi Wu; Xiaoxian Zhong
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Raw sequence data of mitochondrial genomes from Pennisetum glaucum ‘23A’ and ‘23B’ based on Illumina NovaSeq 6000 and PacBio Sequel II platforms. The data are also publicly accessible at https://ngdc.cncb.ac.cn/gsa.

  12. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Xuping Zhou (2023). Niphotrichum japonicum genome project [Dataset]. http://doi.org/10.6084/m9.figshare.23573514.v5
Organization logoOrganization logo

Niphotrichum japonicum genome project

Explore at:
txtAvailable download formats
Dataset updated
Sep 16, 2023
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Xuping Zhou
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The dataset contains the genome and annotation file of Niphotrichum japonicum. The raw data have been deposited in National Genomics Data Center (NGDC; https://ngdc.cncb.ac.cn/bioproject/) under the BioProject accession number PRJCA017860.

Search
Clear search
Close search
Google apps
Main menu