20 datasets found
  1. Data from: GIAB Benchmarking of HG002 Assemblies from HPRC Year 1 Bakeoff

    • catalog.data.gov
    Updated Jul 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Standards and Technology (2022). GIAB Benchmarking of HG002 Assemblies from HPRC Year 1 Bakeoff [Dataset]. https://catalog.data.gov/dataset/giab-benchmarking-of-hg002-assemblies-from-hprc-year-1-bakeoff
    Explore at:
    Dataset updated
    Jul 29, 2022
    Dataset provided by
    National Institute of Standards and Technologyhttp://www.nist.gov/
    Description

    The Human Pangenome Reference Consortium (HPRC) tested which combination of current genome sequencing and automated assembly approaches yields the most complete, accurate, and cost-effective diploid genome assemblies with minimal manual curation. Assemblies were generated for GIAB HG002. Variant calls from twenty-nine assemblies were evaluated by NIST using dipcall v0.3 (https://github.com/lh3/dipcall) to produce variant calls when aligned to GRCh38. Benchmarking of small variant calls was then performed against GIAB benchmark v4.2.1 using hap.py v3.12 (https://github.com/Illumina/hap.py).

  2. Z

    Data for: Nanopore R10.4.1 LSK114 HG002: subset of 20000 reads in BLOW5...

    • data.niaid.nih.gov
    Updated Mar 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gamaarachchi, Hasindu (2023). Data for: Nanopore R10.4.1 LSK114 HG002: subset of 20000 reads in BLOW5 format [Dataset]. https://data.niaid.nih.gov/resources?id=ZENODO_7695412
    Explore at:
    Dataset updated
    Mar 3, 2023
    Dataset authored and provided by
    Gamaarachchi, Hasindu
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    HG002 (NA24385) is a reference human genome sample used for benchmarking and comparing bioinformatics applications. This dataset contains a subset of 20,000 reads from the HG002 human reference sample, sequenced using an Oxford Nanopore Technologies PromethION sequencer on an R10.4.1 flowcell. Sheared DNA libraries (~17Kb) were prepared using the ONT LSK114 ligation library prep and an R10.4.1 flow cell was used to generate ~30X genome coverage. The original data in the FAST5 format was converted to BLOW5 format using slow5tools v0.8.0. This is a downsampled subset containing 20,000 reads in BLOW5 format.

  3. f

    HG002 Illumina PCR Free

    • figshare.com
    application/gzip
    Updated Jun 21, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nathan Dwarshuis (2023). HG002 Illumina PCR Free [Dataset]. http://doi.org/10.6084/m9.figshare.22637347.v1
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Jun 21, 2023
    Dataset provided by
    figshare
    Authors
    Nathan Dwarshuis
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    HG002 Illumina PCR Free vcf 40x coverage

  4. A

    Challenging Medically-Relevant Genes Benchmark Set

    • data.amerigeoss.org
    • catalog.data.gov
    bin, gz, text, tsv
    Updated Aug 28, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States (2022). Challenging Medically-Relevant Genes Benchmark Set [Dataset]. http://identifiers.org/ark:/88434/mds2-2475
    Explore at:
    bin, gz, text, tsvAvailable download formats
    Dataset updated
    Aug 28, 2022
    Dataset provided by
    United States
    License

    https://www.nist.gov/open/licensehttps://www.nist.gov/open/license

    Description

    CMRG v1.00 of a small variant benchmark and structural variant benchmark focused on 273 challenging medically relevant genes for the Genome in a Bottle (GIAB) sample HG002 (aka Ashkenazi son). These benchmarks were generated from a trio-based hifiasm v0.11 (https://doi.org/10.1038/s41592-020-01056-5) diploid assembly of HG002 using PacBio HiFi reads for HG002 for assembly and partitioning into phased haplotypes using Illumina reads for the parents, HG003 and HG004. This benchmark contains vcfs for small and structural variants along with corresponding benchmark bed files indicating regions that are homozygous reference if they do not have a variant in the vcf. We extensively curated the variant calls, excluding any found to be questionable or errors. This benchmark helps measure performance in important challenging regions, including challenging segmental duplications, regions with complex variants, regions with structural variants, and regions affected by false duplications in GRCh37 or GRCh38. This benchmark is described in https://doi.org/10.1101/2021.06.07.444885.

  5. HG002 Ultima (2024)

    • figshare.com
    application/gzip
    Updated Apr 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nathan Dwarshuis (2024). HG002 Ultima (2024) [Dataset]. http://doi.org/10.6084/m9.figshare.25554984.v1
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Apr 5, 2024
    Dataset provided by
    figshare
    Authors
    Nathan Dwarshuis
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description
  6. Test data for sv-callers workflow

    • zenodo.org
    • data.niaid.nih.gov
    zip
    Updated Nov 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Arnold Kuzniar; Arnold Kuzniar; Luca Santuari; Luca Santuari (2023). Test data for sv-callers workflow [Dataset]. http://doi.org/10.5281/zenodo.4001614
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 19, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Arnold Kuzniar; Arnold Kuzniar; Luca Santuari; Luca Santuari
    Description

    This distribution includes data analyzed by the sv-callers workflow (v1.1.0) in the single-sample (germline) and paired-sample (somatic) modes:

  7. o

    Garvan Institute Long Read Sequencing Benchmark Data

    • registry.opendata.aws
    Updated Jun 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Genomic Technologies Group, Garvan Institute of Medical Research (https://www.garvan.org.au/research/labs-groups/genomic-technologies-lab) (2023). Garvan Institute Long Read Sequencing Benchmark Data [Dataset]. https://registry.opendata.aws/gtgseq/
    Explore at:
    Dataset updated
    Jun 16, 2023
    Dataset provided by
    Genomic Technologies Group, Garvan Institute of Medical Research (<a href="https://www.garvan.org.au/research/labs-groups/genomic-technologies-lab">https://www.garvan.org.au/research/labs-groups/genomic-technologies-lab</a>)
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    The dataset contains reference samples that will be useful for benchmarking and comparing bioinformatics tools for genome analysis. Currently, there are two samples, which are NA12878 (HG001) and NA24385 (HG002), sequenced on an Oxford Nanopore Technologies (ONT) PromethION using the latest R10.4.1 flowcells. Raw signal data output by the sequencer is provided for these datasets in BLOW5 format, and can be rebasecalled when basecalling software updates bring accuracy and feature improvements over the years. Raw signal data is not only for rebasecalling, but also can be used for emerging bioinformatics tools that directly analyse raw signal data. We also provide the basecalled data alongside the raw signal data and will continue to provide updated basecalls when there is a major update to the basecalling software. In the future, we plan to extend this open dataset with additional samples, including sequencing runs from vendors other than ONT.

  8. o

    PopDel identifies medium-size deletions jointly in tens of thousands of...

    • explore.openaire.eu
    Updated Aug 20, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sebastian Niehus (2020). PopDel identifies medium-size deletions jointly in tens of thousands of genomes - Variant call sets [Dataset]. http://doi.org/10.5281/zenodo.3992606
    Explore at:
    Dataset updated
    Aug 20, 2020
    Authors
    Sebastian Niehus
    Description

    This data set contains the variant calls sets generated by different tools for the benchmarks in the paper PopDel identifies medium-size deletions simultaneously in tens of thousands of genomes. It includes the VCFs/BCFs for the following test cases: Random deletion simulation on up to 1000 chromosome 21 samples 1000 Genomes Project deletions inserted into simulated chromosomes 17 to 22 of up to 500 samples HG001 (NA12878) Trio of HG002 + HG003 + HG004 Polaris Diversity cohort Polaris Kids cohort Further, the long and short read reference call sets for HG001 are provided. For HG002 the reference call set and the high confidence regions by the Genome in a Bottle consortium are provided. For details on how the files have been created, please refer to the paper and the script repository on GitHub. {"references": ["Auton, A., Abecasis, G., Altshuler, D. et al. A global reference for human genetic variation. Nature 526, 68\u201374 (2015).", "Zook, J.M., Chapman, B., Wang, J. et al. Integrating human sequence data sets provides a resource of benchmark SNP and indel genotype calls. Nat Biotechnol 32, 246\u2013251 (2014).", "Zook, J.M., Hansen, N.F., Olson, N.D. et al. Author Correction: A robust benchmark for detection of germline large deletions and insertions. Nat Biotechnol (2020)."]} This work was funded by the German Federal Ministry of Education and Research under grant number 031L0180.

  9. HG002

    • figshare.com
    application/gzip
    Updated Dec 6, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ivan Tolstoganov (2022). HG002 [Dataset]. http://doi.org/10.6084/m9.figshare.21678842.v1
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Dec 6, 2022
    Dataset provided by
    figshare
    Authors
    Ivan Tolstoganov
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Assembly graphs and reference Verkko assembly for HG002 dataset

  10. HG002 Ultima (2022)

    • figshare.com
    application/x-gzip
    Updated Apr 5, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nathan Dwarshuis (2024). HG002 Ultima (2022) [Dataset]. http://doi.org/10.6084/m9.figshare.25554978.v1
    Explore at:
    application/x-gzipAvailable download formats
    Dataset updated
    Apr 5, 2024
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Nathan Dwarshuis
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description
  11. Minigraph pangenome graphs for HPRC year-1 samples

    • zenodo.org
    application/gzip, bin +1
    Updated Aug 12, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Heng Li; Heng Li (2022). Minigraph pangenome graphs for HPRC year-1 samples [Dataset]. http://doi.org/10.5281/zenodo.6286522
    Explore at:
    application/gzip, txt, binAvailable download formats
    Dataset updated
    Aug 12, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Heng Li; Heng Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Pangenome graphs built with minigraph-0.14 for HPRC year-1 samples, excluding HG002, HG002 and NA19240. See 00README.txt for file description.

  12. Data from: SVXplorer: three-tier approach to identification of structural...

    • commons.datacite.org
    Updated Feb 3, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kunal Kathuria; Aakrosh Ratan (2020). SVXplorer: three-tier approach to identification of structural variants via sequential recombination of discordant cluster signatures [Dataset]. http://doi.org/10.5281/zenodo.3634027
    Explore at:
    Dataset updated
    Feb 3, 2020
    Dataset provided by
    DataCitehttps://www.datacite.org/
    Zenodohttp://zenodo.org/
    Authors
    Kunal Kathuria; Aakrosh Ratan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This repository contains the bgzipped variants calls in VCF format for CHM1, NA12878 and AJ trio dataset that are used in the SVXplorer manuscript. The names of the files contain the name of the sample (CHM1/NA12878/HG002/HG003/HG004), the name of the method (SVXplorer/DELLY/LUMPY/TIDDIT/TARDIS/MANTA) used to call the variants. There are three separate files for the DELLY calls which have the deletions, duplications and the inversion calls made by DELLY for each of the samples. For NA12878, there are two sets of calls, one for each of the libraries (ERR194147/SRR505885)

  13. o

    Transposable element pangenome

    • explore.openaire.eu
    Updated Jan 24, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cristian Groza; Guillaume Bourque; Clement Goubert (2022). Transposable element pangenome [Dataset]. http://doi.org/10.5281/zenodo.5898620
    Explore at:
    Dataset updated
    Jan 24, 2022
    Authors
    Cristian Groza; Guillaume Bourque; Clement Goubert
    Description

    A genome graph and associated GCSA index describing the transposable element pangenome derived from reference annotations and non-reference insertions from the HG00733 and HG002 genomes.

  14. HG002 PacBio Hifi

    • figshare.com
    application/gzip
    Updated Jun 21, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nathan Dwarshuis (2023). HG002 PacBio Hifi [Dataset]. http://doi.org/10.6084/m9.figshare.22637410.v1
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Jun 21, 2023
    Dataset provided by
    figshare
    Authors
    Nathan Dwarshuis
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    HG002 PacBio Hifi vcf 37x coverage

  15. Performance of deletion calls for HG002.

    • figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kunal Kathuria; Aakrosh Ratan (2023). Performance of deletion calls for HG002. [Dataset]. http://doi.org/10.1371/journal.pcbi.1007737.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Kunal Kathuria; Aakrosh Ratan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Performance of deletion calls for HG002.

  16. f

    Heuristics used to determine HG002 genotypes.

    • figshare.com
    • plos.figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lesley M. Chapman; Noah Spies; Patrick Pai; Chun Shen Lim; Andrew Carroll; Giuseppe Narzisi; Christopher M. Watson; Christos Proukakis; Wayne E. Clarke; Naoki Nariai; Eric Dawson; Garan Jones; Daniel Blankenberg; Christian Brueffer; Chunlin Xiao; Sree Rohit Raj Kolora; Noah Alexander; Paul Wolujewicz; Azza E. Ahmed; Graeme Smith; Saadlee Shehreen; Aaron M. Wenger; Marc Salit; Justin M. Zook (2023). Heuristics used to determine HG002 genotypes. [Dataset]. http://doi.org/10.1371/journal.pcbi.1007933.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS Computational Biology
    Authors
    Lesley M. Chapman; Noah Spies; Patrick Pai; Chun Shen Lim; Andrew Carroll; Giuseppe Narzisi; Christopher M. Watson; Christos Proukakis; Wayne E. Clarke; Naoki Nariai; Eric Dawson; Garan Jones; Daniel Blankenberg; Christian Brueffer; Chunlin Xiao; Sree Rohit Raj Kolora; Noah Alexander; Paul Wolujewicz; Azza E. Ahmed; Graeme Smith; Saadlee Shehreen; Aaron M. Wenger; Marc Salit; Justin M. Zook
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Heuristics used to determine HG002 genotypes.

  17. f

    Classification of inversions detected in HG002 using PacBio and ONT reads.

    • figshare.com
    xls
    Updated Jun 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jingwen Ren; Mark J. P. Chaisson (2023). Classification of inversions detected in HG002 using PacBio and ONT reads. [Dataset]. http://doi.org/10.1371/journal.pcbi.1009078.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 10, 2023
    Dataset provided by
    PLOS Computational Biology
    Authors
    Jingwen Ren; Mark J. P. Chaisson
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Classification of inversions detected in HG002 using PacBio and ONT reads.

  18. f

    Additional file 2 of cDNA-detector: detection and removal of cDNA...

    • springernature.figshare.com
    • figshare.com
    xlsx
    Updated Jun 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Meifang Qi; Utthara Nayar; Leif S. Ludwig; Nikhil Wagle; Esther Rheinbay (2023). Additional file 2 of cDNA-detector: detection and removal of cDNA contamination in DNA sequencing libraries [Dataset]. http://doi.org/10.6084/m9.figshare.17598081.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    figshare
    Authors
    Meifang Qi; Utthara Nayar; Leif S. Ludwig; Nikhil Wagle; Esther Rheinbay
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Additional file 2. Effect of contaminant cDNA reads and decontamination with cDNA-detector on structural variant calling in HG002 WGS.

  19. f

    Table_1_Profiling genes encoding the adaptive immune receptor repertoire...

    • frontiersin.figshare.com
    xlsx
    Updated Jun 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mao-Jan Lin; Yu-Chun Lin; Nae-Chyun Chen; Allen Chilun Luo; Sheng-Kai Lai; Chia-Lang Hsu; Jacob Shujui Hsu; Chien-Yu Chen; Wei-Shiung Yang; Pei-Lung Chen (2023). Table_1_Profiling genes encoding the adaptive immune receptor repertoire with gAIRR Suite.xlsx [Dataset]. http://doi.org/10.3389/fimmu.2022.922513.s002
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 16, 2023
    Dataset provided by
    Frontiers
    Authors
    Mao-Jan Lin; Yu-Chun Lin; Nae-Chyun Chen; Allen Chilun Luo; Sheng-Kai Lai; Chia-Lang Hsu; Jacob Shujui Hsu; Chien-Yu Chen; Wei-Shiung Yang; Pei-Lung Chen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Adaptive immune receptor repertoire (AIRR) is encoded by T cell receptor (TR) and immunoglobulin (IG) genes. Profiling these germline genes encoding AIRR (abbreviated as gAIRR) is important in understanding adaptive immune responses but is challenging due to the high genetic complexity. Our gAIRR Suite comprises three modules. gAIRR-seq, a probe capture-based targeted sequencing pipeline, profiles gAIRR from individual DNA samples. gAIRR-call and gAIRR-annotate call alleles from gAIRR-seq reads and annotate whole-genome assemblies, respectively. We gAIRR-seqed TRV and TRJ of seven Genome in a Bottle (GIAB) DNA samples with 100% accuracy and discovered novel alleles. We also gAIRR-seqed and gAIRR-called the TR and IG genes of a subject from both the peripheral blood mononuclear cells (PBMC) and oral mucosal cells. The calling results from these two cell types have a high concordance (99% for all known gAIRR alleles). We gAIRR-annotated 36 genomes to unearth 325 novel TRV alleles and 29 novel TRJ alleles. We could further profile the flanking sequences, including the recombination signal sequence (RSS). We validated two structural variants for HG002 and uncovered substantial differences of gAIRR genes in references GRCh37 and GRCh38. gAIRR Suite serves as a resource to sequence, analyze, and validate germline TR and IG genes to study various immune-related phenotypes.

  20. Additional file 1 of Characterization of telomere variant repeats using long...

    • springernature.figshare.com
    xlsx
    Updated May 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zachary Stephens; Jean-Pierre Kocher (2024). Additional file 1 of Characterization of telomere variant repeats using long reads enables allele-specific telomere length estimation [Dataset]. http://doi.org/10.6084/m9.figshare.25844365.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    May 17, 2024
    Dataset provided by
    figshare
    Authors
    Zachary Stephens; Jean-Pierre Kocher
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Additional file 1: Correlation of ATLs computed from PacBio and ONT datasets for HG002, using different methods for choosing a representative ATL from length distributions.

  21. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
National Institute of Standards and Technology (2022). GIAB Benchmarking of HG002 Assemblies from HPRC Year 1 Bakeoff [Dataset]. https://catalog.data.gov/dataset/giab-benchmarking-of-hg002-assemblies-from-hprc-year-1-bakeoff
Organization logo

Data from: GIAB Benchmarking of HG002 Assemblies from HPRC Year 1 Bakeoff

Related Article
Explore at:
Dataset updated
Jul 29, 2022
Dataset provided by
National Institute of Standards and Technologyhttp://www.nist.gov/
Description

The Human Pangenome Reference Consortium (HPRC) tested which combination of current genome sequencing and automated assembly approaches yields the most complete, accurate, and cost-effective diploid genome assemblies with minimal manual curation. Assemblies were generated for GIAB HG002. Variant calls from twenty-nine assemblies were evaluated by NIST using dipcall v0.3 (https://github.com/lh3/dipcall) to produce variant calls when aligned to GRCh38. Benchmarking of small variant calls was then performed against GIAB benchmark v4.2.1 using hap.py v3.12 (https://github.com/Illumina/hap.py).

Search
Clear search
Close search
Google apps
Main menu