100+ datasets found

f
Reference to Whole-Genome Sequencing Data of Helicobacter pylori
datasetcatalog.nlm.nih.gov
Updated Aug 9, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Qiu, Xunan; Zheng, Shuwen; Chen, Jijun; Guo, Rui; Yuan, Yuan; Ni, Chuxuan; Gong, Yuehua; Wang, Yingying; Yin, Honghao (2024). Reference to Whole-Genome Sequencing Data of Helicobacter pylori [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001434219
Explore at:
Dataset updated
Aug 9, 2024
Authors
Qiu, Xunan; Zheng, Shuwen; Chen, Jijun; Guo, Rui; Yuan, Yuan; Ni, Chuxuan; Gong, Yuehua; Wang, Yingying; Yin, Honghao
Description
This dataset comprises whole-genome sequencing data for Helicobacter pylori (H. pylori), collected from publicly available databases, including NCBI (https://www.ncbi.nlm.nih.gov/) and BV-BRC(https://www.bv-brc.org/). The data was originally generated and submitted to these databases as part of various studies focused on understandingantibiotic resistance of H. pylori.For further information on the studies that contributed to this dataset, please refer to the original research publications("Early genetic diagnosis of clarithromycin resistance in Helicobacter pylori", "Helicobacter pylori Infections in the Bronx, New York: Surveying Antibiotic Susceptibility and Strain Lineage by Whole-Genome Sequencing", "Helicobacter pylori Antimicrobial Resistance and Gene Variants in High- and Low-Gastric-Cancer-Risk Populations", "A Survey of Helicobacter pylori Antibiotic-Resistant Genotypes and Strain Lineages by Whole-Genome Sequencing in China", "Multiple Genome Sequences of Helicobacter pylori Strains of Diverse Disease and Antibiotic Resistance Backgrounds from Malaysia", "Long-Read- and Short-Read-Based Whole-Genome SequencingReveals the Antibiotic Resistance Pattern of Helicobacter pylori", "Antimicrobial resistance patterns and genetic elements associated with the antibiotic resistance of Helicobacter pylori strains from Shanghai") and the associated GenBank accessions, which can be found in the file GenBank Accessions.txt. This file provides a detailed list of the accession numbers, allowing you to access the specific genetic sequences and related data used in this study.
F
Low-coverage Whole Genome Sequencing (LCWGS) of DNA Sequences from...
frdr-dfdr.ca
Updated Sep 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Osagie, Patricia B; Enciso-Romero, Juan; Burg, Theresa M (2025). Low-coverage Whole Genome Sequencing (LCWGS) of DNA Sequences from White-Crowned Sparrows (Zonotrichia leucophrys) in Alberta and British Columbia (Canada) and Colorado and Oregon (U.S.) [Dataset]. http://doi.org/10.20383/103.01131
Explore at:
Unique identifier
https://doi.org/10.20383/103.01131
Dataset updated
Sep 8, 2025
Dataset provided by
Federated Research Data Repository / dépôt fédéré de données de recherche
Authors
Osagie, Patricia B; Enciso-Romero, Juan; Burg, Theresa M
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Canada, United States
Description
This dataset contains low-coverage whole genome sequencing (lcWGS) data from multiple subspecies of white-crowned sparrows (Zonotrichia leucophrys) sampled across Alberta and British Columbia (Canada), and Colorado and Oregon (USA), between 2017 and 2021. The samples represent three focal subspecies (Z. l. gambelii, Z. l. oriantha, and Z. l. pugetensis) collected during the breeding season from riparian deciduous habitats to minimize environmental effects on genetic variation.

Genomic DNA was extracted from blood or feather samples using a modified salt extraction method, and shotgun sequencing libraries were prepared without PCR amplification, incorporating 8 bp unique barcodes per sample. Sequencing was performed at Genome Quebec using Illumina NovaSeq 6000 S4 PE 150 chemistry. Each sample was sequenced at a depth of ~6.2x to 8.9x coverage.

The data processing pipeline includes read alignment to the zebra finch (Taeniopygia guttata) reference genome using BWA, duplicate removal with Picard, indel realignment and overlap clipping, genotype likelihood estimation with SAMtools and BCFtools, and SNP calling with quality and frequency filters. ANGSD was used to infer major/minor alleles, allele frequencies, and to generate population-level variant data for downstream analysis of genetic structure and divergence. This dataset supports investigations into mitonuclear coevolution, population structure, and recent divergence patterns in white-crowned sparrows.
d
Data from: Whole-genome sequencing approaches for conservation biology:...
datadryad.org
search.dataone.org
zip
Updated Aug 7, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Angela P. Fuentes-Pardo; Daniel E. Ruzzante (2017). Whole-genome sequencing approaches for conservation biology: advantages, limitations, and practical recommendations [Dataset]. http://doi.org/10.5061/dryad.3k8v9
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.3k8v9
Dataset updated
Aug 7, 2017
Dataset provided by
Dryad
Authors
Angela P. Fuentes-Pardo; Daniel E. Ruzzante
Time period covered
Jul 20, 2017
Description
data_Genbank_june2017Each file contains the metadata of genomes available in GenBank for a given taxonomic group up to June 2017 (https://www.ncbi.nlm.nih.gov/genome/browse/#). Four levels of assembly were considered: contigs, scaffolds, chromosomes, and complete genome (https://www.ncbi.nlm.nih.gov/assembly/help/#definition).
d
Data from: Whole-genome sequence data and analysis of a Staphylococcus...
catalog.data.gov
agdatacommons.nal.usda.gov
Updated Dec 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Agricultural Research Service (2025). Data from: Whole-genome sequence data and analysis of a Staphylococcus aureus strain SJTUF_J27 isolated from seaweed [Dataset]. https://catalog.data.gov/dataset/data-from-whole-genome-sequence-data-and-analysis-of-a-staphylococcus-aureus-strain-sjtuf--5d2cc
Explore at:
Dataset updated
Dec 2, 2025
Dataset provided by
Agricultural Research Service
Description
The complete genome sequence data of S. aureus SJTUF_J27 isolated from seaweed in China is reported here. The size of the genome is 2.8 Mbp with 32.9% G+C content, consisting of 2614 coding sequences and 77 RNAs. A number of virulence factors, including antimicrobial resistance genes (fluoroquinolone, beta-lactams, fosfomycin, mupirocin, trimethoprim, and aminocoumarin) and the egc enterotoxin cluster, were found in the genome. In addition, the genes encoding metal-binding proteins and associated heavy metal resistance were identified. Phylogenetic data analysis, based upon genome-wide single nucleotide polymorphisms (SNPs), and comparative genomic evaluation with BLAST Ring Image Generator (BRIG) were performed for SJTUF_J27 and four S. aureus strains isolated from food. The completed genome data was deposited in NCBI's GenBank under the accession number CP019117, https://www.ncbi.nlm.nih.gov/nuccore/CP019117. Resources in this dataset: Resource Title: NCBI GenBank Accession CP019117.1: Staphylococcus aureus strain SJTUF_J27 chromosome, complete genome. File Name: Web Page, url: https://www.ncbi.nlm.nih.gov/nuccore/CP019117 With an average of 331-fold sequencing coverage, a genome size of 2,804,759 bp constituting 32.9% of G+C content was generated. RAST annotation of the genome revealed a total of 399 subsystems, 2614 coding sequences (80 of them related to virulence, disease and defense), and 77 RNAs. PathogenFinder showed the probability of this strain being a human pathogen was 98%. Bacteria and source DNA available from Xianming Shi, 800 Dongchuan Road, Shanghai, China, 200240. Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013).
M
A collection of Whole-genome sequencing files from the Cancer Genome Atlas...
datacatalog.mskcc.org
Updated Jul 26, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Cancer Genome Atlas (TCGA) (2021). A collection of Whole-genome sequencing files from the Cancer Genome Atlas program on Adenocarcinoma, filtered from the GDC Data Portal [Dataset]. https://datacatalog.mskcc.org/dataset/10777
Explore at:
Dataset updated
Jul 26, 2021
Dataset provided by
The Cancer Genome Atlas (TCGA)
MSK Library
Description
The GDC Data Portal is a robust data-driven platform that allows cancer researchers and bioinformaticians to search and download cancer data for analysis. This dataset is a filtered search result in the GDC Data Portal for TCGA Project, Adenocarcinoma, Whole Genome Sequencing Reads. It consists of 196 BAM files and 99 cases.
Whole genome sequencing, replicate 3
figshare.com
bin
Updated Mar 12, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Marcus Høy Hansen; Charlotte Guldborg Nyvold (2021). Whole genome sequencing, replicate 3 [Dataset]. http://doi.org/10.6084/m9.figshare.14198591.v1
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.14198591.v1
Dataset updated
Mar 12, 2021
Dataset provided by
Figsharehttp://figshare.com/
figshare
Authors
Marcus Høy Hansen; Charlotte Guldborg Nyvold
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Whole-genome sequencing of caucasian male. Replicate 3
f
Whole-genome sequencing data statistics.
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated May 21, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Estivill, Xavier; Escaramís, Geòrgia; Cáceres, Mario; Rabionet, Raquel; Gut, Marta; Martínez-Fundichely, Alexander; Ossowski, Stephan; Bassaganyas, Laia; Tubio, Jose M. C.; Tornador, Cristian (2013). Whole-genome sequencing data statistics. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001646407
Explore at:
Dataset updated
May 21, 2013
Authors
Estivill, Xavier; Escaramís, Geòrgia; Cáceres, Mario; Rabionet, Raquel; Gut, Marta; Martínez-Fundichely, Alexander; Ossowski, Stephan; Bassaganyas, Laia; Tubio, Jose M. C.; Tornador, Cristian
Description
Whole-genome sequencing data statistics.
E
shallow whole genome sequencing dataset
ega-archive.org
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
shallow whole genome sequencing dataset [Dataset]. https://ega-archive.org/datasets/EGAD50000000535
Explore at:
License
https://ega-archive.org/dacs/EGAC50000000227https://ega-archive.org/dacs/EGAC50000000227
Description
This dataset comes from shallow whole genome sequencing data of STIC project
f
Data from: Whole-Genome Sequencing of the World’s Oldest People
datasetcatalog.nlm.nih.gov
figshare.com
+1more
Updated Nov 12, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Roach, Jared C.; Smith, Justin D.; Coles, L. Stephen; Fortney, Kristen; Markov, Glenn J.; Gierman, Hinco J.; Li, Hong; Kim, Stuart K.; Coles, Natalie S.; Hood, Leroy; Glusman, Gustavo (2014). Whole-Genome Sequencing of the World’s Oldest People [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001221461
Explore at:
Dataset updated
Nov 12, 2014
Authors
Roach, Jared C.; Smith, Justin D.; Coles, L. Stephen; Fortney, Kristen; Markov, Glenn J.; Gierman, Hinco J.; Li, Hong; Kim, Stuart K.; Coles, Natalie S.; Hood, Leroy; Glusman, Gustavo
Area covered
World
Description
Supercentenarians (110 years or older) are the world’s oldest people. Seventy four are alive worldwide, with twenty two in the United States. We performed whole-genome sequencing on 17 supercentenarians to explore the genetic basis underlying extreme human longevity. We found no significant evidence of enrichment for a single rare protein-altering variant or for a gene harboring different rare protein altering variants in supercentenarian compared to control genomes. We followed up on the gene most enriched for rare protein-altering variants in our cohort of supercentenarians, TSHZ3, by sequencing it in a second cohort of 99 long-lived individuals but did not find a significant enrichment. The genome of one supercentenarian had a pathogenic mutation in DSC2, known to predispose to arrhythmogenic right ventricular cardiomyopathy, which is recommended to be reported to this individual as an incidental finding according to a recent position statement by the American College of Medical Genetics and Genomics. Even with this pathogenic mutation, the proband lived to over 110 years. The entire list of rare protein-altering variants and DNA sequence of all 17 supercentenarian genomes is available as a resource to assist the discovery of the genetic basis of extreme longevity in future studies.
U
Whole genome sequencing of three North American large-bodied birds
data.usgs.gov
datasets.ai
+2more
Updated Dec 13, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Robert Cornman; Jennifer Fike; Sara Oyler-McCance (2023). Whole genome sequencing of three North American large-bodied birds [Dataset]. http://doi.org/10.5066/P9DK14PM
Explore at:
Unique identifier
https://doi.org/10.5066/P9DK14PM
Dataset updated
Dec 13, 2023
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Authors
Robert Cornman; Jennifer Fike; Sara Oyler-McCance
License
U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
Time period covered
Jul 15, 2021
Area covered
United States
Description
The data release details the samples, methods, and raw data used to generate high-quality genome assemblies for greater sage-grouse (Centrocercus urophasianus), white-tailed ptarmigan (Lagopus leucura), and trumpeter swan (Cygnus buccinator). The raw data have been deposited in the Sequence Read Archive (SRA) of the National Center for Biotechnology Information (NCBI), the authoritative repository for public biological sequence data, and are not included in this data release. Instead, the accessions that link to those data via the NCBI portal (www.ncbi.nlm.nih.gov) are provided herein. The release consists of a single file, sample.metadata.txt, which maps NCBI accessions to the samples sequenced and the different types of sequencing performed to generate the assemblies and annotate their gene features.
n
Data from: Clinical Genomic Database
neuinfo.org
scicrunch.org
+2more
Updated Sep 23, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Clinical Genomic Database [Dataset]. http://identifiers.org/RRID:SCR_006427
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_006427
Dataset updated
Sep 23, 2024
Description
Manually curated database of all conditions with known genetic causes, focusing on medically significant genetic data with available interventions. Includes gene symbol, conditions, allelic conditions, inheritance, age in which interventions are indicated, clinical categorization, and general description of interventions/rationale. Contents are intended to describe types of interventions that might be considered. Includes only single gene alterations and does not include genetic associations or susceptibility factors related to more complex diseases.
E
Whole genome sequencing data of high-grade serous ovarian cancer samples...
ega-archive.org
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Whole genome sequencing data of high-grade serous ovarian cancer samples (set 16) [Dataset]. https://ega-archive.org/datasets/EGAD50000000777
Explore at:
License
https://ega-archive.org/dacs/EGAC00001001760https://ega-archive.org/dacs/EGAC00001001760
Description
The dataset contains whole genome sequencing data of 58 high-grade serous carcinoma (HGSC) patients sequenced with Novoseq 6000. The 144 samples are either fresh frozen tumour samples or blood samples. The files provided are paired fastq files.
G
Genomic Data Analysis Service Report
archivemarketresearch.com
doc, pdf, ppt
Updated Jan 19, 2026
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Archive Market Research (2026). Genomic Data Analysis Service Report [Dataset]. https://www.archivemarketresearch.com/reports/genomic-data-analysis-service-55807
Explore at:
pdf, ppt, docAvailable download formats
Dataset updated
Jan 19, 2026
Dataset authored and provided by
Archive Market Research
License
https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy
Time period covered
2026 - 2034
Area covered
Global
Variables measured
Market Size
Description
The global Genomic Data Analysis Service market is booming, projected to reach $4192.3 million in 2025, with a significant CAGR driving growth. Explore market trends, key players (Illumina, QIAGEN, BGI Genomics), and regional insights in this comprehensive analysis. Discover opportunities in whole genome & exome sequencing.
s
Long-read whole genome sequencing of human T cells
figshare.scilifelab.se
researchdata.se
+1more
Updated Jan 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Joanna Hård; Jakob Michaelsson (2025). Long-read whole genome sequencing of human T cells [Dataset]. http://doi.org/10.17044/scilifelab.22730684.v1
Explore at:
Unique identifier
https://doi.org/10.17044/scilifelab.22730684.v1
Dataset updated
Jan 15, 2025
Dataset provided by
Karolinska Institutet
Authors
Joanna Hård; Jakob Michaelsson
License
https://www.scilifelab.se/data/restricted-access/https://www.scilifelab.se/data/restricted-access/
Description
This dataset represent long read sequencing of single human T cells isolated from a human donor. The data set include Illumina whole genome sequencing of 16 single T cells and PacBio HiFi whole genome sequenicng of 5 single T cells
kraken2 database of marine animal genomes, for host decontamination
zenodo.org
application/gzip
Updated Dec 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Angelina Angelova; Angelina Angelova (2025). kraken2 database of marine animal genomes, for host decontamination [Dataset]. http://doi.org/10.5281/zenodo.17873185
Explore at:
application/gzipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.17873185
Dataset updated
Dec 11, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Angelina Angelova; Angelina Angelova
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
kraken2 database of common marine animal hosts in marine metagenomic dataset. Used in Nephele pipelines for decontamination of metagenomic datasets from common marine animal host reads (database inclusive of human genome).

Content of assemblies:

Homo sapiens (GRCh38.p13),

Conus ventricosus (ASM1839881v1),

Crassostrea virginica (C_virginica-3.0) ,

Crassostrea gigas (cgigas_uk_roslin_v1),

Mytilus galloprovincialis (MGAL_10),

Octopus sinensis (ASM634580v1),

Paraescarpia echinospica (HKBU_Pec_v1),

Streblospio benedicti (ASM1909598v1),

Hyalella azteca (Hazt_2.0.2),

Amphibalanus amphitrite (NRLGWU_Aamphi_draft),

Paramacrobiotus sp. TYO (Prichtersi_v1.0),

Hypsibius dujardini (ASM157998v1),

Lytechinus pictus (UCSD_Lpic_2.0),

Strongylocentrotus purpuratus (Spur_5.0),

Apostichopus parvimensis (Ppar_1.0),

Hydra vulgaris (Hydra_105_v3),

Hydra viridissima (ASM1470644v1),

Pocillopora damicornis (ASM370409v1),

Amphimedon queenslandica (assembly v1.0)
E
Whole-genome sequencing
ega-archive.org
Updated May 16, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2022). Whole-genome sequencing [Dataset]. https://ega-archive.org/datasets/EGAD00001008778
Explore at:
Dataset updated
May 16, 2022
License
https://ega-archive.org/dacs/EGAC00001002682https://ega-archive.org/dacs/EGAC00001002682
Description
Whole-genome sequencing (WGS) data.
H
Bacterial whole genome sequencing data
dataverse.harvard.edu
search.dataone.org
Updated Feb 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Matthew Munneke (2025). Bacterial whole genome sequencing data [Dataset]. http://doi.org/10.7910/DVN/2ZMG77
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/2ZMG77
Dataset updated
Feb 2, 2025
Dataset provided by
Harvard Dataverse
Authors
Matthew Munneke
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
4-thiouracil suppressor bacterial whole genome sequencing files
f
Processed Whole Genome Sequencing Data (Goyal et. al)
datasetcatalog.nlm.nih.gov
Updated May 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Goyal, Yogesh (2023). Processed Whole Genome Sequencing Data (Goyal et. al) [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001060614
Explore at:
Dataset updated
May 28, 2023
Authors
Goyal, Yogesh
Description
GeoMx Whole Genome Sequencing processed datasets from Goyal et al.. All raw data from the Whole Genome Sequencing used in this manuscript can be found at BioProject Accession PRJNA972638.
f
Dataset Validation - Concordance Files
plus.figshare.com
zip
Updated May 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Renato Santos; Manuel Corpas (2024). Dataset Validation - Concordance Files [Dataset]. http://doi.org/10.25452/figshare.plus.21673739.v3
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.25452/figshare.plus.21673739.v3
Dataset updated
May 13, 2024
Dataset provided by
Figshare+
Authors
Renato Santos; Manuel Corpas
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This dataset contains concordance metrics between high and low-coverage VCF files of the IBS001 genome, belonging to an IBS (Iberian Populations in Spain) individual. This genome was sequenced at 40X coverage in both Illumina and MGI sequencing platforms, and then respectively downsampled to 1x coverage with samtools. Genotype likelihood calculations and calling was performed using bcftools, and imputation of the low-coverage genotypes was performed using GLIMPSE1.See related materials at: https://doi.org/10.25452/figshare.plus.c.6347534
d
Whole Genome Shotgun Submissions
catalog.data.gov
datadiscovery.nlm.nih.gov
+4more
Updated Jun 19, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Library of Medicine (2025). Whole Genome Shotgun Submissions [Dataset]. https://catalog.data.gov/dataset/whole-genome-shotgun-submissions
Explore at:
Dataset updated
Jun 19, 2025
Dataset provided by
National Library of Medicine
Description
Whole Genome Shotgun (WGS) projects are genome assemblies of incomplete genomes or incomplete chromosomes of prokaryotes or eukaryotes that are generally being sequenced by a whole genome shotgun strategy. WGS projects may be annotated, but annotation is not required. NCBI has a Prokaryotic Genomes Annotation Pipeline that may be requested at the time the genome files are submitted to GenBank. This pipeline generates a submission-ready annotated file that is posted back to the submitter for review and which the submitter could edit prior to data release. The public WGS projects are at the list of WGS projects. https://www.ncbi.nlm.nih.gov/Traces/wgs/

Facebook

Twitter

Click to copy link

Link copied

Cite

Qiu, Xunan; Zheng, Shuwen; Chen, Jijun; Guo, Rui; Yuan, Yuan; Ni, Chuxuan; Gong, Yuehua; Wang, Yingying; Yin, Honghao (2024). Reference to Whole-Genome Sequencing Data of Helicobacter pylori [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001434219

Reference to Whole-Genome Sequencing Data of Helicobacter pylori

Explore at:

Dataset updated

Aug 9, 2024

Authors

Qiu, Xunan; Zheng, Shuwen; Chen, Jijun; Guo, Rui; Yuan, Yuan; Ni, Chuxuan; Gong, Yuehua; Wang, Yingying; Yin, Honghao

Description

This dataset comprises whole-genome sequencing data for Helicobacter pylori (H. pylori), collected from publicly available databases, including NCBI (https://www.ncbi.nlm.nih.gov/) and BV-BRC(https://www.bv-brc.org/). The data was originally generated and submitted to these databases as part of various studies focused on understandingantibiotic resistance of H. pylori.For further information on the studies that contributed to this dataset, please refer to the original research publications("Early genetic diagnosis of clarithromycin resistance in Helicobacter pylori", "Helicobacter pylori Infections in the Bronx, New York: Surveying Antibiotic Susceptibility and Strain Lineage by Whole-Genome Sequencing", "Helicobacter pylori Antimicrobial Resistance and Gene Variants in High- and Low-Gastric-Cancer-Risk Populations", "A Survey of Helicobacter pylori Antibiotic-Resistant Genotypes and Strain Lineages by Whole-Genome Sequencing in China", "Multiple Genome Sequences of Helicobacter pylori Strains of Diverse Disease and Antibiotic Resistance Backgrounds from Malaysia", "Long-Read- and Short-Read-Based Whole-Genome SequencingReveals the Antibiotic Resistance Pattern of Helicobacter pylori", "Antimicrobial resistance patterns and genetic elements associated with the antibiotic resistance of Helicobacter pylori strains from Shanghai") and the associated GenBank accessions, which can be found in the file GenBank Accessions.txt. This file provides a detailed list of the accession numbers, allowing you to access the specific genetic sequences and related data used in this study.

Clear search

Close search

Google apps

Main menu

Reference to Whole-Genome Sequencing Data of Helicobacter pylori

Low-coverage Whole Genome Sequencing (LCWGS) of DNA Sequences from...

Data from: Whole-genome sequencing approaches for conservation biology:...

Data from: Whole-genome sequence data and analysis of a Staphylococcus...

A collection of Whole-genome sequencing files from the Cancer Genome Atlas...

Whole genome sequencing, replicate 3

Whole-genome sequencing data statistics.

shallow whole genome sequencing dataset

Data from: Whole-Genome Sequencing of the World’s Oldest People

Whole genome sequencing of three North American large-bodied birds

Data from: Clinical Genomic Database

Whole genome sequencing data of high-grade serous ovarian cancer samples...

Genomic Data Analysis Service Report

Long-read whole genome sequencing of human T cells

kraken2 database of marine animal genomes, for host decontamination

Whole-genome sequencing

Bacterial whole genome sequencing data

Processed Whole Genome Sequencing Data (Goyal et. al)

Dataset Validation - Concordance Files

Whole Genome Shotgun Submissions

Reference to Whole-Genome Sequencing Data of Helicobacter pylori