100+ datasets found
  1. u

    Data from: CottonGen BLAST

    • agdatacommons.nal.usda.gov
    • s.cnmilf.com
    • +1more
    bin
    Updated Feb 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Taein Lee; Sook Jung; Ksenija Gasic; Todd Campbell; Jing Yu; Jodi Humann; Heidi Hough; Dorrie Main (2024). CottonGen BLAST [Dataset]. https://agdatacommons.nal.usda.gov/articles/dataset/CottonGen_BLAST/24853260
    Explore at:
    binAvailable download formats
    Dataset updated
    Feb 13, 2024
    Dataset provided by
    MainLab, Washington State University
    Authors
    Taein Lee; Sook Jung; Ksenija Gasic; Todd Campbell; Jing Yu; Jodi Humann; Heidi Hough; Dorrie Main
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Description

    CottonGen offers BLAST with genome, transcriptome, peptide and marker sequence databases from Gossypium species. This can be done using nucleotide sequences or peptide sequences. BLAST functionality is similar to that on NCBI. BLAST Programs:

    blastn: Search a nucleotide database using a nucleotide query. blastx: Search protein database using a translated nucleotide query. tblastn: Search translated nucleotide database using a protein query.

    blastp: Search protein database using a protein query. Resources in this dataset:Resource Title: Website Pointer for CottonGen BLAST Search. File Name: Web Page, url: https://www.cottongen.org/blast CottonGen offers BLAST with genome, transcriptome, peptide and marker sequence databases from Gossypium species. This can be done using nucleotide sequences or peptide sequences. BLAST functionality is similar to that on NCBI. Enter or upload FASTA sequence(s) to query and select BLAST database.

    BLAST Programs:

    blastn: Search a nucleotide database using a nucleotide query. blastx: Search protein database using a translated nucleotide query. tblastn: Search translated nucleotide database using a protein query. blastp: Search protein database using a protein query.

  2. FASTA BLAST Databases - ykkd-gk2d - Archive Repository

    • healthdata.gov
    csv, xlsx, xml
    Updated Nov 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). FASTA BLAST Databases - ykkd-gk2d - Archive Repository [Dataset]. https://healthdata.gov/w/k93e-8e88/default?cur=aXzi9rdWmhY
    Explore at:
    xlsx, xml, csvAvailable download formats
    Dataset updated
    Nov 5, 2024
    Description

    This dataset tracks the updates made on the dataset "FASTA BLAST Databases" as a repository for previous versions of the data and metadata.

  3. December 2023 release of the databases for PaperBLAST, Curated BLAST for...

    • figshare.com
    application/gzip
    Updated Feb 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Morgan Price (2024). December 2023 release of the databases for PaperBLAST, Curated BLAST for Genomes, and SitesBLAST [Dataset]. http://doi.org/10.6084/m9.figshare.25254562.v1
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Feb 20, 2024
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Morgan Price
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This includes the December 2023 release of three related tools: PaperBLAST, Curated BLAST for Genomes, and SitesBLAST. PaperBLAST links protein sequences to papers about them and to curated annotations. Curated BLAST for Genomes uses the characterized subset of PaperBLAST's database to find candidates with a function of interest within a genome. SitesBLAST links protein sequences to functional sites, such as catalytic residues or substrate-binding residues.Most of the files are gzipped:litsearch.db -- the sqlite3 database for PaperBLAST and SitesBLAST. Also see the schema.litsearch.faa -- sequences for all of the proteins mentioned in the Gene or CuratedGene tables, in fasta formatuniq.faa -- unique sequences from litsearch.faa, in fasta format. (Also see the SeqToDuplicate table.)stats -- some statistics on the PaperBLAST databasecurated.faa -- the curated subset of PaperBLAST's database, in fasta formathassites.faa -- sequences for all the proteins mentioned in the HasSites table, in fasta format.To run the tools from these downloaded databases, you need to gunzip the files and format the BLAST databases. For more information see here.References:PaperBLAST: Text Mining Papers for Information about Homologs (mSystems, 2017)Curated BLAST for Genomes (mSystems, 2020)Interactive Analysis of Functional Residues in Protein Families (mSystems, 2022)

  4. Custom blast databses

    • zenodo.org
    application/gzip
    Updated Oct 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Oliver White; Oliver White (2023). Custom blast databses [Dataset]. http://doi.org/10.5281/zenodo.8424777
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Oct 10, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Oliver White; Oliver White
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Custom blast blast databases for refseq mitochndrial genomes and SILVA ribosomal sequences. Code used to generate these blast databses can be found on github (https://github.com/o-william-white/custom_blastdb).

  5. f

    Annotation of unigenes BLAST against four different databases.

    • datasetcatalog.nlm.nih.gov
    • figshare.com
    Updated Aug 22, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wang, Ling; Yin, Meichen; Lu, Hongzhao; Yang, Likai; Zhang, Tao (2018). Annotation of unigenes BLAST against four different databases. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000660011
    Explore at:
    Dataset updated
    Aug 22, 2018
    Authors
    Wang, Ling; Yin, Meichen; Lu, Hongzhao; Yang, Likai; Zhang, Tao
    Description

    Annotation of unigenes BLAST against four different databases.

  6. NCBI Virus BLAST Database

    • zenodo.org
    bin, txt
    Updated Oct 26, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Geoffrey Zahn; Geoffrey Zahn (2022). NCBI Virus BLAST Database [Dataset]. http://doi.org/10.5281/zenodo.7250500
    Explore at:
    bin, txtAvailable download formats
    Dataset updated
    Oct 26, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Geoffrey Zahn; Geoffrey Zahn
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Curated database of NCBI virus genomes, formatted for BLASTn

  7. f

    BLAST search of protein and nucleotide sequence databases at Pathema and...

    • plos.figshare.com
    xls
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nishant Singh; Alok Bhattacharya; Sudha Bhattacharya (2023). BLAST search of protein and nucleotide sequence databases at Pathema and AmoebaDB for meiotic- and HR-specific genes in E.invadens (Ei) and E.histolytica (Eh). [Dataset]. http://doi.org/10.1371/journal.pone.0074465.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Nishant Singh; Alok Bhattacharya; Sudha Bhattacharya
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Values shown were obtained with BLASTp. SPO11, DMC1 and MND1 are meiotic specific genes.

  8. Case study I: Fidelity of iBLAST in three consecutive time periods.

    • plos.figshare.com
    xls
    Updated Jun 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sajal Dash; Sarthok Rasique Rahman; Heather M. Hines; Wu-chun Feng (2023). Case study I: Fidelity of iBLAST in three consecutive time periods. [Dataset]. http://doi.org/10.1371/journal.pone.0249410.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 10, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Sajal Dash; Sarthok Rasique Rahman; Heather M. Hines; Wu-chun Feng
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    blastn search was performed on nucleotide sequence databases (nt). At any time instance, the Past database size is the size of the database from the previous time instance. The Present database size is the database size at the present time instance. Delta is the incremental database growth from the previous time instance to the current time instance. NCBI BLAST must be performed on the entire Present database size, while iBLAST only needs to be performed on Delta.

  9. n

    Antibiotic Resistance Genes Database

    • neuinfo.org
    • rrid.site
    • +2more
    Updated Mar 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Antibiotic Resistance Genes Database [Dataset]. http://identifiers.org/RRID:SCR_007040
    Explore at:
    Dataset updated
    Mar 19, 2024
    Description

    The goals of Antibiotic Resistance Genes Database (ARGB) are to provide a centralized compendium of information on antibiotic resistance, to facilitate the consistent annotation of resistance information in newly sequenced organisms, and also to facilitate the identification and characterization of new genes. ARGB contains six types of database groups: - Resistance Type: This database contains information, such as resistance profile, mechanism, requirement, epidemiology for each type. - Resistance Gene: This database contains information, such as resistance profile, resistance type, requirement, protein and DNA sequence for each gene.This database only includes NON-REDUNDANT, NON-VECTOR, COMPLETE genes. - Antibiotic: This database contains information, such as producer, action mechanism, resistance type, for each gene. - Resistance Gene(NonRD): This database contains the same information as Resistance Gene. It does NOT include NON-REDUNDANT, NON-VECTOR genes, but includes INCOMPLETE genes. - Resistance Gene(ALL): This database contains the same information as Resistance Gene. It includes all REDUNDANT, VECTOR AND INCOMPLETE genes. - Resistance Species: This database contains resistance profile and corresponding resistance genes for each species. Furthermore, ARDB also contians three types BLAST database: - Resistance Genes Complete: Contains only NON-REDUNDANT, NON-VECTOR, COMPLETE genes sequences. - Resistance Genes Non-redundant: Contains NON-REDUNDANT, NON-VECTOR, COMPLETE, INCOMPLETE genes sequences. - Resistance Genes All: Contains all REDUNDANT, VECTOR, COMPLETE, INCOMPLETE genes sequences. Lastly, ARDB provides four types of Analytical tools: - Normal BLAST: This function allows an user to input a DNA or protein sequence, and find similar DNA (Nucleotide BLAST) or protein (Protein BLAST) sequences using blastn, blastp, blastx, tblastn, tblastx - RPS BLAST: A web RPSBLAST (RPS BLAST) interface is provided to align a query sequence against the Position Specific Scoring Matrix (PSSM) for each type. Normally, this will give the same annotation information as using regular BLAST mentioned above. - Multiple Sequences BLAST (Genome Annotation): This function allows an user to annotate multiple (less than 5000) query sequences in FASTA format. - Mutation Resistance Identification: This function allows an user to identify mutations that will cause potential antibiotic resistance, for 12 genes (16S rRNA, 23S rRNA, gyrA, gyrB, parC, parE, rpoB, katG, pncA, embB, folP, dfr). ������ :Sponsors: ARDB is funded by Uniformed Services University of the Health Sciences, administered by the Henry Jackson Foundation. :

  10. z

    Vertebrata core nt BLAST Database

    • zenodo.org
    application/gzip
    Updated Jun 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alexander Brown; Alexander Brown (2025). Vertebrata core nt BLAST Database [Dataset]. http://doi.org/10.5281/zenodo.15685806
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Jun 17, 2025
    Dataset provided by
    WSU
    Authors
    Alexander Brown; Alexander Brown
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    A BLAST database generated by downloading the NCBI's core nt database on 04/28/2025, extracting all the vertebrata sequences using the taxon id 7742, and then building a new BLAST database from the vertebrata sequences.

  11. PaperBLAST and SitesBLAST database from April 2022

    • figshare.com
    application/x-gzip
    Updated Jun 8, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Morgan Price (2022). PaperBLAST and SitesBLAST database from April 2022 [Dataset]. http://doi.org/10.6084/m9.figshare.20022590.v1
    Explore at:
    application/x-gzipAvailable download formats
    Dataset updated
    Jun 8, 2022
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Morgan Price
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Source code and data files for PaperBLAST, SitesBLAST, and Curated BLAST for Genomes. The database is as of April 2022. The code is as of June 2022.

    Exploding the code tarball will create a PaperBLAST/ directory. The code is in the cgi/ bin/ and lib/ subdirectories.

    To install the data, create a data subdirectory, and explode the PaperBLAST_Apr2022 tarball into that directory. This includes the SQLite database (litsearch.db), and two BLAST databases (uniq.faa for PaperBLAST and hassites.faa for SitesBLAST).

    For up to date code and databases, see https://github.com/morgannprice/PaperBLAST

  12. f

    The GenBank Non-Redundant Protein Sequence Database (NRDB)

    • fungidb.org
    • piroplasmadb.org
    • +1more
    Updated Aug 16, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2019). The GenBank Non-Redundant Protein Sequence Database (NRDB) [Dataset]. https://fungidb.org/fungidb/app/record/dataset/DS_a7163a9f0d
    Explore at:
    Dataset updated
    Aug 16, 2019
    Description

    The GenBank non-redundant protein sequence database (NRDB) is a component of the NCBI BLAST databases and contains entries from GenPept, Swissprot, PIR, PDF, PDB and NCBI RefSeq.

  13. f

    Blast database sequences

    • datasetcatalog.nlm.nih.gov
    • figshare.com
    Updated Jul 18, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhang, Chi (2020). Blast database sequences [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000547354
    Explore at:
    Dataset updated
    Jul 18, 2020
    Authors
    Zhang, Chi
    Description

    Sequences used as Blast database.

  14. u

    Data from: CottonGen CottonCyc Pathways Database

    • agdatacommons.nal.usda.gov
    • catalog.data.gov
    bin
    Updated Dec 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Taein Lee; Sook Jung; Ksenija Gasic; Todd Campbell; Jing Yu; Jodi Humann; Heidi Hough; Dorrie Main (2023). CottonGen CottonCyc Pathways Database [Dataset]. https://agdatacommons.nal.usda.gov/articles/dataset/CottonGen_CottonCyc_Pathways_Database/24853212
    Explore at:
    binAvailable download formats
    Dataset updated
    Dec 18, 2023
    Dataset provided by
    MainLab, Washington State University
    Authors
    Taein Lee; Sook Jung; Ksenija Gasic; Todd Campbell; Jing Yu; Jodi Humann; Heidi Hough; Dorrie Main
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Description

    The CottonGen CottonCyc Pathways Database, part of CottonGen, supports searching and browsing the following CottonCyc databases:

    Cyc pathways for JGI v2.0 G. raimondii D5 genome assembly

    This Cyc database was constructed using PathwayTools version 20.0 using the gene models from the JGI v2.0 D5 genome assembly of Gossypium raimondii. There has been no manual curation of this Cyc database. Pathway predictions were made using PathwayTools and in-silico v2.1 annotations as provided by JGI.

    Cyc pathways for CGP-BGI v1.0 G. hirsutum AD1 genome assembly

    This Cyc database was constructed using PathwayTools version 20.0 using the gene models from the CGP-BGI v1.0 AD1 genome assembly of Gossypium hirsutum. There has been no manual curation of this Cyc database. Pathway predictions were made using PathwayTools and in-silico v1.0 annotations as provided by CGP-BGI. Search parameters include genes, proteins, RNAs, compounds, reactions, pathways, growth media, and BLAST search. Resources in this dataset:Resource Title: Website Pointer to CottonGen CottonCyc Pathways Database. File Name: Web Page, url: http://ptools.cottongen.org/

  15. f

    Blast hits of the Lymnaea TSA to different protein databases.

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Aug 1, 2012
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sadamoto, Hisayo; Toyota, Masao; Asakawa, Yoshinori; Kenmoku, Hiromichi; Takahashi, Hironobu; Okada, Taketo (2012). Blast hits of the Lymnaea TSA to different protein databases. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001161894
    Explore at:
    Dataset updated
    Aug 1, 2012
    Authors
    Sadamoto, Hisayo; Toyota, Masao; Asakawa, Yoshinori; Kenmoku, Hiromichi; Takahashi, Hironobu; Okada, Taketo
    Description

    Blast hits of the Lymnaea TSA to different protein databases.

  16. u

    Data from: CottonGen: Cotton Database Resources

    • agdatacommons.nal.usda.gov
    • datasetcatalog.nlm.nih.gov
    • +1more
    bin
    Updated Feb 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jing Yu; Sook Jung; Chun-Huai Cheng; Stephen P. Ficklin; Taein Lee; Ping Zheng; Don Jones; Richard G. Percy; Dorrie Main (2024). CottonGen: Cotton Database Resources [Dataset]. https://agdatacommons.nal.usda.gov/articles/dataset/CottonGen_Cotton_Database_Resources/24853203
    Explore at:
    binAvailable download formats
    Dataset updated
    Feb 13, 2024
    Dataset provided by
    MainLab, Washington State University
    Authors
    Jing Yu; Sook Jung; Chun-Huai Cheng; Stephen P. Ficklin; Taein Lee; Ping Zheng; Don Jones; Richard G. Percy; Dorrie Main
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Description

    CottonGen (https://www.cottongen.org) is a curated and integrated web-based relational database providing access to publicly available genomic, genetic and breeding data to enable basic, translational and applied research in cotton. Built using the open-source Tripal database infrastructure, CottonGen supersedes CottonDB and the Cotton Marker Database, which includes sequences, genetic and physical maps, genotypic and phenotypic markers and polymorphisms, quantitative trait loci (QTLs), pathogens, germplasm collections and trait evaluations, pedigrees, and relevant bibliographic citations, with enhanced tools for easier data sharing, mining, visualization, and data retrieval of cotton research data. CottonGen contains annotated whole genome sequences, unigenes from expressed sequence tags (ESTs), markers, trait loci, genetic maps, genes, taxonomy, germplasm, publications and communication resources for the cotton community. Annotated whole genome sequences of Gossypium raimondii are available with aligned genetic markers and transcripts. These whole genome data can be accessed through genome pages, search tools and GBrowse, a popular genome browser. Most of the published cotton genetic maps can be viewed and compared using CMap, a comparative map viewer, and are searchable via map search tools. Search tools also exist for markers, quantitative trait loci (QTLs), germplasm, publications and trait evaluation data. CottonGen also provides online analysis tools such as NCBI BLAST and Batch BLAST. This project is funded/supported by Cotton Incorporated, the USDA-ARS Crop Germplasm Research Unit at College Station, TX, the Southern Association of Agricultural Experiment Station Directors, Bayer CropScience, Corteva/Agriscience, Dow/Phytogen, Monsanto, Washington State University, and NRSP10. Resources in this dataset:Resource Title: Website Pointer for CottonGen. File Name: Web Page, url: https://www.cottongen.org/ Genomic, Genetic and Breeding Resources for Cotton Research Discovery and Crop Improvement organized by :

    Species (Gossypium arboreum, barbadense, herbaceum, hirsutum, raimondii, others), Data (Contributors, Download, Submission, Community Projects, Archives, Cotton Trait Ontology, Nomenclatures, and links to Variety Testing Data and NCBISRA Datasets), Search options (Colleague, Genes and Transcripts, Genotype, Germplasm, Map, Markers, Publications, QTLs, Sequences, Trait Evaluation, MegaSearch), Tools (BIMS, BLAST+, CottonCyc, JBrowse, Map Viewer, Primer3, Sequence Retrieval, Synteny Viewer), International Cotton Genome Initiative (ICGI), and Help sources (User manual, FAQs).

    Also provides Quick Start links for Major Species and Tools.

  17. f

    Gene annotation and BLAST results against seven public databases.

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Dec 20, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhang, Zongwen; Liu, Jing; Wang, Tingting; Liu, Minxuan; Gao, Jia (2017). Gene annotation and BLAST results against seven public databases. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001811876
    Explore at:
    Dataset updated
    Dec 20, 2017
    Authors
    Zhang, Zongwen; Liu, Jing; Wang, Tingting; Liu, Minxuan; Gao, Jia
    Description

    Gene annotation and BLAST results against seven public databases.

  18. D1 - Reference databases used for blast queries

    • figshare.com
    txt
    Updated Aug 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Megan Sorensen (2023). D1 - Reference databases used for blast queries [Dataset]. http://doi.org/10.6084/m9.figshare.21611124.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Aug 2, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Megan Sorensen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The databases with representaive sequences used as the blast queries when searching for plastid 16S, rbcL and psbA, and the host 18S.

  19. DNA Methylase Finder databases

    • zenodo.org
    application/gzip
    Updated Jun 16, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michael J Tisza; Michael J Tisza (2022). DNA Methylase Finder databases [Dataset]. http://doi.org/10.5281/zenodo.6647341
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Jun 16, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Michael J Tisza; Michael J Tisza
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Databases:

    cdd_plus_hmms: CDD HMM database plus custom DNA methylase domain HMMs

    methylase_hmms: custom DNA methylase domain HMMs

    restriction_enzyme_hmms: custom Restriction Enzyme domain HMMs

    specificity_subunit_hmms: custom Specificity Subunit domain HMMs

    subtype_hmms: DNA methylase subtype HMMs from oliveira et al. https://github.com/oliveira-lab/RMS

    motif_protein_blastp: protein blast database of REBASE DNA methylase genes with known motif specificity

  20. Data from: COInr a comprehensive, non-redundant COI database from NCBI-nt...

    • zenodo.org
    • data.niaid.nih.gov
    application/gzip
    Updated May 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Emese Meglecz; Emese Meglecz (2023). COInr a comprehensive, non-redundant COI database from NCBI-nt and BOLD [Dataset]. http://doi.org/10.5281/zenodo.6555985
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    May 5, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Emese Meglecz; Emese Meglecz
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    COInr is a non-redundant, comprehensive database of COI sequences extracted from NCBI-nt and BOLD. It is not limited to a taxon, a gene region, or a taxonomic resolution. Sequences are dereplicated between databases and within taxa.

    Each taxon has a unique taxonomic Identifier (taxID), fundamental to avoid ambiguous associations of homonyms and synonyms in the source database. TaxIDs form a coherent hierarchical system fully compatible with the NCBI taxIDs allowing creating their full or ranked linages.

    COInr is a good starting point to create custom databases according to the users’ needs using mkCOInr scripts available at https://github.com/meglecz/mkCOInr
    It is possible to select/eliminate sequences for a list of taxa, select a specific gene region, select for minimum taxonomic resolution, add new custom sequences, and format the database for BLAST, QIIME, RDP classifiers.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Taein Lee; Sook Jung; Ksenija Gasic; Todd Campbell; Jing Yu; Jodi Humann; Heidi Hough; Dorrie Main (2024). CottonGen BLAST [Dataset]. https://agdatacommons.nal.usda.gov/articles/dataset/CottonGen_BLAST/24853260

Data from: CottonGen BLAST

Related Article
Explore at:
binAvailable download formats
Dataset updated
Feb 13, 2024
Dataset provided by
MainLab, Washington State University
Authors
Taein Lee; Sook Jung; Ksenija Gasic; Todd Campbell; Jing Yu; Jodi Humann; Heidi Hough; Dorrie Main
License

U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically

Description

CottonGen offers BLAST with genome, transcriptome, peptide and marker sequence databases from Gossypium species. This can be done using nucleotide sequences or peptide sequences. BLAST functionality is similar to that on NCBI. BLAST Programs:

blastn: Search a nucleotide database using a nucleotide query. blastx: Search protein database using a translated nucleotide query. tblastn: Search translated nucleotide database using a protein query.

blastp: Search protein database using a protein query. Resources in this dataset:Resource Title: Website Pointer for CottonGen BLAST Search. File Name: Web Page, url: https://www.cottongen.org/blast CottonGen offers BLAST with genome, transcriptome, peptide and marker sequence databases from Gossypium species. This can be done using nucleotide sequences or peptide sequences. BLAST functionality is similar to that on NCBI. Enter or upload FASTA sequence(s) to query and select BLAST database.

BLAST Programs:

blastn: Search a nucleotide database using a nucleotide query. blastx: Search protein database using a translated nucleotide query. tblastn: Search translated nucleotide database using a protein query. blastp: Search protein database using a protein query.

Search
Clear search
Close search
Google apps
Main menu