3 datasets found
  1. h

    oas_70million_human

    • huggingface.co
    Updated May 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wiona Glänzer (2024). oas_70million_human [Dataset]. https://huggingface.co/datasets/WionaGlaenzer/oas_70million_human
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 31, 2024
    Authors
    Wiona Glänzer
    Description

    This is a selection of 70M antibody sequences from the OAS database. OAS was first filtered using the criteria described by Leem et al. (Deciphering the language of antibodies using self-supervised learning.) and then 70M sequences were randomly sampled from all repertoires.

      license: cc-by-4.0
    

    tags: - antibodies - biology - protein size_categories: - 10M<n<100M

  2. ABodyBuilder2 predicted structures of paired antibody sequences from...

    • zenodo.org
    application/gzip
    Updated Dec 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alexander Greenshields-Watson; Alexander Greenshields-Watson; Brennan Abanades; Brennan Abanades (2023). ABodyBuilder2 predicted structures of paired antibody sequences from Observed Antibody Space. [Dataset]. http://doi.org/10.5281/zenodo.10280181
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Dec 11, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Alexander Greenshields-Watson; Alexander Greenshields-Watson; Brennan Abanades; Brennan Abanades
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We used ABodyBuilder2 (https://doi.org/10.1038/s42003-023-04927-7) to model ~1.5M paired antibody structures from paired antibody sequences in Observed Antibody Space (https://opig.stats.ox.ac.uk/webapps/oas/oas_paired/). We have save the structures in folders and sub folders that correspond to the OAS files they came from. Parent folders are named according to study. Within each parent folder are sub folders names according to the files (named by SRA ID) containing sequences. Each structure is then named with the parent file followed by the row number from this file.

  3. Z

    OASis peptide database

    • data.niaid.nih.gov
    Updated Aug 7, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Danny A. Bitton (2021). OASis peptide database [Dataset]. https://data.niaid.nih.gov/resources?id=ZENODO_5164684
    Explore at:
    Dataset updated
    Aug 7, 2021
    Dataset provided by
    Veronica Juan
    David Prihoda
    Daniel Svozil
    Danny A. Bitton
    Andrew Waight
    Jad Maamary
    Laurence Fayadat-Dilman
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    OASis human 9-mer peptide database, generated from 118 million human antibody sequences from the Observed Antibody Space database.

    Attached is a gzipped SQLite database containing two tables: "peptides" and "subjects".

    Links:

    BioPhi codebase and documentation: https://github.com/Merck/BioPhi

    Public BioPhi server: https://biophi.dichlab.org

    OAS Database: http://opig.stats.ox.ac.uk/webapps/oas/

  4. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Wiona Glänzer (2024). oas_70million_human [Dataset]. https://huggingface.co/datasets/WionaGlaenzer/oas_70million_human

oas_70million_human

WionaGlaenzer/oas_70million_human

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 31, 2024
Authors
Wiona Glänzer
Description

This is a selection of 70M antibody sequences from the OAS database. OAS was first filtered using the criteria described by Leem et al. (Deciphering the language of antibodies using self-supervised learning.) and then 70M sequences were randomly sampled from all repertoires.

  license: cc-by-4.0

tags: - antibodies - biology - protein size_categories: - 10M<n<100M

Search
Clear search
Close search
Google apps
Main menu