This is a selection of 70M antibody sequences from the OAS database. OAS was first filtered using the criteria described by Leem et al. (Deciphering the language of antibodies using self-supervised learning.) and then 70M sequences were randomly sampled from all repertoires.
license: cc-by-4.0
tags: - antibodies - biology - protein size_categories: - 10M<n<100M
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
We used ABodyBuilder2 (https://doi.org/10.1038/s42003-023-04927-7) to model ~1.5M paired antibody structures from paired antibody sequences in Observed Antibody Space (https://opig.stats.ox.ac.uk/webapps/oas/oas_paired/). We have save the structures in folders and sub folders that correspond to the OAS files they came from. Parent folders are named according to study. Within each parent folder are sub folders names according to the files (named by SRA ID) containing sequences. Each structure is then named with the parent file followed by the row number from this file.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
OASis human 9-mer peptide database, generated from 118 million human antibody sequences from the Observed Antibody Space database.
Attached is a gzipped SQLite database containing two tables: "peptides" and "subjects".
Links:
BioPhi codebase and documentation: https://github.com/Merck/BioPhi
Public BioPhi server: https://biophi.dichlab.org
OAS Database: http://opig.stats.ox.ac.uk/webapps/oas/
Not seeing a result you expected?
Learn how you can add new datasets to our index.
This is a selection of 70M antibody sequences from the OAS database. OAS was first filtered using the criteria described by Leem et al. (Deciphering the language of antibodies using self-supervised learning.) and then 70M sequences were randomly sampled from all repertoires.
license: cc-by-4.0
tags: - antibodies - biology - protein size_categories: - 10M<n<100M