The AntiBody Sequence Database is a public dataset for antibody sequence data. It provides unique identifiers for antibody sequences, including both immunoglobulin and single-chain variable fragment sequences. These are are critical for immunological studies, and allows users to search and retrieve antibody sequences based on sequence similarity and specificity, and other biological properties.
Tracks all antibody and nanobody related therapeutics recognized by World Health Organisation, and identifies any corresponding structures in Structural Antibody Database with near exact or exact variable domain sequence matches. Synchronized with SAbDab to update weekly, reflecting new Protein Data Bank entries and availability of new sequence data published by WHO.
Database containing all antibody structures available in the PDB, annotated and presented in consistent fashion.Each structure is annotated with number of properties including experimental details, antibody nomenclature (e.g. heavy-light pairings), curated affinity data and sequence annotations. You can use the database to inspect individual structures, create and download datasets for analysis, search the database for structures with similar sequences to your query, monitor the known structural repetoire of antibodies.
The aim of this site is to collect and to share experimental results on antibodies that would otherwise remain in laboratories, thus aiding researchers in selection and validation of antibodies.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The antibody repertoire is a critical component of the adaptive immune system and is believed to reflect an individual’s immune history and current immune status. Delineating the antibody repertoire has advanced our understanding of humoral immunity, facilitated antibody discovery, and showed great potential for improving the diagnosis and treatment of disease. However, no tool to date has effectively integrated big Rep-seq data and prior knowledge of functional antibodies to elucidate the remarkably diverse antibody repertoire. We developed a Rep-seq dataset Analysis Platform with an Integrated antibody Database (RAPID; https://rapid.zzhlab.org/), a free and web-based tool that allows researchers to process and analyse Rep-seq datasets. RAPID consolidates 521 WHO-recognized therapeutic antibodies, 88,059 antigen- or disease-specific antibodies, and 306 million clones extracted from 2,449 human IGH Rep-seq datasets generated from individuals with 29 different health conditions. RAPID also integrates a standardized Rep-seq dataset analysis pipeline to enable users to upload and analyse their datasets. In the process, users can also select set of existing repertoires for comparison. RAPID automatically annotates clones based on integrated therapeutic and known antibodies, and users can easily query antibodies or repertoires based on sequence or optional keywords. With its powerful analysis functions and rich set of antibody and antibody repertoire information, RAPID will benefit researchers in adaptive immune studies.
https://entrepot.recherche.data.gouv.fr/api/datasets/:persistentId/versions/3.1/customlicense?persistentId=doi:10.57745/DDLHWUhttps://entrepot.recherche.data.gouv.fr/api/datasets/:persistentId/versions/3.1/customlicense?persistentId=doi:10.57745/DDLHWU
Reproducibility data for the AntiBody Sequence Database (ABSD) article. This dataset contains the raw data (antibody sequences) extracted on June 20, 2024, from various databases, as well as the several scripts, to ensure the reproducibility of our results. External databases used: ABDB, AbPDB, CoV-AbDab, Genbank, IMGT, PDB, SACS, SAbDab, TheraSAbDab, UniProt, KABAT Scripts usage: each external database has a corresponding script to format all antibody sequences extracted from it. A last script enable merging all extracted antibody sequences while removing redundancy, standardizing and cleaning data.
A database of antibody structure containing sequences from Kabat, IMGT and the Protein Databank (PDB), as well as structure data from the PDB. It provides search of the sequence data on various criteria and display of results in different formats. For data from the PDB, sequence searches can be combined with structural constraints. For example, one can ask for all the antibodies with a 10-residue Kabat CDR-L1 with a serine at H23 and an arginine within 10A of H36. The site also has software for structure analysis and other information on antibody structure available.
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on May 4th,2023. It was integrated with Antibody Registry. The JCN antibody database is a listing of all antibodies used in JCN papers from 2006 onward. The catalog numbers and vendor information is included for all antibodies listed, and with a new collaboration with NIF''''s AntibodyRegistry, a unique identifier is also listed for each antibody. The Journal of Comparative Neurology requires rigorous characterization for all antibodies that are used in JCN papers. The antibodies in the The Journal of Comparative Neurology antibody database have in nearly all cases been described and characterized adequately according to the provided guidelines. This information can be used to identify a particular target immunohistochemically or to design an experiment using the antibody information. If you are looking for an antibody to identify a particular target immunohistochemically, this list is a good place to begin your search. We suggest you then look up the paper in which the antibody was used, to make sure that it will meet your needs and to verify its characterization. (The characterization of antibodies in JCN papers often goes well beyond the material published by the manufacturer, so that examining this information before you order an antibody can be very useful.) While we do not guarantee that these antibodies will identify only the intended target (that is a function of the actual experiment and controls), this is the most carefully verified list of antibodies that we are aware of, and we wanted to share this resource with our readers and authors.
The ZFIN Antibody Database is a database of zebrafish gene expression antibodies.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This repository includes SAAINT-parser-processed structure models and unprocessed PDB mmcifs for SAAINT-DB.
processed_pdb_models.tar.gz: the SAAINT-parser-processed structure models (in the PDB format)
unprocessed_mmcifs.tar.gz: the original, unprocessed PDB structures (in the mmCIF format)
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Bottom-up proteomics approaches rely on database searches that compare experimental values of peptides to theoretical values derived from protein sequences in a database. While the human body can produce millions of distinct antibodies, current databases for human antibodies such as UniProtKB are limited to only 1095 sequences (as of 2024 January). This limitation may hinder the identification of new antibodies using bottom-up proteomics. Therefore, extending the databases is an important task for discovering new antibodies.
Herein, we adopted extensive collection of antibody sequences from Observed Antibody Space for conducting efficient database searches in publicly available proteomics data with a focus on the SARS-CoV-2 disease. Thirty million heavy antibody sequences from 146 SARS-CoV-2 patients in the Observed Antibody Space were in silico digested to obtain 18 million unique peptides. These peptides were then used to create six databases (DB1-DB6) for bottom-up proteomics. We used those databases for searching antibody peptides in publicly available SARS-CoV-2 human plasma samples in the Proteomics Identification Database (PRIDE), and we consistently found new antibody peptides in those samples. The database searching task was done by using Fragpipe softwares.
Table 1. Information of databases. In addition to human SARS-CoV-2 antibody peptides, every database also contains human protein sequences from UniProt database and contaminants from cRAP database.
File | Database | Number of human SARS-CoV-2 antibody peptides | Number of covered antibodies |
DB1.fasta | DB1 | 100 | 1.28E7 |
DB2.fasta | DB2 | 1E3 | 1.93E7 |
DB3.fasta | DB3 | 1E4 | 2.40E7 |
DB4.fasta | DB4 | 1E5 | 2.66E7 |
DB5.fasta | DB5 | 1E6 | 2.83E7 |
DB6.fasta | DB6 | 1E7 | 3.01E7 |
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
OASis human 9-mer peptide database, generated from 118 million human antibody sequences from the Observed Antibody Space database.
Attached is a gzipped SQLite database containing two tables: "peptides" and "subjects".
Links:
BioPhi codebase and documentation: https://github.com/Merck/BioPhi
Public BioPhi server: https://biophi.dichlab.org
OAS Database: http://opig.stats.ox.ac.uk/webapps/oas/
Reproducibility data for the AntiBody Sequence Database (ABSD) article. This dataset contains the raw data (antibody sequences) extracted on June 20, 2024, from various databases, as well as the several scripts, to ensure the reproducibility of our results. External databases used: ABDB, AbPDB, CoV-AbDab, Genbank, IMGT, PDB, SACS, SAbDab, TheraSAbDab, UniProt, KABAT Scripts usage: each external database has a corresponding script to format all antibody sequences extracted from it. A last script enable merging all extracted antibody sequences while removing redundancy, standardizing and cleaning data.
Repository of sequenced antibodies, integrating curated information about antibody and its antigen with cross links to standardized databases of chemical and protein entities. Manually curated repository of sequenced antibodies, developed by Geneva Antibody Facility at University of Geneva, in collaboration with CALIPHO and Swiss Prot groups at SIB Swiss Institute of Bioinformatics. Database provides list of sequenced antibodies with their known targets. Each antibody is assigned unique ID number that can be used in academic publications to increase reproducibility of experiments.
THIS RESOURCE IS NO LONGER IN SERVICE, documented September 13, 2016. A searchable biotechnology database e-books with information on more than 9000 monoclonal antibodies. This database has antibodies produced for the diagnosis and therapy of human cancer, Alzheimer's disease, AIDS, and other diseases as well as for biomarker and proteomics research. Information such as antibody name, species, type, characteristics, antigen characteristics, and developer or distributor of antibody as well as mentions in journals, patents, abstracts and reports up until 2012 are included.
The Kabat Database determines the combining site of antibodies based on the available amino acid sequences. The precise delineation of complementarity determining regions (CDR) of both light and heavy chains provides the first example of how properly aligned sequences can be used to derive structural and functional information of biological macromolecules. The Kabat database now includes nucleotide sequences, sequences of T cell receptors for antigens (TCR), major histocompatibility complex (MHC) class I and II molecules, and other proteins of immunological interest. The Kabat Database searching and analysis tools package is an ASP.NET web-based portal containing lookup tools, sequence matching tools, alignment tools, length distribution tools, positional correlation tools and much more. The searching and analysis tools are custom made for the aligned data sets contained in both the SQL Server and ASCII text flat file formats. The searching and analysis tools may be run on a single PC workstation or in a distributed environment. The analysis tools are written in ASP.NET and C# and are available in Visual Studio .NET 2003/2005/2008 formats. The Kabat Database was initially started in 1970 to determine the combining site of antibodies based on the available amino acid sequences at that time. Bence Jones proteins, mostly from human, were aligned, using the now-known Kabat numbering system, and a quantitative measure, variability, was calculated for every position. Three peaks, at positions 24-34, 50-56 and 89-97, were identified and proposed to form the complementarity determining regions (CDR) of light chains. Subsequently, antibody heavy chain amino acid sequences were also aligned using a different numbering system, since the locations of their CDRs (31-35B, 50-65 and 95-102) are different from those of the light chains. CDRL1 starts right after the first invariant Cys 23 of light chains, while CDRH1 is eight amino acid residues away from the first invariant Cys 22 of heavy chains. During the past 30 years, the Kabat database has grown to include nucleotide sequences, sequences of T cell receptors for antigens (TCR), major histocompatibility complex (MHC) class I and II molecules and other proteins of immunological interest. It has been used extensively by immunologists to derive useful structural and functional information from the primary sequences of these proteins.
This is a database of in silico generated antibody-antigen bindings (159 antigens times 6.9 million CDRH3 murine sequences), as resource for benchmarking machine learning methods.
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
AbRank is a large-scale benchmark and evaluation framework that reframes affinity prediction as a pairwise ranking problem. It aggregates over 380,000 binding assays from nine heterogeneous sources, spanning diverse antibodies, antigens, and experimental conditions, and introduces standardized data splits that systematically increase distribution shift, from local perturbations such as point mutations to broad generalization across novel antigens and antibodies. To ensure robust supervision, AbRank defines a 10-confident ranking framework by filtering out comparisons with marginal affinity differences, focusing training on pairs with at least an 10-fold difference in measured binding strength.
Serves as clearinghouse for monoclonal antibodies against SARS-CoV-2. Database will catalog contributed antibodies in searchable resource and provide interactive analysis tools for comparisons among them. Most potent antibodies will guide development of vaccines to stop current outbreak and protect against future pandemics.
Open-access database of antibodies against human proteins developed through collaboration between Antibodypedia AB and the Nature Publishing Group. It aims to provide the scientific community and antibody distributors alike with information on the effectiveness of specific antibodies in specific applications--to help scientists select the right antibody for the right application. Antibodypedia's mission is to promote the functional understanding of the human proteome and expedite analysis of potential biomarkers discovered through clinical efforts. To this end, they have developed an open-access, curated, searchable database containing annotated and scored affinity reagents to aid users in selecting antibodies tailored to specific biological and biomedical assays. They envisage Antibodypedia as a virtual repository of validated antibodies against all human, and ultimately most model-organism, proteins. Such a tool will be exploitable to identify affinity reagents to document protein expression patterns in normal and pathological states and to purify proteins alone and in complex for structural and functional analyses. They hope to promote characterization of the roles and interplay of proteins and complexes in human health and disease. They encourage commercial providers to submit information regarding their inventory of antibodies with links to quality control data. Independent users can submit their own application-specific experimental data using standard validation criteria (supportive or non-supportive) developed with the assistance of an international advisory board recruited from academic research institutions. Users can also comment on specific antibodies without submitting validation data.
The AntiBody Sequence Database is a public dataset for antibody sequence data. It provides unique identifiers for antibody sequences, including both immunoglobulin and single-chain variable fragment sequences. These are are critical for immunological studies, and allows users to search and retrieve antibody sequences based on sequence similarity and specificity, and other biological properties.