abYbank / AbDb

Antibody data resources

AbDb

The AbDb data set, a compilation of antibodies extracted from the PDB with standard numbering schemes applied and redundancy information.

We have been performing a complete rewrite of the AbDb code as updating became impractical using the old code. Undfortunately this has taken a lot longer than we had hoped. Consequently the searchable data and split datasets available here here have not been updated since 26th July 2019!

Everything is now complete with the exception of populating the web interface. We are now supplying a snapshot of data from July 2024. This is provided as a ZIP file. For each Fv, the ZIP file contains files with extensions .kab (Kabat numbered), .cho (Chothia numbered), .mar (Martin numbered), and .faa (a FASTA sequence file). Filenames contain a P after the entry name (e.g. 1yqv_0P) to indicate a protein or peptide antigen, H to indicate a hapten, N to indicate a nucleotide. Some entries contain more than one of these (e.g. 1zea_0PH).

The ZIP file also contains the file AbClusters.txt which lists the redundant clusters. Each line represents a set of the same (redundant) antibody and contains an entry of the form:

      xxxx:yyyy

where xxxx is a comma-separated list of free antibodies and yyyy is a comma-separated list of bound antibodies. Either xxxx or yyyy may be missing if there are only unbound or only bound antibodies.

For example:

      3u36_0,3u36_3,3u36_2,3u36_1:

is a set of unbound antibodies having the same sequence (in this case from the same PDB file).


      :3u2s_1PH,3u4e_0PH,3u4e_1PH,3u2s_0PH

is a set of bound antibodies having the same sequence (in this case from two PDB files).


      1hzh_0,1hzh_1:2ny7_0PH,5vn8_2PH,5vn8_0PH,1n0x_0P,5vn8_1PH,3ru8_0P,8sxi_1P,1n0x_1P,8sxi_0P,8sxi_2P

is a set of two unbound antibodies (from one PDB file) and 10 bound antibodies (from 5 PDB files).

Search

Search by PDB code

PDB code:


Search by keyword

Antibody name:

Antigen name:

Search by Species

Antibody species:

Antigen species:

Download Datasets

Complete Antibodies (Light+Heavy Chains)

Complex with Antigen Type
Numbering Protein Non-protein Free antibodies Complete Dataset
Kabat All   /   NR All   /   NR All   /   NR All   /   NR
Individual Individual Individual Individual
Chothia All   /   NR All   /   NR All   /   NR All   /   NR
Individual Individual Individual Individual
Martin All   /   NR All   /   NR All   /   NR All   /   NR
Individual Individual Individual Individual
Redundant Antibody List Download Download Download Download
Free Antibody/Complex List Free Antibodies/Complexes

Light Chains (Bence Jones Proteins)

Complex with Antigen Type
Numbering Protein Non-protein Free antibodies Complete Dataset
Kabat All   /   NR All   /   NR All   /   NR All   /   NR
Individual Individual Individual Individual
Chothia All   /   NR All   /   NR All   /   NR All   /   NR
Individual Individual Individual Individual
Martin All   /   NR All   /   NR All   /   NR All   /   NR
Individual Individual Individual Individual
Redundant Antibody List Download Download Download Download
Free Antibody/Complex List Free Light/Complexed Light

Heavy Chains (Camelid Proteins)

Complex with Antigen Type
Numbering Protein Non-protein Free antibodies Complete Dataset
Kabat All   /   NR All   /   NR All   /   NR All   /   NR
Individual Individual Individual Individual
Chothia All   /   NR All   /   NR All   /   NR All   /   NR
Individual Individual Individual Individual
Martin All   /   NR All   /   NR All   /   NR All   /   NR
Individual Individual Individual Individual
Redundant Antibody List Download Download Download Download
Free Antibody/Complex List Free Heavy/Complexed Heavy

Database Statistics

Processed data

Database Statistics - Processed Data

Datasets Complex Type Processed PDB Files Resultant Antibodies Non-Redundant Antibodies
Complete Antibody Protein 1694 3000 1205
Non-protein 361 489 259
Free Antibody 915 1536 810
Complete Dataset 2894 5025 1924
Light Chains Protein 9 14 6
Non-protein 1 1 1
Light Only 97 236 61
Complete Dataset 107 251 66
Heavy Chains Protein 246 466 190
Non-protein 13 36 14
Heavy Only 102 198 102
Complete Dataset 347 700 277

Unprocessed data

Database Statistics - Unprocessed Data

The following statistics show the type and number of unprocessed antibody related structures that are not present in AbDb

Fc Fragments Numbering Failed Superseded Single Chain Antibody (scFVs)
239 168 13 102

List of antibodies that failed the numbering

List of antibodies bound to multiple antigens

List of single chain antibodies (scFVs)

List of chain mapping information for all structures in AbDb



Help to access AbDb web services can be found here !!!