Institute of Bioinformatics Münster
Domain annotation
Detection and annotation of Protein domains with Interpro scan and Pfam scan

Summary

Class Number of Sequences
Pfam Domain 9,620
Interpro Domain 9,160
GO annotation 6,020
Total Proteins containing Domains 9,620

Description

Protein domains are highly conserved functional units of proteins. Protein domains occur once or several times or none at all in proteins. Because of their conserved characteristic, they are used to study the evolutionary history of proteins. Changes like the addition or the deletion of domains in proteins give hints of evolutionary events. This could lead to changes in the function and/or the structure of the proteins. Pfam and Insterpro are two big databases which contain protein domain data. Pfam scan and Interpro scan are search programs which detect protein domains in protein sequence data. The detected domains refer to existing proteins which are inside the protein predictions and to their function.

Results

Interproscan output file

Download Interpro scan annotation [Interpro scan output gff]

Classified Proteins with Interproscan prediction

Domain annotation
Figure 1: Distribution of protein domain findings in classified proteins.

Non-classified Proteins with Interproscan prediction

Domain annotation
Figure 2: Distribution of protein domain findings in Non-classified proteins.
2020-11-18 22:08