[CSBG main page]

Resources for the annotation and prediction of protein function at the domain level

Most proteins, specially in eukaryotic organisms, comprise multiple domains, which can be seen as the structural, evolutionary and functional units of proteins. Nevertheless, almost all databases and resources for annotating and predicting protein molecular function assign functions to complete protein chains, without distinguishing the particular domain responsible or associated to a given molecular function. Along the time, we have developed the first automatic annotation of proteins at the structural domain level with GeneOntology “molecular function” (GO-MF) terms. That database of annotations, named SCOP2GO, was used for constructing profiles of domains with the same fold and the same function that can be used for matching regions of sequences against them. This is a way of assigning fold and GO-MF function to the domains of newly sequenced proteins. Additionally, the matching of residues of a query protein against conserved positions of the profiles can give clues on its possible functional sites. This methodology has been recently implemented in a web server, COPRED, where any user can paste a single sequence and inspect those predictions in an interactive interface.

More information and links

  • Daniel Lopez & Florencio Pazos. (2009). Gene ontology functional annotations at the structural domain level. Proteins. 76(3):598-607.
    [PubMed:19241468] [HTML] [PDF]
  • Daniel Lopez and Florencio Pazos (2013). Concomitant prediction of function and fold at the domain level with GO-based profiles. BMC Bioinformatics. 14(S3):S12.
    [PubMed:23514233] [HTML] [PDF]
  • Daniel Lopez and Florencio Pazos (2013). COPRED: prediction of fold, GO molecular function and functional residues at the domain level. Bioinformatics. 29(14):1811-1812.
    [PubMed:23720488] [HTML] [PDF]
  • Scop2go database
  • COPRED web server

  • © 2012, Computational Systems Biology Group. CNB-CSIC