Treffer: A novel method for automatic functional annotation of proteins

Title:
A novel method for automatic functional annotation of proteins
Source:
Selection of papers presented at the German Conference on Bioinformatics (GCB'98, Cologne, Germany, October 1998Bioinformatics (Oxford. Print). 15(3):228-233
Publisher Information:
Oxford: Oxford University Press, 1999.
Publication Year:
1999
Physical Description:
print, 14 ref
Original Material:
INIST-CNRS
Document Type:
Konferenz Conference Paper
File Description:
text
Language:
English
Author Affiliations:
The EMBL Outstation - The European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
ISSN:
1367-4803
Rights:
Copyright 1999 INIST-CNRS
CC BY 4.0
Sauf mention contraire ci-dessus, le contenu de cette notice bibliographique peut être utilisé dans le cadre d’une licence CC BY 4.0 Inist-CNRS / Unless otherwise stated above, the content of this bibliographic record may be used under a CC BY 4.0 licence by Inist-CNRS / A menos que se haya señalado antes, el contenido de este registro bibliográfico puede ser utilizado al amparo de una licencia CC BY 4.0 Inist-CNRS
Notes:
Biological sciences. Generalities. Modelling. Methods

Generalities in biological sciences
Accession Number:
edscal.1831440
Database:
PASCAL Archive

Weitere Informationen

Motivation: To cope with the increasing amount ofsequence data, reliable automatic annotation tools are required. The TrEMBL database contains together with SWISS-PROT nearly all publicly available protein sequences, but in contrast to SWISS-PROT only limited functional annotation. To improve this situation, we had to develop a method of automatic annotation that produces highly reliable functional prediction using the language and the syntax of SWISS-PROT. Results: An algorithm was developed and successfully used for the automatic annotation ofa testset of unknown proteins. The predicted information included description, function, catalytic activity, cofactors, pathway, subcellular location, quaternary structure, similarity to other protein, active sites, and keywords. The algorithm showed a low coverage (10% ), but a high specificity and reliability.