Home Server Help Group
PURE [Prediction of Unassigned REgions] Server :

PURE - Prediction of Unassigned REgions is a bioinformatics protocol to identify putative domains in the Unassigned Regions[UR]. PURE is now available as a web server. Users can submit the sequence of unassigned regions or linker regions to predict domains encoded in them.


Protein domains are the structural and functional units of proteins. The ability to parse protein chains into different domains is important for protein classification and for understanding protein structure, function, and evolution. There are several computational methods are available to identify domains in the sequence. Domain finding algorithms often employ stringent thresholds to recognize sequence domains. The realization of additional domains can be tedious involving manual intervention but can lead to better understanding of overall biological function. In this context, the problem of the identifying new domains in the unassigned regions assumes a crucial importance.

PURE Server is a web server implimentation of the multi-step algorithm based on PURE method for the further examination of unassigned linker regions. Initially submitted sequence undergoes different automated filtering steps like length, coiled coiled, transmembrane and secondary structural content. In the next phase filtered sequences are fed to PSI-BLAST,CD-HIT and hmmpfam. We then integrate all the information and present the user predicted domain in the sequence.

Please click here for details about different input options and usage.


Chandrashekar, C., Manonmani, A., Babu, M. and Sowdhamini, R. (2006) Enhanced structure prediction of gene products containing adenylyl cyclase. In Silico Biol. 6 0033: http://www.bioinfo.de/isb/2006/06/0033/main.html

Ward JJ et al. (2004) Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. Journal of Molecular Biology, 337, 635-645.

Rune Linding et al. (2003) GlobPlot: exploring protein sequences for globularity and disorder. Nucleic Acid Res 31:13

Rice, P. et al. (2000) EMBOSS: The European Molecular Biology Open Software Suite. Trends in Genetics 16:276—277.

McGuffin, L. J., et al. (2000) The PSIPRED protein structure prediction server. Bioinformatics 16:404-405

Edouard de Castro et al. (2000) ScanProsite: detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins. Nucleic Acids Res. 34:W362-W365

Altschul, S.F. et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res., 25, 3389-3402.

Brown, N.P. et al. (1998) MView: A Web compatible database search or multiple alignment viewer. Bioinformatics 14:380-381

Weizhong Li & Adam Godzik (2006) Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics, 22:1658-9

Eddy, S. (1998) Profile hidden markov models. Bioinformatics, 14, 755–763.

Park J, Teichmann SA, Hubbard T, Chothia C: (1997) Intermediate sequences increase the detection of homology between sequences. J Mol Biol., 273:349-354.

Stajich JE, et.al., (2002) The Bioperl toolkit: Perl modules for the life sciences. Genome Res, 12. 1611-1618.

Prof. R. Sowdhamini (mini@ncbs.res.in)
Dr. Bernard Offman
Chilamakuri Chandra Sekhar Reddy
Khader Shameer