PURE [Prediction of Unassigned REgions] Server:
PURE - Prediction of Unassigned REgions is a method to identify putative domains in the Unassigned Regions[URs]. PURE is now available as a web server. Users can submit the sequence of unassigned regions / Linker regions to predict domains encoded in them.
FASTA format description
A sequence in FASTA format begins with a single-line description,
followed by lines of sequence data. The description line is
distinguished from the sequence data by a greater-than (">") symbol
in the first column. It is recommended that all lines of text be
shorter than 80 characters in length. An example sequence in FASTA
format is:
>NP_852891.1.1-130
MCITLHEAQFFKRLKKTYMVQTIGRRKTQSFEYLKNKWGSSNKSAIEVIQITMDNIENSL
LFKDLRTLVYTSNIVENYNSVIGTFLPAKKSFNNINQFLLDLYVHFGNNPRYKKLNQKTR
RVRNWYRIYE
Sequences are expected to be represented in the standard
IUB/IUPAC amino acid with these
exceptions: lower-case letters are accepted and are mapped
into upper-case; a single hyphen or dash can be used to represent
a gap of indeterminate length; and in amino acid sequences,
are acceptable letters (see below). Before submitting a request,
any numerical digits in the query sequence should either be
removed or replaced by appropriate letter codes (e.g.,
X for unknown amino acid residue).
For those programs that use amino acid query sequences (BLASTP
,BLASTPGP), the accepted amino acid codes are:
A alanine P proline
B aspartate or asparagine Q glutamine
C cystine R arginine
D aspartate S serine
E glutamate T threonine
F phenylalanine U selenocysteine
G glycine V valine
H histidine W tryptophan
I isoleucine Y tyrosine
K lysine Z glutamate or glutamine
L leucine X any
M methionine
N asparagine
Contact:
Prof. R. Sowdhamini
(mini@ncbs.res.in)
Dr. Bernard Offman
Chilamakuri Chandra Sekhar Reddy
K. Shameer