Download
All sequences:
Total number of hits obtained using HMM, PSI-BLAST and interacting motif based PHI-BLAST.
Sequences having 100% sequence idendity to each other are considered to be redundant
and are filtered using cd-hit
Download sequences in fasta format --- allseqfasta.tar.gz
1.Use right click and save target as option.
2.How to untar : tar -xvzf allseqfasta.tar.gz
Pruned sequences :
No of sequences in a superfamily after subjected to following procedure
Sequences having 100% sequence idendity to each other are considered to be redundant
and are filtered using cd-hit
Sequences having less than 40% of the query length are considerd as false positives
and are purged from the dataset.
Download pruned sequences in fasta format --- sfseq_fasta.tar.gz
1.Use right click and save target as option.
2.How to untar : tar -xvzf sfseq_fasta.tar.gz
Aligned sequences:
Download alignment for all superfamilies ---- sf_ali.tar.gz
Download alignment for all genomes --- genomes_ali.tar.gz