sequenceSimilarity.py
This filter returns entries that pass the sequence similarity search criteria. Searches protein and nucleic acid sequences using the BLAST. PSI-BLAST is used to find more distantly related protein sequences.
The E value, or Expect value, is a parameter that describes the number of hits one can expect to see just by chance when searching a database of a particular size. For example, an E value of one indicates that a result will contain one sequence with similar score simply by chance. The scoring takes chain length into consideration and therefore shorter sequences can have identical matches with high E value.
The Low Complexity filter masks low complexity regions in a sequence to filter out avoid spurious alignments.
Sequence Identity Cutoff (%) filter removes entries of low sequence similarity. The cutoff value is a percentage value between 0 to 100.
Note: sequences must be at least 12 residues long. For shorter sequences try the Sequence Motif Search.