fasta identity calculation

Cameron c.r.dunn at bath.ac.uk
Thu Jun 24 05:31:25 EST 1999


which is the more valid way to calculate percentage identity
between two sequences of different length ?

Should we include the ends which are unmatched ?
GCG FASTA doesn't.   www.ebi.ac.uk FASTA does.

In this case a researcher wants to compare the identities
of several similar (around 95%) DNA sequences which have
different lengths.

Thanks in advance,

