BLAST-OER

prev toc next

Comparing Sequences

There are several applications in biomedical science that need to compare DNA, RNA or protein sequences:

The first step of BLAST in any of these comparisons is to find the best alignment.

This sequence alignment is done by (pairwise) sequence alignment, defined as the process of arranging two sequences to achieve maximal levels of identity and conservation.

For conservation, the group to which an amino acid belongs is important (basic, acidic, uncharged polar, or nonpolar side chain), as well as the size of the amino acid’s side chain. Also, some amino acids have very unique properties, making them essential in certain positions in a protein (e.g. cysteine, to form sulphur bridges).

Homology is defined as similarity attributed to descent from a common ancestor. Sequences are either homologous or not (they cannot be somewhat homologous due to similar protein domain function). Orthologues are homologues between species (e.g. human and mouse haemoglobin alpha chain), paralogues are homologues within a species (e.g. human haemoglobin alpha and beta chain).

Using sequence alignment allows us to assess the degree of similarity and the possibility of homology.

Besides checking identity and conservation of residues, we also should be able to account for the occurrence of (small) insertions and deletions. This is done by the concept of a gap: when aligning two sequences, each of them can be broken apart, and some residues in the other sequence can be matched to empty spaces in that sequence.

Comparing protein or nucleotide sequences?

Alignment of protein sequences can be more informative than alignment of nucleotide sequences. This has several reasons:

So, protein sequences offer a longer “look-back” time.

This means that it can be more useful to use protein sequences than DNA or RNA sequences, especially when the expected divergence between the sequences is high (e.g. large evolutionary distance).

Alignment of nucleotide sequences is still useful, for example to:

prev toc next