displays an aligned set of protein or a nucleic acid sequences in a style suitable for publication.
Here is a sample session with
Displays a multiple sequence alignment
Input sequence set: ~/align.pep
Output file [align.showalign]:
The sequence alignment to be displayed.
If you enter the
of a file, this program
the sequence details into that file.
If you give the number in the alignment or the name of a sequence, it is taken as the reference sequence. The reference sequence is always shown in full and is the one against which all the other sequences are compared. If this is qualifier is set to
, the consensus sequence is used as the reference sequence. By default, the consensus sequence is used as the reference sequence.
If this option is true, the reference sequence is displayed at the top and bottom of the alignment.
What to show.
Output order of the sequences.
If this option is true (when
is set to
and a residue is similar but not identical to the reference sequence residue), the residue case is changed to lowercase. If
is set to
, non-identical, non-similar residues are changed to lowercase. If
, no changes are made to the case of the residues on the basis of their similarity to the reference sequence.
This is the scoring matrix file used when comparing sequences. By default, it is the file
) or the file
(for nucleic sequences). These files are found in the
directory of the EMBOSS installation.
If this option is true, the consensus line is displayed at the bottom.
Regions to put in uppercase. If this is left blank, the sequence case is left alone. A set of
is specified by a set of pairs of positions. The
are integers. They are separated by any non-digit, non-alpha character. Examples of region specifications are:
If this option is true, a line giving the positions in the alignment is displayed every 10
above the alignment.
If this option is true, a ruler line marking every 5th and 10th character in the alignment is displayed.
Width of sequence to display.
This sets the length of the left-hand margin for sequence
. If the margin is set at
, no margin and no names are displayed. If the margin is set to a value less than the length of a sequence name, the sequence name is displayed truncated to the length of the margin. If the margin is set to
, the minimum margin width that allows all the sequence names to be displayed in full (plus a space at the end of the name) is automatically selected.
Use HTML formatting.
Regions to color if formatting for HTML. If this is left blank, the sequence is left alone. A set of regions is specified by a set of pairs of positions. The positions are integers. They are followed by any valid HTML font
. Examples of region specifications are:
24-45 blue 56-78 orange
1-100 green 120-156 red
A file of ranges to color (one range per line) can be specified as
Set a cut-off for the percentage of positive scoring matches below which there is no consensus. The default plurality is taken as 50% of the total weight of all sequences in the alignment.
Sets the threshold for the scores of the positive matches above which the consensus is is uppercase, and below which the consensus is in lowercase.
Provides the ability to set the required number of identities at a position for it to give a consensus. If this is set to 100%, only
of identities contribute to the consensus.