ceqlogo

Usage: ceqlogo -i <filename> [options]

Examples:

Load all motifs within a MEME motif file and write to logo.eps in EPS format:
ceqlogo -i meme.motifs -o logo.eps -f EPS

Load second motif from each of two files and shift the first one:
ceqlogo -i2 meme1.motifs -s 3 -i2 meme2.motifs -o logo.eps -f EPS

Run a self-test:
ceqlogo -test

Description:

The ceqlogo program generates one or multiple, aligned logos in EPS or SVG format, based on a set of nucleotide or amino acid frequency matrices provided in the MEME file format. The code of ceqlogo is based on the Perl code of weblogo but is written in C and supports SVG output. Note that the program strives to use command line options similar to those of weblogo.

The letter stacks are calculated by the following equations found in Schneider and Stephens paper "Sequence Logos: A New Way to Display Consensus Sequences" and adapted from the weblogo documentation. The height of a letter is calculated as:

height(b,l) = f(b,l) * R(l)

where f(b,l) is the frequency of base or amino acid b at position l. The stack height R(l) is the amount of information present at position l and can be quantified as follows:

R(l) for amino acids = log(20) - (H(l) + e(n))

R(l) for nucleic acids = 2 - (H(l) + e(n))

where log is taken base 2, H(l) is the uncertainty at position l, and e(n) is the error correction factor for small sample sizes n. H(l) is computed as follows:

H(l) = - (Sum f(b,l) * log[ f(b,l) ])

where again, log is taken base 2. f(b,l) is the frequency of base b at position l. The sum is taken over all amino acids or bases.

The error correction factor e(n) is approximated by:

e(n) = (s-1) / (2 * ln 2 * n)

Input:

Output:

Options:

Options with arguments (all lower case):

Toggles (all upper case):

Known problems:


Author: Stefan Maetschke (s.maetschke@imb.uq.edu.au)