Command line Training Set First Motif Summary of Motifs Termination Explanation


Search sequence databases for the best combined matches with these motifs using MAST.
Search sequence databases for all matches with these motifs using FIMO.
Find Genome Ontology terms associated with upstream sequences matching these motifs using GOMO.
Submit these motifs to BLOCKS multiple alignment processor.


MEME - Motif discovery tool

MEME version 4.2.0 (Release date: Thu Apr 9 11:15:56 PDT 2009)

For further information on how to interpret these results or to get a copy of the MEME software please access http://meme.sdsc.edu.

This file may be used as input to the MAST algorithm for searching sequence databases for matches to groups of motifs. MAST is available for interactive use and downloading at http://meme.sdsc.edu.


REFERENCE

If you use this program in your research, please cite:

Timothy L. Bailey and Charles Elkan, "Fitting a mixture model by expectation maximization to discover motifs in biopolymers", Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28-36, AAAI Press, Menlo Park, California, 1994.



TRAINING SET

DATAFILE= tests/common/crp0.fasta
ALPHABET= ACGT
Sequence name Weight Length Sequence name Weight Length
ce1cg 1.0000 105 ara 1.0000 105
bglr1 1.0000 105 crp 1.0000 105
cya 1.0000 105 deop2 1.0000 105
gale 1.0000 105 ilv 1.0000 105
lac 1.0000 105 male 1.0000 105
malk 1.0000 105 malt 1.0000 105
ompa 1.0000 105 tnaa 1.0000 105
uxu1 1.0000 105 pbr322 1.0000 105
trn9cat 1.0000 105 tdc 1.0000 105

COMMAND LINE SUMMARY

This information can also be useful in the event you wish to report a
problem with the MEME software.

command: meme tests/common/crp0.fasta -dna -nmotifs 3 -revcomp 
model: mod= zoops nmotifs= 3 evt= inf
object function= E-value of product of p-values
width: minw= 8 maxw= 50 minic= 0.00
width: wg= 11 ws= 1 endgaps= yes
nsites: minsites= 2 maxsites= 18 wnsites= 0.8
theta: prob= 1 spmap= uni spfuzz= 0.5
em: prior= dirichlet b= 0.01 maxiter= 50
distance= 1e-05
data: n= 1890 N= 18
strands: + -
sample: seed= 0 seqfrac= 1
Letter frequencies in dataset:
A 0.304 C 0.196 G 0.196 T 0.304
Background letter frequencies (from dataset with add-one prior applied):
A 0.304 C 0.196 G 0.196 T 0.304

P N
MOTIF 1 width = 18 sites = 18 llr = 180 E-value = 1.1e-006

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
14.7 (bits)
Relative Entropy
14.4 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME STRAND START P-VALUE SITES
ara - 58 2.51e-07 TGGCATAGCA AAGTGTGACGCCGTGCAA ATAATCAATG
lac + 8 5.35e-07 AACGCAAT TAATGTGAGTTAGCTCAC TCATTAGGCA
malt + 40 8.61e-07 AAAGATTTGG AATTGTGACACAGTGCAA ATTCAGACAC
ilv - 42 1.69e-06 GCAAAGGGAA AATTGAGGGGTTGATCAC GTTTTGTACT
pbr322 - 56 2.85e-06 CTCCTTACGC ATCTGTGCGGTATTTCAC ACCGCATATG
deop2 + 59 2.85e-06 AGATTTCCTT AATTGTGATGTGTATCGA AGTGTGTTGC
uxu1 + 16 5.17e-06 AGAGTGAAAT TGTTGTGATGTGGTTAAC CCAATTAGAA
trn9cat + 83 5.69e-06 CTTTTGGCGA AAATGAGACGTTGATCGG CACG
ce1cg - 64 7.54e-06 GGACTTCCAT TTTTGTGAAAACGATCAA AAAAACAGTC
ompa + 47 9.04e-06 TTTTTTTCAT ATGCCTGACGGAGTTCAC ACTTGTAAGT
crp - 66 9.89e-06 TACTGCACGG TAATGTGACGTCCTTTGC ATACATGCAG
male + 13 1.29e-05 TTACCGCCAA TTCTGTAACAGAGATCAC ACAAAGCGAC
gale - 45 1.41e-05 AAGATGCGAA AAGTGTGACATGGAATAA ATTAGTGGAA
tdc - 81 1.53e-05 AACAGG ATATGTGCGACCACTCAC AAATTAACTT
malk + 60 1.67e-05 ATGTAAGGAA TTTCGTGATGTTGCTTGC AAAAATCGTG
cya + 49 1.81e-05 TCAATCAGCA AGGTGTTAAATTGATCAC GTTTTAGACC
tnaa + 70 2.73e-05 CTCCCCGAAC GATTGTGATTCGATTCAC ATTTAAACAA
bglr1 + 75 8.30e-05 CAAAGTTAAT AACTGTGAGCATGGTCAT ATTTTTATCA

Motif 1 block diagrams


Name
Lowest
p-value
Motifs
ara 2.51e-07

-1
lac 5.35e-07

+1
malt 8.61e-07

+1
ilv 1.69e-06

-1
pbr322 2.85e-06

-1
deop2 2.85e-06

+1
uxu1 5.17e-06

+1
trn9cat 5.69e-06

+1
ce1cg 7.54e-06

-1
ompa 9.04e-06

+1
crp 9.89e-06

-1
male 1.29e-05

+1
gale 1.41e-05

-1
tdc 1.53e-05

-1
malk 1.67e-05

+1
cya 1.81e-05

+1
tnaa 2.73e-05

+1
bglr1 8.30e-05

+1
SCALE
| | | | |
1 25 50 75 100

Motif 1 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 1 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 1 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 1 regular-expression

[AT][AT][TAG]TGTGA[CGT][GA][TC][ATCGC]G[AT]TC[AG][CA]

Time 1.75 secs.

P N
MOTIF 2 width = 8 sites = 2 llr = 24 E-value = 1.5e+004

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
15.0 (bits)
Relative Entropy
17.5 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME STRAND START P-VALUE SITES
ilv + 4 2.18e-06 GCTC CGGCGGGG TTTTTTGTTA
male + 40 5.56e-06 CACAAAGCGA CGGTGGGG CGTAGGGGCA

Motif 2 block diagrams


Name
Lowest
p-value
Motifs
ilv 2.18e-06

+2
male 5.56e-06

+2
SCALE
| | | | |
1 25 50 75 100

Motif 2 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 2 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 2 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 2 regular-expression

CGG[CT]GGGG

Time 2.94 secs.

P N
MOTIF 3 width = 11 sites = 2 llr = 31 E-value = 2.0e+004

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
20.0 (bits)
Relative Entropy
22.3 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME STRAND START P-VALUE SITES
lac - 32 7.89e-08 AAGTGTAAAG CCTGGGGTGCC TAATGAGTGA
trn9cat + 35 2.01e-07 ATAAATAAAT CCTGGTGTCCC TGTTGATACC

Motif 3 block diagrams


Name
Lowest
p-value
Motifs
lac 7.89e-08

-3
trn9cat 2.01e-07

+3
SCALE
| | | | |
1 25 50 75 100

Motif 3 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 3 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 3 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 3 regular-expression

CCTGG[GT]GT[CG]CC

Time 4.08 secs.

P N
SUMMARY OF MOTIFS


Combined block diagrams: non-overlapping sites with p-value < 0.0001


Name
Combined
p-value

Motifs
ce1cg 7.53e-03

-1
ara 9.02e-04

-1
bglr1 2.01e-01

+1
crp 3.77e-02

-1
cya 6.49e-02

+1
deop2 1.19e-02

+1
gale 6.15e-02

-1
ilv 4.50e-06

+2
-1
lac 1.93e-07

+1
-3
male 1.22e-04

+1
+2
malk 1.39e-02

+1
malt 5.01e-03

+1
ompa 3.94e-02

+1
tnaa 2.80e-02

-1
uxu1 1.70e-02

+1
pbr322 6.62e-03

-1
trn9cat 5.41e-06

+3
+1
tdc 6.53e-02

-1
SCALE
| | | | |
1 25 50 75 100

Motif summary in machine readable format.
Stopped because Stopped because nmotifs = 3 reached..



CPU: Macintosh.local


EXPLANATION OF MEME RESULTS


The MEME results consist of:

MOTIFS

For each motif that it discovers in the training set, MEME prints the following information: