| Call Summary | Training Set | Constraints | Background | Models | PWM | Logo | Sites | Probs | Explanation |
COSMO version 1.0 (Release date: 4/11/06)
For further information on how to interpret this output, please access http://cosmoweb.berkeley.edu.
You can download these results in XML format here.
| Date job was run: | 13:36 Tue 04/11/2006 |
| Web page expires: | 13:36 Tue 04/18/2006 |
| Run time: | 5 minutes 15 seconds |
| Run on server: | biostat-05.berkeley.edu |
If you use this software in your research, please cite
O. Bembom, M.J. van der Laan (2006). Supervised detection of conserved motifs in DNA sequences with cosmo. UC Berkeley Division of Biostatistics Working Paper Series. Working Paper 209. http://www.bepress.com/ucbbiostat/paper209.
cosmo makes use of the donlp2() function by Peter Spellucci. The use of donlp2 must be acknowledged in any publication which contains results obtained with cosmo or parts of it. Citation of the author's name and netlib-source is suitable.
This information can also be useful in the event you wish to report a problem with the cosmo software
command: cosmo cosmo.seqs.13573 -minw 7 -maxw 12 -zoops -tcm -addfree -con cosmo.cons.13573 -xml -status
| Constraints | numConSets: | 2 | nonempty: | TRUE | ||
| criterion: | pwmCV | fold: | 2 | trunc: | 100 | |
| Distribution | OOPS: | FALSE | ZOOPS: | TRUE | TCM: | TRUE |
| approx: | OVER | cutfac: | 5 | |||
| criterion: | eval | fold: | 5 | trunc: | 100 | |
| Width | minw: | 7 | maxw: | 12 | ||
| criterion: | eval | fold: | 5 | trunc: | 100 | |
| NumSites | minsites: | 2 | maxsites: | 50 | ||
| criterion: | eval | fold: | 5 | trunc: | 100 | |
| Starts | type: | eval | number: | 5 | ||
| Background | source: | same | cvorder: | TRUE | fold: | 5 |
| Data | numseqs: | 20 | numnucs: | 4000 | rev: | TRUE |
| Sequence Name | Length | Sequence Name | Length |
|---|---|---|---|
| TestSeq0:138 | 200 | TestSeq1:144 | 200 |
| TestSeq2:95 | 200 | TestSeq3:168 | 200 |
| TestSeq4:8 | 200 | TestSeq5:51 | 200 |
| TestSeq6:44 | 200 | TestSeq7:128 | 200 |
| TestSeq8:73 | 200 | TestSeq9:116 | 200 |
| TestSeq10:3 | 200 | TestSeq11:175 | 200 |
| TestSeq12:161 | 200 | TestSeq13:147 | 200 |
| TestSeq14:88 | 200 | TestSeq15:12 | 200 |
| TestSeq16:6 | 200 | TestSeq17:130 | 200 |
| TestSeq18:28 | 200 | TestSeq19:121 | 200 |
| Original constraint file | Interpreted constraints |
|---|---|
| @ ConstraintSet 1 | @ Constraint set 1 |
| >IntervalSetup | >IntervalSetup |
| Length: 3 bp | Length: 3 bp |
| Length: variable | Length: variable |
| Length: 3 bp | Length: 3 bp |
| >IcBounds | >ICBounds |
| Interval: 2 | Interval: 1 |
| Bounds: 0 to 0.8 | Bounds: 1.000 to 2.000 |
| >IcBounds | >ICBounds |
| Interval: 1 | Interval: 2 |
| Bounds: 1.0 to 2.0 | Bounds: 0.000 to 0.800 |
| >Pal | >Palindrome |
| Intervals: 1 and 3 | Intervals: 1 and 3 |
| ErrorTol: 0.05 | ErrorTol: 0.050 |
| @ Constraint set 2 | |
| >IntervalSetup | |
| Length: variable |
Order of background Markov model chosen by likelihood-based CV: 1
Kullback-Leibler divergences for candidate orders 0 to 6:
| Order | KL divergence |
|---|---|
| 0 | 271.68 |
| 1 | 269.72 |
| 2 | 269.83 |
| 3 | 319.49 |
| 4 | 5783.6 |
| 5 | inf |
| 6 | inf |
Estimated transition matrix for order 0
| Prefix | A | C | G | T |
|---|---|---|---|---|
| - | 0.3043 | 0.1943 | 0.1870 | 0.3145 |
Estimated transition matrix for order 1
| Prefix | A | C | G | T |
|---|---|---|---|---|
| A | 0.2973 | 0.1949 | 0.1775 | 0.3303 |
| C | 0.2826 | 0.2116 | 0.2142 | 0.2916 |
| G | 0.2520 | 0.2278 | 0.1887 | 0.3315 |
| T | 0.3562 | 0.1629 | 0.1789 | 0.3019 |
| Constraints | Distribution | Width | wCrit | distCrit | conCrit |
|---|---|---|---|---|---|
| 1 | ZOOPS | 7 | 1.01e+03 | --- | --- |
| 1 | ZOOPS | 8 | 1.46e-06 | 1.46e-06 | 1.27 |
| 1 | ZOOPS | 9 | 17.6 | --- | --- |
| 1 | ZOOPS | 10 | 0.0404 | --- | --- |
| 1 | ZOOPS | 11 | 23.1 | --- | --- |
| 1 | ZOOPS | 12 | 78.9 | --- | --- |
| 1 | TCM | 7 | 111 | --- | --- |
| 1 | TCM | 8 | 1.62 | 1.62 | --- |
| 1 | TCM | 9 | 652 | --- | --- |
| 1 | TCM | 10 | 46.3 | --- | --- |
| 1 | TCM | 11 | 710 | --- | --- |
| 1 | TCM | 12 | 3.59e+03 | --- | --- |
| 2 | ZOOPS | 7 | 0.00147 | --- | --- |
| 2 | ZOOPS | 8 | 1.54e-06 | 1.54e-06 | 1.36 |
| 2 | ZOOPS | 9 | 1.07e-05 | --- | --- |
| 2 | ZOOPS | 10 | 4.6e-05 | --- | --- |
| 2 | ZOOPS | 11 | 0.000596 | --- | --- |
| 2 | ZOOPS | 12 | 0.0125 | --- | --- |
| 2 | TCM | 7 | 0.944 | --- | --- |
| 2 | TCM | 8 | 0.0223 | 0.0223 | --- |
| 2 | TCM | 9 | 0.324 | --- | --- |
| 2 | TCM | 10 | 0.353 | --- | --- |
| 2 | TCM | 11 | 1.36 | --- | --- |
| 2 | TCM | 12 | 22.6 | --- | --- |
| Parameter | Choice | Criterion | Criterion value |
|---|---|---|---|
| Constraints | 1 | PWM-based CV | 1.27 |
| Distribution | ZOOPS | E-value | 1.46e-06 |
| Width | 8 | E-value | 1.46e-06 |
| NumSites | 19 | E-value | 1.46e-06 |
| Nuc\Pos | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
|---|---|---|---|---|---|---|---|---|
| A | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| C | 0.950 | 0.178 | 0.780 | 0.153 | 0.044 | 0.270 | 0.872 | 0.000 |
| G | 0.050 | 0.822 | 0.220 | 0.577 | 0.484 | 0.730 | 0.128 | 1.000 |
| T | 0.000 | 0.000 | 0.000 | 0.269 | 0.472 | 0.000 | 0.000 | 0.000 |

| Sequence |   |   |   | Strand |   | Start |   | Prob |   |   |   |   | Site |   |   | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| TestSeq8:73 | + | 73 | 0.9988 | atctagacta | C | G | C | G | T | G | C | G | gtggtattga | |||||||||
| TestSeq3:168 | - | 168 | 0.9980 | agccgctaga | C | G | G | C | C | G | C | G | atatgatccc | |||||||||
| TestSeq13:147 | - | 147 | 0.9980 | agtcagccaa | C | G | C | C | A | G | C | G | cttggtattt | |||||||||
| TestSeq0:138 | - | 138 | 0.9977 | accgattccg | C | G | C | C | A | G | C | G | tatcgatact | |||||||||
| TestSeq5:51 | - | 51 | 0.9974 | gatgcttgca | C | G | C | C | C | C | C | G | tgcatatctg | |||||||||
| TestSeq4:8 | - | 8 | 0.9971 | ttccaat | C | G | C | A | C | C | C | G | tttttaacaa | |||||||||
| TestSeq10:3 | - | 3 | 0.9960 | ct | C | G | G | C | A | G | C | G | gttcagagta | |||||||||
| TestSeq7:128 | - | 128 | 0.9950 | atataatatc | C | G | C | A | G | G | C | G | tttaaccggc | |||||||||
| TestSeq12:161 | - | 161 | 0.9941 | ttgcctaact | C | G | G | A | C | G | G | G | gactcataaa | |||||||||
| TestSeq2:95 | + | 95 | 0.9930 | taacgcggta | C | C | C | C | G | G | C | G | atcacaaatt | |||||||||
| TestSeq18:28 | - | 28 | 0.9930 | gtatatacta | C | G | G | A | C | G | G | G | atactgtacc | |||||||||
| TestSeq14:88 | - | 88 | 0.9572 | ttgagaaggt | C | G | C | A | A | G | C | C | tcgtgtagta | |||||||||
| TestSeq17:130 | - | 130 | 0.9393 | ttttcttcga | C | G | C | C | C | G | G | C | aatgaatcat | |||||||||
| TestSeq15:12 | - | 12 | 0.9151 | cactgttttt | C | G | C | A | C | G | G | G | aaagggcctc | |||||||||
| TestSeq9:116 | + | 116 | 0.8329 | ccttgcattc | G | C | C | G | T | G | C | G | acccgcgcca | |||||||||
| TestSeq11:175 | + | 175 | 0.7721 | acgacagcat | G | C | C | C | T | G | C | G | atgtttgcga | |||||||||
| TestSeq16:6 | + | 6 | 0.7677 | ttatt | C | C | C | G | C | C | G | G | atcaggaatc | |||||||||
| TestSeq6:44 | - | 44 | 0.6169 | agttccgctg | C | G | G | G | C | G | C | G | tattagtgtc | |||||||||
| TestSeq1:144 | - | 144 | 0.4328 | atcaattatt | C | C | C | C | C | G | G | C | ccttttttct | |||||||||
@ Constraint set >IntervalSetup Length:variable