Matrix Details
The Matrix block provides the nucleotide frequencies observed in aligned binding sites of the corresponding transcription factor (or, more general, in aligned sites of the described function); an additional column depicts the IUPAC string consensus derived from the matrix according to the following rules, adapted from Cavener, 1987:
- A single nucleotide (A, C, G, T) is shown if its frequency is
at least 50% and at least twice as high as the second most frequent
nucleotide.
- A double-degenerate code indicates that the corresponding two
nucleotides occur in at least 75% of the underlying sequences and
rule 1 does not apply. (W = A or T), (S = C or G), (R = A or G), (Y
= C or T), (K = G or T), (M = A or C).
- Usage of triple-degenerate codes is restricted to those
positions where one of the nucleotides did not show up at all in
the sequence set and none of the afore mentioned rules applies. (B
= C, G or T), (D = A, G or T), (H = A, C or T), (V = A, C or
G).
- All other frequency distributions are represented by the letter "N" (= A, C, G or T).