Formula used to calculate I
, the Kullback-Leibler distance , where i is the position within the site, p
is the frequency of that base in the genome, and f
is the observed frequency of each base at that position (from the weight matrix). Values for p
were calculated from the percentage G+C content of the genome sequence.