Figure 4From: A simple metric of promoter architecture robustly predicts expression breadth of human genes suggesting that most transcription factors are positive regulatorsA correlogram of eleven variables describing promoter architecture. In the correlogram, there are four measures of GC-content: GC-content in a 1 kb proximal promoter (GC), GC-content in a 20 kbps window around the promoter (GC_big), GC-content in a third codon position (GC3), and the frequency of CpG sites (CpG); there are also four measures describing the number of transcription factor binding sites in promoters: Tfbs1 (Tfbs_length – straight number of Tfbs), Tfbs2 (Tfbs_length_unique – the number of unique Tfbs), Tfbs3 (Tfbs_length_noPol2 – the number of Tfbs excluding PolII), and Tfbs4 (Tfbs_length_unique_noPol2 - the number of unique Tfbs excluding PolII), a measure of methylation (methyl), a signature of digestion by DNASE1 (DNASE1), and the breadth of expression (BoE).Back to article page