Skip to main content

Table 1 Input features to train CoreBoost

From: Boosting with stumps for predicting transcription start sites

Feature list Details
Core promoter elements Score of each core element
Weighted score of pairs between TATA, Inr, and CCAAT- and GC-box
TFBSs Weighted maximal scores for weight matrices from TRANSFAC and density of TFBS
Mechanical properties Weighted energy/flexibility scores around position -25 and +1
Average energy/flexibility scores
Correlation with the empirical average energy/flexibility profile
Markovian score Likelihood ratios from homogeneous third order Markov models
k-mer frequency Frequency of 1- or 2-mers related to nucleotide G or C
  1. TFBS, transcription factor binding site.