Figure 3From: PhyloFacts: an online structural phylogenomic encyclopedia for protein functional and structural classificationPhyloFacts whole-genome library construction pipeline. This figure represents our protocol for building global homology group protein family books. The pipeline starts with clustering a target genome into global homology groups (GHGs; sequences sharing the same overall domain structure), and proceeding through various stages of cluster expansion, multiple sequence alignment, phylogenetic tree construction, retrieval of experimental data, a variety of bioinformatics methods for predicting functional subfamilies, key residues, cellular localization, and so on, and quality control assessment.Back to article page