Skip to main content

Table 5 Proportion of reference taxon genes shared with sister taxon (that is, core gene set)

From: Community transcriptomics reveals universal patterns of protein sequence conservation in natural microbial communities

Taxona

Number of CDSb

Sister taxonc

Percentage of cored

Alpha Proteobacterium HIMB114

1,425

Pelagibacter ubique HTCC1062

63

Ca. Kuenenia stuttgartiensis

4,787

Planctomyces limnophilus DSM 3776

17

Nitrosopumilus maritimus

1,796

Cenarchaeum symbiosium

49

Ca. Pelagibacter sp. HTCC7211

1,447

Pelagibacter ubique HTCC1062

75

Ca. Pelagibacter ubique HTCC1002

1,423

Pelagibacter sp. HTCC7211

80

Ca. Pelagibacter ubique HTCC1062

1,354

Pelagibacter sp. HTCC7211

80

Prochlorococcus marinus AS9601

1,920

All Pro. strains

68

Prochlorococcus marinus CCMP1375

1,883

All Pro. strains

69

Prochlorococcus marinus MIT 9312

1,810

All Pro. strains

72

Prochlorococcus marinus MIT9301

1,906

All Pro. strains

67

Prochlorococcus marinus NATL1A

2,193

All Pro. strains

59

Prochlorococcus marinus NATL2A

2,162

All Pro. strains

59

Uncultured SUP05 cluster bacterium

1,456

Ca. Ruthia magnifica

52

Solibacter usitatus Ellin6076

7,826

Acidobacterium capsulatum ATCC 51196

22

Ca. Koribacter versatilis Ellin345

4,777

Acidobacterium capsulatum ATCC 51196

36

Acidobacterium capsulatum ATCC 51196

3,377

Solibacter usitatus Ellin6076

51

Bradyrhizobium japonicum USDA 110

8,317

Bradyrhizobium sp. BTAi1

49

Bacterium Ellin514

6,510

Verrucomicrobium spinosum DSM 4136

24

  1. aRepresentative taxon at high abundance in each sample. bNumber of CDS is the number of protein-coding genes in the sequenced reference genome of each taxon. cSister taxon used for identification of core genome (see main text). dPercentage of core is the percentage of protein-coding genes in each taxon that are shared with the sister taxon. CDS, coding sequence.