Skip to main content

Table 1 Comparison of ReAnoCDS05 and Ensembl

From: Anopheles gambiae genome reannotation through synthesis of ab initioand comparative gene prediction algorithms

 

ReAnoCDS05

Ensembl

Total CDSs

31,254

16,148

Total exons

155,680

58,579

Average exons per CDS

4.98

3.62

CDS completion rate*

96%

37%

CDSs overlapped by cDNA pair contigs†

1,885

2,257

CDSs perfectly matched to cDNA pair contigs†

752

672

cDNA pair contigs not overlapping any CDSs†

1%

8%

Specificity (nucleotide)‡

0.96

0.99

Sensitivity (overall)§

99%

92%

Sensitivity (perfect)¶

45%

30%

Protein hits yielded by MS peptides

4,737

1,413

MS peptide missing rate¥

12%

62%

  1. *Proportion of CDSs with start and stop codon. †Paired end sequences of full-length cDNAs from [7]. ‡Calculation for ReAnoCDS05 described in Materials and methods, Ensembl value from [18]. §Proportion of cDNA pair contigs overlapping a CDS. ¶Proportion of overlapped CDSs precisely matching cDNA pair contig boundaries. ¥Proportion of mass spectrometry (MS) peptides failing to hit any protein in database.