Skip to main content
Fig. 2 | Genome Biology

Fig. 2

From: Widespread false gene gains caused by duplication errors in genome assemblies

Fig. 2

The amount of false duplication and factors that correlate with false duplication. a The total assembly size and the proportion that are false duplications in the previous and VGP assemblies. False duplications were classified as heterotype and homotype. b Scheme of false duplications (FD) in the previous and VGP assemblies due to heterozygous alleles. Corrected FD are regions in the VGP assembly that are false duplications in the previous assembly. Correctly assembled are regions without any false duplication in the previous and VGP assemblies. Introduced FD are false duplications introduced in the VGP assembly that were not present in the previous assembly. c Heterozygosity of corrected FD, correctly assembled, and introduced FD, according to the VGP assembly haplotype data (***P < 0.001; two-sided T-test). Red dotted line, overall heterozygosity of the genome. d The portion of erroneous k-mers in false duplications and correct regions of each assembly (***P < 0.001; two-sided T-test)

Back to article page