Fig. 2

Comparison of Arabidopsis thaliana genome size estimations. Genome sizes of the A. thaliana accessions Col-0 (a) and Nd-1 (b) were predicted by MGSE, GenomeScope2, gce, and findGSE. Different MGSE approaches were evaluated, differing by the set of regions for the average coverage calculation (e.g. all genes) and the methods for the calculation of this value (mean/median). Multiple read data sets (n) were analyzed by each tool/approach to infer an average genome size given as median (m, yellow line) and mean (black triangles), transposable elements = TE, without = wo. The blue region in (a) shows the expected genome size range. It has the near complete assembly size of Col-0 [80] as the lower boundary and one of the largest reported assembly sizes of Arabidopsis thaliana [81] as the upper boundary (Additional File 8); The blue line in (b) represents the highest quality and largest reported assembly size for Nd-1 to date