The first near-complete assembly of the hexaploid bread wheat genome, Triticum aestivum

Common bread wheat, Triticum aestivum, has one of the most complex genomes known to science, with 6 copies of each chromosome, enormous numbers of near-identical sequences scattered throughout, and an overall haploid size of more than 15 billion bases. Multiple past attempts to assemble the genome have produced assemblies that were well short of the estimated genome size. Here we report the first near-complete assembly of T. aestivum, using deep sequencing coverage from a combination of short Illumina reads and very long Pacific Biosciences reads. The final assembly contains 15 344 693 583 bases and has a weighted average (N50) contig size of 232 659 bases. This represents by far the most complete and contiguous assembly of the wheat genome to date, providing a strong foundation for future genetic studies of this important food crop. We also report how we used the recently published genome of Aegilops tauschii, the diploid ancestor of the wheat D genome, to identify 4 179 762 575 bp of T. aestivum that correspond to its D genome components.

Data and Resources

Additional Info

Field Value
Author Zimin, Aleksey V
Last Updated November 20, 2019, 16:47 (UTC)
Created August 1, 2019, 10:30 (UTC)
Article Host Type publisher
Article Is Open Access true
Article License Type cc-by
Article Version Type publishedVersion
Citation Report https://scite.ai/reports/10.1093/gigascience/gix097
DOI 10.1093/gigascience/gix097
Date Last Updated 2019-06-14T17:38:54.416317
Evidence open (via page says license)
Funder code(s) National Institutes of Health (HG006677); US National Science Foundation (IOS-1238231, IOS-1444893)
Journal Is Open Access true
Open Access Status gold
PDF URL https://academic.oup.com/gigascience/article-pdf/6/11/gix097/25512454/gix097.pdf
Publisher URL https://doi.org/10.1093/gigascience/gix097