Repeat associated mechanisms of genome evolution and function revealed by the Mus caroli and Mus pahari genomes

Understanding the mechanisms driving lineage-specific evolution in both primates and rodents has been hindered by the lack of sister clades with a similar phylogenetic structure having high-quality genome assemblies. Here, we have created chromosome-level assemblies of the Mus caroli and Mus pahari genomes. Together with the Mus musculus and Rattus norvegicus genomes, this set of rodent genomes is similar in divergence times to the Hominidae (human-chimpanzee-gorilla-orangutan). By comparing the evolutionary dynamics between the Muridae and Hominidae, we identified punctate events of chromosome reshuffling that shaped the ancestral karyotype of Mus musculus and Mus caroli between 3 and 6 million yr ago, but that are absent in the Hominidae. Hominidae show between four- and sevenfold lower rates of nucleotide change and feature turnover in both neutral and functional sequences, suggesting an underlying coherence to the Muridae acceleration. Our system of matched, high-quality genome assemblies revealed how specific classes of repeats can play lineage-specific roles in related species. Recent LINE activity has remodeled protein-coding loci to a greater extent across the Muridae than the Hominidae, with functional consequences at the species level such as reproductive isolation. Furthermore, we charted a Muridae-specific retrotransposon expansion at unprecedented resolution, revealing how a single nucleotide mutation transformed a specific SINE element into an active CTCF binding site carrier specifically in Mus caroli, which resulted in thousands of novel, species-specific CTCF binding sites. Our results show that the comparison of matched phylogenetic sets of genomes will be an increasingly powerful strategy for understanding mammalian biology.

Data and Resources

Additional Info

Field Value
Author Thybert, David
Last Updated November 20, 2019, 16:49 (UTC)
Created August 1, 2019, 10:29 (UTC)
Article Host Type publisher
Article Is Open Access true
Article License Type cc-by
Article Version Type publishedVersion
Citation Report https://scite.ai/reports/10.1101/gr.234096.117
DOI 10.1101/gr.234096.117
Date Last Updated 2019-06-06T11:50:36.430328
Evidence open (via page says license)
Funder code(s) Wellcome Trust (WT098051, WT202878/B/16/Z, WT202878/Z/16/Z, WT108749/Z/15/Z); National Human Genome Research Institute (U41HG007234); Cancer Research UK (20412); H2020 European Research Council (615584); Biotechnology and Biological Sciences Research Council (BB/N02317X/a); European Molecular Biology Laboratory (); European Community's Seventh Framework Programme (244356, FP7/2010-2014); European Union's Seventh Framework Programme (HEALTH-F4-2010-241504, FP7/2007–2013)
Journal Is Open Access false
Open Access Status hybrid
PDF URL http://genome.cshlp.org/content/28/4/448.full.pdf
Publisher URL https://doi.org/10.1101/gr.234096.117