Triticum turgidum Svevo Assembly & Annotation

Overview

Analysis Name Triticum turgidum Svevo Assembly & Annotation
Sequencing technology Illumina, Hi-C
Assembly method DenovoMAGIC2 
Release Date 2019-02-13
Reference Publication(s)

Maccaferri M, Harris NS, Twardziok SO, Pasam RK, Gundlach H, Spannagl M, Ormanbekova D, Lux T, Prade VM, Milner SG, Himmelbach A, Mascher M, Bagnaresi P, Faccioli P, Cozzi P, Lauria M, Lazzari B, Stella A, Manconi A, Gnocchi M, Moscatelli M, Avni R, Deek J, Biyiklioglu S, Frascaroli E, Corneti S, Salvi S, Sonnante G, Desiderio F, Marè C, Crosatti C, Mica E, Özkan H, Kilian B, De Vita P, Marone D, Joukhadar R, Mazzucotelli E, Nigro D, Gadaleta A, Chao S, Faris JD, Melo ATO, Pumphrey M, Pecchioni N, Milanesi L, Wiebe K, Ens J, MacLachlan RP, Clarke JM, Sharpe AG, Koh CS, Liang KYH, Taylor GJ, Knox R, Budak H, Mastrangelo AM, Xu SS, Stein N, Hale I, Distelfeld A, Hayden MJ, Tuberosa R, Walkowiak S, Mayer KFX, Ceriotti A, Pozniak CJ, Cattivelli L. Durum wheat genome highlights past domestication signatures and future improvement targets. Nat Genet. 2019 May;51(5):885-895. doi: 10.1038/s41588-019-0381-3.

Abstract

The domestication of wild emmer wheat led to the selection of modern durum wheat, grown mainly for pasta production. We describe the 10.45 gigabase (Gb) assembly of the genome of durum wheat cultivar Svevo. The assembly enabled genome-wide genetic diversity analyses revealing the changes imposed by thousands of years of empirical selection and breeding. Regions exhibiting strong signatures of genetic divergence associated with domestication and breeding were widespread in the genome with several major diversity losses in the pericentromeric regions. A locus on chromosome 5B carries a gene encoding a metal transporter (TdHMA3-B1) with a non-functional variant causing high accumulation of cadmium in grain. The high-cadmium allele, widespread among durum cultivars but undetected in wild emmer accessions, increased in frequency from domesticated emmer to modern durum wheat. The rapid cloning of TdHMA3-B1 rescues a wild beneficial allele and demonstrates the practical use of the Svevo genome for wheat improvement.

Assembly statistics

Genome size10 Gb
Total ungapped length9.8 Gb
Number of chromosomes14
Number of scaffolds14
Scaffold N50723 Mb
Scaffold L507
Number of contigs330,244
Contig N5057.9 kb
Contig L5050,989
GC percent46
Genome coverage270.0x
Assembly levelChromosome

Assembly

The Triticum turgidum Svevo Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_900231445.1_Svevo.v1_genomic.fna.gz

Gene Predictions

The Triticum turgidum Svevo genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) GCA_900231445.1_Svevo.v1_genomic.gff.gz
CDS sequences (FASTA file) Tt_cds.fa.gz
Protein sequences (FASTA file) Tt_pep.fa.gz

Functional Analysis

Functional annotation for the Triticum turgidum Svevo is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Triticum_turgidum.Pfam.tsv.gz

S genes

Summary

QueryChromosomeSize(bp)CoordinatestBLASTn HittBLASTn %IDDomain
DUF247I-S1LT934111.158526672280061907-80063502LpSDUF247-I_chromosome1DUF247
DUF247I-S2LT934112.1681112512131222758-131224374LpSDUF247-I_chromosome1DUF247
DUF247II-S1LT934111.158526672278697474-78699093LpSDUF247-II_chromosome1DUF247
DUF247II-S2ΨLT934112.1681112512130984358-130984540LpSDUF247-II_chromosome1DUF247
HPS10-S1LT934111.158526672280057717-80057840,
80057946-80058079
LpsS_contig12948-
HPS10-S2LT934112.1681112512130990724-130990827,
130990935-130991061
LpsS_contig11029-
DUF247I-Z1ΨLT934113.1775448786736756013-736756549LrDUF247I-ZDUF247
DUF247I-Z2LT934114.1790338525728194691-728196280LpZDUF247-I_chromosome2DUF247
DUF247II-Z1ΨLT934113.1775448786736759800-736760549LrDUF247II-ZDUF247
DUF247II-Z2LT934114.1790338525728182201-728183874LrDUF247II-ZDUF247
HPS10-Z1LT934113.1775448786736757916-736758060,
736758184-736758293
LpsZ_chromosome2-
HPS10-Z2LT934114.1790338525728191843-728191969,
728192099-728192199
AerianthaHPS10-Z-

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences