Analysis Name | Aegilops bicornis ASM2160514v1 Assembly & Annotation |
Sequencing technology | Oxford Nanopore PromethION |
Assembly method | Wtdbg2 v. 2.5 |
Release Date | 2022-01-26 |
Li LF, Zhang ZB, Wang ZH, Li N, Sha Y, Wang XF, Ding N, Li Y, Zhao J, Wu Y, Gong L, Mafessoni F, Levy AA, Liu B. Genome sequences of five Sitopsis species of Aegilops and the origin of polyploid wheat B subgenome. Mol Plant. 2022 Mar 7;15(3):488-503. doi: 10.1016/j.molp.2021.12.019.
AbstractCommon wheat (Triticum aestivum, BBAADD) is a major staple food crop worldwide. The diploid progenitors of the A and D subgenomes have been unequivocally identified; that of B, however, remains ambiguous and controversial but is suspected to be related to species of Aegilops, section Sitopsis. Here, we report the assembly of chromosome-level genome sequences of all five Sitopsis species, namely Aegilops bicornis, Ae. longissima, Ae. searsii, Ae. sharonensis, and Ae. speltoides, as well as the partial assembly of the Amblyopyrum muticum (synonym Aegilops mutica) genome for phylogenetic analysis. Our results reveal that the donor of the common wheat B subgenome is a distinct, and most probably extinct, diploid species that diverged from an ancestral progenitor of the B lineage to which the still extant Ae. speltoides and Am. muticum belong. In addition, we identified interspecific genetic introgressions throughout the evolution of the Triticum/Aegilops species complex. The five Sitopsis species have various assembled genome sizes (4.11–5.89 Gb) with high proportions of repetitive sequences (85.99%–89.81%); nonetheless, they retain high collinearity with other genomes or subgenomes of species in the Triticum/Aegilops complex. Differences in genome size were primarily due to independent post-speciation amplification of transposons. We also identified a set of Sitopsis genes pertinent to important agronomic traits that can be harnessed for wheat breeding. These newly assembled genome resources provide a new roadmap for evolutionary and genetic studies of the Triticum/Aegilops complex, as well as for wheat improvement.
Assembly statistics
Genome size | 5.9 Gb |
Number of chromosomes | 7 |
Number of scaffolds | 172 |
Scaffold N50 | 816.6 Mb |
Scaffold L50 | 4 |
Number of contigs | 3,579 |
Contig N50 | 3.8 Mb |
Contig L50 | 479 |
Assembly level | Chromosome |
The Aegilops bicornis ASM2160514v1 Assembly file is available in FASTA format.
Downloads
Chromosomes (FASTA file) | GWHBFXS00000000.1.genome.fasta.gz |
The Aegilops bicornis ASM2160514v1 genome gene prediction files are available in GFF3 and FASTA format.
Downloads
Genes (GFF3 file) | GWHBFXS00000000.1.gff.gz |
CDS sequences (FASTA file) | GWHBFXS00000000.1.RNA.fasta.gz |
Protein sequences (FASTA file) | GWHBFXS00000000.1.Protein.faa.gz |
Functional annotation for the Aegilops bicornis ASM2160514v1 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).
Downloads
Domain from InterProScan | Aegilops_bicornis_TB01.Pfam.tsv.gz |
Summary
Query | Chromosome | Size(bp) | Coordinates | tBLASTn Hit | tBLASTn %ID | Domain |
DUF247I-S | GWHBFXS00000001.1 | 674339718 | 150866909-150868504 | LpSDUF247-I_chromosome1 | 80 | DUF247 |
DUF247II-S | GWHBFXS00000001.1 | 674339718 | 150875149-150876729 | LpSDUF247-II_chromosome1 | 70 | DUF247 |
HPS10-S | GWHBFXS00000001.1 | 674339718 | 150870033-150870172,150870368-150870533 | LpsS_contig12948 | 51 | - |
DUF247I-ZΨ | GWHBFXS00000002.1 | 873736323 | 810483225-810483644 | LpZDUF247-I_chromosome2 | 75 | DUF247 |
DUF247II-ZΨ | GWHBFXS00000002.1 | 873736323 | 810747387-810747875 | AsativaDUF247II-Z1 | 76 | DUF247 |
HPS10-Z | GWHBFXS00000002.1 | 873736323 | 810476550-810476644,810476742-810476904 | Bhybridum_HPS10-Z | 67 | - |
Nucleotide
Protein