Aegilops bicornis ASM2160514v1 Assembly & Annotation

Overview

Analysis Name Aegilops bicornis ASM2160514v1 Assembly & Annotation
Sequencing technology Oxford Nanopore PromethION
Assembly method Wtdbg2 v. 2.5
Release Date 2022-01-26
Reference Publication(s)

Li LF, Zhang ZB, Wang ZH, Li N, Sha Y, Wang XF, Ding N, Li Y, Zhao J, Wu Y, Gong L, Mafessoni F, Levy AA, Liu B. Genome sequences of five Sitopsis species of Aegilops and the origin of polyploid wheat B subgenome. Mol Plant. 2022 Mar 7;15(3):488-503. doi: 10.1016/j.molp.2021.12.019.

Abstract

Common wheat (Triticum aestivum, BBAADD) is a major staple food crop worldwide. The diploid progenitors of the A and D subgenomes have been unequivocally identified; that of B, however, remains ambiguous and controversial but is suspected to be related to species of Aegilops, section Sitopsis. Here, we report the assembly of chromosome-level genome sequences of all five Sitopsis species, namely Aegilops bicornis, Ae. longissima, Ae. searsii, Ae. sharonensis, and Ae. speltoides, as well as the partial assembly of the Amblyopyrum muticum (synonym Aegilops mutica) genome for phylogenetic analysis. Our results reveal that the donor of the common wheat B subgenome is a distinct, and most probably extinct, diploid species that diverged from an ancestral progenitor of the B lineage to which the still extant Ae. speltoides and Am. muticum belong. In addition, we identified interspecific genetic introgressions throughout the evolution of the Triticum/Aegilops species complex. The five Sitopsis species have various assembled genome sizes (4.11–5.89 Gb) with high proportions of repetitive sequences (85.99%–89.81%); nonetheless, they retain high collinearity with other genomes or subgenomes of species in the Triticum/Aegilops complex. Differences in genome size were primarily due to independent post-speciation amplification of transposons. We also identified a set of Sitopsis genes pertinent to important agronomic traits that can be harnessed for wheat breeding. These newly assembled genome resources provide a new roadmap for evolutionary and genetic studies of the Triticum/Aegilops complex, as well as for wheat improvement.

Assembly statistics

Genome size 5.9 Gb
Number of chromosomes 7
Number of scaffolds 172
Scaffold N50 816.6 Mb
Scaffold L50 4
Number of contigs 3,579
Contig N50 3.8 Mb
Contig L50 479
Assembly level Chromosome

Assembly

The Aegilops bicornis ASM2160514v1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GWHBFXS00000000.1.genome.fasta.gz

Gene Predictions

The Aegilops bicornis ASM2160514v1 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) GWHBFXS00000000.1.gff.gz
CDS sequences (FASTA file) GWHBFXS00000000.1.RNA.fasta.gz
Protein sequences (FASTA file) GWHBFXS00000000.1.Protein.faa.gz

Functional Analysis

Functional annotation for the Aegilops bicornis ASM2160514v1 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Aegilops_bicornis_TB01.Pfam.tsv.gz

S genes

Summary

QueryChromosomeSize(bp)CoordinatestBLASTn HittBLASTn %IDDomain
DUF247I-SGWHBFXS00000001.1674339718150866909-150868504LpSDUF247-I_chromosome180DUF247
DUF247II-SGWHBFXS00000001.1674339718150875149-150876729LpSDUF247-II_chromosome170DUF247
HPS10-SGWHBFXS00000001.1674339718150870033-150870172,
150870368-150870533
LpsS_contig1294851-
DUF247I-ZΨGWHBFXS00000002.1873736323810483225-810483644LpZDUF247-I_chromosome275DUF247
DUF247II-ZΨGWHBFXS00000002.1873736323810747387-810747875AsativaDUF247II-Z176DUF247
HPS10-ZGWHBFXS00000002.1873736323810476550-810476644,
810476742-810476904
Bhybridum_HPS10-Z67-

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences