Alloteropsis semialata ASEM_AUS1_V1.0 Assembly & Annotation

Overview

Analysis Name Alloteropsis semialata ASEM_AUS1_V1.0 Assembly & Annotation
Sequencing technology Illumina HiSeq; PacBio
Assembly method SOAPdenovo v. 2.04; Dbg2olc v. 11062016
Release Date 2019-02-04
Reference Publication(s)

Dunning LT, Olofsson JK, Parisod C, Choudhury RR, Moreno-Villena JJ, Yang Y, Dionora J, Quick WP, Park M, Bennetzen JL, Besnard G, Nosil P, Osborne CP, Christin PA. Lateral transfers of large DNA fragments spread functional genes among grasses. Proc Natl Acad Sci U S A. 2019 Mar 5;116(10):4416-4425. doi: 10.1073/pnas.1810031116.

Abstract

A fundamental tenet of multicellular eukaryotic evolution is that vertical inheritance is paramount, with natural selection acting on genetic variants transferred from parents to offspring. This lineal process means that an organism’s adaptive potential can be restricted by its evolutionary history, the amount of standing genetic variation, and its mutation rate. Lateral gene transfer (LGT) theoretically provides a mechanism to bypass many of these limitations, but the evolutionary importance and frequency of this process in multicellular eukaryotes, such as plants, remains debated. We address this issue by assembling a chromosome-level genome for the grass Alloteropsis semialata, a species surmised to exhibit two LGTs, and screen it for other grass-to-grass LGTs using genomic data from 146 other grass species. Through stringent phylogenomic analyses, we discovered 57 additional LGTs in the A. semialata nuclear genome, involving at least nine different donor species. The LGTs are clustered in 23 laterally acquired genomic fragments that are up to 170 kb long and have accumulated during the diversification of Alloteropsis. The majority of the 59 LGTs in A. semialata are expressed, and we show that they have added functions to the recipient genome. Functional LGTs were further detected in the genomes of five other grass species, demonstrating that this process is likely widespread in this globally important group of plants. LGT therefore appears to represent a potent evolutionary force capable of spreading functional genes among distantly related grass species.

Assembly statistics

Genome size 747.2 Mb
Number of chromosomes 9
Number of scaffolds 686
Scaffold N50 82.1 Mb
Scaffold L50 5
Number of contigs 28,005
Contig N50 39.2 kb
Contig L50 5,451
Assembly level Chromosome

Assembly

The Alloteropsis semialata ASEM_AUS1_V1.0 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_004135705.1_ASEM_AUS1_V1.0_genomic.fna.gz

Gene Predictions

The Alloteropsis semialata ASEM_AUS1_V1.0 genome gene prediction files are not available.

Downloads

Genes (GFF3 file) -
CDS sequences (FASTA file) -
Protein sequences (FASTA file) -

Functional Analysis

Functional annotation for the Alloteropsis semialata ASEM_AUS1_V1.0 is not available.

Downloads

Domain from InterProScan -

S genes

Summary

QueryChromosomeSize(bp)CoordinatestBLASTn HittBLASTn %IDDomain
DUF247I-SCM014273.16758146328892840-28894507Eophiuroides65DUF247
DUF247II-S1CM014273.16758146328671501-28673054Ttriandra61DUF247
DUF247II-S2CM014273.16758146333077172-33078818Ttriandra66DUF247
HPS10-S1CM014273.16758146328675383-28675486,
28675913-28676024
ShybridS158-
HPS10-S2CM014273.16758146329350900-29351041,
29351142-29351245
ShybridS176-
HPS10-S3CM014273.16758146330540879-30540997,
30541101-30541248
ShybridS141-
HPS10-S4CM014273.16758146336286144-36286252,
36286535-36286647
ShybridS161-
HPS10-S5CM014273.16758146344961096-44961199,
44961587-44961626
ShybridS160-
HPS10-S6CM014273.16758146347184871-47184977,
47185146-47185236
ShybridS130-
DUF247I-ZCM014276.16663619962076989-62078614LpZDUF247-I_chromosome260DUF247
DUF247II-ZCM014276.16663619962086866-62088545LpZDUF247-II_chromosome263DUF247

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences