Microstegium vimineum ASM2203655v1 Assembly & Annotation

Overview

Analysis Name Microstegium vimineum ASM2203655v1 Assembly & Annotation
Sequencing technology PacBio Sequel II; Omni-C
Assembly method Wtdbg2 v. 2.5
Release Date 2022-02-10
Reference Publication(s)

Ramachandran D, Huebner CD, Daly M, Haimovitz J, Swale T, Barrett CF. Chromosome Level Genome Assembly and Annotation of Highly Invasive Japanese Stiltgrass (Microstegium vimineum). Genome Biol Evol. 2021 Nov 5;13(11):evab238. doi: 10.1093/gbe/evab238.

Abstract

The invasive Japanese stiltgrass (Microstegium vimineum) affects a wide range of ecosystems and threatens biodiversity across the eastern USA. However, the mechanisms underlying rapid adaptation, plasticity, and epigenetics in the invasive range are largely unknown. We present a chromosome-level assembly for M. vimineum to investigate genome dynamics, evolution, adaptation, and the genomics of phenotypic plasticity. We generated a 1.12-Gb genome with scaffold N50 length of 53.44 Mb respectively, taking a de novo assembly approach that combined PacBio and Dovetail Genomics Omni-C sequencing. The assembly contains 23 pseudochromosomes, representing 99.96% of the genome. BUSCO assessment indicated that 80.3% of Poales gene groups are present in the assembly. The genome is predicted to contain 39,604 protein-coding genes, of which 26,288 are functionally annotated. Furthermore, 66.68% of the genome is repetitive, of which unclassified (35.63%) and long-terminal repeat (LTR) retrotransposons (26.90%) are predominant. Similar to other grasses, Gypsy (41.07%) and Copia (32%) are the most abundant LTR-retrotransposon families. The majority of LTR-retrotransposons are derived from a significant expansion in the past 1–2 Myr, suggesting the presence of relatively young LTR-retrotransposon lineages. We find corroborating evidence from Ks plots for a stiltgrass-specific duplication event, distinct from the more ancient grass-specific duplication event. The assembly and annotation of M. vimineum will serve as an essential genomic resource facilitating studies of the invasion process, the history and consequences of polyploidy in grasses, and provides a crucial tool for natural resource managers.

Assembly statistics

Genome size1.1 Gb
Total ungapped length1.1 Gb
Number of chromosomes23
Number of scaffolds460
Scaffold N5053 Mb
Scaffold L5010
Number of contigs5,308
Contig N50597 kb
Contig L50545
GC percent45
Genome coverage100.0x
Assembly levelChromosome

Assembly

The Microstegium vimineum ASM2203655v1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_022036555.1_ASM2203655v1_genomic.fna.gz

Gene Predictions

The Microstegium vimineum ASM2203655v1 genome gene prediction files are not available.

Downloads

Genes (GFF3 file) -
CDS sequences (FASTA file) -
Protein sequences (FASTA file) -

Functional Analysis

Functional annotation for the Microstegium vimineum ASM2203655v1 is not available.

Downloads

Domain from InterProScan -

S genes

Summary

QueryChromosomeSize(bp)CoordinatestBLASTn HittBLASTn %IDDomain
DUF247I-SΨCM039126.15348704824937586-24938389Shybrid68DUF247
DUF247I-ZΨCM039126.15348704858858911-58860038Pvaginatum81DUF247
DUF247II-Z1CM039131.16237820259130582-59132177Shybrid54DUF247
DUF247II-Z2ΨCM039131.16237820259255517-59256944Pvaginatum85DUF247
HPS10-ZCM039131.16237820259129639-59129798,
59129965-59130074
Ocoarctata35-

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences