Eleusine indica ASM3037835v1 Assembly & Annotation

Overview

Analysis Name Eleusine indica ASM3037835v1 Assembly & Annotation
Sequencing technology PacBio RSII
Assembly method FALCON v. 01
Release Date 2023-07-06
Reference Publication(s)

Zhang C, Johnson NA, Hall N, Tian X, Yu Q, Patterson EL. Subtelomeric 5-enolpyruvylshikimate-3-phosphate synthase copy number variation confers glyphosate resistance in Eleusine indica. Nat Commun. 2023 Aug 11;14(1):4865. doi: 10.1038/s41467-023-40407-6.

Abstract

Genomic structural variation (SV) has profound effects on organismal evolution; often serving as a source of novel genetic variation. Gene copy number variation (CNV), one type of SV, has repeatedly been associated with adaptive evolution in eukaryotes, especially with environmental stress. Resistance to the widely used herbicide, glyphosate, has evolved through target-site CNV in many weedy plant species, including the economically important grass, Eleusine indica (goosegrass); however, the origin and mechanism of these CNVs remain elusive in many weed species due to limited genetic and genomic resources. To study this CNV in goosegrass, we present high-quality reference genomes for glyphosate-susceptible and -resistant goosegrass lines and fine-assembles of the duplication of glyphosate’s target site gene 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS). We reveal a unique rearrangement of EPSPS involving chromosome subtelomeres. This discovery adds to the limited knowledge of the importance of subtelomeres as genetic variation generators and provides another unique example for herbicide resistance evolution.

Assembly statistics

Genome size522.5 Mb
Total ungapped length522.5 Mb
Number of chromosomes9
Number of scaffolds108
Scaffold N5057.4 Mb
Scaffold L505
Number of contigs170
Contig N5042.1 Mb
Contig L505
GC percent44
Genome coverage223.0x
Assembly levelChromosome

Assembly

The Eleusine indica ASM3037835v1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) Eleusine_indica.faa.gz

Gene Predictions

The Eleusine indica ASM3037835v1 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) Eleusine_indica.gff.gz
CDS sequences (FASTA file) Ei_cds.fa.gz
Protein sequences (FASTA file) Ei_pep.fa.gz

Functional Analysis

Functional annotation for the Eleusine indica ASM3037835v1 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Eleusine_indica.Pfam.tsv.gz

S genes

Summary

QueryChromosomeSize(bp)CoordinatestBLASTn HittBLASTn %IDDomain
DUF247II-SΨ36374251539275110-39276441Ttriandra60DUF247
DUF247I-ZΨ9421098162281031-2281468Pnotatum59DUF247
DUF247II-Z9421098162277238-2278836Onivara67DUF247
HPS10-Z9421098162279251-2279431,
2279528-2279634
DexilisZ232-

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences