Analysis Name | Cenchrus purpureus gwh_assembly Elephant grass cv. Purple Assembly & Annotation |
Sequencing technology | Nanopore |
Assembly method | NextDEnovo 2020-8-7 |
Release Date | 2020-09-15 |
Yan Q, Wu F, Xu P, Sun Z, Li J, Gao L, Lu L, Chen D, Muktar M, Jones C, Yi X, Zhang J. The elephant grass (Cenchrus purpureus) genome provides insights into anthocyanidin accumulation and fast growth. Mol Ecol Resour. 2021 Feb;21(2):526-542. doi: 10.1111/1755-0998.13271.
SummaryElephant grass (2n = 4x = 28; Cenchrus purpureus Schumach.), also known as Napier grass, is an important forage grass and potential energy crop in tropical and subtropical regions of Asia, Africa and America. However, no study has yet reported a genome assembly for elephant grass at the chromosome scale. Here, we report a high-quality chromosome-scale genome of elephant grass with a total size of 1.97 Gb and a 1.5% heterozygosity rate, obtained using short-read sequencing, single-molecule long-read sequencing and Hi-C chromosome conformation capture. Evolutionary analysis showed that subgenome A' of elephant grass and pearl millet may have originated from a common ancestor more than 3.22 million years ago (MYA). Further, allotetraploid formation occurred at approximately 6.61 MYA. Syntenic analyses within elephant grass and with other grass species indicated that elephant grass has experienced chromosomal rearrangements. We found that some key enzyme-encoding gene families related to the biosynthesis of anthocyanidins and flavonoids were expanded and highly expressed in leaves, which probably drives the production of these major anthocyanidin compounds and explains why this elephant grass cultivar has a high anthocyanidin content. In addition, we found a high copy number and transcript levels of genes involved in C4 photosynthesis and hormone signal transduction pathways that may contribute to the fast growth of elephant grass. The availability of elephant grass genome data advances our knowledge of the genetic evolution of elephant grass and will contribute to further biological research and breeding as well as for other polyploid plants in the genus Cenchrus.
Assembly statistics
Genome size (bp) | 1,967,774,562 |
GC content | 46.95% |
Genome sequence No. | 252 |
Maximum genome sequence length (bp) | 199,064,672 |
Minimum genome sequence length (bp) | 29,438 |
Average genome sequence length (bp) | 7,808,629 |
Genome sequence N50 (bp) | 150,585,890 |
Genome sequence N90 (bp) | 99,749,506 |
Assembly level | Chromosome |
The Cenchrus purpureus gwh_assembly Elephant grass cv. Purple Assembly file is available in FASTA format.
Downloads
Chromosomes (FASTA file) | GWHAORA00000000.genome.fasta.gz |
The Cenchrus purpureus gwh_assembly Elephant grass cv. Purple genome gene prediction files are available in GFF3 and FASTA format.
Downloads
Genes (GFF3 file) | GWHAORA00000000.gff.gz |
CDS sequences (FASTA file) | GWHAORA00000000.CDS.fasta.gz |
Protein sequences (FASTA file) | GWHAORA00000000.Protein.faa.gz |
Functional annotation for the Cenchrus purpureus gwh_assembly Elephant grass cv. Purple is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).
Downloads
Domain from InterProScan | Cenchrus_purpureus.Pfam.tsv.gz |
Summary
Query | Chromosome | Size(bp) | Coordinates | tBLASTn Hit | tBLASTn %ID | Domain |
DUF247I-S | GWHAORA00000012 | 106417498 | 67422643-67424331 | Ecrus-galli | 60 | DUF247 |
DUF247II-SΨ | GWHAORA00000012 | 106417498 | 67762161-67762466 | Ttriandra | 64 | DUF247 |
HPS10-S | GWHAORA00000012 | 106417498 | 67464499-67464667,67464777-67464895 | EophiuroidesS1 | 60 | - |
DUF247II-Z | GWHAORA00000014 | 66044712 | 60057475-60059106 | LpZDUF247-II_chromosome2 | 65 | DUF247 |
HPS10-Z1 | GWHAORA00000014 | 66044712 | 60055693-60055858,60055925-60056037 | LmsZ_scaffold1905 | 39 | - |
HPS10-Z2 | GWHAORA00000014 | 66044712 | 60173403-60173568,60173635-60173750 | LmsZ_scaffold1905 | 37 | - |
Nucleotide
Protein