Eleusine coracana KNE796-S Assembly & Annotation

Overview

Analysis Name Eleusine coracana KNE796-S Assembly & Annotation
Sequencing technology PacBio CLR
Assembly method MECAT v. 1.4
Release Date 2023-11-02
Reference Publication(s)

Devos KM, Qi P, Bahri BA, Gimode DM, Jenike K, Manthi SJ, Lule D, Lux T, Martinez-Bello L, Pendergast TH 4th, Plott C, Saha D, Sidhu GS, Sreedasyam A, Wang X, Wang H, Wright H, Zhao J, Deshpande S, de Villiers S, Dida MM, Grimwood J, Jenkins J, Lovell J, Mayer KFX, Mneney EE, Ojulong HF, Schatz MC, Schmutz J, Song B, Tesfaye K, Odeny DA. Genome analyses reveal population structure and a purple stigma color gene candidate in finger millet. Nat Commun. 2023 Jun 21;14(1):3694. doi: 10.1038/s41467-023-38915-6.

Abstract

Finger millet is a key food security crop widely grown in eastern Africa, India and Nepal. Long considered a ‘poor man’s crop’, finger millet has regained attention over the past decade for its climate resilience and the nutritional qualities of its grain. To bring finger millet breeding into the 21st century, here we present the assembly and annotation of a chromosome-scale reference genome. We show that this ~1.3 million years old allotetraploid has a high level of homoeologous gene retention and lacks subgenome dominance. Population structure is mainly driven by the differential presence of large wild segments in the pericentromeric regions of several chromosomes. Trait mapping, followed by variant analysis of gene candidates, reveals that loss of purple coloration of anthers and stigma is associated with loss-of-function mutations in the finger millet orthologs of the maize R1/B1 and Arabidopsis GL3/EGL3 anthocyanin regulatory genes. Proanthocyanidin production in seed is not affected by these gene knockouts.

Assembly statistics

Genome size1.1 Gb
Total ungapped length1.1 Gb
Gaps between scaffolds142
Number of chromosomes18
Number of scaffolds674
Scaffold N5015.3 Mb
Scaffold L5025
Number of contigs674
Contig N5015.3 Mb
Contig L5025
GC percent44
Genome coverage343.5x
Assembly levelChromosome

Assembly

The Eleusine coracana KNE796-S Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_032690845.1_Eleusine_coracana_v1.0_genomic.fna.gz

Gene Predictions

The Eleusine coracana KNE796-S genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) GCA_032690845.1_Eleusine_coracana_v1.0_genomic.gff.gz
CDS sequences (FASTA file) GCA_032690845.1_Eleusine_coracana_v1.0_cds_from_genomic.fna.gz
Protein sequences (FASTA file) GCA_032690845.1_Eleusine_coracana_v1.0_protein.faa.gz

Functional Analysis

Functional annotation for the Eleusine coracana KNE796-S is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Eleusine_coracana.Pfam.tsv.gz

S genes

Summary

QueryChromosomeSize(bp)CoordinatestBLASTn HittBLASTn %IDDomain
DUF247I-SBQKI01000085.11807945317201219-17202907Cfungigraminus63DUF247
DUF247II-S1ΨBQKI01000007.12829313125146095-25147603Bdecipiens59DUF247
DUF247II-S2ΨBQKI01000085.11807945317189097-17189579Bdecipiens63DUF247
HPS10-SBQKI01000085.11807945317199750-17199859,
17199954-17200116
ShybridS363-
DUF247I-Z1ΨBQKI01000009.1230261352500678-2501115AatlanticaDUF247I-Z71DUF247
DUF247I-Z2ΨBQKI01000079.1258995192309413-2310468Pvirgatum64DUF247
DUF247II-ZΨBQKI01000009.1230261352496883-2497203AlongiglumisDUF247II-Z63DUF247
HPS10-ZBQKI01000009.1230261352498875-2499055,
2499151-2499257
Ecrus-galliZ263-

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences