Cenchrus americanus 843 Assembly & Annotation

Overview

Analysis Name Cenchrus americanus 843 Assembly & Annotation
Sequencing technology Pacbio,Bionano
Assembly method hifiasm
Release Date 2023-09-04
Reference Publication(s)

Ramu P, Srivastava RK, Sanyal A, Fengler K, Cao J, Zhang Y, Nimkar M, Gerke J, Shreedharan S, Llaca V, May G, Peterson-Burch B, Lin H, King M, Das S, Bhupesh V, Mandaokar A, Maruthachalam K, Krishnamurthy P, Gandhi H, Rathore A, Gupta R, Chitikineni A, Bajaj P, Gupta SK, Satyavathi CT, Pandravada A, Varshney RK, Babu R. Improved pearl millet genomes representing the global heterotic pool offer a framework for molecular breeding applications. Commun Biol. 2023 Sep 4;6(1):902. doi: 10.1038/s42003-023-05258-3.

Abstract

High-quality reference genome assemblies, representative of global heterotic patterns, offer an ideal platform to accurately characterize and utilize genetic variation in the primary gene pool of hybrid crops. Here we report three platinum grade de-novo, near gap-free, chromosome-level reference genome assemblies from the active breeding germplasm in pearl millet with a high degree of contiguity, completeness, and accuracy. An improved Tift genome (Tift23D2B1-P1-P5) assembly has a contig N50 ~ 7,000-fold (126 Mb) compared to the previous version and better alignment in centromeric regions. Comparative genome analyses of these three lines clearly demonstrate a high level of collinearity and multiple structural variations, including inversions greater than 1 Mb. Differential genes in improved Tift genome are enriched for serine O-acetyltransferase and glycerol-3-phosphate metabolic process which play an important role in improving the nutritional quality of seed protein and disease resistance in plants, respectively. Multiple marker-trait associations are identified for a range of agronomic traits, including grain yield through genome-wide association study. Improved genome assemblies and marker resources developed in this study provide a comprehensive framework/platform for future applications such as marker-assisted selection of mono/oligogenic traits as well as whole-genome prediction and haplotype-based breeding of complex traits.

Assembly statistics

Genome size (bp) 1863713383
Number of scaffolds 8
Scaffold N50 (bp) 273803026
Scaffold L50 3
Assembly level Chromosome

Assembly

The Cenchrus americanus 843 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) PearlMillet.843B.CHROMOSOMES.fasta.gz

Gene Predictions

The Cenchrus americanus 843 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) 843B_fil.gff3.gz
CDS sequences (FASTA file) 843B_fil_CDS.fa.gz
Protein sequences (FASTA file) 843B_fil_prot.fa.gz

Functional Analysis

Functional annotation for the Cenchrus americanus 843 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan ???

S genes

Summary

QueryChromosomeSize(bp)CoordinatestBLASTn HittBLASTn %IDDomain
DUF247I-SΨChr5299584346058829-6059311LpSDUF247-I_chromosome176DUF247
DUF247II-SChr5299584346046665-6048326LpSDUF247-II_chromosome168DUF247
HPS10-SChr5299584346050205-6050364,
6050467-6050603
LpsS_contig1102938-
DUF247I-ZΨChr43550269432941990-32942343AatlanticaDUF247I-Z74DUF247
DUF247II-ZΨChr43550269432949311-32950072Psupina Chr4 772DUF247
HPS10-ZChr43550269432946909-32947006,
32947136-32947307
AerianthaHPS10-Z35-

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences