Analysis Name | Cenchrus americanus 843 Assembly & Annotation |
Sequencing technology | Pacbio,Bionano |
Assembly method | hifiasm |
Release Date | 2023-09-04 |
Ramu P, Srivastava RK, Sanyal A, Fengler K, Cao J, Zhang Y, Nimkar M, Gerke J, Shreedharan S, Llaca V, May G, Peterson-Burch B, Lin H, King M, Das S, Bhupesh V, Mandaokar A, Maruthachalam K, Krishnamurthy P, Gandhi H, Rathore A, Gupta R, Chitikineni A, Bajaj P, Gupta SK, Satyavathi CT, Pandravada A, Varshney RK, Babu R. Improved pearl millet genomes representing the global heterotic pool offer a framework for molecular breeding applications. Commun Biol. 2023 Sep 4;6(1):902. doi: 10.1038/s42003-023-05258-3.
AbstractHigh-quality reference genome assemblies, representative of global heterotic patterns, offer an ideal platform to accurately characterize and utilize genetic variation in the primary gene pool of hybrid crops. Here we report three platinum grade de-novo, near gap-free, chromosome-level reference genome assemblies from the active breeding germplasm in pearl millet with a high degree of contiguity, completeness, and accuracy. An improved Tift genome (Tift23D2B1-P1-P5) assembly has a contig N50 ~ 7,000-fold (126 Mb) compared to the previous version and better alignment in centromeric regions. Comparative genome analyses of these three lines clearly demonstrate a high level of collinearity and multiple structural variations, including inversions greater than 1 Mb. Differential genes in improved Tift genome are enriched for serine O-acetyltransferase and glycerol-3-phosphate metabolic process which play an important role in improving the nutritional quality of seed protein and disease resistance in plants, respectively. Multiple marker-trait associations are identified for a range of agronomic traits, including grain yield through genome-wide association study. Improved genome assemblies and marker resources developed in this study provide a comprehensive framework/platform for future applications such as marker-assisted selection of mono/oligogenic traits as well as whole-genome prediction and haplotype-based breeding of complex traits.
Assembly statistics
Genome size (bp) | 1863713383 |
Number of scaffolds | 8 |
Scaffold N50 (bp) | 273803026 |
Scaffold L50 | 3 |
Assembly level | Chromosome |
The Cenchrus americanus 843 Assembly file is available in FASTA format.
Downloads
Chromosomes (FASTA file) | PearlMillet.843B.CHROMOSOMES.fasta.gz |
The Cenchrus americanus 843 genome gene prediction files are available in GFF3 and FASTA format.
Downloads
Genes (GFF3 file) | 843B_fil.gff3.gz |
CDS sequences (FASTA file) | 843B_fil_CDS.fa.gz |
Protein sequences (FASTA file) | 843B_fil_prot.fa.gz |
Functional annotation for the Cenchrus americanus 843 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).
Downloads
Domain from InterProScan | ??? |
Summary
Query | Chromosome | Size(bp) | Coordinates | tBLASTn Hit | tBLASTn %ID | Domain |
DUF247I-SΨ | Chr5 | 29958434 | 6058829-6059311 | LpSDUF247-I_chromosome1 | 76 | DUF247 |
DUF247II-S | Chr5 | 29958434 | 6046665-6048326 | LpSDUF247-II_chromosome1 | 68 | DUF247 |
HPS10-S | Chr5 | 29958434 | 6050205-6050364,6050467-6050603 | LpsS_contig11029 | 38 | - |
DUF247I-ZΨ | Chr4 | 35502694 | 32941990-32942343 | AatlanticaDUF247I-Z | 74 | DUF247 |
DUF247II-ZΨ | Chr4 | 35502694 | 32949311-32950072 | Psupina Chr4 7 | 72 | DUF247 |
HPS10-Z | Chr4 | 35502694 | 32946909-32947006,32947136-32947307 | AerianthaHPS10-Z | 35 | - |
Nucleotide
Protein