Tripidium rufipilum Assembly & Annotation

Overview

Analysis Name Tripidium rufipilum Assembly & Annotation
Sequencing technology PacBio HiFi
Assembly method Hifiasm Hifiasm-0.16.1 (r375)
Release Date 2023-02-15
Reference Publication(s)

Wang T, Wang B, Hua X, Tang H, Zhang Z, Gao R, Qi Y, Zhang Q, Wang G, Yu Z, Huang Y, Zhang Z, Mei J, Wang Y, Zhang Y, Li Y, Meng X, Wang Y, Pan H, Chen S, Li Z, Shi H, Liu X, Deng Z, Chen B, Zhang M, Gu L, Wang J, Ming R, Yao W, Zhang J. A complete gap-free diploid genome in Saccharum complex and the genomic footprints of evolution in the highly polyploid Saccharum genus. Nat Plants. 2023 Apr;9(4):554-571. doi: 10.1038/s41477-023-01378-0.

Abstract

A diploid genome in the Saccharum complex facilitates our understanding of evolution in the highly polyploid Saccharum genus. Here we have generated a complete, gap-free genome assembly of Erianthus rufipilus, a diploid species within the Saccharum complex. The complete assembly revealed that centromere satellite homogenization was accompanied by the insertions of Gypsy retrotransposons, which drove centromere diversification. An overall low rate of gene transcription was observed in the palaeo-duplicated chromosome EruChr05 similar to other grasses, which might be regulated by methylation patterns mediated by homologous 24 nt small RNAs, and potentially mediating the functions of many nucleotide-binding site genes. Sequencing data for 211 accessions in the Saccharum complex indicated that Saccharum probably originated in the trans-Himalayan region from a diploid ancestor (x = 10) around 1.9–2.5 million years ago. Our study provides new insights into the origin and evolution of Saccharum and accelerates translational research in cereal genetics and genomics.

Assembly statistics

Genome size (bp)858,700,241
GC content45.45%
Chromosomes sequence No.10
Assembly levelT2T

Assembly

The Tripidium rufipilum Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) Erufi.v20220328.chr.fasta.tar.gz

Gene Predictions

The Tripidium rufipilum genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) Erufi.v20220328.chr.gff3.tar.gz
CDS sequences (FASTA file) Erufi.v20220328.cds.fa.tar.gz
Protein sequences (FASTA file) Erufi.v20220328.protein.fa.tar.gz

Functional Analysis

Functional annotation for the Tripidium rufipilum is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Tripidium_rufipilum.Pfam.tsv.gz

S genes

Summary

QueryChromosomeSize(bp)CoordinatestBLASTn HittBLASTn %IDDomain
DUF247I-SΨChr107248043839869826-39870854Pvaginatum69DUF247
DUF247II-ZChr067706317270685228-70686853Orufipogon54DUF247
HPS10-ZChr067706317270682230-70682342,70682532-70682661Lperrieri44-

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences