Citrus mangshanensis ASM3044989v1 Assembly & Annotation

Overview

Analysis Name Citrus mangshanensis ASM3044989v1 Assembly & Annotation
Sequencing technology Oxford Nanopore
Assembly method SMARTdenovo v. default
Release Date 2023-07-13
Reference Publication(s)

Huang Y, He J, Xu Y, Zheng W, Wang S, Chen P, Zeng B, Yang S, Jiang X, Liu Z, Wang L, Wang X, Liu S, Lu Z, Liu Z, Yu H, Yue J, Gao J, Zhou X, Long C, Zeng X, Guo YJ, Zhang WF, Xie Z, Li C, Ma Z, Jiao W, Zhang F, Larkin RM, Krueger RR, Smith MW, Ming R, Deng X, Xu Q. Pangenome analysis provides insight into the evolution of the orange subfamily and a key gene for citric acid accumulation in citrus fruits. Nat Genet. 2023 Nov;55(11):1964-1975. doi: 10.1038/s41588-023-01516-6.

Abstract

The orange subfamily (Aurantioideae) contains several Citrus species cultivated worldwide, such as sweet orange and lemon. The origin of Citrus species has long been debated and less is known about the Aurantioideae. Here, we compiled the genome sequences of 314 accessions, de novo assembled the genomes of 12 species and constructed a graph-based pangenome for Aurantioideae. Our analysis indicates that the ancient Indian Plate is the ancestral area for Citrus-related genera and that South Central China is the primary center of origin of the Citrus genus. We found substantial variations in the sequence and expression of the PH4 gene in Citrus relative to Citrus-related genera. Gene editing and biochemical experiments demonstrate a central role for PH4 in the accumulation of citric acid in citrus fruits. This study provides insights into the origin and evolution of the orange subfamily and a regulatory mechanism underpinning the evolution of fruit taste.

Assembly statistics

Genome size371 Mb
Total ungapped length371 Mb
Number of contigs376
Contig N503.6 Mb
Contig L5024
GC percent37.5
Genome coverage77.0x
Assembly levelContig

Assembly

The Citrus mangshanensis ASM3044989v1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) MSYG.v1.0.genome.fa.gz

Gene Predictions

The Citrus mangshanensis ASM3044989v1 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) MSYG.v1.0.gene.model.gff3.gz
CDS sequences (FASTA file) MSYG.v1.0.CDS.fa.gz
Protein sequences (FASTA file) MSYG.v1.0.protein.fa.gz

Functional Analysis

Functional annotation for the Citrus mangshanensis ASM3044989v1 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Citrus_mangshanensis_ASM3044989v1.Pfam.tsv.gz

S genes

Summary

QueryContigSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF13ψcontig13552069046320-45244ASM2964120v1, SLF1398.795-
SLF3contig13552069064230-65339PB533_SCSK_HAP2, SLF396.216F-box; F_box_assoc
SLF4contig13552069078761-79876PP719843.1, S30-SLF494.176F-box; F_box_assoc
SLF5contig135520690158103-159227ASM2964120v1, SLF592.851F-box; F_box_assoc
SLF6contig135520690182860-183999ASM2964120v1, SLF693.634F-box; F_box_assoc
SLF7contig135520690222200-221073ASM2964120v1, SLF791.851F-box; F_box_assoc
SLF9contig135520690250132-251271ASM2964120v1, SLF991.373F-box; F_box_assoc
SLF8contig135520690266274-267413ASM2964120v1, SLF8-293.684F-box; F_box_assoc
SLF10ψcontig135520690347044-348079ASM2964120v1, SLF1092.954-
SLF11contig135520690364830-363676ASM2964120v1, SLF1194.934F-box; F_box_assoc
SLF12contig135520690374773-373654ASM2964120v1, SLF1290.893F-box; F_box_assoc
SLF13contig135520690390406-391521ASM2964120v1, SLF1392.242F-box; F_box_assoc
SLF2contig135520690400446-399322PP719841.1, S30-SLF296.622F-box; F_box_assoc
SLF1contig135520690409463-408363PP719840.1, S30-SLF197.366F-box; F_box_assoc
SLF1-2contig28590037457859314-7860414Citrus maxima SLF_S2C98.547F-box; F_box_assoc
SLF2-2contig28590037457870101-7871216PP719827.1, S2-SLF299.104F-box; F_box_assoc
SLF3-2contig28590037457892799-7893920PP719828.1, S2-SLF398.841F-box; F_box_assoc
SLF4-2contig28590037457904196-7905299PP719843.1, S30-SLF491.561F-box; F_box_assoc
SLF5-2contig28590037457921796-7920672PP719830.1, S2-SLF599.2F-box; F_box_assoc
SLF6-2contig28590037457961077-7959944PP719831.1, S2-SLF698.942F-box; F_box_assoc
SLF7-2contig28590037457966329-7965199PP719832.1, S2-SLF799.293F-box; F_box_assoc
SLF9-2contig28590037457975089-7976234PP719834.1, S2-SLF999.302F-box; F_box_assoc
SLF8-2contig28590037457980615-7981754PP719833.1, S2-SLF898.509F-box; F_box_assoc
SLF10-2contig28590037458002841-8003965ASM2964120v1, SLF1099.564F-box; F_box_assoc
SLF11-2contig28590037458006447-8005272ASM2964120v1, SLF1198.895F-box; F_box_assoc
SLF12-2contig28590037458010578-8009457ASM2964120v1, SLF1298.93F-box; F_box_assoc
SLF13-2contig28590037458021118-8022173ASM2964120v1, SLF1399.071F-box; F_box_assoc
S-RNase-1contig135520690131165-131404,
131508-131945
OQ672689.1, S23-RNase100Ribonuclease T2
S-RNase-2contig28590037457938886-7939140,
7939233-7939676
OQ672700.1, S2-RNase100Ribonuclease T2

Citrus S genes Nucleotide

Citrus S genes Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences