Analysis Name | Citrus sinensis cv. Valencia DVS v1.0 Assembly & Annotation |
Sequencing technology | PacBio Sequel II |
Assembly method | FALCON-Unzip v. 2.0; CANU v. 2.0 |
Release Date | 2022-02-11 |
Wu B, Yu Q, Deng Z, Duan Y, Luo F, Gmitter F Jr. A chromosome-level phased genome enabling allele-level studies in sweet orange: a case study on citrus Huanglongbing tolerance. Hortic Res. 2022 Nov 3;10(1):uhac247. doi: 10.1093/hr/uhac247.
AbstractSweet orange originated from the introgressive hybridizations of pummelo and mandarin resulting in a highly heterozygous genome. How alleles from the two species cooperate in shaping sweet orange phenotypes under distinct circumstances is unknown. Here, we assembled a chromosome-level phased diploid Valencia sweet orange (DVS) genome with over 99.999% base accuracy and 99.2% gene annotation BUSCO completeness. DVS enables allele-level studies for sweet orange and other hybrids between pummelo and mandarin. We first configured an allele-aware transcriptomic profiling pipeline and applied it to 740 sweet orange transcriptomes. On average, 32.5% of genes have a significantly biased allelic expression in the transcriptomes. Different cultivars, transgenic lineages, tissues, development stages, and disease status all impacted allelic expressions and resulted in diversified allelic expression patterns in sweet orange, but particularly citrus Huanglongbing (HLB) shifted the allelic expression of hundreds of genes in leaves and calyx abscission zones. In addition, we detected allelic structural mutations in an HLB-tolerant mutant (T19) and a more sensitive mutant (T78) through long-read sequencing. The irradiation-induced structural mutations mostly involved double-strand breaks, while most spontaneous structural mutations were transposon insertions. In the mutants, most genes with significant allelic expression ratio alterations (≥1.5-fold) were directly affected by those structural mutations. In T19, alleles located at a translocated segment terminal were upregulated, including CsDnaJ, CsHSP17.4B, and CsCEBPZ. Their upregulation is inferred to keep phloem protein homeostasis under the stress from HLB and enable subsequent stress responses observed in T19. DVS will advance allelic level studies in citrus.
Assembly statistics
DVS_A1.0 | DVS_B1.0 | |
Genome size | 299 Mb | 299.6 Mb |
Total ungapped length | 299 Mb | 299.6 Mb |
Number of chromosomes | 9 | 9 |
Number of scaffolds | 9 | 9 |
Scaffold N50 | 32.9 Mb | 32.3 Mb |
Scaffold L50 | 4 | 4 |
Number of contigs | 9 | 9 |
Contig N50 | 32.9 Mb | 32.3 Mb |
Contig L50 | 4 | 4 |
GC percent | 34 | 34 |
Genome coverage | 117.0x | 117.0x |
Assembly level | Complete Genome | Complete Genome |
The Citrus sinensis cv. Valencia DVS v1.0 Assembly file is available in FASTA format.
Downloads
Chromosomes (FASTA file) | GCA_022201045.1_DVS_A1.0_genomic.fna.gz | GCA_022201065.1_DVS_B1.0_genomic.fna.gz |
The Citrus sinensis cv. Valencia DVS v1.0 genome gene prediction files are available in GFF3 and FASTA format.
Downloads
Genes (GFF3 file) | GCA_022201045.1_DVS_A1.0_genomic.gff.gz | GCA_022201065.1_DVS_B1.0_genomic.gff.gz |
CDS sequences (FASTA file) | GCA_022201045.1_DVS_A1.0_cds_from_genomic.fna.gz | GCA_022201065.1_DVS_B1.0_cds_from_genomic.fna.gz |
Protein sequences (FASTA file) | GCA_022201045.1_DVS_A1.0_protein.faa.gz | GCA_022201065.1_DVS_B1.0_protein.faa.gz |
Functional annotation for the Citrus sinensis cv. Valencia DVS v1.0 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).
Downloads
Domain from InterProScan | Citrus_sinensis_DVS_A_v1.0.Pfam.tsv.gz | Citrus_sinensis_DVS_B_v1.0.Pfam.tsv.gz |
Summary
Query | Chr | Size(bp) | Coordinates | BLASTn Hit | BLASTn %ID | Domain |
SLF13 | CM039167.1 | 29493366 | 1099357-1098281 | PP719853.1, S30-SLF13 | 98.607 | F-box; F_box_assoc |
SLF12 | CM039167.1 | 29493366 | 1110704-1111825 | ASM2964120v1, SLF12_cds | 99.198 | F-box; F_box_assoc |
SLF11 | CM039167.1 | 29493366 | 1115084-1116259 | ASM2964120v1, SLF11_cds | 98.81 | F-box; F_box_assoc |
SLF10 | CM039167.1 | 29493366 | 1118733-1117588 | ASM2964120v1, SLF10_cds | 99.476 | F-box; F_box_assoc |
SLF8 | CM039167.1 | 29493366 | 1140261-1139122 | ASM2964120v1, SLF8-2_cds | 98.509 | F-box; F_box_assoc |
SLF9 | CM039167.1 | 29493366 | 1144296-1143154 | ASM2964120v1, SLF9_cds | 93.613 | F-box; F_box_assoc |
SLF7 | CM039167.1 | 29493366 | 1170672-1171802 | PP719832.1, S2-SLF7 | 93.546 | F-box; F_box_assoc |
SLF6 | CM039167.1 | 29493366 | 1180476-1181612 | ASM2964120v1, SLF6_cds | 91.645 | F-box; F_box_assoc |
SLF5 | CM039167.1 | 29493366 | 1232501-1231377 | ASM2964120v1, SLF5_cds | 90.311 | F-box; F_box_assoc |
SLF4 | CM039167.1 | 29493366 | 1299068-1297947 | PP719843.1, S30-SLF4 | 90.446 | F-box; F_box_assoc |
SLF3 | CM039167.1 | 29493366 | 1325939-1324827 | PP719842.1, S30-SLF3 | 90.999 | F-box; F_box_assoc |
SLF2 | CM039167.1 | 29493366 | 1360020-1358926 | PP719841.1, S30-SLF2 | 91.065 | F-box; F_box_assoc |
S-RNaseψ | CM039167.1 | 29493366 | 1250582-1250346,1250224-1249797 | MN652912.1, SmR-RNase | 99.9 | - |
SLF13-2 | CM039176.1 | 31234485 | 932558-931482 | PP719853.1, S30-SLF13 | 98.422 | F-box; F_box_assoc |
SLF12-2 | CM039176.1 | 31234485 | 943595-944716 | PP719852.1, S30-SLF12 | 99.02 | F-box; F_box_assoc |
SLF11-2 | CM039176.1 | 31234485 | 947732-948907 | ASM2964120v1, SLF11_cds | 99.49 | F-box; F_box_assoc |
SLF10-2 | CM039176.1 | 31234485 | 951327-950182 | ASM2964120v1, SLF10_cds | 99.738 | F-box; F_box_assoc |
SLF8-2 | CM039176.1 | 31234485 | 978606-977467 | ASM2964120v1, SLF8-2_cds | 98.509 | F-box; F_box_assoc |
SLF9-2 | CM039176.1 | 31234485 | 982506-981364 | ASM2964120v1, SLF9_cds | 93.613 | F-box; F_box_assoc |
SLF7-2 | CM039176.1 | 31234485 | 985115-986245 | PP719832.1, S2-SLF7 | 92.927 | F-box; F_box_assoc |
SLF8-3 | CM039176.1 | 31234485 | 988433-987294 | ASM2964120v1, SLF8_cds | 96.74 | F-box; F_box_assoc |
SLF2-2 | CM039176.1 | 31234485 | 995049-996161 | PP719841.1, S30-SLF2 | 88.839 | F-box; F_box_assoc |
SLF5-2 | CM039176.1 | 31234485 | 1007849-1008976 | PB533_SCSK_HAP2, SLF5 | 86.833 | F-box; F_box_assoc |
SLF4-2 | CM039176.1 | 31234485 | 1050013-1048898 | PP719843.1, S30-SLF4 | 91.667 | F-box; F_box_assoc |
SLF3-2 | CM039176.1 | 31234485 | 1054087-1052966 | PP719842.1, S30-SLF3 | 95.455 | F-box; F_box_assoc |
SLF2-3 | CM039176.1 | 31234485 | 1060109-1058985 | ASM2964120v1, SLF2_cds | 98.933 | F-box; F_box_assoc |
SLF1 | CM039176.1 | 31234485 | 1063192-1062092 | ASM2964120v1, SLF1_cds | 98.093 | F-box; F_box_assoc |
S-RNase-2 | CM039176.1 | 31234485 | 1045922-1046164,1046258-1046710 | MN652903.1, S7-RNase | 99.7 | Ribonuclease T2 |
Citrus S genes Nucleotide
Citrus S genes Protein