Citrus australis v1.0 Assembly & Annotation

Overview

Analysis Name Citrus australis v1.0 Assembly & Annotation
Sequencing technology PacBio
Assembly method Hifiasm
Release Date 2023-02-07
Reference Publication(s)

Nakandala U, Masouleh AK, Smith MW, Furtado A, Mason P, Constantin L, Henry RJ. Haplotype resolved chromosome level genome assembly of Citrus australis reveals disease resistance and other citrus specific genes. Hortic Res. 2023 Apr 3;10(5):uhad058. doi: 10.1093/hr/uhad058.

Abstract

Recent advances in genome sequencing and assembly techniques have made it possible to achieve chromosome level reference genomes for citrus. Relatively few genomes have been anchored at the chromosome level and/or are haplotype phased, with the available genomes of varying accuracy and completeness. We now report a phased high-quality chromosome level genome assembly for an Australian native citrus species; Citrus australis (round lime) using highly accurate PacBio HiFi long reads, complemented with Hi-C scaffolding. Hifiasm with Hi-C integrated assembly resulted in a 331 Mb genome of C. australis with two haplotypes of nine pseudochromosomes with an N50 of 36.3 Mb and 98.8% genome assembly completeness (BUSCO). Repeat analysis showed that more than 50% of the genome contained interspersed repeats. Among them, LTR elements were the predominant type (21.0%), of which LTR Gypsy (9.8%) and LTR copia (7.7%) elements were the most abundant repeats. A total of 29 464 genes and 32 009 transcripts were identified in the genome. Of these, 28 222 CDS (25 753 genes) had BLAST hits and 21 401 CDS (75.8%) were annotated with at least one GO term. Citrus specific genes for antimicrobial peptides, defense, volatile compounds and acidity regulation were identified. The synteny analysis showed conserved regions between the two haplotypes with some structural variations in Chromosomes 2, 4, 7 and 8. This chromosome scale, and haplotype resolved C. australis genome will facilitate the study of important genes for citrus breeding and will also allow the enhanced definition of the evolutionary relationships between wild and domesticated citrus species.

Assembly statistics

Assembly

The Citrus australis v1.0 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GWHBQDX00000000.genome.fasta.gz

Gene Predictions

The Citrus australis v1.0 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) GWHBQDX00000000.gff.gz
CDS sequences (FASTA file) GWHBQDX00000000.CDS.fasta.gz
Protein sequences (FASTA file) GWHBQDX00000000.Protein.faa.gz

Functional Analysis

Functional annotation for the Citrus australis v1.0 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Citrus_australis_v1.0.Pfam.tsv.gz

S genes

Summary

QueryChrSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF1Chr13129647230087325-30088425ASM2964120v1, SLF1-297.548F-box; F_box_assoc
SLF2Chr13129647230105588-30106712PP719841.1, S30-SLF292.192F-box; F_box_assoc
SLF3Chr13129647230116756-30117877PP719842.1, S30-SLF392.87F-box; F_box_assoc
SLF4Chr13129647230135591-30136658PP719843.1, S30-SLF491.927F-box; F_box_assoc
SLF8Chr13129647230223846-30222707PP719833.1, S2-SLF887.642F-box; F_box_assoc
SLF9Chr13129647230266971-30265829ASM2964120v1, SLF991.419F-box; F_box_assoc
SLF7Chr13129647230285781-30286911PP719832.1, S2-SLF793.369F-box; F_box_assoc
SLF6Chr13129647230294550-30295692PP719845.1, S30-SLF690.901F-box; F_box_assoc
SLF5Chr13129647230301116-30299989PP719830.1, S2-SLF589.6F-box; F_box_assoc
SLF10Chr13129647230326439-30327584PP719835.1, S2-SLF1094.318F-box; F_box_assoc
SLF11Chr13129647230340821-30339646ASM2964120v1, SLF1198.639F-box; F_box_assoc
SLF12Chr13129647230344970-30343849PP719837.1, S2-SLF1299.109F-box; F_box_assoc
SLF13ψChr13129647230355546-30356621PP719853.1, S30-SLF1398.329-

Citrus S genes Nucleotide

Citrus S genes Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences