Fragaria nilgerrensis SCBG_Genome_v1.0 Assembly & Annotation

Overview

Analysis Name Fragaria nilgerrensis SCBG_Genome_v1.0 Assembly & Annotation
Sequencing technology Pacbio
Assembly method FALCON (v0.3.0)
Release Date 2020-09-20
Reference Publication(s)

Feng C, Wang J, Harris AJ, Folta KM, Zhao M, Kang M. Tracing the Diploid Ancestry of the Cultivated Octoploid Strawberry. Mol Biol Evol. 2021 Jan 23;38(2):478-485. doi: 10.1093/molbev/msaa238.

Abstract

The commercial strawberry, Fragaria × ananassa, is a recent allo-octoploid that is cultivated worldwide. However, other than Fragaria vesca, which is universally accepted one of its diploid ancestors, its other early diploid progenitors remain unclear. Here, we performed comparative analyses of the genomes of five diploid strawberries, F. iinumae, F. vesca, F. nilgerrensis, F. nubicola, and F. viridis, of which the latter three are newly sequenced. We found that the genomes of these species share highly conserved gene content and gene order. Using an alignment-based approach, we show that F. iinumae and F. vesca are the diploid progenitors to the octoploid F. × ananassa, whereas the other three diploids that we analyzed in this study are not parental species. We generated a fully resolved, dated phylogeny of Fragaria, and determined that the genus arose ∼6.37 Ma. Our results effectively resolve conflicting hypotheses regarding the putative diploid progenitors of the cultivated strawberry, establish a reliable backbone phylogeny for the genus, and provide genetic resources for molecular breeding.

Assembly statistics

Genome-sequencing depth (×)373
Estimated genome size (Mb)279
Total length of scaffolds (Mb)272.0
N50 of scaffolds (Mb)37.5
Total length of contigs (Mb)271.9
N50 of contigs (Mb)4.0
Mapping rate of reads from short-insert libraries96.3%
CEGMA evaluation97.2%
BUSCO evaluation93.7%
LAI evaluation10.2
EST evaluation92.5%
RNA-Seq evaluation88.6–93.4%
Percentage of TE43.60
Percentage of LTRs35.29
No. of predicted protein-coding genes29,068
No. of genes annotated to public database26,353
Assembly level Chromosome

Assembly

The Fragaria nilgerrensis SCBG_Genome_v1.0 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) Fnil_assembly.fasta.gz

Gene Predictions

The Fragaria nilgerrensis SCBG_Genome_v1.0 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) Fnil.gene.gff.gz
CDS sequences (FASTA file) Fnil.cds.fa.gz
Protein sequences (FASTA file) Fnil.pep.fa.gz

Functional Analysis

Functional annotation for the Fragaria nilgerrensis SCBG_Genome_v1.0 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Fragaria_nilgerrensis_SCBG_Genome_v1.0.Pfam.tsv.gz
© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences