Fragaria nubicola SCBG Genome v1.0 Assembly & Annotation

Overview

Analysis Name Fragaria nubicola SCBG Genome v1.0 Assembly & Annotation
Sequencing technology Pacbio
Assembly method FALCON (v0.3.0)
Release Date 2020-09-21
Reference Publication(s)

Feng C, Wang J, Harris AJ, Folta KM, Zhao M, Kang M. Tracing the Diploid Ancestry of the Cultivated Octoploid Strawberry. Mol Biol Evol. 2021 Jan 23;38(2):478-485. doi: 10.1093/molbev/msaa238.

Abstract

The commercial strawberry, Fragaria × ananassa, is a recent allo-octoploid that is cultivated worldwide. However, other than Fragaria vesca, which is universally accepted one of its diploid ancestors, its other early diploid progenitors remain unclear. Here, we performed comparative analyses of the genomes of five diploid strawberries, F. iinumae, F. vesca, F. nilgerrensis, F. nubicola, and F. viridis, of which the latter three are newly sequenced. We found that the genomes of these species share highly conserved gene content and gene order. Using an alignment-based approach, we show that F. iinumae and F. vesca are the diploid progenitors to the octoploid F. × ananassa, whereas the other three diploids that we analyzed in this study are not parental species. We generated a fully resolved, dated phylogeny of Fragaria, and determined that the genus arose ∼6.37 Ma. Our results effectively resolve conflicting hypotheses regarding the putative diploid progenitors of the cultivated strawberry, establish a reliable backbone phylogeny for the genus, and provide genetic resources for molecular breeding.

Assembly statistics

Genome-sequencing depth (×)406
Estimated genome size (Mb)273
Total length of scaffolds (Mb)247.5
N50 of scaffolds (Mb)35.0
Total length of contigs (Mb)247.2
N50 of contigs (Mb)2.6
Mapping rate of reads from short-insert libraries90.0%
CEGMA evaluation92.3%
BUSCO evaluation87.5%
LAI evaluation17.5
EST evaluation92.4%
RNA-Seq evaluation81.0–84.4%
Percentage of TE43.07
Percentage of LTRs32.87
No. of predicted protein-coding genes27,594
No. of genes annotated to public database25,418

Assembly

The Fragaria nubicola SCBG Genome v1.0 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) Fnub_assembly.fasta.gz

Gene Predictions

The Fragaria nubicola SCBG Genome v1.0 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) Fnub.gene.gff.gz
CDS sequences (FASTA file) Fnub.cds.fa.gz
Protein sequences (FASTA file) Fnub.pep.fa.gz

Functional Analysis

Functional annotation for the Fragaria nubicola SCBG Genome v1.0 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Fragaria_nubicola_SCBG_Genome_v1.0.Pfam.tsv.gz

S genes

Summary

QueryChrSize(bp)CoordinatesDomain
SLF-1Fnub64103706389658-90860F-box,F-box associated
SLF-2Fnub64103706367571-68256,68541-68886F-box
SLF-3Fnub641037063749896-749949,750066-750579,750836-751197F-box,F-box associated
S-RNaseFnub641037063625279-625204,624982-624813,609999-609550Ribonuclease T2 family
SLF-4Fnub641037063772251-773136,773180-773250,773692-773772F-box associated

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences