Fragaria vesca v6 Assembly & Annotation

Overview

Analysis Name Fragaria vesca v6 Assembly & Annotation
Sequencing technology ONT, PacBio HiFi, Hi-C, Illumina, Full-length RNA-seq (ONT)
Assembly method Hifiasm (version 0.16.1), NextDenovo
Release Date 2023-02-20
Reference Publication(s)

Zhou Y, Xiong J, Shu Z, Dong C, Gu T, Sun P, He S, Jiang M, Xia Z, Xue J, Khan WU, Chen F, Cheng ZM. The telomere-to-telomere genome of Fragaria vesca reveals the genomic evolution of Fragaria and the origin of cultivated octoploid strawberry. Hortic Res. 2023 Feb 20;10(4):uhad027. doi: 10.1093/hr/uhad027.

Abstract

Fragaria vesca, commonly known as wild or woodland strawberry, is the most widely distributed diploid Fragaria species and is native to Europe and Asia. Because of its small plant size, low heterozygosity, and relative ease of genetic transformation, F. vesca has been a model plant for fruit research since the publication of its Illumina-based genome in 2011. However, its genomic contribution to octoploid cultivated strawberry remains a long-standing question. Here, we de novo assembled and annotated a telomere-to-telomere, gap-free genome of F. vesca ‘Hawaii 4’, with all seven chromosomes assembled into single contigs, providing the highest completeness and assembly quality to date. The gap-free genome is 220 785 082 bp in length and encodes 36 173 protein-coding gene models, including 1153 newly annotated genes. All 14 telomeres and seven centromeres were annotated within the seven chromosomes. Among the three previously recognized wild diploid strawberry ancestors, F. vesca, F. iinumae, and F. viridis, phylogenomic analysis showed that F. vesca and F. viridis are the ancestors of the cultivated octoploid strawberry F. × ananassa, and F. vesca is its closest relative. Three subgenomes of F. × ananassa belong to the F. vesca group, and one is sister to F. viridis. We anticipate that this high-quality, telomere-to-telomere, gap-free F. vesca genome, combined with our phylogenomic inference of the origin of cultivated strawberry, will provide insight into the genomic evolution of Fragaria and facilitate strawberry genetics and molecular breeding.

Assembly statistics

Genome size (Mb)220.8
Contig N50 (Mb)34.34
Number of contigs7
Gaps0
Number of telomeres14
Number of centromeres7
GC content (%)38.5
Number of gene models36 173
BUSCOs (%)98.8
Assembly level Telomere-to-telomere

Assembly

The Fragaria vesca v6 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) Fragaria_vesca_v6_genome.fasta.gz

Gene Predictions

The Fragaria vesca v6 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) Fragaria_vesca_v6_genome.gff.gz
CDS sequences (FASTA file) Fragaria_vesca_v6_cds.fasta.gz
Protein sequences (FASTA file) Fragaria_vesca_v6_proteins.fasta.gz

Functional Analysis

Functional annotation for the Fragaria vesca v6 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Fragaria_vesca_v6.Pfam.tsv.gz
© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences