Analysis Name | Fragaria vesca v6 Assembly & Annotation |
Sequencing technology | ONT, PacBio HiFi, Hi-C, Illumina, Full-length RNA-seq (ONT) |
Assembly method | Hifiasm (version 0.16.1), NextDenovo |
Release Date | 2023-02-20 |
Zhou Y, Xiong J, Shu Z, Dong C, Gu T, Sun P, He S, Jiang M, Xia Z, Xue J, Khan WU, Chen F, Cheng ZM. The telomere-to-telomere genome of Fragaria vesca reveals the genomic evolution of Fragaria and the origin of cultivated octoploid strawberry. Hortic Res. 2023 Feb 20;10(4):uhad027. doi: 10.1093/hr/uhad027.
AbstractFragaria vesca, commonly known as wild or woodland strawberry, is the most widely distributed diploid Fragaria species and is native to Europe and Asia. Because of its small plant size, low heterozygosity, and relative ease of genetic transformation, F. vesca has been a model plant for fruit research since the publication of its Illumina-based genome in 2011. However, its genomic contribution to octoploid cultivated strawberry remains a long-standing question. Here, we de novo assembled and annotated a telomere-to-telomere, gap-free genome of F. vesca ‘Hawaii 4’, with all seven chromosomes assembled into single contigs, providing the highest completeness and assembly quality to date. The gap-free genome is 220 785 082 bp in length and encodes 36 173 protein-coding gene models, including 1153 newly annotated genes. All 14 telomeres and seven centromeres were annotated within the seven chromosomes. Among the three previously recognized wild diploid strawberry ancestors, F. vesca, F. iinumae, and F. viridis, phylogenomic analysis showed that F. vesca and F. viridis are the ancestors of the cultivated octoploid strawberry F. × ananassa, and F. vesca is its closest relative. Three subgenomes of F. × ananassa belong to the F. vesca group, and one is sister to F. viridis. We anticipate that this high-quality, telomere-to-telomere, gap-free F. vesca genome, combined with our phylogenomic inference of the origin of cultivated strawberry, will provide insight into the genomic evolution of Fragaria and facilitate strawberry genetics and molecular breeding.
Assembly statistics
Genome size (Mb) | 220.8 |
Contig N50 (Mb) | 34.34 |
Number of contigs | 7 |
Gaps | 0 |
Number of telomeres | 14 |
Number of centromeres | 7 |
GC content (%) | 38.5 |
Number of gene models | 36 173 |
BUSCOs (%) | 98.8 |
Assembly level | Telomere-to-telomere |
The Fragaria vesca v6 Assembly file is available in FASTA format.
Downloads
Chromosomes (FASTA file) | Fragaria_vesca_v6_genome.fasta.gz |
The Fragaria vesca v6 genome gene prediction files are available in GFF3 and FASTA format.
Downloads
Genes (GFF3 file) | Fragaria_vesca_v6_genome.gff.gz |
CDS sequences (FASTA file) | Fragaria_vesca_v6_cds.fasta.gz |
Protein sequences (FASTA file) | Fragaria_vesca_v6_proteins.fasta.gz |
Functional annotation for the Fragaria vesca v6 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).
Downloads
Domain from InterProScan | Fragaria_vesca_v6.Pfam.tsv.gz |
Fragaria S genes Nucleotide
Fragaria S genes Protein