Solanum lycopersicoides 'ZY64 (cultivar)' ASM2770504v1 Assembly & Annotation

Overview

Analysis Name Solanum lycopersicoides 'ZY64 (cultivar)' ASM2770504v1 Assembly & Annotation
Sequencing technology PacBio
Assembly method Canu v. 1.5
Release Date 2023-01-11
Reference Publication(s)

Li N, He Q, Wang J, Wang B, Zhao J, Huang S, Yang T, Tang Y, Yang S, Aisimutuola P, Xu R, Hu J, Jia C, Ma K, Li Z, Jiang F, Gao J, Lan H, Zhou Y, Zhang X, Huang S, Fei Z, Wang H, Li H, Yu Q. Super-pangenome analyses highlight genomic diversity and structural variation across wild and cultivated tomato species. Nat Genet. 2023 May;55(5):852-860. doi: 10.1038/s41588-023-01340-y.

Abstract

Effective utilization of wild relatives is key to overcoming challenges in genetic improvement of cultivated tomato, which has a narrow genetic basis; however, current efforts to decipher high-quality genomes for tomato wild species are insufficient. Here, we report chromosome-scale tomato genomes from nine wild species and two cultivated accessions, representative of Solanum section Lycopersicon, the tomato clade. Together with two previously released genomes, we elucidate the phylogeny of Lycopersicon and construct a section-wide gene repertoire. We reveal the landscape of structural variants and provide entry to the genomic diversity among tomato wild relatives, enabling the discovery of a wild tomato gene with the potential to increase yields of modern cultivated tomatoes. Construction of a graph-based genome enables structural-variant-based genome-wide association studies, identifying numerous signals associated with tomato flavor-related traits and fruit metabolites. The tomato super-pangenome resources will expedite biological studies and breeding of this globally important crop.

Assembly statistics

Genome size 1.2 Gb
Number of scaffolds 1,175
Scaffold N50 91.1 Mb
Scaffold L50 6
Number of contigs 3,965
Contig N50 579 kb
Contig L50 513
Assembly level Scaffold

Assembly

The Solanum lycopersicoides 'ZY64 (cultivar)' ASM2770504v1 assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_027705045.1_ASM2770504v1_genomic.fna.gz

Gene Predictions

The Solanum lycopersicoides 'ZY64 (cultivar)' ASM2770504v1 genome gene prediction files are available in FASTA format.

Downloads

Genes (GFF3 file) -
CDS sequences (FASTA file) S.lycopersicoides.cds.fa.gz
Protein sequences (FASTA file) S.lycopersicoides.pep.fa.gz

Functional Analysis

Functional annotation for the Solanum lycopersicoides 'ZY64 (cultivar)' ASM2770504v1 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Solanum_lycopersicoides_ZY64_ASM2770504v1.Pfam.tsv.gz

S genes

Summary

QueryScaffoldSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF5JAKWJB01
0000210.1
11158223278-22109Solanum chilense KJ814884.1, SLF598.1F-box domain
SLF6JAKWJB01
0000210.1
11158243436-42291Solanum lycopersicoides KU987626.1, SLF698.9F-box domain
SLF19JAKWJB01
0000250.1
859998033-6924Solanum lycopersicum SL2.31, SLF1996.2F-box domain
SLF10ΨJAKWJB01
0000285.1
8150618076-19304Solanum chilense KJ814888.1, SLF1094.6-
SLF3ΨJAKWJB01
0000846.1
8199258326-59483Solanum pennellii BK009230.1, SLF397.2-
SLF11JAKWJB01
0000974.1
10070510888-9719Solanum peruvianum KJ814846.1, SLF196.3F-box domain
SLF15JAKWJB01
0001164.1
1359716473419543-3418284Solanum lycopersicum SL2.31, SLF1595.6F-box domain
SLF16JAKWJB01
0001164.1
1359716473967616-3966435Solanum lycopersicum SL2.31, SLF1696.4F-box domain
SLF23ΨJAKWJB01
0001164.1
13597164758860502-58861658Solanum lycopersicoides KU960925.1, SLF2398.9-
SLF5-2JAKWJB01
0001164.1
13597164773849360-73848173Solanum lycopersicoides KU987627.2, SLF597.3F-box domain
SLF4JAKWJB01
0001164.1
13597164773954796-73953630Solanum peruvianum KJ814848.1, SLF497F-box domain
SLF7ΨJAKWJB01
0001164.1
13597164774076071-74074903Solanum peruvianum KJ814851.1, SLF797-
SLF6-2ΨJAKWJB01
0001164.1
13597164774086192-74085087Solanum tuberosum DM8.1, SLF690.2-
SLF21ΨJAKWJB01
0001164.1
13597164774462830-74464140Solanum lycopersicoides KU960923.1, SLF2199.4-
SLF20ΨJAKWJB01
0001164.1
13597164774855994-74857158Solanum lycopersicoides KU960922.1, SLF2099.7-
SLF9ΨJAKWJB01
0001164.1
13597164775656502-75655360Solanum lycopersicoides KU987631.1, SLF999-
SLF10-2ΨJAKWJB01
0001164.1
13597164779658648-79657418Solanum chilense KJ814888.1, SLF1094.3-
SLF11-2ΨJAKWJB01
0001164.1
13597164783210579-83211750Solanum pennellii NM_001323461.1, SLF1197.4-
SLF11-3ΨJAKWJB01
0001164.1
13597164783265432-83264262Solanum pennellii NM_001323461.1, SLF1197.5-
SLF3-2ΨJAKWJB01
0001164.1
13597164785797952-85799113Solanum pennellii BK009230.1, SLF397.8-
SLF13JAKWJB01
0001164.1
13597164786547536-86546334Solanum pennellii NM_001323465.1, SLF1398.2F-box domain
SLF18ΨJAKWJB01
0001164.1
135971647104997980-104999093Solanum lycopersicum SL2.31, SLF1895.9-
SLF19-2ΨJAKWJB01
0001164.1
135971647105037649-105036548Solanum lycopersicum SL2.31, SLF1994-

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences