Solanum multiinterruptum PG4060 Assembly & Annotation

Overview

Analysis Name Solanum multiinterruptum PG4060 Assembly & Annotation
Sequencing technology PacBio data and Hi-C data
Assembly method hifiasm (v.0.13)
Release Date 2022-06-08
Reference Publication(s)

Tang D, Jia Y, Zhang J, Li H, Cheng L, Wang P, Bao Z, Liu Z, Feng S, Zhu X, Li D, Zhu G, Wang H, Zhou Y, Zhou Y, Bryan GJ, Buell CR, Zhang C, Huang S. Genome evolution and diversity of wild and cultivated potatoes. Nature. 2022 Jun;606(7914):535-541. doi: 10.1038/s41586-022-04822-x.

Abstract

Potato (Solanum tuberosum L.) is the world’s most important non-cereal food crop, and the vast majority of commercially grown cultivars are highly heterozygous tetraploids. Advances in diploid hybrid breeding based on true seeds have the potential to revolutionize future potato breeding and production. So far, relatively few studies have examined the genome evolution and diversity of wild and cultivated landrace potatoes, which limits the application of their diversity in potato breeding. Here we assemble 44 high-quality diploid potato genomes from 24 wild and 20 cultivated accessions that are representative of Solanum section Petota, the tuber-bearing clade, as well as 2 genomes from the neighbouring section, Etuberosum. Extensive discordance of phylogenomic relationships suggests the complexity of potato evolution. We fnd that the potato genome substantially expanded its repertoire of disease-resistance genes when compared with closely related seed-propagated solanaceous crops, indicative of the efect of tuber-based propagation strategies on the evolution of the potato genome. We discover a transcription factor that determines tuber identity and interacts with the mobile tuberization inductive signal SP6A. We also identify 561,433 high-confdence structural variants and construct a map of large inversions, which provides insights for improving inbred lines and precluding potential linkage drag, as exemplifed by a 5.8-Mb inversion that is associated with carotenoid content in tubers. This study will accelerate hybrid potato breeding and enrich our understanding of the evolution and biology of potato as a global staple food crop.

Assembly statistics

Contig total length 1,550,555,616 bp
Contig number 3052
Contig N50 4,114,417 bp
Contig L50 48
Contig longest 79,093,328 bp
Assembly level Contig

Assembly

The Solanum multiinterruptum PG4060 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) PG4060.fa.gz

Gene Predictions

The Solanum multiinterruptum PG4060 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) PG4060.gff.gz
CDS sequences (FASTA file) PG4060.cds.fa.gz
Protein sequences (FASTA file) PG4060.protein.fa.gz

Functional Analysis

Functional annotation for the Solanum multiinterruptum PG4060 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Solanum_multiinterruptum_PG4060.Pfam.tsv.gz

S genes

Summary

QueryContigSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF16Ψatg00031531079651511-650331Solanum tuberosum DM8.1, SLF1697.9-
SLF6atg014964234792234112-2232970Solanum tuberosum DM8.1, SLF6-295.2F-box domain
SLF12Ψatg014964234792297166-2298330Solanum pennellii
NM_001323464.1, SLF12
94.2-
SLF13atg014964234792792830-2791628Solanum tuberosum DM8.1, SLF1398.2F-box domain
SLF18atg034720917471422540-1421431Solanum tuberosum DM8.1, SLF1896.8F-box domain
SLF1atg0758277848132247-133428Solanum pennellii KJ814858.1, SLF191.6F-box domain
S-RNasehptg00131722284372914-373147,
373228-373638
Solanum tuberosum
MZ561405.1, SRNase-S2
96.4Ribonuclease T2 family
SLF17hptg002283706601933122-1934303Solanum tuberosum DM8.1, SLF1797.9F-box domain
SLF1-2hptg002283706603682839-3681658Solanum pennellii KJ814858.1, SLF191.8F-box domain
SLF1-3hptg00662292031153221-152040Solanum pennellii KJ814858.1, SLF191.6F-box domain
SLF23hptg00662292031423603-424769Solanum neorickii MG266243.1, SLF2395F-box domain
SLF17-2hptg00662292031445719-446900Solanum tuberosum DM8.1, SLF1797.8F-box domain
SLF21hptg00982438763442162-440942Solanum tuberosum DM8.1, SLF2196.3F-box domain
SLF6-2Ψhptg0104264231571478-72615Solanum tuberosum DM8.1, SLF692.9-
SLF7hptg0104264231578680-79843Solanum tuberosum DM8.1, SLF795.6F-box domain
SLF5hptg01042642315282235-283404Solanum tuberosum DM8.1, SLF5-296.6F-box domain
SLF4Ψhptg01042642315500538-501703Solanum pimpinellifolium
KJ814871.1, SLF4
95.4-
SLF12-2hptg01042642315722368-721202Solanum tuberosum DM8.1, SLF1296.3F-box domain
S-RNase-2hptg0122464752347295-347050,
346938-346513
Solanum chacoense
AF232304.1, ScS14-RNase
99.3Ribonuclease T2 family
SLF15hptg01731075788246572-247831Solanum tuberosum DM8.1, SLF1597.6F-box domain
SLF15-2ptg0018214858273063867-3065126Solanum tuberosum DM8.1, SLF1597.7F-box domain
SLF16-2Ψptg0018214858274303666-4302486Solanum tuberosum DM8.1, SLF1698F-box domain
SLF5-2ptg0028529982471830953-1829763Solanum tuberosum DM8.1, SLF597.6F-box domain
SLF12-3ptg0028529982471858514-1859680Solanum tuberosum DM8.1, SLF1298.4F-box domain
SLF5-3ptg0028529982472067773-2066604Solanum tuberosum DM8.1, SLF5-298.1F-box domain
SLF7-2ptg0028529982472102469-2101306Solanum tuberosum DM8.1, SLF795.4F-box domain
SLF6-3Ψptg0028529982472109654-2108512Solanum tuberosum DM8.1, SLF693.1-
SLF20ptg0028529982472173098-2171932Solanum tuberosum DM8.1, SLF2096.8F-box domain
SLF21-2ptg0028529982472610861-2609641Solanum tuberosum DM8.1, SLF2196.6F-box domain
SLF6-4ptg0028529982477138408-7137266Solanum tuberosum DM8.1, SLF6-295.2F-box domain
SLF12-4Ψptg0028529982477195468-7196632Solanum pennellii
NM_001323464.1, SLF12
94.2-
SLF13-2ptg0028529982477707718-7706516Solanum tuberosum DM8.1, SLF1398.2F-box domain
SLF18-2ptg00285299824723500167-23501282Solanum tuberosum DM8.1, SLF1896.2F-box domain
SLF19ptg00285299824723560163-23559093Solanum tuberosum DM8.1, SLF1995F-box domain

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences