Solanum neorossii PG6243 Assembly & Annotation

Overview

Analysis Name Solanum neorossii PG6243 Assembly & Annotation
Sequencing technology PacBio data and Hi-C data
Assembly method hifiasm (v.0.13)
Release Date 2022-06-08
Reference Publication(s)

Tang D, Jia Y, Zhang J, Li H, Cheng L, Wang P, Bao Z, Liu Z, Feng S, Zhu X, Li D, Zhu G, Wang H, Zhou Y, Zhou Y, Bryan GJ, Buell CR, Zhang C, Huang S. Genome evolution and diversity of wild and cultivated potatoes. Nature. 2022 Jun;606(7914):535-541. doi: 10.1038/s41586-022-04822-x.

Abstract

Potato (Solanum tuberosum L.) is the world’s most important non-cereal food crop, and the vast majority of commercially grown cultivars are highly heterozygous tetraploids. Advances in diploid hybrid breeding based on true seeds have the potential to revolutionize future potato breeding and production. So far, relatively few studies have examined the genome evolution and diversity of wild and cultivated landrace potatoes, which limits the application of their diversity in potato breeding. Here we assemble 44 high-quality diploid potato genomes from 24 wild and 20 cultivated accessions that are representative of Solanum section Petota, the tuber-bearing clade, as well as 2 genomes from the neighbouring section, Etuberosum. Extensive discordance of phylogenomic relationships suggests the complexity of potato evolution. We fnd that the potato genome substantially expanded its repertoire of disease-resistance genes when compared with closely related seed-propagated solanaceous crops, indicative of the efect of tuber-based propagation strategies on the evolution of the potato genome. We discover a transcription factor that determines tuber identity and interacts with the mobile tuberization inductive signal SP6A. We also identify 561,433 high-confdence structural variants and construct a map of large inversions, which provides insights for improving inbred lines and precluding potential linkage drag, as exemplifed by a 5.8-Mb inversion that is associated with carotenoid content in tubers. This study will accelerate hybrid potato breeding and enrich our understanding of the evolution and biology of potato as a global staple food crop.

Assembly statistics

Contig total length 1,274,759,420 bp
Contig number 2386
Contig N50 5,254,495 bp
Contig L50 48
Contig longest 46,004,193 bp
Assembly level Contig

Assembly

The Solanum neorossii PG6243 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) PG6243.fa.gz

Gene Predictions

The Solanum neorossii PG6243 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) PG6243.gff.gz
CDS sequences (FASTA file) PG6243.cds.fa.gz
Protein sequences (FASTA file) PG6243.protein.fa.gz

Functional Analysis

Functional annotation for the Solanum neorossii PG6243 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Solanum_neorossii_PG6243.Pfam.tsv.gz

S genes

Summary

QueryContigSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF19ΨPG6243_atg0069302443613512-12399Solanum tuberosum DM8.1, SLF1994.7 -
SLF13PG6243_atg011731489262252836-2254038Solanum tuberosum DM8.1, SLF1399.1 F-box domain
SLF6ΨPG6243_atg011731489262576313-2577475Solanum tuberosum DM8.1, SLF6-295.9 -
SLF16ΨPG6243_atg013015662871220451-1219271Solanum tuberosum DM8.1, SLF1698.3 -
SLF15PG6243_atg0193843018719139-720398Solanum tuberosum DM8.1, SLF1597.5 F-box domain
SLF23PG6243_hptg000743534701479853-1481010Solanum neorickii MG266233.1, SLF2394.0 F-box domain
SLF17PG6243_hptg000743534701566602-1567783Solanum tuberosum DM8.1, SLF1797.8 F-box domain
SLF5PG6243_hptg000743534701943229-1942039Solanum tuberosum DM8.1, SLF595.7 F-box domain
SLF12PG6243_hptg000743534701972988-1974154Solanum tuberosum DM8.1, SLF1297.7 F-box domain
SLF4ΨPG6243_hptg000743534702127281-2126119Solanum pimpinellifolium KJ814871.1, SLF494.1 -
SLF5-2PG6243_hptg000743534702306181-2307350Solanum tuberosum DM8.1, SLF5-297.1 F-box domain
SLF7PG6243_hptg000743534702359391-2358228Solanum tuberosum DM8.1, SLF794.4 F-box domain
SLF9PG6243_hptg003653382551842878-1844020Solanum tuberosum DM8.1, SLF998.3 F-box domain
SLF21PG6243_hptg003653382553362166-3363386Solanum tuberosum DM8.1, SLF2198.9 F-box domain
SLF20PG6243_hptg003653382553894182-3895348Solanum tuberosum DM8.1, SLF2098.5 F-box domain
SLF6-2ΨPG6243_hptg003653382553954431-3955572Solanum tuberosum DM8.1, SLF692.3 -
SLF7-2PG6243_hptg003653382553976383-3977552Solanum tuberosum DM8.1, SLF797.5 F-box domain
SLF5-3PG6243_hptg003653382554048806-4049975Solanum tuberosum DM8.1, SLF5-298.6 F-box domain
SLF4-2ΨPG6243_hptg003653382554127445-4128607Solanum pennellii NM_001323453.1, SLF495.7 -
SLF12-2PG6243_hptg003653382554332558-4331407Solanum tuberosum DM8.1, SLF1297.8 F-box domain
SLF5-4PG6243_hptg003653382554373063-4374247Solanum tuberosum DM8.1, SLF595.1 F-box domain
SLF17-2PG6243_hptg0051198728341759-40578Solanum tuberosum DM8.1, SLF1797.4 F-box domain
SLF23-2PG6243_hptg0051198728369424-68267Solanum neorickii MG266239.1, SLF2395.6 F-box domain
SLF18PG6243_hptg010342234024204114-4205229Solanum tuberosum DM8.1, SLF1897.1 F-box domain
SLF21-2ΨPG6243_hptg0115809055395990-394771Solanum tuberosum DM8.1, SLF2197.5 -
SLF13-2PG6243_ptg003098675098437000-8438202Solanum tuberosum DM8.1, SLF1398.9 F-box domain
SLF6-3ΨPG6243_ptg003098675098806296-8807446Solanum tuberosum DM8.1, SLF6-295.4 -
SLF11ΨPG6243_ptg003098675099741482-9740332Solanum tuberosum DM8.1, SLF1194.6 -
SLF19-2ΨPG6243_ptg004470332143231600-3232707Solanum tuberosum DM8.1, SLF1997.2 -
SLF18-2PG6243_ptg004470332143267552-3266443Solanum tuberosum DM8.1, SLF1897.5 F-box domain
SLF16-2ΨPG6243_ptg0045115319016313847-6315027Solanum tuberosum DM8.1, SLF1698.6 -
SLF15-2PG6243_ptg0045115319018079052-8080311Solanum tuberosum DM8.1, SLF1597.9 F-box domain
SLF9-2PG6243_ptg00641678696708295-707153Solanum tuberosum DM8.1, SLF998.7 F-box domain

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences