Solanum jamesii PG1008 Assembly & Annotation

Overview

Analysis Name Solanum jamesii PG1008 Assembly & Annotation
Sequencing technology PacBio data and Hi-C data
Assembly method hifiasm (v.0.13)
Release Date 2022-06-08
Reference Publication(s)

Tang D, Jia Y, Zhang J, Li H, Cheng L, Wang P, Bao Z, Liu Z, Feng S, Zhu X, Li D, Zhu G, Wang H, Zhou Y, Zhou Y, Bryan GJ, Buell CR, Zhang C, Huang S. Genome evolution and diversity of wild and cultivated potatoes. Nature. 2022 Jun;606(7914):535-541. doi: 10.1038/s41586-022-04822-x.

Abstract

Potato (Solanum tuberosum L.) is the world’s most important non-cereal food crop, and the vast majority of commercially grown cultivars are highly heterozygous tetraploids. Advances in diploid hybrid breeding based on true seeds have the potential to revolutionize future potato breeding and production. So far, relatively few studies have examined the genome evolution and diversity of wild and cultivated landrace potatoes, which limits the application of their diversity in potato breeding. Here we assemble 44 high-quality diploid potato genomes from 24 wild and 20 cultivated accessions that are representative of Solanum section Petota, the tuber-bearing clade, as well as 2 genomes from the neighbouring section, Etuberosum. Extensive discordance of phylogenomic relationships suggests the complexity of potato evolution. We fnd that the potato genome substantially expanded its repertoire of disease-resistance genes when compared with closely related seed-propagated solanaceous crops, indicative of the efect of tuber-based propagation strategies on the evolution of the potato genome. We discover a transcription factor that determines tuber identity and interacts with the mobile tuberization inductive signal SP6A. We also identify 561,433 high-confdence structural variants and construct a map of large inversions, which provides insights for improving inbred lines and precluding potential linkage drag, as exemplifed by a 5.8-Mb inversion that is associated with carotenoid content in tubers. This study will accelerate hybrid potato breeding and enrich our understanding of the evolution and biology of potato as a global staple food crop.

Assembly statistics

Contig total length 1,203,426,476 bp
Contig number 4074
Contig N50 5,049,162 bp
Contig L50 41
Contig longest 50,133,173 bp
Assembly level Contig

Assembly

The Solanum jamesii PG1008 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) PG1008.fa.gz

Gene Predictions

The Solanum jamesii PG1008 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) PG1008.gff.gz
CDS sequences (FASTA file) PG1008.cds.fa.gz
Protein sequences (FASTA file) PG1008.protein.fa.gz

Functional Analysis

Functional annotation for the Solanum jamesii PG1008 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Solanum_jamesii_PG1008.Pfam.tsv.gz

S genes

Summary

QueryContigSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF6ΨPG1008_atg00414562572912455-913606Solanum tuberosum DM8.1, SLF6-294.6-
SLF19PG1008_atg010424647521447578-1448693Solanum lycopersicum SL2.31, SLF1993.3F-box domain
SLF18PG1008_atg010424647521478221-1477109Solanum tuberosum DM8.1, SLF1895.3F-box domain
SLF16ΨPG1008_atg01642068022345462-346642Solanum tuberosum DM8.1, SLF1697.6-
SLF15ΨPG1008_atg016420680221423561-1424821Solanum tuberosum DM8.1, SLF1595.3-
SLF15-2PG1008_atg016420680221489929-1491188Solanum tuberosum DM8.1, SLF1596.3F-box domain
SLF4ΨPG1008_atg02671230755518903-517738Solanum pimpinellifolium KJ814871.1, SLF492.5-
SLF19-2PG1008_ptg00015013317327136985-27138100Solanum tuberosum DM8.1, SLF1995F-box domain
SLF18-2PG1008_ptg00015013317327167506-27166394Solanum tuberosum DM8.1, SLF1895.2F-box domain
SLF6-2ΨPG1008_ptg00015013317338586208-38587360Solanum tuberosum DM8.1, SLF6-294.6-
SLF4-2ΨPG1008_ptg00015013317343271256-43270091Solanum pimpinellifolium KJ814871.1, SLF492.4-
SLF15-3PG1008_ptg0012158287403172549-3171290Solanum tuberosum DM8.1, SLF1596.3F-box domain
SLF15-4ΨPG1008_ptg0012158287403233972-3232721Solanum tuberosum DM8.1, SLF1596-
SLF16-2ΨPG1008_ptg0012158287404414759-4413579Solanum tuberosum DM8.1, SLF1697.6-

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences