Analysis Name | Solanum morelliforme PG1011 Assembly & Annotation |
Sequencing technology | PacBio data and Hi-C data |
Assembly method | hifiasm (v.0.13) |
Release Date | 2022-06-08 |
Tang D, Jia Y, Zhang J, Li H, Cheng L, Wang P, Bao Z, Liu Z, Feng S, Zhu X, Li D, Zhu G, Wang H, Zhou Y, Zhou Y, Bryan GJ, Buell CR, Zhang C, Huang S. Genome evolution and diversity of wild and cultivated potatoes. Nature. 2022 Jun;606(7914):535-541. doi: 10.1038/s41586-022-04822-x.
AbstractPotato (Solanum tuberosum L.) is the world’s most important non-cereal food crop, and the vast majority of commercially grown cultivars are highly heterozygous tetraploids. Advances in diploid hybrid breeding based on true seeds have the potential to revolutionize future potato breeding and production. So far, relatively few studies have examined the genome evolution and diversity of wild and cultivated landrace potatoes, which limits the application of their diversity in potato breeding. Here we assemble 44 high-quality diploid potato genomes from 24 wild and 20 cultivated accessions that are representative of Solanum section Petota, the tuber-bearing clade, as well as 2 genomes from the neighbouring section, Etuberosum. Extensive discordance of phylogenomic relationships suggests the complexity of potato evolution. We fnd that the potato genome substantially expanded its repertoire of disease-resistance genes when compared with closely related seed-propagated solanaceous crops, indicative of the efect of tuber-based propagation strategies on the evolution of the potato genome. We discover a transcription factor that determines tuber identity and interacts with the mobile tuberization inductive signal SP6A. We also identify 561,433 high-confdence structural variants and construct a map of large inversions, which provides insights for improving inbred lines and precluding potential linkage drag, as exemplifed by a 5.8-Mb inversion that is associated with carotenoid content in tubers. This study will accelerate hybrid potato breeding and enrich our understanding of the evolution and biology of potato as a global staple food crop.
Assembly statistics
Contig total length | 1,160,043,494 bp |
Contig number | 5489 |
Contig N50 | 17,564,894 bp |
Contig L50 | 15 |
Contig longest | 59,389,852 bp |
Assembly level | Contig |
The Solanum morelliforme PG1011 Assembly file is available in FASTA format.
Downloads
Chromosomes (FASTA file) | PG1011.fa.gz |
The Solanum morelliforme PG1011 genome gene prediction files are available in GFF3 and FASTA format.
Downloads
Genes (GFF3 file) | PG1011.gff.gz |
CDS sequences (FASTA file) | PG1011.cds.fa.gz |
Protein sequences (FASTA file) | PG1011.protein.fa.gz |
Functional annotation for the Solanum morelliforme PG1011 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).
Downloads
Domain from InterProScan | Solanum_morelliforme_PG1011.Pfam.tsv.gz |
Summary
Query | Contig | Size(bp) | Coordinates | BLASTn Hit | BLASTn %ID | Domain |
SLF19 | PG1011_ptg0019 | 24517580 | 445191-446306 | Solanum tuberosum DM8.1, SLF19 | 95.2 | F-box domain |
SLF18 | PG1011_ptg0019 | 24517580 | 477687-476578 | Solanum tuberosum DM8.1, SLF18 | 95 | F-box domain |
SLF14Ψ | PG1011_ptg0019 | 24517580 | 11072256-11073400 | Solanum habrochaites KJ814931.1, SLF14 | 88.3 | - |
SLF13Ψ | PG1011_ptg0019 | 24517580 | 13511931-13513130 | Solanum tuberosum DM8.1, SLF13 | 95.6 | - |
SLF11Ψ | PG1011_ptg0019 | 24517580 | 15014837-15013666 | Solanum tuberosum DM8.1, SLF11 | 95.1 | - |
SLF21 | PG1011_ptg0019 | 24517580 | 19847783-19848997 | Solanum tuberosum DM8.1, SLF21 | 95.7 | F-box domain |
SLF20Ψ | PG1011_ptg0019 | 24517580 | 20082629-20083795 | Solanum tuberosum DM8.1, SLF20 | 97.3 | - |
SLF12Ψ | PG1011_ptg0019 | 24517580 | 20700634-20699466 | Solanum tuberosum DM8.1, SLF12 | 96 | - |
SLF16Ψ | PG1011_ptg0042 | 3886020 | 89744-90924 | Solanum tuberosum DM8.1, SLF16 | 97.9 | - |
SLF15 | PG1011_ptg0042 | 3886020 | 784358-783099 | Solanum tuberosum DM8.1, SLF15 | 97 | F-box domain |
Nucleotide
Protein