Solanum americanum 'sp2271 (Isolate)' Assembly & Annotation

Overview

Analysis Name Solanum americanum 'sp2271 (Isolate)' Assembly & Annotation
Sequencing technology PacBio CCS
Assembly method HiFiasm v. 0.13
Release Date 2023-07-06
Reference Publication(s)

Lin X, Jia Y, Heal R, Prokchorchik M, Sindalovskaya M, Olave-Achury A, Makechemu M, Fairhead S, Noureen A, Heo J, Witek K, Smoker M, Taylor J, Shrestha RK, Lee Y, Zhang C, Park SJ, Sohn KH, Huang S, Jones JDG. Solanum americanum genome-assisted discovery of immune receptors that detect potato late blight pathogen effectors. Nat Genet. 2023 Sep;55(9):1579-1588. doi: 10.1038/s41588-023-01486-9.

Abstract

Potato (Solanum tuberosum) and tomato (Solanum lycopersicon) crops suffer severe losses to late blight caused by the oomycete pathogen Phytophthora infestans. Solanum americanum, a relative of potato and tomato, is globally distributed and most accessions are highly blight resistant. We generated high-quality reference genomes of four S. americanum accessions, resequenced 52 accessions, and defined a pan-NLRome of S. americanum immune receptor genes. We further screened for variation in recognition of 315 P. infestans RXLR effectors in 52 S. americanum accessions. Using these genomic and phenotypic data, we cloned three NLR-encoding genes, Rpi-amr4, R02860 and R04373, that recognize cognate P. infestans RXLR effectors PITG_22825 (AVRamr4), PITG_02860 and PITG_04373. These genomic resources and methodologies will support efforts to engineer potatoes with durable late blight resistance and can be applied to diseases of other crops.

Assembly statistics

Genome size 1.1 Gb
Number of chromosomes 12
Number of scaffolds 450
Scaffold N50 90.7 Mb
Scaffold L50 6
Number of contigs 567
Contig N50 51.1 Mb
Contig L50 8
GC percent 36.5
Assembly level Chromosome

Assembly

The Solanum americanum 'sp2271 (Isolate)' Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) sp2271.v4.fa.gz

Gene Predictions

The Solanum americanum 'sp2271 (Isolate)' genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) sp2271.v4.gff3.gz
CDS sequences (FASTA file) sp2271.v4.cds.fa.gz
Protein sequences (FASTA file) sp2271.v4.pep.fa.gz

Functional Analysis

Functional annotation for the Solanum americanum 'sp2271 (Isolate)' is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Solanum_americanum_sp2271.Pfam.tsv.gz

S genes

Summary

QueryChromosomeSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF15sp2271chr011131235343595658-3596917Solanum tuberosum DM8.1, SLF1590.2 F-box domain
SLF16ψsp2271chr011131235344323391-4322208Solanum tuberosum DM8.1, SLF1687.9 -
SLF4ψsp2271chr0111312353435743323-35742136Petunia hybrida, S9-SLF477.5-
SLF22sp2271chr0111312353437174305-37173169Solanum tuberosum DM8.1, SLF2287.2F-box domain
SLF12sp2271chr0111312353438378629-38379780Solanum tuberosum DM8.1, SLF1283.5F-box domain
SLF6sp2271chr0111312353438800764-38801912Solanum tuberosum DM8.1, SLF6-286.1 F-box domain
SLF5sp2271chr0111312353439350885-39349716Solanum tuberosum DM8.1, SLF5-287.6 F-box domain
SLF17sp2271chr0111312353440624287-40623109Solanum peruvianum KU987615.1, SLF1778.5F-box domain
SLF23sp2271chr0111312353440855863-40854706Solanum neorickii MG266239.1, SLF2388.5 F-box domain
SLF20ψsp2271chr0111312353450885148-50883982Solanum tuberosum DM8.1, SLF2088.9 -
SLF9sp2271chr0111312353454517286-54516115Solanum tuberosum DM8.1, SLF982.7F-box domain
SLF20-2ψsp2271chr0111312353454761667-54760501Solanum tuberosum DM8.1, SLF2087.7 -
SLF21ψsp2271chr0111312353463628010-63629223Solanum tuberosum DM8.1, SLF2187.0 -
SLF11sp2271chr0111312353465305181-65306350Solanum tuberosum DM8.1, SLF1189.4 F-box domain
SLF3sp2271chr0111312353466409424-66408261Solanum pennellii BK009230.1, SLF384.0 F-box domain
SLF18sp2271chr0111312353482939062-82940177Solanum lycopersicum SL2.31, SLF1887.8 F-box domain
SLF19sp2271chr0111312353482995686-82994586Solanum tuberosum DM8.1, SLF1990.6F-box domain

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences