Solanum retroflexum gwh_assembly PI 688410 01 Assembly & Annotation

Overview

Analysis Name Solanum retroflexum gwh_assembly PI 688410 01 Assembly & Annotation
Sequencing technology PacBio HiFi
Assembly method Hifiasm v0.16.1-r375
Release Date 2023-03-21
Reference Publication(s)

Wu Y, Li D, Hu Y, Li H, Ramstein GP, Zhou S, Zhang X, Bao Z, Zhang Y, Song B, Zhou Y, Zhou Y, Gagnon E, Särkinen T, Knapp S, Zhang C, Städler T, Buckler ES, Huang S. Phylogenomic discovery of deleterious mutations facilitates hybrid potato breeding. Cell. 2023 May 25;186(11):2313-2328.e15. doi: 10.1016/j.cell.2023.04.008.

Summary

Hybrid potato breeding will transform the crop from a clonally propagated tetraploid to a seed-reproducing diploid. Historical accumulation of deleterious mutations in potato genomes has hindered the development of elite inbred lines and hybrids. Utilizing a whole-genome phylogeny of 92 Solanaceae and its sister clade species, we employ an evolutionary strategy to identify deleterious mutations. The deep phylogeny reveals the genome-wide landscape of highly constrained sites, comprising ∼2.4% of the genome. Based on a diploid potato diversity panel, we infer 367,499 deleterious variants, of which 50% occur at non-coding and 15% at synonymous sites. Counterintuitively, diploid lines with relatively high homozygous deleterious burden can be better starting material for inbred-line development, despite showing less vigorous growth. Inclusion of inferred deleterious mutations increases genomic-prediction accuracy for yield by 24.7%. Our study generates insights into the genome-wide incidence and properties of deleterious mutations and their far-reaching consequences for breeding.

Assembly statistics

Genome size (bp) 2,963,877,796
GC content 36.39%
Genome sequence No. 37
Maximum genome sequence length (bp) 101,730,500
Minimum genome sequence length (bp) 48,069,423
Average genome sequence length (bp) 80,104,805
Genome sequence N50 (bp) 83,865,500
Genome sequence N90 (bp) 68,343,000
Assembly level Chromosome

Assembly

The Solanum retroflexum gwh_assembly PI 688410 01 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GWHBKCB00000000.genome.fasta.gz

Gene Predictions

The Solanum retroflexum gwh_assembly PI 688410 01 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) GWHBKCB00000000.gff.gz
CDS sequences (FASTA file) GWHBKCB00000000.CDS.fasta.gz
Protein sequences (FASTA file) GWHBKCB00000000.Protein.faa.gz

Functional Analysis

Functional annotation for the Solanum retroflexum gwh_assembly PI 688410 01 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Solanum_retroflexum_gwh_PI_688410_01.Pfam.tsv.gz

S genes

Summary

QueryChromosomeSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF15GWHBKCB000000011017305003381911-3383170Solanum tuberosum DM8.1, SLF1590.2 F-box domain
SLF16ΨGWHBKCB000000011017305004049901-4048718Solanum tuberosum DM8.1, SLF1688.2 -
SLF22ΨGWHBKCB0000000110173050037808988-37807853Solanum tuberosum DM8.1, SLF2287.2 -
SLF21ΨGWHBKCB0000000110173050047829007-47830220Solanum tuberosum DM8.1, SLF2186.7 -
SLF11GWHBKCB0000000110173050049241756-49242922Solanum tuberosum DM8.1, SLF1189.3 F-box domain
SLF18GWHBKCB0000000110173050071351651-71352766Solanum lycopersicum SL2.31, SLF1888.5 F-box domain
SLF19GWHBKCB0000000110173050071398706-71397609Solanum tuberosum DM8.1, SLF1988.5 F-box domain
SLF19-2GWHBKCB0000000110173050071493542-71492445Solanum tuberosum DM8.1, SLF1988.4 F-box domain
SLF15-2ΨGWHBKCB00000005904595773842668-3841413Solanum tuberosum DM8.1, SLF1589.6 -
SLF16-2ΨGWHBKCB00000005904595774570108-4568925Solanum tuberosum DM8.1, SLF1688.4 -
SLF22-2ΨGWHBKCB000000059045957729537317-29536181Solanum tuberosum DM8.1, SLF2287.1 -
SLF20ΨGWHBKCB000000059045957731944497-31943333Solanum tuberosum DM8.1, SLF2087.7 -
SLF7ΨGWHBKCB000000059045957736234792-36233621Solanum tuberosum DM8.1, SLF789.0 -
S-RNaseΨGWHBKCB000000059045957739150810-39151037,
39151180-39151608
Solanum chacoense
AF176533.1, ScS12-RNase
86.5 -
SLF21-2ΨGWHBKCB000000059045957743785799-43784586Solanum tuberosum DM8.1, SLF2187.1 -
SLF11-2GWHBKCB000000059045957747821840-47820671Solanum tuberosum DM8.1, SLF1189.1 F-box domain
SLF18-2ΨGWHBKCB000000059045957763361068-63362186Solanum tuberosum DM8.1, SLF1888.1 -
SLF19-3GWHBKCB000000059045957763398343-63397243Solanum tuberosum DM8.1, SLF1990.6 F-box domain
SLF15-3GWHBKCB000000098799950018978218-18979477Solanum tuberosum DM8.1, SLF1589.9 F-box domain
SLF16-3ΨGWHBKCB000000098799950019666987-19665804Solanum tuberosum DM8.1, SLF1687.8 -
SLF22-3GWHBKCB000000098799950048491490-48490354Solanum tuberosum DM8.1, SLF2287.0 F-box domain
SLF5ΨGWHBKCB000000098799950051486393-51487562Solanum tuberosum DM8.1, SLF5-286.4 -
SLF11-3ΨGWHBKCB000000098799950059721108-59722277Solanum tuberosum DM8.1, SLF1187.3 -
SLF18-3GWHBKCB000000098799950077542561-77543670Solanum tuberosum DM8.1, SLF1888.6 F-box domain
SLF19-4ΨGWHBKCB000000098799950077592063-77590962Solanum tuberosum DM8.1, SLF1990.4 -

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences