Solanum wrightii gwh_assembly A392 Assembly & Annotation

Overview

Analysis Name Solanum wrightii gwh_assembly A392 Assembly & Annotation
Sequencing technology PacBio HiFi
Assembly method Hifiasm v0.16.1-r375
Release Date 2023-03-21
Reference Publication(s)

Wu Y, Li D, Hu Y, Li H, Ramstein GP, Zhou S, Zhang X, Bao Z, Zhang Y, Song B, Zhou Y, Zhou Y, Gagnon E, Särkinen T, Knapp S, Zhang C, Städler T, Buckler ES, Huang S. Phylogenomic discovery of deleterious mutations facilitates hybrid potato breeding. Cell. 2023 May 25;186(11):2313-2328.e15. doi: 10.1016/j.cell.2023.04.008.

Summary

Hybrid potato breeding will transform the crop from a clonally propagated tetraploid to a seed-reproducing diploid. Historical accumulation of deleterious mutations in potato genomes has hindered the development of elite inbred lines and hybrids. Utilizing a whole-genome phylogeny of 92 Solanaceae and its sister clade species, we employ an evolutionary strategy to identify deleterious mutations. The deep phylogeny reveals the genome-wide landscape of highly constrained sites, comprising ∼2.4% of the genome. Based on a diploid potato diversity panel, we infer 367,499 deleterious variants, of which 50% occur at non-coding and 15% at synonymous sites. Counterintuitively, diploid lines with relatively high homozygous deleterious burden can be better starting material for inbred-line development, despite showing less vigorous growth. Inclusion of inferred deleterious mutations increases genomic-prediction accuracy for yield by 24.7%. Our study generates insights into the genome-wide incidence and properties of deleterious mutations and their far-reaching consequences for breeding.

Assembly statistics

Genome size (bp) 2,509,052,418
GC content 36.91%
Genome sequence No. 13
Maximum genome sequence length (bp) 243,185,762
Minimum genome sequence length (bp) 31,758,166
Average genome sequence length (bp) 193,004,032
Genome sequence N50 (bp) 207,861,452
Genome sequence N90 (bp) 164,600,000
Assembly level Chromosome

Assembly

The Solanum wrightii gwh_assembly A392 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GWHBKCK00000000.genome.fasta.gz

Gene Predictions

The Solanum wrightii gwh_assembly A392 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) GWHBKCK00000000.gff.gz
CDS sequences (FASTA file) GWHBKCK00000000.CDS.fasta.gz
Protein sequences (FASTA file) GWHBKCK00000000.Protein.faa.gz

Functional Analysis

Functional annotation for the Solanum wrightii gwh_assembly A392 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Solanum_wrightii_gwh_A392.Pfam.tsv.gz

S genes

Summary

QueryChrSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF18Chr_422359199277780668-77779556Solanum tuberosum DM8.1, SLF18-287.6 F-box domain
SLF13Chr_4223591992100714043-100712880Solanum tuberosum DM8.1, SLF1385.6 F-box domain
SLF11Chr_4223591992129026037-129027206Solanum tuberosum DM8.1, SLF1188.4 F-box domain
SLF7Chr_4223591992132673269-132672100Solanum tuberosum DM8.1, SLF790.0 F-box domain
SLF13-2ΨChr_4223591992134521474-134522652Solanum tuberosum DM8.1, SLF1386.2 -
S-RNaseChr_4223591992144924380-144924610,
144924702-144925115
Solanum tuberosum
MZ561411.1, SRNase-S8
78.9 Ribonuclease T2 family
SLF23Chr_4223591992150859800-150858649Solanum neorickii MG266233.1, SLF2388.7 F-box domain
SLF21ΨChr_4223591992151347322-151348514Solanum tuberosum DM8.1, SLF2186.1 -
SLF12Chr_4223591992151479558-151480727Solanum tuberosum DM8.1, SLF1284.8 F-box domain
SLF5ΨChr_4223591992156860559-156859412Solanum tuberosum DM8.1, SLF5-288.5 -
SLF16ΨChr_4223591992217681319-217682487Solanum tuberosum DM8.1, SLF1687.6 -
SLF15Chr_4223591992219722367-219723626Solanum tuberosum DM8.1, SLF1586.8 F-box domain

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences