Solanum ochranthum gwh_assembly PI 230519 01 Assembly & Annotation

Overview

Analysis Name Solanum ochranthum gwh_assembly PI 230519 01 Assembly & Annotation
Sequencing technology PacBio HiFi
Assembly method Hifiasm v0.16.1-r375
Release Date 2023-03-21
Reference Publication(s)

Wu Y, Li D, Hu Y, Li H, Ramstein GP, Zhou S, Zhang X, Bao Z, Zhang Y, Song B, Zhou Y, Zhou Y, Gagnon E, Särkinen T, Knapp S, Zhang C, Städler T, Buckler ES, Huang S. Phylogenomic discovery of deleterious mutations facilitates hybrid potato breeding. Cell. 2023 May 25;186(11):2313-2328.e15. doi: 10.1016/j.cell.2023.04.008.

Summary

Hybrid potato breeding will transform the crop from a clonally propagated tetraploid to a seed-reproducing diploid. Historical accumulation of deleterious mutations in potato genomes has hindered the development of elite inbred lines and hybrids. Utilizing a whole-genome phylogeny of 92 Solanaceae and its sister clade species, we employ an evolutionary strategy to identify deleterious mutations. The deep phylogeny reveals the genome-wide landscape of highly constrained sites, comprising ∼2.4% of the genome. Based on a diploid potato diversity panel, we infer 367,499 deleterious variants, of which 50% occur at non-coding and 15% at synonymous sites. Counterintuitively, diploid lines with relatively high homozygous deleterious burden can be better starting material for inbred-line development, despite showing less vigorous growth. Inclusion of inferred deleterious mutations increases genomic-prediction accuracy for yield by 24.7%. Our study generates insights into the genome-wide incidence and properties of deleterious mutations and their far-reaching consequences for breeding.

Assembly statistics

Genome size (bp) 957,054,186
GC content 35.27%
Genome sequence No. 25
Maximum genome sequence length (bp) 105,267,135
Minimum genome sequence length (bp) 32,409
Average genome sequence length (bp) 38,282,167
Genome sequence N50 (bp) 73,620,363
Genome sequence N90 (bp) 56,739,671
Assembly level Chromosome

Assembly

The Solanum ochranthum gwh_assembly PI 230519 01 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GWHBKBY00000000.genome.fasta.gz

Gene Predictions

The Solanum ochranthum gwh_assembly PI 230519 01 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) GWHBKBY00000000.gff.gz
CDS sequences (FASTA file) GWHBKBY00000000.CDS.fasta.gz
Protein sequences (FASTA file) GWHBKBY00000000.Protein.faa.gz

Functional Analysis

Functional annotation for the Solanum ochranthum gwh_assembly PI 230519 01 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Solanum_ochranthum_gwh_PI_230519_01.Pfam.tsv.gz

S genes

Summary

QueryChrSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF15Chr_21052671352439445-2438186Solanum lycopersicum SL2.31, SLF1595.4 F-box domain
SLF16Chr_21052671353201498-3200317Solanum lycopersicum SL2.31, SLF1696.1 F-box domain
SLF22Chr_210526713545597654-45596515Solanum lycopersicoides KU960924.1, SLF2297.1 F-box domain
SLF21Chr_210526713551291299-51292606Solanum lycopersicoides KU960923.1, SLF2197.7 F-box domain
SLF20Chr_210526713551869413-51870579Solanum lycopersicoides KU960922.1, SLF2097.5 F-box domain
SLF7Chr_210526713551949635-51950807Solanum peruvianum KJ814851.1, SLF796.4 F-box domain
S-RNaseChr_210526713552920462-52920689,
52921304-52921732
Solanum tuberosum
MZ561417.1, SRNase-S14
91.6Ribonuclease T2 family
SLF12Chr_210526713556444060-56442906Solanum tuberosum DM8.1, SLF1293.2 F-box domain
SLF5Chr_210526713556640609-56641796Solanum tuberosum DM8.1, SLF594.0 F-box domain
SLF5-2Chr_210526713556996085-56997254Solanum chilense KJ814884.1, SLF596.6 F-box domain
SLF4Chr_210526713557010180-57011346Solanum pimpinellifolium KJ814871.1, SLF497.0 F-box domain
SLF12-2ΨChr_210526713557094757-57093600Solanum tuberosum DM8.1, SLF1293.3 -
SLF5-3Chr_210526713557185937-57187124Solanum lycopersicoides KU987627.2, SLF597.3 F-box domain
SLF1Chr_210526713558455162-58453987Solanum pennellii KJ814858.1, SLF195.2 F-box domain
SLF23Chr_210526713558477973-58479145Solanum lycopersicoides KU960925.1, SLF2397.5 F-box domain
SLF17Chr_210526713558533236-58532055Solanum lycopersicoides KU960921.1, SLF1796.6 F-box domain
SLF11ΨChr_210526713561519056-61520218Solanum pennellii NM_001323461.1, SLF1196.3 -
SLF3ΨChr_210526713565414961-65416123Solanum pennellii BK009230.1, SLF397.4 -
SLF13Chr_210526713566096266-66095043Solanum pennellii NM_001323465.1, SLF1398.8 F-box domain
SLF14ΨChr_210526713568487249-68486083Solanum habrochaites KJ814931.1, SLF1490.6 -
SLF18Chr_210526713579403256-79404371Solanum lycopersicum SL2.31, SLF1895.3 F-box domain

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences