Solanum stramoniifolium gwh_assembly PI 487464 01 Assembly & Annotation

Overview

Analysis Name Solanum stramoniifolium gwh_assembly PI 487464 01 Assembly & Annotation
Sequencing technology PacBio HiFi
Assembly method Hifiasm v0.16.1-r375
Release Date 2023-03-21
Reference Publication(s)

Wu Y, Li D, Hu Y, Li H, Ramstein GP, Zhou S, Zhang X, Bao Z, Zhang Y, Song B, Zhou Y, Zhou Y, Gagnon E, Särkinen T, Knapp S, Zhang C, Städler T, Buckler ES, Huang S. Phylogenomic discovery of deleterious mutations facilitates hybrid potato breeding. Cell. 2023 May 25;186(11):2313-2328.e15. doi: 10.1016/j.cell.2023.04.008.

Summary

Hybrid potato breeding will transform the crop from a clonally propagated tetraploid to a seed-reproducing diploid. Historical accumulation of deleterious mutations in potato genomes has hindered the development of elite inbred lines and hybrids. Utilizing a whole-genome phylogeny of 92 Solanaceae and its sister clade species, we employ an evolutionary strategy to identify deleterious mutations. The deep phylogeny reveals the genome-wide landscape of highly constrained sites, comprising ∼2.4% of the genome. Based on a diploid potato diversity panel, we infer 367,499 deleterious variants, of which 50% occur at non-coding and 15% at synonymous sites. Counterintuitively, diploid lines with relatively high homozygous deleterious burden can be better starting material for inbred-line development, despite showing less vigorous growth. Inclusion of inferred deleterious mutations increases genomic-prediction accuracy for yield by 24.7%. Our study generates insights into the genome-wide incidence and properties of deleterious mutations and their far-reaching consequences for breeding.

Assembly statistics

Genome size (bp) 2,551,001,719
GC content 36.07%
Contig sequence No. 554
Maximum contig sequence length (bp) 117,118,335
Minimum contig sequence length (bp) 15,780
Average contig sequence length (bp) 4,604,696
Contig N50 (bp) 49,321,352
Contig N90 (bp) 10,083,157
Assembly level Contig

Assembly

The Solanum stramoniifolium gwh_assembly PI 487464 01 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GWHBKCH00000000.genome.fasta.gz

Gene Predictions

The Solanum stramoniifolium gwh_assembly PI 487464 01 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) GWHBKCH00000000.gff.gz
CDS sequences (FASTA file) GWHBKCH00000000.CDS.fasta.gz
Protein sequences (FASTA file) GWHBKCH00000000.Protein.faa.gz

Functional Analysis

Functional annotation for the Solanum stramoniifolium gwh_assembly PI 487464 01 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Solanum_stramoniifolium_gwh_PI_487464_01.Pfam.tsv.gz

S genes

Summary

QueryContigSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF16Ψptg000014l5469628952518498-52519666Solanum tuberosum DM8.1, SLF1687.6 -
SLF15ptg000014l5469628954315251-54316369Solanum lycopersicum SL2.31, SLF1585.8 F-box domain
S-RNaseptg000052l608403509578784-9578551,
9578462-9578049
Solanum tuberosum
MZ561414.1, SRNase-S11
85.9Ribonuclease T2 family
SLF11ptg000052l6084035014193281-14194450Solanum tuberosum DM8.1, SLF1188.4 F-box domain
SLF7ptg000052l6084035016581257-16580088Solanum tuberosum DM8.1, SLF789.7 F-box domain
SLF13ptg000052l6084035018022093-18020915Solanum tuberosum DM8.1, SLF1385.9 F-box domain
SLF18ptg000052l6084035058029291-58028179Solanum tuberosum DM8.1, SLF18-288.0 F-box domain
SLF21ptg000060l1273020811107550-11108764Solanum tuberosum DM8.1, SLF2186.5 F-box domain
SLF5ptg000068l100831575583832-5582672Solanum tuberosum DM8.1, SLF5-287.6 F-box domain
SLF5-2ptg000090l75605831073442-1074602Solanum tuberosum DM8.1, SLF5-287.8 F-box domain

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences