Solanum sisymbriifolium gwh_assembly PI 381291 01 Assembly & Annotation

Overview

Analysis Name Solanum sisymbriifolium gwh_assembly PI 381291 01 Assembly & Annotation
Sequencing technology PacBio HiFi
Assembly method Hifiasm v0.16.1-r375
Release Date 2023-03-21
Reference Publication(s)

Wu Y, Li D, Hu Y, Li H, Ramstein GP, Zhou S, Zhang X, Bao Z, Zhang Y, Song B, Zhou Y, Zhou Y, Gagnon E, Särkinen T, Knapp S, Zhang C, Städler T, Buckler ES, Huang S. Phylogenomic discovery of deleterious mutations facilitates hybrid potato breeding. Cell. 2023 May 25;186(11):2313-2328.e15. doi: 10.1016/j.cell.2023.04.008.

Summary

Hybrid potato breeding will transform the crop from a clonally propagated tetraploid to a seed-reproducing diploid. Historical accumulation of deleterious mutations in potato genomes has hindered the development of elite inbred lines and hybrids. Utilizing a whole-genome phylogeny of 92 Solanaceae and its sister clade species, we employ an evolutionary strategy to identify deleterious mutations. The deep phylogeny reveals the genome-wide landscape of highly constrained sites, comprising ∼2.4% of the genome. Based on a diploid potato diversity panel, we infer 367,499 deleterious variants, of which 50% occur at non-coding and 15% at synonymous sites. Counterintuitively, diploid lines with relatively high homozygous deleterious burden can be better starting material for inbred-line development, despite showing less vigorous growth. Inclusion of inferred deleterious mutations increases genomic-prediction accuracy for yield by 24.7%. Our study generates insights into the genome-wide incidence and properties of deleterious mutations and their far-reaching consequences for breeding.

Assembly statistics

Genome size (bp) 2,110,910,040
GC content 39.46%
Contig sequence No. 1,086
Maximum contig sequence length (bp) 103,172,820
Minimum contig sequence length (bp) 22,389
Average contig sequence length (bp) 1,943,748
Contig sequence N50 (bp) 26,290,576
Contig sequence N90 (bp) 3,741,333
Assembly level Contig

Assembly

The Solanum sisymbriifolium gwh_assembly PI 381291 01 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GWHBKCF00000000.genome.fasta.gz

Gene Predictions

The Solanum sisymbriifolium gwh_assembly PI 381291 01 genome gene prediction files are not available.

Downloads

Genes (GFF3 file) -
CDS sequences (FASTA file) -
Protein sequences (FASTA file) -

Functional Analysis

Functional annotation for the Solanum sisymbriifolium gwh_assembly PI 381291 01 is not available.

Downloads

Domain from InterProScan -

S genes

Summary

QueryContigSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF16ΨGWHBKCF0000000929352838236969-235801Solanum tuberosum DM8.1, SLF1688.2 -
SLF7GWHBKCF000000181760561110190684-10191853Solanum tuberosum DM8.1, SLF789.8 F-box domain
SLF11GWHBKCF000000181760561116580078-16578909Solanum tuberosum DM8.1, SLF1189.3 F-box domain
SLF23GWHBKCF000000343884239628610522-28609371Solanum lycopersicoides
KU960925.1, SLF23
88.3 F-box domain
SLF5GWHBKCF00000036203818528458441-8457272Solanum tuberosum DM8.1, SLF5-289.4 F-box domain
SLF21GWHBKCF0000004669901676891855-6893069Solanum tuberosum DM8.1, SLF2186.8 F-box domain
SLF11-2GWHBKCF0000004985640974814217-4815386Solanum tuberosum DM8.1, SLF1189.2 F-box domain
SLF22GWHBKCF00000050148586821016633-1015476Solanum tuberosum DM8.1, SLF22-288.9 F-box domain
SLF21-2GWHBKCF00000050148586823806863-3805649Solanum tuberosum DM8.1, SLF2187.0 F-box domain
SLF23-2GWHBKCF000000501485868211317542-11316391Solanum lycopersicoides
KU960925.1, SLF23
88.7 F-box domain
SLF13GWHBKCF000000601063426510544147-10542984Solanum tuberosum DM8.1, SLF1386.8 F-box domain
SLF5-2GWHBKCF00000063116252464949358-4948216Solanum tuberosum DM8.1, SLF5-288.5 F-box domain
SLF7-2GWHBKCF00000063116252467428716-7427547Solanum tuberosum DM8.1, SLF790.7 F-box domain
SLF12GWHBKCF000000631162524610824427-10823258Solanum tuberosum DM8.1, SLF1285.4 F-box domain
SLF19GWHBKCF00000087116388864853140-4854255Solanum tuberosum DM8.1, SLF1992.1 F-box domain
SLF18GWHBKCF00000087116388864945827-4944718Solanum tuberosum DM8.1, SLF1887.3 F-box domain
SLF13-2GWHBKCF0000019052403324329406-4328243Solanum tuberosum DM8.1, SLF1386.6 F-box domain
SLF15GWHBKCF0000042823013071861-70596Solanum tuberosum DM8.1, SLF1585.7 F-box domain

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences