Solanum galapagense gwh_assembly LA0317 Assembly & Annotation

Overview

Analysis Name Solanum galapagense gwh_assembly LA0317 Assembly & Annotation
Sequencing technology PacBio
Assembly method hifiasm v1.0
Release Date 2022-06-21
Reference Publication(s)

Yu X, Qu M, Shi Y, Hao C, Guo S, Fei Z, Gao L. Chromosome-scale genome assemblies of wild tomato relatives Solanum habrochaites and Solanum galapagense reveal structural variants associated with stress tolerance and terpene biosynthesis. Hortic Res. 2022 Jun 20;9:uhac139. doi: 10.1093/hr/uhac139.

Assembly statistics

Genome size (bp) 859,928,289
GC content 34.82%
Genome sequence No. 1,116
Maximum genome sequence length (bp) 96,570,058
Minimum genome sequence length (bp) 18,337
Average genome sequence length (bp) 770,545
Genome sequence N50 (bp) 68,406,545
Genome sequence N90 (bp) 52,401,859
Assembly level Chromosome

Assembly

The Solanum galapagense gwh_assembly LA0317 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GWHBJTJ00000000.genome.fasta.gz

Gene Predictions

The Solanum galapagense gwh_assembly LA0317 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) GWHBJTJ00000000.gff.gz
CDS sequences (FASTA file) GWHBJTJ00000000.CDS.fasta.gz
Protein sequences (FASTA file) GWHBJTJ00000000.Protein.faa.gz

Functional Analysis

Functional annotation for the Solanum galapagense gwh_assembly LA0317 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Solanum_galapagense_gwh.Pfam.tsv.gz

S genes

Summary

QueryChrSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF15Chr1965700582178022-2176763Solanum lycopersicum SL2.31, SLF1599.7F-box domain
SLF16Chr1965700582739342-2738161Solanum lycopersicum SL2.31, SLF1699.7F-box domain
SLF17ΨChr19657005847210579-47211664Solanum lycopersicum SL2.31, SLF1799.5-
SLF1ΨChr19657005849356798-49357967Solanum lycopersicum NM_001301439.2, SLF199.3-
S-RNaseChr19657005850162676-50162437,
50162339-50161914
Solanum galapagense OK091157.1,
SRNase
96.7Ribonuclease T2 family
SLF2ΨChr19657005851151099-51149918Solanum pimpinellifolium KJ814870.1, SLF299.7-
SLF12ΨChr19657005851207541-51208672Solanum lycopersicum SL2.31, SLF1299.6-
SLF4ΨChr19657005851274842-51273676Solanum lycopersicum KJ814943.1, SLF499.2-
SLF5ΨChr19657005851355607-51354439Solanum pimpinellifolium KJ814872.1, SLF5100-
SLF6Chr19657005851373102-51371957Solanum lycopersicum KJ814944.1, SLF699.8F-box domain
SLF8ΨChr19657005851932363-51931194Solanum lycopersicum SL2.31, SLF899.7-
SLF7ΨChr19657005851957102-51956005Solanum lycopersicum SL2.31, SLF7100-
SLF9Chr19657005854159120-54158056Solanum lycopersicum NM_001329461.2, SLF999.6F-box domain
SLF10ΨChr19657005854595530-54596761Solanum lycopersicum KJ814899.1, SLF1099.8-
SLF11Chr19657005856563798-56564970Solanum pimpinellifolium KJ814877.1, SLF11100F-box domain
SLF12-2Chr19657005858362328-58361165Solanum lycopersicum NM_001301441.1, SLF1299.7F-box domain
SLF13Chr19657005859215935-59214742Solanum lycopersicum NM_001301435.1, SLF1399.9F-box domain
SLF14ΨChr19657005862653581-62652411Solanum lycopersicum KJ814903.1, SLF14100-
SLF18Chr19657005873985914-73987029Solanum lycopersicum SL2.31, SLF1899.8F-box domain
SLF19Chr19657005874004951-74003842Solanum lycopersicum SL2.31, SLF1999.7F-box domain

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences