Solanum habrochaites gwh_assembly LA0407 Assembly & Annotation

Overview

Analysis Name Solanum habrochaites gwh_assembly LA0407 Assembly & Annotation
Sequencing technology PacBio
Assembly method hifiasm v1.0
Release Date 2022-06-21
Reference Publication(s)

Yu X, Qu M, Shi Y, Hao C, Guo S, Fei Z, Gao L. Chromosome-scale genome assemblies of wild tomato relatives Solanum habrochaites and Solanum habrochaites reveal structural variants associated with stress tolerance and terpene biosynthesis. Hortic Res. 2022 Jun 20;9:uhac139. doi: 10.1093/hr/uhac139.

Assembly statistics

Genome size (bp) 950,673,181
GC content 34.97%
Genome sequence No. 1,009
Maximum genome sequence length (bp) 109,540,374
Minimum genome sequence length (bp) 3,000
Average genome sequence length (bp) 942,193
Genome sequence N50 (bp) 72,063,167
Genome sequence N90 (bp) 58,730,508
Assembly level Chromosome

Assembly

The Solanum habrochaites gwh_assembly LA0407 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GWHBJTH00000000.genome.fasta.gz

Gene Predictions

The Solanum habrochaites gwh_assembly LA0407 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) GWHBJTH00000000.gff.gz
CDS sequences (FASTA file) GWHBJTH00000000.CDS.fasta.gz
Protein sequences (FASTA file) GWHBJTH00000000.Protein.faa.gz

Functional Analysis

Functional annotation for the Solanum habrochaites gwh_assembly LA0407 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Solanum_habrochaites_gwh.Pfam.tsv.gz

S genes

Summary

QueryChrSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF15Chr11095403742541084-2539825Solanum lycopersicum SL2.31, SLF1596.9F-box domain
SLF16Chr11095403743228673-3227492Solanum lycopersicum SL2.31, SLF1699F-box domain
SLF1Chr110954037454780499-54779330Solanum peruvianum KJ814846.1, SLF195.6F-box domain
SLF17Chr110954037455055308-55056474Solanum peruvianum KU960916.1, SLF1799.1F-box domain
SLF14Chr110954037455870429-55869260Solanum pennellii BK009220.1, SLF1498F-box domain
SLF4Chr110954037456052192-56051320Solanum pimpinellifolium KJ814871.1, SLF496.5F-box domain
SLF23Chr110954037456062526-56063683Solanum pennellii BK009221.1, SLF2399.1F-box domain
S-RNaseChr110954037457225365-57225595
57225688-57226101
Solanum habrochaites AB072478.1,
Sh_hgSRN1
85.6Ribonuclease T2 family
SLF2Chr110954037459138023-59136842Solanum habrochaites KJ814908.1, SLF2-S198.1F-box domain
SLF4-2Chr110954037459282492-59281326Solanum pimpinellifolium KJ814871.1, SLF496.7F-box domain
SLF7Chr110954037459718811-59717642Solanum habrochaites KJ814923.1, SLF7-S198.7F-box domain
SLF20Chr110954037459845664-59844498Solanum peruvianum KU960913.1, SLF2098.1F-box domain
SLF10ΨChr110954037460312759-60313988Solanum pimpinellifolium KJ814876.1, SLF1097.7-
SLF9Chr110954037460547814-60548956Solanum pimpinellifolium KJ814875.1, SLF998.3F-box domain
SLF21Chr110954037461601909-61603207Solanum peruvianum KU960914.1, SLF2198.6F-box domain
SLF11Chr110954037463693780-63694952Solanum pimpinellifolium KJ814877.1, SLF1198.6F-box domain
SLF17-2Chr110954037464293419-64292238Solanum lycopersicoides KU960921.1, SLF1796.3F-box domain
SLF14-2ΨChr110954037472512834-72514004Solanum lycopersicum KJ814903.1, SLF1498.5-
SLF13Chr110954037476128658-76129644Solanum habrochaites KJ814930.1, SLF1399.2F-box domain
SLF12Chr110954037476688997-76687834Solanum habrochaites KJ814929.1, SLF1299.5F-box domain
SLF18Chr110954037484792432-84793547Solanum lycopersicum SL2.31, SLF1898F-box domain
SLF19Chr110954037484811087-84809978Solanum tuberosum DM8.1, SLF1992.6F-box domain

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences