Jaltomata sinuosa ASM399621v1 Assembly & Annotation

Overview

Analysis Name Jaltomata sinuosa ASM399621v1 Assembly & Annotation
Sequencing technology PacBio; Illumina
Assembly method MaSuRCA v. 3.2.2
Release Date 2019-01-07
Reference Publication(s)

Wu M, Kostyun JL, Moyle LC. Genome Sequence of Jaltomata Addresses Rapid Reproductive Trait Evolution and Enhances Comparative Genomics in the Hyper-Diverse Solanaceae. Genome Biol Evol. 2019 Feb 1;11(2):335-349. doi: 10.1093/gbe/evy274.

Abstract

Within the economically important plant family Solanaceae, Jaltomata is a rapidly evolving genus that has extensive diversity in flower size and shape, as well as fruit and nectar color, among its ∼80 species. Here, we report the whole-genome sequencing, assembly, and annotation, of one representative species (Jaltomata sinuosa) from this genus. Combining PacBio long reads (25×) and Illumina short reads (148×) achieved an assembly of ∼1.45 Gb, spanning ∼96% of the estimated genome. Ninety-six percent of curated single-copy orthologs in plants were detected in the assembly, supporting a high level of completeness of the genome. Similar to other Solanaceous species, repetitive elements made up a large fraction (∼80%) of the genome, with the most recently active element, Gypsy, expanding across the genome in the last 1–2 Myr. Computational gene prediction, in conjunction with a merged transcriptome data set from 11 tissues, identified 34,725 protein-coding genes. Comparative phylogenetic analyses with six other sequenced Solanaceae species determined that Jaltomata is most likely sister to Solanum, although a large fraction of gene trees supported a conflicting bipartition consistent with substantial introgression between Jaltomata and Capsicum after these species split. We also identified gene family dynamics specific to Jaltomata, including expansion of gene families potentially involved in novel reproductive trait development, and loss of gene families that accompanied the loss of self-incompatibility. This high-quality genome will facilitate studies of phenotypic diversification in this rapidly radiating group and provide a new point of comparison for broader analyses of genomic evolution across the Solanaceae.

Assembly statistics

Genome size 1.4 Gb
Number of scaffolds 7,667
Scaffold N50 397.6 kb
Scaffold L50 965
Number of contigs 8,210
Contig N50 372.5 kb
Contig L50 1,044
Assembly level Scaffold

Assembly

The Jaltomata sinuosa ASM399621v1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_003996215.1_ASM399621v1_genomic.fna.gz

Gene Predictions

The Jaltomata sinuosa ASM399621v1 genome gene prediction files are not available.

Downloads

Genes (GFF3 file) -
CDS sequences (FASTA file) -
Protein sequences (FASTA file) -

Functional Analysis

Functional annotation for the Jaltomata sinuosa ASM399621v1 is not available.

Downloads

Domain from InterProScan -

S genes

Summary

QueryScaffoldSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF15QJPP01000904.124696814882-13623Solanum tuberosum DM8.1, SLF1588.7 F-box domain
SLF16QJPP01002257.1674078269897-271129Solanum tuberosum DM8.1, SLF1684.3 F-box domain
SLF18QJPP01007549.116964683960-82827Solanum tuberosum DM8.1, SLF18-286.6 F-box domain

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences