Iochroma cyaneum v1.0 Assembly & Annotation

Overview

Analysis Name Iochroma cyaneum v1.0 Assembly & Annotation
Sequencing technology Illumina, Hi-C and Nanopore sequencing
Assembly method MaSurca v3.3.2
Release Date 2022-06-06
Reference Publication(s)

Powell AF, Zhang J, Hauser D, Vilela JA, Hu A, Gates DJ, Mueller LA, Li FW, Strickler SR, Smith SD. Genome sequence for the blue-flowered Andean shrub Iochroma cyaneum reveals extensive discordance across the berry clade of Solanaceae. Plant Genome. 2022 Sep;15(3):e20223. doi: 10.1002/tpg2.20223.

Abstract

The tomato (Solanum lycopersicum L.) family, Solanaceae, is a model clade for a wide range of applied and basic research questions. Currently, reference-quality genomes are available for over 30 species from seven genera, and these include numerous crops as well as wild species [e.g., Jaltomata sinuosa (Miers) Mione and Nicotiana attenuata Torr. ex S. Watson]. Here we present the genome of the showy-flowered Andean shrub Iochroma cyaneum (Lindl.) M. L. Green, a woody lineage from the tomatillo (Physalis philadelphica Lam.) subfamily Physalideae. The assembled size of the genome (2.7 Gb) is more similar in size to pepper (Capsicum annuum L.) (2.6 Gb) than to other sequenced diploid members of the berry clade of Solanaceae [e.g., potato (Solanum tuberosum L.), tomato, and Jaltomata]. Our assembly recovers 92% of the conserved orthologous set, suggesting a nearly complete genome for this species. Most of the genomic content is repetitive (69%), with Gypsy elements alone accounting for 52% of the genome. Despite the large amount of repetitive content, most of the 12 I. cyaneum chromosomes are highly syntenic with tomato. Bayesian concordance analysis provides strong support for the berry clade, including I. cyaneum, but reveals extensive discordance along the backbone, with placement of chili pepper and Jaltomata being highly variable across gene trees. The I. cyaneum genome contributes to a growing wealth of genomic resources in Solanaceae and underscores the need for expanded sampling of diverse berry genomes to dissect major morphological transitions.

Assembly statistics

Genome assembly total length, Mb 2,716.02
Percentage of assembly assigned to chromosomes 84.13
No. of contigs 37,881
Contig N50, kb 212.94
Longest contig, kb 3,996.25
No. of N bases, Mb 0.64
No. of gaps 19,176
No. of genes 38,625
Repeat percentage of genome, % 69.35
Assembly level Chromosome

Assembly

The Iochroma cyaneum v1.0 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) IC_v1.0_chromosomes.fa.gz

Gene Predictions

The Iochroma cyaneum v1.0 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) IC_v1.0_gene_models.gff.gz
CDS sequences (FASTA file) IC_v1.0_cds.fa.gz
Protein sequences (FASTA file) IC_v1.0_proteins.fa.gz

Functional Analysis

Functional annotation for the Iochroma cyaneum v1.0 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Iochroma_cyaneum_v1.0.Pfam.tsv.gz

S genes

Summary

QueryChrSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF15IC1.0ch01135778923759159-757894Solanum tuberosum DM8.1, SLF1586.8 F-box domain
SLF16IC1.0ch011357789238574522-8573344Solanum tuberosum DM8.1, SLF1680.8 F-box domain
SLFIC1.0ch0113577892352141785-52142966Nicotiana alata EF420260.1, NaDD10-S682.8 F-box domain
SLF5IC1.0ch0113577892356692857-56694023Petunia hybrida AB933093.1, Sm-SLF583.8 F-box domain
SLF12IC1.0ch0113577892358492005-58490881Solanum tuberosum DM8.1, SLF1285.1 F-box domain
SLF9IC1.0ch0113577892366825002-66826153Nicotiana alata EF420252.1, NaDD2-S186.5 F-box domain
S-RNaseIC1.0ch0113577892367194347-67194105,
67193991-67193566
Solanum tuberosum
MZ561409.1, SRNase-S6
86.4 Ribonuclease T2 family
SLF21IC1.0ch0113577892374902976-74904155Solanum tuberosum DM8.1, SLF2181.2 F-box domain
SLF21-2IC1.0ch0113577892374977333-74976137Solanum tuberosum DM8.1, SLF2181.5 F-box domain
SLF18IC1.0ch01135778923100040234-100041379Solanum tuberosum DM8.1, SLF18-286.5 F-box domain

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences