Solanum melongena V4.1 Assembly & Annotation

Overview

Analysis Name Solanum melongena V4.1 Assembly & Annotation
Sequencing technology Illumina(long-insert mate-pair libraries), BioNano and Hi-C reads
Assembly method SOAPdenovo2
Release Date 2021-10-18
Reference Publication(s)

Barchi L, Rabanus-Wallace MT, Prohens J, Toppino L, Padmarasu S, Portis E, Rotino GL, Stein N, Lanteri S, Giuliano G. Improved genome assembly and pan-genome provide key insights into eggplant domestication and breeding. Plant J. 2021 Jul;107(2):579-596. doi: 10.1111/tpj.15313.

Summary

Eggplant (Solanum melongena L.) is an important horticultural crop and one of the most widely grown vegetables from the Solanaceae family. It was domesticated from a wild, prickly progenitor carrying small, round, non-anthocyanic fruits. We obtained a novel, highly contiguous genome assembly of the eggplant ‘67/3’ reference line, by Hi-C retrofitting of a previously released short read- and optical mapping-based assembly. The sizes of the 12 chromosomes and the fraction of anchored genes in the improved assembly were comparable to those of a chromosome-level assembly. We resequenced 23 accessions of S. melongena representative of the worldwide phenotypic, geographic, and genetic diversity of the species, and one each from the closely related species Solanum insanum and Solanum incanum. The eggplant pan-genome contained approximately 51.5 additional megabases and 816 additional genes compared with the reference genome, while the pan-plastome showed little genetic variation. We identified 53 selective sweeps related to fruit color, prickliness, and fruit shape in the nuclear genome, highlighting selection leading to the emergence of present-day S. melongena cultivars from its wild ancestors. Candidate genes underlying the selective sweeps included a MYBL1 repressor and CHALCONE ISOMERASE (for fruit color), homologs of Arabidopsis GLABRA1 and GLABROUS INFLORESCENCE STEMS2 (for prickliness), and orthologs of tomato FW2.2, OVATE, LOCULE NUMBER/WUSCHEL, SUPPRESSOR OF OVATE, and CELL SIZE REGULATOR (for fruit size/shape), further suggesting that selection for the latter trait relied on a common set of orthologous genes in tomato and eggplant.

Assembly statistics

Genome size (bp) 1,164,419,523
Genome sequence No. 13
Genome sequence N50 (bp) 92,127,910
Genome sequence L50 5
Maximum genome sequence length (bp) 114,254,532
Assembly level Chromosome

Assembly

The Solanum melongena V4.1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) Eggplant_V4.1.fa.gz

Gene Predictions

The Solanum melongena V4.1 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) Eggplant_V4.1_function_IPR_final.sorted.gff.gz
CDS sequences (FASTA file) Eggplant_V4.1_transcripts.function.fa.gz
Protein sequences (FASTA file) Eggplant_V4.1_protein.function.fa.gz

Functional Analysis

Functional annotation for the Solanum melongena V4.1 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Eggplant_V4.1_Interproscan.tsv.gz

S genes

Summary

QueryChrSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF19Ψ111425453238718308-38719413Solanum tuberosum DM8.1, SLF1990.1-
SLF18111425453239574216-39573116Solanum tuberosum DM8.1, SLF1887F-box domain
SLF23Ψ111425453288871171-88872302Solanum neorickii MG266239.1, SLF2384.6-
SLF17Ψ111425453289100049-89101206Solanum tuberosum DM8.1, SLF1780.7-
SLF16Ψ1114254532108956996-108958164Solanum tuberosum DM8.1, SLF1686.8-
SLF151114254532111814157-111815422Solanum tuberosum DM8.1, SLF1584.3F-box domain

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences