Analysis Name | Solanum melongena V4.1 Assembly & Annotation |
Sequencing technology | Illumina(long-insert mate-pair libraries), BioNano and Hi-C reads |
Assembly method | SOAPdenovo2 |
Release Date | 2021-10-18 |
Barchi L, Rabanus-Wallace MT, Prohens J, Toppino L, Padmarasu S, Portis E, Rotino GL, Stein N, Lanteri S, Giuliano G. Improved genome assembly and pan-genome provide key insights into eggplant domestication and breeding. Plant J. 2021 Jul;107(2):579-596. doi: 10.1111/tpj.15313.
SummaryEggplant (Solanum melongena L.) is an important horticultural crop and one of the most widely grown vegetables from the Solanaceae family. It was domesticated from a wild, prickly progenitor carrying small, round, non-anthocyanic fruits. We obtained a novel, highly contiguous genome assembly of the eggplant ‘67/3’ reference line, by Hi-C retrofitting of a previously released short read- and optical mapping-based assembly. The sizes of the 12 chromosomes and the fraction of anchored genes in the improved assembly were comparable to those of a chromosome-level assembly. We resequenced 23 accessions of S. melongena representative of the worldwide phenotypic, geographic, and genetic diversity of the species, and one each from the closely related species Solanum insanum and Solanum incanum. The eggplant pan-genome contained approximately 51.5 additional megabases and 816 additional genes compared with the reference genome, while the pan-plastome showed little genetic variation. We identified 53 selective sweeps related to fruit color, prickliness, and fruit shape in the nuclear genome, highlighting selection leading to the emergence of present-day S. melongena cultivars from its wild ancestors. Candidate genes underlying the selective sweeps included a MYBL1 repressor and CHALCONE ISOMERASE (for fruit color), homologs of Arabidopsis GLABRA1 and GLABROUS INFLORESCENCE STEMS2 (for prickliness), and orthologs of tomato FW2.2, OVATE, LOCULE NUMBER/WUSCHEL, SUPPRESSOR OF OVATE, and CELL SIZE REGULATOR (for fruit size/shape), further suggesting that selection for the latter trait relied on a common set of orthologous genes in tomato and eggplant.
Assembly statistics
Genome size (bp) | 1,164,419,523 |
Genome sequence No. | 13 |
Genome sequence N50 (bp) | 92,127,910 |
Genome sequence L50 | 5 |
Maximum genome sequence length (bp) | 114,254,532 |
Assembly level | Chromosome |
The Solanum melongena V4.1 Assembly file is available in FASTA format.
Downloads
Chromosomes (FASTA file) | Eggplant_V4.1.fa.gz |
The Solanum melongena V4.1 genome gene prediction files are available in GFF3 and FASTA format.
Downloads
Genes (GFF3 file) | Eggplant_V4.1_function_IPR_final.sorted.gff.gz |
CDS sequences (FASTA file) | Eggplant_V4.1_transcripts.function.fa.gz |
Protein sequences (FASTA file) | Eggplant_V4.1_protein.function.fa.gz |
Functional annotation for the Solanum melongena V4.1 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).
Downloads
Domain from InterProScan | Eggplant_V4.1_Interproscan.tsv.gz |
Summary
Query | Chr | Size(bp) | Coordinates | BLASTn Hit | BLASTn %ID | Domain |
SLF19Ψ | 1 | 114254532 | 38718308-38719413 | Solanum tuberosum DM8.1, SLF19 | 90.1 | - |
SLF18 | 1 | 114254532 | 39574216-39573116 | Solanum tuberosum DM8.1, SLF18 | 87 | F-box domain |
SLF23Ψ | 1 | 114254532 | 88871171-88872302 | Solanum neorickii MG266239.1, SLF23 | 84.6 | - |
SLF17Ψ | 1 | 114254532 | 89100049-89101206 | Solanum tuberosum DM8.1, SLF17 | 80.7 | - |
SLF16Ψ | 1 | 114254532 | 108956996-108958164 | Solanum tuberosum DM8.1, SLF16 | 86.8 | - |
SLF15 | 1 | 114254532 | 111814157-111815422 | Solanum tuberosum DM8.1, SLF15 | 84.3 | F-box domain |
Nucleotide
Protein