Analysis Name | Solanum aethiopicum Assembly & Annotation |
Sequencing technology | Illumina |
Assembly method | SOAPdenovo2 |
Release Date | 2019-10-01 |
Song B, Song Y, Fu Y, Kizito EB, Kamenya SN, Kabod PN, Liu H, Muthemba S, Kariba R, Njuguna J, Maina S, Stomeo F, Djikeng A, Hendre PS, Chen X, Chen W, Li X, Sun W, Wang S, Cheng S, Muchugi A, Jamnadass R, Shapiro HY, Van Deynze A, Yang H, Wang J, Xu X, Odeny DA, Liu X. Draft genome sequence of Solanum aethiopicum provides insights into disease resistance, drought tolerance, and the evolution of the genome. Gigascience. 2019 Oct 1;8(10):giz115. doi: 10.1093/gigascience/giz115.
AbstractBackground: The African eggplant (Solanum aethiopicum) is a nutritious traditional vegetable used in many African countries, including Uganda and Nigeria. It is thought to have been domesticated in Africa from its wild relative, Solanum anguivi. S.aethiopicum has been routinely used as a source of disease resistance genes for several Solanaceae crops, including Solanum melongena. A lack of genomic resources has meant that breeding of S. aethiopicum has lagged behind other vegetable crops.
Results: We assembled a 1.02-Gb draft genome of S. aethiopicum, which contained predominantly repetitive sequences (78.9%). We annotated 37,681 gene models, including 34,906 protein-coding genes. Expansion of disease resistance genes was observed via 2 rounds of amplification of long terminal repeat retrotransposons, which may have occurred ∼1.25 and 3.5 million years ago, respectively. By resequencing 65 S. aethiopicum and S. anguivi genotypes, 18,614,838 single-nucleotide polymorphisms were identified, of which 34,171 were located within disease resistance genes. Analysis of domestication and demographic history revealed active selection for genes involved in drought tolerance in both “Gilo” and “Shum” groups. A pan-genome of S. aethiopicum was assembled, containing 51,351 protein-coding genes; 7,069 of these genes were missing from the reference genome.
Conclusions: The genome sequence of S. aethiopicum enhances our understanding of its biotic and abiotic resistance. The single-nucleotide polymorphisms identified are immediately available for use by breeders. The information provided here will accelerate selection and breeding of the African eggplant, as well as other crops within the Solanaceae family.
Assembly statistics
Scaffold number | 162,187 |
Scaffold total length | 1.02 Gb |
Scaffold N50 | 516.1 kb |
Scaffold longest | 2.94 Mb |
Contig number | 231,821 |
Contig total length | 936 Mb |
Contig N50 | 25.2 kb |
Contig longest | 366.2 kb |
GC content | 33.13% |
Number of genes | 34,906 |
Total length of transposable elements | 805.7 Mb (78.23%) |
Assembly level | Scaffold |
The Solanum aethiopicum Assembly file is available in FASTA format.
Downloads
Chromosomes (FASTA file) | Solanum_aethiopicum.genome.fa.gz |
The Solanum aethiopicum genome gene prediction files are available in GFF3 and FASTA format.
Downloads
Genes (GFF3 file) | Solanum_aethiopicum.gene.gff.gz |
CDS sequences (FASTA file) | Solanum_aethiopicum.gene.cds.fa.gz |
Protein sequences (FASTA file) | Solanum_aethiopicum.gene.pep.fa.gz |
Functional annotation for the Solanum aethiopicum is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).
Downloads
Domain from InterProScan | Solanum_aethiopicum.Pfam.tsv.gz |
Summary
Query | Scaffold | Size(bp) | Coordinates | BLASTn Hit | BLASTn %ID | Domain |
SLF18 | scaffold141504_cov61 | 1637287 | 1394419-1395519 | Solanum tuberosum DM8.1 SLF18 | 86.6 | F-box domain |
SLF19 | scaffold149241_cov62 | 1425273 | 999715-1000830 | Solanum tuberosum DM8.1 SLF19 | 89.9 | F-box domain |
SLF15 | scaffold149356_cov63 | 2300088 | 790925-792190 | Solanum tuberosum DM8.1 SLF15 | 84.5 | F-box domain |
SLF16 | scaffold149356_cov63 | 2300088 | 2077817-2076645 | Solanum tuberosum DM8.1 SLF16 | 87.5 | F-box domain |
SLF23ψ | scaffold150928_cov63 | 230831 | 9914-8757 | Solanum lycopersicoides KU960925.1, SLF23 | 84.4 | - |
SLF11ψ | scaffold150938_cov63 | 702656 | 443047-444222 | Solanum tuberosum DM8.1 SLF11 | 87.4 | - |
Nucleotide
Protein