Capsicum baccatum 'PBC81 (Isolate)' ASM227188v2 Assembly & Annotation

Overview

Analysis Name Capsicum baccatum 'PBC81 (Isolate)' ASM227188v2 Assembly & Annotation
Sequencing technology Illumina HiSeq
Assembly method SOAPdenovo v. 2.04-r240; SSPACE v. 2.0; Platanus v. 1.2.1
Release Date 2017-10-30
Reference Publication(s)

Kim S, Park J, Yeom SI, Kim YM, Seo E, Kim KT, Kim MS, Lee JM, Cheong K, Shin HS, Kim SB, Han K, Lee J, Park M, Lee HA, Lee HY, Lee Y, Oh S, Lee JH, Choi E, Choi E, Lee SE, Jeon J, Kim H, Choi G, Song H, Lee J, Lee SC, Kwon JK, Lee HY, Koo N, Hong Y, Kim RW, Kang WH, Huh JH, Kang BC, Yang TJ, Lee YH, Bennetzen JL, Choi D. New reference genome sequences of hot pepper reveal the massive evolution of plant disease-resistance genes by retroduplication. Genome Biol. 2017 Nov 1;18(1):210. doi: 10.1186/s13059-017-1341-9.

Abstract

Background: Transposable elements are major evolutionary forces which can cause new genome structure and species diversification. The role of transposable elements in the expansion of nucleotide-binding and leucine-rich-repeat proteins (NLRs), the major disease-resistance gene families, has been unexplored in plants. Results: We report two high-quality de novo genomes (Capsicum baccatum and C. chinense) and an improved reference genome (C. annuum) for peppers. Dynamic genome rearrangements involving translocations among chromosomes 3, 5, and 9 were detected in comparison between C. baccatum and the two other peppers. The amplification of athila LTR-retrotransposons, members of the gypsy superfamily, led to genome expansion in C. baccatum. In-depth genome-wide comparison of genes and repeats unveiled that the copy numbers of NLRs were greatly increased by LTR-retrotransposon-mediated retroduplication. Moreover, retroduplicated NLRs are abundant across the angiosperms and, in most cases, are lineage-specific. Conclusions: Our study reveals that retroduplication has played key roles for the massive emergence of NLR genes including functional disease-resistance genes in pepper plants.

Assembly statistics

Genome size 3.2 Gb
Number of chromosomes 12
Number of scaffolds 23,260
Scaffold N50 229.7 Mb
Scaffold L50 7
Number of contigs 177,021
Contig N50 57.9 kb
Contig L50 13,720
Assembly level Chromosome

Assembly

The Capsicum baccatum 'PBC81 (Isolate)' ASM227188v2 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_002271885.2_ASM227188v2_genomic.fna.gz

Gene Predictions

The Capsicum baccatum 'PBC81 (Isolate)' ASM227188v2 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) GCA_002271885.2_ASM227188v2_genomic.gff.gz
CDS sequences (FASTA file) GCA_002271885.2_ASM227188v2_cds_from_genomic.fna.gz
baccatumtein sequences (FASTA file) GCA_002271885.2_ASM227188v2_protein.faa.gz

Functional Analysis

Functional annotation for the Capsicum baccatum 'PBC81 (Isolate)' ASM227188v2 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterbaccatumScan Capsicum_baccatum_ASM227188v2.Pfam.tsv.gz

S genes

Summary

QueryChromosomeSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF13CM008443.1259,277,713119081641-119082810Solanum tuberosum DM8.1, SLF1383.8 F-box domain
SLF12CM008443.1259,277,713121092403-121093557Solanum tuberosum DM8.1, SLF1280.2 F-box domain
SLF15CM008443.1259,277,713242465356-242464094Solanum tuberosum DM8.1, SLF1583.7 F-box domain

Nucleotide

baccatumtein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences