Brassica oleracea ASM3463897v1 Assembly & Annotation

Overview

Analysis Name Brassica oleracea ASM3463897v1 Assembly & Annotation
Sequencing technology Oxford Nanopore
Assembly method NextDenovo v. 2.2
Release Date 2023-12-28
Reference Publication(s)

Li X, Wang Y, Cai C, Ji J, Han F, Zhang L, Chen S, Zhang L, Yang Y, Tang Q, Bucher J, Wang X, Yang L, Zhuang M, Zhang K, Lv H, Bonnema G, Zhang Y, Cheng F. Large-scale gene expression alterations introduced by structural variation drive morphotype diversification in Brassica oleracea. Nat Genet. 2024 Mar;56(3):517-529. doi: 10.1038/s41588-024-01655-4.

Summary

Brassica oleracea, globally cultivated for its vegetable crops, consists of very diverse morphotypes, characterized by specialized enlarged organs as harvested products. This makes B. oleracea an ideal model for studying rapid evolution and domestication. We constructed a B. oleracea pan-genome from 27 high-quality genomes representing all morphotypes and their wild relatives. We identified structural variations (SVs) among these genomes and characterized these in 704 B. oleracea accessions using graph-based genome tools. We show that SVs exert bidirectional effects on the expression of numerous genes, either suppressing through DNA methylation or promoting probably by harboring transcription factor-binding elements. The following examples illustrate the role of SVs modulating gene expression: SVs promoting BoPNY and suppressing BoCKX3 in cauliflower/broccoli, suppressing BoKAN1 and BoACS4 in cabbage and promoting BoMYBtf in ornamental kale. These results provide solid evidence for the role of SVs as dosage regulators of gene expression, driving B. oleracea domestication and diversification.

Assembly statistics

Genome size581 Mb
Total ungapped length580.9 Mb
Number of chromosomes9
Number of scaffolds24
Scaffold N5068.7 Mb
Scaffold L504
Number of contigs101
Contig N5030.3 Mb
Contig L508
GC percent37
Genome coverage151.0x
Assembly levelChromosome

Assembly

The Brassica oleracea ASM3463897v1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_034638975.1_ASM3463897v1_genomic.fna.gz

Gene Predictions

The Brassica oleracea ASM3463897v1 genome gene prediction files are not available.

Downloads

Genes (GFF3 file) -
CDS sequences (FASTA file) -
Protein sequences (FASTA file) -

Functional Analysis

Functional annotation for the Brassica oleracea ASM3463897v1 is not available.

Downloads

Domain from InterProScan -

S genes

Summary

QueryChromosomeSize(bp)CoordinatesBLASTp HitBLASTp %ID
SRKCM068153.15630327638584351-38585608,38584046-38584306,
38583932-38584022,38583621-38583879,
38583315-38583538,38583076-38583226,
38582690-38582998
SRKb|AB052756.1_prot_BAB40987.136.45
SCRCM068153.15630327637825101-37825170,37825269-37825483XP_018438641.187.67

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences