Brassica juncea ASM1870372v1 Assembly & Annotation

Overview

Analysis Name Brassica juncea ASM1870372v1 Assembly & Annotation
Sequencing technology PacBio Sequel; Illumina HiSeq
Assembly method FALCON v. 2018
Release Date 2021-06-04
Reference Publication(s)

Kang L, Qian L, Zheng M, Chen L, Chen H, Yang L, You L, Yang B, Yan M, Gu Y, Wang T, Schiessl SV, An H, Blischak P, Liu X, Lu H, Zhang D, Rao Y, Jia D, Zhou D, Xiao H, Wang Y, Xiong X, Mason AS, Chris Pires J, Snowdon RJ, Hua W, Liu Z. Genomic insights into the origin, domestication and diversification of Brassica juncea. Nat Genet. 2021 Sep;53(9):1392-1402. doi: 10.1038/s41588-021-00922-y.

Abstract

Despite early domestication around 3000 BC, the evolutionary history of the ancient allotetraploid species Brassica juncea (L.) Czern & Coss remains uncertain. Here, we report a chromosome-scale de novo assembly of a yellow-seeded B. juncea genome by integrating long-read and short-read sequencing, optical mapping and Hi-C technologies. Nuclear and organelle phylogenies of 480 accessions worldwide supported that B. juncea is most likely a single origin in West Asia, 8,000-14,000 years ago, via natural interspecific hybridization. Subsequently, new crop types evolved through spontaneous gene mutations and introgressions along three independent routes of eastward expansion. Selective sweeps, genome-wide trait associations and tissue-specific RNA-sequencing analysis shed light on the domestication history of flowering time and seed weight, and on human selection for morphological diversification in this versatile species. Our data provide a comprehensive insight into the origin and domestication and a foundation for genomics-based breeding of B. juncea.

Assembly statistics

Genome size933.5 Mb
Total ungapped length889.1 Mb
Number of chromosomes18
Number of scaffolds1,544
Scaffold N5059.3 Mb
Scaffold L507
Number of contigs2,522
Contig N501.9 Mb
Contig L50101
GC percent37.5
Genome coverage98.0x
Assembly levelChromosome

Assembly

The Brassica juncea ASM1870372v1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_018703725.1_ASM1870372v1_genomic.fna.gz

Gene Predictions

The Brassica juncea ASM1870372v1 genome gene prediction files are not available.

Downloads

Genes (GFF3 file) -
CDS sequences (FASTA file) -
Protein sequences (FASTA file) -

Functional Analysis

Functional annotation for the Brassica juncea ASM1870372v1 is not available.

Downloads

Domain from InterProScan -

S genes

Summary

QueryChromosomeSize(bp)CoordinatesBLASTp HitBLASTp %ID
SRK1CM031932.1 (A07)3000752124187269-24188586,24191483-24191605,
24191707-24191888,24193013-24193223,
24193314-24193551,24193644-24193794,
24193844-24194182
spQ09092SRK6_BRAOV67
SCR1CM031932.1 (A07)3000752124176283-24176367,24176004-24176194BAB86356.182
SRK2CM031941.1 (B07)5934120751220064-51221345,51219784-51219969,
51219640-51219737,51219359-51219569,
51219030-51219267,51218800-51218950,
51218420-51218719
spP0DH86SRK_ARATH33
SCR2CM031941.1 (B07)5934120751019941-51020010,51020083-51020300XP_00630121545

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences