Analysis Name | Brassica juncea ASM1870372v1 Assembly & Annotation |
Sequencing technology | PacBio Sequel; Illumina HiSeq |
Assembly method | FALCON v. 2018 |
Release Date | 2021-06-04 |
Kang L, Qian L, Zheng M, Chen L, Chen H, Yang L, You L, Yang B, Yan M, Gu Y, Wang T, Schiessl SV, An H, Blischak P, Liu X, Lu H, Zhang D, Rao Y, Jia D, Zhou D, Xiao H, Wang Y, Xiong X, Mason AS, Chris Pires J, Snowdon RJ, Hua W, Liu Z. Genomic insights into the origin, domestication and diversification of Brassica juncea. Nat Genet. 2021 Sep;53(9):1392-1402. doi: 10.1038/s41588-021-00922-y.
AbstractDespite early domestication around 3000 BC, the evolutionary history of the ancient allotetraploid species Brassica juncea (L.) Czern & Coss remains uncertain. Here, we report a chromosome-scale de novo assembly of a yellow-seeded B. juncea genome by integrating long-read and short-read sequencing, optical mapping and Hi-C technologies. Nuclear and organelle phylogenies of 480 accessions worldwide supported that B. juncea is most likely a single origin in West Asia, 8,000-14,000 years ago, via natural interspecific hybridization. Subsequently, new crop types evolved through spontaneous gene mutations and introgressions along three independent routes of eastward expansion. Selective sweeps, genome-wide trait associations and tissue-specific RNA-sequencing analysis shed light on the domestication history of flowering time and seed weight, and on human selection for morphological diversification in this versatile species. Our data provide a comprehensive insight into the origin and domestication and a foundation for genomics-based breeding of B. juncea.
Assembly statistics
Genome size | 933.5 Mb |
Total ungapped length | 889.1 Mb |
Number of chromosomes | 18 |
Number of scaffolds | 1,544 |
Scaffold N50 | 59.3 Mb |
Scaffold L50 | 7 |
Number of contigs | 2,522 |
Contig N50 | 1.9 Mb |
Contig L50 | 101 |
GC percent | 37.5 |
Genome coverage | 98.0x |
Assembly level | Chromosome |
The Brassica juncea ASM1870372v1 Assembly file is available in FASTA format.
Downloads
Chromosomes (FASTA file) | GCA_018703725.1_ASM1870372v1_genomic.fna.gz |
The Brassica juncea ASM1870372v1 genome gene prediction files are not available.
Downloads
Genes (GFF3 file) | - |
CDS sequences (FASTA file) | - |
Protein sequences (FASTA file) | - |
Functional annotation for the Brassica juncea ASM1870372v1 is not available.
Downloads
Domain from InterProScan | - |
Summary
Query | Chromosome | Size(bp) | Coordinates | BLASTp Hit | BLASTp %ID |
SRK1 | CM031932.1 (A07) | 30007521 | 24187269-24188586,24191483-24191605,24191707-24191888,24193013-24193223,24193314-24193551,24193644-24193794,24193844-24194182 | spQ09092SRK6_BRAOV | 67 |
SCR1 | CM031932.1 (A07) | 30007521 | 24176283-24176367,24176004-24176194 | BAB86356.1 | 82 |
SRK2 | CM031941.1 (B07) | 59341207 | 51220064-51221345,51219784-51219969,51219640-51219737,51219359-51219569,51219030-51219267,51218800-51218950,51218420-51218719 | spP0DH86SRK_ARATH | 33 |
SCR2 | CM031941.1 (B07) | 59341207 | 51019941-51020010,51020083-51020300 | XP_006301215 | 45 |
Nucleotide
Protein