Brassica cretica B_cretica_A_v2 Assembly & Annotation

Overview

Analysis Name Brassica cretica B_cretica_A_v2 Assembly & Annotation
Sequencing technology Illumina HiSeq
Assembly method SOAPdenovo v. 2-bin-LINUX-generic-r240
Release Date 2020-03-24
Reference Publication(s)

Kioukis A, Michalopoulou VA, Briers L, Pirintsos S, Studholme DJ, Pavlidis P, Sarris PF. Intraspecific diversification of the crop wild relative Brassica cretica Lam. using demographic model selection. BMC Genomics. 2020 Jan 14;21(1):48. doi: 10.1186/s12864-019-6439-x.

Summary

Background: Crop wild relatives (CWRs) contain genetic diversity, representing an invaluable resource for crop improvement. Many of their traits have the potential to help crops to adapt to changing conditions that they experience due to climate change. An impressive global effort for the conservation of various CWR will facilitate their use in crop breeding for food security. The genus Brassica is listed in Annex I of the International Treaty on Plant Genetic Resources for Food and Agriculture. Brassica oleracea (or wild cabbage), a species native to southern and western Europe, has become established as an important human food crop plant because of its large reserves stored over the winter in its leaves. Brassica cretica Lam. (Bc) is a CWR in the brassica group and B. cretica subsp. nivea (Bcn) has been suggested as a separate subspecies. The species Bc has been proposed as a potential gene donor to brassica crops, including broccoli, cabbage, cauliflower, oilseed rape, etc.

Results: We sequenced genomes of four Bc individuals, including two Bcn and two Bc. Demographic analysis based on our whole-genome sequence data suggests that populations of Bc are not isolated. Classification of the Bc into distinct subspecies is not supported by the data. Using only the non-coding part of the data (thus, the parts of the genome that has evolved nearly neutrally), we find the gene flow between different Bc population is recent and its genomic diversity is high.

Conclusions: Despite predictions on the disruptive effect of gene flow in adaptation, when selection is not strong enough to prevent the loss of locally adapted alleles, studies show that gene flow can promote adaptation, that local adaptations can be maintained despite high gene flow, and that genetic architecture plays a fundamental role in the origin and maintenance of local adaptation with gene flow. Thus, in the genomic era it is important to link the selected demographic models with the underlying processes of genomic variation because, if this variation is largely selectively neutral, we cannot assume that a diverse population of crop wild relatives will necessarily exhibit the wide-ranging adaptive diversity required for further crop improvement.

Assembly statistics

Genome size332.3 Mb
Total ungapped length330.4 Mb
Number of scaffolds2,934
Scaffold N5036.1 Mb
Scaffold L505
Number of contigs25,197
Contig N5025.5 kb
Contig L503,731
GC percent36
Genome coverage55.0x
Assembly levelScaffold

Assembly

The Brassica cretica B_cretica_A_v2 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_003260655.2_B_cretica_A_v2_genomic.fna.gz

Gene Predictions

The Brassica cretica B_cretica_A_v2 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) genomic.gff.gz
CDS sequences (FASTA file) cds_from_genomic.fna.gz
Protein sequences (FASTA file) protein.faa.gz

Functional Analysis

Functional annotation for the Brassica cretica B_cretica_A_v2 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Brassica_cretica.Pfam.tsv.gz

S genes

Summary

QueryScaffoldSize(bp)CoordinatesBLASTp HitBLASTp %ID
SRKQGKV02001556.12710351621707467-21708778,21709307-21709435,
21709590-21709780,21709854-21710064,
21710153-21710390,21710509-21710659,
21710769-21711110
sp|Q09092|SRK6_BRAOV80
SCRQGKV02001556.12710351613414064-13414139,13414262-13414485XP_006414465.176

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences