Bothriochloa decipiens ASM2333362v1 Assembly & Annotation

Overview

Analysis Name Bothriochloa decipiens ASM2333362v1 Assembly & Annotation
Sequencing technology Illumina HiSeq
Assembly method HiRise v. Jan-2020
Release Date 2022-05-12
Reference Publication(s)

De Silva NP, Lee C, Battlay P, Fournier-Level A, Moore JL, Hodgins KA. Genome assembly of an Australian native grass species reveals a recent whole-genome duplication and biased gene retention of genes involved in stress response. Gigascience. 2022 Dec 28;12:giad034. doi: 10.1093/gigascience/giad034.

Abstract

Background:The adaptive significance of polyploidy has been extensively debated, and chromosome-level genome assemblies of polyploids can provide insight into this. The Australian grass Bothriochloa decipiens belongs to the BCD clade, a group with a complex history of hybridization and polyploid. This is the first genome assembly and annotation of a species that belongs to this fascinating yet complex group.

Findings:Using Illumina short reads, 10X Genomics linked reads, and Hi-C sequencing data, we assembled a highly contiguous genome of B. decipiens, with a total length of 1,218.22 Mb and scaffold N50 of 42.637 Mb. Comparative analysis revealed that the species experienced a relatively recent whole-genome duplication. We clustered the 20 major scaffolds, representing the 20 chromosomes, into the 2 subgenomes of the parental species using unique repeat signatures. We found evidence of biased fractionation and differences in the activity of transposable elements between the subgenomes prior to hybridization. Duplicates were enriched for genes involved in transcription and response to external stimuli, supporting a biased retention of duplicated genes following whole-genome duplication.

Conclusions:Our results support the hypotheses of a biased retention of duplicated genes following polyploidy and point to differences in repeat activity associated with subgenome dominance. B. decipiens is a widespread species with the ability to establish across many soil types, making it a prime candidate for climate change– resilient ecological restoration of Australian grasslands. This reference genome is a valuable resource for future population genomic research on Australian grasses.

Assembly statistics

Genome size 1.2 Gb
Number of chromosomes 20
Number of scaffolds 14,671
Scaffold N50 54 Mb
Scaffold L50 10
Number of contigs 44,114
Contig N50 80.5 kb
Contig L50 4,103
Assembly level Chromosome

Assembly

The Bothriochloa decipiens ASM2333362v1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) Bdec_final_genome_assembly.fasta.gz

Gene Predictions

The Bothriochloa decipiens ASM2333362v1 genome gene prediction files are not available.

Downloads

Genes (GFF3 file) -
CDS sequences (FASTA file) -
Protein sequences (FASTA file) -

Functional Analysis

Functional annotation for the Bothriochloa decipiens ASM2333362v1 is not available.

Downloads

Domain from InterProScan -

S genes

Summary

QueryChromosomeSize(bp)CoordinatestBLASTn HittBLASTn %IDDomain
DUF247I-SΨScaffold_174496429822133905-22134828Shybrid90DUF247
DUF247II-SScaffold_174496429822034878-22036488Pvirgatum58DUF247
DUF247I-ZΨScaffold_13482227364521470-4522201Sspontaneum92DUF247
DUF247II-ZΨScaffold_13482227364516062-4516346Sspontaneum89DUF247
HPS10-ZScaffold_13482227364517629-4517741,
4517814-4517973
Pmiliaceum47-

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences