Hordeum marinum H559 Assembly & Annotation

Overview

Analysis Name Hordeum marinum H559 Assembly & Annotation
Sequencing technology PacBio, Illumina HiSeq 4000, Hi-C, 10X Genomics
Assembly method Falcon+Quiver+Plion+Bowtie+Lachesis 1.0
Release Date 2022-05-10
Reference Publication(s)

Kuang L, Shen Q, Chen L, Ye L, Yan T, Chen ZH, Waugh R, Li Q, Huang L, Cai S, Fu L, Xing P, Wang K, Shao J, Wu F, Jiang L, Wu D, Zhang G. The genome and gene editing system of sea barleygrass provide a novel platform for cereal domestication and stress tolerance studies. Plant Commun. 2022 Sep 12;3(5):100333. doi: 10.1016/j.xplc.2022.100333.

Abstract

The tribe Triticeae provides important staple cereal crops and contains elite wild species with wide genetic diversity and high tolerance to abiotic stresses. Sea barleygrass (Hordeum marinum Huds.), a wild Triticeae species, thrives in saline marshlands and is well known for its high tolerance to salinity and waterlogging. Here, a 3.82-Gb high-quality reference genome of sea barleygrass is assembled de novo, with 3.69 Gb (96.8%) of its sequences anchored onto seven chromosomes. In total, 41 045 high-confidence (HC) genes are annotated by homology, de novo prediction, and transcriptome analysis. Phylogenetics, non-synonymous/synonymous mutation ratios (Ka/Ks), and transcriptomic and functional analyses provide genetic evidence for the divergence in morphology and salt tolerance among sea barleygrass, barley, and wheat. The large variation in post-domestication genes (e.g. IPA1 and MOC1) may cause interspecies differences in plant morphology. The extremely high salt tolerance of sea barleygrass is mainly attributed to low Na+ uptake and root-to-shoot translocation, which are mainly controlled by SOS1, HKT, and NHX transporters. Agrobacterium-mediated transformation and CRISPR/Cas9-mediated gene editing systems were developed for sea barleygrass to promote its utilization for exploration and functional studies of hub genes and for the genetic improvement of cereal crops.

Assembly statistics

Genome size (bp)3,815,965,547
GC content44.53%
Genome sequence No.1,197
Maximum genome sequence length (bp)588,092,371
Minimum genome sequence length (bp)242
Average genome sequence length (bp)3,187,941
Genome sequence N50 (bp)524,471,465
Genome sequence N90 (bp)450,127,416
Assembly levelChromosome

Assembly

The Hordeum marinum H559 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GWHBJBH00000000.genome.fasta.gz

Gene Predictions

The Hordeum marinum H559 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) GWHBJBH00000000.gff.gz
CDS sequences (FASTA file) GWHBJBH00000000.RNA.fasta.gz
Protein sequences (FASTA file) GWHBJBH00000000.Protein.faa.gz

Functional Analysis

Functional annotation for the Hordeum marinum H559 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Hordeum_marinum.Pfam.tsv.gz

S genes

Summary

QueryChromosomeSize(bp)CoordinatestBLASTn HittBLASTn %IDDomain
DUF247II-SGWHBJBH0000000148845380471140986-71142605LpSDUF247-II_chromosome174DUF247
HPS10-ZGWHBJBH00000002588092371513591575-513591713,
513591845-513591987
Amyosuroides37-

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences