Analysis Name | Hordeum marinum H559 Assembly & Annotation |
Sequencing technology | PacBio, Illumina HiSeq 4000, Hi-C, 10X Genomics |
Assembly method | Falcon+Quiver+Plion+Bowtie+Lachesis 1.0 |
Release Date | 2022-05-10 |
Kuang L, Shen Q, Chen L, Ye L, Yan T, Chen ZH, Waugh R, Li Q, Huang L, Cai S, Fu L, Xing P, Wang K, Shao J, Wu F, Jiang L, Wu D, Zhang G. The genome and gene editing system of sea barleygrass provide a novel platform for cereal domestication and stress tolerance studies. Plant Commun. 2022 Sep 12;3(5):100333. doi: 10.1016/j.xplc.2022.100333.
AbstractThe tribe Triticeae provides important staple cereal crops and contains elite wild species with wide genetic diversity and high tolerance to abiotic stresses. Sea barleygrass (Hordeum marinum Huds.), a wild Triticeae species, thrives in saline marshlands and is well known for its high tolerance to salinity and waterlogging. Here, a 3.82-Gb high-quality reference genome of sea barleygrass is assembled de novo, with 3.69 Gb (96.8%) of its sequences anchored onto seven chromosomes. In total, 41 045 high-confidence (HC) genes are annotated by homology, de novo prediction, and transcriptome analysis. Phylogenetics, non-synonymous/synonymous mutation ratios (Ka/Ks), and transcriptomic and functional analyses provide genetic evidence for the divergence in morphology and salt tolerance among sea barleygrass, barley, and wheat. The large variation in post-domestication genes (e.g. IPA1 and MOC1) may cause interspecies differences in plant morphology. The extremely high salt tolerance of sea barleygrass is mainly attributed to low Na+ uptake and root-to-shoot translocation, which are mainly controlled by SOS1, HKT, and NHX transporters. Agrobacterium-mediated transformation and CRISPR/Cas9-mediated gene editing systems were developed for sea barleygrass to promote its utilization for exploration and functional studies of hub genes and for the genetic improvement of cereal crops.
Assembly statistics
Genome size (bp) | 3,815,965,547 |
GC content | 44.53% |
Genome sequence No. | 1,197 |
Maximum genome sequence length (bp) | 588,092,371 |
Minimum genome sequence length (bp) | 242 |
Average genome sequence length (bp) | 3,187,941 |
Genome sequence N50 (bp) | 524,471,465 |
Genome sequence N90 (bp) | 450,127,416 |
Assembly level | Chromosome |
The Hordeum marinum H559 Assembly file is available in FASTA format.
Downloads
Chromosomes (FASTA file) | GWHBJBH00000000.genome.fasta.gz |
The Hordeum marinum H559 genome gene prediction files are available in GFF3 and FASTA format.
Downloads
Genes (GFF3 file) | GWHBJBH00000000.gff.gz |
CDS sequences (FASTA file) | GWHBJBH00000000.RNA.fasta.gz |
Protein sequences (FASTA file) | GWHBJBH00000000.Protein.faa.gz |
Functional annotation for the Hordeum marinum H559 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).
Downloads
Domain from InterProScan | Hordeum_marinum.Pfam.tsv.gz |
Summary
Query | Chromosome | Size(bp) | Coordinates | tBLASTn Hit | tBLASTn %ID | Domain |
DUF247II-S | GWHBJBH00000001 | 488453804 | 71140986-71142605 | LpSDUF247-II_chromosome1 | 74 | DUF247 |
HPS10-Z | GWHBJBH00000002 | 588092371 | 513591575-513591713,513591845-513591987 | Amyosuroides | 37 | - |
Nucleotide
Protein