Coix lacryma-jobi BJ Coix v1 Assembly & Annotation

Overview

Analysis Name Coix lacryma-jobi BJ Coix v1 Assembly & Annotation
Sequencing technology PacBio
Assembly method HERA 2018
Release Date 2019-06-04
Reference Publication(s)

Hua Y, Liu Q, Zhai Y, Zhao L, Zhu J, Zhang X, Jia Q, Liang Z, Wang D. Genome-wide analysis of the HSP20 gene family and its response to heat and drought stress in Coix (Coix lacryma-jobi L.). BMC Genomics. 2023 Aug 24;24(1):478. doi: 10.1186/s12864-023-09580-2.

Abstract

Background: Heat shock protein 20 (HSP20) is a member of the heat stress-related protein family, which plays critical roles in plant growth, development, and response to abiotic stresses. Although many HSP20 genes have been associated with heat stress in numerous types of plants, little is known about the details of the HSP20 gene family in Coix. To investigate the mechanisms of the ClHSP20 response to heat and drought stresses, the ClHSP20 gene family in Coix was identified and characterized based on genome-wide analysis.

Results: A total of 32 putative ClHSP20 genes were identified and characterized in Coix. Phylogenetic analysis indicated that ClHSP20s were grouped into 11 subfamilies. The duplicated event analysis demonstrated that tandem duplication and segment duplication events played crucial roles in promoting the expansion of the ClHSP20 gene family. Synteny analysis showed that Coix shared the highest homology in 36 HSP20 gene pairs with wheat, followed by 22, 19, 15, and 15 homologous gene pairs with maize, sorghum, barley, and rice, respectively. The expression profile analysis showed that almost all ClHSP20 genes had different expression levels in at least one tissue. Furthermore, 22 of the 32 ClHSP20 genes responded to heat stress, with 11 ClHSP20 genes being significantly upregulated and 11 ClHSP20 genes being significantly downregulated. Furthermore, 13 of the 32 ClHSP20 genes responded to drought stress, with 6 ClHSP20 genes being significantly upregulated and 5 ClHSP20 genes being significantly downregulated.

Conclusions: Thirty-two ClHSP20 genes were identified and characterized in the genome of Coix. Tandem and segmental duplication were identified as having caused the expansion of the ClHSP20 gene family. The expression patterns of the ClHSP20 genes suggested that they play a critical role in growth, development, and response to heat and drought stress. The current study provides a theoretical basis for further research on ClHSP20s and will facilitate the functional characterization of ClHSP20 genes.

Assembly statistics

Genome size (bp)1,731,458,735
GC content46.68%
Chromosomes sequence No.10
Genome sequence No.1,233
Maximum genome sequence length (bp)190,792,110
Minimum genome sequence length (bp)649
Average genome sequence length (bp)1,404,265
Genome sequence N50 (bp)161,246,920
Genome sequence N90 (bp)135,077,776
Assembly level Chromosome

Assembly

The Coix lacryma-jobi BJ Coix v1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GWHAAYR00000000.genome.fasta.gz

Gene Predictions

The Coix lacryma-jobi BJ Coix v1 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) GWHAAYR00000000.gff.gz
CDS sequences (FASTA file) GWHAAYR00000000.RNA.fasta.gz
Protein sequences (FASTA file) GWHAAYR00000000.Protein.faa.gz

Functional Analysis

Functional annotation for the Coix lacryma-jobi BJ Coix v1 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Coix_lacryma-jobi.Pfam.tsv.gz

S genes

Summary

QueryChromosomeSize(bp)CoordinatestBLASTn HittBLASTn %IDDomain
DUF247I-SΨGWHAAYR00000003179537263106822829-106823872Ecrus-galli59DUF247
DUF247II-SΨGWHAAYR0000000317953726399645733-99646311Shybrid93DUF247
HPS10-SGWHAAYR00000003179537263105666312-105666427,
105666525-105666687
Pvirgatum61-

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences