Stipa capillata TSU_Scap_1 Assembly & Annotation

Overview

Analysis Name Stipa capillata TSU_Scap_1 Assembly & Annotation
Sequencing technology PacBio Sequel
Assembly method Flye v. 2.4
Release Date 2021-07-13
Reference Publication(s)

Baiakhmetov E, Guyomar C, Shelest E, Nobis M, Gudkova PD. The first draft genome of feather grasses using SMRT sequencing and its implications in molecular studies of Stipa. Sci Rep. 2021 Jul 28;11(1):15345. doi: 10.1038/s41598-021-94068-w.

Abstract

The Eurasian plant Stipa capillata is the most widespread species within feather grasses. Many taxa of the genus are dominants in steppe plant communities and can be used for their classification and in studies related to climate change. Moreover, some species are of economic importance mainly as fodder plants and can be used for soil remediation processes. Although large-scale molecular data has begun to appear, there is still no complete or draft genome for any Stipa species. Thus, here we present a single-molecule long-read sequencing dataset generated using the Pacific Biosciences Sequel System. A draft genome of about 1004 Mb was obtained with a contig N50 length of 351 kb. Importantly, here we report 81,224 annotated protein-coding genes, present 77,614 perfect and 58 unique imperfect SSRs, reveal the putative allopolyploid nature of S. capillata, investigate the evolutionary history of the genus, demonstrate structural heteroplasmy of the chloroplast genome and announce for the first time the mitochondrial genome in Stipa. The assembled nuclear, mitochondrial and chloroplast genomes provide a significant source of genetic data for further works on phylogeny, hybridisation and population studies within Stipa and the grass family Poaceae.

Assembly statistics

Genome size1 Gb
Total ungapped length1 Gb
Number of contigs5,931
Contig N50350.5 kb
Contig L50837
GC percent46
Genome coverage23.0x
Assembly levelContig

Assembly

The Stipa capillata TSU_Scap_1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_019208055.1_TSU_Scap_1_genomic.fna.gz

Gene Predictions

The Stipa capillata TSU_Scap_1 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) 41598_2021_94068_MOESM1_ESM.gff.gz
CDS sequences (FASTA file) Sc_cds.fa.gz
Protein sequences (FASTA file) Sc_pep.fa.gz

Functional Analysis

Functional annotation for the Stipa capillata TSU_Scap_1 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Stipa_capillata.Pfam.tsv.gz

S genes

Summary

QueryContigSize(bp)CoordinatestBLASTn HittBLASTn %IDDomain
HPS10-Scontig_499875886198912-199021,199156-199306Bsylvaticum_HPS10-S64-
DUF247I-ZΨcontig_467833111665826-66050AlongiglumisDUF247I-Z67DUF247
HPS10-Z1contig_3263266841213595-213757,214022-214125LpsZ_contig5509754-
HPS10-Z2contig_3173533808266680-266774,266961-267087contig_317355-

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences