Analysis Name | Stipa capillata TSU_Scap_1 Assembly & Annotation |
Sequencing technology | PacBio Sequel |
Assembly method | Flye v. 2.4 |
Release Date | 2021-07-13 |
Baiakhmetov E, Guyomar C, Shelest E, Nobis M, Gudkova PD. The first draft genome of feather grasses using SMRT sequencing and its implications in molecular studies of Stipa. Sci Rep. 2021 Jul 28;11(1):15345. doi: 10.1038/s41598-021-94068-w.
AbstractThe Eurasian plant Stipa capillata is the most widespread species within feather grasses. Many taxa of the genus are dominants in steppe plant communities and can be used for their classification and in studies related to climate change. Moreover, some species are of economic importance mainly as fodder plants and can be used for soil remediation processes. Although large-scale molecular data has begun to appear, there is still no complete or draft genome for any Stipa species. Thus, here we present a single-molecule long-read sequencing dataset generated using the Pacific Biosciences Sequel System. A draft genome of about 1004 Mb was obtained with a contig N50 length of 351 kb. Importantly, here we report 81,224 annotated protein-coding genes, present 77,614 perfect and 58 unique imperfect SSRs, reveal the putative allopolyploid nature of S. capillata, investigate the evolutionary history of the genus, demonstrate structural heteroplasmy of the chloroplast genome and announce for the first time the mitochondrial genome in Stipa. The assembled nuclear, mitochondrial and chloroplast genomes provide a significant source of genetic data for further works on phylogeny, hybridisation and population studies within Stipa and the grass family Poaceae.
Assembly statistics
Genome size | 1 Gb |
Total ungapped length | 1 Gb |
Number of contigs | 5,931 |
Contig N50 | 350.5 kb |
Contig L50 | 837 |
GC percent | 46 |
Genome coverage | 23.0x |
Assembly level | Contig |
The Stipa capillata TSU_Scap_1 Assembly file is available in FASTA format.
Downloads
Chromosomes (FASTA file) | GCA_019208055.1_TSU_Scap_1_genomic.fna.gz |
The Stipa capillata TSU_Scap_1 genome gene prediction files are available in GFF3 and FASTA format.
Downloads
Genes (GFF3 file) | 41598_2021_94068_MOESM1_ESM.gff.gz |
CDS sequences (FASTA file) | Sc_cds.fa.gz |
Protein sequences (FASTA file) | Sc_pep.fa.gz |
Functional annotation for the Stipa capillata TSU_Scap_1 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).
Downloads
Domain from InterProScan | Stipa_capillata.Pfam.tsv.gz |
Summary
Query | Contig | Size(bp) | Coordinates | tBLASTn Hit | tBLASTn %ID | Domain |
HPS10-S | contig_499 | 875886 | 198912-199021,199156-199306 | Bsylvaticum_HPS10-S | 64 | - |
DUF247I-ZΨ | contig_4678 | 331116 | 65826-66050 | AlongiglumisDUF247I-Z | 67 | DUF247 |
HPS10-Z1 | contig_3263 | 266841 | 213595-213757,214022-214125 | LpsZ_contig55097 | 54 | - |
HPS10-Z2 | contig_3173 | 533808 | 266680-266774,266961-267087 | contig_3173 | 55 | - |
Nucleotide
Protein