Solanum rostratum IEDA_Sros_1.0 Assembly & Annotation

Overview

Analysis Name Solanum rostratum IEDA_Sros_1.0 Assembly & Annotation
Sequencing technology PacBio Sequel
Assembly method Hifiasm v. 0.16.0
Release Date 2023-05-08
Reference Publication(s)

Zhang Y, Guo W, Yuan Z, Song Z, Wang Z, Gao J, Fu W, Zhang G. Chromosome-level genome assembly and annotation of the prickly nightshade Solanum rostratum Dunal. Sci Data. 2023 Jun 1;10(1):341. doi: 10.1038/s41597-023-02247-3.

Abstract

The prickly nightshade Solanum rostratum, an annual malignant weed, is native to North America and has globally invaded 34 countries, causing serious threats to ecosystems, agriculture, animal husbandry, and human health. In this study, we constructed a chromosome-level genome assembly and annotation of S. rostratum. The contig-level genome was initially assembled in 898.42 Mb with a contig N50 of 62.00 Mb from PacBio high-fidelity reads. With Hi-C sequencing data scaffolding, 96.80% of the initially assembled sequences were anchored and orientated onto 12 pseudo-chromosomes, generating a genome of 869.69 Mb with a contig N50 of 72.15 Mb. We identified 649.92 Mb (72.26%) of repetitive sequences and 3,588 non-coding RNAs in the genome. A total of 29,694 protein-coding genes were predicted, with 28,154 (94.81%) functionally annotated genes. We found 99.5% and 91.3% complete embryophyta_odb10 genes in the pseudo-chromosomes genome and predicted gene datasets by BUSCO assessment. The present genomic resource provides essential information for subsequent research on the mechanisms of environmental adaptation of S. rostratum and host shift in Colorado potato beetles.

Assembly statistics

Genome size 898.4 Mb
Number of chromosomes 12
Number of scaffolds 221
Scaffold N50 72.1 Mb
Scaffold L50 6
Number of contigs 311
Contig N50 46.7 Mb
Contig L50 7
GC percent 37
Assembly level Chromosome

Assembly

The Solanum rostratum IEDA_Sros_1.0 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_029948455.1_IEDA_Sros_1.0_genomic.fna.gz

Gene Predictions

The Solanum rostratum IEDA_Sros_1.0 genome gene prediction files are not available.

Downloads

Genes (GFF3 file) -
CDS sequences (FASTA file) -
Protein sequences (FASTA file) -

Functional Analysis

Functional annotation for the Solanum rostratum IEDA_Sros_1.0 is not available.

Downloads

Domain from InterProScan -

S genes

Summary

QueryChrSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF15CM057045.192,283,8342980826-2979561Solanum tuberosum DM8.1, SLF1584.6 F-box domain
SLF16ΨCM057045.192,283,8343693830-3692662Solanum tuberosum DM8.1, SLF1686.4 -
SLF18CM057045.192,283,83458068855-58069967Solanum tuberosum DM8.1, SLF18-287.3 F-box domain
SLF19ΨCM057045.192,283,83458976297-58975336Solanum tuberosum DM8.1, SLF1985.7 -

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences