Analysis Name | Solanum rostratum IEDA_Sros_1.0 Assembly & Annotation |
Sequencing technology | PacBio Sequel |
Assembly method | Hifiasm v. 0.16.0 |
Release Date | 2023-05-08 |
Zhang Y, Guo W, Yuan Z, Song Z, Wang Z, Gao J, Fu W, Zhang G. Chromosome-level genome assembly and annotation of the prickly nightshade Solanum rostratum Dunal. Sci Data. 2023 Jun 1;10(1):341. doi: 10.1038/s41597-023-02247-3.
AbstractThe prickly nightshade Solanum rostratum, an annual malignant weed, is native to North America and has globally invaded 34 countries, causing serious threats to ecosystems, agriculture, animal husbandry, and human health. In this study, we constructed a chromosome-level genome assembly and annotation of S. rostratum. The contig-level genome was initially assembled in 898.42 Mb with a contig N50 of 62.00 Mb from PacBio high-fidelity reads. With Hi-C sequencing data scaffolding, 96.80% of the initially assembled sequences were anchored and orientated onto 12 pseudo-chromosomes, generating a genome of 869.69 Mb with a contig N50 of 72.15 Mb. We identified 649.92 Mb (72.26%) of repetitive sequences and 3,588 non-coding RNAs in the genome. A total of 29,694 protein-coding genes were predicted, with 28,154 (94.81%) functionally annotated genes. We found 99.5% and 91.3% complete embryophyta_odb10 genes in the pseudo-chromosomes genome and predicted gene datasets by BUSCO assessment. The present genomic resource provides essential information for subsequent research on the mechanisms of environmental adaptation of S. rostratum and host shift in Colorado potato beetles.
Assembly statistics
Genome size | 898.4 Mb |
Number of chromosomes | 12 |
Number of scaffolds | 221 |
Scaffold N50 | 72.1 Mb |
Scaffold L50 | 6 |
Number of contigs | 311 |
Contig N50 | 46.7 Mb |
Contig L50 | 7 |
GC percent | 37 |
Assembly level | Chromosome |
The Solanum rostratum IEDA_Sros_1.0 Assembly file is available in FASTA format.
Downloads
Chromosomes (FASTA file) | GCA_029948455.1_IEDA_Sros_1.0_genomic.fna.gz |
The Solanum rostratum IEDA_Sros_1.0 genome gene prediction files are not available.
Downloads
Genes (GFF3 file) | - |
CDS sequences (FASTA file) | - |
Protein sequences (FASTA file) | - |
Functional annotation for the Solanum rostratum IEDA_Sros_1.0 is not available.
Downloads
Domain from InterProScan | - |
Summary
Query | Chr | Size(bp) | Coordinates | BLASTn Hit | BLASTn %ID | Domain |
SLF15 | CM057045.1 | 92,283,834 | 2980826-2979561 | Solanum tuberosum DM8.1, SLF15 | 84.6 | F-box domain |
SLF16Ψ | CM057045.1 | 92,283,834 | 3693830-3692662 | Solanum tuberosum DM8.1, SLF16 | 86.4 | - |
SLF18 | CM057045.1 | 92,283,834 | 58068855-58069967 | Solanum tuberosum DM8.1, SLF18-2 | 87.3 | F-box domain |
SLF19Ψ | CM057045.1 | 92,283,834 | 58976297-58975336 | Solanum tuberosum DM8.1, SLF19 | 85.7 | - |
Nucleotide
Protein