Rubus argutus Hillquist Genome v1.0 Assembly & Annotation

Overview

Analysis Name Rubus argutus Hillquist Genome v1.0 Assembly & Annotation
Sequencing technology PacBio, Hi-C, 10x Genomics
Assembly method FALCON, Juicer
Release Date 2022-04-20
Reference Publication(s)

Brůna T, Aryal R, Dudchenko O, Sargent DJ, Mead D, Buti M, Cavallini A, Hytönen T, Andrés J, Pham M, Weisz D, Mascagni F, Usai G, Natali L, Bassil N, Fernandez GE, Lomsadze A, Armour M, Olukolu B, Poorten T, Britton C, Davik J, Ashrafi H, Aiden EL, Borodovsky M, Worthington M. A chromosome-length genome assembly and annotation of blackberry (Rubus argutus, cv. "Hillquist"). G3 (Bethesda). 2023 Feb 9;13(2):jkac289. doi: 10.1093/g3journal/jkac289.

Abstract

Blackberries (Rubus spp.) are the fourth most economically important berry crop worldwide. Genome assemblies and annotations have been developed for Rubus species in subgenus Idaeobatus, including black raspberry (R. occidentalis), red raspberry (R. idaeus), and R. chingii, but very few genomic resources exist for blackberries and their relatives in subgenus Rubus. Here we present a chromosome length assembly and annotation of the diploid blackberry germplasm accession “Hillquist” (R. argutus). “Hillquist” is the only known source of primocane-fruiting (annual-fruiting) in tetraploid fresh-market blackberry breeding programs and is represented in the pedigree of many important cultivars worldwide. The “Hillquist” assembly, generated using Pacific Biosciences long reads scaffolded with high-throughput chromosome conformation capture sequencing, consisted of 298Mb, of which 270Mb (90%) was placed on 7 chromosome-length scaffolds with an average length of 38.6Mb. Approximately 52.8% of the genome was composed of repetitive elements. The genome se quencewashighly collinear with a novel maternal haplotype-resolved linkage map of the tetraploid blackberry selection A-2551TN and ge nome assemblies of R. chingii and red raspberry. A total of 38,503 protein-coding genes were predicted, of which 72% were functionally annotated. Eighteen flowering gene homologs within a previously mapped locus aligning to an 11.2Mb region on chromosome Ra02 were identified as potential candidate genes for primocane-fruiting. The utility of the “Hillquist” genome has been demonstrated here by the development of the first genotyping-by-sequencing-based linkage map of tetraploid blackberry and the identification of possible can didate genes for primocane-fruiting. This chromosome-length assembly will facilitate future studies in Rubus biology, genetics, and geno mics and strengthen applied breeding programs.

Assembly

The Rubus argutus Hillquist Genome v1.0 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) genome_softmasked_release2.fasta.gz

Gene Predictions

The Rubus argutus Hillquist Genome v1.0 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) Hillquist_release2.gff.gz
CDS sequences (FASTA file) -
Protein sequences (FASTA file) Hillquist_proteins_release2.fasta.gz

Functional Analysis

Functional annotation for the Rubus argutus Hillquist Genome v1.0 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Rubus_argutus_Hillquist_v1.0.Pfam.tsv.gz
© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences