Physalis grisea Phygri_v1.0 Assembly & Annotation

Overview

Analysis Name Physalis grisea Phygri_v1.0 Assembly & Annotation
Sequencing technology ONT, PacBio HiFi and Hi-C
Assembly method HiCan, Flye
Release Date 2023-01-02
Reference Publication(s)

He J, Alonge M, Ramakrishnan S, Benoit M, Soyk S, Reem NT, Hendelman A, Van Eck J, Schatz MC, Lippman ZB. Establishing Physalis as a Solanaceae model system enables genetic reevaluation of the inflated calyx syndrome. Plant Cell. 2023 Jan 2;35(1):351-368. doi: 10.1093/plcell/koac305.

Abstract

The highly diverse Solanaceae family contains several widely studied models and crop species. Fully exploring, appreciating, and exploiting this diversity requires additional model systems. Particularly promising are orphan fruit crops in the genus Physalis, which occupy a key evolutionary position in the Solanaceae and capture understudied variation in traits such as inflorescence complexity, fruit ripening and metabolites, disease and insect resistance, self-compatibility, and most notable, the striking inflated calyx syndrome (ICS), an evolutionary novelty found across angiosperms where sepals grow exceptionally large to encapsulate fruits in a protective husk. We recently developed transformation and genome editing in Physalis grisea (groundcherry). However, to systematically explore and unlock the potential of this and related Physalis as genetic systems, high-quality genome assemblies are needed. Here, we present chromosome-scale references for P. grisea and its close relative Physalis pruinosa and use these resources to study natural and engineered variations in floral traits. We first rapidly identified a natural structural variant in a bHLH gene that causes petal color variation. Further, and against expectations, we found that CRISPR–Cas9-targeted mutagenesis of 11 MADS-box genes, including purported essential regulators of ICS, had no effect on inflation. In a forward genetics screen, we identified huskless, which lacks ICS due to mutation of an AP2-like gene that causes sepals and petals to merge into a single whorl of mixed identity. These resources and findings elevate Physalis to a new Solanaceae model system and establish a paradigm in the search for factors driving ICS.

Assembly statistics

Genome size (bp) 1,373,559,025
Genome sequence No. 460
Maximum genome sequence length (bp) 132,441,007
Minimum genome sequence length (bp) 2388
Genome sequence N50 (bp) 112,874,881
Genome sequence N90 (bp) 100,751,416
Assembly level Chromosome

Assembly

The Physalis grisea Phygri_v1.0 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) Phygri_1.0.fasta.gz

Gene Predictions

The Physalis grisea Phygri_v1.0 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) Phygri1.3.2_gene_models.gff.gz
CDS sequences (FASTA file) Phygri1.3.2.CDS.fasta.gz
Protein sequences (FASTA file) Phygri1.3.2.proteins.fasta.gz

Functional Analysis

Functional annotation for the Physalis grisea Phygri_v1.0 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Physalis_grisea.Pfam.tsv.gz

S genes

Summary

QueryChromosomeSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF18chr1118889174113544749-113543625Solanum lycopersicum SL2.31, SLF1883.4 F-box domain
SLF19chr3112874881105607410-105606298Solanum tuberosum DM8.1, SLF1987.7 F-box domain

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences