Petunia x hybrida 'S3LS3L (cultivar)' Assembly & Annotation

Overview

Analysis Name Petunia x hybrida 'S3LS3L (cultivar)' Assembly & Annotation
Sequencing technology Pacbio HiFi, ONT, Hi-C
Assembly method hifiasm, NextDenovo
Release Date 2026-02-03
Reference Publication(s)

Wang C, Zhao H, Wu H, Sun S, Zhang H, Xue Y. The Gap-free Petunia Genome Assemblies Reveal the Evolutionary Dynamics of the S-locus Supergene. Genomics Proteomics Bioinformatics. 2026 Feb 3:qzag011. doi: 10.1093/gpbjnl/qzag011.

Abstract

Petunia hybrida is a key genetic model for investigating self-incompatibility (SI), a reproductive barrier governed by the multi-allelic S-locus, which encodes a pistil-specific S-RNase and multiple S-locus F-box (SLF) genes. Due to high heterozygosity and abundant repetitive sequences, previous S-locus assemblies in reference genomes have been fragmented and collapsed. Here, we present the telomere-to-telomere (T2T), haplotype-resolved genomes of two homozygous SI lines (P. hybrida S3LS3L and SVSV), enabling the complete reconstruction of both S-loci. Population genomic analyses delineated their boundaries, spanning approximately 14.01 Mb and 20.83 Mb, respectively. Remarkably, both S-loci exhibited extremely low nucleotide polymorphism and structural variation compared with the remainder of the genome. In addition to the S-RNase and the complete repertoire of SLF genes, we identified two pollen-specific genes, ubiquitin-like and MYB, which may contribute to SI regulation. Our results demonstrate that the genomic architecture of the Petunia S-locus continues to evolve dynamically while retaining the core genetic components essential for SI. Furthermore, we propose six evolutionary scenarios, providing new insights into the processes driving the generation, diversification, loss, functional maintenance, and structural reorganization of SLF genes in Petunia. Overall, the T2T genomes reported here establish P. hybrida as a premier model for comparative genomics and SI research in the Solanaceae family.

Assembly statistics

SummaryPhS3LS3L haplotype1PhS3LS3L haplotype2
Total length (bp)13068597471305949475
contigs651317
Largest contig (bp)211457805211393694
GC (%)38.4138.34
N50 (bp)164991823164021228
N90 (bp)13785324843152564
L5044
L9078
T2T Contig Num.23
T2T Chromosome Num.77

Assembly

The Petunia x hybrida 'S3LS3L (cultivar)' Assembly files are available in FASTA format.

Downloads

Chromosomes (FASTA file) S3L_hap1_T2T.fa.gz S3L_hap2_T2T.fa.gz

Gene Predictions

The Petunia x hybrida 'S3LS3L (cultivar)' genome gene prediction files are available in GTF and FASTA format.

Downloads

Genes (GTF file) S3L_hap1_braker.gtf.gzS3L_hap2_braker.gtf.gz
CDS sequences (FASTA file) S3L_hap1_BRAKER3.cds.fa.gzS3L_hap2_BRAKER3.cds.fa.gz
Protein sequences (FASTA file) S3L_hap1_BRAKER3.pro.fa.gzS3L_hap2_BRAKER3.pro.fa.gz

Functional Analysis

Functional annotations for the Petunia x hybrida 'S3LS3L (cultivar)' are available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan interproscan.S3L_hap1.Pfam.tsv.zipinterproscan.S3L_hap2.Pfam.tsv.zip

S genes

Summary

QueryChromosomeStartEndDomain
S3L-SLF9AS3L_hap1_Chr3118429932118431077F-box domain
S3L-SLF5S3L_hap1_Chr3119505303119506469F-box domain
S3L-SLF6S3L_hap1_Chr3120546403120547584F-box domain
S3L-SLF14S3L_hap1_Chr3120923981120922812F-box domain
S3L-SLF16AS3L_hap1_Chr3124169019124167844F-box domain
S3L-SLF11S3L_hap1_Chr3125366501125367658F-box domain
S3L-SLF9BS3L_hap1_Chr3125772471125773616F-box domain
S3L-SLF4S3L_hap1_Chr3126163055126161847F-box domain
S3L-RNaseS3L_hap1_Chr3127512365127513108Ribonuclease T2 family
S3L-SLF16BS3L_hap1_Chr3127919219127920394F-box domain
S3L-FBX1S3L_hap1_Chr3128483288128482113F-box domain
S3L-SLFLike1-1S3L_hap1_Chr3129028523129029725F-box domain
S3L-SLF4LikeS3L_hap1_Chr3130083028130081934F-box domain
S3L-SLF13S3L_hap1_Chr3130198476130197310F-box domain
S3L-SLF3S3L_hap1_Chr3130277485130276328F-box domain
S3L-SLF12ψS3L_hap1_Chr3130433821130432613-
S3L-SLF1S3L_hap1_Chr3130492570130491401F-box domain
S3L-SLF8S3L_hap1_Chr3130535200130533986F-box domain
S3L-SLF12S3L_hap1_Chr3131181353131180172F-box domain
S3L-SLF15S3L_hap1_Chr3131752363131753541F-box domain
S3L-SLF7S3L_hap1_Chr3132438886132440064F-box domain
S3L-SLFLike1-2S3L_hap1_Chr3132662932132661845F-box domain
S3L-SLFLike1-3S3L_hap1_Chr3132731324132730107F-box domain
S3L-SLF10S3L_hap1_Chr3142803547142802375F-box domain
© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences