Papaver somniferum Pso_HiFi_v3 Assembly & Annotation

Overview

Analysis Name Papaver somniferum Pso_HiFi_v3 Assembly & Annotation
Sequencing technology PacBio, Oxford Nanopore
Assembly method hifiasm 0.16.1, Shasta 0.10
Release Date 2024-03-01
Reference Publication(s)

Yang X, Gao S, Guo L, Wang B, Jia Y, Zhou J, Che Y, Jia P, Lin J, Xu T, Sun J, Ye K. Three chromosome-scale Papaver genomes reveal punctuated patchwork evolution of the morphinan and noscapine biosynthesis pathway. Nat Commun. 2021 Oct 15;12(1):6030. doi: 10.1038/s41467-021-26330-8.

Abstract

For millions of years, plants evolve plenty of structurally diverse secondary metabolites (SM) to support their sessile lifestyles through continuous biochemical pathway innovation. While new genes commonly drive the evolution of plant SM pathway, how a full biosynthetic pathway evolves remains poorly understood. The evolution of pathway involves recruiting new genes along the reaction cascade forwardly, backwardly, or in a patchwork manner. With three chromosome-scale Papaver genome assemblies, we here reveal whole-genome duplications (WGDs) apparently accelerate chromosomal rearrangements with a nonrandom distribution towards SM optimization. A burst of structural variants involving fusions, translocations and duplications within 7.7 million years have assembled nine genes into the benzylisoquinoline alkaloids gene cluster, following a punctuated patchwork model. Biosynthetic gene copies and their total expression matter to morphinan production. Our results demonstrate how new genes have been recruited from a WGD-induced repertoire of unregulated enzymes with promiscuous reactivities to innovate efficient metabolic pathways with spatiotemporal constraint.

Assembly statistics

Genome size (bp) 2,786,512,425
Chromosomes sequence No. 11
Genome sequence No. 513
Maximum genome sequence length (bp) 344,816,055
Minimum genome sequence length (bp) 19,064
Average genome sequence length (bp) 5,431,798
Genome sequence N50 (bp) 261,977,153
Genome sequence N90 (bp) 186,096,664
Assembly level Chromosome

Assembly

The Papaver somniferum Pso_HiFi_v3 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GWHAZPJ00000000.1.genome.fasta.gz

Gene Predictions

The Papaver somniferum Pso_HiFi_v3 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) Pso.HiFi.gene.gff3.tar.gz
CDS sequences (FASTA file) Pso.HiFi.cds.fa.tar.gz
Protein sequences (FASTA file) Pso.HiFi.pep.fa.tar.gz

Functional Analysis

Functional annotation for the Papaver somniferum Pso_HiFi_v3 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Papaver_somniferum_Pso_HiFi_v3.Pfam.tsv.gz

S genes

Summary

QueryChromosomeSize(bp)CoordinatesBLASTn HitBLASTn %ID
S1-likeGWHAZPJ00000002.134481605525312164-25311613Papaver somniferum XM_026594847.1, S1-like100

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences