Papaver rhoeas Prh_HiFi_v2 Assembly & Annotation

Overview

Analysis Name Papaver rhoeas Prh_HiFi_v2 Assembly & Annotation
Sequencing technology PacBio
Assembly method hifiasm 0.16.1
Release Date 2024-03-01
Reference Publication(s)

Yang X, Gao S, Guo L, Wang B, Jia Y, Zhou J, Che Y, Jia P, Lin J, Xu T, Sun J, Ye K. Three chromosome-scale Papaver genomes reveal punctuated patchwork evolution of the morphinan and noscapine biosynthesis pathway. Nat Commun. 2021 Oct 15;12(1):6030. doi: 10.1038/s41467-021-26330-8.

Abstract

For millions of years, plants evolve plenty of structurally diverse secondary metabolites (SM) to support their sessile lifestyles through continuous biochemical pathway innovation. While new genes commonly drive the evolution of plant SM pathway, how a full biosynthetic pathway evolves remains poorly understood. The evolution of pathway involves recruiting new genes along the reaction cascade forwardly, backwardly, or in a patchwork manner. With three chromosome-scale Papaver genome assemblies, we here reveal whole-genome duplications (WGDs) apparently accelerate chromosomal rearrangements with a nonrandom distribution towards SM optimization. A burst of structural variants involving fusions, translocations and duplications within 7.7 million years have assembled nine genes into the benzylisoquinoline alkaloids gene cluster, following a punctuated patchwork model. Biosynthetic gene copies and their total expression matter to morphinan production. Our results demonstrate how new genes have been recruited from a WGD-induced repertoire of unregulated enzymes with promiscuous reactivities to innovate efficient metabolic pathways with spatiotemporal constraint.

Assembly statistics

Genome size (bp) 2,291,049,518
Chromosomes sequence No. 7
Genome sequence No. 93
Maximum genome sequence length (bp) 380,013,634
Minimum genome sequence length (bp) 3,790
Average genome sequence length (bp) 24,634,941
Genome sequence N50 (bp) 295,707,834
Genome sequence N90 (bp) 275,865,493
Assembly level Chromosome

Assembly

The Papaver rhoeas Prh_HiFi_v2 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GWHAZPI00000000.1.genome.fasta.gz

Gene Predictions

The Papaver rhoeas Prh_HiFi_v2 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) Prh.HiFi.gene.gff3.tar.gz
CDS sequences (FASTA file) Prh.HiFi.cds.fa.gz
Protein sequences (FASTA file) Prh.HiFi.pep.fa.tar.gz

Functional Analysis

Functional annotation for the Papaver rhoeas Prh_HiFi_v2 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Papaver_rhoeas_Prh_HiFi_v2.Pfam.tsv.gz
© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences