Analysis Name | Papaver setigerum Pse_HiFi_v2 Assembly & Annotation |
Sequencing technology | PacBio |
Assembly method | hifiasm 0.16.1 |
Release Date | 2024-03-01 |
Yang X, Gao S, Guo L, Wang B, Jia Y, Zhou J, Che Y, Jia P, Lin J, Xu T, Sun J, Ye K. Three chromosome-scale Papaver genomes reveal punctuated patchwork evolution of the morphinan and noscapine biosynthesis pathway. Nat Commun. 2021 Oct 15;12(1):6030. doi: 10.1038/s41467-021-26330-8.
AbstractFor millions of years, plants evolve plenty of structurally diverse secondary metabolites (SM) to support their sessile lifestyles through continuous biochemical pathway innovation. While new genes commonly drive the evolution of plant SM pathway, how a full biosynthetic pathway evolves remains poorly understood. The evolution of pathway involves recruiting new genes along the reaction cascade forwardly, backwardly, or in a patchwork manner. With three chromosome-scale Papaver genome assemblies, we here reveal whole-genome duplications (WGDs) apparently accelerate chromosomal rearrangements with a nonrandom distribution towards SM optimization. A burst of structural variants involving fusions, translocations and duplications within 7.7 million years have assembled nine genes into the benzylisoquinoline alkaloids gene cluster, following a punctuated patchwork model. Biosynthetic gene copies and their total expression matter to morphinan production. Our results demonstrate how new genes have been recruited from a WGD-induced repertoire of unregulated enzymes with promiscuous reactivities to innovate efficient metabolic pathways with spatiotemporal constraint.
Assembly statistics
Genome size (bp) | 4,691,551,335 |
Chromosomes sequence No. | 22 |
Genome sequence No. | 1,269 |
Maximum genome sequence length (bp) | 342,945,758 |
Minimum genome sequence length (bp) | 13,936 |
Average genome sequence length (bp) | 3,697,046 |
Genome sequence N50 (bp) | 217,393,329 |
Genome sequence N90 (bp) | 158,403,959 |
Assembly level | Chromosome |
The Papaver setigerum Pse_HiFi_v2 Assembly file is available in FASTA format.
Downloads
Chromosomes (FASTA file) | GWHAZPH00000000.1.genome.fasta.gz |
The Papaver setigerum Pse_HiFi_v2 genome gene prediction files are available in GFF3 and FASTA format.
Downloads
Genes (GFF3 file) | Pse.HiFi.gene.gff3.tar.gz |
CDS sequences (FASTA file) | Pse.HiFi.cds.fa.tar.gz |
Protein sequences (FASTA file) | Pse.HiFi.pep.fa.tar.gz |
Functional annotation for the Papaver setigerum Pse_HiFi_v2 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).
Downloads
Domain from InterProScan | Papaver_setigerum_Pse_HiFi_v2.Pfam.tsv.gz |
Summary
Query | Chromosome | Size(bp) | Coordinates | BLASTn Hit | BLASTn %ID |
S1-like | GWHAZPH00000001.1 | 342945758 | 319059255-319059806 | Papaver somniferum XM_026594847.1, S1-like | 99.3 |
Nucleotide
Protein