Pyrus betulifolia ASM784424v1 Assembly & Annotation

Overview

Analysis Name Pyrus betulifolia ASM784424v1 Assembly & Annotation
Sequencing technology PacBio
Assembly method HGAP v. 1.0
Release Date 2019-08-08
Reference Publication(s)

Dong X, Wang Z, Tian L, Zhang Y, Qi D, Huo H, Xu J, Li Z, Liao R, Shi M, Wahocho SA, Liu C, Zhang S, Tian Z, Cao Y. De novo assembly of a wild pear (Pyrus betuleafolia) genome. Plant Biotechnol J. 2020 Feb;18(2):581-595. doi: 10.1111/pbi.13226.

Abstract

China is the origin and evolutionary centre of Oriental pears. Pyrus betuleafolia is a wild species native to China and distributed in the northern region, and it is widely used as rootstock. Here, we report the de novo assembly of the genome of P. betuleafolia-Shanxi Duli using an integrated strategy that combines PacBio sequencing, BioNano mapping and chromosome conformation capture (Hi-C) sequencing. The genome assembly size was 532.7 Mb, with a contig N50 of 1.57 Mb. A total of 59 552 protein-coding genes and 247.4 Mb of repetitive sequences were annotated for this genome. The expansion genes in P. betuleafolia were significantly enriched in secondary metabolism, which may account for the organism's considerable environmental adaptability. An alignment analysis of orthologous genes showed that fruit size, sugar metabolism and transport, and photosynthetic efficiency were positively selected in Oriental pear during domestication. A total of 573 nucleotide-binding site (NBS)-type resistance gene analogues (RGAs) were identified in the P. betuleafolia genome, 150 of which are TIR-NBS-LRR (TNL)-type genes, which represented the greatest number of TNL-type genes among the published Rosaceae genomes and explained the strong disease resistance of this wild species. The study of flavour metabolism-related genes showed that the anthocyanidin reductase (ANR) metabolic pathway affected the astringency of pear fruit and that sorbitol transporter (SOT) transmembrane transport may be the main factor affecting the accumulation of soluble organic matter. This high-quality P. betuleafolia genome provides a valuable resource for the utilization of wild pear in fundamental pear studies and breeding.

Assembly statistics

Genome size532.2 Mb
Total ungapped length496.5 Mb
Number of chromosomes17
Number of organelles1
Number of scaffolds136
Scaffold N5028.1 Mb
Scaffold L508
Number of contigs592
Contig N501.6 Mb
Contig L5093
GC percent37.5
Genome coverage95.9x
Assembly levelChromosome

Assembly

The Pyrus betulifolia ASM784424v1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GWHAAYT00000000.genome.fasta.gz

Gene Predictions

The Pyrus betulifolia ASM784424v1 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) GWHAAYT00000000.gff.gz
CDS sequences (FASTA file) GWHAAYT00000000.CDS.fasta.gz
Protein sequences (FASTA file) GWHAAYT00000000.Protein.faa.gz

Functional Analysis

Functional annotation for the Pyrus betulifolia ASM784424v1 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Pyrus_betulifolia_ASM784424v1.Pfam.tsv.gz

S genes

Summary

QueryChrSize(bp)CoordinatesDomain
PbeSFBB.XVI-S67Chr 172941283026093227-26094414F-box; F_box_assoc
PbeSFBB.X-S67Chr 172941283026110389-26111567F-box; F_box_assoc
PbeSFBB.XI-S67Chr 172941283026131630-26132814F-box; F_box_assoc
PbeSFBB.III-S67Chr 172941283026250993-26252174F-box; F_box_assoc
PbeSFBB.IV-S67Chr 172941283026336706-26337890F-box; F_box_assoc
PbeSFBB.Ia-S67Chr 172941283026371301-26372503F-box; F_box_assoc
PbeSFBB.XV-S67Chr 172941283026397872-26399086F-box; F_box_assoc
PbeSFBB.VI-S67Chr 172941283026403481-26404659F-box; F_box_assoc
PbeSFBB.Ib-S67Chr 172941283026563583-26564785F-box; F_box_assoc
PbeSFBB.XIII-S67Chr 172941283026708911-26710092F-box; F_box_assoc
PbeSFBB.XII-S67Chr 172941283026783023-26784195F-box; F_box_assoc
PbeSFBB.XIV-S67Chr 172941283026856722-26857942F-box; F_box_assoc
S67-RNaseChr 172941283026886516-26886767,26886918-26887352RNase_T2
PbeSFBB.IX-S67Chr 172941283026965220-26966398F-box; F_box_assoc
PbeSFBB.VII-S67Chr 172941283027014738-27015910F-box; F_box_assoc
PbeSFBB.XIX-S67Chr 172941283027098826-27100058F-box; F_box_assoc
PbeSFBB.XVIII-S67Chr 172941283027111438-27112622F-box; F_box_assoc
PbeSFBB.VIII-S67Chr 172941283027139919-27141109F-box; F_box_assoc
PbeSFBB.XXI-S67Chr 172941283027470492-27471841F-box; F_box_assoc

Pyrus betulifolia ASM784424v1 S genes Nucleotide

Pyrus betulifolia ASM784424v1 S genes Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences