Oryza alta PPR1 Assembly & Annotation

Overview

Analysis Name Oryza alta PPR1 Assembly & Annotation
Sequencing technology PacBio
Assembly method CANU version1.3
Release Date 2021-02-04
Reference Publication(s)

Yu H, Lin T, Meng X, Du H, Zhang J, Liu G, Chen M, Jing Y, Kou L, Li X, Gao Q, Liang Y, Liu X, Fan Z, Liang Y, Cheng Z, Chen M, Tian Z, Wang Y, Chu C, Zuo J, Wan J, Qian Q, Han B, Zuccolo A, Wing RA, Gao C, Liang C, Li J. A route to de novo domestication of wild allotetraploid rice. Cell. 2021 Mar 4;184(5):1156-1170.e14. doi: 10.1016/j.cell.2021.01.013.

Summary

Cultivated rice varieties are all diploid, and polyploidization of rice has long been desired because of its advantages in genome buffering, vigorousness, and environmental robustness. However, a workable route remains elusive. Here, we describe a practical strategy, namely de novo domestication of wild allotetraploid rice. By screening allotetraploid wild rice inventory, we identified one genotype of Oryza alta (CCDD), polyploid rice 1 (PPR1), and established two important resources for its de novo domestication: (1) an efficient tissue culture, transformation, and genome editing system and (2) a high-quality genome assembly discriminated into two subgenomes of 12 chromosomes apiece. With these resources, we show that six agronomically important traits could be rapidly improved by editing O. alta homologs of the genes controlling these traits in diploid rice. Our results demonstrate the possibility that de novo domesticated allotetraploid rice can be developed into a new staple cereal to strengthen world food security.

Assembly statistics

Genome size (bp)894,549,829
GC content43.88%
Chromosomes sequence No.24
Genome sequence No.274
Maximum genome sequence length (bp)50,345,809
Minimum genome sequence length (bp)15,000
Average genome sequence length (bp)3,264,780
Genome sequence N50 (bp)37,066,527
Genome sequence N90 (bp)27,781,296
Assembly levelChromosome

Assembly

The Oryza alta PPR1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GWHAZTO00000000.genome.fasta.gz

Gene Predictions

The Oryza alta PPR1 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) GWHAZTO00000000.gff.gz
CDS sequences (FASTA file) GWHAZTO00000000.RNA.fasta.gz
Protein sequences (FASTA file) GWHAZTO00000000.Protein.faa.gz

Functional Analysis

Functional annotation for the Oryza alta PPR1 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Oryza_alta.Pfam.tsv.gz

S genes

Summary

QueryChromosomeSize(bp)CoordinatestBLASTn HittBLASTn %IDDomain
DUF247I-SΨGWHAZTO00000005354713876448868-6449179Olongistaminata77DUF247
DUF247II-ZΨGWHAZTO000000043706652734139204-34139425TturgidumZ261DUF247
HPS10-ZGWHAZTO000000043706652734135947-34135977,
34136068-34136159
LpsZ_contig5509764-

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences