Oryza punctata OpunRS2 Assembly & Annotation

Overview

Analysis Name Oryza punctata OpunRS2 Assembly & Annotation
Sequencing technology PacBio
Assembly method MECAT v. 1.3; CANU v. 1.5; FALCON v. 2017.06.28-18.01-py2.7-ucs4
Release Date 2021-03-10
Reference Publication(s)

Stein JC, Yu Y, Copetti D, Zwickl DJ, Zhang L, Zhang C, Chougule K, Gao D, Iwata A, Goicoechea JL, Wei S, Wang J, Liao Y, Wang M, Jacquemin J, Becker C, Kudrna D, Zhang J, Londono CEM, Song X, Lee S, Sanchez P, Zuccolo A, Ammiraju JSS, Talag J, Danowitz A, Rivera LF, Gschwend AR, Noutsos C, Wu CC, Kao SM, Zeng JW, Wei FJ, Zhao Q, Feng Q, El Baidouri M, Carpentier MC, Lasserre E, Cooke R, Rosa Farias DD, da Maia LC, Dos Santos RS, Nyberg KG, McNally KL, Mauleon R, Alexandrov N, Schmutz J, Flowers D, Fan C, Weigel D, Jena KK, Wicker T, Chen M, Han B, Henry R, Hsing YC, Kurata N, de Oliveira AC, Panaud O, Jackson SA, Machado CA, Sanderson MJ, Long M, Ware D, Wing RA. Genomes of 13 domesticated and wild rice relatives highlight genetic conservation, turnover and innovation across the genus Oryza. Nat Genet. 2018 Feb;50(2):285-296. doi: 10.1038/s41588-018-0040-0.

Abstract

The genus Oryza is a model system for the study of molecular evolution over time scales ranging from a few thousand to 15 million years. Using 13 reference genomes spanning the Oryza species tree, we show that despite few large-scale chromosomal rearrangements rapid species diversification is mirrored by lineage-specific emergence and turnover of many novel elements, including transposons, and potential new coding and noncoding genes. Our study resolves controversial areas of the Oryza phylogeny, showing a complex history of introgression among different chromosomes in the young ‘AA’ subclade containing the two domesticated species. This study highlights the prevalence of functionally coupled disease resistance genes and identifies many new haplotypes of potential use for future crop protection. Finally, this study marks a milestone in modern rice research with the release of a complete long-read assembly of IR 8 ‘Miracle Rice’, which relieved famine and drove the Green Revolution in Asia 50 years ago.

Assembly statistics

Genome size422.4 Mb
Total ungapped length422.4 Mb
Gaps between scaffolds16
Number of chromosomes12
Number of scaffolds77
Scaffold N5018.6 Mb
Scaffold L5010
Number of contigs77
Contig N5018.6 Mb
Contig L5010
GC percent43.5
Genome coverage165.0x
Assembly levelChromosome

Assembly

The Oryza punctata OpunRS2 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_000573905.2_OpunRS2_genomic.fna.gz

Gene Predictions

The Oryza punctata OpunRS2 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) Oryza_punctata.maker.gff.gz
CDS sequences (FASTA file) Oryza_punctata.coding.fasta.gz
Protein sequences (FASTA file) Oryza_punctata.protein.fasta.gz

Functional Analysis

Functional annotation for the Oryza punctata OpunRS2 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Oryza_punctata.Pfam.tsv.gz

S genes

Summary

QueryChromosomeSize(bp)CoordinatestBLASTn HittBLASTn %IDDomain
DUF247I-SΨCM002492.2325019816383914-6384396Olongistaminata73DUF247
DUF247II-SΨCM002492.2325019816380474-6380620Olongistaminata68DUF247
HPS10-SCM002492.2325019816381576-6381657,
6381876-6382000
OcoarctataS160-
DUF247I-ZΨCM002491.23432181531603253-31603642AsativaDUF247I-Z73DUF247
DUF247II-ZCM002491.23432181531608630-31609940Ocoarctata86DUF247
HPS10-ZCM002491.23432181531605786-31605939,
31605990-31606135
Omeridionalis43-

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences