Oryza meridionalis OmerRS3 Assembly & Annotation

Overview

Analysis Name Oryza meridionalis OmerRS3 Assembly & Annotation
Sequencing technology PacBio
Assembly method CANU v. 1.5; MECAT v. 1.3; FALCON v. 2017.06.28-18.01-py2.7-ucs4
Release Date 2021-03-10
Reference Publication(s)

Stein JC, Yu Y, Copetti D, Zwickl DJ, Zhang L, Zhang C, Chougule K, Gao D, Iwata A, Goicoechea JL, Wei S, Wang J, Liao Y, Wang M, Jacquemin J, Becker C, Kudrna D, Zhang J, Londono CEM, Song X, Lee S, Sanchez P, Zuccolo A, Ammiraju JSS, Talag J, Danowitz A, Rivera LF, Gschwend AR, Noutsos C, Wu CC, Kao SM, Zeng JW, Wei FJ, Zhao Q, Feng Q, El Baidouri M, Carpentier MC, Lasserre E, Cooke R, Rosa Farias DD, da Maia LC, Dos Santos RS, Nyberg KG, McNally KL, Mauleon R, Alexandrov N, Schmutz J, Flowers D, Fan C, Weigel D, Jena KK, Wicker T, Chen M, Han B, Henry R, Hsing YC, Kurata N, de Oliveira AC, Panaud O, Jackson SA, Machado CA, Sanderson MJ, Long M, Ware D, Wing RA. Genomes of 13 domesticated and wild rice relatives highlight genetic conservation, turnover and innovation across the genus Oryza. Nat Genet. 2018 Feb;50(2):285-296. doi: 10.1038/s41588-018-0040-0.

Abstract

The genus Oryza is a model system for the study of molecular evolution over time scales ranging from a few thousand to 15 million years. Using 13 reference genomes spanning the Oryza species tree, we show that despite few large-scale chromosomal rearrangements rapid species diversification is mirrored by lineage-specific emergence and turnover of many novel elements, including transposons, and potential new coding and noncoding genes. Our study resolves controversial areas of the Oryza phylogeny, showing a complex history of introgression among different chromosomes in the young ‘AA’ subclade containing the two domesticated species. This study highlights the prevalence of functionally coupled disease resistance genes and identifies many new haplotypes of potential use for future crop protection. Finally, this study marks a milestone in modern rice research with the release of a complete long-read assembly of IR 8 ‘Miracle Rice’, which relieved famine and drove the Green Revolution in Asia 50 years ago.

Assembly statistics

Genome size393.6 Mb
Total ungapped length393.6 Mb
Gaps between scaffolds18
Number of chromosomes12
Number of scaffolds198
Scaffold N5016.1 Mb
Scaffold L5010
Number of contigs198
Contig N5016.1 Mb
Contig L5010
GC percent43.5
Genome coverage165.0x
Assembly levelChromosome

Assembly

The Oryza meridionalis OmerRS3 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_000338895.3_OmerRS3_genomic.fna.gz

Gene Predictions

The Oryza meridionalis OmerRS3 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) Oryza_meridionalis.maker.gff.gz
CDS sequences (FASTA file) Oryza_meridionalis.coding.fasta.gz
Protein sequences (FASTA file) Oryza_meridionalis.protein.fasta.gz

Functional Analysis

Functional annotation for the Oryza meridionalis OmerRS3 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Oryza_meridionalis.Pfam.tsv.gz

S genes

Summary

QueryChromosomeSize(bp)CoordinatestBLASTn HittBLASTn %IDDomain
DUF247I-SΨCM002528.2300451315986893-5987822Olongistaminata69DUF247
DUF247II-SΨCM002528.2300451315976278-5976823TaestivumS164DUF247
HPS10-SCM002528.2300451315984790-5984952,
5985140-5985264
LpsS_chromosome143-
DUF247I-ZΨCM002527.23427279131769277-31770383Psupina55DUF247
DUF247II-ZΨCM002527.23427279131778314-31779480AsativaDUF247II-Z166DUF247
HPS10-ZCM002527.23427279131777207-31777360,
31777505-31777584
LpsZ_chromosome262-

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences