Oryza glumaepatula OgluRS3 Assembly & Annotation

Overview

Analysis Name Oryza glumaepatula OgluRS3 Assembly & Annotation
Sequencing technology PacBio
Assembly method CANU v. 1.5; MECAT v. 1.3; FALCON v. 2017.06.28-18.01-py2.7-ucs4
Release Date 2021-03-10
Reference Publication(s)

Stein JC, Yu Y, Copetti D, Zwickl DJ, Zhang L, Zhang C, Chougule K, Gao D, Iwata A, Goicoechea JL, Wei S, Wang J, Liao Y, Wang M, Jacquemin J, Becker C, Kudrna D, Zhang J, Londono CEM, Song X, Lee S, Sanchez P, Zuccolo A, Ammiraju JSS, Talag J, Danowitz A, Rivera LF, Gschwend AR, Noutsos C, Wu CC, Kao SM, Zeng JW, Wei FJ, Zhao Q, Feng Q, El Baidouri M, Carpentier MC, Lasserre E, Cooke R, Rosa Farias DD, da Maia LC, Dos Santos RS, Nyberg KG, McNally KL, Mauleon R, Alexandrov N, Schmutz J, Flowers D, Fan C, Weigel D, Jena KK, Wicker T, Chen M, Han B, Henry R, Hsing YC, Kurata N, de Oliveira AC, Panaud O, Jackson SA, Machado CA, Sanderson MJ, Long M, Ware D, Wing RA. Genomes of 13 domesticated and wild rice relatives highlight genetic conservation, turnover and innovation across the genus Oryza. Nat Genet. 2018 Feb;50(2):285-296. doi: 10.1038/s41588-018-0040-0.

Abstract

The genus Oryza is a model system for the study of molecular evolution over time scales ranging from a few thousand to 15 million years. Using 13 reference genomes spanning the Oryza species tree, we show that despite few large-scale chromosomal rearrangements rapid species diversification is mirrored by lineage-specific emergence and turnover of many novel elements, including transposons, and potential new coding and noncoding genes. Our study resolves controversial areas of the Oryza phylogeny, showing a complex history of introgression among different chromosomes in the young ‘AA’ subclade containing the two domesticated species. This study highlights the prevalence of functionally coupled disease resistance genes and identifies many new haplotypes of potential use for future crop protection. Finally, this study marks a milestone in modern rice research with the release of a complete long-read assembly of IR 8 ‘Miracle Rice’, which relieved famine and drove the Green Revolution in Asia 50 years ago.

Assembly statistics

Genome size388.6 Mb
Total ungapped length388.6 Mb
Gaps between scaffolds21
Number of chromosomes12
Number of scaffolds68
Scaffold N5018.2 Mb
Scaffold L508
Number of contigs68
Contig N5018.2 Mb
Contig L508
GC percent44
Genome coverage115.0x
Assembly levelChromosome

Assembly

The Oryza glumaepatula OgluRS3 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_000576495.2_OgluRS3_genomic.fna.gz

Gene Predictions

The Oryza glumaepatula OgluRS3 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) Oryza_glumaepatula.maker.gff.gz
CDS sequences (FASTA file) Oryza_glumaepatula.coding.fasta.gz
Protein sequences (FASTA file) Oryza_glumaepatula.protein.fasta.gz

Functional Analysis

Functional annotation for the Oryza glumaepatula OgluRS3 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Oryza_glumaepatula.Pfam.tsv.gz

S genes

Summary

QueryChromosomeSize(bp)CoordinatestBLASTn HittBLASTn %IDDomain
DUF247I-SΨCM002519.2302577045446624-5446773LpSDUF247-I_chromosome167DUF247
DUF247II-SΨCM002519.2302577045434216-5435142LpSDUF247-II_chromosome168DUF247
HPS10-SCM002519.2302577045437812-5437914,
5438017-5438153
LpsS_contig1102938-
DUF247I-ZΨCM002518.23431693031627791-31628144Olongistaminata62DUF247

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences