Analysis Name | Oryza longistaminata w11 Assembly & Annotation |
Sequencing technology | Illumina HiSeq; PacBio RSII |
Assembly method | AllPaths v. Release 37250; PBjelly v. 2 |
Release Date | 2019-12-31 |
Li W, Li K, Zhang QJ, Zhu T, Zhang Y, Shi C, Liu YL, Xia EH, Jiang JJ, Shi C, Zhang LP, Huang H, Tong Y, Liu Y, Zhang D, Zhao Y, Jiang WK, Zhao YJ, Mao SY, Jiao JY, Xu PZ, Yang LL, Yin GY, Gao LZ. Improved hybrid de novo genome assembly and annotation of African wild rice, Oryza longistaminata, from Illumina and PacBio sequencing reads. Plant Genome. 2020 Mar;13(1):e20001. doi: 10.1002/tpg2.20001.
AbstractAfrican wild rice Oryza longistaminata, one of the eight AA- genome species in the genus Oryza, possesses highly valued traits, such as the rhizomatousness for perennial rice breeding, strong tolerance to biotic and abiotic stresses, and high biomass production on poor soils. To obtain the high-quality reference genome for O. longistaminata we employed a hybrid assembly approach through incorporating Illumina and PacBio sequencing datasets. The final genome assembly comprised only 107 scaffolds and was approximately ∼363.5 Mb, representing ∼92.7% of the estimated African wild rice genome (∼392 Mb). The N50 lengths of the assembled contigs and scaffolds were ∼46.49 Kb and ∼6.83 Mb, indicating ∼3.72-fold and ∼18.8-fold improvement in length compared to the earlier released assembly (∼12.5 Kb and 364 Kb, respectively). Aided with Hi-C data and syntenic relationship with O. sativa, these assembled scaffolds were anchored into 12 pseudo-chromosomes. Genome annotation and comparative genomic analysis reveal that lineage-specific expansion of gene families that respond to biotic- and abiotic stresses are of great potential for mining novel alleles to overcome major diseases and abiotic adaptation in rice breeding programs. This reference genome of African wild rice will greatly enlarge the existing database of rice genome resources and unquestionably form a solid base to understand genomic basis underlying highly valued phenotypic traits and search for novel gene sources in O. longistaminata for the future rice breeding programs.
Assembly statistics
Genome size | 371.3 Mb |
Total ungapped length | 362.8 Mb |
Number of chromosomes | 12 |
Number of scaffolds | 1,009 |
Scaffold N50 | 33.9 Mb |
Scaffold L50 | 5 |
Number of contigs | 14,614 |
Contig N50 | 45.3 kb |
Contig L50 | 2,381 |
GC percent | 42 |
Genome coverage | 442.0x |
Assembly level | Chromosome |
The Oryza longistaminata w11 Assembly file is available in FASTA format.
Downloads
Chromosomes (FASTA file) | GCA_009805545.1_ASM980554v1_genomic.fna.gz |
The Oryza longistaminata w11 genome gene prediction files are not available.
Downloads
Genes (GFF3 file) | - |
CDS sequences (FASTA file) | - |
Protein sequences (FASTA file) | - |
Functional annotation for the Oryza longistaminata w11 is not available.
Downloads
Domain from InterProScan | - |
Nucleotide
Protein