Analysis Name | Oropetium thomaeum V2 Assembly & Annotation |
Sequencing technology | Pacbio, Illumina, Hi-C |
Assembly method | Canu (V1.4) |
Release Date | 2018-11-15 |
VanBuren R, Wai CM, Keilwagen J, Pardo J. A chromosome-scale assembly of the model desiccation tolerant grass Oropetium thomaeum. Plant Direct. 2018 Nov 15;2(11):e00096. doi: 10.1002/pld3.96.
AbstractOropetium thomaeum is an emerging model for desiccation tolerance and genome size evolution in grasses. A draft genome of Oropetium was recently sequenced, but the lack of a chromosome-scale assembly has hindered comparative analyses and downstream functional genomics. Here, we reassembled Oropetium, and anchored the genome into 10 chromosomes using high-throughput chromatin conformation capture (Hi-C) based chromatin interactions. A combination of high-resolution RNAseq data and homology-based gene prediction identified thousands of new, conserved gene models that were absent from the V1 assembly. This includes thousands of new genes with high expression across a desiccation timecourse. Comparison between the Sorghum and Oropetium genomes revealed a surprising degree of chromosome-level collinearity, and several chromosome pairs have near perfect synteny. Other chromosomes are collinear in the gene rich chromosome arms but have experienced pericentric translocations. Together, these resources will be useful for the grass-comparative genomic community and further establish Oropetium as a model resurrection plant.
Assembly statistics
Number of contigs | 436 |
Contig N50 | 2.02 Mb |
Scaffold N50 | 20.5 Mb |
Total assembly size | 236 Mb |
Gene models | 28,835 |
BUSCO | 98.9% |
Assembly level | Chromosome |
The Oropetium thomaeum V2 Assembly file is available in FASTA format.
Downloads
Chromosomes (FASTA file) | Oropetium_thomaeum.faa.gz |
The Oropetium thomaeum V2 genome gene prediction files are available in GFF3 and FASTA format.
Downloads
Genes (GFF3 file) | Oropetium_thomaeum.gff.gz |
CDS sequences (FASTA file) | Ot_cds.fa.gz |
Protein sequences (FASTA file) | Ot_pep.fa.gz |
Functional annotation for the Oropetium thomaeum V2 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).
Downloads
Domain from InterProScan | Oropetium_thomaeum.Pfam.tsv.gz |
Summary
Query | ? | Size(bp) | Coordinates | tBLASTn Hit | tBLASTn %ID | Domain |
DUF247II-SΨ | 2 | 31289883 | 13312260-13313252 | Shybrid | 59 | DUF247 |
HPS10-S | 2 | 31289883 | 13336971-13337044,13337440-13337542 | ShybridS1 | 72 | - |
DUF247I-ZΨ | 6 | 20568207 | 19269559-19270131 | Pvaginatum | 80 | DUF247 |
DUF247II-ZΨ | 6 | 20568207 | 19265373-19265732 | Ttriandra | 61 | DUF247 |
HPS10-Z | 6 | 20568207 | 19267731-19267840,19268031-19268136 | Lperrieri | 65 | - |
Nucleotide
Protein