Lolium rigidum APGP_CSIRO_Lrig_0.1 Assembly & Annotation

Overview

Analysis Name Lolium rigidum APGP_CSIRO_Lrig_0.1 Assembly & Annotation
Sequencing technology Oxford Nanopore PromethION; Oxford Nanopore MinION
Assembly method Flye v. 2.9; PurgeDups v. 1.2.5; Racon v. 1.4.22; Polca v. 4.0.7; ALLHiC v. 1
Release Date 2022-03-14
Reference Publication(s)

Paril J, Pandey G, Barnett EM, Rane RV, Court L, Walsh T, Fournier-Level A. Rounding up the annual ryegrass genome: High-quality reference genome of Lolium rigidum. Front Genet. 2022 Nov 1;13:1012694. doi: 10.3389/fgene.2022.1012694.

Abstract

The genome of the major agricultural weed species, annual ryegrass (Lolium rigidum) was assembled, annotated and analysed. Annual ryegrass is a major weed in grain cropping, and has the remarkable capacity to evolve resistance to herbicides with various modes of action. The chromosome-level assembly was achieved using short- and long-read sequencing in combination with Hi-C mapping. The assembly size is 2.44 Gb with N50 = 361.79 Mb across 1,764 scaffolds where the seven longest sequences correspond to the seven chromosomes. Genome completeness assessed through BUSCO returned a 99.8% score for complete (unique and duplicated) and fragmented genes using the Viridiplantae set. We found evidence for the expansion of herbicide resistance-related gene families including detoxification genes. The reference genome of L. rigidum is a critical asset for leveraging genetic information for the management of this highly problematic weed species.

Assembly statistics

Genome size 2.4 Gb
Number of chromosomes 7
Number of scaffolds 1,764
Scaffold N50 361.8 Mb
Scaffold L50 4
Number of contigs 33,683
Contig N50 113.7 kb
Contig L50 6,160
Assembly level Chromosome

Assembly

The Lolium rigidum APGP_CSIRO_Lrig_0.1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GCF_022539505.1_APGP_CSIRO_Lrig_0.1_genomic.fna.gz

Gene Predictions

The Lolium rigidum APGP_CSIRO_Lrig_0.1 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) GCF_022539505.1_APGP_CSIRO_Lrig_0.1_genomic.gff.gz
CDS sequences (FASTA file) GCF_022539505.1_APGP_CSIRO_Lrig_0.1_cds_from_genomic.fna.gz
Protein sequences (FASTA file) GCF_022539505.1_APGP_CSIRO_Lrig_0.1_protein.faa.gz

Functional Analysis

Functional annotation for the Lolium rigidum APGP_CSIRO_Lrig_0.1 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Lolium_rigidum_APGP_CSIRO_Lrig_0.1.Pfam.tsv.gz

S genes

Summary

QueryChromosomeSize(bp)CoordinatestBLASTn HittBLASTn %IDDomain
DUF247I-SChromosome 428309249916399366-16400958LpSDUF247-I_chromosome180DUF247
DUF247II-SChromosome 428309249916322133-16323692LpSDUF247-II_chromosome172DUF247
HPS10-SChromosome 428309249916263416-16263549,
16263681-16263834
LpsS_chromosome152-
DUF247I-SChromosome 428309249968194051-68195664LpSDUF247-I_chromosome181DUF247
DUF247II-SChromosome 428309249968642908-68644527LpSDUF247-II_chromosome176DUF247
HPS10-SChromosome 428309249968197062-68197198,
68197292-68197427
LpsS_chromosome145-
DUF247I-ZChromosome 736245227220343269-20344894LpZDUF247-I_chromosome258DUF247
DUF247II-ZChromosome 736245227220380880-20382550LpZDUF247-II_chromosome248DUF247
HPS10-ZChromosome 736245227220345720-20345906,
20346031-20346113
LpsZ_contig5509739-

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences