Analysis Name | Saccharum officinarum x spontaneum R570 v2.1 Assembly & Annotation |
Sequencing technology | PacBio Sequel II; Illumina HiSeq-2500 |
Assembly method | HiFiAsm v. 1; RACON v. 1.4.10 |
Release Date | 2023-03-22 |
Healey AL, Garsmeur O, Lovell JT, Shengquiang S, Sreedasyam A, Jenkins J, Plott CB, Piperidis N, Pompidor N, Llaca V, Metcalfe CJ, Doležel J, Cápal P, Carlson JW, Hoarau JY, Hervouet C, Zini C, Dievart A, Lipzen A, Williams M, Boston LB, Webber J, Keymanesh K, Tejomurthula S, Rajasekar S, Suchecki R, Furtado A, May G, Parakkal P, Simmons BA, Barry K, Henry RJ, Grimwood J, Aitken KS, Schmutz J, D'Hont A. The complex polyploid genome architecture of sugarcane. Nature. 2024 Apr;628(8009):804-810. doi: 10.1038/s41586-024-07231-4.
AbstractSugarcane, the world’s most harvested crop by tonnage, has shaped global history, trade and geopolitics, and is currently responsible for 80% of sugar production worldwide1. While traditional sugarcane breeding methods have effectively generated cultivars adapted to new environments and pathogens, sugar yield improvements have recently plateaued2. The cessation of yield gains may be due to limited genetic diversity within breeding populations, long breeding cycles and the complexity of its genome, the latter preventing breeders from taking advantage of the recent explosion of whole-genome sequencing that has benefited many other crops. Thus, modern sugarcane hybrids are the last remaining major crop without a reference-quality genome. Here we take a major step towards advancing sugarcane biotechnology by generating a polyploid reference genome for R570, a typical modern cultivar derived from interspecific hybridization between the domesticated species (Saccharum officinarum) and the wild species (Saccharum spontaneum). In contrast to the existing single haplotype (‘monoploid’) representation of R570, our 8.7 billion base assembly contains a complete representation of unique DNA sequences across the approximately 12 chromosome copies in this polyploid genome. Using this highly contiguous genome assembly, we filled a previously unsized gap within an R570 physical genetic map to describe the likely causal genes underlying the single-copy Bru1 brown rust resistance locus. This polyploid genome assembly with fine-grain descriptions of genome architecture and molecular targets for biotechnology will help accelerate molecular and transgenic breeding and adaptation of sugarcane to future environmental conditions.
Assembly statistics
Assembly Source: | JGI |
Assembly Version: | v2.0 |
Annotation Source: | JGI |
Annotation Version: | v2.1 |
Total Scaffold Length (bp): | 5,046,770,891 |
Number of Scaffolds: | 144 |
Min. Number of Scaffolds containing half of assembly (L50): | 28 |
Shortest Scaffold from L50 set (N50): | 79,221,035 |
Total Contig Length (bp): | 5,042,101,904 |
Number of Contigs: | 842 |
Min. Number of Contigs containing half of assembly (L50): | 99 |
Shortest Contig from L50 set (N50): | 15,340,496 |
Number of Protein-coding Transcripts: | 299,731 |
Number of Protein-coding Genes: | 194,593 |
Percentage of Eukaryote BUSCO Genes: | 98.7 |
Percentage of Embroyphyte BUSCO Genes: | 99.8 |
Assembly level: | Chromosome |
The Saccharum officinarum x spontaneum R570 v2.1 Assembly file is available in FASTA format.
Downloads
Chromosomes (FASTA file) | SofficinarumxspontaneumR570_771_v2.0.fa.gz |
The Saccharum officinarum x spontaneum R570 v2.1 genome gene prediction files are available in GFF3 and FASTA format.
Downloads
Genes (GFF3 file) | SofficinarumxspontaneumR570_771_v2.1.gene.gff3.gz |
CDS sequences (FASTA file) | SofficinarumxspontaneumR570_771_v2.1.cds.fa.gz |
Protein sequences (FASTA file) | SofficinarumxspontaneumR570_771_v2.1.protein.fa.gz |
Functional annotation for the Saccharum officinarum x spontaneum R570 v2.1 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).
Downloads
Domain from InterProScan | Saccharum_x_sp.Pfam.tsv.gz |
Summary
Query | Chromosome | Size(bp) | Coordinates | tBLASTn Hit | tBLASTn %ID | Domain |
DUF247I-S1 | Chr4A | 82147227 | 45451459-45453150 | Pnotatum | 68 | DUF247 |
DUF247I-S2 | Chr4B | 67335593 | 35359872-35361542 | Pnotatum | 72 | DUF247 |
DUF247I-S3Ψ | Chr4B | 67335593 | 35352426-35353532 | Pnotatum | 76 | DUF247 |
DUF247I-S4 | Chr4C | 66740618 | 34973006-34974700 | Pnotatum | 62 | DUF247 |
DUF247I-S5 | Chr4E | 51224956 | 17425601-17427268 | Pnotatum | 64 | DUF247 |
DUF247I-S6 | Chr4F | 50041630 | 18076523-18078217 | Pnotatum | 62 | DUF247 |
DUF247II-S1 | Chr4A | 82147227 | 45118168-45119784 | Pnotatum | 64 | DUF247 |
DUF247II-S2 | Chr4B | 67335593 | 35646182-35647807 | Pnotatum | 71 | DUF247 |
DUF247II-S3 | Chr4C | 66740618 | 34944043-34945665 | Pnotatum | 65 | DUF247 |
DUF247II-S4 | Chr4D | 56614552 | 37553438-37555051 | Pnotatum | 60 | DUF247 |
DUF247II-S5 | Chr4E | 51224956 | 17162933-17164552 | Pnotatum | 61 | DUF247 |
DUF247II-S6 | Chr4F | 50041630 | 16872511-16874133 | Pnotatum | 65 | DUF247 |
HPS10-S1 | Chr4A | 82147227 | 46096791-46096953,46097063-46097160 | Pvaginatum | 62 | - |
HPS10-S2 | Chr4B | 67335593 | 35356085-35356194,35356279-35356444 | Pvaginatum | 47 | - |
HPS10-S3 | Chr4B | 67335593 | 35348653-35348762,35348847-35349012 | Pvaginatum | 47 | - |
HPS10-S4 | Chr4C | 66740618 | 34988926-34989044,34989165-34989303 | Pvaginatum | 51 | - |
HPS10-S5 | Chr4E | 51224956 | 17723108-17723220,17723323-17723464 | Pvaginatum | 50 | - |
HPS10-S6 | Chr4F | 50041630 | 18096861-18096979,18097100-18097268 | Pvaginatum | 51 | - |
DUF247I-Z1Ψ | Chr7A | 71344650 | 63428166-63428759 | Pnotatum | 60 | DUF247 |
DUF247I-Z2 | Chr7B | 68479205 | 61210945-61212525 | Pnotatum | 58 | DUF247 |
DUF247I-Z3 | Chr7C | 63301622 | 56642183-56643748 | Pnotatum | 61 | DUF247 |
DUF247I-Z4 | Chr7D | 62522272 | 55330487-55332142 | Pnotatum | 53 | DUF247 |
DUF247I-Z5Ψ | Chr7E | 57896127 | 50081317-50082054 | Pvaginatum | 71 | DUF247 |
DUF247I-Z6 | Chr7_10A | 92477847 | 86187493-86189121 | Pvaginatum | 47 | DUF247 |
DUF247I-Z7 | Chr7os1 | 28450685 | 24061534-24063138 | Pnotatum | 59 | DUF247 |
DUF247II-Z1Ψ | Chr7A | 71344650 | 63465565-63465750 | Trufipilum | 53 | DUF247 |
DUF247II-Z2 | Chr7B | 68479205 | 61216457-61218127 | Pnotatum | 49 | DUF247 |
DUF247II-Z3 | Chr7C | 63301622 | 56674475-56676148 | Pnotatum | 51 | DUF247 |
DUF247II-Z4 | Chr7D | 62522272 | 55326178-55327854 | Pnotatum | 50 | DUF247 |
DUF247II-Z5 | Chr7_10A | 92477847 | 86233541-86235211 | Pnotatum | 47 | DUF247 |
DUF247II-Z6 | Chr7os1 | 28450685 | 24109889-24111520 | Pnotatum | 48 | DUF247 |
HPS10-Z1 | Chr7B | 68479205 | 61214485-61214656,61214963-61215117 | Pvaginatum | 52 | - |
HPS10-Z2 | Chr7C | 63301622 | 56646243-56646334,56646512-56646650 | Olongistaminata | 75 | - |
HPS10-Z3 | Chr7D | 62522272 | 55328126-55328211,55328481-55328667 | Ocoarctata | 53 | - |
HPS10-Z4 | Chr7E | 57896127 | 50083900-50083994,50084317-50084455 | Orufipogon | 46 | - |
HPS10-Z5 | Chr7_10A | 92477847 | 86232194-86232356,86232455-86232564 | Pnotatum | 61 | - |
HPS10-Z6 | Chr7os1 | 28450685 | 24100177-24100348,24100476-24100582 | Pvaginatum | 60 | - |
Nucleotide
Protein