作者: | Zijian Wang, Lingfeng Miao, Kaiwen Tan, Weilong Guo, Beibei Xin, Rudi Appels, Jizeng Jia, Jinsheng Lai, Fei Lu, Zhongfu Ni, Xiangdong Fu, Qixin Sun, Jian Chen |
---|---|
刊物名称: | Molecular Plant |
DOI: | |
联系作者: | |
英文联系作者: | |
发布时间: | 2025-02-18 |
卷: | |
摘要: | A complete reference genome is crucial for biology research and genetic improvement. Owing to its large size and highly repetitive nature, there are numerous gaps in the globally used wheat Chinese Spring (CS) genome. Here, we generated a 14.46 Gb near-completed assembly of the CS genome, with a contig N50 over 266 Mb and an overall base accuracy of 99.9963%. Among the 290 gaps that remained (26, 257 and 7 gaps from the A, B and D subgenomes, respectively), 278 gaps were extremely high-copy tandem repeats, whereas the remaining 12 were TE-associated gaps. Four chromosomes were completely gap-free, including chr1D, chr3D, chr4D and chr5D. Extensive annotation of the near-complete genome revealed 151,405 high-confidence genes, of which 59,180 high-confidence genes were newly annotated, including 7,602 newly assembled genes. Except for the centromere of chr1B, which has a gap associated with superlong GAA repeat arrays, the centromeric sequences of all of the remaining 20 chromosomes were completely assembled. Our near-complete assembly revealed that the extent of tandem repeats, such as SSRs, was highly uneven among different subgenomes. Similarly, the repeat compositions of the centromeres also varied among the three subgenomes. With the genome sequences of all six types of seed storage proteins fully assembled, the expression of ω-gliadin was found to be contributed entirely by the B subgenome, whereas the expression of the other 5 types of SSPs was most abundant from the D subgenome. The near-complete CS genome will serve as a valuable resource for the research and breeding of wheat as well as its related species. Keywords: wheat genome,Chinese Spring, near-complete assembly, seed storage proteins, tandem repeats, centromeres |