|
Accurate and Fast Cell Marker Gene Identification with COSG
Min Dai, Xiaobing Pei, Xiu-Jie Wang
Briefings in Bioinformatic
Abstract
Accurate cell classification is the groundwork for downstream analysis of single-cell sequencing data, yet how to identify true marker genes for different cell types still remains a big challenge. Here, we report COSine similarity-based marker Gene identification (COSG) as a cosine similarity-based method for more accurate and scalable marker gene identification. COSG is applicable to single-cell RNA sequencing data, single-cell ATAC sequencing data and spatially resolved transcriptome data. COSG is fast and scalable for ultra-large datasets of million-scale cells. Application on both simulated and real experimental datasets showed that the marker genes or genomic regions identified by COSG have greater cell-type specificity, demonstrating the superior performance of COSG in terms of both accuracy and efficiency as compared with other available methods.
|
论文编号: |
DOI:10.1093/bib/bbab579 |
论文题目: |
Accurate and Fast Cell Marker Gene Identification with COSG |
英文论文题目: |
Accurate and Fast Cell Marker Gene Identification with COSG |
第一作者: |
Min Dai, Xiaobing Pei, Xiu-Jie Wang |
英文第一作者: |
Min Dai, Xiaobing Pei, Xiu-Jie Wang |
联系作者: |
|
英文联系作者: |
|
外单位作者单位: |
|
英文外单位作者单位: |
|
发表年度: |
2022-01-25 |
卷: |
|
期: |
|
页码: |
|
摘要: |
Accurate cell classification is the groundwork for downstream analysis of single-cell sequencing data, yet how to identify true marker genes for different cell types still remains a big challenge. Here, we report COSine similarity-based marker Gene identification (COSG) as a cosine similarity-based method for more accurate and scalable marker gene identification. COSG is applicable to single-cell RNA sequencing data, single-cell ATAC sequencing data and spatially resolved transcriptome data. COSG is fast and scalable for ultra-large datasets of million-scale cells. Application on both simulated and real experimental datasets showed that the marker genes or genomic regions identified by COSG have greater cell-type specificity, demonstrating the superior performance of COSG in terms of both accuracy and efficiency as compared with other available methods. |
英文摘要: |
Accurate cell classification is the groundwork for downstream analysis of single-cell sequencing data, yet how to identify true marker genes for different cell types still remains a big challenge. Here, we report COSine similarity-based marker Gene identification (COSG) as a cosine similarity-based method for more accurate and scalable marker gene identification. COSG is applicable to single-cell RNA sequencing data, single-cell ATAC sequencing data and spatially resolved transcriptome data. COSG is fast and scalable for ultra-large datasets of million-scale cells. Application on both simulated and real experimental datasets showed that the marker genes or genomic regions identified by COSG have greater cell-type specificity, demonstrating the superior performance of COSG in terms of both accuracy and efficiency as compared with other available methods. |
刊物名称: |
Briefings in Bioinformatic |
英文刊物名称: |
Briefings in Bioinformatic |
论文全文: |
|
英文论文全文: |
|
全文链接: |
|
其它备注: |
Min Dai, Xiaobing Pei, Xiu-Jie Wang. Accurate and Fast Cell Marker Gene Identification with COSG. Briefings in Bioinformatic. DOI:10.1093/bib/bbab579 |
英文其它备注: |
|
学科: |
|
英文学科: |
|
影响因子: |
|
第一作者所在部门: |
|
英文第一作者所在部门: |
|
论文出处: |
|
英文论文出处: |
|
论文类别: |
|
英文论文类别: |
|
参与作者: |
|
英文参与作者: |
|
|