期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

G2S3: A gene graph-based imputation method for single-cell RNA sequencing data

Weimiao Wu Yunqing Liu Qile Dai Xiting Yan Zuoheng Wang 《PLoS computational biology》2021,17(5)

相似文献

2.

scIMC: a platform for benchmarking comparison and visualization analysis of scRNA-seq data imputation methods

Chichi Dai Yi Jiang Chenglin Yin Ran Su Xiangxiang Zeng Quan Zou Kenta Nakai Leyi Wei 《Nucleic acids research》2022,50(9):4877

相似文献

3.

SwarnSeq: An improved statistical approach for differential expression analysis of single-cell RNA-seq data

《Genomics》2021,113(3):1308-1324

Single-cell RNA sequencing (scRNA-seq) is a powerful technology that is capable of generating gene expression data at the resolution of individual cell. The scRNA-seq data is characterized by the presence of dropout events, which severely bias the results if they remain unaddressed. There are limited Differential Expression (DE) approaches which consider the biological processes, which lead to dropout events, in the modeling process. So, we develop, SwarnSeq, an improved method for DE, and other downstream analysis that considers the molecular capture process in scRNA-seq data modeling. The performance of the proposed method is benchmarked with 11 existing methods on 10 different real scRNA-seq datasets under three comparison settings. We demonstrate that SwarnSeq method has improved performance over the 11 existing methods. This improvement is consistently observed across several public scRNA-seq datasets generated using different scRNA-seq protocols. The external spike-ins data can be used in the SwarnSeq method to enhance its performance.Availability and implementationThe method is implemented as a publicly available R package available at https://github.com/sam-uofl/SwarnSeq. 相似文献

4.

An Overview of Algorithms and Associated Applications for Single Cell RNA-Seq Data Imputation

Zarrin Basharat Sania Majeed Humaira Saleem Ishtiaq Ahmad Khan Azra Yasmin 《Current Genomics》2021,22(5):319

相似文献

5.

Non-linear archetypal analysis of single-cell RNA-seq data by deep autoencoders

Yuge Wang Hongyu Zhao 《PLoS computational biology》2022,18(4)

Advances in single-cell RNA sequencing (scRNA-seq) have led to successes in discovering novel cell types and understanding cellular heterogeneity among complex cell populations through cluster analysis. However, cluster analysis is not able to reveal continuous spectrum of states and underlying gene expression programs (GEPs) shared across cell types. We introduce scAAnet, an autoencoder for single-cell non-linear archetypal analysis, to identify GEPs and infer the relative activity of each GEP across cells. We use a count distribution-based loss term to account for the sparsity and overdispersion of the raw count data and add an archetypal constraint to the loss function of scAAnet. We first show that scAAnet outperforms existing methods for archetypal analysis across different metrics through simulations. We then demonstrate the ability of scAAnet to extract biologically meaningful GEPs using publicly available scRNA-seq datasets including a pancreatic islet dataset, a lung idiopathic pulmonary fibrosis dataset and a prefrontal cortex dataset. 相似文献

6.

Independent component analysis based gene co-expression network inference (ICAnet) to decipher functional modules for better single-cell clustering and batch integration

Weixu Wang Huanhuan Tan Mingwan Sun Yiqing Han Wei Chen Shengnu Qiu Ke Zheng Gang Wei Ting Ni 《Nucleic acids research》2021,49(9):e54

With the tremendous increase of publicly available single-cell RNA-sequencing (scRNA-seq) datasets, bioinformatics methods based on gene co-expression network are becoming efficient tools for analyzing scRNA-seq data, improving cell type prediction accuracy and in turn facilitating biological discovery. However, the current methods are mainly based on overall co-expression correlation and overlook co-expression that exists in only a subset of cells, thus fail to discover certain rare cell types and sensitive to batch effect. Here, we developed independent component analysis-based gene co-expression network inference (ICAnet) that decomposed scRNA-seq data into a series of independent gene expression components and inferred co-expression modules, which improved cell clustering and rare cell-type discovery. ICAnet showed efficient performance for cell clustering and batch integration using scRNA-seq datasets spanning multiple cells/tissues/donors/library types. It works stably on datasets produced by different library construction strategies and with different sequencing depths and cell numbers. We demonstrated the capability of ICAnet to discover rare cell types in multiple independent scRNA-seq datasets from different sources. Importantly, the identified modules activated in acute myeloid leukemia scRNA-seq datasets have the potential to serve as new diagnostic markers. Thus, ICAnet is a competitive tool for cell clustering and biological interpretations of single-cell RNA-seq data analysis. 相似文献

7.

Regulatory network-based imputation of dropouts in single-cell RNA sequencing data

Ana Carolina Leote Xiaohui Wu Andreas Beyer 《PLoS computational biology》2022,18(2)

相似文献

8.

Estrogen regulates divergent transcriptional and epigenetic cell states in breast cancer

Aysegul Ors Alex Daniel Chitsazan Aaron Reid Doe Ryan M Mulqueen Cigdem Ak Yahong Wen Syber Haverlack Mithila Handu Spandana Naldiga Joshua C Saldivar Hisham Mohammed 《Nucleic acids research》2022,50(20):11492

相似文献

9.

Approximate distance correlation for selecting highly interrelated genes across datasets

Qunlun Shen Shihua Zhang 《PLoS computational biology》2021,17(11)

With the rapid accumulation of biological omics datasets, decoding the underlying relationships of cross-dataset genes becomes an important issue. Previous studies have attempted to identify differentially expressed genes across datasets. However, it is hard for them to detect interrelated ones. Moreover, existing correlation-based algorithms can only measure the relationship between genes within a single dataset or two multi-modal datasets from the same samples. It is still unclear how to quantify the strength of association of the same gene across two biological datasets with different samples. To this end, we propose Approximate Distance Correlation (ADC) to select interrelated genes with statistical significance across two different biological datasets. ADC first obtains the k most correlated genes for each target gene as its approximate observations, and then calculates the distance correlation (DC) for the target gene across two datasets. ADC repeats this process for all genes and then performs the Benjamini-Hochberg adjustment to control the false discovery rate. We demonstrate the effectiveness of ADC with simulation data and four real applications to select highly interrelated genes across two datasets. These four applications including 21 cancer RNA-seq datasets of different tissues; six single-cell RNA-seq (scRNA-seq) datasets of mouse hematopoietic cells across six different cell types along the hematopoietic cell lineage; five scRNA-seq datasets of pancreatic islet cells across five different technologies; coupled single-cell ATAC-seq (scATAC-seq) and scRNA-seq data of peripheral blood mononuclear cells (PBMC). Extensive results demonstrate that ADC is a powerful tool to uncover interrelated genes with strong biological implications and is scalable to large-scale datasets. Moreover, the number of such genes can serve as a metric to measure the similarity between two datasets, which could characterize the relative difference of diverse cell types and technologies. 相似文献

10.

SSRE: Cell Type Detection Based on Sparse Subspace Representation and Similarity Enhancement

Zhenlan Liang Min Li Ruiqing Zheng Yu Tian Xuhua Yan Jin Chen Fang-Xiang Wu Jianxin Wang 《基因组蛋白质组与生物信息学报(英文版)》2021,19(2):282-291

Accurate identification of cell types from single-cell RNA sequencing(scRNA-seq) data plays a critical role in a variety of scRNA-seq analysis studies. This task corresponds to solving an unsupervised clustering problem, in which the similarity measurement between cells affects the result significantly. Although many approaches for cell type identification have been proposed,the accuracy still needs to be improved. In this study, we proposed a novel single-cell clustering framework based on similarity learning, called SSRE. SSRE models the relationships between cells based on subspace assumption, and generates a sparse representation of the cell-to-cell similarity.The sparse representation retains the most similar neighbors for each cell. Besides, three classical pairwise similarities are incorporated with a gene selection and enhancement strategy to further improve the effectiveness of SSRE. Tested on ten real scRNA-seq datasets and five simulated datasets, SSRE achieved the superior performance in most cases compared to several state-of-the-art single-cell clustering methods. In addition, SSRE can be extended to visualization of scRNA-seq data and identification of differentially expressed genes. The matlab and python implementations of SSRE are available at https://github.com/CSUBioGroup/SSRE. 相似文献

11.

scGET: Predicting Cell Fate Transition During Early Embryonic Development by Single-cell Graph Entropy

Jiayuan Zhong Chongyin Han Xuhang Zhang Pei Chen Rui Liu 《基因组蛋白质组与生物信息学报(英文版)》2021,19(3):461-474

During early embryonic development, cell fate commitment represents a critical transition or"tipping point"of embryonic differentiation, at which there is a drastic and qualitative shift of the cell populations. In this study, we presented a computational approach, scGET, to explore the gene–gene associations based on single-cell RNA sequencing (scRNA-seq) data for critical transition prediction. Specifically, by transforming the gene expression data to the local network entropy, the single-cell graph entropy (SGE) value quantitatively characterizes the stability and criticality of gene regu-latory networks among cell populations and thus can be employed to detect the critical signal of cell fate or lineage commitment at the single-cell level. Being applied to five scRNA-seq datasets of embryonic differentiation, scGET accurately predicts all the impending cell fate transitions. After identifying the"dark genes"that are non-differentially expressed genes but sensitive to the SGE value, the underlying signaling mechanisms were revealed, suggesting that the synergy of dark genes and their downstream targets may play a key role in various cell development processes. The application in all five datasets demonstrates the effectiveness of scGET in analyzing scRNA-seq data from a network perspective and its potential to track the dynamics of cell differentiation. The source code of scGET is accessible at https://github.com/zhongjiayuna/scGET_Project. 相似文献

12.

CellDepot: A Unified Repository for scRNA-seq Data and Visual Exploration

《Journal of molecular biology》2022,434(11):167425

CellDepot containing over 270 datasets from 8 species and many tissues serves as an integrated web application to empower scientists in exploring single-cell RNA-seq (scRNA-seq) datasets and comparing the datasets among various studies through a user-friendly interface with advanced visualization and analytical capabilities. To begin with, it provides an efficient data management system that users can upload single cell datasets and query the database by multiple attributes such as species and cell types. In addition, the graphical multi-logic, multi-condition query builder and convenient filtering tool backed by MySQL database system, allows users to quickly find the datasets of interest and compare the expression of gene(s) across these. Moreover, by embedding the cellxgene VIP tool, CellDepot enables fast exploration of individual dataset in the manner of interactivity and scalability to gain more refined insights such as cell composition, gene expression profiles, and differentially expressed genes among cell types by leveraging more than 20 frequently applied plotting functions and high-level analysis methods in single cell research. In summary, the web portal available at http://celldepot.bxgenomics.com, prompts large scale single cell data sharing, facilitates meta-analysis and visualization, and encourages scientists to contribute to the single-cell community in a tractable and collaborative way. Finally, CellDepot is released as open-source software under MIT license to motivate crowd contribution, broad adoption, and local deployment for private datasets. 相似文献

13.

Cell function and identity revealed by comparative scRNA-seq analysis in human nasal,bronchial and epididymis epithelia

《European journal of cell biology》2022,101(3):151231

相似文献

14.

Joint gene network construction by single-cell RNA sequencing data

Meichen Dong Yiping He Yuchao Jiang Fei Zou 《Biometrics》2023,79(2):915-925

相似文献

15.

scDeepSort: a pre-trained cell-type annotation method for single-cell transcriptomics using deep learning with a weighted graph neural network

Xin Shao Haihong Yang Xiang Zhuang Jie Liao Penghui Yang Junyun Cheng Xiaoyan Lu Huajun Chen Xiaohui Fan 《Nucleic acids research》2021,49(21):e122

相似文献

16.

TSEE: an elastic embedding method to visualize the dynamic gene expression patterns of time series single-cell RNA sequencing data

An Shaokun Ma Liang Wan Lin 《BMC genomics》2019,20(2):77-92

Background

Time series single-cell RNA sequencing (scRNA-seq) data are emerging. However, the analysis of time series scRNA-seq data could be compromised by 1) distortion created by assorted sources of data collection and generation across time samples and 2) inheritance of cell-to-cell variations by stochastic dynamic patterns of gene expression. This calls for the development of an algorithm able to visualize time series scRNA-seq data in order to reveal latent structures and uncover dynamic transition processes.

Results

In this study, we propose an algorithm, termed time series elastic embedding (TSEE), by incorporating experimental temporal information into the elastic embedding (EE) method, in order to visualize time series scRNA-seq data. TSEE extends the EE algorithm by penalizing the proximal placement of latent points that correspond to data points otherwise separated by experimental time intervals. TSEE is herein used to visualize time series scRNA-seq datasets of embryonic developmental processed in human and zebrafish. We demonstrate that TSEE outperforms existing methods (e.g. PCA, tSNE and EE) in preserving local and global structures as well as enhancing the temporal resolution of samples. Meanwhile, TSEE reveals the dynamic oscillation patterns of gene expression waves during zebrafish embryogenesis.

Conclusions

TSEE can efficiently visualize time series scRNA-seq data by diluting the distortions of assorted sources of data variation across time stages and achieve the temporal resolution enhancement by preserving temporal order and structure. TSEE uncovers the subtle dynamic structures of gene expression patterns, facilitating further downstream dynamic modeling and analysis of gene expression processes. The computational framework of TSEE is generalizable by allowing the incorporation of other sources of information.

相似文献

17.

单细胞RNA测序技术在病毒研究中的应用

屈亮李素仇华吉《遗传》2020,(3):269-277

单细胞RNA测序(single-cell RNA sequencing, scRNA-seq)技术已经成为不同领域中研究细胞异质性的有效工具。在病毒研究领域中,利用该技术分析病毒和细胞的转录组,可以在单细胞水平上检测病毒感染的动态变化,了解病毒与细胞间复杂的相互作用。本文简述了scRNA-seq技术,着重介绍病毒感染宿主细胞后scRNA-seq研究的最新进展,同时也描述了细胞周期、基因表达、细胞状态等细胞异质性对病毒感染过程的影响,以及病毒变异对其本身感染过程的影响。此外,本文还分析了scRNA-seq在研究病毒–宿主互作动态变化方面具有的独特优势,及其在病毒研究领域中广阔的应用前景,为揭示病毒的感染与致病机制、抗病毒靶标的开发等提供参考。相似文献

18.

coupleCoC+: An information-theoretic co-clustering-based transfer learning framework for the integrative analysis of single-cell genomic data

Pengcheng Zeng Zhixiang Lin 《PLoS computational biology》2021,17(6)

Technological advances have enabled us to profile multiple molecular layers at unprecedented single-cell resolution and the available datasets from multiple samples or domains are growing. These datasets, including scRNA-seq data, scATAC-seq data and sc-methylation data, usually have different powers in identifying the unknown cell types through clustering. So, methods that integrate multiple datasets can potentially lead to a better clustering performance. Here we propose coupleCoC+ for the integrative analysis of single-cell genomic data. coupleCoC+ is a transfer learning method based on the information-theoretic co-clustering framework. In coupleCoC+, we utilize the information in one dataset, the source data, to facilitate the analysis of another dataset, the target data. coupleCoC+ uses the linked features in the two datasets for effective knowledge transfer, and it also uses the information of the features in the target data that are unlinked with the source data. In addition, coupleCoC+ matches similar cell types across the source data and the target data. By applying coupleCoC+ to the integrative clustering of mouse cortex scATAC-seq data and scRNA-seq data, mouse and human scRNA-seq data, mouse cortex sc-methylation and scRNA-seq data, and human blood dendritic cells scRNA-seq data from two batches, we demonstrate that coupleCoC+ improves the overall clustering performance and matches the cell subpopulations across multimodal single-cell genomic datasets. coupleCoC+ has fast convergence and it is computationally efficient. The software is available at https://github.com/cuhklinlab/coupleCoC_plus. 相似文献

19.

Integrated analysis of multimodal single-cell data with structural similarity

Yingxin Cao Laiyi Fu Jie Wu Qinke Peng Qing Nie Jing Zhang Xiaohui Xie 《Nucleic acids research》2022,50(21):e121

Multimodal single-cell sequencing technologies provide unprecedented information on cellular heterogeneity from multiple layers of genomic readouts. However, joint analysis of two modalities without properly handling the noise often leads to overfitting of one modality by the other and worse clustering results than vanilla single-modality analysis. How to efficiently utilize the extra information from single cell multi-omics to delineate cell states and identify meaningful signal remains as a significant computational challenge. In this work, we propose a deep learning framework, named SAILERX, for efficient, robust, and flexible analysis of multi-modal single-cell data. SAILERX consists of a variational autoencoder with invariant representation learning to correct technical noises from sequencing process, and a multimodal data alignment mechanism to integrate information from different modalities. Instead of performing hard alignment by projecting both modalities to a shared latent space, SAILERX encourages the local structures of two modalities measured by pairwise similarities to be similar. This strategy is more robust against overfitting of noises, which facilitates various downstream analysis such as clustering, imputation, and marker gene detection. Furthermore, the invariant representation learning part enables SAILERX to perform integrative analysis on both multi- and single-modal datasets, making it an applicable and scalable tool for more general scenarios. 相似文献

20.

单细胞转录组测序技术在心脏发育、疾病以及医学中的应用

朱庆元李天晴《生物技术通报》2021,(1):145-154

单细胞转录组测序(Single-cell RNA sequencing,scRNA-seq)可以在单细胞水平描绘出每个细胞同一基因的表达量在不同细胞间的表达水平差异,使得在单细胞水平重新认识各种组织器官成为可能.目前对心脏的测序研究正从传统的普通转录组水平过渡到单细胞水平,对小鼠和人的心脏的测序陆续地发表出来.概述了s... 相似文献