期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Differential expression analyses for single-cell RNA-Seq: old questions on new data

Zhun Miao Xuegong Zhang 《Quantitative Biology.》2016,4(4):243

Background: Single-cell RNA sequencing (scRNA-seq) is an emerging technology that enables high resolution detection of heterogeneities between cells. One important application of scRNA-seq data is to detect differential expression (DE) of genes. Currently, some researchers still use DE analysis methods developed for bulk RNA-Seq data on single-cell data, and some new methods for scRNA-seq data have also been developed. Bulk and single-cell RNA-seq data have different characteristics. A systematic evaluation of the two types of methods on scRNA-seq data is needed. Results: In this study, we conducted a series of experiments on scRNA-seq data to quantitatively evaluate 14 popular DE analysis methods, including both of traditional methods developed for bulk RNA-seq data and new methods specifically designed for scRNA-seq data. We obtained observations and recommendations for the methods under different situations. Conclusions: DE analysis methods should be chosen for scRNA-seq data with great caution with regard to different situations of data. Different strategies should be taken for data with different sample sizes and/or different strengths of the expected signals. Several methods for scRNA-seq data show advantages in some aspects, and DEGSeq tends to outperform other methods with respect to consistency, reproducibility and accuracy of predictions on scRNA-seq data. 相似文献

2.

EPIC: Inferring relevant cell types for complex traits by integrating genome-wide association studies and single-cell RNA sequencing

Rujin Wang Dan-Yu Lin Yuchao Jiang 《PLoS genetics》2022,18(6)

More than a decade of genome-wide association studies (GWASs) have identified genetic risk variants that are significantly associated with complex traits. Emerging evidence suggests that the function of trait-associated variants likely acts in a tissue- or cell-type-specific fashion. Yet, it remains challenging to prioritize trait-relevant tissues or cell types to elucidate disease etiology. Here, we present EPIC (cEll tyPe enrIChment), a statistical framework that relates large-scale GWAS summary statistics to cell-type-specific gene expression measurements from single-cell RNA sequencing (scRNA-seq). We derive powerful gene-level test statistics for common and rare variants, separately and jointly, and adopt generalized least squares to prioritize trait-relevant cell types while accounting for the correlation structures both within and between genes. Using enrichment of loci associated with four lipid traits in the liver and enrichment of loci associated with three neurological disorders in the brain as ground truths, we show that EPIC outperforms existing methods. We apply our framework to multiple scRNA-seq datasets from different platforms and identify cell types underlying type 2 diabetes and schizophrenia. The enrichment is replicated using independent GWAS and scRNA-seq datasets and further validated using PubMed search and existing bulk case-control testing results. 相似文献

3.

Direct Comparative Analyses of 10X Genomics Chromium and Smart-seq2

Xiliang Wang Yao He Qiming Zhang Xianwen Ren Zemin Zhang 《基因组蛋白质组与生物信息学报(英文版)》2021,19(2):253-266

相似文献

4.

A resource of single-cell gene expression profiles in a planarian Dugesia japonica

Makoto Kashima Rei Komura Yuki Sato Chikara Hashimoto Hiromi Hirata 《Development, growth & differentiation》2024,66(1):43-55

The freshwater planarian Dugesia japonica maintains an abundant heterogeneous cell population called neoblasts, which include adult pluripotent stem cells. Thus, it is an excellent model organism for stem cell and regeneration research. Recently, many single-cell RNA sequencing (scRNA-seq) databases of several model organisms, including other planarian species, have become publicly available; these are powerful and useful resources to search for gene expression in various tissues and cells. However, the only scRNA-seq dataset for D. japonica has been limited by the number of genes detected. Herein, we collected D. japonica cells, and conducted an scRNA-seq analysis. A novel, automatic, iterative cell clustering strategy produced a dataset of 3,404 cells, which could be classified into 63 cell types based on gene expression profiles. We introduced two examples for utilizing the scRNA-seq dataset in this study using D. japonica. First, the dataset provided results consistent with previous studies as well as novel functionally relevant insights, that is, the expression of DjMTA and DjP2X-A genes in neoblasts that give rise to differentiated cells. Second, we conducted an integrative analysis of the scRNA-seq dataset and time-course bulk RNA-seq of irradiated animals, demonstrating that the dataset can help interpret differentially expressed genes captured via bulk RNA-seq. Using the R package “Seurat” and GSE223927, researchers can easily access and utilize this dataset. 相似文献

5.

SDImpute: A statistical block imputation method based on cell-level and gene-level information for dropouts in single-cell RNA-seq data

Jing Qi Yang Zhou Zicen Zhao Shuilin Jin 《PLoS computational biology》2021,17(6)

The single-cell RNA sequencing (scRNA-seq) technologies obtain gene expression at single-cell resolution and provide a tool for exploring cell heterogeneity and cell types. As the low amount of extracted mRNA copies per cell, scRNA-seq data exhibit a large number of dropouts, which hinders the downstream analysis of the scRNA-seq data. We propose a statistical method, SDImpute (Single-cell RNA-seq Dropout Imputation), to implement block imputation for dropout events in scRNA-seq data. SDImpute automatically identifies the dropout events based on the gene expression levels and the variations of gene expression across similar cells and similar genes, and it implements block imputation for dropouts by utilizing gene expression unaffected by dropouts from similar cells. In the experiments, the results of the simulated datasets and real datasets suggest that SDImpute is an effective tool to recover the data and preserve the heterogeneity of gene expression across cells. Compared with the state-of-the-art imputation methods, SDImpute improves the accuracy of the downstream analysis including clustering, visualization, and differential expression analysis. 相似文献

6.

单细胞转录组测序技术在心脏发育、疾病以及医学中的应用

朱庆元李天晴《生物技术通报》2021,(1):145-154

单细胞转录组测序(Single-cell RNA sequencing,scRNA-seq)可以在单细胞水平描绘出每个细胞同一基因的表达量在不同细胞间的表达水平差异,使得在单细胞水平重新认识各种组织器官成为可能.目前对心脏的测序研究正从传统的普通转录组水平过渡到单细胞水平,对小鼠和人的心脏的测序陆续地发表出来.概述了s... 相似文献

7.

Joint gene network construction by single-cell RNA sequencing data

Meichen Dong Yiping He Yuchao Jiang Fei Zou 《Biometrics》2023,79(2):915-925

相似文献

8.

Single-Cell Toolkits Opening a New Era for Cell Engineering

Sean Lee Jireh Kim Jong-Eun Park 《Molecules and cells》2021,44(3):127

相似文献

9.

Analyzing allele specific RNA expression using mixture models

Rong Lu Ryan M Smith Michal Seweryn Danxin Wang Katherine Hartmann Amy Webb Wolfgang Sadee Grzegorz A Rempala 《BMC genomics》2015,16(1)

相似文献

10.

c-CSN: Single-cell RNA Sequencing Data Analysis by Conditional Cell-specific Network

Lin Li Hao Dai Zhaoyuan Fang Luonan Chen 《基因组蛋白质组与生物信息学报(英文版)》2021,19(2):319-329

The rapid advancement of single-cell technologies has shed new light on the complex mechanisms of cellular heterogeneity. However, compared to bulk RNA sequencing(RNA-seq),single-cell RNA-seq(sc RNA-seq) suffers from higher noise and lower coverage, which brings new computational difficulties. Based on statistical independence, cell-specific network(CSN) is able to quantify the overall associations between genes for each cell, yet suffering from a problem of overestimation related to indirect effects. To overcome this problem, we propose the c-CSN method, which can construct the conditional cell-specific network(CCSN) for each cell. c-CSN method can measure the direct associations between genes by eliminating the indirect associations.c-CSN can be used for cell clustering and dimension reduction on a network basis of single cells.Intuitively, each CCSN can be viewed as the transformation from less ‘‘reliable" gene expression to more ‘‘reliable" gene–gene associations in a cell. Based on CCSN, we further design network flow entropy(NFE) to estimate the differentiation potency of a single cell. A number of sc RNA-seq datasets were used to demonstrate the advantages of our approach. 1) One direct association network is generated for one cell. 2) Most existing sc RNA-seq methods designed for gene expression matrices are also applicable to c-CSN-transformed degree matrices. 3) CCSN-based NFE helps resolving the direction of differentiation trajectories by quantifying the potency of each cell. c-CSN is publicly available at https://github.com/Lin Li-0909/c-CSN. 相似文献

11.

Single-cell RNA sequencing of batch Chlamydomonas cultures reveals heterogeneity in their diurnal cycle phase

Feiyang Ma Patrice A Salom Sabeeha S Merchant Matteo Pellegrini 《The Plant cell》2021,33(4):1042

相似文献

12.

scDeepSort: a pre-trained cell-type annotation method for single-cell transcriptomics using deep learning with a weighted graph neural network

Xin Shao Haihong Yang Xiang Zhuang Jie Liao Penghui Yang Junyun Cheng Xiaoyan Lu Huajun Chen Xiaohui Fan 《Nucleic acids research》2021,49(21):e122

相似文献

13.

Independent component analysis based gene co-expression network inference (ICAnet) to decipher functional modules for better single-cell clustering and batch integration

Weixu Wang Huanhuan Tan Mingwan Sun Yiqing Han Wei Chen Shengnu Qiu Ke Zheng Gang Wei Ting Ni 《Nucleic acids research》2021,49(9):e54

With the tremendous increase of publicly available single-cell RNA-sequencing (scRNA-seq) datasets, bioinformatics methods based on gene co-expression network are becoming efficient tools for analyzing scRNA-seq data, improving cell type prediction accuracy and in turn facilitating biological discovery. However, the current methods are mainly based on overall co-expression correlation and overlook co-expression that exists in only a subset of cells, thus fail to discover certain rare cell types and sensitive to batch effect. Here, we developed independent component analysis-based gene co-expression network inference (ICAnet) that decomposed scRNA-seq data into a series of independent gene expression components and inferred co-expression modules, which improved cell clustering and rare cell-type discovery. ICAnet showed efficient performance for cell clustering and batch integration using scRNA-seq datasets spanning multiple cells/tissues/donors/library types. It works stably on datasets produced by different library construction strategies and with different sequencing depths and cell numbers. We demonstrated the capability of ICAnet to discover rare cell types in multiple independent scRNA-seq datasets from different sources. Importantly, the identified modules activated in acute myeloid leukemia scRNA-seq datasets have the potential to serve as new diagnostic markers. Thus, ICAnet is a competitive tool for cell clustering and biological interpretations of single-cell RNA-seq data analysis. 相似文献

14.

Single-cell RNA sequencing reveals a high-resolution cell atlas of xylem in Populus

Hui Li Xinren Dai Xiong Huang Mengxuan Xu Qiao Wang Xiaojing Yan Ronald R. Sederoff Quanzi Li 《植物学报(英文版)》2021,63(11):1906-1921

High-throughput single-cell RNA sequencing (scRNA-seq) has advantages over traditional RNA-seq to explore spatiotemporal information on gene dynamic expressions in heterogenous tissues. We performed Drop-seq, a method for the dropwise sequestration of single cells for sequencing, on protoplasts from the differentiating xylem of Populus alba × Populus glandulosa. The scRNA-seq profiled 9,798 cells, which were grouped into 12 clusters. Through characterization of differentially expressed genes in each cluster and RNA in situ hybridizations, we identified vessel cells, fiber cells, ray parenchyma cells and xylem precursor cells. Diffusion pseudotime analyses revealed the differentiating trajectory of vessels, fiber cells and ray parenchyma cells and indicated a different differentiation process between vessels and fiber cells, and a similar differentiation process between fiber cells and ray parenchyma cells. We identified marker genes for each cell type (cluster) and key candidate regulators during developmental stages of xylem cell differentiation. Our study generates a high-resolution expression atlas of wood formation at the single cell level and provides valuable information on wood formation. 相似文献

15.

VASC: Dimension Reduction and Visualization of Single-cell RNA-seq Data by Deep Variational Autoencoder

Dongfang Wang Jin Gu 《基因组蛋白质组与生物信息学报(英文版)》2018,16(5):320-331

相似文献

16.

New technologies to study helminth development and host-parasite interactions

《International journal for parasitology》2023,53(8):393-403

How parasites develop and survive, and how they stimulate or modulate host immune responses are important in understanding disease pathology and for the design of new control strategies. Microarray analysis and bulk RNA sequencing have provided a wealth of data on gene expression as parasites develop through different life-cycle stages and on host cell responses to infection. These techniques have enabled gene expression in the whole organism or host tissue to be detailed, but do not take account of the heterogeneity between cells of different types or developmental stages, nor the spatial organisation of these cells. Single-cell RNA-seq (scRNA-seq) adds a new dimension to studying parasite biology and host immunity by enabling gene profiling at the individual cell level. Here we review the application of scRNA-seq to establish gene expression cell atlases for multicellular helminths and to explore the expansion and molecular profile of individual host cell types involved in parasite immunity and tissue repair. Studying host-parasite interactions in vivo is challenging and we conclude this review by briefly discussing the applications of organoids (stem-cell derived mini-tissues) to examine host-parasite interactions at the local level, and as a potential system to study parasite development in vitro. Organoid technology and its applications have developed rapidly, and the elegant studies performed to date support the use of organoids as an alternative in vitro system for research on helminth parasites. 相似文献

17.

Estimating cell type composition using isoform expression one gene at a time

Hillary M. Heiling Douglas R. Wilson Naim U. Rashid Wei Sun Joseph G. Ibrahim 《Biometrics》2023,79(2):854-865

Human tissue samples are often mixtures of heterogeneous cell types, which can confound the analyses of gene expression data derived from such tissues. The cell type composition of a tissue sample may itself be of interest and is needed for proper analysis of differential gene expression. A variety of computational methods have been developed to estimate cell type proportions using gene-level expression data. However, RNA isoforms can also be differentially expressed across cell types, and isoform-level expression could be equally or more informative for determining cell type origin than gene-level expression. We propose a new computational method, IsoDeconvMM, which estimates cell type fractions using isoform-level gene expression data. A novel and useful feature of IsoDeconvMM is that it can estimate cell type proportions using only a single gene, though in practice we recommend aggregating estimates of a few dozen genes to obtain more accurate results. We demonstrate the performance of IsoDeconvMM using a unique data set with cell type–specific RNA-seq data across more than 135 individuals. This data set allows us to evaluate different methods given the biological variation of cell type–specific gene expression data across individuals. We further complement this analysis with additional simulations. 相似文献

18.

Cell function and identity revealed by comparative scRNA-seq analysis in human nasal,bronchial and epididymis epithelia

《European journal of cell biology》2022,101(3):151231

相似文献

19.

Approximate distance correlation for selecting highly interrelated genes across datasets

Qunlun Shen Shihua Zhang 《PLoS computational biology》2021,17(11)

With the rapid accumulation of biological omics datasets, decoding the underlying relationships of cross-dataset genes becomes an important issue. Previous studies have attempted to identify differentially expressed genes across datasets. However, it is hard for them to detect interrelated ones. Moreover, existing correlation-based algorithms can only measure the relationship between genes within a single dataset or two multi-modal datasets from the same samples. It is still unclear how to quantify the strength of association of the same gene across two biological datasets with different samples. To this end, we propose Approximate Distance Correlation (ADC) to select interrelated genes with statistical significance across two different biological datasets. ADC first obtains the k most correlated genes for each target gene as its approximate observations, and then calculates the distance correlation (DC) for the target gene across two datasets. ADC repeats this process for all genes and then performs the Benjamini-Hochberg adjustment to control the false discovery rate. We demonstrate the effectiveness of ADC with simulation data and four real applications to select highly interrelated genes across two datasets. These four applications including 21 cancer RNA-seq datasets of different tissues; six single-cell RNA-seq (scRNA-seq) datasets of mouse hematopoietic cells across six different cell types along the hematopoietic cell lineage; five scRNA-seq datasets of pancreatic islet cells across five different technologies; coupled single-cell ATAC-seq (scATAC-seq) and scRNA-seq data of peripheral blood mononuclear cells (PBMC). Extensive results demonstrate that ADC is a powerful tool to uncover interrelated genes with strong biological implications and is scalable to large-scale datasets. Moreover, the number of such genes can serve as a metric to measure the similarity between two datasets, which could characterize the relative difference of diverse cell types and technologies. 相似文献

20.

Single-cell RNA-seq analysis of mouse preimplantation embryos by third-generation sequencing

Xiaoying Fan Dong Tang Yuhan Liao Pidong Li Yu Zhang Minxia Wang Fan Liang Xiao Wang Yun Gao Lu Wen Depeng Wang Yang Wang Fuchou Tang 《PLoS biology》2020,18(12)

相似文献