期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

MISC: missing imputation for single-cell RNA sequencing data

Mary Qu Yang Sherman M. Weissman William Yang Jialing Zhang Allon Canaann Renchu Guan 《BMC systems biology》2018,12(7):114

Background

Single-cell RNA sequencing (scRNA-seq) technology provides an effective way to study cell heterogeneity. However, due to the low capture efficiency and stochastic gene expression, scRNA-seq data often contains a high percentage of missing values. It has been showed that the missing rate can reach approximately 30% even after noise reduction. To accurately recover missing values in scRNA-seq data, we need to know where the missing data is; how much data is missing; and what are the values of these data.

Methods

To solve these three problems, we propose a novel model with a hybrid machine learning method, namely, missing imputation for single-cell RNA-seq (MISC). To solve the first problem, we transformed it to a binary classification problem on the RNA-seq expression matrix. Then, for the second problem, we searched for the intersection of the classification results, zero-inflated model and false negative model results. Finally, we used the regression model to recover the data in the missing elements.

Results

We compared the raw data without imputation, the mean-smooth neighbor cell trajectory, MISC on chronic myeloid leukemia data (CML), the primary somatosensory cortex and the hippocampal CA1 region of mouse brain cells. On the CML data, MISC discovered a trajectory branch from the CP-CML to the BC-CML, which provides direct evidence of evolution from CP to BC stem cells. On the mouse brain data, MISC clearly divides the pyramidal CA1 into different branches, and it is direct evidence of pyramidal CA1 in the subpopulations. In the meantime, with MISC, the oligodendrocyte cells became an independent group with an apparent boundary.

Conclusions

Our results showed that the MISC model improved the cell type classification and could be instrumental to study cellular heterogeneity. Overall, MISC is a robust missing data imputation model for single-cell RNA-seq data.

相似文献

2.

Regulatory network-based imputation of dropouts in single-cell RNA sequencing data

Ana Carolina Leote Xiaohui Wu Andreas Beyer 《PLoS computational biology》2022,18(2)

相似文献

3.

Joint gene network construction by single-cell RNA sequencing data

Meichen Dong Yiping He Yuchao Jiang Fei Zou 《Biometrics》2023,79(2):915-925

相似文献

4.

Highly Regional Genes: graph-based gene selection for single-cell RNA-seq data

《遗传学报》2022,49(9):891-899

相似文献

5.

TSEE: an elastic embedding method to visualize the dynamic gene expression patterns of time series single-cell RNA sequencing data

An Shaokun Ma Liang Wan Lin 《BMC genomics》2019,20(2):77-92

Background

Time series single-cell RNA sequencing (scRNA-seq) data are emerging. However, the analysis of time series scRNA-seq data could be compromised by 1) distortion created by assorted sources of data collection and generation across time samples and 2) inheritance of cell-to-cell variations by stochastic dynamic patterns of gene expression. This calls for the development of an algorithm able to visualize time series scRNA-seq data in order to reveal latent structures and uncover dynamic transition processes.

Results

In this study, we propose an algorithm, termed time series elastic embedding (TSEE), by incorporating experimental temporal information into the elastic embedding (EE) method, in order to visualize time series scRNA-seq data. TSEE extends the EE algorithm by penalizing the proximal placement of latent points that correspond to data points otherwise separated by experimental time intervals. TSEE is herein used to visualize time series scRNA-seq datasets of embryonic developmental processed in human and zebrafish. We demonstrate that TSEE outperforms existing methods (e.g. PCA, tSNE and EE) in preserving local and global structures as well as enhancing the temporal resolution of samples. Meanwhile, TSEE reveals the dynamic oscillation patterns of gene expression waves during zebrafish embryogenesis.

Conclusions

TSEE can efficiently visualize time series scRNA-seq data by diluting the distortions of assorted sources of data variation across time stages and achieve the temporal resolution enhancement by preserving temporal order and structure. TSEE uncovers the subtle dynamic structures of gene expression patterns, facilitating further downstream dynamic modeling and analysis of gene expression processes. The computational framework of TSEE is generalizable by allowing the incorporation of other sources of information.

相似文献

6.

Crafting a blueprint for single-cell RNA sequencing

《Trends in plant science》2022,27(1):92-103

相似文献

7.

SDImpute: A statistical block imputation method based on cell-level and gene-level information for dropouts in single-cell RNA-seq data

Jing Qi Yang Zhou Zicen Zhao Shuilin Jin 《PLoS computational biology》2021,17(6)

The single-cell RNA sequencing (scRNA-seq) technologies obtain gene expression at single-cell resolution and provide a tool for exploring cell heterogeneity and cell types. As the low amount of extracted mRNA copies per cell, scRNA-seq data exhibit a large number of dropouts, which hinders the downstream analysis of the scRNA-seq data. We propose a statistical method, SDImpute (Single-cell RNA-seq Dropout Imputation), to implement block imputation for dropout events in scRNA-seq data. SDImpute automatically identifies the dropout events based on the gene expression levels and the variations of gene expression across similar cells and similar genes, and it implements block imputation for dropouts by utilizing gene expression unaffected by dropouts from similar cells. In the experiments, the results of the simulated datasets and real datasets suggest that SDImpute is an effective tool to recover the data and preserve the heterogeneity of gene expression across cells. Compared with the state-of-the-art imputation methods, SDImpute improves the accuracy of the downstream analysis including clustering, visualization, and differential expression analysis. 相似文献

8.

Scalable microfluidics for single-cell RNA printing and sequencing

Sayantan Bose Zhenmao Wan Ambrose Carr Abbas H. Rizvi Gregory Vieira Dana Pe’er Peter A. Sims 《Genome biology》2015,16(1)

相似文献

9.

Correction: SDImpute: A statistical block imputation method based on cell-level and gene-level information for dropouts in single-cell RNA-seq data

Jing Qi Yang Zhou Zicen Zhao Shuilin Jin 《PLoS computational biology》2022,18(1)

相似文献

10.

TedSim: temporal dynamics simulation of single-cell RNA sequencing data and cell division history

Xinhai Pan Hechen Li Xiuwei Zhang 《Nucleic acids research》2022,50(8):4272

Recently, lineage tracing technology using CRISPR/Cas9 genome editing has enabled simultaneous readouts of gene expressions and lineage barcodes, which allows for the reconstruction of the cell division tree and makes it possible to reconstruct ancestral cell types and trace the origin of each cell type. Meanwhile, trajectory inference methods are widely used to infer cell trajectories and pseudotime in a dynamic process using gene expression data of present-day cells. Here, we present TedSim (single-cell temporal dynamics simulator), which simulates the cell division events from the root cell to present-day cells, simultaneously generating two data modalities for each single cell: the lineage barcode and gene expression data. TedSim is a framework that connects the two problems: lineage tracing and trajectory inference. Using TedSim, we conducted analysis to show that (i) TedSim generates realistic gene expression and barcode data, as well as realistic relationships between these two data modalities; (ii) trajectory inference methods can recover the underlying cell state transition mechanism with balanced cell type compositions; and (iii) integrating gene expression and barcode data can provide more insights into the temporal dynamics in cell differentiation compared to using only one type of data, but better integration methods need to be developed. 相似文献

11.

A novel method for determining microflora composition using dynamic phylogenetic analysis of 16S ribosomal RNA deep sequencing data

Chan ER Hester J Kalady M Xiao H Li X Serre D 《Genomics》2011,98(4):253-259

Deep sequencing of the 16S rRNA gene provides a comprehensive view of bacterial communities in a particular environment and has expanded our ability to study the impact of the microflora on human health and disease. Current analysis methods rely on comparisons of the sequences generated with an expanding but limited set of annotated 16S rRNA sequences or phylogenic clustering of sequences based on arbitrary similarity cutoffs. We describe a novel approach to characterize bacterial composition using deep sequencing of 16S rRNA gene. Our method defines operational taxonomic units based on phylogenetic tree reconstruction and dynamic clustering of sequences using solely sequencing data. These OTUs can be used to identify differences in bacteria abundance between environments. This approach can perform better than previous phylogenetic methods and will significantly improve our understanding of the microfloral role on human diseases by providing a comprehensive analysis of the microbial composition from various bacterial communities. 相似文献

12.

Miscellanea. A robust imputation method for surrogate outcome data

Chen Y-H 《Biometrika》2000,87(3):711-716

相似文献

13.

Quartz-Seq: a highly reproducible and sensitive single-cell RNA sequencing method,reveals non-genetic gene-expression heterogeneity

Yohei Sasagawa Itoshi Nikaido Tetsutaro Hayashi Hiroki Danno Kenichiro D Uno Takeshi Imai Hiroki R Ueda 《Genome biology》2013,14(4):R31

Development of a highly reproducible and sensitive single-cell RNA sequencing (RNA-seq) method would facilitate the understanding of the biological roles and underlying mechanisms of non-genetic cellular heterogeneity. In this study, we report a novel single-cell RNA-seq method called Quartz-Seq that has a simpler protocol and higher reproducibility and sensitivity than existing methods. We show that single-cell Quartz-Seq can quantitatively detect various kinds of non-genetic cellular heterogeneity, and can detect different cell types and different cell-cycle phases of a single cell type. Moreover, this method can comprehensively reveal gene-expression heterogeneity between single cells of the same cell type in the same cell-cycle phase. 相似文献

14.

A high sensitivity method for sequencing RNA: application to ribosomal RNA.

A Bakin J Ofengand 《BioTechniques》1992,13(5):682-684

相似文献

15.

Detecting cell-type-specific allelic expression imbalance by integrative analysis of bulk and single-cell RNA sequencing data

Jiaxin Fan Xuran Wang Rui Xiao Mingyao Li 《PLoS genetics》2021,17(3)

相似文献

16.

SCOPE-Seq: a scalable technology for linking live cell imaging and single-cell RNA sequencing

Jinzhou Yuan Jenny Sheng Peter A. Sims 《Genome biology》2018,19(1):227

Optically decodable beads link the identity of a sample to a measurement through an optical barcode, enabling libraries of biomolecules to be captured on beads in solution and decoded by fluorescence. This approach has been foundational to microarray, sequencing, and flow-based expression profiling technologies. We combine microfluidics with optically decodable beads and show that phenotypic analysis of living cells can be linked to single-cell sequencing. As a proof-of-concept, we demonstrate the accuracy and scalability of our tool called Single Cell Optical Phenotyping and Expression sequencing (SCOPE-Seq) to combine live cell imaging with single-cell RNA sequencing. 相似文献

17.

Author Corrections: Reconstruction of cell spatial organization from single-cell RNA sequencing data based on ligand-receptor mediated self-assembly

Xianwen Ren Guojie Zhong Qiming Zhang Lei Zhang Yujie Sun Zemin Zhang 《Cell research》2021,31(12):1319

相似文献

18.

STRIDE: accurately decomposing and integrating spatial transcriptomics using single-cell RNA sequencing

Dongqing Sun Zhaoyang Liu Taiwen Li Qiu Wu Chenfei Wang 《Nucleic acids research》2022,50(7):e42

相似文献

19.

Studying the mechanism underlying lipid metabolism in osteosarcoma based on transcriptomic RNA sequencing and single-cell data

Zhe Li Chi Jin Xinchang Lu Yan Zhang Yi Zhang Jia Wen Yongkui Liu Xiaoting Liu Jiazhen Li 《The journal of gene medicine》2023,25(6):e3491

相似文献

20.

Q-RRBS: a quantitative reduced representation bisulfite sequencing method for single-cell methylome analyses

Kangli Wang Xianfeng Li Shanshan Dong Jialong Liang Fengbiao Mao Cheng Zeng Honghu Wu Jinyu Wu Wanshi Cai Zhong Sheng Sun 《Epigenetics》2015,10(9):775-783

Reduced representation bisulfite sequencing (RRBS) is a powerful method of DNA methylome profiling that can be applied to single cells. However, no previous report has described how PCR-based duplication-induced artifacts affect the accuracy of this method when measuring DNA methylation levels. For quantifying the effects of duplication-induced artifacts on methylome profiling when using ultra-trace amounts of starting material, we developed a novel method, namely quantitative RRBS (Q-RRBS), in which PCR-induced duplication is excluded through the use of unique molecular identifiers (UMIs). By performing Q-RRBS on varying amounts of starting material, we determined that duplication-induced artifacts were more severe when small quantities of the starting material were used. However, through using the UMIs, we successfully eliminated these artifacts. In addition, Q-RRBS could accurately detect allele-specific methylation in absence of allele-specific genetic variants. Our results demonstrate that Q-RRBS is an optimal strategy for DNA methylation profiling of single cells or samples containing ultra-trace amounts of cells. 相似文献