E-spatial

Beta

New application is live now

E-spatial

Single-cell spatial explorer

Notebooks

Premium

expiMap: Biologically informed deep learning to query gene programs in single-cell atlases
lock icon

BioTuring

The development of large-scale single-cell atlases has allowed describing cell states in a more detailed manner. Meanwhile, current deep leanring methods enable rapid analysis of newly generated query datasets by mapping them into reference atlases. expiMap (‘explainable programmable mapper’) Lotfollahi, Mohammad, et al. is one of the methods proposed for single-cell reference mapping. Furthermore, it incorporates prior knowledge from gene sets databases or users to analyze query data in the context of known gene programs (GPs).
Required GPU
expiMap
Spatial charting of single-cell transcriptomes in tissues - celltrek
lock icon

BioTuring

Single-cell RNA sequencing methods can profile the transcriptomes of single cells but cannot preserve spatial information. Conversely, spatial transcriptomics assays can profile spatial regions in tissue sections but do not have single-cell resolution. Here, Runmin Wei (Siyuan He, Shanshan Bai, Emi Sei, Min Hu, Alastair Thompson, Ken Chen, Savitri Krishnamurthy & Nicholas E. Navin) developed a computational method called CellTrek that combines these two datasets to achieve single-cell spatial mapping through coembedding and metric learning approaches. They benchmarked CellTrek using simulation and in situ hybridization datasets, which demonstrated its accuracy and robustness. They then applied CellTrek to existing mouse brain and kidney datasets and showed that CellTrek can detect topological patterns of different cell types and cell states. They performed single-cell RNA sequencing and spatial transcriptomics experiments on two ductal carcinoma in situ tissues and applied CellTrek to identify tumor subclones that were restricted to different ducts, and specific T-cell states adjacent to the tumor areas.
Only CPU
CellTrek
SCEVAN: Single CEll Variational ANeuploidy analysis
lock icon

BioTuring

In the realm of cancer research, grasping the intricacies of intratumor heterogeneity and its interplay with the immune system is paramount for deciphering treatment resistance and tumor progression. While single-cell RNA sequencing unveils diverse transcriptional programs, the challenge persists in automatically discerning malignant cells from non-malignant ones within complex datasets featuring varying coverage depths. Thus, there arises a compelling need for an automated solution to this classification conundrum. SCEVAN (De Falco et al., 2023), a variational algorithm, is designed to autonomously identify the clonal copy number substructure of tumors using single-cell data. It automatically separates malignant cells from non-malignant ones, and subsequently, groups of malignant cells are examined through an optimization-driven joint segmentation process.
Required GPU
scevan
CopyKAT: Delineating copy number and clonal substructure in human tumors from single-cell transcriptomes
lock icon

BioTuring

Classification of tumor and normal cells in the tumor microenvironment from scRNA-seq data is an ongoing challenge in human cancer study. Copy number karyotyping of aneuploid tumors (***copyKAT***) (Gao, Ruli, et al., 2021) is a method proposed for identifying copy number variations in single-cell transcriptomics data. It is used to predict aneuploid tumor cells and delineate the clonal substructure of different subpopulations that coexist within the tumor mass. In this notebook, we will illustrate a basic workflow of CopyKAT based on the tutorial provided on CopyKAT's repository. We will use a dataset of triple negative cancer tumors sequenced by 10X Chromium 3'-scRNAseq (GSM4476486) as an example. The dataset contains 20,990 features across 1,097 cells. We have modified the notebook to demonstrate how the tool works on BioTuring's platform.

Trends

Multimodal single-cell chromatin analysis with Signac

BioTuring

The recent development of experimental methods for measuring chromatin state at single-cell resolution has created a need for computational tools capable of analyzing these datasets. Here we developed Signac, a framework for the analysis of single-cell chromatin data, as an extension of the Seurat R toolkit for single-cell multimodal analysis. **Signac** enables an end-to-end analysis of single-cell chromatin data, including peak calling, quantification, quality control, dimension reduction, clustering, integration with single-cell gene expression datasets, DNA motif analysis, and interactive visualization. Furthermore, Signac facilitates the analysis of multimodal single-cell chromatin data, including datasets that co-assay DNA accessibility with gene expression, protein abundance, and mitochondrial genotype. We demonstrate scaling of the Signac framework to datasets containing over 700,000 cells.
Only CPU
Required PFP
signac
scGPT: Towards Building a Foundational Model for Single-Cell Multi-omics Using Generative AI

BioTuring

Generative pre-trained models have demonstrated exceptional success in various fields, including natural language processing and computer vision. In line with this progress, scGPT has been developed as a foundational model tailored specifically for the field of single-cell biology. It employs the generative pre-training transformer framework on an extensive dataset comprising more than 33 million cells. scGPT effectively extracts valuable biological insights related to genes and cells and can be fine-tuned to excel in numerous downstream applications.
Required GPU
scgpt
Seurat
Bioturing Massive-scale Analysis Solution for CellChat: Running analysis for massive-scale data from Seurat dataset

BioTuring

This tool provides a user-friendly and automated way to analyze large-scale single-cell RNA-seq datasets stored in RDS (Seurat) format. It allows users to run various analysis tools on their data in one command, streamlining the analysis workflow and saving time. Note that this notebook is only for the demonstration of the tool. Users can run the tool directly through the command line. Currently, we support: - CellChat - Inference and analysis of cell-cell communication using CellChat
Only CPU
CellChat
BioTuring Data Converter: Seurat <=> Scanpy for single-cell data transcriptomic and spatial transcriptomics

BioTuring

This notebook illustrates how to convert data from a Seurat object into a Scanpy annotation data and a Scanpy annotation data into a Seurat object using the BioStudio data transformation library (currently under development). It facilitates continued research using libraries that interact with Scanpy in Python and Seurat in R. seurat.to.adata function can retain information about reductions (such as PCA, t-SNE, UMAP and Seurat Clusters) and spatial information.
SoupX: removing ambient RNA contamination from droplet-based single-cell RNA sequencing data

BioTuring

Droplet-based single-cell RNA sequence analyses assume that all acquired RNAs are endogenous to cells. However, there is a certain amount of cell-free mRNAs floating in the input solution (referred to as 'the soup'), created from cells in the input solution being lysed. These background mRNAs are then distributed into the droplets with cells and sequenced alongside them, resulting in background contamination that confounds the biological interpretation of single-cell transcriptomic data. SoupX (Young and Behjati, 2020) is one of the methods proposed for ambient mRNA removal. In this notebook, we will illustrate a workflow example that applies SoupX to correct the ambient RNA in a dataset of 10k PBMC cells. The output of SoupX is a modified counts matrix, which can be used for any downstream analysis tool.
Only CPU
SoupX
Seurat - Integrated analysis of multimodal single-cell data

BioTuring

The simultaneous measurement of multiple modalities represents an exciting frontier for single-cell genomics and necessitates computational methods that can define cellular states based on multimodal data. Here, we introduce "weighted-nearest neighbor" analysis, an unsupervised framework to learn the relative utility of each data type in each cell, enabling an integrative analysis of multiple modalities. We apply our procedure to a CITE-seq dataset of 211,000 human peripheral blood mononuclear cells (PBMCs) with panels extending to 228 antibodies to construct a multimodal reference atlas of the circulating immune system. Multimodal analysis substantially improves our ability to resolve cell states, allowing us to identify and validate previously unreported lymphoid subpopulations. Moreover, we demonstrate how to leverage this reference to rapidly map new datasets and to interpret immune responses to vaccination and coronavirus disease 2019 (COVID-19). Our approach represents a broadly applicable strategy to analyze single-cell multimodal datasets and to look beyond the transcriptome toward a unified and multimodal definition of cellular identity.
Only CPU
Seurat
A workflow to analyze cell-cell communications on Visium data

BioTuring

Single-cell RNA data allows cell-cell communications (***CCC***) methods to infer CCC at either the individual cell or cell cluster/cell type level, but physical distances between cells are not preserved Almet, Axel A., et al., (2021). On the other hand, spatial data provides spatial distances between cells, but single-cell or gene resolution is potentially lost. Therefore, integrating two types of data in a proper manner can complement their strengths and limitations, from that improve CCC analysis. In this pipeline, we analyze CCC on Visium data with single-cell data as a reference. The pipeline includes 4 sub-notebooks as following 01-deconvolution: This step involves deconvolution and cell type annotation for Visium data, with cell type information obtained from a relevant single-cell dataset. The deconvolution method is SpatialDWLS which is integrated in Giotto package. 02-giotto: performs spatial based CCC and expression based CCC on Visium data using Giotto method. 03-nichenet: performs spatial based CCC and expression based CCC on Visium data using NicheNet method. 04-visualization: visualizes CCC results obtained from Giotto and NicheNet.
Scanpy is a scalable toolkit for analyzing single-cell gene expression data built jointly with anndata.

BioTuring

SCANPY integrates the analysis possibilities of established R-based frameworks and provides them in a scalable and modular form. Specifically, SCANPY provides preprocessing comparable to SEURAT and CELL RANGER, visualization through TSNE, graph-drawing and diffusion maps, clustering similar to PHENOGRAPH, identification of marker genes for clusters via differential expression tests and pseudotemporal ordering via diffusion pseudotime, which compares favorably with MONOCLE 2, and WISHBONE.
Only CPU
Scanpy
Monocle3 - An analysis toolkit for single-cell RNA-seq

BioTuring

Build single-cell trajectories with the software that introduced **pseudotime**. Find out about cell fate decisions and the genes regulated as they're made. Group and classify your cells based on gene expression. Identify new cell types and states and the genes that distinguish them. Find genes that vary between cell types and states, over trajectories, or in response to perturbations using statistically robust, flexible differential analysis. In development, disease, and throughout life, cells transition from one state to another. Monocle introduced the concept of **pseudotime**, which is a measure of how far a cell has moved through biological progress. Many researchers are using single-cell RNA-Seq to discover new cell types. Monocle 3 can help you purify them or characterize them further by identifying key marker genes that you can use in follow-up experiments such as immunofluorescence or flow sorting. **Single-cell trajectory analysis** shows how cells choose between one of several possible end states. The new reconstruction algorithms introduced in Monocle 3 can robustly reveal branching trajectories, along with the genes that cells use to navigate these decisions.
CellPhoneDB: inferring cell–cell communication from combined expression of multi-subunit ligand–receptor complexes

BioTuring

Cell–cell communication mediated by ligand–receptor complexes is critical to coordinating diverse biological processes, such as development, differentiation and inflammation. To investigate how the context-dependent crosstalk of different cell types enables physiological processes to proceed, we developed CellPhoneDB, a novel repository of ligands, receptors and their interactions. In contrast to other repositories, our database takes into account the subunit architecture of both ligands and receptors, representing heteromeric complexes accurately. We integrated our resource with a statistical framework that predicts enriched cellular interactions between two cell types from single-cell transcriptomics data. Here, we outline the structure and content of our repository, provide procedures for inferring cell–cell communication networks from single-cell RNA sequencing data and present a practical step-by-step guide to help implement the protocol. CellPhoneDB v.2.0 is an updated version of our resource that incorporates additional functionalities to enable users to introduce new interacting molecules and reduces the time and resources needed to interrogate large datasets. CellPhoneDB v.2.0 is publicly available, both as code and as a user-friendly web interface; it can be used by both experts and researchers with little experience in computational genomics. In our protocol, we demonstrate how to evaluate meaningful biological interactions with CellPhoneDB v.2.0 using published datasets. This protocol typically takes ~2 h to complete, from installation to statistical analysis and visualization, for a dataset of ~10 GB, 10,000 cells and 19 cell types, and using five threads.
Inference and analysis of cell-cell communication using CellChat

BioTuring

Understanding global communications among cells requires accurate representation of cell-cell signaling links and effective systems-level analyses of those links. We construct a database of interactions among ligands, receptors and their cofactors that accurately represent known heteromeric molecular complexes. We then develop **CellChat**, a tool that is able to quantitatively infer and analyze intercellular communication networks from single-cell RNA-sequencing (scRNA-seq) data. CellChat predicts major signaling inputs and outputs for cells and how those cells and signals coordinate for functions using network analysis and pattern recognition approaches. Through manifold learning and quantitative contrasts, CellChat classifies signaling pathways and delineates conserved and context-specific pathways across different datasets. Applying **CellChat** to mouse and human skin datasets shows its ability to extract complex signaling patterns.
Required GPU
CellChat
pySCENIC: Single-Cell rEgulatory Network Inference and Clustering

BioTuring

SCENIC Suite is a set of tools to study and decipher gene regulation. Its core is based on SCENIC (Single-Cell Regulatory Network Inference and Clustering) which enables you to infer transcription factors, gene regulatory networks and cell types from single-cell RNA-seq data. pySCENIC is a lightning-fast python implementation of the SCENIC pipeline (Single-Cell Regulatory Network Inference and Clustering) which enables biologists to infer transcription factors, gene regulatory networks and cell types from single-cell RNA-seq data.
Only CPU
pySCENIC
Monorail-pipeline and Recount3

BioTuring

Monorail can be used to process local and/or private data, allowing results to be directly compared to any study in recount3. Taken together, Monorail-pipeline tools help biologists maximize the utility of publicly available RNA-seq data, especially to improve their understanding of newly collected data. This is for helping potential users of the Monorail RNA-seq processing pipeline (alignment/quantification) get started running their own data through it.
Only CPU
recount3
Bioturing Massive-scale Analysis Solution: Running analysis for massive-scale data from Seurat dataset

BioTuring

This tool provides a user-friendly and automated way to analyze large-scale single-cell RNA-seq datasets stored in RDS (Seurat) format. It allows users to run various analysis tools on their data in one command, streamlining the analysis workflow and saving time. Note that this notebook is only for the demonstration of the tool. User can run the tool directly through the command line. Currently, we support: - InferCNV - Identifying tumor cells at the single-cell level using machine learning
Only CPU
inferCNV
The comprehensive pipeline for single-cell rnaseq - cellgeni

BioTuring

It is a comprehensive set of premade notebooks available to users on BioColab. These notebooks are designed to guide users through various stages of downstream analysis and to aid them in inspecting their own data. We have five notebooks available, each of which we believe covers an important aspect of downstream analysis. Creation of the Seurat object and some basic exploration of its properties. Estimation of mitochondrial and ribosomal protein percentages among all reads. Calculation of cell cycle scores for each cell. Log-normalization, highly variable gene selection, dimensionality reduction, clustering, and general exploration of the dataset. Quality control and careful removal of low-quality cells. Normalization, scaling, and highly variable gene selection via SCTransform. Clustering with parameter tuning allows the identification of smaller clusters. Marker gene identification for each cluster. Automated cell type annotation using singleR
Notebooks
Required PFP
Only CPU
signac
Required GPU
scgpt
Seurat
Only CPU
CellChat
Only CPU
SoupX
Only CPU
Seurat
Only CPU
Scanpy
Required GPU
CellChat
Only CPU
pySCENIC
Only CPU
recount3
Only CPU
inferCNV