Imaging Mass Cytometry Pipeline
Pipeline Overview
Raw MCD/TIFF Files ──> Image Processing ──> Cell Masks │ ▼ ┌─────────────────────────────────────────────┐ │ imc-pipeline │ ├─────────────────────────────────────────────┤ │ 1. Data Preprocessing (spillover, hot px) │ │ 2. Cell Segmentation (Cellpose/Mesmer) │ │ 3. Single-cell Quantification │ │ 4. Clustering & Phenotyping │ │ 5. Spatial Analysis │ │ 6. Visualization │ └─────────────────────────────────────────────┘ │ ▼ Cell Types + Spatial Neighborhoods
Complete steinbock Workflow
Step 1: Setup and Preprocessing
Initialize steinbock project
steinbock preprocess imc
--mcd data/*.mcd
--panel panel.csv
--output raw/
Hot pixel filtering
steinbock preprocess imc hotpixel
--input raw/
--output img/
--threshold 50
Create nuclear and membrane channels
steinbock preprocess mosaic
--input img/
--channels panel.csv
--output mosaics/
Step 2: Cell Segmentation
Using Cellpose
steinbock segment cellpose
--input img/
--panel panel.csv
--channel DNA1 DNA2
--output masks/
--diameter 20
Alternative: Using Mesmer
steinbock segment mesmer
--input img/
--panel panel.csv
--nuclear DNA1 DNA2
--membrane CD45
--output masks/
Step 3: Single-cell Quantification
Extract intensities
steinbock measure intensities
--input img/
--masks masks/
--panel panel.csv
--output intensities/
Measure cell properties (area, etc.)
steinbock measure regionprops
--masks masks/
--output regionprops/
Extract neighbor relationships
steinbock measure neighbors
--masks masks/
--output neighbors/
--distance 15
Complete Python Workflow
import pandas as pd import numpy as np import anndata as ad import scanpy as sc import squidpy as sq from pathlib import Path
=== 1. LOAD DATA ===
data_dir = Path('steinbock_output')
intensities = pd.read_csv(data_dir / 'intensities.csv', index_col=0) regionprops = pd.read_csv(data_dir / 'regionprops.csv', index_col=0) neighbors = pd.read_csv(data_dir / 'neighbors.csv')
print(f'Loaded {len(intensities)} cells')
=== 2. CREATE ANNDATA ===
adata = ad.AnnData(X=intensities.values, obs=regionprops, var=pd.DataFrame(index=intensities.columns)) adata.obs['image_id'] = [idx.split('_')[0] for idx in intensities.index] adata.obs['cell_id'] = intensities.index
Add spatial coordinates
adata.obsm['spatial'] = regionprops[['centroid_y', 'centroid_x']].values
=== 3. PREPROCESSING ===
Arcsinh transform (cofactor 5 for IMC)
adata.X = np.arcsinh(adata.X / 5)
Scale for clustering
sc.pp.scale(adata, max_value=10) adata.raw = adata.copy()
=== 4. DIMENSIONALITY REDUCTION ===
sc.pp.pca(adata, n_comps=20) sc.pp.neighbors(adata, n_neighbors=15) sc.tl.umap(adata)
=== 5. CLUSTERING ===
sc.tl.leiden(adata, resolution=0.8) print(f'Found {adata.obs["leiden"].nunique()} clusters')
=== 6. PHENOTYPING ===
Marker expression per cluster
sc.tl.rank_genes_groups(adata, 'leiden', method='wilcoxon') marker_genes = sc.get.rank_genes_groups_df(adata, group=None)
Annotate clusters based on markers
cluster_annotations = { '0': 'T cells', '1': 'Macrophages', '2': 'Tumor', '3': 'B cells', '4': 'Stromal' } adata.obs['cell_type'] = adata.obs['leiden'].map(cluster_annotations)
=== 7. SPATIAL ANALYSIS ===
Build spatial graph
sq.gr.spatial_neighbors(adata, coord_type='generic', delaunay=True)
Neighborhood enrichment
sq.gr.nhood_enrichment(adata, cluster_key='cell_type')
Co-occurrence analysis
sq.gr.co_occurrence(adata, cluster_key='cell_type')
Ripley's statistics
sq.gr.ripley(adata, cluster_key='cell_type', mode='L')
=== 8. VISUALIZATION ===
import matplotlib.pyplot as plt
UMAP by cell type
fig, axes = plt.subplots(1, 2, figsize=(14, 5)) sc.pl.umap(adata, color='cell_type', ax=axes[0], show=False) sc.pl.umap(adata, color='leiden', ax=axes[1], show=False) plt.savefig('umap_celltypes.png', dpi=150, bbox_inches='tight')
Spatial plot
fig, ax = plt.subplots(figsize=(10, 10)) sq.pl.spatial_scatter(adata[adata.obs['image_id'] == 'image1'], color='cell_type', shape=None, size=10, ax=ax) plt.savefig('spatial_celltypes.png', dpi=150, bbox_inches='tight')
Neighborhood enrichment heatmap
sq.pl.nhood_enrichment(adata, cluster_key='cell_type') plt.savefig('neighborhood_enrichment.png', dpi=150, bbox_inches='tight')
=== 9. DIFFERENTIAL ANALYSIS ===
Compare conditions
adata.obs['condition'] = adata.obs['image_id'].map({ 'image1': 'Control', 'image2': 'Control', 'image3': 'Treatment', 'image4': 'Treatment' })
Cell type proportions
proportions = adata.obs.groupby(['image_id', 'condition', 'cell_type']).size().unstack(fill_value=0) proportions = proportions.div(proportions.sum(axis=1), axis=0)
Save results
adata.write('imc_analysis.h5ad') proportions.to_csv('cell_type_proportions.csv') print('Analysis complete!')
R Alternative (imcRtools)
library(imcRtools) library(cytomapper) library(CATALYST)
Read steinbock output
spe <- read_steinbock('steinbock_output/')
Transform
assay(spe, 'exprs') <- asinh(counts(spe) / 5)
Cluster
spe <- runDR(spe, features = rownames(spe), exprs_values = 'exprs', dr = 'UMAP') spe <- cluster(spe, features = rownames(spe), exprs_values = 'exprs', xdim = 10, ydim = 10, maxK = 20)
Spatial analysis
spe <- buildSpatialGraph(spe, img_id = 'image_id', type = 'expansion', threshold = 20) spe <- aggregateNeighbors(spe, colPairName = 'neighborhood', by = 'cluster_id')
Spatial context
cn <- detectCommunity(spe, colPairName = 'neighborhood', size_threshold = 10, group_by = 'image_id')
Plot
plotSpatial(spe, img_id = 'image1', node_color_by = 'cluster_id')
QC Checkpoints
Stage Check Action if Failed
Preprocessing No hot pixel streaks Lower threshold
Segmentation
80% cells detected Adjust diameter
Quantification All markers extracted Check panel.csv
Clustering 5-20 clusters Adjust resolution
Spatial Neighbors detected Check distance
Workflow Variants
High-plex Panels (40+ markers)
Use batch-aware clustering
import scvi
scvi.model.SCVI.setup_anndata(adata, batch_key='image_id') model = scvi.model.SCVI(adata) model.train() adata.obsm['X_scvi'] = model.get_latent_representation() sc.pp.neighbors(adata, use_rep='X_scvi')
Tumor Microenvironment Analysis
Spatial interactions with tumor
tumor_cells = adata[adata.obs['cell_type'] == 'Tumor'].obs_names sq.gr.ligrec(adata, cluster_key='cell_type', source_groups=['Tumor'], target_groups=['T cells', 'Macrophages'])
Related Skills
-
imaging-mass-cytometry/data-preprocessing - Hot pixel, spillover
-
imaging-mass-cytometry/cell-segmentation - Cellpose/Mesmer details
-
imaging-mass-cytometry/phenotyping - Cluster annotation
-
imaging-mass-cytometry/spatial-analysis - Spatial statistics
-
imaging-mass-cytometry/interactive-annotation - Manual cell labeling
-
imaging-mass-cytometry/quality-metrics - QC metrics
-
single-cell/clustering - Clustering methods
-
spatial-transcriptomics/spatial-statistics - Related spatial methods