scRNA Orchestrator¶
End-to-end single-cell RNA-seq pipeline built on Scanpy. QC filtering, normalisation, dimensionality reduction, clustering, marker gene identification, and doublet removal.
Quick Demo¶
CLI Reference¶
python3 skills/scrna-orchestrator/scrna_orchestrator.py \
--input <data.h5ad> \
--output <report_dir>
# With doublet detection
python3 skills/scrna-orchestrator/scrna_orchestrator.py \
--demo --doublet-method scrublet --output /tmp/scrna_doublet_demo
| Argument | Required | Description |
|---|---|---|
--input |
Yes* | Path to AnnData (.h5ad) file |
--output |
Yes | Output directory |
--demo |
No | Use built-in PBMC3k demo data |
--doublet-method |
No | Doublet detection method (scrublet) |
Pipeline Steps¶
- QC Filtering — mitochondrial %, gene counts, cell counts
- Normalisation — library size normalisation + log1p
- HVG Selection — highly variable genes
- PCA — dimensionality reduction
- Neighbourhood Graph — k-NN graph construction
- Clustering — Leiden community detection
- UMAP — 2D embedding visualisation
- Marker Genes — rank_genes_groups per cluster
- Doublet Detection — optional Scrublet integration
Output¶
report.md— Analysis report with QC stats and cluster annotationsfigures/— UMAP plots, QC violin plots, marker gene dotplotsprocessed.h5ad— Processed AnnData object- Reproducibility bundle