Can foundation models (scGPT, Geneformer, scBERT) pre-trained on single-cell immune atlases predict individual patient drug responses from pre-treatment transcriptomic profiles — and do they outperfor

Question

Accepted Answer

Single-cell foundation models (scFMs) such as **scGPT**, **Geneformer**, and **scFoundation** can predict individual patient drug responses and sensitivity from pre-treatment transcriptomic profiles (Direct, High; PMID: 41044630). While these models demonstrate superior performance over some traditional baselines in drug sensitivity tasks, their superiority is inconsistent across different types of prediction (e.g., transcriptomic perturbation vs. clinical sensitivity) and often depends on the specific biological context or the use of external biological knowledge (Direct, High; PMID: 40269681, PMID: 41044630).

## Performance in Drug Sensitivity and Response Prediction
Large-scale pre-training on single-cell atlases allows foundation models to capture universal biological knowledge, which can be adapted for drug sensitivity tasks (Direct, High; PMID: 41044630).

*   **Sensitivity Benchmarks:** In zero-shot and fine-tuning benchmarks, **scFoundation** and **scGPT** emerged as top performers for drug sensitivity prediction, achieving AUROCs of 0.755 and 0.737, respectively (Direct, High; PMID: 41044630).
*   **Comparison of scFMs:** **LangCell** (AUROC = 0.667) and **Geneformer** (AUROC = 0.601) showed lower performance in clinical sensitivity tasks compared to scFoundation and scGPT in the same benchmark (Direct, High; PMID: 41044630).
*   **Predicting Clinical Outcomes:** The **ENLIGHT–DeepPT** framework, which uses deep learning to impute gene expression from H&E slides and then applies genetic interaction networks, successfully predicted patient response to targeted and immune therapies across five clinical trials with an aggregate odds ratio (OR) of 2.28 (Direct, High; PMID: 38961276).

## Comparisons to Conventional Genomic and Expression Biomarkers
The evidence regarding whether scFMs outperform traditional methods is mixed and depends on the evaluation metric and task type.

*   **Superiority in Sensitivity:** Benchmarking on clinical scenarios found that all evaluated scFMs (scFoundation, scGPT, LangCell, Geneformer, scCello, UCE) outperformed baseline models trained on raw expression counts when predicting drug sensitivity (Direct, High; PMID: 41044630).
*   **Inferiority in Perturbation Prediction:** In tasks predicting post-perturbation RNA-seq vectors (the actual expression change following treatment), scGPT and scFoundation generally underperformed compared to a simple "Train Mean" baseline or standard machine learning models (e.g., Random Forest) that incorporated biological prior knowledge such as Gene Ontology (GO) terms (Direct, High; PMID: 40269681).
*   **Traditional ML Advantages:** Random Forest models using GO features or scELMo (language model-based gene embeddings) outperformed scFMs by a large margin in predicting responses to unseen perturbations in datasets like Adamson or Replogle (Direct, High; PMID: 40269681).

## Clinical Target Identification and Virtual Screening
Foundation models have moved beyond simple sensitivity prediction to identifying novel therapeutic targets through *in silico* perturbations.

*   **Geneformer Case Study:** Geneformer predicted candidate therapeutic targets for cardiomyopathy (e.g., *GSN*, *PLN*); CRISPR-mediated inhibition of these targets significantly improved cardiomyocyte contraction in human iPSC-based disease models (Direct, High; PMID: 37258680).
*   **C2S-Scale Virtual Screen:** Using a multimodal Large Language Model (LLM) framework, researchers identified **silmitasertib** (a CK2 inhibitor) as a context-specific amplifier of MHC-I antigen presentation. This was validated experimentally in primary tumor fragments and neuroendocrine Merkel cell models, where it enhanced immune visibility in the presence of interferon signaling (Direct, High; PMID: 41279114).
*   **DREEP Utility:** The DREEP tool, which utilizes enrichment analysis of single-cell profiles against bulk-derived drug sensitivity signatures, outperformed other single-cell drug prediction tools like **scDRUG** and **scDEAL** in detecting sensitivity differences among individual cells within the same tumor (Direct, High; PMID: 38041118).

## Factors Influencing Model Selection
The choice between a foundation model and a simpler alternative is guided by task complexity and available data (Direct, High; PMID: 41044630).

*   **Roughness Index (ROGI):** Model performance is strongly correlated with the smoothness of the cell-property landscape (ROGI). scFMs often provide a smoother landscape that reduces the difficulty of training task-specific models compared to raw noisy counts (Direct, High; PMID: 41044630).
*   **Design Trade-offs:** scFoundation's strength in drug sensitivity is attributed to read-depth aware pre-training and absolute value projection, making it sensitive to tumor microenvironment heterogeneity (Direct, High; PMID: 41044630).
*   **Data Limitations:** Benchmarking is often limited by low inter-sample variance in common datasets (e.g., Perturb-seq), which can make simple mean-based predictors appear more effective than complex foundation models (Derived, Medium; PMID: 40269681, PMID: 41044630).

Overall, while foundation models offer powerful out-of-the-box capabilities for identifying rare cell populations and context-specific drug targets, they do not universally replace traditional biomarkers or simpler machine learning models, which remain superior for certain transcriptomic prediction tasks (Derived, High; PMID: 41044630, PMID: 40269681).

---

### Unverified Citations

The following sources failed to support their assigned claims after 3 verification rounds designed to ensure only high-confidence, relevant references are retained:

- **PMID:38614094** — **   **Immune Checkpoint Biomarkers:** For anti-PD-1 therapy, the **T cell InteractPrint** score (leveraging weighted int...*  
  Failed: entities — The claim attributes 'T cell InteractPrint' to foundation models (scFMs), but the paper develops this score using traditional UCell signature scoring and receptor-ligand pairing analysis, not scFMs.  
  Possible alternatives (unverified): PMID:39592577 (60% topic match); PMID:34062119 (52% topic match)

Molecular Factor	Link Type	Target	Effect	Context / Mechanism	Reference
Silmitasertib (CK2 inhibitor)	synergy	IFN-beta signaling	amplification	Silmitasertib conditionally amplifies MHC-I antigen presentation in the presence of interferon, as predicted by a dual-context virtual screen using C2S-Scale.	PMID: 41279114
GSN	inhibition	Cardiomyocyte contraction	impairment	In silico deletion of GSN predicted improvement in contractile stress for dilated cardiomyopathy models, which was validated through CRISPR-mediated knockout.	PMID: 37258680
Nutlin	inhibition	MDM2	TP53 stabilization	DREEP accurately predicted that TP53 wild-type cells exhibit sensitivity to Nutlin-induced cell cycle arrest while missense mutants are resistant.	PMID: 38041118
WNT6	activation	FZD10/LRP6 complex	canonical signaling	Epidermal-derived Wnt6 acts as a dominant source for paracrine signaling to dermal trajectory cells during embryonic skin specification.	PMID: 33597522
Afatinib	inhibition	EGFR/HER2	growth inhibition	Retrospective lineage tracing identified distinct tolerant subpopulations in MDA-MB-468 cells that could be targeted using IGF1R or HDAC inhibitors.	PMID: 38041118
OCT4, SOX2, KLF4, MYC	reprogramming	Fibroblast state	iPSC state shift	In silico activation by moving these gene ranks to the front of cell encodings shifted the gene embedding landscape towards an iPSC state in Geneformer.	PMID: 37258680
CSF1	activation	CSF1R	microglial proliferation	Spatial transcriptomics (CosMx) revealed that CSF1-CSF1R signaling strength increases as microglia approach amyloid plaques, driving DAM development.	PMID: 38819990
GAD1/GAD2 and SLC6A11	regulation	GABBR2	hyperpolarization	Increased astrocytic GABA-related signaling to neurons in microglia-dense plaque niches contributes to an imbalance in neuronal synaptic signaling.	PMID: 38819990
Bortezomib	inhibition	26S proteasome	transcriptome rewiring	DREEP captured the shift from a heterogeneous sensitive population to a homogeneous resistant population in MCF7 cells over 96 hours of exposure.	PMID: 38041118
CXCL13	attraction	CXCR5+ B cells	TLS formation	Prenatal LTo-like cells express CXCL13 to facilitate immune cell recruitment and organized lymphoid structure development in the human skin and gut.	PMID: 40993240

Can foundation models (scGPT, Geneformer, scBERT) pre-trained on single-cell immune atlases predict individual patient drug responses from pre-treatment transcriptomic profiles — and do they outperfor

Performance in Drug Sensitivity and Response Prediction

Comparisons to Conventional Genomic and Expression Biomarkers

Clinical Target Identification and Virtual Screening

Factors Influencing Model Selection

Unverified Citations

1. Phases of Evidence Evolution

2. Network Structure and Relationships

3. Mechanisms → Therapies → Outcomes

4. Biases and Reliability

Significance Assessment

Unverified Citations

Hypothesis 1

Mechanistic rationale

Predictions

Study design

Confounders & controls

Risks/limitations

Falsification criteria

Unverified Citations