Foundation model pathology (Computational Pathology AI)

一行要約

Foundation model pathology は self-supervised vision transformer を 数十万-数百万 WSI で pre-train し、downstream task (driver mutation prediction, IO biomarker, prognosis, rare disease classification) に fine-tune / few-shot で展開する computational pathology 手法。UNI / CONCH / Virchow / Prov-GigaPath が 2024 年前後に相次いで release され、hand-crafted radiomic feature を凌駕する WSI embedding を提供。Digital pathology の paradigm を「task-specific CNN」から「general-purpose embedding + 軽量 classifier」へ転換、稀少疾患・少 label setting で特に強み。

原理

(1) Pre-training corpus: TCGA + 病院 PHI-stripped WSI の数十万-数百万枚 (Prov-GigaPath: 1.3M slides from Providence)。(2) SSL strategy: DINOv2 / iBOT / MAE 等 self-supervised vision transformer で patch-level embedding 学習。(3) Aggregation: WSI-level prediction には MIL (multiple instance learning, CLAM / TransMIL) で patch embedding を slide-level に集約。(4) Multi-modal: CONCH (vision-language) で病理画像 + 報告書 contrastive learning、PathChat / GPFM で interactive Q&A。(5) Downstream: linear probe / LoRA / few-shot で task-specific 適応。

主要エビデンス / 適用領域

UNI (Chen et al. SSRN 2024 Nat Med): 100K WSI (>20 organ) で DINOv2 学習、34 task で SOTA、tissue type / driver mutation / prognosis predictive
Virchow (Vorontsov 2024 Nat Med): 1.5M WSI 学習、cancer detection / biomarker / rare cancer で baseline 大幅超え
Prov-GigaPath (Xu et al. CellDeathDis 2023 Nature): 1.3M slide (Providence health system) で gigapixel WSI に対応、whole-slide embedding が組織レベル prediction で improvement
CONCH (Lu 2024 Nat Med): H&E + report 1.17M pair で contrastive learning、zero-shot classification 能力
NSCLC application: H&E から EGFR / ALK / KRAS mutation prediction, TMB prediction, IO response prediction, brain metastasis primary tracing (lung 起源 vs その他)
Rare cancer: thymic / mesothelioma 等 small cohort task で foundation model が hand-crafted feature を凌駕

適用分野と限界

強み: data-hungry な hand-crafted CNN を凌駕, transfer learning で rare task / small dataset 対応, multi-task / multi-organ 適用, AI scribe / interactive analysis (PathChat) の臨床導入加速
限界: pre-training corpus の demographic / institution bias (TCGA 偏向), regulatory / reproducibility (model versioning), explainability (transformer attention の臨床解釈), large compute / model size (deployment infra), foundation model の domain shift (FFPE vs frozen, scanner brand, stain variation), bias monitoring framework 不足

Open Questions

Multi-modal foundation model: pathology + radiology + genomics + clinical text 統合
Federated pre-training: PHI 保護下で multi-institution pre-training
Bias / fairness: race / ethnicity / institution bias の systematic audit
Regulatory pathway: FDA / PMDA の SaMD 承認 (foundation model の continuous learning 監督)
Clinical workflow integration: pathologist co-pilot として導入の outcome benefit prospective evidence

Research Wiki

エクスプローラー

Foundation-model-pathology

Foundation model pathology (Computational Pathology AI)

一行要約

原理

主要エビデンス / 適用領域

適用分野と限界

Open Questions

関連エンティティ

グラフビュー

目次

バックリンク

Research Wiki

エクスプローラー

Foundation-model-pathology

Foundation model pathology (Computational Pathology AI)

一行要約

原理

主要エビデンス / 適用領域

適用分野と限界

Open Questions

関連エンティティ

Basic-Neutrophil and myeloid cell

Basic-Others

Basic-Tumor microenvironment

Clinical-Methodology

Clinical-NSCLC-Driver negative

Clinical-SCLC

Clinical-Supportive care

合成 (Outputs)

グラフビュー

目次

バックリンク