Education / AI-Assisted Biomarker Assessment

AI-Assisted Biomarker
Assessment

How machine learning transforms manual slide scoring into consistent, quantitative measurement — and what that means for reproducibility across labs and clinical trials.

Digital Pathology Automated Scoring Whole Slide Image Algorithmic Scoring Inter-observer Variability Assistive AI
LEARNING OBJECTIVES
Section 01

The Problem AI Is Solving

Manual IHC interpretation — a pathologist examining a stained slide under a microscope and estimating what percentage of cells stain positive, and at what intensity — has been the standard for decades. It works, but it has a known limitation: it’s inherently subjective.

That subjectivity has real clinical consequences. Inter-observer variabilityDisagreement between two pathologists interpreting the same slide — a known limitation of manual IHC scoring that AI tools help reduce. — two pathologists looking at the same slide and arriving at different scores — is a documented feature of manual IHC, not an edge case. For biomarkers where a scoring threshold determines treatment eligibility (HER2 status for targeted therapy, MMR status for immunotherapy, PD-L1 for checkpoint blockade), these inconsistencies affect which patients receive which treatments. The same pathologist may also score a slide differently on two different days, compounding the reliability problem.

Scale adds a third constraint. A single whole slide imageA digital scan of an entire glass slide at high resolution, enabling viewing at any magnification on a computer rather than a physical microscope. can contain millions of cells. Manual scoring requires a pathologist to estimate from a representative sample rather than evaluate every cell — a practical necessity that introduces sampling variability alongside the observer variability already present. The more heterogeneous the tumor, the more consequential that sampling limitation becomes.

Section 02

What AI-Assisted Assessment Does

Machine learning allows algorithms to recognize patterns in annotated datasets and generalize those patterns to new inputs. Applied to digital pathologyThe practice of converting glass slides into high-resolution digital images (whole slide images, or WSIs) that can be viewed, shared, and analyzed computationally., this enables automated image analysisSoftware that scans a digital slide and quantifies features — like staining intensity or cell count — without requiring a human to score each cell manually.: an algorithm scans the entire digitized slide, identifies individual cells, classifies each one by staining intensity, and produces a quantitative score that covers the full tissue section rather than a sampled subset — in a fraction of the time manual scoring would require.

A direct comparison between manual pathological analysis and one such tool (IHC Profiler) resolved in a match rate of 88.6% — demonstrating that automated scoring can closely align with expert human interpretation while offering substantially greater consistency across repeated runs and participating laboratories.

Manual Scoring
  • Pathologist visually scans a representative sample of cells
  • Estimates staining intensity and positive cell percentage
  • Records a score based on a trained visual judgment
Estimate · Observer-dependent
VS
AI-Assisted Scoring
  • Algorithm scans the entire digitized slide at cell level
  • Classifies every cell by staining intensity category
  • Outputs a precise quantitative score from full-section data
Quantitative · Consistent

The goal isn’t to replace the pathologist — it’s to give them a more consistent, quantitative starting point. Most regulatory-approved diagnostic tools using AI are designated as assistive AIAI tools designed to support and augment a pathologist’s interpretation rather than replace it — currently the standard model for regulatory-approved diagnostic AI tools.: they provide a score, the pathologist reviews it, applies clinical context, and makes the final determination. The human remains the decision-maker.

i AI as Assistant, Not Replacement

Current regulatory-approved AI diagnostic tools are designated as assistive — they provide a quantitative score as input to the pathologist’s review, not an independent diagnosis. The pathologist reviews the AI output, applies clinical context, and makes the final call. This is a deliberate regulatory design choice, not a temporary limitation: the goal is to augment expert judgment with quantitative data, not to automate away the expertise.

Section 03

Why Standardization Matters

One of the most significant advantages of algorithmic scoringA numerical score assigned by a machine learning model based on patterns it has learned from large datasets of annotated slides. is reproducibility across sites. A manual score produced in one laboratory may not be directly comparable to a manual score produced in another — even using the same antibody and protocol — because different pathologists calibrate their visual estimates differently. An AI model, once validated, applies the same criteria consistently regardless of who is operating it or where it is deployed.

This matters most for clinical trials, where biomarker scores from patients enrolled at different institutions need to be directly comparable for the trial data to be interpretable. It also matters for emerging biomarkers where scoring thresholds are still being established — consistency in measurement is critical to generating reliable evidence about where those thresholds should fall.

Consistent across observers

The same criteria applied to every slide, every time — eliminating the variability inherent in pathologist-to-pathologist interpretation of staining intensity.

Full tissue coverage

AI scores every cell in the section rather than estimating from a representative sample — reducing sampling bias, especially in heterogeneous tumors.

Reproducible across sites

A validated AI model produces the same output at any participating laboratory, making cross-institution clinical trial data directly comparable.

Section 04

Where This Is Headed

The next generation of biomarkers may be defined through AI-first or AI-only assays — tests designed from the outset without a manual scoring equivalent. Rather than automating an existing manual score, these systems would identify entirely new patterns in slide data that human visual inspection wouldn’t detect: spatial relationships between cell types, subtle texture features, microenvironmental signatures, and correlations between morphological features and treatment response or survival outcomes.

These capabilities build on the same underlying infrastructure — whole slide imaging, standardized tissue preparation, validated antibody staining — that supports current AI-assisted scoring. The direction is toward increasing integration of computational analysis into standard pathology workflow, not as a replacement for expertise, but as a tool that expands what’s measurable.

AI

Reliable AI starts with reliable staining

An AI model trained to score a biomarker at a specific staining intensity can only perform consistently if the staining itself is consistent run to run. PCI-AI’s TriControl™, QuadControl™, and DualControl™ panels are designed to meet that requirement — providing a standardized, validated reference point that ensures the staining input to an AI scoring system falls within the expected range every time.

View control panels →
Sources: MDPI Currents in Oncology (2026) — Machine Learning in Biomarker-Driven Precision Oncology: Automated IHC Scoring in Genitourinary Cancers; PMC (2022) — Digital Pathology: New Initiative in Pathology; PMC (2025) — Augmented Reality Microscopy to Bridge Trust Between AI and Pathologists (npj Precision Oncology); PMC (2014) — IHC Profiler: Open Source Plugin for Quantitative Evaluation and Automated Scoring of IHC Images (match rate 88.6% cited from this source); The Lancet Digital Health (2025) — Application of Artificial Intelligence and Digital Tools in Cancer Pathology.