← Back to Home

rfab-harness

Campaign orchestration for de novo antibody design against cancer and rare disease targets

February 2026

📄 Read the Scientific Preprint

Why this exists

In January 2025, the Baker Lab published RFAntibody—the first open-source pipeline for de novo antibody design using diffusion models. It demonstrated atomically accurate design of single-domain antibodies against five therapeutic targets.

The pipeline works. But running it against a new target requires stitching together three separate tools, converting file formats, manually selecting hotspot residues, tuning CDR loop lengths, and parsing raw Quiver score files. Each campaign is an ad hoc scripts-and-prayer affair.

This harness turns a campaign into a single YAML file. Define your target, pick your antibody format, set your thresholds, and run one command. We provide 21 pre-built configs—6 reproducing the original paper, 10 for cancer immunotherapy targets, and 4 for rare diseases—so you can start designing antibodies on day one.

The Pipeline

RFAntibody designs antibodies in three stages, each a separate ML model:

YAML
Campaign Config
Target Prep
Fetch • Truncate • Validate
RFdiffusion
Backbone generation
ProteinMPNN
Sequence design
RF2
Structure prediction
Analysis
Filter • Rank • Export

Stage 1 (RFdiffusion) generates antibody backbone structures using SE(3)-equivariant denoising diffusion. You specify which target residues to contact (hotspots) and how long each CDR loop should be. It produces thousands of diverse backbone conformations.

Stage 2 (ProteinMPNN) fills in amino acid sequences for each backbone, designing multiple sequence variants per structure. The CDR-specific masking ensures the framework regions stay fixed while loop sequences are optimized.

Stage 3 (RF2) predicts the structure of each designed antibody-antigen complex and scores it. Three metrics determine whether a design is worth pursuing:

MetricThresholdWhat it measures
pAE< 10 ÅPredicted aligned error—confidence that the binding interface is real
RMSD< 2 ÅHow well the predicted structure matches the designed backbone
ddG< −20 REUBinding free energy—lower means tighter predicted binding

Designs passing all three filters are ranked by a composite score (0.4×pAE + 0.3×RMSD + 0.3×ddG, min-max normalized) and exported as individual PDB files ready for experimental validation.

What the Harness Adds

21
Pre-built campaigns
33
Unit tests
1 cmd
Config → candidates

Quick Start

# Install
git clone https://github.com/inventcures/repro_rfantibody_for-cancer-targets.git
cd repro_rfantibody_for-cancer-targets
pip install -e .

# Validate a campaign config (no GPU needed)
rfab validate campaigns/smoke_test.yaml

# Dry run — prepare inputs, check everything works
rfab run campaigns/smoke_test.yaml --dry-run --rfantibody-root ./RFAntibody

# Full campaign
rfab run campaigns/cancer/pdl1_vhh.yaml --rfantibody-root ./RFAntibody

# Re-analyze with different thresholds
rfab analyze campaigns/cancer/pdl1_vhh.yaml

Paper Reproductions

Six configs reproduce the targets from Bennett et al. (2025) with exact parameters from the paper:

TargetFormatPDBDesigns
Influenza HA stemVHH4BGW9,000
C. difficile TcdBVHH7UMN10,000
C. difficile TcdBscFv7UMN10,000
RSV Site IIIVHH4JHW10,000
PHOX2B-HLA neoantigenscFvmodeled10,000
SARS-CoV-2 RBDVHH6M0J10,000

Cancer Targets

Ten campaigns targeting validated cancer antigens, prioritized by structural data quality and therapeutic precedent:

Immune Checkpoints

TargetIndicationPDBStrategy
PD-L1Broad solid tumors5N2CVHH targeting BC/DE loop interface
CTLA-4Melanoma, renal1I8LVHH blocking B7 ligand binding
TIGITEmerging checkpoint6V33VHH blocking PVR interaction

Receptor Tyrosine Kinases & Surface Antigens

TargetIndicationPDBStrategy
HER2Breast, gastric1N8ZVHH domain IV (trastuzumab-like)
EGFRNSCLC, colorectal1NQLVHH domain III blocking EGF
TROP-2Solid tumors (ADC)7E5MVHH for ADC conjugation
GPC3Hepatocellular carcinoma7YIOVHH targeting heparan sulfate site
Claudin-18.2Gastric, pancreatic7RFBVHH extracellular loop (exploratory)

B-cell Antigens

TargetIndicationPDBStrategy
CD20B-cell lymphoma6Y4AVHH extracellular loop
CD19B-cell malignancies6AL5scFv (bispecific potential)

Rare Disease Targets

4 Campaigns

TargetIndicationPDBStrategy
Complement C5PNH / aHUS3CU7VHH blocking C5 convertase cleavage
PCSK9Familial hypercholesterolemia3BPSVHH blocking LDLR interaction
IL-6RSystemic JIA1N26VHH blocking IL-6 binding
GNEGNE myopathy4WMNVHH enzyme stabilizer (unconventional)

Technical Details

Campaign Config Schema

Each campaign is a YAML file with six sections:

# campaigns/cancer/pdl1_vhh.yaml
campaign:
  name: "pdl1_vhh"

target:
  pdb_id: "5N2C"
  chain_id: "A"
  epitope_residues: [54, 56, 58, 60, 62, ...]
  hotspot_residues: [56, 60, 115]
  truncation:
    enabled: true
    buffer_angstroms: 10.0

antibody:
  format: "vhh"
  framework: "builtin:NbBCII10"
  cdr_loops:
    H1: "7"
    H2: "6"
    H3: "5-13"    # variable length range

pipeline:
  rfdiffusion:
    num_designs: 10000
  proteinmpnn:
    sequences_per_backbone: 5
    temperature: 0.2

filtering:
  pae_threshold: 10.0
  rmsd_threshold: 2.0
  ddg_threshold: -20.0

Validation Rules

The harness validates 15 rules before any GPU computation:

Antibody Formats

FormatChainsCDR LoopsFrameworkUse Case
VHH (nanobody)H onlyH1, H2, H3NbBCII10Single-domain, small (~15 kDa), stable
scFvH + LH1-H3, L1-L3hu4D5-8Full variable region (~27 kDa), bispecific building block

Composite Scoring

Designs passing all filters are ranked by:

score = 0.4 × norm(pAE) + 0.3 × norm(RMSD) + 0.3 × norm(ddG)

where norm() is min-max normalization across all passing designs (lower composite score = better candidate). Weights reflect that binding confidence (pAE) is the most informative single metric per the original paper.

From Computation to Experiment

The harness includes experimental planning modules that generate protocols for the complete design-to-validation cycle:

This mirrors the validation workflow from the RFAntibody paper, where YSD screening followed by SPR confirmation identified binders from 1–2% of computationally passing designs.

Source code & all campaign configs:
github.com/inventcures/repro_rfantibody_for-cancer-targets

Related: Antibodies from Thin Air — comparison of five de novo antibody design platforms