Soft control

Where a causal probe switches a component fully off, soft control turns a dial. Instead of masking experts, it gently tilts the router’s internal preference scores (“logits”) toward a task and away from the rest, by a small tunable amount — soft router-logit biasing. The size of the nudge, written ε, is the steering knob.

Calibration — planned, not measured. Everything on this page describes a pre-registered experiment that has not been run. The conditions, gates, and decision rule below are design intent, locked in advance. There are no results yet, and the numbers (ε values, condition counts) are the planned sweep, not findings.

Question: can we move smoothly from “no perturbation” to “task-shifted output” while preserving competence?

Soft control = bias-magnitude knob, dose-response, coherence preserved across the transition. Hard ablation answers causal questions; soft control answers steering questions. This is the intended deployable primitive.

S2.1 — Soft router-logit bias

The deployable steering primitive. Same calibration source as S2.0 (humaneval-p99 / wikitext-p99 top-13/bottom-13 per layer), but the application is:

router_logits[task_top_experts]    += ε
router_logits[task_bottom_experts] -= ε

Top-k still operates normally. No expert is hard-skipped. ε is the steering knob.

Conditions (13)

Decision gates

SHIP = soft-control regime exists with quality preserved.

What soft control gives

Why this is the deployable primitive

Per 2026-05-04 strategy note: hard task masks are expensive and blunt. Soft bias amortizes calibration across domains and gives explicit dose-response control. Enables runtime task-conditional policy without per-domain calibration burden.

Status