Soft control
Where a causal probe switches a component fully off, soft control turns a dial. Instead of masking experts, it gently tilts the router’s internal preference scores (“logits”) toward a task and away from the rest, by a small tunable amount — soft router-logit biasing. The size of the nudge, written ε, is the steering knob.
Calibration — planned, not measured. Everything on this page describes a pre-registered experiment that has not been run. The conditions, gates, and decision rule below are design intent, locked in advance. There are no results yet, and the numbers (ε values, condition counts) are the planned sweep, not findings.
Question: can we move smoothly from “no perturbation” to “task-shifted output” while preserving competence?
Soft control = bias-magnitude knob, dose-response, coherence preserved across the transition. Hard ablation answers causal questions; soft control answers steering questions. This is the intended deployable primitive.
S2.1 — Soft router-logit bias
The deployable steering primitive. Same calibration source as S2.0 (humaneval-p99 / wikitext-p99 top-13/bottom-13 per layer), but the application is:
router_logits[task_top_experts] += ε
router_logits[task_bottom_experts] -= ε
Top-k still operates normally. No expert is hard-skipped. ε is the steering knob.
Conditions (13)
- Baseline ε=0
- Humaneval-direction: ε ∈ {0.5, 1.0, 2.0, 4.0}
- Anti-direction: ε ∈ {1.0, 2.0}
- Cross-corpus (wikitext-direction): ε ∈ {1.0, 2.0}
- Random-direction control at every primary ε (budget-matched per-ε)
Decision gates
- G1 dose-monotonicity (principled humaneval-direction)
- G2 smallest-ε task-selective at quality metric (not just drift)
- G3 coherence floor (no degeneracy collapse at the working ε)
- G4 cross-corpus reversal at matched ε
- G5 anti-direction symmetry
- G6 per-ε random-bias trap
- G7 ε=0 sanity
SHIP = soft-control regime exists with quality preserved.
What soft control gives
- Continuous knob, not binary on/off
- Coherence-preserving steering (selection of small ε)
- Composability with task detector (S3.0) for runtime adaptive policy
Why this is the deployable primitive
Per 2026-05-04 strategy note: hard task masks are expensive and blunt. Soft bias amortizes calibration across domains and gives explicit dose-response control. Enables runtime task-conditional policy without per-domain calibration burden.
Status
- S2.1 prereg:
cross-check/preregistry/s2_1_soft_bias_router_steering_2026-05-04/ - Run: gated on S2.0 completion (provides hard-mask ceiling reference)