Practical steering

Practical steering is the runtime system the whole ladder is built toward: a prompt arrives, the model reads its own routing to guess the task (a calibration-free routing readout, S3.0), and that guess picks a soft-bias direction (S2.1) — all with no per-prompt setup and no retraining. The two halves are the task-agnostic substrate (computed once, shared) and the task-conditioned overlay (thin, per-prompt).

Calibration — planned, not measured. This page describes the intended end state. S3.0 and S4.0 are pre-registered and not yet run; S4.0 is additionally gated on S2.1 and S3.0 shipping first. Nothing here is a demonstrated result — it is the program’s target configuration with decision rules written in advance.

Question: can we steer at runtime, with no per-prompt calibration, preserving quality?

Practical steering = task-agnostic substrate + lightweight task-conditioned overlay. Builds on causal probes (S1.0, S2.0) and soft control (S2.1) and is intended to ship the deployable runtime configuration.

S3.0 — Routing-readout task detector

Per-prompt forward → top-k expert distribution per layer → cosine similarity vs cached task priors → predicted task. No calibration data per prompt, no probe training, sub-second overhead.

If S3.0 ships: cheap inference-time task inference enables prompt-conditional soft-bias direction without forcing the user to label the prompt.

Prereg: cross-check/preregistry/s3_0_routing_readout_task_detector_2026-05-04/.

S4.0 — Runtime composition (S3.0 ⨂ S2.1)

prompt arrives
  ↓
S3.0 fingerprint forward pass
  ↓
predicted task (humaneval-aligned vs wikitext-aligned vs ...)
  ↓
S2.1 selects bias direction at small ε (from S0.0 ε_safe atlas)
  ↓
soft-steered generation

Zero training. Reusable substrate. Lightweight task overlay. Composable.

Prereg: deferred until S2.1 SHIP and S3.0 SHIP land.

What practical steering gives

Runtime control without per-domain calibration
Coherence-preserving task adaptation
Cheap to deploy (no probe training, no SAE infrastructure)

What is required before practical steering ships

S0.0 universal safety map: identify ε_safe per (component, layer) — substrate
S2.1 soft-bias dose-response: confirm soft control regime exists
S3.0 routing-readout: confirm task detection at sub-second overhead
S4.0 composition: confirm composing S3.0 ⨂ S2.1 ⨂ S0.0 produces equivalent selectivity to oracle-task S2.1 within 1.2× ratio

Each gate is pre-registered. Practical steering claim depends on all four passing.

Status

S0.0 prereg: cross-check/preregistry/s0_0_universal_safety_map_2026-05-04/
S2.1 prereg: cross-check/preregistry/s2_1_soft_bias_router_steering_2026-05-04/
S3.0 prereg: cross-check/preregistry/s3_0_routing_readout_task_detector_2026-05-04/
S4.0: deferred, gated