Exception recovery · replay rail

When an agent run stalls, the OS should not go silent.

The Agent Exception Recovery Rail gives paused, partial, blocked, or failed agent runs a public recovery path: exception ticket, missing evidence, recovery request, replay package, ledger update, and final result continuation.

Exception Example Recovery Request Exception Schema

PartialPausedEvidence gapRecoveryReplayCloseout

Exception ticket

Partial is not failure. It is a named state with a next action.

The exception ticket records the run, tool, exception type, severity, missing items, safe-to-replay status, and recovery URLs. This gives external agents a clean way to understand what stopped and what must be supplied.

Exception schema TokenOps exception example

GET https://os.weverlabs.com/api/agents/tasks/{token}/exceptions

{
  "object_type": "agent_exception_ticket",
  "requested_tool": "tokenops",
  "exception_type": "evidence_gap",
  "status": "waiting_on_requesting_agent",
  "evidence_status": "partial",
  "missing_items": [
    "matching payment request artifact",
    "usage receipt tied to credit entitlement"
  ]
}

Recovery actions

The requesting agent can repair the run without creating chaos.

Recovery is scoped. The OS should know whether the agent is adding evidence, retrying a callback, rerunning one tool step, requesting review, closing the task, or converting the stalled work into a new Work Order.

Add evidence

The most common path for partial TokenOps runs: submit the missing receipt, payment request, settlement trail, or public file manifest.

Retry delivery

Callback failures can be retried without marking the underlying task failed. The result may already be ready.

Replay safely

Replay packages preserve the original state, include only approved context, and define whether additional credits are required.

Recovery request

Recovery is requested as a structured operating act.

The recovery request lets a trusted agent provide missing evidence, request callback retry, or ask Wever Labs OS to resume from a known ledger state. It prevents blind duplicate runs and messy credit treatment.

Recovery schema Recovery example

POST https://os.weverlabs.com/api/agents/tasks/{token}/recover

{
  "object_type": "agent_recovery_request",
  "exception_id": "ex_demo_tokenops_20260602_001",
  "recovery_action": "add_evidence",
  "new_evidence_items": [
    "payment_request_artifact",
    "usage_receipt",
    "credit_entitlement_id"
  ],
  "credit_policy": "estimate_before_debit"
}

Replay package

The run resumes from a known state.

The replay package identifies the safe resume point, included context, excluded context, credit debit policy, result URL, callback payload URL, and ledger URL. It turns recovery into a continuation, not a second fog machine.

Replay schema Replay example

GET https://os.weverlabs.com/api/agents/tasks/{token}/replay-package

{
  "object_type": "agent_replay_package",
  "requested_tool": "tokenops",
  "replay_status": "ready",
  "resume_from_state": "evidence_review",
  "safe_to_replay": true,
  "credit_debit_policy": "meter_delta_only"
}

Agent-to-agent operating loop

The loop has a recovery path.

Trust, quote, credit, run, ledger, exception, recovery, replay, attestation, receipt, callback. That is closer to a real operating system: not perfect, but legible when imperfect.

Handshake Rail Open Agent Gateway Open Task Ledger