Better task starts
Prior execution gives the next similar task a stronger first action instead of another generic cold start.
Aionis Runtime learns from every run to improve task starts, stabilize handoffs, reuse successful workflows, and forget intelligently.
npx @ostinato/aionis-runtime startAionis Runtime turns task start, handoff, replay, policy memory, and semantic forgetting into one unified execution-memory loop, so agent systems do not restart from zero every time. They can continue from prior execution, improve over time, and reuse what already worked.
The public product shape today includes the standalone runtime package, Lite, the public TypeScript SDK, the local Inspector and Playground interfaces, and the docs site that explains the product, mechanisms, proof path, and route surfaces.
Aionis learns from real task starts, handoffs, replays, repairs, and governance instead of treating chat history as the product.
The next start, the next resume, and the next reuse path are exposed as explicit runtime surfaces.
Each run can strengthen the next run through better task starts, better replay, and clearer policy memory.
Memory is managed through demotion, archive, relocation, and on-demand restoration instead of growing without control.
| Capability | What it does | Primary surface |
|---|---|---|
| Task Start | Produces a stronger first action for the next similar task | memory.taskStart(...), memory.planningContext(...) |
| Task Handoff | Stores structured recovery state across runs, including target files and next action | handoff.store(...), handoff.recover(...) |
| Task Replay | Records successful execution, promotes stable workflows, and reuses playbooks | memory.replay.run.*, memory.replay.playbooks.* |
| Action Retrieval | Exposes the explicit next-action retrieval layer with evidence, source kind, and retrieval surfaces | memory.actionRetrieval(...), memory.experienceIntelligence(...) |
| Uncertainty Layer | Turns weak retrieval into explicit gates such as inspect, widen recall, rehydrate payload, or request review | memory.taskStart(...), memory.planningContext(...), operator_projection.action_hints[] |
| Policy Memory | Materializes repeated successful execution into governable policy memory | memory.tools.feedback(...), memory.reviewPacks.evolution(...) |
| Semantic Forgetting | Moves memory through retain / demote / archive / review and supports differential rehydration | memory.archive.rehydrate(...), memory.anchors.rehydratePayload(...) |
| Session / Review / Inspect | Exposes continuity state, evolution state, and review entry points | memory.sessions.*, memory.agent.*, memory.executionIntrospect(...) |
| Sandbox / Automation | Executes local shell, playbook, and automation flows | Lite runtime, sandbox, automation routes |
Aionis does not stop at remembering prior execution. It exposes an explicit action-retrieval layer that answers the runtime question directly: what should the agent do next, what evidence supports it, and how certain is that recommendation.
The runtime can now return explicit retrieval evidence, source-kind signals, uncertainty levels, gate actions, and operator hints instead of flattening everything into one overconfident first step.
This means weak retrieval can now escalate task start instead of pretending to be a confident answer. Hosts and operators can inspect the gate surface directly and decide whether to inspect context, widen recall, or rehydrate colder payload before acting.
Read next: Action Retrieval, Uncertainty and Gates, Operator Projection and Action Hints
The self-evolving loop is already defended by six live Lite proofs: better second task start, policy memory materialization, governance loop, continuity provenance preservation, session continuity promotion, and semantic forgetting with differential rehydration.
The same task moved from a generic cold start to a learned file-aware startup.
tool_selection → warm: experience_intelligencesrc/services/billing.tsRepeated positive execution feedback became persisted policy memory instead of staying as loose hints.
materialization_state = "persisted"selected_policy_memory_state = "active"The runtime retired policy memory and reactivated it with fresh live evidence through the public route.
active → retired → activelive_policy_selected_tool = "bash"Stable workflow guidance kept explicit continuity provenance instead of erasing where the learning signal came from.
distillation=handoff_continuity_carrierdistillation=session_event_continuity_carrierRepeated session state now counts as distinct workflow observations instead of depending on an event-only path.
distillation=session_continuity_carrierobserved_count = 2Archived workflow memory now surfaces lifecycle decay, cold relocation, and differential payload restore instead of silently disappearing.
semantic_forgetting.action = "archive"rehydration_mode = "differential"Aionis treats forgetting as lifecycle control. Memory is cooled, relocated, and restored when needed. It is not left to accumulate forever, and it is not reduced to blind deletion either.
Prior execution improves the kickoff for the next similar task instead of forcing another generic first step.
Handoff packets store recovery anchors, target files, next actions, and resume context as runtime state.
Replay runs feed playbook promotion, repair review, and stable workflow reuse across later execution.
Continuity, self-evolution, forgetting, and governance all live on explicit runtime seams. The SDK, routes, stores, sandbox, and automation path are visible and inspectable by design.
| Metric | Current result | Entry point |
|---|---|---|
| Runnable self-evolving proofs | 6 | Proof By Evidence |
| Real-provider A/B snapshot | 31 / 31 vs thin · 25 / 25 vs chat · 25 / 25 vs vector | Validation and Benchmarks |
| Strict replay reuse | 3 / 3, 0 replay tokens | Validation and Benchmarks |
| Lite runtime test suite | 211 / 211 | npm run -s lite:test |
| Public SDK test suite | 14 / 14 | npm run -s sdk:test |
Task start improvement, policy memory, governance, provenance, session continuity, and semantic forgetting are all defended by runnable Lite evidence.
Real-provider A/B coverage now spans repeated starts, forgetting, replay, multi-cycle refinement, and production simulation.
The public SDK surface is validated directly instead of only being implied by route behavior.
The current Lite runtime baseline stays green across replay, recall, handoff, policy, forgetting, and automation.
npx @ostinato/aionis-runtime startIf you are working from a source checkout instead of the published runtime package:
npm install
npm run lite:startnpm install @ostinato/aionisimport { createAionisClient } from "@ostinato/aionis";
const aionis = createAionisClient({ baseUrl: "http://127.0.0.1:3001" });
const taskStart = await aionis.memory.taskStart({
tenant_id: "default",
scope: "docs-home",
query_text: "fix flaky retry in worker.ts",
});
await aionis.handoff.store({
tenant_id: "default",
scope: "docs-home",
anchor: "task:retry-fix",
summary: "Pause after diagnosis",
handoff_text: "Resume in src/worker.ts and patch retry handling.",
target_files: ["src/worker.ts"],
next_action: taskStart.first_action?.next_action ?? "Patch retry handling in src/worker.ts",
});
await aionis.memory.replay.run.start({
tenant_id: "default",
scope: "docs-home",
actor: "docs-home",
run_id: "retry-fix-run-1",
goal: "fix flaky retry in worker.ts",
});npm run example:sdk:core-path| Path | What To Prove | Primary Surfaces |
|---|---|---|
| Core | Continuity works at all | memory.write(...), memory.taskStart(...) or memory.planningContext(...), handoff.store(...), memory.replay.run.* |
| Enhanced | Continuity improves over time | memory.archive.rehydrate(...), memory.nodes.activate(...), memory.reviewPacks.*, memory.sessions.* |
| Advanced | The runtime exposes deeper learning and control | memory.experienceIntelligence(...), memory.executionIntrospect(...), memory.delegationRecords.*, memory.tools.*, memory.rules.*, memory.patterns.* |
Recommended order:
Fastest repository proof:
npm run example:sdk:core-pathBoot the runtime, hit the health route, and confirm the local runtime shape.
Integrate · 10 minWrite memory, ask for task start, store handoff, and move into replay from TypeScript.
Understand · 15 minSee how Lite is organized across shell, bootstrap, host, subsystems, and SQLite-backed stores.