Publikation

Kintsugi: Learning Policies by Repairing Executable Knowledge Bases

Teng Cao; Yu Deng; Hikaru Shindo; Quentin Delfosse; Lanxi Wen; Suli Wang; Jannis Blüml; Christopher Tauchmann; Kristian Kersting

In: Computing Research Repository eprint Journal (CoRR), Vol. abs/2605.09487, Pages 1-35, arXiv, 2026.

Zusammenfassung

Modern embodied agents achieve impressive performance, but their task knowl- edge is often stored in neural weights, latent state, or prompt-bound memory, making individual policy knowledge difficult to inspect, validate, recombine, and reuse. We introduce Kintsugi, a white-box policy-learning framework that treats embodied policy improvement as verifier-gated construction of a typed executable Knowledge Base (KB). Kintsugi represents task-level policy knowledge as com- posable typed entries—predicates, operators, policy schemas, monitors, recovery rules, experience records, and goals—and improves this artifact through local- ized typed edits induced from rollout evidence, rather than relying on test-time language-model reasoning. Between rollouts, a tool-constrained agentic editing loop diagnoses trajectory failures, localizes them to editable KB layers, and pro- poses candidate edits. A deterministic verification gate admits an edit only when the candidate type-checks, the resulting KB executes, and focused validation suc- cess or trajectory-health metrics improve without violating protected-regression checks. At inference, the accepted KB is executed by a deterministic symbolic executor with zero LLM calls. Across long-horizon text-agent benchmarks and rep- resentative object-centric manipulation settings, Kintsugi achieves strong endpoint performance while preserving inspectability, local editability, and verifier-gated deployment. These results suggest that embodied policy improvement can be organized around executable task knowledge.

Weitere Links

https://doi.org/10.48550/arXiv.2605.09487

2605.09487v1.pdf (pdf, 4 MB )