Compiled from ## References sections across docs, plus README and ongoing research backlog entries.
- Pi โ The best open-source harness, huge source of inspiration. (README)
- oh-my-pi โ An awesome customization of Pi with so many goodies and tricks. (README)
- The Harness Problem โ Can Bรถlรผk, 2026. Benchmark of edit formats across 16 models showing hashline outperforms str_replace and patch. (edit)
- oh-my-pi react-edit-benchmark โ Benchmark code and per-run reports. (edit)
- Diff-XYZ benchmark โ JetBrains. No single edit format dominates across models and use cases. (edit)
- EDIT-Bench โ Only one model achieves over 60% pass@1 on realistic editing tasks. (edit)
- Aider benchmarks โ Format choice swung GPT-4 Turbo from 26% to 59%. (edit)
- Cursor Instant Apply โ Fine-tuned 70B model for edit application; full rewrite outperforms diffs for files under 400 lines. (edit)
- Erlang
gen_statemโ OTP state machine behaviour used byOpal.Agent. (agent-loop) - Elixir
GenServerโ Messaging model still used by sibling subsystems and APIs around the loop. (agent-loop) - Erlang/OTP Supervisor Principles โ Supervision strategy used by session-local processes and tool tasks. (agent-loop)
- Erlang Distribution Protocol โ Official docs covering node naming, cookies, and EPMD. (erlang)
- Erlang Distribution Security Guide โ How to enable TLS for inter-node traffic. (erlang)
- LLMDB โ Model database powering auto-discovery of models, context windows, and capabilities. (providers)
- OpenAI Reasoning Guide โ Official docs for
reasoning.effortandreasoning.summaryparameters on the Responses API. (reasoning) - Anthropic Extended Thinking โ Official docs for budget-based and adaptive thinking modes, including
output_config.effortlevels. (reasoning) - opencode#6864 โ Confirms the Copilot proxy does not return
reasoning_contentfor Claude models. Other tools experience the same limitation. (reasoning)
- Handle approvals and user input โ Anthropic, 2025. Claude Agent SDK documentation for surfacing approval requests and clarifying questions. Informed the
ask_usertool design and the planning approach. (user-input, planning)
- Zod JSON Schema โ Zod 4's built-in
z.fromJsonSchema()for deriving Zod schemas (with full type inference) from JSON Schema at runtime. (rpc)
- RFC 8628 โ OAuth 2.0 Device Authorization Grant โ GitHub device-code OAuth flow used by Opal. (installing)
- Evaluating AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents? โ Gloaguen et al., 2026. Finds that AGENTS.md context files tend to reduce task success rates while increasing inference cost by 20%+; recommends minimal requirements only. (arxiv)
- GitHub Actions: macOS 13 runner image is closing down โ macos-13 retired Dec 2025. Use
macos-15-intelfor x86_64 builds. Intel macOS support ends Fall 2027 when macos-15 image retires. (GitHub changelog)
Papers and resources to review and potentially integrate:
- LCM: Lossless Context Management โ "We introduce Lossless Context Management (LCM), a deterministic architecture for LLM memory that outperforms Claude Code on long-context tasks. When benchmarked using Opus 4.6, our LCM-augmented coding agent, Volt, achieves higher scores than Claude Code on the OOLONG long-context eval, including at every context length between 32K and 1M tokens."