feat(lucebox): autotune sweep + profile/smoke diagnostics (stacked on #335) by easel · Pull Request #5 · easel/lucebox-hub

easel · 2026-06-16T20:49:08Z

Stacked on Luce-Org#335 (core lucebox CLI). Base is `feat/lucebox-cli`, so this PR's diff is only the tuning/diagnostics cluster. Review/merge after Luce-Org#335.

Summary

Restores the second-order tuning + diagnostics surface deferred from Luce-Org#335:

autotune.py: the empirical sweep machinery — `candidate_configs`, the `Profile` registry + per-arch coding-agent-loop brackets, `get_profile`. (`recommend_preset` stays in `download.py` from the feat(lucebox): hub CLI + autotune/sweep/profile + harness adapters + shell wrapper Luce-Org/lucebox-hub#335 split.)
sweep.py: the per-cell sweep driver (config set → restart → luce-bench snapshot → winner pick).
profile.py / smoke.py + the `autotune`, `profile`, `smoke` CLI commands.
Wrapper: the `autotune --sweep` exec-routing special case + autotune/profile/smoke in usage/completion/exec-set.

Tests

sweep / profile / smoke / autotune-cli / candidate-configs suites + the Profile/bracket tests restored. `test_cli` guards that the client launcher verbs stay deferred while autotune/profile/smoke register.

lucebox 122 passed (123 with the inherited recommend_preset test), wrapper 55 passed, ruff + mypy + shellcheck clean.

🤖 Generated with Claude Code

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 39087d8adb

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-06-16T20:53:14Z

    local cmd="$1"; shift
    case "$cmd" in
-        config|models|check|print-run|print-serve-argv)
+        config|smoke|models|check|profile|print-run|print-serve-argv)


Keep profile off the exec fast path

When the server container is already running, this routes lucebox profile through cmd_exec_in_container. But profile.run_profile immediately shells out to docker inspect/docker exec to run luce-bench in the server container, while the long-running server container is started without the Docker socket mounted. In the normal “server running” case the nested profile command therefore cannot see Docker and exits before taking a snapshot; route profile through the docker-run orchestrator like autotune --sweep.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-06-16T20:53:14Z

+    if rc is not None:
+        return rc
+
+    cfg = config_mod.load() or config_mod.live_config()


Preserve scalar env overrides during sweeps

When config.toml exists, this raw load ignores the LUCEBOX_PORT, LUCEBOX_CONTAINER, and LUCEBOX_MODELS values that the wrapper exports and the systemd unit uses. With a non-default port/container, the sweep restarts the right service but then waits on cfg.port or profiles cfg.container_name from the stale/default file values, causing every cell to fail readiness or target the wrong container. Use the same env overlay path as the CLI before building candidates and probing the server.

Useful? React with 👍 / 👎.

Stacked on the core lucebox CLI (Luce-Org#335). Restores the second-order tuning and diagnostics surface deferred from that PR: - autotune.py: the empirical sweep machinery — candidate_configs, the Profile registry + per-arch coding-agent-loop brackets, get_profile. (recommend_preset stays in download.py from the core split.) - sweep.py: the per-cell config sweep driver (config set -> restart -> luce-bench snapshot -> winner pick). - profile.py / smoke.py + the `autotune`, `profile`, `smoke` CLI commands. - Wrapper: the `autotune --sweep` exec-routing special case (sweeps must stay on docker run, not exec into the container they'd restart) and the autotune/profile/smoke entries in usage + completion + the exec set. Tests: sweep / profile / smoke / autotune-cli / candidate-configs suites and the Profile/bracket tests restored; test_cli guards that the client launcher verbs are still deferred while autotune/profile/smoke register. lucebox 122 passed, wrapper 55 passed, ruff + mypy + shellcheck clean. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

easel mentioned this pull request Jun 16, 2026

feat(lucebox): agent-client launchers + harness adapter package (stacked on tuning) #6

Open

chatgpt-codex-connector Bot reviewed Jun 16, 2026

View reviewed changes

easel force-pushed the feat/lucebox-tuning branch from 39087d8 to 9f63eea Compare June 16, 2026 23:12

easel force-pushed the feat/lucebox-tuning branch from 9f63eea to a9a1f30 Compare June 17, 2026 05:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(lucebox): autotune sweep + profile/smoke diagnostics (stacked on #335)#5

feat(lucebox): autotune sweep + profile/smoke diagnostics (stacked on #335)#5
easel wants to merge 1 commit into
feat/lucebox-clifrom
feat/lucebox-tuning

easel commented Jun 16, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Jun 16, 2026

Uh oh!

chatgpt-codex-connector Bot Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

easel commented Jun 16, 2026

Summary

Tests

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant