Add Cosmos3-Nano LIBERO-10 action-policy SFT recipe, config, eval harness, and doc by fwd4 · Pull Request #61 · NVIDIA/cosmos-framework

fwd4 · 2026-06-26T12:59:55Z

What

Adds the Cosmos3-Nano LIBERO-10 action-policy SFT surface, mirroring the existing DROID counterpart (action_policy_droid_nano + action_policy_droid_repro.toml + launch_sft_action_policy_droid.sh + doc).

Feature (net-new)

Experiment config action_policy_libero_nano (gen + action heads from the public Cosmos3-Nano GA base).
Dataset LIBEROLeRobotDataset + get_action_libero_sft_dataset — frame_wise_relative rot6d, quantile_rot, concat_view (third-person + wrist), 20 fps.
- base_dataset tasks.parquet fallback for community LIBERO layouts.
- Resample-on-decode-failure guard so a single undecodable packed-mp4 frame can't crash a multi-node run (matches i4 behavior).
Closed-loop eval harness with vectorized sim, plus a batched /predict_batch path and single-rank no_dist checkpoint load in the policy server.

Recipe + doc

Canonical examples/toml/sft_config/action_policy_libero_repro.toml + examples/launch_sft_action_policy_libero.sh: lr 5e-5, warmup 500, cycle 16000, global batch 2048 (HSDP 2x8).
docs/action_policy_libero_sft.md.

Notes

Scoped to LIBERO only; the broader action-dataloader/model changes are intentionally not included here.
Based on main.

🤖 Generated with Claude Code

…ness, and doc Mirrors the DROID action-policy counterpart (action_policy_droid_nano + repro toml + launch + doc). Net-new LIBERO feature: - experiment config: action_policy_libero_nano - dataset: LIBEROLeRobotDataset + get_action_libero_sft_dataset (frame_wise_relative rot6d, quantile_rot, concat_view, 20fps); base_dataset tasks.parquet fallback for community LIBERO layouts; resample-on-decode-failure guard (matches i4 behavior) - closed-loop eval harness (vectorized sim) + batched /predict_batch inference path + single-rank no_dist checkpoint load for the policy server - canonical recipe action_policy_libero_repro.toml + launch_sft_action_policy_libero.sh (lr 5e-5, warmup 500, cycle 16000, global batch 2048; ~95% libero_10 500-ep eval) - docs/action_policy_libero_sft.md Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Lean the toml/config/launch/doc comments (drop SR numbers and experimental detail), and set the canonical recipe to HSDP 2x8 with grad_accum=1 (global batch 2048) instead of single-node grad_accum=2.

- action_sft_dataset.py: rebuild as origin/main + libero-only (drop the speedup-era ShardedDROIDLeRobotDataset import that broke config load on a clean main). - remove dataset_reply_action_server.py (GT-replay debug tool, not part of the recipe). - drop DROID/LoRA references from libero docstrings/comments/doc/launch.

…etions); EGL setup optional in doc

…aunch -> gbs 2048)

…h synced

…ntion

fwd4 and others added 9 commits June 26, 2026 20:59

libero(doc): align markdown tables (rumdl-fmt / MD060)

3ecd0a7

libero: trim recipe/doc comments to essentials; HSDP 2x8 ga1 canonical

6b22dd5

Lean the toml/config/launch/doc comments (drop SR numbers and experimental detail), and set the canonical recipe to HSDP 2x8 with grad_accum=1 (global batch 2048) instead of single-node grad_accum=2.

libero: model_loader = origin/main + no_dist only (drop unrelated del…

dd78c68

…etions); EGL setup optional in doc

libero: canonical recipe = HSDP 8x8 (replicate 8, max_samples 32 in l…

5f1847e

…aunch -> gbs 2048)

libero: recipe = minimum HSDP 2x8 (gbs 2048, grad_accum 1); doc/launc…

21d34ca

…h synced

libero: move lower-mem caveat to Heads-up section; drop all-suites me…

82a5a84

…ntion

libero: lint launch headers (drop GPU counts), drop sweep mention

4d351dd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Cosmos3-Nano LIBERO-10 action-policy SFT recipe, config, eval harness, and doc#61

Add Cosmos3-Nano LIBERO-10 action-policy SFT recipe, config, eval harness, and doc#61
fwd4 wants to merge 9 commits into
NVIDIA:mainfrom
fwd4:haolia/libero-action-policy-sft

fwd4 commented Jun 26, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

fwd4 commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Feature (net-new)

Recipe + doc

Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fwd4 commented Jun 26, 2026 •

edited

Loading