feat(vlm): add Nemotron-Omni RADIO post-load patches by yuekaizhang · Pull Request #2311 · NVIDIA-NeMo/Automodel

yuekaizhang · 2026-05-25T09:31:14Z

This PR improves nemotron-3-omni:

enable_radio_vit_fused_attn(): route RADIO timm ViT attention through F.scaled_dot_product_attention so the (B, H, seq, seq) attention tensor (~5 GiB per block at RADIO-v2-H + dynamic-resolution patch counts) is not materialized.
apply_parameter_freezing(): new freeze_video_embedder knob (default False). patch_generator.video_embedder is only exercised on video inputs; on image-only training it sits in the optimizer without state (no grad → no lazy init), so dcp.load on resume raises a missing-key error. Independent of freeze_vision_tower so the image encoder can stay trainable while the video branch is frozen out.

copy-pr-bot · 2026-05-25T09:31:17Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

HuiyingLi

LGTM thank you @yuekaizhang !

HuiyingLi · 2026-05-25T20:37:59Z

/ok to test 29bae71

HuiyingLi · 2026-05-25T20:41:32Z

Hi @yuekaizhang could you please fix the ci errors, thank you~

- enable_radio_vit_fused_attn(): route RADIO timm ViT attention through F.scaled_dot_product_attention so the (B, H, seq, seq) attention tensor (~5 GiB per block at RADIO-v2-H + dynamic-resolution patch counts) is not materialized. Mirrors the Megatron-Bridge path's vision_config.use_flash_attn=True. No-op on non-RADIO models; invoked unconditionally from apply_model_infrastructure(). - apply_parameter_freezing(): new freeze_video_embedder knob (default False). patch_generator.video_embedder is only exercised on video inputs; on image-only training it sits in the optimizer without state (no grad → no lazy init), so dcp.load on resume raises a missing-key error. Independent of freeze_vision_tower so the image encoder can stay trainable while the video branch is frozen out. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> Signed-off-by: root <zhangyuekai@foxmail.com>

yuekaizhang · 2026-05-26T03:05:23Z

Hi @yuekaizhang could you please fix the ci errors, thank you~

Done. Thanks!

HuiyingLi · 2026-05-26T13:06:46Z

Thank you @yuekaizhang , I think the codecov is still failing. Would you mind fixing that? Appreciate it!

- enable_radio_vit_fused_attn: flips fused_attn on all blocks, resolves vision_model via both top-level and nested model.model paths, no-ops when RADIO is absent, and tolerates blocks that lack an attn attribute. - apply_parameter_freezing(freeze_video_embedder=...): True freezes only patch_generator.video_embedder.* and leaves the rest of patch_generator trainable; False (default) keeps it trainable. Pushes codecov/patch on these additions over the 80% threshold. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> Signed-off-by: root <zhangyuekai@foxmail.com>

yuekaizhang · 2026-05-26T13:49:40Z

/ok to test 9bd5d2c

yuekaizhang · 2026-05-27T00:12:56Z

Thank you @yuekaizhang , I think the codecov is still failing. Would you mind fixing that? Appreciate it!

@HuiyingLi Fixed it now.

HuiyingLi

Thank you!

Pulls in "feat(vlm): add Nemotron-Omni RADIO post-load patches" (NVIDIA-NeMo/Automodel#2311). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

yuekaizhang requested review from HuiyingLi, ZhiyuLi-Nvidia, adil-a, akoumpa, athitten, hemildesai, pthombre and zyzhou5 as code owners May 25, 2026 09:31

HuiyingLi previously approved these changes May 25, 2026

View reviewed changes

copy-pr-bot Bot temporarily deployed to nemo-ci May 25, 2026 20:38 Inactive

copy-pr-bot Bot temporarily deployed to test May 25, 2026 20:38 Inactive

copy-pr-bot Bot temporarily deployed to public May 25, 2026 20:38 Inactive

copy-pr-bot Bot temporarily deployed to public May 25, 2026 20:40 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci May 25, 2026 20:43 Inactive

copy-pr-bot Bot temporarily deployed to public May 25, 2026 20:47 Inactive

yuekaizhang dismissed HuiyingLi’s stale review via 60760f8 May 26, 2026 01:16

yuekaizhang force-pushed the n3-omni-fix branch from 29bae71 to 60760f8 Compare May 26, 2026 01:16

copy-pr-bot Bot temporarily deployed to nemo-ci May 26, 2026 01:22 Inactive

copy-pr-bot Bot temporarily deployed to public May 26, 2026 01:26 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci May 26, 2026 13:50 Inactive

copy-pr-bot Bot temporarily deployed to test May 26, 2026 13:50 Inactive

copy-pr-bot Bot temporarily deployed to public May 26, 2026 13:50 Inactive

copy-pr-bot Bot temporarily deployed to public May 26, 2026 13:53 Inactive

copy-pr-bot Bot temporarily deployed to public May 26, 2026 13:54 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci May 26, 2026 13:55 Inactive

copy-pr-bot Bot temporarily deployed to public May 26, 2026 14:00 Inactive

HuiyingLi enabled auto-merge (squash) May 27, 2026 00:58

HuiyingLi approved these changes May 27, 2026

View reviewed changes

HuiyingLi merged commit 2610809 into NVIDIA-NeMo:main May 27, 2026
77 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(vlm): add Nemotron-Omni RADIO post-load patches#2311

feat(vlm): add Nemotron-Omni RADIO post-load patches#2311
HuiyingLi merged 2 commits into
NVIDIA-NeMo:mainfrom
yuekaizhang:n3-omni-fix

yuekaizhang commented May 25, 2026

Uh oh!

copy-pr-bot Bot commented May 25, 2026

Uh oh!

HuiyingLi left a comment

Uh oh!

HuiyingLi commented May 25, 2026

Uh oh!

HuiyingLi commented May 25, 2026

Uh oh!

yuekaizhang commented May 26, 2026

Uh oh!

HuiyingLi commented May 26, 2026

Uh oh!

yuekaizhang commented May 26, 2026

Uh oh!

yuekaizhang commented May 27, 2026

Uh oh!

HuiyingLi left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yuekaizhang commented May 25, 2026

Uh oh!

copy-pr-bot Bot commented May 25, 2026

Uh oh!

HuiyingLi left a comment

Choose a reason for hiding this comment

Uh oh!

HuiyingLi commented May 25, 2026

Uh oh!

HuiyingLi commented May 25, 2026

Uh oh!

yuekaizhang commented May 26, 2026

Uh oh!

HuiyingLi commented May 26, 2026

Uh oh!

yuekaizhang commented May 26, 2026

Uh oh!

yuekaizhang commented May 27, 2026

Uh oh!

HuiyingLi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants