Vitrine

Video-to-structured-3D-scene pipeline built on LichtFeld Studio. Point it at a folder of videos and it produces a richly-annotated USD scene graph — a textured environment mesh plus individually-reconstructed, correctly-placed 3D hulls of the key objects in the scene — with full video→frame→object lineage, and a compressed Gaussian splat (.ksplat) for web delivery.

This is an isolated fork. Upstream sync is one-way pull only; we never push to or open PRs against the upstream repository. See BOUNDARIES.md.

Pipeline

Drive/video ingest  →  frame extraction + per-image metadata sidecar
   →  COLMAP SfM (ALIKED+LightGlue, SIFT fallback)
   →  3DGS training (LichtFeld: ImprovedGS+ / MRNF / MCMC)
   →  .ksplat (splat-transform)              [web delivery]
   →  SAM3 concept segmentation  →  key-item ranking (min_object_gaussians + keyness)
   →  per-object hull: orbit render → FLUX.2 recovery → TRELLIS.2 / Hunyuan3D-2.1 (textured GLB)
   →  environment mesh (CoMe; MILo / GaussianWrapping / TSDF fallbacks)
   →  USD scene graph (native LichtFeld export; v2g:* metadata + lineage)  +  .ksplat

The pipeline is driven by a pre-run exhibit.toml manifest (exhibit identity, object list, Drive URL, env:-indirected secrets) and orchestrated by an in-container agent. A SOTA idiot-check (pipeline/sota_registry.py, wired into preflight.py) validates the host — staged weights, VRAM fit, version pins, licence posture — before any run.

SOTA stack (research/non-commercial posture)

Web-verified mid-2026; weights staged + verified on the reference host. Run python -m pipeline.sota_registry check to validate.

Element	Model	Licence	VRAM	Notes
Inpaint / recovery	FLUX.2-dev	non-commercial	~34 GB fp8	masked recovery of unseen object faces
Local VLM	gemma-4-26B-A4B-it (Q8_0)	Apache-2.0	~28 GB	artifact analysis + recovery oversight
3D hull (primary)	TRELLIS.2-4B	MIT	~24 GB	single-image → textured PBR mesh
3D hull (fallback)	Hunyuan3D-2.1	Tencent-community	~29 GB	multiview, matches the orbit renderer
GS→surface mesh	CoMe (default)	CC BY-NC-ND	~20 GB	best indoor F1; MILo / PGSR / TSDF fallbacks
Training	ImprovedGS+ (native)	—	—	−27% time vs MCMC; MRNF/MCMC also native
SfM	ALIKED+LightGlue	—	—	via LichtFeld COLMAP plugin; SIFT fallback
USD	native `scene.export_usd`	—	—	LichtFeld v0.5.1+; custom assembler fallback

Commercial deployment requires swapping the non-commercial models (CoMe→PGSR, FLUX.2→Qwen-Image-Edit); the idiot-check fails the run under --commercial if a non-commercial model is selected. See research/decisions/work-order-sota-modernisation.md.

Deployment

# 1. Provision the exhibit manifest (web wizard or hand-edit exhibit.example.toml)
cd onboarding && cargo build --release && ./target/release/vitrine-onboarding   # http://localhost:8088

# 2. Bring up the pipeline containers
docker compose -f docker-compose.consolidated.yml up -d

# 3. Bring up the canonical ComfyUI (owner install + staged model tree, over v2g-net)
scripts/run_comfyui.sh                                                          # http://localhost:8200

# 4. Web UI
# http://localhost:7860

Container	Base	GPU	Purpose
`gaussian-toolkit`	Ubuntu 24.04 / CUDA 12.8 / Py 3.12	0	COLMAP, LichtFeld 3DGS, web UI, Blender, SAM3, pipeline
`vitrine-comfyui`	—	0	FLUX.2 / TRELLIS.2 / Hunyuan / SAM3D ComfyUI (`scripts/run_comfyui.sh`)
`milo`	Ubuntu 22.04 / CUDA 11.8 / Py 3.10	1	MILo (+ optional GaussianWrapping) mesh extraction
`come`	Ubuntu 22.04 / CUDA 12.1 / Py 3.10	1	CoMe mesh extraction (gated: `INSTALL_COME=1`)

Containers share a v2g-net bridge; the pipeline reaches ComfyUI as V2G_COMFYUI_URL=http://vitrine-comfyui:8188.

Port	Service
7860	Web UI (Flask)
8088	Onboarding wizard (Rust/Axum)
8200	Canonical ComfyUI
7681	Web terminal (ttyd / Claude Code orchestrator)
45677	LichtFeld MCP server (70+ tools)
5901	VNC (Blender remote desktop)

Pipeline modules (`src/pipeline/`)

Category	Modules
Core	`stages.py`, `cli.py`, `config.py`, `preflight.py`, `sota_registry.py`
Manifest / infra	`manifest.py` (exhibit.toml), `model_lifecycle.py` (serial VRAM), `endpoints.py` (v2g-net)
Ingestion	`drive_ingestor.py`, `frame_selector.py`, `frame_quality.py`, `fibonacci_sampler.py`
Reconstruction	`colmap_parser.py`, `coordinate_transform.py`, `mcp_client.py`, `gsplat_trainer.py`
Segmentation	`sam2_segmentor.py`, `sam3_segmentor.py`, `sam3d_client.py`, `mask_projector.py`
Hull / recovery	`multiview_renderer.py`, `comfyui_inpainter.py`, `comfyui_control.py`, `hunyuan3d_client.py`
Mesh	`mesh_extractor.py` (TSDF), `milo_extractor.py`, `come_extractor.py`, `gaussianwrapping_extractor.py`, `mesh_cleaner.py`
Delivery / scene	`splat_optimizer.py`, `texture_baker.py`, `material_assigner.py`, `blender_assembler.py`, `usd_assembler.py`
Quality	`quality_gates.py`, `person_remover.py`

Web UI in src/web/ (Flask :7860). Onboarding wizard in onboarding/ (Rust/Axum).

Hardware

Reference host: 2× NVIDIA RTX 6000 Ada (48 GB each), AMD Threadripper PRO, 256 GB RAM, NVMe. The serial model lifecycle keeps peak VRAM at max(stage), not the sum, so a single 48 GB GPU is the practical floor for the full SOTA stack (TSDF-only mesh works on 12 GB).

Status

Validated end-to-end on real data (June 2026). A real 80-frame indoor scene (milo_run) now travels the full path — LichtFeld 3DGS training → SAM3 concept segmentation (5 objects) → depth-aware multi-view object isolation → per-object meshing → native USD + composed textured USD — producing object-resolved dual-USD output. First demonstration of the object-resolved promise on real data; it fixed eight defects surfaced by the run (SAM3 #507 dtype, gsplat SH-degree, Hunyuan kwargs, LichtFeld runtime libs, the D10 projection, native-USD CLI path, Blender bake + USD-export). Caveat: the validation scene was reused (not a fresh Drive→ingest→COLMAP capture); meshes use the gsplat-TSDF fallback (not the SOTA hulls); isolation quality is first-pass (sparse SAM3 detection, no per-view depth test).

Built and host-validated (v3 scaffolding): exhibit.toml manifest + loader, SOTA registry + idiot-check (wired into preflight), serial model lifecycle, v2g-net endpoints, agent-controlled ComfyUI client, native USD export (LichtFeld CLI convert), key-item ranking, depth-aware mask→3D projection (ADR-010 D10), per-image metadata sidecar, secret-stripped config snapshots, the onboarding wizard, and the full SOTA weight set staged (FLUX.2, gemma-4, TRELLIS.2, Hunyuan-2.1, SAM3D). model_lifecycle + comfyui_control suites pass (31/31).

In progress / pending: SOTA single-image hulls (TRELLIS.2/Hunyuan ComfyUI custom-node builds — currently gsplat-TSDF fallback), the FLUX.2 generative recovery loop, gemma VLM serving (agent-vlm), v2g:* USD metadata + native-export parity probe, fresh-capture end-to-end (Drive→ingest→COLMAP), isolation-quality tuning (denser sampling + per-view depth occlusion), automated acceptance gates + golden fixture, and persistence chores (fold libz-ng into the image, mount host-staged weights). See docs/engineering-log.md and the current-state report.

Source video quality is the dominant quality bottleneck: lossy/featureless/reflective footage breaks reconstruction regardless of downstream method. See docs/capture-methodology.md.

Boundaries & license

Fork of LichtFeld Studio (GPL-3.0). Upstream directories (src/core/, src/app/, src/mcp/, src/rendering/, src/training/, src/geometry/, src/io/, …) are not modified; all additions live in src/pipeline/, src/web/, onboarding/, docker/, and scripts/. Pipeline additions: GPL-3.0 (derivative work). See BOUNDARIES.md.

Name		Name	Last commit message	Last commit date
Latest commit History 2,220 Commits
.github		.github
.vscode		.vscode
cmake		cmake
docker		docker
docs		docs
eval		eval
external		external
onboarding		onboarding
report		report
research		research
resources		resources
scripts		scripts
secrets		secrets
src		src
tests		tests
tools		tools
.clang-format		.clang-format
.clangd		.clangd
.dockerignore		.dockerignore
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
.mcp.json		.mcp.json
AGENTS.md		AGENTS.md
BOUNDARIES.md		BOUNDARIES.md
CLAUDE_CONTAINER.md		CLAUDE_CONTAINER.md
CMakeLists.txt		CMakeLists.txt
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile.consolidated		Dockerfile.consolidated
LICENSE		LICENSE
README.md		README.md
THIRD_PARTY_LICENSES.md		THIRD_PARTY_LICENSES.md
build_lichtfeld.ps1		build_lichtfeld.ps1
docker-compose.consolidated.yml		docker-compose.consolidated.yml
exhibit.example.toml		exhibit.example.toml
pins.lock.toml		pins.lock.toml
pins.resolved.toml		pins.resolved.toml
vcpkg-configuration.json		vcpkg-configuration.json
vcpkg.json		vcpkg.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vitrine

Pipeline

SOTA stack (research/non-commercial posture)

Deployment

Pipeline modules (`src/pipeline/`)

Hardware

Status

Boundaries & license

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Vitrine

Pipeline

SOTA stack (research/non-commercial posture)

Deployment

Pipeline modules (src/pipeline/)

Hardware

Status

Boundaries & license

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Pipeline modules (`src/pipeline/`)

Packages