FSM: Fast Spatial Memory

This repository is a PyTorch/GPU implementation of Fast Spatial Memory with Elastic Test-Time Training, as well as a self-reimplemented (non-official!) version of 4D-LRM.

Please note, this repository is not distributed under a single uniform license. The license terms applicable to a given file or implementation path depend on that file’s provenance. Users must comply with the license terms applicable to each file, directory, or implementation path, as described in LICENSE.md and in the corresponding license files and file headers.

Environment Setup

We recommend using a virtual environment to manage your dependencies. You can create one using the following command to create a virtual environment under

virtualenv --no-download "venv/fsm" --prompt "fsm"  # Or "python3.10 -m venv venv/fsm"
source venv/fsm/bin/activate

Then, install the required dependencies:

pip install --upgrade pip
pip install -r envs/requirements.txt

Alternatively, use conda to create an environment:

conda env create -f envs/environment.yml

Pretrained Checkpoints

(For Inference) Pretrained 4D-LVSM & 4D-LRM with LaCET

Pretrained weights are available on Hugging Face.

Release the res256 4D-LVSM models
Release the res256 4D-LRM models
Release the res128 base models
Add detailed model cards

import os
import shutil
from huggingface_hub import hf_hub_download

repo_id = "marstin/fast-spatial-mem"
local_path = "static/weights"
path_in_repo = "lvsm_checkpoints/fsm_4dlvsm_patch8_res256.pth"

# Download (cached under ~/.cache/huggingface/hub)
cached_path = hf_hub_download(
    repo_id=repo_id,
    filename=path_in_repo,
    repo_type="model"
)

# Copy to your desired local folder
os.makedirs(os.path.dirname(local_path), exist_ok=True)
target_path = os.path.join(local_path, os.path.basename(path_in_repo))
shutil.copy(cached_path, target_path)

(For Training) VGG19 for Perceptual Loss

Run this and should output "OK." This will be used by fsm/model/losses/perceptual_loss.py

mkdir -p static/weights
wget -O static/weights/imagenet-vgg-verydeep-19.mat https://www.vlfeat.org/matconvnet/models/imagenet-vgg-verydeep-19.mat
cd weights
actual="$(md5sum imagenet-vgg-verydeep-19.mat)"; [[ "$actual" == "106118b7cf60435e6d8e04f6a6dc3657  imagenet-vgg-verydeep-19.mat" ]] && echo "OK" || { echo "Mismatch: $actual"; exit 1; }

Quick Start

We provide quickstart_training.ipynb and quickstart_inference.ipynb for quick start.

Data Preparation

See static/datasets/example for the data structure.

Provide data processing scripts
Add detailed data cards

Pretraining

To pretrain FSM-LVSM from scratch, run

bash scripts/launch_fsm_lvsm_pretrain.sh

To pretrain FSM-LRM from scratch, run

bash scripts/launch_fsm_lrm_pretrain.sh

Finetuning

To upscale the resolution of FSM-LVSM from an existing checkpoint, run

bash scripts/launch_fsm_lvsm_finetune.sh

To upscale the resolution of FSM-LRM from an existing checkpoint, run

bash scripts/launch_fsm_lrm_finetune.sh

Evaluation

To evaluate the FSM-LVSM model on Steoro4D test set:

bash scripts/launch_fsm_lvsm_eval.sh

To evaluate the FSM-LRM model on Steoro4D test set:

bash scripts/launch_fsm_lrm_eval.sh

Citations

Fast Spatial Memory with Elastic Test-Time Training

Ziqiao Ma*, Xueyang Yu*, Haoyu Zhen, Yuncong Yang, Joyce Chai, Chuang Gan

@article{ma2026fast,
  title={Fast Spatial Memory with Elastic Test-Time Training},
  author={Ma, Ziqiao and Yu, Xueyang and Zhen, Haoyu and Yang, Yuncong and Chai, Joyce and Gan, Chuang},
  journal={arXiv preprint arXiv:2604.07350},
  year={2026}
}

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

Ziqiao Ma, Xuweiyi Chen, Shoubin Yu, Sai Bi, Kai Zhang, Chen Ziwen, Sihan Xu, Jianing Yang, Zexiang Xu, Kalyan Sunkavalli, Mohit Bansal, Joyce Chai, Hao Tan

@inproceedings{ma20254dlrm,
  title={4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time},
  author={Ma, Ziqiao and Chen, Xuweiyi and Yu, Shoubin and Bi, Sai and Zhang, Kai and Ziwen, Chen and Xu, Sihan and Yang, Jianing and Xu, Zexiang and Sunkavalli, Kalyan and others},
  booktitle={The Thirty-ninth Annual Conference on Neural Information Processing Systems},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
envs		envs
fsm		fsm
licenses		licenses
scripts		scripts
static		static
.gitignore		.gitignore
LICENSE.md		LICENSE.md
NOTICE.md		NOTICE.md
README.md		README.md
launch_inference.py		launch_inference.py
launch_training.py		launch_training.py
path2data.json		path2data.json
quickstart_inference.ipynb		quickstart_inference.ipynb
quickstart_training.ipynb		quickstart_training.ipynb
utils_ddp.py		utils_ddp.py
utils_train.py		utils_train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FSM: Fast Spatial Memory

Environment Setup

Pretrained Checkpoints

(For Inference) Pretrained 4D-LVSM & 4D-LRM with LaCET

(For Training) VGG19 for Perceptual Loss

Quick Start

Data Preparation

Pretraining

Finetuning

Evaluation

Citations

Fast Spatial Memory with Elastic Test-Time Training

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FSM: Fast Spatial Memory

Environment Setup

Pretrained Checkpoints

(For Inference) Pretrained 4D-LVSM & 4D-LRM with LaCET

(For Training) VGG19 for Perceptual Loss

Quick Start

Data Preparation

Pretraining

Finetuning

Evaluation

Citations

Fast Spatial Memory with Elastic Test-Time Training

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages