This repository provides full reproduction code for all experiments, figures, and tables in the paper. Given the same datasets and pretrained model weights, running the pipeline will produce results ...
"""Preprocess 122 isometric Grok videos for SCD training. Encodes MP4+TXT pairs into precomputed latents + text embeddings for SCD LoRA training. Uses combined ...