"""Preprocess 122 isometric Grok videos for SCD training. Encodes MP4+TXT pairs into precomputed latents + text embeddings for SCD LoRA training. Uses combined ...
"""Preprocess a Ditto-1M subset for SCD training. Selects N video editing pairs from Ditto-1M, encodes edited videos to VAE latents, uv run python scripts/preprocess_ditto_subset.py --subset-size 500 ...