Representation Alignment for Just Image Transformers is not Easier than You ThinkMar 17, 2026·Jaeyo Shin*,Jiwook Kim*,Hyunjung Shim· 0 min read PDF Cite CodeTypePreprintPublicationarXiv preprint arXiv:2603.14366Last updated on Mar 30, 2026Generative Modeling Diffusion Models 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation Nov 4, 2025 →