Return to Article Details Generalist Vision Models for Any-to-Any Image-to-Video Understanding Download Download PDF