Soran's picture
clip_lora_vision_encoder & visual_projevtor with youtube dataset.
c4b1330 verified