SLIP: Self-supervision meets Language-Image Pre-training Paper • 2112.12750 • Published Dec 23, 2021 • 1
TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation Paper • 2411.04709 • Published Nov 5 • 25