matlok 's Collections
LMM

Papers - World Sim - Encoder - Video - Phenaki

an encoder-decoder model which compresses videos to discrete embeddings (tokens) and a transformer model to translate text embeddings to video tokens.