Multimodal_Autoencoder / README.md

wajahatalikhan

Update README.md

94f2bc2 verified 5 months ago

preview code

raw

history blame contribute delete

318 Bytes

metadata

license: apache-2.0
base_model:
  - openai/clip-vit-base-patch32

Multimodal Learning for Autoencoders

Repository of my SIGGRAPH Asia publication. In Multimodal Autoencoder the image is reconstructed using image and text inputs rather than just only image input.

https://dl.acm.org/doi/10.1145/3681756.3697974