vicenteor/sbu_captions
Updated
•
10
•
15
Note Popular Transformer for Caption Gen and Visual Question Answering. Trained on sbu_captions
Note Seems to be multi-modal
Note Maybe what we want too?
Note This one looks promising for visual question answering