metadata
license: mit
datasets:
- conceptual_captions
- sbu_captions
- visual_genome
language:
- en
tags:
- BridgeTower
Model weights for AAAI 2023 Oral Paper: BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning.
Additional materials: Code, Slides, Video(EN), Video(CN), Blog(CN), Tweet(EN).
BridgeTower has also been integrated into Transformers.
- Model Hub, Code and Documentation are available.