--- license: mit datasets: - conceptual_captions - sbu_captions - visual_genome language: - en tags: - BridgeTower --- Model weights for AAAI 2023 Oral Paper: [BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning](https://arxiv.org/abs/2206.08657). Additional materials: [Code](https://github.com/microsoft/BridgeTower), [Slides](https://looperxx.github.io/files/BridgeTower-AAAI23-PPT-2023-02-08.pdf), [Video(EN)](https://youtu.be/VoHS6RB9LIg), [Video(CN)](https://www.bilibili.com/video/BV1sT411d7Cr), [Blog(CN)](http://looperxx.github.io/blog/BridgeTower), [Tweet(EN)](https://twitter.com/looperxx27/status/1621862912422993921). BridgeTower has also been integrated into [Transformers](https://github.com/huggingface/transformers/). - [Model Hub](https://huggingface.co/BridgeTower), [Code](https://github.com/huggingface/transformers/tree/main/src/transformers/models/bridgetower) and [Documentation](https://huggingface.co/docs/transformers/main/en/model_doc/bridgetower) are available.