ckm's picture
Update README.md
d8331d9
metadata
language:
  - zh
license: apache-2.0

Mengzi-oscar-base-caption (Chinese Multi-modal Image Caption model)

Mengzi-oscar-base-caption is fine-tuned based on Chinese multi-modal pre-training model Mengzi-Oscar, on AIC-ICC Chinese image caption dataset.

Usage

Installation

Check INSTALL.md for installation instructions.

Pretrain & fine-tune

See the Mengzi-Oscar.md for details.

Citation

If you find the technical report or resource is useful, please cite the following technical report in your paper.

example