MUFFIN-T5-11B / README.md
Reza8848's picture
Update README.md
1d183f4 verified
|
raw
history blame
1.25 kB
metadata
datasets:
  - Reza8848/MUFFIN_68k
language:
  - en
license: mit

This is the model weight of MUFFIN-T5-11B (Multi-Faceted Instructions).

We fine-tune the T5-11B model on our MUFFIN dataset.

We released both 3B and 11B models:

Model Number of parameters
MUFFIN-T5-3B 3 billion
MUFFIN-T5-11B 11 billion

Please refer to MUFFIN-T5-3B for detailed documentation.

🥳 Citation

Please kindly cite our paper if you use any resources in this repository:

@inproceedings{Lou2023MUFFIN,
   title={{MUFFIN}: Curating Multi-Faceted Instructions for Improving Instruction Following},
   author={Renze Lou and Kai Zhang and Jian Xie and Yuxuan Sun and Janice Ahn and Hanzi Xu and Yu su and Wenpeng Yin},
   booktitle={The Twelfth International Conference on Learning Representations},
   year={2024},
   url={https://openreview.net/forum?id=1vrS1zwekw}
}