uwnlp/guanaco-playground-tgi · How to fine-tune the Guanaco (7B, 13B) model?

Jun 1, 2023

•

edited Jun 1, 2023

I have read this post https://huggingface.co/blog/4bit-transformers-bitsandbytes, which ends with a demo of the Guanaco Playground: https://huggingface.co/spaces/uwnlp/guanaco-playground-tgi. Which does really nice. Though I would like to fine-tune it to my needs. The same article has a link on how to fine-tune a QLora model (resources section -> https://colab.research.google.com/drive/1VoYNfYDKcKRQRor98Zbf2-9VQTtGJ24k?usp=sharing) but that seems to be about fine-tuning a EleutherAI/gpt-neox-20b completion model, not the Guanaco chat-instruction model, right? Is there a colab available the shows how to fine-tune the model that is used in the guanaco-playground? (https://huggingface.co/spaces/uwnlp/guanaco-playground-tgi)

artidoro

University of Washington NLP org Jun 1, 2023

We just uploaded scripts to replicate the guanaco finetuning. Take a look at: https://github.com/artidoro/qlora/tree/main/scripts

Let me know if you have questions

Ichsan2895

Jun 10, 2023

We just uploaded scripts to replicate the guanaco finetuning. Take a look at: https://github.com/artidoro/qlora/tree/main/scripts

Let me know if you have questions

How to fine tune guanaco to the new dataset? For example, training codes for guanaco 33B with qlora to my own dataset which contains new language?