Finetuning llama 2 on our celebrity news dataset located here

Disclaimer: This is still work in progress as we need to preprocess our celebrity news dataset to match Llama 2's prompt format as described here

Reserve GPU on g5k

Log into your Grid5000 account using ssh and run the following code in the terminal

oarsub -p "cluster='graffiti'" -l gpu=1 -I -q production

Wait till GPUs are available and assigned to you, if you need more information about g5k, you can refer to here

Create a virtual environment

  • Installing PIP
pip install virtualenv
  • Creating environment
virtualenv venv
  • Activating environment
source venv/bin/activate

Install requirements file

pip install -r requirements.txt

Running the script to finetune Llama-2-7b-chat-hf and push to huggingface model repository

python makebelieve.py
Downloads last month
13
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.