File size: 1,113 Bytes
a62cc8b 161eed6 ee90e72 161eed6 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 |
---
license: apache-2.0
---
# Finetuning llama 2 on our celebrity news dataset located [here](https://huggingface.co/datasets/2nji/makebelieve-480)
Disclaimer: This is still work in progress as we need to preprocess our celebrity news dataset to match Llama 2's prompt format as described [here](https://huggingface.co/blog/llama2#how-to-prompt-llama-2)
## Reserve GPU on g5k
Log into your Grid5000 account using ssh and run the following code in the terminal
```script
oarsub -p "cluster='graffiti'" -l gpu=1 -I -q production
```
Wait till GPUs are available and assigned to you, if you need more information about g5k, you can refer to [here](https://www.grid5000.fr/w/Getting_Started)
### Create a virtual environment
- Installing PIP
```script
pip install virtualenv
```
- Creating environment
```script
virtualenv venv
```
- Activating environment
```script
source venv/bin/activate
```
## Install requirements file
```script
pip install -r requirements.txt
```
## Running the script to finetune Llama-2-7b-chat-hf and push to huggingface model repository
```script
python makebelieve.py
```
|