gradjitta
/

llama2-7b-merged-finnish-alpaca-buggy

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Whats this merge about

Its a 500 step checkpoint of the following run

python ./trl/examples/scripts/sft_trainer.py --model_name meta-llama/Llama-2-7b-hf --dataset_name datacrunch/finnish_alpaca --load_in_4bit --use_peft --batch_size 4 --gradient_accumulation_steps 2

Using the repo https://github.com/lvwerra/trl/blob/main/examples/scripts/sft_trainer.py

I am still figuring out an efficient way of doing this, in the meantime you can test it

An example prompt you can try, that should return the Finnish response you need

"Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request. ### Instruction: Anna kolme vinkkiä terveenä pysymiseen. ###Response:"

Downloads last month: 17

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Dataset used to train gradjitta/llama2-7b-merged-finnish-alpaca-buggy