Statuo's picture
Update README.md
68c9fc8 verified
|
raw
history blame
No virus
953 Bytes
metadata
library_name: transformers
base_model:
  - Sao10K/MN-12B-Lyra-v1
datasets:
  - jondurbin/gutenberg-dpo-v0.1
license: apache-2.0

Mostly quanting this to try it out, didn't see any other quants for EXL2 on this so here we are.

This is the 8bpw version of this model. Find the original here.
For the 6bpw version, go here
For the 4bpw version, go here

mistral-nemo-gutenberg-12B-v4

Sao10K/MN-12B-Lyra-v1 finetuned on jondurbin/gutenberg-dpo-v0.1.

Method

Finetuned using an A100 on Google Colab for 3 epochs.

Fine-tune Llama 3 with ORPO