Text Generation
Transformers
Safetensors
mistral
conversational
text-generation-inference
nbeerbower's picture
Update README.md
4ada9ad verified
|
raw
history blame
871 Bytes
metadata
license: apache-2.0
library_name: transformers
base_model:
  - Qwen/Qwen2.5-14B-Instruct
datasets:
  - jondurbin/gutenberg-dpo-v0.1
  - nbeerbower/gutenberg2-dpo
  - nbeerbower/gutenberg-moderne-dpo

image/png

mistral-nemo-gutenberg3-12B

Qwen/Qwen2.5-14B-Instruct finetuned on jondurbin/gutenberg-dpo-v0.1, nbeerbower/gutenberg2-dpo, and nbeerbower/gutenberg-moderne-dpo.

Method

ORPO tuned with 8x A100 for 2 epochs.