Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Mousaicv
/
selfrag-lora
like
1
Text Generation
Transformers
Safetensors
gpt4_reward_with_format
mistral
alignment-handbook
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
4-bit precision
bitsandbytes
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
725f617
selfrag-lora
/
eval_results.json
Mousaicv
selfrag zephyr-7b-sft-lora
8f6f1b7
8 months ago
raw
Copy download link
history
blame
No virus
190 Bytes
{
"epoch"
:
2.99
,
"eval_loss"
:
0.09112608432769775
,
"eval_runtime"
:
1063.6125
,
"eval_samples"
:
4693
,
"eval_samples_per_second"
:
4.412
,
"eval_steps_per_second"
:
2.207
}