Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Mousaicv
/
selfrag-lora
like
1
Text Generation
Transformers
Safetensors
gpt4_reward_with_format
mistral
alignment-handbook
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
4-bit precision
bitsandbytes
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
725f617
selfrag-lora
/
train_results.json
Mousaicv
selfrag zephyr-7b-sft-lora
8f6f1b7
8 months ago
raw
Copy download link
history
blame
No virus
196 Bytes
{
"epoch"
:
2.99
,
"train_loss"
:
0.11520915337848155
,
"train_runtime"
:
83607.727
,
"train_samples"
:
42230
,
"train_samples_per_second"
:
1.515
,
"train_steps_per_second"
:
0.008
}