Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Ahjeong
/
dpo_gemma_7b_bf16_lr5e-7_origindset_default_kl0.01_epoch4-epoch4
like
0
Text Generation
Transformers
Safetensors
gemma
conversational
text-generation-inference
Inference Endpoints
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
8b32890
dpo_gemma_7b_bf16_lr5e-7_origindset_default_kl0.01_epoch4-epoch4
1 contributor
History:
2 commits
Ahjeong
Upload GemmaForCausalLM
8b32890
verified
about 16 hours ago
.gitattributes
1.52 kB
initial commit
about 16 hours ago
README.md
5.17 kB
Upload GemmaForCausalLM
about 16 hours ago
config.json
754 Bytes
Upload GemmaForCausalLM
about 16 hours ago
generation_config.json
132 Bytes
Upload GemmaForCausalLM
about 16 hours ago
model-00001-of-00004.safetensors
5 GB
LFS
Upload GemmaForCausalLM
about 16 hours ago
model-00002-of-00004.safetensors
4.98 GB
LFS
Upload GemmaForCausalLM
about 16 hours ago
model-00003-of-00004.safetensors
4.98 GB
LFS
Upload GemmaForCausalLM
about 16 hours ago
model-00004-of-00004.safetensors
2.11 GB
LFS
Upload GemmaForCausalLM
about 16 hours ago
model.safetensors.index.json
20.9 kB
Upload GemmaForCausalLM
about 16 hours ago