Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
YxBxRyXJx
/
QAPhi4_GRPO_250220_merged_16bit
like
0
Text Generation
Transformers
Safetensors
English
llama
text-generation-inference
unsloth
trl
grpo
conversational
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
QAPhi4_GRPO_250220_merged_16bit
1 contributor
History:
4 commits
YxBxRyXJx
Trained with Unsloth
8e229ab
verified
18 days ago
.gitattributes
Safe
1.52 kB
initial commit
18 days ago
README.md
611 Bytes
Trained with Unsloth
18 days ago
config.json
883 Bytes
Trained with Unsloth
18 days ago
generation_config.json
165 Bytes
Trained with Unsloth
18 days ago
merges.txt
Safe
917 kB
Upload tokenizer
18 days ago
model-00001-of-00008.safetensors
4.07 GB
LFS
Trained with Unsloth
18 days ago
model-00002-of-00008.safetensors
3.91 GB
LFS
Trained with Unsloth
18 days ago
model-00003-of-00008.safetensors
4.04 GB
LFS
Trained with Unsloth
18 days ago
model-00004-of-00008.safetensors
4.08 GB
LFS
Trained with Unsloth
18 days ago
model-00005-of-00008.safetensors
4.08 GB
LFS
Trained with Unsloth
18 days ago
model-00006-of-00008.safetensors
4.04 GB
LFS
Trained with Unsloth
18 days ago
model-00007-of-00008.safetensors
3.91 GB
LFS
Trained with Unsloth
18 days ago
model-00008-of-00008.safetensors
1.21 GB
LFS
Trained with Unsloth
18 days ago
model.safetensors.index.json
29.9 kB
Trained with Unsloth
18 days ago
special_tokens_map.json
Safe
570 Bytes
Upload tokenizer
18 days ago
tokenizer_config.json
Safe
18.1 kB
Upload tokenizer
18 days ago
vocab.json
Safe
2.01 MB
Upload tokenizer
18 days ago