Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Pedro13543
/
test_grpo
like
0
Text Generation
Transformers
PyTorch
openai/gsm8k
llama
unsloth
trl
grpo
conversational
text-generation-inference
Inference Endpoints
arxiv:
1910.09700
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
test_grpo
1 contributor
History:
9 commits
Pedro13543
Trained with Unsloth
b300421
verified
10 days ago
.gitattributes
Safe
1.57 kB
Upload tokenizer
12 days ago
README.md
5.36 kB
Update README.md
10 days ago
config.json
773 Bytes
Trained with Unsloth
10 days ago
generation_config.json
154 Bytes
Trained with Unsloth
12 days ago
pytorch_model.bin
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"torch.HalfStorage"
,
"collections.OrderedDict"
What is a pickle import?
3.33 GB
LFS
Trained with Unsloth
10 days ago
special_tokens_map.json
Safe
636 Bytes
Upload tokenizer
12 days ago
tokenizer.json
Safe
34.4 MB
LFS
Upload tokenizer
12 days ago
tokenizer.model
Safe
4.24 MB
LFS
Upload tokenizer
12 days ago
tokenizer_config.json
40.5 kB
Upload tokenizer
10 days ago