Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
RTO-RL
/
Llama3-8B-RTO
like
1
Follow
Reinforced Token Optimization
4
Safetensors
weqweasdas/ultra_train
llama
Model card
Files
Files and versions
Community
Train
main
Llama3-8B-RTO
Commit History
Update README.md
7d45fa0
verified
zkshan2002
commited on
Feb 11
Update README.md
76e1665
verified
zkshan2002
commited on
Feb 11
Create README.md
71c49be
verified
zkshan2002
commited on
Dec 29, 2024
initial commit
2e81574
verified
zkshan2002
commited on
Dec 29, 2024
initial commit
fba0fe9
verified
zkshan2002
commited on
Dec 29, 2024