Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
mkurman
/
Llama-3.2-MedIT-SUN-2.5B-BT-GRPO
like
3
Safetensors
GGUF
open-thoughts/OpenThoughts-114k
Jiayi-Pan/Countdown-Tasks-3to4
FreedomIntelligence/medical-o1-verifiable-problem
llama
Inference Endpoints
conversational
License:
llama3.2
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
main
Llama-3.2-MedIT-SUN-2.5B-BT-GRPO
1 contributor
History:
7 commits
mkurman
Update README.md
0b8cce0
verified
about 1 month ago
.gitattributes
Safe
1.72 kB
GGUF
about 1 month ago
Llama-3.2-MedIT-SUN-2.5B-R1-Q4_K_M.gguf
1.54 GB
LFS
v-1.0.0
about 1 month ago
Llama-3.2-MedIT-SUN-2.5B-R1-Q8_0.gguf
2.63 GB
LFS
v-1.0.0
about 1 month ago
README.md
4.17 kB
Update README.md
about 1 month ago
config.json
1.06 kB
v-1.0.0
about 1 month ago
generation_config.json
200 Bytes
v-1.0.0
about 1 month ago
model.safetensors
4.94 GB
LFS
v-1.0.0
about 1 month ago
special_tokens_map.json
Safe
454 Bytes
v.0.1.0
about 1 month ago
tokenizer.json
17.2 MB
LFS
v-1.0.0
about 1 month ago
tokenizer_config.json
55.1 kB
v-1.0.0
about 1 month ago