yuansui
/

Meta-Llama-3.1-8B-Instruct-PPO-tuned

Reinforcement Learning

Inference Endpoints

Model card Files Files and versions Community

Meta-Llama-3.1-8B-Instruct-PPO-tuned

1 contributor

History: 3 commits

yuansui's picture

Push model using huggingface_hub.

d11c2b6 verified 3 months ago

.gitattributes

1.52 kB

initial commit 3 months ago
README.md

1.33 kB

Push model using huggingface_hub. 3 months ago
adapter_config.json

737 Bytes

Push model using huggingface_hub. 3 months ago
adapter_model.safetensors

83.9 MB
LFS

Push model using huggingface_hub. 3 months ago
config.json

1.3 kB

Push model using huggingface_hub. 3 months ago
pytorch_model.bin
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "torch.BFloat16Storage"
What is a pickle import?
83.9 MB
LFS

Push model using huggingface_hub. 3 months ago
special_tokens_map.json

325 Bytes

Push model using huggingface_hub. 3 months ago
tokenizer.json

9.09 MB

Push model using huggingface_hub. 3 months ago
tokenizer_config.json

55.4 kB

Push model using huggingface_hub. 3 months ago