taku-yoshioka
/

rlhf_llm_custom_rm

Reinforcement Learning

Inference Endpoints

Model card Files Files and versions Community

rlhf_llm_custom_rm

1 contributor

History: 4 commits

taku-yoshioka's picture

Push model using huggingface_hub.

e792766 verified 9 months ago

.gitattributes

1.52 kB

initial commit 10 months ago
README.md

1.28 kB

Push model using huggingface_hub. 9 months ago
adapter_config.json

621 Bytes

Push model using huggingface_hub. 9 months ago
adapter_model.safetensors

14.2 MB
LFS

Push model using huggingface_hub. 9 months ago
pytorch_model.bin
Detected Pickle imports (3)
- "collections.OrderedDict",
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage"
What is a pickle import?
10.7 kB
LFS

Push model using huggingface_hub. 9 months ago
special_tokens_map.json

397 Bytes

Push model using huggingface_hub. 10 months ago
spiece.model

1.21 MB
LFS

Push model using huggingface_hub. 10 months ago
tokenizer.json

3.73 MB

Push model using huggingface_hub. 10 months ago
tokenizer_config.json

1.65 kB

Push model using huggingface_hub. 10 months ago