plaguss
/

Mistral-7B-v0.1-Math-Shepherd-PRM-0.2

Token Classification

Generated from Trainer

stepwise-reward-trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Mistral-7B-v0.1-Math-Shepherd-PRM-0.2

1 contributor

History: 2 commits

plaguss's picture

plaguss HF staff

Training in progress, step 500

dc8ac7a verified 26 days ago

.gitattributes

1.52 kB

initial commit 26 days ago
config.json

673 Bytes

Training in progress, step 500 26 days ago
model-00001-of-00003.safetensors

4.94 GB
LFS

Training in progress, step 500 26 days ago
model-00002-of-00003.safetensors

5 GB
LFS

Training in progress, step 500 26 days ago
model-00003-of-00003.safetensors

4.28 GB
LFS

Training in progress, step 500 26 days ago
model.safetensors.index.json

24 kB

Training in progress, step 500 26 days ago
special_tokens_map.json

437 Bytes

Training in progress, step 500 26 days ago
tokenizer.json

3.51 MB

Training in progress, step 500 26 days ago
tokenizer_config.json

1.03 kB

Training in progress, step 500 26 days ago
training_args.bin
Detected Pickle imports (14)
- "trl.trainer.stepwise_reward_config.StepwiseRewardConfig",
- "transformers.trainer_utils.IntervalStrategy",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "transformers.trainer_utils.SchedulerType",
- "transformers.trainer_utils.SaveStrategy",
- "transformers.training_args.OptimizerNames",
- "transformers.integrations.deepspeed.HfDeepSpeedConfig",
- "accelerate.state.PartialState",
- "torch.device",
- "accelerate.utils.dataclasses.DeepSpeedPlugin",
- "torch.bfloat16",
- "transformers.trainer_utils.HubStrategy",
- "transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig"
How to fix it?
6.78 kB
LFS

Training in progress, step 500 26 days ago