1 contributor

History: 4 commits

slackingfred

TypeError: LoraConfig.__init__() got an unexpected keyword argument 'layer_replication'

09a80bf 3 months ago

.gitattributes

1.52 kB

initial commit 3 months ago
README.md

5.12 kB

After 5 stages / 150K steps 3 months ago
adapter_config.json

715 Bytes

TypeError: LoraConfig.__init__() got an unexpected keyword argument 'layer_replication' 3 months ago
adapter_model.safetensors

120 MB
LFS

After 5 stages / 150K steps 3 months ago
command.txt

98 Bytes

Add required configs from math-deepseek-lora-arith-simple-hard-5-step 3 months ago
config.json

631 Bytes

Add required configs from math-deepseek-lora-arith-simple-hard-5-step 3 months ago
optimizer.pt
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage",
- "collections.OrderedDict"
What is a pickle import?
240 MB
LFS

After 5 stages / 150K steps 3 months ago
rng_state.pth
Detected Pickle imports (7)
- "numpy.core.multiarray._reconstruct",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "numpy.ndarray",
- "numpy.dtype",
- "_codecs.encode",
- "torch.ByteStorage"
How to fix it?
14.2 kB
LFS

After 5 stages / 150K steps 3 months ago
run_config.json

2.11 kB

Add required configs from math-deepseek-lora-arith-simple-hard-5-step 3 months ago
scheduler.pt

1.06 kB
LFS

After 5 stages / 150K steps 3 months ago
tokenizer.json

1.37 MB

Add required configs from math-deepseek-lora-arith-simple-hard-5-step 3 months ago
tokenizer_config.json

1.87 kB

Add required configs from math-deepseek-lora-arith-simple-hard-5-step 3 months ago
trainer_state.json

54.5 kB

After 5 stages / 150K steps 3 months ago
training_args.bin
Detected Pickle imports (9)
- "transformers.training_args.TrainingArguments",
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.training_args.OptimizerNames",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "transformers.trainer_utils.SchedulerType",
- "torch.device",
- "accelerate.state.PartialState",
- "transformers.trainer_utils.HubStrategy"
How to fix it?
4.86 kB
LFS

After 5 stages / 150K steps 3 months ago

Detected Pickle imports (3)

Detected Pickle imports (7)

Detected Pickle imports (9)