CMU-AIR2
/

math-deepseek-lora-arith-curriculum-per-subject

Model card Files Files and versions Community

math-deepseek-lora-arith-curriculum-per-subject

1 contributor

History: 2 commits

slackingfred's picture

After 5 stages / 150K steps

c3e0053 9 months ago

.gitattributes

1.52 kB

initial commit 9 months ago
README.md

5.12 kB

After 5 stages / 150K steps 9 months ago
adapter_config.json

743 Bytes

After 5 stages / 150K steps 9 months ago
adapter_model.safetensors

120 MB
LFS

After 5 stages / 150K steps 9 months ago
optimizer.pt
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage",
- "collections.OrderedDict"
What is a pickle import?
240 MB
LFS

After 5 stages / 150K steps 9 months ago
rng_state.pth
Detected Pickle imports (7)
- "numpy.core.multiarray._reconstruct",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "numpy.ndarray",
- "numpy.dtype",
- "_codecs.encode",
- "torch.ByteStorage"
How to fix it?
14.2 kB
LFS

After 5 stages / 150K steps 9 months ago
scheduler.pt
Pickle imports
- No problematic imports detected
What is a pickle import?
1.06 kB
LFS

After 5 stages / 150K steps 9 months ago
trainer_state.json

54.5 kB

After 5 stages / 150K steps 9 months ago
training_args.bin
Detected Pickle imports (9)
- "transformers.training_args.TrainingArguments",
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.training_args.OptimizerNames",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "transformers.trainer_utils.SchedulerType",
- "torch.device",
- "accelerate.state.PartialState",
- "transformers.trainer_utils.HubStrategy"
How to fix it?
4.86 kB
LFS

After 5 stages / 150K steps 9 months ago