runtime error
ram with shape torch.Size([8388608, 1]) from checkpoint, the shape in current model is torch.Size([4096, 4096]). size mismatch for model.layers.30.mlp.gate_proj.weight: copying a param with shape torch.Size([29360128, 1]) from checkpoint, the shape in current model is torch.Size([14336, 4096]). size mismatch for model.layers.30.mlp.up_proj.weight: copying a param with shape torch.Size([29360128, 1]) from checkpoint, the shape in current model is torch.Size([14336, 4096]). size mismatch for model.layers.30.mlp.down_proj.weight: copying a param with shape torch.Size([29360128, 1]) from checkpoint, the shape in current model is torch.Size([4096, 14336]). size mismatch for model.layers.31.self_attn.q_proj.weight: copying a param with shape torch.Size([8388608, 1]) from checkpoint, the shape in current model is torch.Size([4096, 4096]). size mismatch for model.layers.31.self_attn.k_proj.weight: copying a param with shape torch.Size([2097152, 1]) from checkpoint, the shape in current model is torch.Size([1024, 4096]). size mismatch for model.layers.31.self_attn.v_proj.weight: copying a param with shape torch.Size([2097152, 1]) from checkpoint, the shape in current model is torch.Size([1024, 4096]). size mismatch for model.layers.31.self_attn.o_proj.weight: copying a param with shape torch.Size([8388608, 1]) from checkpoint, the shape in current model is torch.Size([4096, 4096]). size mismatch for model.layers.31.mlp.gate_proj.weight: copying a param with shape torch.Size([29360128, 1]) from checkpoint, the shape in current model is torch.Size([14336, 4096]). size mismatch for model.layers.31.mlp.up_proj.weight: copying a param with shape torch.Size([29360128, 1]) from checkpoint, the shape in current model is torch.Size([14336, 4096]). size mismatch for model.layers.31.mlp.down_proj.weight: copying a param with shape torch.Size([29360128, 1]) from checkpoint, the shape in current model is torch.Size([4096, 14336]). You may consider adding `ignore_mismatched_sizes=True` in the model `from_pretrained` method.
Container logs:
Fetching error logs...