in fine tuning the model begin with zero loss.

#85

by Imran1 - opened Jan 15

Discussion

Imran1

Jan 15

`
import peft

from peft import LoraConfig, prepare_model_for_kbit_training, get_peft_model

lora_config = LoraConfig(
r=32,
lora_alpha=16,
target_modules=[
'q_proj',
'k_proj',
'v_proj',
'dense',
'fc1',
'fc2',

  ],
bias="none",
lora_dropout=0.05,
task_type="CAUSAL_LM",

)

import transformers
from transformers import TrainingArguments
import torch

HAS_BFLOAT16 = torch.cuda.is_bf16_supported()

training_args = TrainingArguments(
output_dir= "phib",
max_steps = 100,
per_device_train_batch_size= 1,
gradient_accumulation_steps= 4,
optim="paged_adamw_32bit",
warmup_steps = 10,
logging_steps = 1,
logging_strategy="steps",
learning_rate = 2e-4,
fp16 = not HAS_BFLOAT16,
bf16 = HAS_BFLOAT16,
weight_decay = 0.01,
lr_scheduler_type = "linear",
group_by_length= True,
#disable_tqdm=False,
report_to="none",
seed = 3407,
)
`

check the lose

Step Training Loss 1 0.000000 2 0.000000 3 0.000000 4 0.000000 5 0.000000 6 0.000000 7 0.000000

Pipper

Jan 16

Got the same issue on similar settings

gugarosa

Microsoft org Jan 16

Could you please try with microsoft/phi-1_5 and report if you are seing the same issue?

Pipper

Jan 16

•

edited Jan 16

Can't try that right now, it looks like this rev "refs/pr/23" is working. The lora total number of trainable parameters are somehow 2 time higher as previous while conserving the same setting. I am wondering if this is supposed to be so (refs/pr/23 vs latest(Jan 16)) .

gugarosa

Microsoft org Jan 17

Could you please re-run with the latest update?

We updated the modeling_phi.py file and disabled the auto-casting on the Attention layer. This is the same fix as the previous code had.

Pipper

Jan 18

@gugarosa

Could you please re-run with the latest update?

great, that works fine. Thanks

gugarosa

Microsoft org Jan 19

No problems! Please let me know if you see anything else.

gugarosa changed discussion status to closed Jan 19

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment