just for curiosity

by prudant - opened Mar 24

Discussion

prudant

Mar 24

how much time took the final training process?

hiyouga

Owner Mar 25

@prudant The training speed of AQLM fine-tuning is around 26.25s/example for a 70B model, so it requires ~15h to fine-tune the model over 2000 examples.

prudant

Mar 25

thanks!

bittamer

Mar 26

can you share the command line and config files ?

hiyouga

Owner Mar 26

https://github.com/hiyouga/LLaMA-Factory/blob/main/examples/qlora_single_gpu/aqlm.sh

bittamer

Mar 27

why is this taking so little resources, compared to accelerate/fsdp_config.yaml ?
I am doing AlexWortega/miqu-1-70b-AQLM-2Bit-1x16-hf

(sorry i am a beginner)

Vasanth

Apr 14

Can you please tell how to format the data @hiyouga for training

bittamer

Apr 14

can I convert mixtral 8x22 to AQLM and then train using this method on 2x3090?

prudant

Apr 15

@bittamer i think AQLM quant process require a lot of gpu computational power (more than 4 gpus for a couple of days running)

hiyouga

Owner Apr 15

@bittamer I think FSDP+QLoRA should be more suitable for your case

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment