Not able to run the LLM Jais

by decodingdatascience - opened Aug 31, 2023

Discussion

decodingdatascience

Aug 31, 2023

Even though i got access

samta-kamboj

Core42 org Aug 31, 2023

@decodingdatascience you may try to access it now.

decodingdatascience

Aug 31, 2023

Thanks Samta Kamboj

decodingdatascience

Aug 31, 2023

i am using google colab ,I m using accelerate as well m Still issue am I doing something wrong

samta-kamboj

Core42 org Aug 31, 2023

Restart your notebook , install accelerate before importing transformers. This may resolve the issue.
The order should be :

pip install accelerate
from transformers import AutoTokenizer, AutoModelForCausalLM

decodingdatascience

Aug 31, 2023

Thanks Samta , will restart again and let you know if working, thanks for prompt reply

decodingdatascience

Aug 31, 2023

Still same error , will try in sagemaker latter , thanks Samta

Ahmes91

Sep 1, 2023

I was getting errors using it with lower end gpus, got it working on 48gb GPU

oafzal

Core42 org Sep 1, 2023

You should be able to load it on a smaller V100 (32GB) or A100 (40GB) GPU by using bfloat16 precision. You can achieve this by adding the dtype argument to the method. Additionally, you can further reduce the memory requirement to 13GB (1 x T4) by using int8 precision or 4 bits precision with the help of bits-and-bytes library, but be aware that this may lead to degradation in quality. We have not tested that yet.

Ahmes91

Sep 1, 2023

•

edited Sep 1, 2023

You should be able to load it on a smaller V100 (32GB) or A100 (40GB) GPU by using bfloat16 precision. You can achieve this by adding the dtype argument to the method. Additionally, you can further reduce the memory requirement to 13GB (1 x T4) by using int8 precision or 4 bits precision with the help of bits-and-bytes library, but be aware that this may lead to degradation in quality. We have not tested that yet.

+---------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=======================================================================================|
| 0 N/A N/A 2547 C /usr/bin/python3 21184MiB |

imadch1

Sep 2, 2023

Update the model to add the offload folder.

model = AutoModelForCausalLM.from_pretrained(model_path, device_map="auto", offload_folder="offload", offload_state_dict = False, trust_remote_code=True)

also add

model.to(device)

samta-kamboj changed discussion status to closed Sep 4, 2023

faris98

Sep 4, 2023

@oafzal @Ahmes91 Where can I add the lower precision in the model? Are there any parameters?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment