How to run on Colab's CPU?

by deepakkaura26 - opened Jun 25, 2023

Jun 25, 2023

Can someone suggest or show me through piece of code that how to run this model (i.e MPT-30B-CHAT) on colab's CPU

Jun 27, 2023

Colab has only 12.7 GB of RAM and MPT-30B-CHAT files are almost 60 GB so it's not possible.

Jun 28, 2023

@beoswindvip Can you suggest me which other models I can use ?

Jun 28, 2023

@beoswindvip Can you suggest me which other models I can use ?

You can run 7B models(4bit or 8bit quantization) on the Colab Free Plan GPU,
Such as https://huggingface.co/TheBloke/vicuna-7B-v1.3-GPTQ .

Jun 28, 2023

@swulling Does this or these 7B models can run easily on CPU also ?

Jun 28, 2023

@swulling Does this or these 7B models can run easily on CPU also ?

You can use ggml version of the models to run on CPU.

try GPT4ALL or LLaMA.cpp

Jun 28, 2023

@swulling firstly thanku so much and one last question,

from text_generation import InferenceAPIClient

client = InferenceAPIClient("OpenAssistant/oasst-sft-4-pythia-12b-epoch-3.5")

print(complete_answer)

Apart from OpenAssistant/oasst-sft-4-pythia-12b-epoch-3.5 model as per above piece of code "which other models I can use"?

Jun 29, 2023

•

I suggest choosing a Chat model with a higher ranking to achieve better results.

Apart from OpenAssistant/oasst-sft-4-pythia-12b-epoch-3.5 model as per above piece of code "which other models I can use"?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment