safetyllm
/

quickertype

text-generation-inference

large-language-model

on-device-computing

Model card Files Files and versions Community

safetyllm commited on Jan 21

Commit

1ec07a0

•

1 Parent(s): 5231dbf

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -1,4 +1,8 @@
 ---
 license: mit
 ---
-This is a simple 15M-parameter GPT model for auto dialogue completion. (More model details to be added later)

 ---
 license: mit
 ---
+QuicktypeGPT is an on-device C-written large language model (LLM) to assist you typing quicker and carrying out meaningful conversations.
+This model only has 15M parameters (dim = 288, 6 layers, 6 heads and 6 kv heads) and 27MB. The model is pre-trained on a single A40 GPU and can be inferenced through a pure C program on a laptop CPU (e.g. AMD, Intel) with decent quality and speed. This project is to demonstrate that (1) we do not need to train a very sophisticated LLM but can still achieve santisfactory performance if the LLM is only focused on a small and dedicated domain or task, (2) we can deploy small LLMs on edge devices (e.g. desktop, laptop, tablet or phone) to perform inference tasks without relying on the servers in the cloud.
+For more details, please refer to [quicktypeGPT](https://github.com/chaoluond/quicktypeGPT) github project.