Mrw33554432 commited on
Commit
b4239f2
1 Parent(s): 4ee4162

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -22,13 +22,14 @@ Please notice the kernel is not optimzed for 1-bit matrix yet.
22
 
23
  The model is trained on a 3090(24GB) for 16 hours.
24
 
 
25
  ### For training code, check https://github.com/Mrw33554432/Bitlinear4HF.
26
 
27
  The training code should be compatible with most of the LLMs in huggingface.
28
 
29
  Using pretrained model weight (normal models) for training will not work due to gradient explosion.
30
 
31
- ## Sample inference code
32
 
33
 
34
  ```python
 
22
 
23
  The model is trained on a 3090(24GB) for 16 hours.
24
 
25
+ ### For faster(3x) inference, check https://github.com/Mrw33554432/Bitlinear4HF and install custom kernel
26
  ### For training code, check https://github.com/Mrw33554432/Bitlinear4HF.
27
 
28
  The training code should be compatible with most of the LLMs in huggingface.
29
 
30
  Using pretrained model weight (normal models) for training will not work due to gradient explosion.
31
 
32
+ ## Sample inference code (slow)
33
 
34
 
35
  ```python