shibing624 commited on
Commit
42bd84d
1 Parent(s): 2e1633e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -34,8 +34,8 @@ llama-3-8b-instruct-262k-chinese基于[Llama-3-8B-Instruct-262k](https://hugging
34
 
35
  Quantization | Peak Usage for Encoding 2048 Tokens | Peak Usage for Generating 8192 Tokens
36
  -- | -- | --
37
- FP16/BF16 | 17.66GB | 22.58GB
38
- Int4 | 8.21GB | 13.62GB
39
 
40
 
41
  缺点:
 
34
 
35
  Quantization | Peak Usage for Encoding 2048 Tokens | Peak Usage for Generating 8192 Tokens
36
  -- | -- | --
37
+ FP16/BF16 | 18.66GB | 24.58GB
38
+ Int4 | 9.21GB | 14.62GB
39
 
40
 
41
  缺点: