mgoin commited on
Commit
09cfe83
1 Parent(s): 5daf216

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -3,6 +3,9 @@ tags:
3
  - fp8
4
  ---
5
 
 
 
 
6
  ```
7
  vllm (pretrained=nm-testing/Mixtral-8x7B-Instruct-v0.1-FP8), gen_kwargs: (None), limit: None, num_fewshot: 5, batch_size: 1
8
  | Groups |Version|Filter|n-shot|Metric|Value | |Stderr|
 
3
  - fp8
4
  ---
5
 
6
+ Mixtral-8x7B-Instruct-v0.1 quantized to FP8 weights and activations, meant to be deployed in vLLM.
7
+
8
+ Accuracy on MMLU:
9
  ```
10
  vllm (pretrained=nm-testing/Mixtral-8x7B-Instruct-v0.1-FP8), gen_kwargs: (None), limit: None, num_fewshot: 5, batch_size: 1
11
  | Groups |Version|Filter|n-shot|Metric|Value | |Stderr|