comaniac commited on
Commit
3e9d5ca
1 Parent(s): 41b0326

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -0
README.md ADDED
@@ -0,0 +1,11 @@
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ## Llama-3-70B-Instruct-FP8-v1
2
+
3
+ * Weights and activations are per-tensor quantized to float8_e4m3.
4
+ * Quantization with AutoFP8.
5
+ * Calibration dataset: Ultrachat (mgoin/ultrachat_2k)
6
+ * Samples: 1024
7
+ * Sequence length: 4096
8
+
9
+ ## Evaluation
10
+
11
+ TBA