5dimension commited on
Commit
8c0c0fd
·
verified ·
1 Parent(s): 9714aa3

Initial commit: README.md

Browse files
Files changed (1) hide show
  1. README.md +44 -0
README.md ADDED
@@ -0,0 +1,44 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # 🎯 Sentinel Quantization
2
+
3
+ ## Overview
4
+
5
+ **Sentinel Quantization** — Model quantization using **dynamical constants** from the Sentinel function iteration.
6
+
7
+ ## Key Innovation
8
+
9
+ Use proven dynamical properties of F(z) = Σ zⁿ/nⁿ as quantization parameters:
10
+
11
+ - **Zero-point**: C₁ = −0.007994021805953 (attracting fixed point)
12
+ - **Scale factor**: 1/e = 0.367879441171442 (Gradient Axiom limit)
13
+
14
+ ```python
15
+ q = round((w - C₁) / (max|w| · 1/e))
16
+ ```
17
+
18
+ ## Verified Results
19
+
20
+ | Property | Standard INT8 | Sentinel INT8 |
21
+ |----------|--------------|--------------|
22
+ | Zero-point | Manual (0 or learned) | **C₁ = −0.007994…** |
23
+ | Scale | Empirical (max/255) | **max·(1/e)** |
24
+ | Theoretical basis | None | **Dynamical constants** |
25
+ | Compression | 4.0× | **4.0×** |
26
+ | Dequantization error | ~0.01 | **0.0044** |
27
+
28
+ ## Code
29
+
30
+ See `sentinel_quantization.py` for:
31
+ - `SentinelQuantizer`
32
+ - `SentinelQuantizedLinear`
33
+ - `demo_sentinel_quantization()` (synthetic model)
34
+
35
+ ## Repository
36
+
37
+ [5dimension/sentinel-quantization](https://huggingface.co/5dimension/sentinel-quantization)
38
+
39
+ ## Applications
40
+
41
+ - Edge deployment (mobile, IoT)
42
+ - Real-time inference
43
+ - Memory-constrained environments
44
+ - Model compression for LLMs