INC4AI commited on
Commit
84f0ef7
·
verified ·
1 Parent(s): fb0d19b

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +49 -0
README.md ADDED
@@ -0,0 +1,49 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ base_model:
4
+ - unsloth/DeepSeek-R1-BF16
5
+ ---
6
+ ## Model Details
7
+
8
+ This model card is for mxfp4 quantization of [unsloth/DeepSeek-R1-BF16](https://huggingface.co/unsloth/DeepSeek-R1-BF16) based on [intel/auto-round](https://github.com/intel/auto-round) saved with llm_compressor format.
9
+ Please follow the license of the original model.
10
+
11
+ ## Run Inference
12
+ ```
13
+ compressed-tensors 0.14.0.1
14
+ transformers 4.57.6
15
+ torch 2.10.0
16
+ vllm 0.19.0
17
+ ```
18
+ ```
19
+ cd vllm
20
+ model_name=INCModel/DeepSeek-R1-MXFP4-LLMC
21
+ python vllm/examples/basic/offline_inference/generate.py \
22
+ --model $model_name \
23
+ --max-model-len 2048 \
24
+ -tp 8 \
25
+ --trust-remote-code \
26
+ --enforce-eager \
27
+ --gpu-memory-utilization 0.8
28
+ ```
29
+ ## Ethical Considerations and Limitations
30
+
31
+ The model can produce factually incorrect output, and should not be relied on to produce factually accurate information.
32
+ Because of the limitations of the pretrained model and the finetuning datasets, it is possible that this model could generate lewd, biased or otherwise offensive outputs.
33
+
34
+ Therefore, before deploying any applications of the model, developers should perform safety testing.
35
+
36
+ ## Caveats and Recommendations
37
+
38
+ Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model.
39
+
40
+ Here are a couple of useful links to learn more about Intel's AI software:
41
+
42
+ - [Intel Neural Compressor](https://github.com/intel/neural-compressor)
43
+ - [AutoRound](https://github.com/intel/auto-round)
44
+
45
+ ## Disclaimer
46
+
47
+ The license on this model does not constitute legal advice.
48
+ We are not responsible for the actions of third parties who use this model.
49
+ Please consult an attorney before using this model for commercial purposes.