wang7776 commited on
Commit
289d408
1 Parent(s): 4f7d536

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -2,6 +2,10 @@
2
  inference: false
3
  license: apache-2.0
4
  ---
 
 
 
 
5
  # Vicuna Model Card
6
 
7
  ## Model Details
 
2
  inference: false
3
  license: apache-2.0
4
  ---
5
+ # Overview
6
+ This model has been pruned to 10% sparsity using the [Wanda pruning method](https://arxiv.org/abs/2306.11695) on attention layers. This method requires no retraining or weight updates and still achieves competitive performance. A link to the base model can be found [here](https://huggingface.co/lmsys/vicuna-7b-v1.3).
7
+
8
+
9
  # Vicuna Model Card
10
 
11
  ## Model Details