Upload folder using huggingface_hub

Browse files

Files changed (5) hide show

README.md +16 -35
comparison_graph.png +0 -0
model-00001-of-00002.safetensors +1 -1
model-00002-of-00002.safetensors +1 -1
tokenizer.json +1 -1

README.md CHANGED Viewed

@@ -5,7 +5,6 @@ tags:
 - python
 - optimized
 - wanda
-- activation-pruning
 base_model: Qwen/Qwen2.5-3B-Instruct
 pipeline_tag: text-generation
 ---
@@ -14,35 +13,28 @@ pipeline_tag: text-generation
 > 🎯 **PYTHON-optimized** | 📦 **Safe** pruning | ⚡ **1% weights pruned**
-This model is a **conservatively pruned** version of [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct), specialized for **PYTHON** tasks using activation-aware weight pruning (Wanda-style).
-## ✨ Key Features
-- **Specialization**: Optimized for Python tasks
-- **Pruning Method**: Wanda-style (|W| × |activation|) importance scoring
-- **Size Reduction**: 1% weights pruned
-- **Use Case**: High accuracy retention, ideal for production use
-## 📊 Performance Comparison
 | Category | Original | Pruned | Change |
 |----------|----------|--------|--------|
-| **Python** | 40.0% | 40.0% ⭐ | → |
-| Html | 6.7% | 6.7% | → |
-| Trivia | 88.9% | 88.9% | → |
-| Math | 57.8% | 57.8% | → |
-| Reasoning | 33.3% | 33.3% | → |
-| Medical | 93.3% | 93.3% | → |
-| Linux | 95.6% | 95.6% | → |
-| Writing | 62.2% | 62.2% | → |
-**Average**: 59.7% → 59.7% (+0.0%)
-**Python Retention**: 100.0% of original performance
 ![Comparison Graph](comparison_graph.png)
-## 🚀 Quick Start
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
@@ -50,31 +42,20 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
 model = AutoModelForCausalLM.from_pretrained("CompactAI/Qwen2.5-3B-Instruct-python-safe")
 tokenizer = AutoTokenizer.from_pretrained("CompactAI/Qwen2.5-3B-Instruct-python-safe")
-# Example usage
 inputs = tokenizer("Your prompt here", return_tensors="pt")
 outputs = model.generate(**inputs, max_new_tokens=100)
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
-## 📋 Technical Details
 | Property | Value |
 |----------|-------|
 | Base Model | [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct) |
 | Specialization | Python |
 | Prune Mode | Safe |
-| Pruning Method | Activation-based weight pruning (Wanda) |
 | Weight Reduction | 1% weights pruned |
-## 🔗 Related Models
-This model is part of the **Qwen2.5-3B-Instruct** pruned model collection. Variants:
-- **Safe** - Conservative pruning (~10-20%), high accuracy retention
-- **Aggressive** - Maximum compression (~40-50%), best for edge deployment
-## 📜 License
-This model inherits the license from the base model [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct).
----
-*Generated by ZANNPS [Zeto Automatic Neural Network Pruning System]*

 - python
 - optimized
 - wanda
 base_model: Qwen/Qwen2.5-3B-Instruct
 pipeline_tag: text-generation
 ---
 > 🎯 **PYTHON-optimized** | 📦 **Safe** pruning | ⚡ **1% weights pruned**
+This model is a **conservatively pruned** version of [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct).
+## Performance Comparison
 | Category | Original | Pruned | Change |
 |----------|----------|--------|--------|
+| **Python** | 92.3% | 92.3% ⭐ | → |
+| Html | 40.0% | 40.0% | → |
+| Trivia | 100.0% | 100.0% | → |
+| Math | 100.0% | 100.0% | → |
+| Reasoning | 91.7% | 91.7% | → |
+| Medical | 64.3% | 64.3% | → |
+| Linux | 69.2% | 69.2% | → |
+| Writing | 54.5% | 54.5% | → |
+**Average**: 76.5% → 76.5% (+0.0%)
+**Python Retention**: 100.0%
 ![Comparison Graph](comparison_graph.png)
+## Quick Start
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
 model = AutoModelForCausalLM.from_pretrained("CompactAI/Qwen2.5-3B-Instruct-python-safe")
 tokenizer = AutoTokenizer.from_pretrained("CompactAI/Qwen2.5-3B-Instruct-python-safe")
 inputs = tokenizer("Your prompt here", return_tensors="pt")
 outputs = model.generate(**inputs, max_new_tokens=100)
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
+## Technical Details
 | Property | Value |
 |----------|-------|
 | Base Model | [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct) |
 | Specialization | Python |
 | Prune Mode | Safe |
 | Weight Reduction | 1% weights pruned |
+## License
+This model inherits the license from the base model.

comparison_graph.png CHANGED Viewed

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:51ce3c8d9e197c1a236cb403694d93183bd53b93c3029bdc05041e3376301e60
 size 3995916600

 version https://git-lfs.github.com/spec/v1
+oid sha256:c3ede79e8ebee2c46c136dacacad35481dd742eb95aef761b98f535c3e155c5a
 size 3995916600

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:aef660264e5bb20c8e345d07279d54aa9f93db8f82899c2d2f3d47aff8462879
 size 2176009944

 version https://git-lfs.github.com/spec/v1
+oid sha256:6f0b7b954afd6db4f60f9c3cf3d65145d04b588f533f2d33c200edb503e93e1f
 size 2176009944

tokenizer.json CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:51354673edf4300eb841665e1fb684cc1badea87c49d5de6ef09981151683508
 size 11422159

 version https://git-lfs.github.com/spec/v1
+oid sha256:7b3e3adf18710ac3bd97b384b0d01b58205c4c5cd37c6c56d24c8fff86b0561d
 size 11422159