oopere
/

pruned60-llama-3.2-1B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

oopere commited on 10 days ago

Commit

f4e5c54

•

1 Parent(s): 86b1572

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -23,6 +23,9 @@ This model is not intended to be used directly, but rather to be fine-tuned for
 - **License:** Same as original model
 - **Developed by:** [Pere Martra](https://huggingface.co/oopere)
 ### Performance on Standard Benchmarks
 | Benchmark | Original Model | Pruned Model | Relative Change |
 | ---- | ---- | ---- | ---- |

 - **License:** Same as original model
 - **Developed by:** [Pere Martra](https://huggingface.co/oopere)
+These models are part of the study "[Exploring GLU Expansion Ratios: Structured Pruning in Llama-3.2 Models](https://doi.org/10.31219/osf.io/qgxea)". They explore structured pruning in GLU-based architectures using Llama-3.2 (1B and 3B variants). The pruning experiments target optimal expansion ratios to balance performance, computational efficiency, and environmental sustainability. The models were evaluated across multiple benchmarks, including BoolQ, ARC-Easy, and MUSR, and demonstrate significant efficiency gains while maintaining robust task performance.
 ### Performance on Standard Benchmarks
 | Benchmark | Original Model | Pruned Model | Relative Change |
 | ---- | ---- | ---- | ---- |