PEFT
Safetensors
English
jinjieyuan commited on
Commit
a45b7db
·
verified ·
1 Parent(s): d38d1a0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -5
README.md CHANGED
@@ -3,17 +3,23 @@ language: en
3
  license: apache-2.0
4
  ---
5
 
6
- # Shears Model Card: shears-llama-7b-50-cs-super-adapter
7
 
8
- The super-adapter fine-tuned on sparsified LLaMA-7B with some commonsense reasoning datasets using Shears.
9
 
10
  The release of the super-network is to facilitate users to apply their own search algorithms and evaluation indicators to extract subnetworks suitable for their specific needs.
11
 
 
 
 
12
  ## Model Details
13
 
 
 
 
14
  ### Information
15
 
16
- - **Model name:** shears-llama-7b-50-cs-super-adapter
17
  - **Base model:** Sparsified [LLaMA-7B](https://huggingface.co/yahma/llama-7b-hf)
18
  - **Sparsity:** 50%
19
  - **Domain:** Commonsense
@@ -134,13 +140,25 @@ Results of the heuristic sub-network discoverd from the super-network:
134
  - **Repository:** [https://github.com/IntelLabs/Hardware-Aware-Automated-Machine-Learning/tree/main/Shears](https://github.com/IntelLabs/Hardware-Aware-Automated-Machine-Learning/tree/main/Shears)
135
  - **Paper:** [Shears: Unstructured Sparsity with Neural Low-rank Adapter Search](https://arxiv.org/abs/2404.10934)
136
 
 
 
 
 
 
 
 
 
 
 
 
 
137
  ## Citation
138
 
139
  ```bash
140
- @article{munoz2024shears,
141
  title = {Shears: Unstructured Sparsity with Neural Low-rank Adapter Search},
142
  author={J. Pablo Munoz and Jinjie Yuan and Nilesh Jain},
143
- journal={The 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-2024)},
144
  year={2024}
145
  }
146
  ```
 
3
  license: apache-2.0
4
  ---
5
 
6
+ # Shears Adapter Card: shears-llama-7b-50-cs-super-adapter
7
 
8
+ The super-adapter-network fine-tuned on sparsified LLaMA-7B with some commonsense reasoning datasets using Shears.
9
 
10
  The release of the super-network is to facilitate users to apply their own search algorithms and evaluation indicators to extract subnetworks suitable for their specific needs.
11
 
12
+ ## Paper Abstract
13
+ Recently, several approaches successfully demonstrated that weight-sharing Neural Architecture Search (NAS) can effectively explore a search space of elastic low-rank adapters (LoRA), allowing the parameter-efficient fine-tuning (PEFT) and compression of large language models. In this paper, we introduce a novel approach called Shears, demonstrating how the integration of cost-effective sparsity and a proposed Neural Low-rank adapter Search (NLS) algorithm can further improve the efficiency of PEFT approaches. Results demonstrate the benefits of Shears compared to other methods, reaching high sparsity levels while improving or with little drop in accuracy, utilizing a single GPU for a pair of hours.
14
+
15
  ## Model Details
16
 
17
+ ### Note
18
+ Please note, we only provide the model adapter and do not provide a copy of the base [yahma/llama-7b-hf](https://huggingface.co/yahma/llama-7b-hf) model or its sparsified one. Any use of this adapter requires a separate download of the base model and follow [this instruction](#sparsified-base-model) to sparse the base model.
19
+
20
  ### Information
21
 
22
+ - **Adapter name:** shears-llama-7b-50-cs-super-adapter
23
  - **Base model:** Sparsified [LLaMA-7B](https://huggingface.co/yahma/llama-7b-hf)
24
  - **Sparsity:** 50%
25
  - **Domain:** Commonsense
 
140
  - **Repository:** [https://github.com/IntelLabs/Hardware-Aware-Automated-Machine-Learning/tree/main/Shears](https://github.com/IntelLabs/Hardware-Aware-Automated-Machine-Learning/tree/main/Shears)
141
  - **Paper:** [Shears: Unstructured Sparsity with Neural Low-rank Adapter Search](https://arxiv.org/abs/2404.10934)
142
 
143
+ ## Ethical Considerations
144
+
145
+ Intel is committed to respecting human rights and avoiding causing or contributing to adverse impacts on human rights. See [Intel’s Global Human Rights Principles](https://www.intel.com/content/dam/www/central-libraries/us/en/documents/policy-human-rights.pdf). Intel’s products and software are intended only to be used in applications that do not cause or contribute to adverse impacts on human rights.
146
+
147
+ | Ethical Considerations | Description |
148
+ | ----------- | ----------- |
149
+ | Data | The adapter was trained using the commonsense_15k data mixture as described above. |
150
+ | Human life | The model is not intended to inform decisions central to human life or flourishing. |
151
+ | Mitigations | No additional risk mitigation strategies were considered during model development. |
152
+ | Risks and harms | This model has not been assessed for harm or biases, and should not be used for sensitive applications where it may cause harm. |
153
+ | Use cases | - |
154
+
155
  ## Citation
156
 
157
  ```bash
158
+ @inproceedings{munoz2024shears,
159
  title = {Shears: Unstructured Sparsity with Neural Low-rank Adapter Search},
160
  author={J. Pablo Munoz and Jinjie Yuan and Nilesh Jain},
161
+ booktitle={The 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-2024)},
162
  year={2024}
163
  }
164
  ```