Update README.md
Browse files
README.md
CHANGED
@@ -1,6 +1,8 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
pipeline_tag: text2text-generation
|
|
|
|
|
4 |
---
|
5 |
# SliceX AI™ ELM (Efficient Language Models)
|
6 |
**ELM** (which stands for **E**fficient **L**anguage **M**odels) is the first version in the series of cutting-edge language models from [SliceX AI](https://slicex.ai) that is designed to achieve the best in class performance in terms of _quality_, _throughput_ & _memory_.
|
@@ -27,7 +29,7 @@ _Fast Inference with Customization:_ Once trained, the ELM model architecture pe
|
|
27 |
This repository contains code to run our ELM models. The current ELM model `elm-v0.1` (named _Rambutan_) was pre-trained (an intermediate checkpoint was used) and then instruction fine-tuned for downstream tasks.
|
28 |
|
29 |
ELM models (in the `models` folder) in this repository come in three sizes (elm-1.0, elm-0.75 and elm-0.25). **All these different slices are derived from the same ELM-1.0 finetuned checkpoint** and supports the following use-case.
|
30 |
-
- news_classification
|
31 |
|
32 |
```note
|
33 |
NOTE: ELM-v0.1 is an early version finetuned from an intermediate pretrained checkpoint & without any KV caching, decoding optimizations, or quantization applied.
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
pipeline_tag: text2text-generation
|
4 |
+
datasets:
|
5 |
+
- ag_news
|
6 |
---
|
7 |
# SliceX AI™ ELM (Efficient Language Models)
|
8 |
**ELM** (which stands for **E**fficient **L**anguage **M**odels) is the first version in the series of cutting-edge language models from [SliceX AI](https://slicex.ai) that is designed to achieve the best in class performance in terms of _quality_, _throughput_ & _memory_.
|
|
|
29 |
This repository contains code to run our ELM models. The current ELM model `elm-v0.1` (named _Rambutan_) was pre-trained (an intermediate checkpoint was used) and then instruction fine-tuned for downstream tasks.
|
30 |
|
31 |
ELM models (in the `models` folder) in this repository come in three sizes (elm-1.0, elm-0.75 and elm-0.25). **All these different slices are derived from the same ELM-1.0 finetuned checkpoint** and supports the following use-case.
|
32 |
+
- news_classification (ag_news)
|
33 |
|
34 |
```note
|
35 |
NOTE: ELM-v0.1 is an early version finetuned from an intermediate pretrained checkpoint & without any KV caching, decoding optimizations, or quantization applied.
|