dev-slx commited on
Commit
6d067ed
·
verified ·
1 Parent(s): ca0b3d6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -1,6 +1,8 @@
1
  ---
2
  license: apache-2.0
3
  pipeline_tag: text2text-generation
 
 
4
  ---
5
  # SliceX AI™ ELM (Efficient Language Models)
6
  **ELM** (which stands for **E**fficient **L**anguage **M**odels) is the first version in the series of cutting-edge language models from [SliceX AI](https://slicex.ai) that is designed to achieve the best in class performance in terms of _quality_, _throughput_ & _memory_.
@@ -27,7 +29,7 @@ _Fast Inference with Customization:_ Once trained, the ELM model architecture pe
27
  This repository contains code to run our ELM models. The current ELM model `elm-v0.1` (named _Rambutan_) was pre-trained (an intermediate checkpoint was used) and then instruction fine-tuned for downstream tasks.
28
 
29
  ELM models (in the `models` folder) in this repository come in three sizes (elm-1.0, elm-0.75 and elm-0.25). **All these different slices are derived from the same ELM-1.0 finetuned checkpoint** and supports the following use-case.
30
- - news_classification
31
 
32
  ```note
33
  NOTE: ELM-v0.1 is an early version finetuned from an intermediate pretrained checkpoint & without any KV caching, decoding optimizations, or quantization applied.
 
1
  ---
2
  license: apache-2.0
3
  pipeline_tag: text2text-generation
4
+ datasets:
5
+ - ag_news
6
  ---
7
  # SliceX AI™ ELM (Efficient Language Models)
8
  **ELM** (which stands for **E**fficient **L**anguage **M**odels) is the first version in the series of cutting-edge language models from [SliceX AI](https://slicex.ai) that is designed to achieve the best in class performance in terms of _quality_, _throughput_ & _memory_.
 
29
  This repository contains code to run our ELM models. The current ELM model `elm-v0.1` (named _Rambutan_) was pre-trained (an intermediate checkpoint was used) and then instruction fine-tuned for downstream tasks.
30
 
31
  ELM models (in the `models` folder) in this repository come in three sizes (elm-1.0, elm-0.75 and elm-0.25). **All these different slices are derived from the same ELM-1.0 finetuned checkpoint** and supports the following use-case.
32
+ - news_classification (ag_news)
33
 
34
  ```note
35
  NOTE: ELM-v0.1 is an early version finetuned from an intermediate pretrained checkpoint & without any KV caching, decoding optimizations, or quantization applied.