slicexai
/

elm-v0.1_news_classification

Text Generation

Model card Files Files and versions Community

dev-slx commited on Apr 18, 2024

Commit

6d067ed

·

verified ·

1 Parent(s): ca0b3d6

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -1,6 +1,8 @@
 ---
 license: apache-2.0
 pipeline_tag: text2text-generation
 ---
 # SliceX AI™ ELM (Efficient Language Models)
 **ELM** (which stands for **E**fficient **L**anguage **M**odels) is the first version in the series of cutting-edge language models from [SliceX AI](https://slicex.ai) that is designed to achieve the best in class performance in terms of _quality_, _throughput_ & _memory_.
@@ -27,7 +29,7 @@ _Fast Inference with Customization:_ Once trained, the ELM model architecture pe
 This repository contains code to run our ELM models. The current ELM model `elm-v0.1` (named _Rambutan_) was pre-trained (an intermediate checkpoint was used) and then instruction fine-tuned for downstream tasks.
 ELM models (in the `models` folder) in this repository come in three sizes (elm-1.0, elm-0.75 and elm-0.25). **All these different slices are derived from the same ELM-1.0 finetuned checkpoint** and supports the following use-case.
-- news_classification
 ```note
 NOTE: ELM-v0.1 is an early version finetuned from an intermediate pretrained checkpoint & without any KV caching, decoding optimizations, or quantization applied.

 ---
 license: apache-2.0
 pipeline_tag: text2text-generation
+datasets:
+- ag_news
 ---
 # SliceX AI™ ELM (Efficient Language Models)
 **ELM** (which stands for **E**fficient **L**anguage **M**odels) is the first version in the series of cutting-edge language models from [SliceX AI](https://slicex.ai) that is designed to achieve the best in class performance in terms of _quality_, _throughput_ & _memory_.
 This repository contains code to run our ELM models. The current ELM model `elm-v0.1` (named _Rambutan_) was pre-trained (an intermediate checkpoint was used) and then instruction fine-tuned for downstream tasks.
 ELM models (in the `models` folder) in this repository come in three sizes (elm-1.0, elm-0.75 and elm-0.25). **All these different slices are derived from the same ELM-1.0 finetuned checkpoint** and supports the following use-case.
+- news_classification (ag_news)
 ```note
 NOTE: ELM-v0.1 is an early version finetuned from an intermediate pretrained checkpoint & without any KV caching, decoding optimizations, or quantization applied.