slicexai
/

elm-v0.1_news_classification

Text Generation

Model card Files Files and versions Community

dev-slx commited on Apr 18, 2024

Commit

ca0b3d6

·

verified ·

1 Parent(s): 327cae5

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -26,9 +26,12 @@ _Fast Inference with Customization:_ Once trained, the ELM model architecture pe
 ## ELM-v0.1 Model Release
 This repository contains code to run our ELM models. The current ELM model `elm-v0.1` (named _Rambutan_) was pre-trained (an intermediate checkpoint was used) and then instruction fine-tuned for downstream tasks.
-Models are located in the `models` folder. ELM models in this repository comes in three sizes (elm-1.0, elm-0.75 and elm-0.25) and supports the following use-case.
 - news_classification
 ## Setup ELM
 ### Download ELM repo

 ## ELM-v0.1 Model Release
 This repository contains code to run our ELM models. The current ELM model `elm-v0.1` (named _Rambutan_) was pre-trained (an intermediate checkpoint was used) and then instruction fine-tuned for downstream tasks.
+ELM models (in the `models` folder) in this repository come in three sizes (elm-1.0, elm-0.75 and elm-0.25). **All these different slices are derived from the same ELM-1.0 finetuned checkpoint** and supports the following use-case.
 - news_classification
+```note
+NOTE: ELM-v0.1 is an early version finetuned from an intermediate pretrained checkpoint & without any KV caching, decoding optimizations, or quantization applied.
+```
 ## Setup ELM
 ### Download ELM repo