mtgv
/

MobileLLaMA-1.4B-Base

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

mtgv commited on Dec 29, 2023

Commit

d6f3b67

•

1 Parent(s): edddc79

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ tags:
 ---
 # Model Summery
 MobileLLaMA-1.4B-Base is a Transformer with 1.4B billon paramters. We downscale LLaMA to facilitate the off-the-shelf deployment. To make our work reproducible, all
-the models are trained on 1.3T tokens1 from the [RedPajama v1](https://www.together.ai/blog/redpajama) dataset only. This benefits further research by enabling controlled experiments.
 We extensively assess our models on two standard natural language benchmarks, for language understanding and common sense reasoning respectively. Experimental results show that our
 MobileLLaMA 1.4B is on par with the most recent opensource models.

 ---
 # Model Summery
 MobileLLaMA-1.4B-Base is a Transformer with 1.4B billon paramters. We downscale LLaMA to facilitate the off-the-shelf deployment. To make our work reproducible, all
+the models are trained on 1.3T tokens from the [RedPajama v1](https://www.together.ai/blog/redpajama) dataset only. This benefits further research by enabling controlled experiments.
 We extensively assess our models on two standard natural language benchmarks, for language understanding and common sense reasoning respectively. Experimental results show that our
 MobileLLaMA 1.4B is on par with the most recent opensource models.