nicholasKluge
/

TeenyTinyLlama-160m

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

nicholasKluge commited on Jan 15

Commit

b995a94

•

1 Parent(s): e35d098

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -50,7 +50,7 @@ Also, TeenyTinyLlama models were trained by leveraging [scaling laws](https://ar
 - **Context length:** 2048 tokens
 - **Dataset:** [Portuguese-Corpus-v3](https://huggingface.co/datasets/nicholasKluge/portuguese-corpus-v3) (6.2B tokens)
 - **Language:** Portuguese
-- **Number of steps:** 457,969 (3.7B tokens)
 - **GPU:** 1 NVIDIA A100-SXM4-40GB
 - **Training time**: ~ 36 hours
 - **Emissions:** 5.6 KgCO2 (Germany)
@@ -178,7 +178,7 @@ for i, completion in enumerate(completions):
 | [Bloom-560m](https://huggingface.co/bigscience/bloom-560m)*                         | 32.13   | 24.74                                   | 37.15                                         | 24.22                                    | 42.44                                          |
 | [Multilingual GPT](https://huggingface.co/ai-forever/mGPT)*                         | 28.73   | 23.81                                   | 26.37                                         | 25.17                                    | 39.62                                          |
-* Evaluations on benchmarks were performed using the [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) (by [EleutherAI](https://www.eleuther.ai/)). Thanks to [Laiviet](https://github.com/laiviet/lm-evaluation-harness) for translating some of the tasks in the LM-Evaluation-Harness. The results of models marked with an "*" were retirved from the [Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard).
 ## Fine-Tuning Comparisons

 - **Context length:** 2048 tokens
 - **Dataset:** [Portuguese-Corpus-v3](https://huggingface.co/datasets/nicholasKluge/portuguese-corpus-v3) (6.2B tokens)
 - **Language:** Portuguese
+- **Number of steps:** 457,969
 - **GPU:** 1 NVIDIA A100-SXM4-40GB
 - **Training time**: ~ 36 hours
 - **Emissions:** 5.6 KgCO2 (Germany)
 | [Bloom-560m](https://huggingface.co/bigscience/bloom-560m)*                         | 32.13   | 24.74                                   | 37.15                                         | 24.22                                    | 42.44                                          |
 | [Multilingual GPT](https://huggingface.co/ai-forever/mGPT)*                         | 28.73   | 23.81                                   | 26.37                                         | 25.17                                    | 39.62                                          |
+- Evaluations on benchmarks were performed using the [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) (by [EleutherAI](https://www.eleuther.ai/)). Thanks to [Laiviet](https://github.com/laiviet/lm-evaluation-harness) for translating some of the tasks in the LM-Evaluation-Harness. The results of models marked with an "*" were retirved from the [Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard).
 ## Fine-Tuning Comparisons