proxectonos
/

Llama-3.1-Carballo

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

pablo-rf commited on Oct 15, 2024

Commit

8413d77

·

verified ·

1 Parent(s): a4393a0

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -118,7 +118,7 @@ library_name: transformers
 ## Model description
 **Llama-3.1-Carballo** is a 8B-parameter transformer-based causal language model for Galician, Portuguese, Spanish, Catalan and English.
-It is the result of a continual pretraining of [meta-llama/Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B) with a multilingual corpus of almost 20B tokens, with an emphasis of Galician texts.
 This model is part of the **Carballo familily**, a family of LLMs specialized in Galician. Smaller models can be founded [here](https://huggingface.co/collections/proxectonos/text-models-65d49fa54e358ce02a9699c8)
 ## Intended uses and limitations
@@ -164,7 +164,7 @@ It was trained using HuggingFace Transformers and Pytorch, using the [Causal Mod
 ### Training data
-The training corpus consists of texts in 5 languages, with an emphasis on Galician. The main aim of this is to ensure that the model learns to handle the latter language perfectly, while maintaining knowledge of languages already known (Spanish, English), learning others (Catalan) or adapting existing language varieties (Portuguese-PT instead of Portuguese-BR).
 The corpus is structured as follows:
@@ -196,7 +196,7 @@ The corpus is structured as follows:
 The traininf was conducted in the Galicia Supercomputing Center ([CESGA](https://www.cesga.es/en/home-2/)), using 5 nodes with 2 GPUs NVIDIA A100 each one.
 ## Evaluation
-TO-DO
 ## Additional information

 ## Model description
 **Llama-3.1-Carballo** is a 8B-parameter transformer-based causal language model for Galician, Portuguese, Spanish, Catalan and English.
+It is the result of a continual pretraining of [meta-llama/Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B) with a multilingual corpus of almost 20B tokens, with an emphasis on Galician texts.
 This model is part of the **Carballo familily**, a family of LLMs specialized in Galician. Smaller models can be founded [here](https://huggingface.co/collections/proxectonos/text-models-65d49fa54e358ce02a9699c8)
 ## Intended uses and limitations
 ### Training data
+The training corpus consists of texts in 5 languages, with an emphasis on Galician. The main aim of this is to ensure that the model learns to work with this language perfectly, while maintaining knowledge of languages already known (Spanish, English), learning others (Catalan) or adapting existing language varieties (Portuguese-PT instead of Portuguese-BR).
 The corpus is structured as follows:
 The traininf was conducted in the Galicia Supercomputing Center ([CESGA](https://www.cesga.es/en/home-2/)), using 5 nodes with 2 GPUs NVIDIA A100 each one.
 ## Evaluation
+In process...
 ## Additional information