Update README.md
Browse files
README.md
CHANGED
@@ -23,6 +23,9 @@ This model is part of the CroissantLLM initiative, and corresponds to the checkp
|
|
23 |
To play with the final model, we recommend using the Chat version: https://huggingface.co/croissantllm/CroissantLLMChat-v0.1.
|
24 |
|
25 |
|
|
|
|
|
|
|
26 |
|
27 |
## Abstract
|
28 |
We introduce CroissantLLM, a 1.3B language model pretrained on a set of 3T English and French tokens, to bring to the research and industrial community a high-performance, fully open-sourced bilingual model that runs swiftly on consumer-grade local hardware.
|
@@ -35,7 +38,14 @@ This work enriches the NLP landscape, breaking away from previous English-centri
|
|
35 |
Our work can be cited as:
|
36 |
|
37 |
```bash
|
38 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
39 |
```
|
40 |
|
41 |
## Usage
|
|
|
23 |
To play with the final model, we recommend using the Chat version: https://huggingface.co/croissantllm/CroissantLLMChat-v0.1.
|
24 |
|
25 |
|
26 |
+
https://arxiv.org/abs/2402.00786
|
27 |
+
|
28 |
+
|
29 |
|
30 |
## Abstract
|
31 |
We introduce CroissantLLM, a 1.3B language model pretrained on a set of 3T English and French tokens, to bring to the research and industrial community a high-performance, fully open-sourced bilingual model that runs swiftly on consumer-grade local hardware.
|
|
|
38 |
Our work can be cited as:
|
39 |
|
40 |
```bash
|
41 |
+
@misc{faysse2024croissantllm,
|
42 |
+
title={CroissantLLM: A Truly Bilingual French-English Language Model},
|
43 |
+
author={Manuel Faysse and Patrick Fernandes and Nuno Guerreiro and Ant贸nio Loison and Duarte Alves and Caio Corro and Nicolas Boizard and Jo茫o Alves and Ricardo Rei and Pedro Martins and Antoni Bigata Casademunt and Fran莽ois Yvon and Andr茅 Martins and Gautier Viaud and C茅line Hudelot and Pierre Colombo},
|
44 |
+
year={2024},
|
45 |
+
eprint={2402.00786},
|
46 |
+
archivePrefix={arXiv},
|
47 |
+
primaryClass={cs.CL}
|
48 |
+
}
|
49 |
```
|
50 |
|
51 |
## Usage
|