manu commited on
Commit
394a79e
verified
1 Parent(s): 434580e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -1
README.md CHANGED
@@ -23,6 +23,9 @@ This model is part of the CroissantLLM initiative, and corresponds to the checkp
23
  To play with the final model, we recommend using the Chat version: https://huggingface.co/croissantllm/CroissantLLMChat-v0.1.
24
 
25
 
 
 
 
26
 
27
  ## Abstract
28
  We introduce CroissantLLM, a 1.3B language model pretrained on a set of 3T English and French tokens, to bring to the research and industrial community a high-performance, fully open-sourced bilingual model that runs swiftly on consumer-grade local hardware.
@@ -35,7 +38,14 @@ This work enriches the NLP landscape, breaking away from previous English-centri
35
  Our work can be cited as:
36
 
37
  ```bash
38
- Coming soon
 
 
 
 
 
 
 
39
  ```
40
 
41
  ## Usage
 
23
  To play with the final model, we recommend using the Chat version: https://huggingface.co/croissantllm/CroissantLLMChat-v0.1.
24
 
25
 
26
+ https://arxiv.org/abs/2402.00786
27
+
28
+
29
 
30
  ## Abstract
31
  We introduce CroissantLLM, a 1.3B language model pretrained on a set of 3T English and French tokens, to bring to the research and industrial community a high-performance, fully open-sourced bilingual model that runs swiftly on consumer-grade local hardware.
 
38
  Our work can be cited as:
39
 
40
  ```bash
41
+ @misc{faysse2024croissantllm,
42
+ title={CroissantLLM: A Truly Bilingual French-English Language Model},
43
+ author={Manuel Faysse and Patrick Fernandes and Nuno Guerreiro and Ant贸nio Loison and Duarte Alves and Caio Corro and Nicolas Boizard and Jo茫o Alves and Ricardo Rei and Pedro Martins and Antoni Bigata Casademunt and Fran莽ois Yvon and Andr茅 Martins and Gautier Viaud and C茅line Hudelot and Pierre Colombo},
44
+ year={2024},
45
+ eprint={2402.00786},
46
+ archivePrefix={arXiv},
47
+ primaryClass={cs.CL}
48
+ }
49
  ```
50
 
51
  ## Usage