asi commited on
Commit
9155609
1 Parent(s): 27c3dea

:books: fix typo in README

Browse files
Files changed (1) hide show
  1. README.md +8 -6
README.md CHANGED
@@ -2,7 +2,7 @@
2
  language:
3
  - fr
4
 
5
- thumbnail: https://github.com/AntoineSimoulin/gpt-fr/blob/main/imgs/logo.png?raw=true
6
  tags:
7
  - tf
8
  - pytorch
@@ -22,7 +22,7 @@ license: apache-2.0
22
  | Model name | Number of layers | Attention Heads | Embedding Dimension | Total Parameters |
23
  | :------: | :---: | :---: | :---: | :---: |
24
  | `gpt-fr-cased-small` | 12 | 12 | 768 | 124 M |
25
- | `gpt-fr-cased-base` | 24 | 14 | 1,792 | 1,017 B |
26
 
27
  ## Intended uses & limitations
28
 
@@ -46,10 +46,9 @@ input_ids = tokenizer.encode(input_sentence, return_tensors='pt')
46
 
47
  beam_outputs = model.generate(
48
  input_ids,
49
- max_length=200,
50
  do_sample=True,
51
  top_k=50,
52
- max_length=100,
53
  top_p=0.95,
54
  num_return_sequences=1
55
  )
@@ -83,8 +82,11 @@ In line with the [WikiText](https://blog.einstein.ai/the-wikitext-long-term-depe
83
  ### BibTeX entry and citation info
84
 
85
  ```bibtex
86
- @inproceedings{...,
87
- year={2020}
 
 
 
88
  }
89
  ```
90
 
 
2
  language:
3
  - fr
4
 
5
+ thumbnail: https://raw.githubusercontent.com/AntoineSimoulin/gpt-fr/main/imgs/logo.png
6
  tags:
7
  - tf
8
  - pytorch
 
22
  | Model name | Number of layers | Attention Heads | Embedding Dimension | Total Parameters |
23
  | :------: | :---: | :---: | :---: | :---: |
24
  | `gpt-fr-cased-small` | 12 | 12 | 768 | 124 M |
25
+ | `gpt-fr-cased-base` | 24 | 14 | 1792 | 1,017 B |
26
 
27
  ## Intended uses & limitations
28
 
 
46
 
47
  beam_outputs = model.generate(
48
  input_ids,
49
+ max_length=100,
50
  do_sample=True,
51
  top_k=50,
 
52
  top_p=0.95,
53
  num_return_sequences=1
54
  )
 
82
  ### BibTeX entry and citation info
83
 
84
  ```bibtex
85
+ @inproceedings{simoulin_2020_gptfr,
86
+ title = {Un modèle Transformer Génératif Pré-entraîné pour le ______ français},
87
+ author = {Simoulin, Antoine and Crabbé, Benoit},
88
+ year = {2021},
89
+ pubstate = {forthcoming},
90
  }
91
  ```
92