Sashkanik13 commited on
Commit
576a6d3
1 Parent(s): 6e7acb7

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -0
README.md ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - ru
4
+ tags:
5
+ - PyTorch
6
+ - Transformers
7
+ thumbnail: "https://github.com/sberbank-ai/ru-gpts"
8
+ ---
9
+
10
+ # rugpt3small\_based\_on\_gpt2
11
+ Model was trained with sequence length 1024 using transformers by [SberDevices](https://sberdevices.ru/) team on 80B tokens around 3 epoch. After that model was finetuned on 2048 context.
12
+
13
+ Total training time took around one week on 32 GPUs.
14
+
15
+ # Authors
16
+ + NLP core team RnD [Telegram channel](https://t.me/nlpcoreteam):
17
+ + Dmitry Zmitrovich