Philip May commited on
Commit
46a949f
1 Parent(s): cf598af

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -0
README.md CHANGED
@@ -12,3 +12,9 @@ tags:
12
 
13
  # Evaluation
14
  see <https://github.com/GermanT5/german-t5-eval>
 
 
 
 
 
 
 
12
 
13
  # Evaluation
14
  see <https://github.com/GermanT5/german-t5-eval>
15
+
16
+ # Tips for training on GPUs
17
+ This model is too big to fit on a normal 16GB GPU in FP32 mode.
18
+ For various reasons, T5 models cannot be trained in FP16 mode.
19
+ However, mixed precision training is not yet supported on many GPUs.
20
+ For example, it does not work on V100 GPUs. On A100, however, it does.