Update README.md
Browse files
README.md
CHANGED
@@ -94,6 +94,9 @@ Details:
|
|
94 |
- **Filtered vs Unfiltered Data**: Minor improvements seen with filtered data, but not significant enough to justify filtering in every case.
|
95 |
- **Computation Limitations**: Fixed memory allocation on TPUs required processing data as a single stream, unlike GPU training which preserves document boundaries. Training was performed in 32-bit mode due to framework limitations, increasing memory usage.
|
96 |
|
|
|
|
|
|
|
97 |
## Citations
|
98 |
If you use GottBERT in your research, please cite the following paper:
|
99 |
```bibtex
|
|
|
94 |
- **Filtered vs Unfiltered Data**: Minor improvements seen with filtered data, but not significant enough to justify filtering in every case.
|
95 |
- **Computation Limitations**: Fixed memory allocation on TPUs required processing data as a single stream, unlike GPU training which preserves document boundaries. Training was performed in 32-bit mode due to framework limitations, increasing memory usage.
|
96 |
|
97 |
+
## Fairseq Checkpoints
|
98 |
+
Get the fairseq checkpoints [here](https://drive.proton.me/urls/CFSGE8ZK9R#1F1G727lv77k).
|
99 |
+
|
100 |
## Citations
|
101 |
If you use GottBERT in your research, please cite the following paper:
|
102 |
```bibtex
|