File size: 611 Bytes
e640c05
af33168
 
e640c05
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
---
language: en
license: cc-by-4.0
tags:
- Clinical notes
- Discharge summaries
- longformer
datasets:
- MIMIC-III
---

* Continue pre-training RoBERTa-base using discharge summaries from MIMIC-III datasets.

* Details can be found in the following paper

> Xiang Dai and Ilias Chalkidis and Sune Darkner and Desmond Elliott. 2022. Revisiting Transformer-based Models for Long Document Classification. (https://arxiv.org/abs/2204.06683)

* Important hyper-parameters 

| | |
|---|---|
| Max sequence | 4096 |
| Batch size | 8 |
| Learning rate | 5e-5 |
| Training epochs | 6 |
| Training time | 130 GPU-hours |