Update README.md
Browse files
README.md
CHANGED
|
@@ -73,7 +73,7 @@ The training process involved a per-device batch size of 92, gradient accumulati
|
|
| 73 |
|
| 74 |
## Results
|
| 75 |
|
| 76 |
-
|
| 77 |
|
| 78 |
| Metric | Value | Description |
|
| 79 |
|--------------------|---------|---------------------------------------------------------|
|
|
@@ -82,7 +82,8 @@ The training process involved a per-device batch size of 92, gradient accumulati
|
|
| 82 |
| **Perplexity** | 1.3849 | Indicates how well the model predicts the next word. |
|
| 83 |
|
| 84 |
|
| 85 |
-
|
|
|
|
| 86 |
Named Entity Recognition (NER)
|
| 87 |
|
| 88 |
| Metric | Value |
|
|
|
|
| 73 |
|
| 74 |
## Results
|
| 75 |
|
| 76 |
+
#### Training and Evaluation Metrics
|
| 77 |
|
| 78 |
| Metric | Value | Description |
|
| 79 |
|--------------------|---------|---------------------------------------------------------|
|
|
|
|
| 82 |
| **Perplexity** | 1.3849 | Indicates how well the model predicts the next word. |
|
| 83 |
|
| 84 |
|
| 85 |
+
#### Fine-tuned For Downstream Task
|
| 86 |
+
|
| 87 |
Named Entity Recognition (NER)
|
| 88 |
|
| 89 |
| Metric | Value |
|