gokulsrinivasagan/processed_wikitext-103-raw-v1-ld
Viewer • Updated • 230k • 4
How to use gokulsrinivasagan/bert_base_train_book_ent_2 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("fill-mask", model="gokulsrinivasagan/bert_base_train_book_ent_2") # Load model directly
from transformers import AutoTokenizer, AutoModelForMaskedLM
tokenizer = AutoTokenizer.from_pretrained("gokulsrinivasagan/bert_base_train_book_ent_2")
model = AutoModelForMaskedLM.from_pretrained("gokulsrinivasagan/bert_base_train_book_ent_2")This model is a fine-tuned version of distilbert-base-uncased on the gokulsrinivasagan/processed_wikitext-103-raw-v1-ld dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Accuracy |
|---|---|---|---|---|
| 1.9291 | 4.1982 | 10000 | 2.1578 | 0.7109 |
| 1.4523 | 8.3963 | 20000 | 2.2038 | 0.7109 |
| 0.9904 | 12.5945 | 30000 | 2.5061 | 0.7046 |
| 0.5702 | 16.7926 | 40000 | 2.9523 | 0.6994 |
| 0.2197 | 20.9908 | 50000 | 3.2009 | 0.6952 |