Update README.md
Browse files
README.md
CHANGED
@@ -116,4 +116,10 @@ one P6000 GPU
|
|
116 |
|
117 |
#### Software
|
118 |
|
119 |
-
Pytorch and HuggingFace
|
|
|
|
|
|
|
|
|
|
|
|
|
|
116 |
|
117 |
#### Software
|
118 |
|
119 |
+
Pytorch and HuggingFace
|
120 |
+
|
121 |
+
### Citation
|
122 |
+
|
123 |
+
Misra, Rishabh. "News Category Dataset." arXiv preprint arXiv:2209.11429 (2022).
|
124 |
+
Misra, Rishabh and Jigyasa Grover. "Sculpting Data for ML: The first act of Machine Learning." ISBN 9798585463570 (2021).
|
125 |
+
Tandon, Karan. "This LLM is based on BERT (2018) a bidirectional Transformer. BERT was finetuned using AdamW with the help of NVIDIA AMP and trained in 45 minutes on one P6000 GPU. This model accepts news summary/news headlines/news article and classifies into one of 40 categories"
|