monsoon-nlp
/

tamillion

Feature Extraction

Inference Endpoints

Model card Files Files and versions Community

system HF staff commited on Jul 19, 2020

Commit

ecf0125

·

1 Parent(s): a68a913

Update README.md

Files changed (1) hide show

README.md +3 -5

README.md CHANGED Viewed

@@ -5,9 +5,7 @@ Google Research's [ELECTRA](https://github.com/google-research/electra).
 Tokenization and pre-training CoLab: https://colab.research.google.com/drive/1GngBFn_Ge5Hd2XI2febBhZyU7GDiqw5w
-V2 (current): 190,000 steps
-V1: 100,000 steps;
 ## Usage
@@ -17,10 +15,10 @@ https://www.kaggle.com/sudalairajkumar/tamil-nlp
 Notebook: https://colab.research.google.com/drive/1_rW9HZb6G87-5DraxHvhPOzGmSMUc67_?usp=sharin
 The model outperformed mBERT on news classification:
-(Random: 16.7%, mBERT: 53.0%, TaMillion: 68.2%)
 The model slightly outperformed mBERT on movie reviews:
-(RMSE - mBERT: 0.657, TaMillion: 0.626)
 Equivalent accuracy on the Tirukkural topic task.

 Tokenization and pre-training CoLab: https://colab.research.google.com/drive/1GngBFn_Ge5Hd2XI2febBhZyU7GDiqw5w
+V2 (current): 190,000 steps;  (V1 was 100,000 steps)
 ## Usage
 Notebook: https://colab.research.google.com/drive/1_rW9HZb6G87-5DraxHvhPOzGmSMUc67_?usp=sharin
 The model outperformed mBERT on news classification:
+(Random: 16.7%, mBERT: 53.0%, TaMillion: 69.6%)
 The model slightly outperformed mBERT on movie reviews:
+(RMSE - mBERT: 0.657, TaMillion: 0.627)
 Equivalent accuracy on the Tirukkural topic task.