pascalrai commited on
Commit
6490298
1 Parent(s): 7090d4e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -3
README.md CHANGED
@@ -37,8 +37,8 @@ This may be due to the base model being trained for emoji classification and the
37
  This model is better if emojis are to be also included for sentiment analysis.
38
  No Evaluation is done for data with only text and no emojis.
39
 
40
- The model was fine-tuned with dataset: mteb/tweet_sentiment_extraction from huggingface
41
- converted to hinglish text.
42
 
43
  The model has a test loss of 0.6 and an f1 score of 0.74 on the unseen data from the dataset.
44
 
@@ -78,4 +78,7 @@ Text: tu mujhe pasandh heh
78
  Negative: 0.01
79
  Neutral: 0.22
80
  Positive: 0.76
81
- ```
 
 
 
 
37
  This model is better if emojis are to be also included for sentiment analysis.
38
  No Evaluation is done for data with only text and no emojis.
39
 
40
+ The model was fine-tuned with the dataset: mteb/tweet_sentiment_extraction from hugging face
41
+ converted to Hinglish text.
42
 
43
  The model has a test loss of 0.6 and an f1 score of 0.74 on the unseen data from the dataset.
44
 
 
78
  Negative: 0.01
79
  Neutral: 0.22
80
  Positive: 0.76
81
+ ```
82
+ Possible Future Direction:
83
+
84
+ 1. Pre-train the Hinglish model with both Hindi, Hinglish, and English datasets. Current tokens for hinlish have very small sizes i.e. low-priority vocabs are used mostly.