Edit model card

Model Details

Model Description

The NusaBERT-base-Indonesian-Plutchik-emotion-analysis-v2 is a model designed to identify and analyze emotions in Indonesian texts based on Plutchik's eight basic emotions: Anticipation, Anger, Disgust, Fear, Joy, Sadness, Surprise, and Trust. This model is developed using the NusaBERT-base and trained using Indonesian tweets categorized into eight emotion categories. The evaluation results of this model can be utilized to analyze emotions in social media, providing insights into users' emotional responses.

Bias

Keep in mind that this model is trained using certain data which may cause bias in the emotion classification process. Therefore, it is important to consider and account for such biases when using this model.

Evaluation Results

The model was trained using the Hyperparameter Tuning technique with Optuna. In this process, Optuna conducted five trials to determine the optimal combination of learning rate (1e-6 to 1e-4) and weight decay (1e-6 to 1e-2). Each trial trained the BERT model with different hyperparameter configurations on the training dataset and then evaluated using the validation dataset. After all the experiments are completed, the best hyperparameter combination is used to train the final model. 

Epoch Training Loss Validation Loss Accuracy F1 Precision Recall
1 0.758400 0.583508 0.829932 0.830203 0.833136 0.829932
2 0.370100 0.394630 0.866213 0.865496 0.870364 0.866213
3 0.231500 0.355294 0.884354 0.884585 0.888140 0.884354
4 0.071000 0.322376 0.902494 0.902801 0.904842 0.902494
5 0.129900 0.308596 0.900227 0.900340 0.902132 0.900227

Citation

@misc{Ardiyanto_Mikhael_2024,
    author    = {Mikhael Ardiyanto},
    title     = {NusaBERT-base-Indonesian-Plutchik-emotion-analysis-v2},
    year      = {2024},
    URL       = {https://huggingface.co/Aardiiiiy/NusaBERT-base-Indonesian-Plutchik-emotion-analysis-v2},
    publisher = {Hugging Face}
}
Downloads last month
14
Safetensors
Model size
111M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.