IDinsight
/

gemma-2-2b-it-ud

Text Generation

urgency detection

maternal health

Model card Files Files and versions Community

tonyzhao6 commited on Aug 22

Commit

85d4850

•

1 Parent(s): 02b3a3b

Update README.md

Files changed (1) hide show

README.md +9 -3

README.md CHANGED Viewed

@@ -108,15 +108,21 @@ response = get_completions(
 print(f"{response = }")
 ```
-The `gemma2_inferece_hf.py` module is provided for downloaded with the model files.
 ### Evaluation
-XXX
 ### Benchmark Results
-XXXX
 ## Usage and Limitations

 print(f"{response = }")
 ```
+The `gemma2_inference_hf.py` module is provided for downloaded with the model files.
 ### Evaluation
+Model evaluation metrics and results on test dataset containing 3k samples. Note: The test dataset is purposely withheld due to the
+nature and sensitivity of the messages.
 ### Benchmark Results
+The finetuned Gemma-2 model was evaluated against the ese models were evaluated against GPT-3.5-Turbo, GPT-4o-mini, and GPT-4o models:
+| Metric | gemma-2-2b-it-ud | GPT-3.5-Turbo-1106 | GPT-4o-mini-2024-07-18 | GPT-4o-2024-08-06 |
+| ------ | ---------------- | ------------------ | ---------------------- | ----------------- |
+| Accuracy | 0.87 | 0.83 | 0.90 | 0.92 |
+| AUC | 0.84 | 0.83 | 0.91 | 0.92 |
 ## Usage and Limitations