sethuiyer
/

Dr_Samantha-7b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sethuiyer commited on Jan 5

Commit

7dbdaab

•

1 Parent(s): b1a643e

Update README.md

Files changed (1) hide show

README.md +12 -5

README.md CHANGED Viewed

@@ -69,11 +69,18 @@ What is your name?
 My name is Samantha.
 ```
-# Doctor Samantha's Performance Evaluation by GPT-4 across 25 random prompts from ChatDoctor-200k Dataset
-## Overall Rating: 83.5/100
-### Pros:
 - Demonstrates extensive medical knowledge through accurate identification of potential causes for various symptoms.
 - Responses consistently emphasize the importance of seeking professional diagnoses and treatments.
@@ -83,7 +90,7 @@ My name is Samantha.
 - Clear and understandable explanations of conditions and treatment options.
 - Prompt responses addressing all aspects of medical inquiries.
-### Cons:
 - Could occasionally place stronger emphasis on urgency when symptoms indicate potential emergencies.
 - Discussion of differential diagnoses could explore a broader range of less common causes.
@@ -92,4 +99,4 @@ My name is Samantha.
 - Consider exploring full medical histories to improve diagnostic context where relevant.
 - Caution levels and risk factors associated with certain conditions could be underscored more.
-Overall, Dr. Samantha performs at a very high level through knowledgeable, empathetic virtual care. While response enhancements could optimize patient support further, she effectively conveys critical medical insights and guidance. The average rating reflects strong competency with room for targeted refinements to achieve maximum ratings. But Dr. Samantha demonstrates the qualities essential for exemplary telehealth services.

 My name is Samantha.
 ```
+## OpenLLM Leaderboard Performance
+| T | Model                            | Average | ARC   | Hellaswag | MMLU  | TruthfulQA | Winogrande | GSM8K |
+|---|----------------------------------|---------|-------|-----------|-------|------------|------------|-------|
+| 1 | sethuiyer/Dr_Samantha-7b         | 52.95   | 53.84 | 77.95     | 47.94 | 45.58      | 73.56      | 18.8  |
+| 2 | togethercomputer/LLaMA-2-7B-32K-Instruct | 50.02   | 51.11 | 78.51     | 46.11 | 44.86      | 73.88      | 5.69  |
+| 3 | togethercomputer/LLaMA-2-7B-32K  | 47.07   | 47.53 | 76.14     | 43.33 | 39.23      | 71.9       | 4.32  |
+## Evaluation by GPT-4 across 25 random prompts from ChatDoctor-200k Dataset
+### Overall Rating: 83.5/100
+#### Pros:
 - Demonstrates extensive medical knowledge through accurate identification of potential causes for various symptoms.
 - Responses consistently emphasize the importance of seeking professional diagnoses and treatments.
 - Clear and understandable explanations of conditions and treatment options.
 - Prompt responses addressing all aspects of medical inquiries.
+#### Cons:
 - Could occasionally place stronger emphasis on urgency when symptoms indicate potential emergencies.
 - Discussion of differential diagnoses could explore a broader range of less common causes.
 - Consider exploring full medical histories to improve diagnostic context where relevant.
 - Caution levels and risk factors associated with certain conditions could be underscored more.