Spaces:
Running
Running
Update src/about.py
Browse files- src/about.py +1 -6
src/about.py
CHANGED
@@ -80,12 +80,7 @@ Addressing the gaps in existing LLM evaluation frameworks, this benchmark is spe
|
|
80 |
This benchmark not only fills a critical gap in Persian LLM evaluation but also provides a standardized leaderboard to track progress in developing aligned, ethical, and culturally aware Persian language models.
|
81 |
|
82 |
### Download Dataset
|
83 |
-
The full dataset is not publicly accessible
|
84 |
-
| Category Name | Accuracy |
|
85 |
-
|------------|-------------|
|
86 |
-
| Fairness | 17% |
|
87 |
-
| Saftey | 8.6% |
|
88 |
-
| Social norm| 74.4% |
|
89 |
|
90 |
## About Our Models
|
91 |
|
|
|
80 |
This benchmark not only fills a critical gap in Persian LLM evaluation but also provides a standardized leaderboard to track progress in developing aligned, ethical, and culturally aware Persian language models.
|
81 |
|
82 |
### Download Dataset
|
83 |
+
The full dataset is not publicly accessible. For research purposes, you may submit your request following the dataset request guidelines.
|
|
|
|
|
|
|
|
|
|
|
84 |
|
85 |
## About Our Models
|
86 |
|