Update README.md
Browse files
README.md
CHANGED
|
@@ -24,6 +24,8 @@ base_model:
|
|
| 24 |
Unlike traditional guardian models that screen for a fixed set of harms (e.g., violence or self-harm) , DynaGuard can enforce bespoke, application-specific rules. This includes scenarios like preventing a customer service bot from mistakenly issuing refunds or ensuring a medical bot avoids giving unauthorized advice.
|
| 25 |
The DynaGuard series achieves state-of-the-art performance across a wide range of safety and compliance benchmarks, with the flagship **DynaGuard-8B** model outperforming other guardian models and even strong generalist models like GPT-4o-mini.
|
| 26 |
|
|
|
|
|
|
|
| 27 |
## Model Details
|
| 28 |
|
| 29 |
* **Developed by:** University of Maryland, Capital One
|
|
@@ -121,6 +123,10 @@ DynaGuard-8B achieves state-of-the-art performance, outperforming other dedicate
|
|
| 121 |
If you use DynaGuard or the DynaBench dataset in your research, please cite our work:
|
| 122 |
```
|
| 123 |
@article{hoover2025dynaguard,
|
| 124 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 125 |
}
|
| 126 |
```
|
|
|
|
| 24 |
Unlike traditional guardian models that screen for a fixed set of harms (e.g., violence or self-harm) , DynaGuard can enforce bespoke, application-specific rules. This includes scenarios like preventing a customer service bot from mistakenly issuing refunds or ensuring a medical bot avoids giving unauthorized advice.
|
| 25 |
The DynaGuard series achieves state-of-the-art performance across a wide range of safety and compliance benchmarks, with the flagship **DynaGuard-8B** model outperforming other guardian models and even strong generalist models like GPT-4o-mini.
|
| 26 |
|
| 27 |
+
[https://arxiv.org/abs/2509.02563](https://arxiv.org/abs/2509.02563)
|
| 28 |
+
|
| 29 |
## Model Details
|
| 30 |
|
| 31 |
* **Developed by:** University of Maryland, Capital One
|
|
|
|
| 123 |
If you use DynaGuard or the DynaBench dataset in your research, please cite our work:
|
| 124 |
```
|
| 125 |
@article{hoover2025dynaguard,
|
| 126 |
+
title={DynaGuard: A Dynamic Guardrail Model With User-Defined Policies},
|
| 127 |
+
author={Monte Hoover and Vatsal Baherwani and Neel Jain and Khalid Saifullah and Joseph Vincent and Chirag Jain and Melissa Kazemi Rad and C. Bayan Bruss and Ashwinee Panda and Tom Goldstein},
|
| 128 |
+
journal={arXiv preprint},
|
| 129 |
+
year={2025},
|
| 130 |
+
url={https://arxiv.org/abs/2509.02563},
|
| 131 |
}
|
| 132 |
```
|