bfcl

Sleeping

dvilasuero HF Staff commited on 11 days ago

Commit

0825398

verified ·

1 Parent(s): 7652fd3

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md CHANGED Viewed

@@ -8,16 +8,36 @@ sdk_version: "latest"
 pinned: false
 ---
-# Inspect Evals/bfcl
-Live log viewer for eval results stored in [dvilasuero/bfcl](https://huggingface.co/dvilasuero/bfcl).
-This Space runs `inspect view` to display real-time evaluation logs from the dataset.
-## View Logs
-Logs are automatically displayed from: `hf://datasets/dvilasuero/bfcl/logs`
-## Dataset
-Results are stored in: [dvilasuero/bfcl](https://huggingface.co/dvilasuero/bfcl)

 pinned: false
 ---
+# bfcl
+This eval was run using [evaljobs](https://github.com/dvsrepo/evaljobs).
+## Command
+```bash
+evaljobs inspect_evals/bfcl \
+  --model hf-inference-providers/meta-llama/Llama-3.1-8B-Instruct \
+  --name bfcl
+```
+## Run with other models
+To run this eval with a different model, use:
+```bash
+evaljobs inspect_evals/bfcl \
+  --model <your-model> \
+  --name <your-name> \
+  --flavor cpu-basic
+```
+## Inspect eval command
+The eval was executed with:
+```bash
+inspect eval inspect_evals/bfcl \
+  --model hf-inference-providers/meta-llama/Llama-3.1-8B-Instruct \
+  --log-shared \
+  --log-buffer 100
+```