dvilasuero HF Staff commited on
Commit
0825398
·
verified ·
1 Parent(s): 7652fd3

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +27 -7
README.md CHANGED
@@ -8,16 +8,36 @@ sdk_version: "latest"
8
  pinned: false
9
  ---
10
 
11
- # Inspect Evals/bfcl
12
 
13
- Live log viewer for eval results stored in [dvilasuero/bfcl](https://huggingface.co/dvilasuero/bfcl).
14
 
15
- This Space runs `inspect view` to display real-time evaluation logs from the dataset.
16
 
17
- ## View Logs
 
 
 
 
18
 
19
- Logs are automatically displayed from: `hf://datasets/dvilasuero/bfcl/logs`
20
 
21
- ## Dataset
22
 
23
- Results are stored in: [dvilasuero/bfcl](https://huggingface.co/dvilasuero/bfcl)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  pinned: false
9
  ---
10
 
11
+ # bfcl
12
 
13
+ This eval was run using [evaljobs](https://github.com/dvsrepo/evaljobs).
14
 
15
+ ## Command
16
 
17
+ ```bash
18
+ evaljobs inspect_evals/bfcl \
19
+ --model hf-inference-providers/meta-llama/Llama-3.1-8B-Instruct \
20
+ --name bfcl
21
+ ```
22
 
23
+ ## Run with other models
24
 
25
+ To run this eval with a different model, use:
26
 
27
+ ```bash
28
+ evaljobs inspect_evals/bfcl \
29
+ --model <your-model> \
30
+ --name <your-name> \
31
+ --flavor cpu-basic
32
+ ```
33
+
34
+ ## Inspect eval command
35
+
36
+ The eval was executed with:
37
+
38
+ ```bash
39
+ inspect eval inspect_evals/bfcl \
40
+ --model hf-inference-providers/meta-llama/Llama-3.1-8B-Instruct \
41
+ --log-shared \
42
+ --log-buffer 100
43
+ ```