DavidGF commited on
Commit
37127c4
1 Parent(s): 4a77598

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -6
README.md CHANGED
@@ -334,15 +334,16 @@ Llama-3-SauerkrautLM-8b-Instruct 7.65625
334
  ```
335
 
336
  **German RAG LLM Evaluation**
 
337
  ```
338
  | Task |Version|Metric|Value| |Stderr|
339
  |------------------------------------------------------|------:|------|----:|---|-----:|
340
- |all | |acc |0.905|± |0.0086|
341
- |community:german_rag_eval:_average:0 | |acc |0.905|± |0.0086|
342
- |community:german_rag_eval:choose_context_by_question:0| 0|acc |0.896|± |0.0097|
343
- |community:german_rag_eval:choose_question_by_context:0| 0|acc |0.826|± |0.0120|
344
- |community:german_rag_eval:context_question_match:0 | 0|acc |0.987|± |0.0036|
345
- |community:german_rag_eval:question_answer_match:0 | 0|acc |0.911|± |0.0090|
346
  ```
347
 
348
  ## Disclaimer
 
334
  ```
335
 
336
  **German RAG LLM Evaluation**
337
+ corrected result after FIX: https://github.com/huggingface/lighteval/pull/171
338
  ```
339
  | Task |Version|Metric|Value| |Stderr|
340
  |------------------------------------------------------|------:|------|----:|---|-----:|
341
+ |all | |acc |0.910|± |0.0084|
342
+ |community:german_rag_eval:_average:0 | |acc |0.910|± |0.0084|
343
+ |community:german_rag_eval:choose_context_by_question:0| 0|acc |0.928|± |0.0082|
344
+ |community:german_rag_eval:choose_question_by_context:0| 0|acc |0.824|± |0.0120|
345
+ |community:german_rag_eval:context_question_match:0 | 0|acc |0.982|± |0.0042|
346
+ |community:german_rag_eval:question_answer_match:0 | 0|acc |0.906|± |0.0092|
347
  ```
348
 
349
  ## Disclaimer