Update README.md
Browse files
README.md
CHANGED
@@ -334,15 +334,16 @@ Llama-3-SauerkrautLM-8b-Instruct 7.65625
|
|
334 |
```
|
335 |
|
336 |
**German RAG LLM Evaluation**
|
|
|
337 |
```
|
338 |
| Task |Version|Metric|Value| |Stderr|
|
339 |
|------------------------------------------------------|------:|------|----:|---|-----:|
|
340 |
-
|all | |acc |0.
|
341 |
-
|community:german_rag_eval:_average:0 | |acc |0.
|
342 |
-
|community:german_rag_eval:choose_context_by_question:0| 0|acc |0.
|
343 |
-
|community:german_rag_eval:choose_question_by_context:0| 0|acc |0.
|
344 |
-
|community:german_rag_eval:context_question_match:0 | 0|acc |0.
|
345 |
-
|community:german_rag_eval:question_answer_match:0 | 0|acc |0.
|
346 |
```
|
347 |
|
348 |
## Disclaimer
|
|
|
334 |
```
|
335 |
|
336 |
**German RAG LLM Evaluation**
|
337 |
+
corrected result after FIX: https://github.com/huggingface/lighteval/pull/171
|
338 |
```
|
339 |
| Task |Version|Metric|Value| |Stderr|
|
340 |
|------------------------------------------------------|------:|------|----:|---|-----:|
|
341 |
+
|all | |acc |0.910|± |0.0084|
|
342 |
+
|community:german_rag_eval:_average:0 | |acc |0.910|± |0.0084|
|
343 |
+
|community:german_rag_eval:choose_context_by_question:0| 0|acc |0.928|± |0.0082|
|
344 |
+
|community:german_rag_eval:choose_question_by_context:0| 0|acc |0.824|± |0.0120|
|
345 |
+
|community:german_rag_eval:context_question_match:0 | 0|acc |0.982|± |0.0042|
|
346 |
+
|community:german_rag_eval:question_answer_match:0 | 0|acc |0.906|± |0.0092|
|
347 |
```
|
348 |
|
349 |
## Disclaimer
|