llmware
/

dragon-llama-7b-v0

Text Generation

Inference Endpoints

Model card Files Files and versions Community

doberst commited on Nov 15, 2023

Commit

49f3a6c

•

1 Parent(s): 7864fe7

Update README.md

Files changed (1) hide show

README.md +4 -10

README.md CHANGED Viewed

@@ -16,10 +16,10 @@ DRAGON models are fine-tuned with high-quality custom instruct datasets, designe
 Evaluated against the benchmark test:   [RAG-Instruct-Benchmark-Tester](https://www.huggingface.co/datasets/llmware/rag_instruct_benchmark_tester)
 Average of 2 Test Runs with 1 point for correct answer, 0.5 point for partial correct or blank / NF, 0.0 points for incorrect, and -1 points for hallucinations.
---**Accuracy Score**:  **99.0** correct out of 100
---Not Found Classification:  95.0%
---Boolean:  82.5%
---Math/Logic:  70.0%
 --Complex Questions (1-5):  4 (Low-Medium)
 --Summarization Quality (1-5):  4 (Coherent, extractive)
 --Hallucinations:  No hallucinations observed in test runs.
@@ -113,12 +113,6 @@ If you are using a HuggingFace generation script:
     output_only = tokenizer.decode(outputs[0][start_of_output:],skip_special_tokens=True)
-    #   note: due to artifact of the fine-tuning, use this post-processing with HF generation
-    eot = output_only.find("<|endoftext|>")
-    if eot > -1:
-        output_only = output_only[:eot]
 ## Model Card Contact

 Evaluated against the benchmark test:   [RAG-Instruct-Benchmark-Tester](https://www.huggingface.co/datasets/llmware/rag_instruct_benchmark_tester)
 Average of 2 Test Runs with 1 point for correct answer, 0.5 point for partial correct or blank / NF, 0.0 points for incorrect, and -1 points for hallucinations.
+--**Accuracy Score**:  **97.25** correct out of 100
+--Not Found Classification:  92.50%
+--Boolean:  95.00%
+--Math/Logic:  63.75%
 --Complex Questions (1-5):  4 (Low-Medium)
 --Summarization Quality (1-5):  4 (Coherent, extractive)
 --Hallucinations:  No hallucinations observed in test runs.
     output_only = tokenizer.decode(outputs[0][start_of_output:],skip_special_tokens=True)
 ## Model Card Contact