llmware
/

dragon-stablelm-7b-v0

Text Generation

Model card Files Files and versions Community

doberst commited on Nov 6, 2023

Commit

6e16d34

•

1 Parent(s): 78162d0

Update README.md

Files changed (1) hide show

README.md +8 -8

README.md CHANGED Viewed

@@ -6,7 +6,7 @@ license: apache-2.0
 <!-- Provide a quick summary of what the model is/does. -->
-dragon-llama-7b-0.1 part of the dRAGon ("Delivering RAG On Private Cloud") model series, RAG-instruct trained on top of a LLama-2 base model.
 DRAGON models are fine-tuned with high-quality custom instruct datasets, designed for production quality use in RAG scenarios.
@@ -16,11 +16,11 @@ DRAGON models are fine-tuned with high-quality custom instruct datasets, designe
 Evaluated against the benchmark test:   [RAG-Instruct-Benchmark-Tester](https://www.huggingface.co/datasets/llmware/rag_instruct_benchmark_tester)
 Average of 2 Test Runs with 1 point for correct answer, 0.5 point for partial correct or blank / NF, 0.0 points for incorrect, and -1 points for hallucinations.
---**Accuracy Score**:  **99.0** correct out of 100
---Not Found Classification:  95.0%
---Boolean:  82.5%
---Math/Logic:  70.0%
---Complex Questions (1-5):  4 (Low-Medium)
 --Summarization Quality (1-5):  4 (Coherent, extractive)
 --Hallucinations:  No hallucinations observed in test runs.
@@ -31,10 +31,10 @@ For test run results (and good indicator of target use cases), please see the fi
 <!-- Provide a longer summary of what this model is. -->
 - **Developed by:** llmware
-- **Model type:** LLama-2
 - **Language(s) (NLP):** English
 - **License:** Apache 2.0
-- **Finetuned from model:** Llama-2-7B-Base
 ## Uses

 <!-- Provide a quick summary of what the model is/does. -->
+dragon-llama-7b-0.1 part of the dRAGon ("Delivering RAG On Private Cloud") model series, RAG-instruct trained on top of a StableLM-7B base model.
 DRAGON models are fine-tuned with high-quality custom instruct datasets, designed for production quality use in RAG scenarios.
 Evaluated against the benchmark test:   [RAG-Instruct-Benchmark-Tester](https://www.huggingface.co/datasets/llmware/rag_instruct_benchmark_tester)
 Average of 2 Test Runs with 1 point for correct answer, 0.5 point for partial correct or blank / NF, 0.0 points for incorrect, and -1 points for hallucinations.
+--**Accuracy Score**:  **96.25** correct out of 100
+--Not Found Classification:  45.0%
+--Boolean:  81.25%
+--Math/Logic:  57.50%
+--Complex Questions (1-5):  3 (Low-Medium)
 --Summarization Quality (1-5):  4 (Coherent, extractive)
 --Hallucinations:  No hallucinations observed in test runs.
 <!-- Provide a longer summary of what this model is. -->
 - **Developed by:** llmware
+- **Model type:** StableLM-7B
 - **Language(s) (NLP):** English
 - **License:** Apache 2.0
+- **Finetuned from model:** StableLM-7B
 ## Uses