llmware
/

dragon-yi-9b-gguf

Model card Files Files and versions

doberst commited on Aug 22, 2024

Commit

efcd7b9

·

verified ·

1 Parent(s): 0b4bdf2

Update README.md

Files changed (1) hide show

README.md +17 -0

README.md CHANGED Viewed

@@ -10,6 +10,23 @@ license: other
 [**dragon-yi-6b**](https://huggingface.co/llmware/dragon-yi-6b-v0) is a fact-based question-answering model, optimized for complex business documents.
 To pull the model via API:
     from huggingface_hub import snapshot_download

 [**dragon-yi-6b**](https://huggingface.co/llmware/dragon-yi-6b-v0) is a fact-based question-answering model, optimized for complex business documents.
+## Benchmark Tests
+Evaluated against the benchmark test: RAG-Instruct-Benchmark-Tester
+1 Test Run (temperature=0.0, sample=False) with 1 point for correct answer, 0.5 point for partial correct or blank / NF, 0.0 points for incorrect, and -1 points for hallucinations.
+--Accuracy Score: 98.0 correct out of 100
+--Not Found Classification: 90.0%
+--Boolean: 97.5%
+--Math/Logic: 95%
+--Complex Questions (1-5): 5 (Very Strong)
+--Summarization Quality (1-5): 4 (Above Average)
+--Hallucinations: No hallucinations observed in test runs.
+For test run results (and good indicator of target use cases), please see the files ("core_rag_test" and "answer_sheet" in this repo).
 To pull the model via API:
     from huggingface_hub import snapshot_download