Spaces:
Runtime error
Runtime error
Update README.md
Browse files
README.md
CHANGED
|
@@ -10,16 +10,18 @@ pinned: false
|
|
| 10 |
license: mit
|
| 11 |
---
|
| 12 |
|
|
|
|
|
|
|
| 13 |
This is the Inference module of a 3-part FTI feature-training-inference RAG-framework LLMOps course. \
|
| 14 |
-
In this iteration, I've replaced Falcon 7B Instruct with the currently-SoTa (Jan '24) Mistral-7B-Instruct-v0.2
|
| 15 |
-
fine-tuned using Unsloth on financial questions and answers generated with the help of GPT-4, quantized \
|
| 16 |
and augmented with a 4bit QLoRa. \
|
| 17 |
\
|
| 18 |
-
Prompt analysis and model registry is handled by Comet LLM
|
| 19 |
-
by Bytewax
|
| 20 |
most relevant news article to provide answers with real-time finance information embedded within the output. \
|
| 21 |
\
|
| 22 |
-
|
| 23 |
|
| 24 |
I have contributed to the original MIT licensed (ka-ching!) course which can be found here: \
|
| 25 |
[https://medium.com/decoding-ml/the-llms-kit-build-a-production-ready-real-time-financial-advisor-system-using-streaming-ffdcb2b50714]
|
|
|
|
| 10 |
license: mit
|
| 11 |
---
|
| 12 |
|
| 13 |
+
## Friendly Fincancial Bot
|
| 14 |
+
\
|
| 15 |
This is the Inference module of a 3-part FTI feature-training-inference RAG-framework LLMOps course. \
|
| 16 |
+
In this iteration, I've **replaced Falcon 7B Instruct with the currently-SoTa (Jan '24) Mistral-7B-Instruct-v0.2**, \
|
| 17 |
+
fine-tuned using **Unsloth** on financial questions and answers generated with the help of GPT-4, quantized \
|
| 18 |
and augmented with a 4bit QLoRa. \
|
| 19 |
\
|
| 20 |
+
Prompt analysis and model registry is handled by **Comet LLM**, and finance news is pulled via an **Alpaca API**, processed \
|
| 21 |
+
by **Bytewax**, and then sent as a vector embedding to **Qdrant**'s serverless vector store. **LangChain** chains the prompt and \
|
| 22 |
most relevant news article to provide answers with real-time finance information embedded within the output. \
|
| 23 |
\
|
| 24 |
+
**#TODO:** Add citations to output to show end-user which article has been used to generate the output.
|
| 25 |
|
| 26 |
I have contributed to the original MIT licensed (ka-ching!) course which can be found here: \
|
| 27 |
[https://medium.com/decoding-ml/the-llms-kit-build-a-production-ready-real-time-financial-advisor-system-using-streaming-ffdcb2b50714]
|