PlantBasedTen commited on
Commit
e689974
1 Parent(s): 3c02c06

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -5
README.md CHANGED
@@ -10,16 +10,18 @@ pinned: false
10
  license: mit
11
  ---
12
 
 
 
13
  This is the Inference module of a 3-part FTI feature-training-inference RAG-framework LLMOps course. \
14
- In this iteration, I've replaced Falcon 7B Instruct with the currently-SoTa (Jan '24) Mistral-7B-Instruct-v0.2, \
15
- fine-tuned using Unsloth on financial questions and answers generated with the help of GPT-4, quantized \
16
  and augmented with a 4bit QLoRa. \
17
  \
18
- Prompt analysis and model registry is handled by Comet LLM, and finance news is pulled via an Alpaca API, processed \
19
- by Bytewax, and then sent as a vector embedding to Qdrant's serverless vector store. LangChain chains the prompt and \
20
  most relevant news article to provide answers with real-time finance information embedded within the output. \
21
  \
22
- #TODO: Add citations to output to show end-user which article has been used to generate the output.
23
 
24
  I have contributed to the original MIT licensed (ka-ching!) course which can be found here: \
25
  [https://medium.com/decoding-ml/the-llms-kit-build-a-production-ready-real-time-financial-advisor-system-using-streaming-ffdcb2b50714]
 
10
  license: mit
11
  ---
12
 
13
+ ## Friendly Fincancial Bot
14
+ \
15
  This is the Inference module of a 3-part FTI feature-training-inference RAG-framework LLMOps course. \
16
+ In this iteration, I've **replaced Falcon 7B Instruct with the currently-SoTa (Jan '24) Mistral-7B-Instruct-v0.2**, \
17
+ fine-tuned using **Unsloth** on financial questions and answers generated with the help of GPT-4, quantized \
18
  and augmented with a 4bit QLoRa. \
19
  \
20
+ Prompt analysis and model registry is handled by **Comet LLM**, and finance news is pulled via an **Alpaca API**, processed \
21
+ by **Bytewax**, and then sent as a vector embedding to **Qdrant**'s serverless vector store. **LangChain** chains the prompt and \
22
  most relevant news article to provide answers with real-time finance information embedded within the output. \
23
  \
24
+ **#TODO:** Add citations to output to show end-user which article has been used to generate the output.
25
 
26
  I have contributed to the original MIT licensed (ka-ching!) course which can be found here: \
27
  [https://medium.com/decoding-ml/the-llms-kit-build-a-production-ready-real-time-financial-advisor-system-using-streaming-ffdcb2b50714]