PlantBasedTen commited on
Commit
b1deddd
1 Parent(s): 11694f8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -14,8 +14,8 @@ license: mit
14
 
15
  This is the **Inference component** of a 3-part **prod-ready** FTI feature-training-inference **RAG-framework LLMOps** course. \
16
  In this iteration, I've **replaced Falcon 7B Instruct** with the **currently-SoTa (Jan'24) Mistral-7B-Instruct-v0.2**, \
17
- fine-tuned using **Unsloth** on financial questions and answers generated with the help of GPT-4, quantized \
18
- and augmented with a 4bit QLoRa. \
19
  \
20
  Prompt analysis and model registry is handled by **Comet LLM**, and finance news is streamed via **Bytewax** using an \
21
  **Alpaca API**, and then sent as a vector embedding to **Qdrant**'s serverless vector store. **LangChain** chains the prompt and \
 
14
 
15
  This is the **Inference component** of a 3-part **prod-ready** FTI feature-training-inference **RAG-framework LLMOps** course. \
16
  In this iteration, I've **replaced Falcon 7B Instruct** with the **currently-SoTa (Jan'24) Mistral-7B-Instruct-v0.2**, \
17
+ fine-tuned using **Unsloth** on an expanded dataset of financial questions and answers generated with the help of GPT-4,
18
+ quantized and augmented with a 4bit QLoRa. \
19
  \
20
  Prompt analysis and model registry is handled by **Comet LLM**, and finance news is streamed via **Bytewax** using an \
21
  **Alpaca API**, and then sent as a vector embedding to **Qdrant**'s serverless vector store. **LangChain** chains the prompt and \