aiplanet
/

buddhi-128k-chat-7b

@@ -1,17 +1,20 @@
 ---
 license: apache-2.0
 ---
 <p align="center" style="font-size:34px;"><b>Buddhi 7B</b></p>
 # Buddhi-7B vLLM Inference: [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/11_8W8FpKK-856QdRVJLyzbu9g-DMxNfg?usp=sharing)
-# Model Description
 <!-- Provide a quick summary of what the model is/does. -->
 Buddhi is a general-purpose chat model, meticulously fine-tuned on the Mistral 7B Instruct, and optimised to handle an extended context length of up to 128,000 tokens using the innovative YaRN [(Yet another Rope Extension)](https://arxiv.org/abs/2309.00071) Technique. This enhancement allows Buddhi to maintain a deeper understanding of context in long documents or conversations, making it particularly adept at tasks requiring extensive context retention, such as comprehensive document summarization, detailed narrative generation, and intricate question-answering.
 ## Architecture
 ### Hardware requirements:
@@ -114,7 +117,15 @@ Why don't scientists trust atoms?
 Because they make up everything.
 ```
-## Prompt Template for Panda Coder 13B
 In order to leverage instruction fine-tuning, your prompt should be surrounded by [INST] and [/INST] tokens. The very first instruction should begin with a begin of sentence id. The next instructions should not. The assistant generation will be ended by the end-of-sentence token id.
@@ -124,18 +135,8 @@ In order to leverage instruction fine-tuning, your prompt should be surrounded b
 "[INST] Do you have mayonnaise recipes? [/INST]"
 ```
-## 🔗 Key Features:
- 🎯 Precision and Efficiency: The model is tailored for accuracy, ensuring your code is not just functional but also efficient.
- ✨ Unleash Creativity: Whether you're a novice or an expert coder, Panda-Coder is here to support your coding journey, offering creative solutions to your programming challenges.
- 📚 Evol Instruct Code: It's built on the robust Evol Instruct Code 80k-v1 dataset, guaranteeing top-notch code generation.
- 📢 What's Next?: We believe in continuous improvement and are excited to announce that in our next release, Panda-Coder will be enhanced with a custom dataset. This dataset will not only expand the language support but also include hardware programming languages like MATLAB, Embedded C, and Verilog. 🧰💡
- ## Get in Touch
  You can schedule a 1:1 meeting with our DevRel & Community Team to get started with AI Planet Open Source LLMs and GenAI Stack. Schedule the call here: [https://calendly.com/jaintarun](https://calendly.com/jaintarun)
@@ -153,8 +154,8 @@ In order to leverage instruction fine-tuning, your prompt should be surrounded b
  ### Citation
  ```
- @misc {Chaitanya890,
-	author       = { {Chaitanya Singhal} },
 	title        = { Buddhi-128k-Chat by AI Planet},
 	year         = 2024,
 	url          = { https://huggingface.co/aiplanet//Buddhi-128K-Chat },

 ---
 license: apache-2.0
+pipeline_tag: text-generation
 ---
 <p align="center" style="font-size:34px;"><b>Buddhi 7B</b></p>
 # Buddhi-7B vLLM Inference: [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/11_8W8FpKK-856QdRVJLyzbu9g-DMxNfg?usp=sharing)
+## Model Description
 <!-- Provide a quick summary of what the model is/does. -->
 Buddhi is a general-purpose chat model, meticulously fine-tuned on the Mistral 7B Instruct, and optimised to handle an extended context length of up to 128,000 tokens using the innovative YaRN [(Yet another Rope Extension)](https://arxiv.org/abs/2309.00071) Technique. This enhancement allows Buddhi to maintain a deeper understanding of context in long documents or conversations, making it particularly adept at tasks requiring extensive context retention, such as comprehensive document summarization, detailed narrative generation, and intricate question-answering.
+## Dataset Creation
 ## Architecture
 ### Hardware requirements:
 Because they make up everything.
 ```
+## Evaluation
+| Model                                | HellaSWAG | ARC-Challenge | MMLU  | TruthfulQA | Winogrande |
+|--------------------------------------|-----------|---------------|-------|------------|------------|
+| Buddhi-128K-Chat                     | 82.78     | 57.51         | 57.39 | 55.44      | 78.37      |
+| NousResearch/Yarn-Mistral-7b-128k    | 80.58     | 58.87         | 60.64 | 42.46      | 72.85      |
+## Prompt Template for Buddi-128-Chat
 In order to leverage instruction fine-tuning, your prompt should be surrounded by [INST] and [/INST] tokens. The very first instruction should begin with a begin of sentence id. The next instructions should not. The assistant generation will be ended by the end-of-sentence token id.
 "[INST] Do you have mayonnaise recipes? [/INST]"
 ```
+## Get in Touch
  You can schedule a 1:1 meeting with our DevRel & Community Team to get started with AI Planet Open Source LLMs and GenAI Stack. Schedule the call here: [https://calendly.com/jaintarun](https://calendly.com/jaintarun)
  ### Citation
  ```
+ @misc {Chaitanya890, lucifertrj ,
+	author       = { {Chaitanya Singhal},{Tarun Jain} },
 	title        = { Buddhi-128k-Chat by AI Planet},
 	year         = 2024,
 	url          = { https://huggingface.co/aiplanet//Buddhi-128K-Chat },