Weyaxi
/

Newton-7B

@@ -4,18 +4,35 @@ tags:
 - axolotl
 - generated_from_trainer
 base_model: openchat/openchat-3.5-0106
-model-index:
-- name: newton-lora
-  results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-[<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
 <details><summary>See axolotl config</summary>
-axolotl version: `0.4.0`
 ```yaml
 base_model: openchat/openchat-3.5-0106
 model_type: MistralForCausalLM
@@ -117,14 +134,75 @@ special_tokens:
 tokens:
   - "<|end_of_turn|>"
   - "<|pad_0|>"
 ```
-</details><br>
-# newton-lora
-This model is a fine-tuned version of [openchat/openchat-3.5-0106](https://huggingface.co/openchat/openchat-3.5-0106) on the None dataset.
-It achieves the following results on the evaluation set:
-- Loss: 0.0800

 - axolotl
 - generated_from_trainer
 base_model: openchat/openchat-3.5-0106
+datasets:
+- hendrycks/competition_math
+- allenai/ai2_arc
+- camel-ai/physics
+- camel-ai/chemistry
+- camel-ai/biology
+- camel-ai/math
+- STEM-AI-mtl/Electrical-engineering
+- openbookqa
+- piqa
+- metaeval/reclor
+- mandyyyyii/scibench
+- derek-thomas/ScienceQA
+- sciq
+- TIGER-Lab/ScienceEval
 ---
+(logo)
+# 🔬👩‍🔬 Newton-7B
+This model is a fine-tuned version of [openchat/openchat-3.5-0106](https://huggingface.co/openchat/openchat-3.5-0106) on datasets related to science.
+This model is fine-tuned using [QLoRa](https://arxiv.org/abs/2305.14314) and [axolotl](https://github.com/OpenAccess-AI-Collective/axolotl).
+This model's training was sponsored by [sablo.ai](https://sablo.ai).
 <details><summary>See axolotl config</summary>
+axolotl version: `0.3.0`
 ```yaml
 base_model: openchat/openchat-3.5-0106
 model_type: MistralForCausalLM
 tokens:
   - "<|end_of_turn|>"
   - "<|pad_0|>"
+```
+</details><br>
+# 📊 Datasets
+Following datasets were used in this model:
+- [MATH](https://huggingface.co/datasets/dahendrycks/competition_math)
+- [ARC](https://huggingface.co/datasets/allenai/ai2_arc) (Note: Only **train** part)
+- [camel-ai/physics](https://huggingface.co/datasets/camel-ai/physics)
+- [camel-ai/chemistry](https://huggingface.co/datasets/camel-ai/chemistry)
+- [camel-ai/biology](https://huggingface.co/datasets/camel-ai/biology)
+- [camel-ai/math](https://huggingface.co/datasets/camel-ai/math)
+- [STEM-AI-mtl/Electrical-engineering](https://huggingface.co/datasets/STEM-AI-mtl/Electrical-engineering)
+- [openbookqa](https://huggingface.co/datasets/openbookqa)
+- [piqa](https://huggingface.co/datasets/piqa)
+- [reclor](https://huggingface.co/datasets/metaeval/reclor)
+- [scibench](https://github.com/mandyyyyii/scibench)
+- [ScienceQA](https://huggingface.co/datasets/derek-thomas/ScienceQA)
+- [sciq](https://huggingface.co/datasets/sciq)
+- [ScienceEval](https://huggingface.co/datasets/TIGER-Lab/ScienceEval)
+# 💬 Prompt Template
+You can use this prompt template while using the model:
+### GPT4 Correct [(Openchat)](https://huggingface.co/openchat/openchat-3.5-0106#conversation-templates)
+```
+GPT4 Correct User: {user}<|end_of_turn|>GPT4 Correct Assistant: {asistant}<|end_of_turn|>GPT4 Correct User: {user}<|end_of_turn|>GPT4 Correct Assistant:
 ```
+You can also utilize the chat template method from the tokenizer config like here:
+```python
+messages = [
+    {"role": "user", "content": "Hello"},
+    {"role": "assistant", "content": "Hi"},
+    {"role": "user", "content": "How are you today?"}
+]
+tokens = tokenizer.apply_chat_template(messages, add_generation_prompt=True)
+```
+# 🤝 Acknowledgments
+Thanks to [@jondurbin](https://hf.co/jondurbin) for reformatting codes for some datasets: [bagel/data_sources](https://github.com/jondurbin/bagel/tree/main/bagel/data_sources)
+Thanks to [Together AI](https://www.together.ai) for providing everyone with free credits, which I used to generate a dataset in multiple choice to explanations format.
+Thanks to all the dataset authors mentioned in the datasets section.
+Thanks to [axolotl](https://github.com/OpenAccess-AI-Collective/axolotl) for making the repository I used to make this model.
+[<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
+If you would like to support me:
+[☕ Buy Me a Coffee](https://www.buymeacoffee.com/weyaxi)