YanaS
/

llama-2-7b-langchain-chat-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

YanaS commited on Sep 27, 2023

Commit

57cc894

•

1 Parent(s): e95191e

Edit model card

Files changed (1) hide show

README.md +3 -22

README.md CHANGED Viewed

@@ -41,6 +41,8 @@ license: llama2
 datasets:
 - Photolens/oasst1-langchain-llama-2-formatted
 ---
 ## Model Overview
 Model license: Llama-2<br>
@@ -52,25 +54,4 @@ This model is trained based on [NousResearch/Llama-2-7b-chat-hf](https://hugging
 ```
 ## Intended Use
-Dataset that is used to finetune base model is optimized for langchain applications.<br>
-So this model is intended for a langchain LLM.
-## Training Details
-This model took `1:14:16` to train in QLoRA on a single `A100 40gb` GPU.<br>
- - *epochs*:  `1`
- - *train batch size*:  `8`
- - *eval batch size*:  `8`
- - *gradient accumulation steps*:  `1`
- - *maximum gradient normal*:  `0.3`
- - *learning rate*:  `2e-4`
- - *weight decay*:  `0.001`
- - *optimizer*:  `paged_adamw_32bit`
- - *learning rate schedule*:  `cosine`
- - *warmup ratio (linear)*:  `0.03`
-## Models in this series
-| Model | Train time | Size (in params) | Base Model |
----|---|---|---
-| [llama-2-7b-langchain-chat](https://huggingface.co/Photolens/llama-2-7b-langchain-chat/) | 1:14:16 | 7 billion | [NousResearch/Llama-2-7b-chat-hf](https://huggingface.co/NousResearch/Llama-2-7b-chat-hf) |
-| [llama-2-13b-langchain-chat](https://huggingface.co/Photolens/llama-2-13b-langchain-chat/) | 2:50:27 | 13 billion | [TheBloke/Llama-2-13B-Chat-fp16](https://huggingface.co/TheBloke/Llama-2-13B-Chat-fp16) |
-| [Photolens/OpenOrcaxOpenChat-2-13b-langchain-chat](https://huggingface.co/Photolens/OpenOrcaxOpenChat-2-13b-langchain-chat/) | 2:56:54 | 13 billion | [Open-Orca/OpenOrcaxOpenChat-Preview2-13B](https://huggingface.co/Open-Orca/OpenOrcaxOpenChat-Preview2-13B) |

 datasets:
 - Photolens/oasst1-langchain-llama-2-formatted
 ---
+Model by [Photolens/llama-2-7b-langchain-chat](https://huggingface.co/Photolens/llama-2-7b-langchain-chat) converted in GGUF format.
 ## Model Overview
 Model license: Llama-2<br>
 ```
 ## Intended Use
+Dataset that is used to finetune base model is optimized for langchain applications.<br>