shahules786
/

Redpajama-3B-orcastyle

Model card Files Files and versions Community

shahules786 commited on Jul 2, 2023

Commit

21edae8

·

1 Parent(s): b7d3b70

Update README.md

Files changed (1) hide show

README.md +30 -3

README.md CHANGED Viewed

@@ -1,3 +1,30 @@
----
-license: apache-2.0
----

+## Training details
+- Dataset used: Explanation style datasets from psmathur/WizardLM_Orca and Dahoas/cot_gsm8k
+- Techniques: fp16 bit precision training + LoRA + DeepSpeed
+- Machine: V100 (16GB) * 2
+## Inference
+```python
+from peft import PeftModel
+from huggingface_hub import hf_hub_download
+from transformers import LlamaTokenizer, LlamaForCausalLM
+import json
+model_name = "shahules786/Redpajama-3B-orcastyle"
+config = hf_hub_download(repo_id=model_name, filename="adapter_config.json", local_dir=".")
+config =  json.load(open("adapter_config.json"))
+base_model = config["base_model_name_or_path"]
+tokenizer = LlamaTokenizer.from_pretrained(model_name)
+model = LlamaForCausalLM.from_pretrained(base_model)
+model.resize_token_embeddings(len(self.tokenizer))
+model = PeftModel.from_pretrained(model, model_name).eval()
+tokenizer.padding_side = "left"
+inputs = tokenizer("This is a sample run", return_tensors="pt")
+model.generate(**inputs)
+```
+Checkout training and inference code [here](https://github.com/explodinggradients/Funtuner/tree/main/funtuner)