add model

Files changed (7) hide show

README.md ADDED Viewed

+---
+license: apache-2.0
+tags:
+- trl
+- transformers
+- reinforcement-learning
+---
+# TRL Model
+This is a [TRL language model](https://github.com/lvwerra/trl) that has been fine-tuned with reinforcement learning to
+ guide the model outputs according to a value, function, or human feedback. The model can be used for text generation.
+## Usage
+To use this model for inference, first install the TRL library:
+```bash
+python -m pip install trl
+```
+You can then generate text as follows:
+```python
+from transformers import pipeline
+generator = pipeline("text-generation", model="lvwerra/runs_truncate/step_350")
+outputs = generator("Hello, my llama is cute")
+```
+If you want to use the model for training or to obtain the outputs from the value head, load the model as follows:
+```python
+from transformers import AutoTokenizer
+from trl import AutoModelForCausalLMWithValueHead
+tokenizer = AutoTokenizer.from_pretrained("lvwerra/runs_truncate/step_350")
+model = AutoModelForCausalLMWithValueHead.from_pretrained("lvwerra/runs_truncate/step_350")
+inputs = tokenizer("Hello, my llama is cute", return_tensors="pt")
+outputs = model(**inputs, labels=inputs["input_ids"])
+```

adapter_config.json ADDED Viewed

+{
+  "base_model_name_or_path": "trl-lib/llama-se-merged",
+  "bias": "none",
+  "enable_lora": null,
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "lora_alpha": 32,
+  "lora_dropout": 0.05,
+  "merge_weights": false,
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "r": 16,
+  "target_modules": [
+    "q_proj",
+    "v_proj"
+  ],
+  "task_type": "CAUSAL_LM"
+}

adapter_model.bin ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:e90eae24a9d0992ff9271c648d5a2a07ba2b4f23877da1a2ae70f4635afcc1dd
+size 33600461

pytorch_model.bin ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:84ef5d11766acdf820ca5f23eb77373b3091c105845d69b8821af40581ca9c69
+size 17471

special_tokens_map.json ADDED Viewed

+{
+  "bos_token": "</s>",
+  "eos_token": "</s>",
+  "pad_token": "[PAD]",
+  "unk_token": "</s>"
+}

tokenizer.model ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:9e556afd44213b6bd1be2b850ebbbd98f5481437a8021afaf58ee7fb1818d347
+size 499723

tokenizer_config.json ADDED Viewed

+{
+  "bos_token": "",
+  "eos_token": "",
+  "model_max_length": 1000000000000000019884624838656,
+  "special_tokens_map_file": "/home/sgugger/tmp/llama/llama-7b-tmp/tokenizer/special_tokens_map.json",
+  "tokenizer_class": "LlamaTokenizer",
+  "unk_token": ""
+}