Upload 3 files

Browse files

Files changed (3) hide show

README.md +102 -1
adapter_config.json +26 -0
adapter_model.safetensors +3 -0

README.md CHANGED Viewed

@@ -1,3 +1,104 @@
 ---
-license: apache-2.0
 ---

 ---
+library_name: peft
+base_model: mistralai/Mistral-7B-v0.1
+datasets:
+- ise-uiuc/Magicoder-OSS-Instruct-75K
 ---
+# Model Card for Model ID
+Trained with [Ludwig.ai](https://ludwig.ai) and [Predibase](https://predibase.com)!
+Given a programming problem and a target language, generate a solution.
+Try it in [LoRAX](https://github.com/predibase/lorax):
+```python
+from lorax import Client
+client = Client("http://<your_endpoint>")
+problem = "<your programming problem>"
+lang = "<your programming language>"
+prompt = f"""
+Below is a programming problem, paired with a language in which the solution
+should be written. Write a solution in the provided that appropriately
+solves the programming problem.
+### Problem: {problem}
+### Language: {lang}
+### Solution:
+"""
+adapter_id = "tgaddair/mistral-7b-magicoder-lora-r8"
+resp = client.generate(prompt, max_new_tokens=64, adapter_id=adapter_id)
+print(resp.generated_text)
+```
+## Model Details
+### Model Description
+Ludwig config (v0.9.3):
+```yaml
+model_type: llm
+input_features:
+  - name: prompt
+    type: text
+    preprocessing:
+      max_sequence_length: null
+    column: prompt
+output_features:
+  - name: solution
+    type: text
+    preprocessing:
+      max_sequence_length: null
+    column: solution
+prompt:
+  template: >-
+    Below is a programming problem, paired with a language in which the solution
+    should be written. Write a solution in the provided that appropriately
+    solves the programming problem.
+    ### Problem: {problem}
+    ### Language: {lang}
+    ### Solution:
+preprocessing:
+  split:
+    type: fixed
+    column: split
+  global_max_sequence_length: 2048
+adapter:
+  type: lora
+generation:
+  max_new_tokens: 64
+trainer:
+  type: finetune
+  epochs: 1
+  optimizer:
+    type: paged_adam
+  batch_size: 1
+  eval_steps: 100
+  learning_rate: 0.0002
+  eval_batch_size: 2
+  steps_per_checkpoint: 1000
+  learning_rate_scheduler:
+    decay: cosine
+    warmup_fraction: 0.03
+  gradient_accumulation_steps: 16
+  enable_gradient_checkpointing: true
+base_model: mistralai/Mistral-7B-v0.1
+quantization:
+  bits: 4
+```

adapter_config.json ADDED Viewed

	@@ -0,0 +1,26 @@

+{
+  "alpha_pattern": {},
+  "auto_mapping": null,
+  "base_model_name_or_path": "mistralai/Mistral-7B-v0.1",
+  "bias": "none",
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 16,
+  "lora_dropout": 0.05,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "r": 8,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "v_proj",
+    "q_proj"
+  ],
+  "task_type": "CAUSAL_LM"
+}

adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fe11768d3f772a05274cbeee8dcc0ce8fe80f20d9cecd46373077365ef12d852
+size 13648432