Vipitis
/

santacoder-finetuned-the-stack-glsl

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Vipitis commited on Jun 28, 2023

Commit

5bef32f

·

1 Parent(s): 8a46ae3

added training params and results

Files changed (1) hide show

README.md +46 -2

README.md CHANGED Viewed

@@ -1,15 +1,59 @@
 ---
 license: bigcode-openrail-m
 datasets:
 - bigcode/the-stack-dedup
 pipeline_tag: text-generation
 tags:
 - code
 ---
 [Santacoder](https://huggingface.co/bigcode/santacoder) finetuned on [Shadertoys](https://huggingface.co/datasets/Vipitis/Shadertoys) for 1000 steps with a batch size of 2 and full sequence length of 2048.
-Origianl finetuning script from found [here](https://github.com/loubnabnl/santacoder-finetuning), adapted version to follow (soon^^).
-Main purpose of this model is to explore if finetuning models improves performance on [ShaderEval](https://huggingface.co/spaces/Vipitis/ShaderEval), results to follow (sooner).
 License carried over from model, and the finetuning dataset holds the same license.

 ---
+language:
+- code
 license: bigcode-openrail-m
 datasets:
 - bigcode/the-stack-dedup
 pipeline_tag: text-generation
 tags:
 - code
+- shader
+widget:
+- text: void mainImage( out vec4 fragColor, in vec2 fragCoord )\n{
+  example_title: mainImage
+  group: Shadertoy
+model-index:
+- name: santacoder-finetuned-the-stack-glsl
+  results:
+  - task:
+      type: text-generation
+      name: ShaderEval
+    dataset:
+      type: Vipitis/Shadertoys-fine
+      name: Shadertoys-fine
+      config: return_completion
+      revision: 0.0.2
+    metrics:
+      - type: exact_match
+        value: 0.380
+        name: 300 samples, greedy decoding
+        verified: false
 ---
 [Santacoder](https://huggingface.co/bigcode/santacoder) finetuned on [Shadertoys](https://huggingface.co/datasets/Vipitis/Shadertoys) for 1000 steps with a batch size of 2 and full sequence length of 2048.
+adapted finetuning script found [here](./train.py)
+### Finetuning parameters
+```sh
+python3 train.py --model_path "bigcode/santacoder" \
+--dataset_name "bigcode/the-stack-dedup" \
+--subset "data/glsl" \
+--data_column "content" \
+--split "train" \
+--seq_length 2048 \
+--max_steps 1000 \
+--batch_size 2 \
+--gradient_accumulation_steps 4 \
+--learning_rate 5e-5 \
+--num_warmup_steps 100 \
+--eval_freq 100 \
+--save_freq 100 \
+--log_freq 1 \
+--output_dir "checkpoint_dir" \
+--no_fp16
+```
+Main purpose of this model is to explore if finetuning models improves performance on [ShaderEval](https://huggingface.co/spaces/Vipitis/ShaderEval), which reached 0.380 with 300 samples.
 License carried over from model, and the finetuning dataset holds the same license.