Nekochu
/

Luminia-13B-v4-QLora

Generated from Trainer

stable diffusion

Model card Files Files and versions Community

Nekochu commited on May 7

Commit

9402b49

•

1 Parent(s): 2cf4940

Update README.md

Files changed (1) hide show

README.md +40 -3

README.md CHANGED Viewed

@@ -1,3 +1,40 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+pretty_name: Luminia
+model_type: llama2
+tags:
+  - llama-factory
+  - lora
+  - generated_from_trainer
+  - llama2
+  - llama
+  - instruct
+  - finetune
+  - gpt4
+  - synthetic data
+  - stable diffusion
+  - alpaca
+  - llm
+datasets:
+  - Nekochu/discord-unstable-diffusion-SD-prompts
+  - Nekochu/Luminia-mixture
+  - AstraMindAI/RLAIF-Nectar
+  - hiyouga/DPO-En-Zh-20k
+---
+Training resume from [Luminia-13B-v3](https://huggingface.co/Nekochu/Luminia-13B-v3).
+<!-- [05/24] This should include all dataset from LLaMA-Factory, and more.-->
+# Luminia-v4 Lora only
+Luminia-13B-v4-QLora-sft (rank 32) barelly can handle new [Luminia-mixture](https://huggingface.co/datasets/Nekochu/Luminia-mixture)
+and [ExtendedPrompts](https://huggingface.co/datasets/Nekochu/discord-unstable-diffusion-SD-prompts) should give more flexible when ask prompt, e.g.:
+```
+### Instruction:
+Create stable diffusion prompt based on the given english description.
+### Input:
+City street, night, raining, drone shot, cyberpunk
+### Response:
+```
+And so Stage-B DPO: I do **NOT** recommend use QLora-orpo, poor lora failed to learn more**.** :<