Nekochu commited on
Commit
9402b49
1 Parent(s): 2cf4940

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +40 -3
README.md CHANGED
@@ -1,3 +1,40 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ pretty_name: Luminia
4
+ model_type: llama2
5
+ tags:
6
+ - llama-factory
7
+ - lora
8
+ - generated_from_trainer
9
+ - llama2
10
+ - llama
11
+ - instruct
12
+ - finetune
13
+ - gpt4
14
+ - synthetic data
15
+ - stable diffusion
16
+ - alpaca
17
+ - llm
18
+ datasets:
19
+ - Nekochu/discord-unstable-diffusion-SD-prompts
20
+ - Nekochu/Luminia-mixture
21
+ - AstraMindAI/RLAIF-Nectar
22
+ - hiyouga/DPO-En-Zh-20k
23
+ ---
24
+ Training resume from [Luminia-13B-v3](https://huggingface.co/Nekochu/Luminia-13B-v3).
25
+ <!-- [05/24] This should include all dataset from LLaMA-Factory, and more.-->
26
+
27
+ # Luminia-v4 Lora only
28
+ Luminia-13B-v4-QLora-sft (rank 32) barelly can handle new [Luminia-mixture](https://huggingface.co/datasets/Nekochu/Luminia-mixture)
29
+ and [ExtendedPrompts](https://huggingface.co/datasets/Nekochu/discord-unstable-diffusion-SD-prompts) should give more flexible when ask prompt, e.g.:
30
+ ```
31
+ ### Instruction:
32
+ Create stable diffusion prompt based on the given english description.
33
+
34
+ ### Input:
35
+ City street, night, raining, drone shot, cyberpunk
36
+
37
+ ### Response:
38
+ ```
39
+
40
+ And so Stage-B DPO: I do **NOT** recommend use QLora-orpo, poor lora failed to learn more**.** :<