khanon commited on
Commit
3e163ff
1 Parent(s): 4e17b4a

updates Kazusa tagging methodology dog

Browse files
Files changed (1) hide show
  1. kazusa/tagging methodology.md +60 -27
kazusa/tagging methodology.md CHANGED
@@ -1,14 +1,37 @@
1
- Tagging methodology for Kazusa (blue archive)
2
 
3
- Start with WD1.4 Swinv2 at 0.25 confidence.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4
 
5
  - Tag unique features
6
  - `halo` / `demon horns` / `low wings`
7
  - Remove when not present or out of view. WD1.4 likes putting `halo` even on images where no halo is visible.
8
- - Kazusa: `halo` / `animal ears`
9
- - Pruned `extra ears` as it seems redundant.
10
  - Tag outfit variants with a single master tag
11
- - Kazusa:
12
  - Uniform: `school uniform` / `black jacket`
13
  - Sometimes the jacket appears without anything else, which was not tagged `school uniform`
14
  - Non-canon costumes
@@ -16,24 +39,26 @@ Start with WD1.4 Swinv2 at 0.25 confidence.
16
  - Nudity (WD1.4 usually does this accurately)
17
  - `nude` / `completely nude`
18
  - Prune eye colors
19
- - Keep tags which describe unusual eye features (`multicolored eyes`, `heterochromia`, `slit pupils`) as they can otherwise be too subtle and inconsistently drawn for the AI to learn
20
  - Prune hair colors
21
  - This includes `two-toned hair`, `gradiant hair`, etc. The AI learns all of these very consistently without the tags, likely because artists tend to draw them consistently
22
  - Partially prune hair styles
23
- - Leave key style tags like `twintails`, `ponytail`, `short hair with long locks`.
24
  - Prune exceedingly common tags like `bangs` / `sidelocks` / `eyebrows visible through hair` / `hair between eyes`, etc.
 
25
  - Prune length, except for images which differ from the character's usual length
26
- - Add `alternate hairstyle` and/or `alternate hair length` for these
27
- - Kazusa: `short hair, colored inner hair` -- while I would usually prune these, they're really her only defining hairstyle traits
 
28
  - Fixup hair ornaments
29
  - Prune generic `hair ornament` in favor of more specificity
30
  - `hairclip` / `black headband` / `hair flower` / `hair ribbon`, etc.
31
  - Consolidate tags that have color variants (`headband` >> `black headband`)
32
- - Kazusa: `hairclip`
33
  - Consolidate outfits
34
  - Only tag an item when it is actually visible. If it is only barely visible along the edge of an image, keep in mind it may be cropped during bucketing.
35
- - Danbooru's wiki entry for a character often provides a good list of tags commonly used to describe a character's outfits.
36
- - Kazusa outfits:
37
  - School Uniform
38
  - `black choker`
39
  - `hooded jacket`
@@ -47,42 +72,50 @@ Start with WD1.4 Swinv2 at 0.25 confidence.
47
  - `sneakers`
48
  - Fixup sleeves
49
  - ie. `long sleeves` / `puffy long sleeves` / `detached sleeves`
50
- - You only need one, but pick one and be consistent. If sleeves aren't tagged properly the tends to add them inappropriately (such as when prompting for sleeveless outfits or nudity)
51
  - Fixup collars
52
  - ie. `detached collar` / `collared shirt` / `choker` / etc.
53
- - Same deal as sleeves, they tend to appear when unwanted if not consistently tagged according to visibility in the training data.
54
  - Fixup clothing state
55
  - ie. `open jacket` / `open shirt` / `partially undressed` / `off shoulder`
56
- - The tagger is generally pretty good at this.
57
  - Tag expressions
58
- - WD1.4 rarely tags these, but doing them manually can help the AI reproduce a character's iconic expressions well.
59
  - Start by searching for images without one of these, and add them.
60
  - `open mouth`
61
  - `closed mouth`
62
  - `parted lips`
63
- - Add less common expressions
64
- - `smile` / `light smile` / `:d`
 
 
65
  - `wavy mouth` / `embarrassed`
66
- - `flustered` / `panicked` / `swirly eyes` / `@_@`
 
 
67
  - `surprised` / `o_o` / `wide-eyed`
68
  - `upset` / `annoyed` / `frustrated` / `v-shaped eyebrows`
69
  - `naughty face` / `seductive smile`
70
- - `smug` / `:3`
 
71
  - `eyes closed` / `one eye closed`
72
- - WD1.4 usually gets these for you.
73
  - Tag camera angles/composition
 
74
  - `cowboy shot`
75
  - `upper body`
76
  - `full body`
77
  - `portrait`
78
- - `cropped torso` / `cropped legs`
79
  - `feet out of frame`
 
80
  - `from side` / `from above` / `from below` / `from behind`
81
- - Tag iconic poses/actions
82
- - ie. `v` / `standing on one leg`
83
- - Kazusa
 
 
84
  - `mouth hold`
85
  - `eating`
86
- - `macaroon`
87
  - Flip through each image and use Hydrus's "related tags" feature to quickly identify important tags that might be missing.
88
- - This feature looks at other images with similar tags to provide suggestions.
 
1
+ # Tagging methodology for Kazusa (blue archive)
2
 
3
+ ## README / Intro
4
+ Since I've seen a few people share this already I'll provide this disclaimer.
5
+
6
+ This is not really intended to be a guide, it's just an log/checklist of my process, for my own benefit, since I repeat this for a lot of LoRAs and I got tired of winging it every single time. I've put only the slightest amount of effort into making it accessible to others.
7
+
8
+ I don't claim that any or all of these optimal, nor can I confidently put them forth as recommendations. They're literally just a record of the steps I follow while tagging, gradually developed after ~16 characters using some version of the below process.
9
+
10
+ Still, I can at least point to my pre-Koharu LoRAs (which used pure WD1.4 tags) and the ones that came after (where I started heavily editing tags) and see a steady progression in quality and prompting flexibility despite using mostly the same training settings for each one.
11
+
12
+ Yes, it takes forever to do all of this shit. No, I don't recommend it unless you're extremely autistic; raw WD1.4 tags are probably good enough for most people. If you intend to do this for more than a few characters, I strongly recommend learning [Hydrus](https://hydrusnetwork.github.io/hydrus/introduction.html) it makes all of this way, way less tedious compared to doing it with crappier tools.
13
+
14
+ ---
15
+
16
+ ## Prep
17
+
18
+ - Scraped `1girl kazusa_(blue_archive) order:popularity` from sancom, curated for quality, then exported from Hydrus to feed into WD1.4 Tagger.
19
+ - Kazusa has a shitload of good art so I had to be very picky to get down to 280 images, which is still a lot. In hindsight I think huge datasets aren't really a problem; they let you train for longer without overfitting.
20
+ - Gelbooru is probably fine too. Danbooru sucks for loli unless you have Gold.
21
+ - I also got a few newer images from pixiv, don't remember which ones.
22
+ - Exported final images from Hydrus to feed into WD1.4 Tagger
23
+ - Auto-tagged with WD1.4 Swinv2 at 0.25 confidence
24
+ - Reimported images+tags into Hydrus using the .txt sidecar feature. I strongly recommend putting WD1.4 tags in a separate tag domain so they aren't mixed in with shit scraped from boorus.
25
+
26
+ ## Tagging
27
 
28
  - Tag unique features
29
  - `halo` / `demon horns` / `low wings`
30
  - Remove when not present or out of view. WD1.4 likes putting `halo` even on images where no halo is visible.
31
+ - **Kazusa**: `halo` / `animal ears`
32
+ - Pruned `extra ears` as it seems redundant and intrinsic to the character.
33
  - Tag outfit variants with a single master tag
34
+ - **Kazusa**:
35
  - Uniform: `school uniform` / `black jacket`
36
  - Sometimes the jacket appears without anything else, which was not tagged `school uniform`
37
  - Non-canon costumes
 
39
  - Nudity (WD1.4 usually does this accurately)
40
  - `nude` / `completely nude`
41
  - Prune eye colors
42
+ - Keep tags which describe unusual eye features (`multicolored eyes`, `heterochromia`, `slit pupils`) as they can otherwise be too subtle and inconsistently drawn for the AI to notice
43
  - Prune hair colors
44
  - This includes `two-toned hair`, `gradiant hair`, etc. The AI learns all of these very consistently without the tags, likely because artists tend to draw them consistently
45
  - Partially prune hair styles
46
+ - Leave key, defining style tags like `twintails`, `ponytail`, `short hair with long locks`, `twin braids`, etc.
47
  - Prune exceedingly common tags like `bangs` / `sidelocks` / `eyebrows visible through hair` / `hair between eyes`, etc.
48
+ - Somewhat arbitrary, but I just don't think there's much value in them because they're ubiquitous and caption space is limited
49
  - Prune length, except for images which differ from the character's usual length
50
+ - If you don't do this, it's more likely to get the hair length wrong when not prompted, which isn't a huge deal.
51
+ - Add `alternate hairstyle` and/or `alternate hair length` on applicable images, which can be used to more easily change styles while prompting
52
+ - **Kazusa**: `short hair, colored inner hair` -- while I would usually prune these, they're really her only defining hairstyle traits
53
  - Fixup hair ornaments
54
  - Prune generic `hair ornament` in favor of more specificity
55
  - `hairclip` / `black headband` / `hair flower` / `hair ribbon`, etc.
56
  - Consolidate tags that have color variants (`headband` >> `black headband`)
57
+ - **Kazusa**: `hairclip`
58
  - Consolidate outfits
59
  - Only tag an item when it is actually visible. If it is only barely visible along the edge of an image, keep in mind it may be cropped during bucketing.
60
+ - Danbooru's wiki entry for a character often provides a good list of tags for a character's entire outfit.
61
+ - **Kazusa outfits**:
62
  - School Uniform
63
  - `black choker`
64
  - `hooded jacket`
 
72
  - `sneakers`
73
  - Fixup sleeves
74
  - ie. `long sleeves` / `puffy long sleeves` / `detached sleeves`
75
+ - You only need one, but pick one and be consistent. If sleeves aren't tagged the AI tends to add them inappropriately (such as when prompting for sleeveless outfits or nudity)
76
  - Fixup collars
77
  - ie. `detached collar` / `collared shirt` / `choker` / etc.
78
+ - Same deal as sleeves, they tend to appear when unwanted if not consistently tagged according to actual visibility
79
  - Fixup clothing state
80
  - ie. `open jacket` / `open shirt` / `partially undressed` / `off shoulder`
81
+ - The tagger is generally good at this but it can help to double-check for weird outfits
82
  - Tag expressions
83
+ - This is tedious and the autotagger doesn't help you out much, but tagging these can really help the AI nail multiple iconic expressions for a character
84
  - Start by searching for images without one of these, and add them.
85
  - `open mouth`
86
  - `closed mouth`
87
  - `parted lips`
88
+ - Sometimes applies with `open mouth`
89
+ - Then proceed through each image and add one of these
90
+ - `smile` / `light smile` / `:d` / `grin` (exposed teeth only)
91
+ - `:o` / `:<` / `expressionless` / `serious`
92
  - `wavy mouth` / `embarrassed`
93
+ - `pout` / `:t` / `tsundere`
94
+ - `nervous` / `nervous smile`
95
+ - `flustered` / `swirly eyes` / `@_@`
96
  - `surprised` / `o_o` / `wide-eyed`
97
  - `upset` / `annoyed` / `frustrated` / `v-shaped eyebrows`
98
  - `naughty face` / `seductive smile`
99
+ - `smug` / `:3` / `smirk`
100
+ - `yelling` / `frown`
101
  - `eyes closed` / `one eye closed`
102
+ - WD1.4 almost always gets these two
103
  - Tag camera angles/composition
104
+ - Most of these aren't very high value, but `from x` can be helpful.
105
  - `cowboy shot`
106
  - `upper body`
107
  - `full body`
108
  - `portrait`
 
109
  - `feet out of frame`
110
+ - `cropped torso` / `cropped legs`
111
  - `from side` / `from above` / `from below` / `from behind`
112
+ - Tag iconic poses, actions, or props
113
+ - Props need to show up often in training data for this to be worth it.
114
+ - `v` / `peace sign` / `standing on one leg`
115
+ - `holding dango` / `weapon case` / `fashion magazine`
116
+ - **Kazusa**
117
  - `mouth hold`
118
  - `eating`
119
+ - `macaron`
120
  - Flip through each image and use Hydrus's "related tags" feature to quickly identify important tags that might be missing.
121
+ - This feature looks at other images with similar tags to provide suggestions. Good for spotting things you or the tagger might have missed.