SeeSee21 commited on
Commit
de7688d
·
verified ·
1 Parent(s): cf7785f

Add README

Browse files
Files changed (1) hide show
  1. README.md +520 -0
README.md ADDED
@@ -0,0 +1,520 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ tags:
6
+ - text-to-image
7
+ - image-generation
8
+ - diffusion
9
+ - anime
10
+ - z-image
11
+ - z-anime
12
+ - comfyui
13
+ - gguf
14
+ - fp8
15
+ - bf16
16
+ - aio
17
+ pipeline_tag: text-to-image
18
+ library_name: diffusers
19
+ base_model:
20
+ - Tongyi-MAI/Z-Image
21
+ base_model_relation: finetune
22
+ ---
23
+
24
+ # 🎌 Z-Anime | Full Anime Fine-Tune on Z-Image Base
25
+
26
+ <div align="center">
27
+ <img src="images/cover.png" width="380" alt="Z-Anime" />
28
+ </div>
29
+
30
+ <div align="center">
31
+
32
+ **Full Fine-Tune • Rich Aesthetics • Strong Diversity • Full Negative Prompt Support**
33
+ **BF16 & FP8 & GGUF & AIO • Natural Language Prompts • 8GB VRAM**
34
+
35
+ </div>
36
+
37
+ ---
38
+
39
+ ## 🖼️ Preview Gallery
40
+
41
+ <table>
42
+ <tr>
43
+ <td><img src="images/1.png" alt="Z-Anime preview 1" /></td>
44
+ <td><img src="images/2.png" alt="Z-Anime preview 2" /></td>
45
+ <td><img src="images/3.png" alt="Z-Anime preview 3" /></td>
46
+ </tr>
47
+ <tr>
48
+ <td><img src="images/4.png" alt="Z-Anime preview 4" /></td>
49
+ <td><img src="images/5.png" alt="Z-Anime preview 5" /></td>
50
+ <td><img src="images/6.png" alt="Z-Anime preview 6" /></td>
51
+ </tr>
52
+ <tr>
53
+ <td><img src="images/7.png" alt="Z-Anime preview 7" /></td>
54
+ <td><img src="images/8.png" alt="Z-Anime preview 8" /></td>
55
+ <td><img src="images/9.png" alt="Z-Anime preview 9" /></td>
56
+ </tr>
57
+ </table>
58
+
59
+ ---
60
+
61
+ ## ✨ What is Z-Anime?
62
+
63
+ **Z-Anime** is a full fine-tune of Alibaba's **Z-Image Base** architecture — **not a LoRA merge**, but a fully trained anime-focused model family built from the ground up.
64
+
65
+ Built on the **S3-DiT (Single-Stream Diffusion Transformer, 6B parameters)**, Z-Anime inherits the strong foundation of Z-Image Base: rich diversity, strong controllability, full negative prompt support, and a high ceiling for fine-tuning — now adapted for anime-style generation.
66
+
67
+ This repository contains the full **Z-Anime family**:
68
+
69
+ | Variant | Focus | Best For |
70
+ |---|---|---|
71
+ | 🎌 **Z-Anime Base** | Highest quality | Final renders, full control |
72
+ | ⚡ **Z-Anime Distill-8-Step** | Speed + quality balance | Everyday generation |
73
+ | 🚀 **Z-Anime Distill-4-Step** | Maximum speed | Fast iteration, batches |
74
+ | 📦 **GGUF Variants** | Lower memory usage | Low VRAM / CPU / AMD-friendly workflows |
75
+ | 📦 **AIO Variants** | Single-file convenience | Easy ComfyUI setup |
76
+ | 🐍 **Diffusers Folder** | `from_pretrained()` ready | Python pipelines, further fine-tuning |
77
+
78
+ ---
79
+
80
+ ## 🎯 Key Features
81
+
82
+ - ✅ Full fine-tune on Z-Image Base — **not** a LoRA merge
83
+ - ✅ Rich anime aesthetics with strong style diversity
84
+ - ✅ Natural language prompting — works best with descriptive prompts, not tag lists
85
+ - ✅ High diversity across characters, poses, compositions, and layouts
86
+ - ✅ LoRA training ready — strong base for further fine-tuning
87
+ - ✅ Partially NSFW capable
88
+ - ✅ 8GB VRAM compatible
89
+ - ✅ GGUF variants available
90
+ - ✅ AIO variants available (Base, 4-Step, 8-Step)
91
+
92
+ ---
93
+
94
+ ## 🗺️ Z-Anime Roadmap
95
+
96
+ ### ✅ Released
97
+
98
+ #### 🎌 Z-Anime Base
99
+ Full fine-tune on Z-Image Base — **BF16 & FP8**
100
+
101
+ #### ⚡ Z-Anime Distill-8-Step
102
+ **BF16 & FP8** — fast anime generation in **8 steps**, **CFG 1.0**
103
+
104
+ #### 🚀 Z-Anime Distill-4-Step
105
+ **BF16 & FP8** — ultra-fast anime generation in **4 steps**, **CFG 1.0**
106
+
107
+ #### 📦 GGUF Variants
108
+ Available for **low VRAM**, **CPU inference**, and **AMD-friendly** workflows.
109
+
110
+ - **Z-Anime-Base-Q8_0** — Q8_0 quantization (**~6.73 GB**)
111
+ - **Z-Anime-Base-Q4_K_S** — Q4_K_S quantization (**~4.2 GB**)
112
+
113
+ #### 📦 AIO Variants
114
+ All-in-one checkpoints with **image model + VAE + Text Encoder integrated** in a single file.
115
+ Available for **Base**, **Distill-4-Step** and **Distill-8-Step** — each in **BF16 & FP8**.
116
+
117
+ #### 🧩 VAE & Text Encoder
118
+ The required **VAE** (`ae.safetensors`) and **Text Encoder** (`qwen_3_4b.safetensors`) are also included in this repository for users running the standard (non-AIO) variants.
119
+
120
+ #### 🐍 Diffusers Folder
121
+ The full **Diffusers-format folder** (`diffusers/`) is included — drop-in compatible with `ZImagePipeline.from_pretrained()` for Python users who want to run inference outside ComfyUI or use Z-Anime as a starting point for further fine-tuning.
122
+
123
+ More updates coming — follow to stay notified! 🎌
124
+
125
+ ---
126
+
127
+ ## 📦 Versions Overview
128
+
129
+ ### 🟢 BF16 (~12GB)
130
+ Maximum precision. **BFloat16** format with minimal quality compromise. Best for final renders, careful work, and LoRA training.
131
+
132
+ ### 🟡 FP8 (~6GB)
133
+ Recommended for most users. Smaller files, faster downloads, and excellent quality with only minor tradeoffs compared to BF16.
134
+
135
+ ### 🔵 GGUF
136
+ Optimized for lightweight inference setups, especially useful for low VRAM, CPU inference, or alternative backends.
137
+
138
+ ### 🟣 AIO
139
+ All-in-one checkpoints with **image model + Text Encoder + VAE integrated** into a single file for the easiest setup. Available for Base, Distill-4-Step and Distill-8-Step.
140
+
141
+ ---
142
+
143
+ ## 🎌 Z-Anime Base
144
+
145
+ The foundation of the Z-Anime family.
146
+
147
+ A full fine-tune with the **highest quality ceiling**, the **widest creative range**, and **full negative prompt support**.
148
+
149
+ ### Recommended Settings
150
+
151
+ ```yaml
152
+ steps: 28-50
153
+ cfg: 3.0-5.0 # up to 9.0 possible
154
+ sampler: euler_ancestral
155
+ scheduler: beta
156
+ negative_prompt: strongly recommended
157
+ ```
158
+
159
+ ### CFG Guide
160
+
161
+ - **3.0–5.0** → sweet spot for balanced quality and creativity
162
+ - **5.0–7.0** → tighter prompt adherence
163
+ - **7.0–9.0** → maximum control, but watch for oversaturation
164
+ - **Above 9.0** → not recommended
165
+
166
+ Negative prompts have **full effect** on Z-Anime Base and are highly recommended.
167
+
168
+ ---
169
+
170
+ ## ⚡ Z-Anime Distill-8-Step
171
+
172
+ The sweet spot of the family.
173
+
174
+ Distilled from Z-Anime Base, this version delivers strong anime results in just **8 steps** while keeping most of the quality.
175
+
176
+ ### Recommended Settings
177
+
178
+ ```yaml
179
+ steps: 8
180
+ cfg: 1.0 # max ~1.5
181
+ sampler: euler_ancestral
182
+ scheduler: beta
183
+ negative_prompt: limited effect
184
+ ```
185
+
186
+ ### CFG Guide
187
+
188
+ - Best at **CFG 1.0**
189
+ - Small increases to **1.3–1.5** are possible
190
+ - Do **not** go above **1.5** — artifacts may appear
191
+
192
+ Negative prompts have only **limited effect** at this distillation level. If your workflow includes **ConditioningZeroOut**, prefer that instead of a large negative prompt.
193
+
194
+ ---
195
+
196
+ ## 🚀 Z-Anime Distill-4-Step
197
+
198
+ The fastest Z-Anime variant.
199
+
200
+ Built for **maximum throughput** — ideal for rapid prototyping, quick batch generation, and speed-focused workflows.
201
+
202
+ ### Recommended Settings
203
+
204
+ ```yaml
205
+ steps: 4
206
+ cfg: 1.0 # max ~1.5
207
+ sampler: euler_ancestral
208
+ scheduler: beta
209
+ negative_prompt: limited effect
210
+ ```
211
+
212
+ ### Tips for 4-Step
213
+
214
+ - Stay at **CFG 1.0** for the most stable results
215
+ - Put the most important visual details **early** in the prompt
216
+ - An optional upscaler such as hires fix or SeedVR2 can help recover fine detail
217
+
218
+ ---
219
+
220
+ ## 📐 Resolution Guide
221
+
222
+ | Use Case | Resolution |
223
+ |---|---|
224
+ | Portrait / character art | **832 × 1216** |
225
+ | Landscape / scenes / backgrounds | **1216 × 832** |
226
+ | Square / general purpose | **1024 × 1024** |
227
+ | Tall / full body / wallpaper | **768 × 1344** |
228
+ | Cinematic / wide scenes | **1920 × 1088** |
229
+ | Detailed portraits | **1024 × 1536** |
230
+
231
+ **Supported range:** approximately **512 × 512 to 2048 × 2048**, any aspect ratio.
232
+ All main variants are designed to run on **8GB VRAM**.
233
+
234
+ ---
235
+
236
+ ## 💡 Prompting Guide
237
+
238
+ **Natural language works best — not tag lists.**
239
+
240
+ ### ✅ Good
241
+
242
+ ```text
243
+ A young anime girl with long silver hair and golden eyes, wearing a traditional shrine maiden outfit with white haori and red hakama. She stands in a sunlit bamboo forest, cherry blossoms falling softly around her. Warm afternoon light filtering through the trees, detailed fabric shading, expressive face, calm serene expression, high quality anime illustration with fine line work.
244
+ ```
245
+
246
+ ### ❌ Avoid
247
+
248
+ ```text
249
+ anime girl, silver hair, shrine maiden, bamboo, cherry blossom, warm light
250
+ ```
251
+
252
+ ### Character Portraits
253
+
254
+ ```text
255
+ Detailed anime portrait of [character], soft rim lighting, expressive eyes with detailed reflections, fine hair strands, clean linework, professional anime illustration quality.
256
+ ```
257
+
258
+ ### Action Scenes
259
+
260
+ ```text
261
+ Dynamic anime [scene], dramatic angle, motion energy, speed lines, particle effects, cinematic composition, detailed shading, high quality anime art.
262
+ ```
263
+
264
+ ### Backgrounds & Landscapes
265
+
266
+ ```text
267
+ Anime [location] at [time of day], [lighting], [atmosphere], beautiful background art, wallpaper quality, highly detailed environment.
268
+ ```
269
+
270
+ ---
271
+
272
+ ## 🔧 Installation
273
+
274
+ ### Step 1 — Download the version you want
275
+
276
+ Choose between:
277
+
278
+ - **Standard / Distill models** in **BF16** or **FP8** (+ VAE + Text Encoder)
279
+ - **GGUF variants** for low VRAM / CPU / AMD-friendly inference (+ VAE + Text Encoder)
280
+ - **AIO variants** for single-file convenience (no extra VAE / Text Encoder needed)
281
+
282
+ ### Step 2 — Place the files
283
+
284
+ #### Standard BF16 / FP8 models
285
+ ```text
286
+ ComfyUI/models/diffusion_models/
287
+ ├── z-anime-base-bf16.safetensors
288
+ ├── z-anime-base-fp8.safetensors
289
+ ├── z-anime-distill-8step-bf16.safetensors
290
+ ├── z-anime-distill-8step-fp8.safetensors
291
+ ├── z-anime-distill-4step-bf16.safetensors
292
+ └── z-anime-distill-4step-fp8.safetensors
293
+ ```
294
+
295
+ #### GGUF variants
296
+ ```text
297
+ ComfyUI/models/unet/
298
+ ├── z-anime-base-q8_0.gguf
299
+ └── z-anime-base-q4_k_s.gguf
300
+ ```
301
+
302
+ #### Text Encoder
303
+ Two text encoders are included — pick **one**:
304
+
305
+ ```text
306
+ ComfyUI/models/clip/
307
+ └── qwen_3_4b-bf16.safetensors # default (Z-Image standard, BF16)
308
+ or
309
+ └── qwen_3_4b-fp8.safetensors # default (Z-Image standard, FP8)
310
+ or
311
+ └── qwen_3_4b-engineer-v4-bf16.safetensors # alternative (Engineer V4, BF16)
312
+ or
313
+ └── qwen_3_4b-engineer-v4-fp8.safetensors # alternative (Engineer V4, FP8)
314
+ ```
315
+
316
+ - **Default (`qwen_3_4b-*`)** — the standard Z-Image text encoder, repackaged as a single `.safetensors` file (BF16 + FP8). This is what the model was trained against.
317
+ - **Engineer V4 (`qwen_3_4b-engineer-v4-*`)** — an alternative full fine-tune of the Z-Image text encoder by **BennyDaBall**, drop-in compatible. Often produces more varied outputs from the same seed. See *Credits* below for the original repo.
318
+
319
+ #### VAE
320
+ ```text
321
+ ComfyUI/models/vae/
322
+ └── ae.safetensors
323
+ ```
324
+
325
+ #### AIO variants
326
+ For the AIO versions, you only need the single checkpoint file — no extra VAE or Text Encoder required:
327
+
328
+ ```text
329
+ ComfyUI/models/checkpoints/
330
+ ├── z-anime-base-aio-bf16.safetensors
331
+ ├── z-anime-base-aio-fp8.safetensors
332
+ ├── z-anime-distill-8step-aio-bf16.safetensors
333
+ ├── z-anime-distill-8step-aio-fp8.safetensors
334
+ ├── z-anime-distill-4step-aio-bf16.safetensors
335
+ └── z-anime-distill-4step-aio-fp8.safetensors
336
+ ```
337
+
338
+ ### Step 3 — Load in ComfyUI
339
+
340
+ #### For standard BF16 / FP8 versions
341
+ Use:
342
+
343
+ - **Load Diffusion Model** for the model file
344
+ - **CLIP Loader** for the text encoder
345
+ - **VAE Loader** for the VAE
346
+
347
+ #### For GGUF versions
348
+ - Load the **GGUF model from the `models/unet/` folder**
349
+ - Use the same **CLIP** and **VAE** files as above
350
+
351
+ #### For AIO versions
352
+ Use a standard **Checkpoint Loader** — no extra CLIP or VAE loading required.
353
+
354
+ ---
355
+
356
+ ## 📦 Custom Nodes
357
+
358
+ - **rgthree-comfy**
359
+ - **ComfyUI-Lora-Manager**
360
+ - **ComfyUI-GGUF** *(only for the GGUF variants)*
361
+ - **ComfyUI-SeedVR2_VideoUpscaler** *(optional, only for SeedVR2 upscale)*
362
+
363
+ ---
364
+
365
+ ## 🐍 Using the Diffusers Folder
366
+
367
+ For Python users, the full Diffusers-format folder is included under `diffusers/` and can be loaded directly with the `subfolder` argument:
368
+
369
+ ```python
370
+ import torch
371
+ from diffusers import ZImagePipeline
372
+
373
+ pipe = ZImagePipeline.from_pretrained(
374
+ "SeeSee21/Z-Anime",
375
+ subfolder="diffusers",
376
+ torch_dtype=torch.bfloat16,
377
+ ).to("cuda")
378
+
379
+ image = pipe(
380
+ prompt="A young anime girl with long silver hair and golden eyes, "
381
+ "shrine maiden outfit, sunlit bamboo forest, cherry blossoms, "
382
+ "professional anime illustration, fine line work.",
383
+ num_inference_steps=40,
384
+ guidance_scale=4.0,
385
+ ).images[0]
386
+
387
+ image.save("z-anime-output.png")
388
+ ```
389
+
390
+ This format is also a clean starting point for further fine-tuning (LoRA or full fine-tune) with frameworks like **OneTrainer**, **diffusers**, or **kohya-ss**.
391
+
392
+ ---
393
+
394
+ ## 🧩 Official Workflow
395
+
396
+ <div align="center">
397
+ <img src="images/workflow-cover.png" width="380" alt="Z-Anime Workflow" />
398
+ </div>
399
+
400
+ A ready-to-use ComfyUI workflow that supports **all variants** (Base / Distill-8 / Distill-4, BF16 / FP8 / GGUF / AIO) is included in [`workflows/Z-Anime-Workflow-v1.json`](workflows/Z-Anime-Workflow-v1.json).
401
+
402
+ It includes:
403
+ - 📦 Model switch (Diffusion / GGUF / AIO loaders — toggle one at a time)
404
+ - 📖 Optional LoRA loader
405
+ - ✍️ Positive + Negative prompt nodes (with default anime negative)
406
+ - 📐 Resolution presets
407
+ - 🎨 Generate + 🔼 Optional 1.5× upscale with side-by-side compare
408
+ - 📚 Built-in MarkdownNote guide with settings per variant
409
+
410
+ <div align="center">
411
+ <img src="images/workflow-overview.png" alt="Z-Anime Workflow overview" />
412
+ </div>
413
+
414
+ ---
415
+
416
+ ## 📁 Repository Structure
417
+
418
+ ```text
419
+ Z-Anime/
420
+ ├── README.md
421
+ ├── config.json
422
+
423
+ ├── diffusion_models/
424
+ │ ├── z-anime-base-bf16.safetensors
425
+ │ ├── z-anime-base-fp8.safetensors
426
+ │ ├── z-anime-distill-8step-bf16.safetensors
427
+ │ ├── z-anime-distill-8step-fp8.safetensors
428
+ │ ├── z-anime-distill-4step-bf16.safetensors
429
+ │ └── z-anime-distill-4step-fp8.safetensors
430
+
431
+ ├── gguf/
432
+ │ ├── z-anime-base-q8_0.gguf
433
+ │ └── z-anime-base-q4_k_s.gguf
434
+
435
+ ├── aio/
436
+ │ ├── z-anime-base-aio-bf16.safetensors
437
+ │ ├── z-anime-base-aio-fp8.safetensors
438
+ │ ├── z-anime-distill-8step-aio-bf16.safetensors
439
+ │ ├── z-anime-distill-8step-aio-fp8.safetensors
440
+ │ ├── z-anime-distill-4step-aio-bf16.safetensors
441
+ │ └── z-anime-distill-4step-aio-fp8.safetensors
442
+
443
+ ├── text_encoder/
444
+ │ ├── qwen_3_4b-bf16.safetensors # default
445
+ │ ├── qwen_3_4b-fp8.safetensors # default
446
+ │ ├── qwen_3_4b-engineer-v4-bf16.safetensors # alternative (BennyDaBall)
447
+ │ └── qwen_3_4b-engineer-v4-fp8.safetensors # alternative (BennyDaBall)
448
+
449
+ ├── vae/
450
+ │ └── ae.safetensors
451
+
452
+ ├── diffusers/
453
+ │ ├── model_index.json
454
+ │ ├── scheduler/
455
+ │ ├── tokenizer/
456
+ │ ├── text_encoder/
457
+ │ ├── transformer/ (sharded safetensors + index)
458
+ │ └── vae/
459
+
460
+ ├── images/
461
+ │ ├── cover.png
462
+ │ ├── workflow-cover.png
463
+ │ ├── workflow-overview.png
464
+ │ ├── 1.png
465
+ │ ├── 2.png
466
+ │ ├── 3.png
467
+ │ ├── 4.png
468
+ │ ├── 5.png
469
+ │ ├── 6.png
470
+ │ ├── 7.png
471
+ │ ├── 8.png
472
+ │ └── 9.png
473
+ └── workflows/
474
+ └── Z-Anime-Workflow-v1.json
475
+ ```
476
+
477
+ ---
478
+
479
+ ## 📈 Version History
480
+
481
+ ### v1.0 — Initial Release
482
+
483
+ - **Z-Anime Base** released in **BF16 & FP8**
484
+ - **Z-Anime Distill-8-Step** released in **BF16 & FP8**
485
+ - **Z-Anime Distill-4-Step** released in **BF16 & FP8**
486
+ - **GGUF variants added**
487
+ - **Z-Anime-Base-Q8_0** — Q8_0 quantization (~6.73 GB)
488
+ - **Z-Anime-Base-Q4_K_S** — Q4_K_S quantization (~4.2 GB)
489
+ - **AIO variants added** — Base, Distill-4-Step and Distill-8-Step (each in BF16 & FP8)
490
+ - **VAE** (`ae.safetensors`) and **Text Encoder** (`qwen_3_4b.safetensors`) included
491
+ - Optimized for **euler_ancestral**, **euler + beta**, and simple practical use across the family
492
+
493
+ ---
494
+
495
+ ## 🔗 Links
496
+
497
+ - **CivitAI Page:** [civitai.red/models/2483351](https://civitai.red/models/2483351)
498
+ - **Base Model:** [Tongyi-MAI/Z-Image](https://huggingface.co/Tongyi-MAI/Z-Image)
499
+ - **Author:** [SeeSee21 on Hugging Face](https://huggingface.co/SeeSee21)
500
+
501
+ ---
502
+
503
+ ## 🙏 Credits
504
+
505
+ - **Base Architecture:** Tongyi Lab (Alibaba) — Z-Image
506
+ - **Fine-Tune:** SeeSee21
507
+ - **License:** Apache 2.0
508
+ - **Architecture:** S3-DiT (Single-Stream Diffusion Transformer, 6B parameters)
509
+ - **Base Model:** [`Tongyi-MAI/Z-Image`](https://huggingface.co/Tongyi-MAI/Z-Image)
510
+ - **Engineer V4 Text Encoder:** [`BennyDaBall/Qwen3-4b-Z-Image-Engineer-V4`](https://huggingface.co/BennyDaBall/Qwen3-4b-Z-Image-Engineer-V4) — full fine-tune with SMART training, included as alternative text encoder
511
+
512
+ ---
513
+
514
+ ## ❤️ Notes
515
+
516
+ Z-Anime is an experimental anime-focused model family built to explore what a full fine-tune on Z-Image Base can achieve in this space.
517
+
518
+ It is already strong for anime aesthetics, character work, and fast iteration, and future versions will continue to improve diversity, character handling, prompting flexibility, and overall quality.
519
+
520
+ **Z-Anime — anime at its finest, powered by Z-Image Base. 🎌**