Linaqruf commited on
Commit
5c9af6f
1 Parent(s): 044e028

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +55 -3
README.md CHANGED
@@ -33,7 +33,7 @@ widget:
33
  }
34
 
35
  .title {
36
- font-size: 2vw;
37
  text-align: center;
38
  color: #333;
39
  font-family: 'Helvetica Neue', sans-serif;
@@ -127,8 +127,18 @@ widget:
127
  /* Fallback for browsers that do not support this effect */
128
  text-shadow: 2px 2px 4px rgba(0, 0, 0, 0.7);
129
  /* Enhanced text shadow for better legibility */
 
 
 
 
 
130
  }
131
-
 
 
 
 
 
132
  </style>
133
 
134
  <h1 class="title">
@@ -282,6 +292,7 @@ image = pipe(
282
  ## Usage Guidelines
283
 
284
  ### Prompt Guidelines
 
285
  Animagine XL 2.0 responds effectively to natural language descriptions for image generation. For example:
286
  ```
287
  A girl with mesmerizing blue eyes looks at the viewer. Her long, white hair is adorned with blue butterfly hair ornaments.
@@ -327,6 +338,18 @@ For higher quality outcomes, prepend prompts with:
327
  masterpiece, best quality
328
  ```
329
 
 
 
 
 
 
 
 
 
 
 
 
 
330
  ### Multi Aspect Resolution
331
 
332
  This model supports generating images at the following dimensions:
@@ -342,6 +365,8 @@ This model supports generating images at the following dimensions:
342
  | 1536 x 640 | 12:5 Horizontal |
343
  | 640 x 1536 | 5:12 Vertical |
344
 
 
 
345
 
346
  ## Training and Hyperparameters
347
 
@@ -360,6 +385,29 @@ This model supports generating images at the following dimensions:
360
 
361
  *Note: The model's training configuration is subject to future enhancements.*
362
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
363
  ## Direct Use
364
 
365
  The Animagine XL 2.0 model, with its advanced text-to-image diffusion capabilities, is highly versatile and can be applied in various fields:
@@ -390,4 +438,8 @@ We extend our gratitude to:
390
  - **Camenduru Server Community:** For invaluable insights and support.
391
  - **NovelAI:** For inspiring the Quality Tags feature.
392
  - **Waifu DIffusion Team:** for inspiring the optimal training pipeline with bigger datasets.
393
- - **Shadow Lilac:** For the image classification model ([Hugging Face - shadowlilac/aesthetic-shadow](https://huggingface.co/shadowlilac/aesthetic-shadow)) crucial in our quality assessment process.
 
 
 
 
 
33
  }
34
 
35
  .title {
36
+ font-size: 2.5em;
37
  text-align: center;
38
  color: #333;
39
  font-family: 'Helvetica Neue', sans-serif;
 
127
  /* Fallback for browsers that do not support this effect */
128
  text-shadow: 2px 2px 4px rgba(0, 0, 0, 0.7);
129
  /* Enhanced text shadow for better legibility */
130
+
131
+ .overlay-subtext {
132
+ font-size: 0.75em;
133
+ margin-top: 0.5em;
134
+ font-style: italic;
135
  }
136
+
137
+ .overlay,
138
+ .overlay-subtext {
139
+ text-shadow: 2px 2px 4px rgba(0, 0, 0, 0.5);
140
+ }
141
+
142
  </style>
143
 
144
  <h1 class="title">
 
292
  ## Usage Guidelines
293
 
294
  ### Prompt Guidelines
295
+
296
  Animagine XL 2.0 responds effectively to natural language descriptions for image generation. For example:
297
  ```
298
  A girl with mesmerizing blue eyes looks at the viewer. Her long, white hair is adorned with blue butterfly hair ornaments.
 
338
  masterpiece, best quality
339
  ```
340
 
341
+ <table class="custom-table">
342
+ <tr>
343
+ <td>
344
+ <div class="custom-image-container">
345
+ <img class="custom-image" src="https://cdn-uploads.huggingface.co/production/uploads/6365c8dbf31ef76df4042821/m6BGzrJgYTb9QrZprVAqZ.png" alt="sample1">
346
+ <div class="overlay" style="font-size: 3vw;"> Twilight Contemplation <div class="overlay-subtext" style="font-size: 0.75em; font-style: italic;">"Stelle, Amidst Shooting Stars and Mountain Silhouettes"</div>
347
+ </div>
348
+ </div>
349
+ </td>
350
+ </tr>
351
+ </table>
352
+
353
  ### Multi Aspect Resolution
354
 
355
  This model supports generating images at the following dimensions:
 
365
  | 1536 x 640 | 12:5 Horizontal |
366
  | 640 x 1536 | 5:12 Vertical |
367
 
368
+ ## Examples
369
+
370
 
371
  ## Training and Hyperparameters
372
 
 
385
 
386
  *Note: The model's training configuration is subject to future enhancements.*
387
 
388
+ ## Model Comparison (Animagine XL 1.0 vs Animagine XL 2.0)
389
+
390
+ ### Image Comparison
391
+
392
+ In the second iteration (Animagine XL 2.0), we have addressed the 'broken neck' issue prevalent in poses like "looking back" and "from behind". Now, characters are consistently "looking at viewer" by default, enhancing the naturalism and accuracy of the generated images.
393
+
394
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6365c8dbf31ef76df4042821/oSssetgmuLEV6RlaSC5Tr.png)
395
+
396
+ ### Training Config
397
+
398
+ | Configuration Item | Animagine XL 1.0 | Animagine XL 2.0 |
399
+ |-----------------------|--------------------|--------------------------|
400
+ | **GPU** | A100 40G | A100 80G |
401
+ | **Dataset** | 8000 images | 170k + 83k images |
402
+ | **Global Epochs** | Not Applicable | 20 |
403
+ | **Learning Rate** | 4e-7 | 1e-6 |
404
+ | **Batch Size** | 16 | 32 |
405
+ | **Train Text Encoder**| False | True |
406
+ | **Train Special Tags**| False | True |
407
+ | **Image Resolution** | 1024 | 1024 |
408
+ | **Bucket Resolution** | 1024 x 256 | 2048 x 512 |
409
+ | **Caption Dropout** | 0.5 | 0 |
410
+
411
  ## Direct Use
412
 
413
  The Animagine XL 2.0 model, with its advanced text-to-image diffusion capabilities, is highly versatile and can be applied in various fields:
 
438
  - **Camenduru Server Community:** For invaluable insights and support.
439
  - **NovelAI:** For inspiring the Quality Tags feature.
440
  - **Waifu DIffusion Team:** for inspiring the optimal training pipeline with bigger datasets.
441
+ - **Shadow Lilac:** For the image classification model ([shadowlilac/aesthetic-shadow](https://huggingface.co/shadowlilac/aesthetic-shadow)) crucial in our quality assessment process.
442
+
443
+ <h1 class="title">
444
+ <span>Anything you can Imagine!</span>
445
+ </h1>