pdr209 commited on
Commit
00fab53
1 Parent(s): c97d61d

Update Readme

Browse files
Files changed (1) hide show
  1. README.md +35 -31
README.md CHANGED
@@ -49,7 +49,7 @@ See the [usage instructions](#usage-example) for how to run the SDXL pipeline wi
49
  | A100 | 0.27 images/sec | 0.36 images/sec | ~33% |
50
  | H100 | 0.40 images/sec | 0.68 images/sec | ~70% |
51
 
52
- #### Timings for LCM version for 4 steps at 1024x1024
53
 
54
  | Accelerator | CLIP | Unet | VAE |Total |
55
  |-------------|--------------------------|-----------------------------|------------------------|------------------------|
@@ -84,33 +84,37 @@ pip3 install -r requirements.txt
84
  python3 -m pip install --pre --upgrade --extra-index-url https://pypi.nvidia.com tensorrt
85
  ```
86
 
87
- 4. Perform TensorRT optimized inference for the sdxl
88
- * The first invocation produces plan files in `engine_xl_base` and `engine_xl_refiner` specific to the accelerator being run on and are reused for later invocations.
89
-
90
- ```
91
- python3 demo_txt2img_xl.py \
92
- "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" \
93
- --build-static-batch \
94
- --use-cuda-graph \
95
- --num-warmup-runs 1 \
96
- --width 1024 \
97
- --height 1024 \
98
- --denoising-steps 30 \
99
- --onnx-base-dir /workspace/stable-diffusion-xl-1.0-tensorrt/sdxl-1.0-base \
100
- --onnx-refiner-dir /workspace/stable-diffusion-xl-1.0-tensorrt/sdxl-1.0-refiner
101
- ```
102
-
103
- 4. Perform TensorRT optimized inference for the sdxl Latent Consistency Model (LCM) version
104
- * The first invocation produces plan files in --engine-dir specific to the accelerator being run on and are reused for later invocations.
105
- ```
106
- python3 demo_txt2img_xl.py \
107
- ""Astronaut in a jungle, cold color palette, muted colors, detailed, 8k"" \
108
- --version=xl-1.0 \
109
- --onnx-dir /workspace/stable-diffusion-xl-1.0-tensorrt/lcm \
110
- --engine-dir /workspace/stable-diffusion-xl-1.0-tensorrt/lcm/engine-sdxl-lcm-nocfg \
111
- --scheduler LCM \
112
- --denoising-steps 4 \
113
- --guidance-scale 0.0 \
114
- --seed 42
115
-
116
- ```
 
 
 
 
 
49
  | A100 | 0.27 images/sec | 0.36 images/sec | ~33% |
50
  | H100 | 0.40 images/sec | 0.68 images/sec | ~70% |
51
 
52
+ #### Timings for Latent Consistency Model(LCM) version for 4 steps at 1024x1024
53
 
54
  | Accelerator | CLIP | Unet | VAE |Total |
55
  |-------------|--------------------------|-----------------------------|------------------------|------------------------|
 
84
  python3 -m pip install --pre --upgrade --extra-index-url https://pypi.nvidia.com tensorrt
85
  ```
86
 
87
+ 4. Perform TensorRT optimized inference:
88
+
89
+ - **SDXL**
90
+
91
+ The first invocation produces plan files in `engine_xl_base` and `engine_xl_refiner` specific to the accelerator being run on and are reused for later invocations.
92
+
93
+ ```
94
+ python3 demo_txt2img_xl.py \
95
+ "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" \
96
+ --build-static-batch \
97
+ --use-cuda-graph \
98
+ --num-warmup-runs 1 \
99
+ --width 1024 \
100
+ --height 1024 \
101
+ --denoising-steps 30 \
102
+ --onnx-base-dir /workspace/stable-diffusion-xl-1.0-tensorrt/sdxl-1.0-base \
103
+ --onnx-refiner-dir /workspace/stable-diffusion-xl-1.0-tensorrt/sdxl-1.0-refiner
104
+ ```
105
+
106
+ - **SDXL-LCM**
107
+
108
+ The first invocation produces plan files in --engine-dir specific to the accelerator being run on and are reused for later invocations.
109
+ ```
110
+ python3 demo_txt2img_xl.py \
111
+ ""Astronaut in a jungle, cold color palette, muted colors, detailed, 8k"" \
112
+ --version=xl-1.0 \
113
+ --onnx-dir /workspace/stable-diffusion-xl-1.0-tensorrt/lcm \
114
+ --engine-dir /workspace/stable-diffusion-xl-1.0-tensorrt/lcm/engine-sdxl-lcm-nocfg \
115
+ --scheduler LCM \
116
+ --denoising-steps 4 \
117
+ --guidance-scale 0.0 \
118
+ --seed 42
119
+
120
+ ```