ashawkey's picture
draft imagedream, before merge
af95a32
|
raw
history blame
1.53 kB
# MVDream-hf
modified from https://github.com/KokeCacao/mvdream-hf.
### convert weights
```bash
# dependency
pip install -U omegaconf diffusers safetensors huggingface_hub transformers accelerate
# download original ckpt
cd models
wget https://huggingface.co/MVDream/MVDream/resolve/main/sd-v2.1-base-4view.pt
wget https://raw.githubusercontent.com/bytedance/MVDream/main/mvdream/configs/sd-v2-base.yaml
cd ..
# convert
python convert_mvdream_to_diffusers.py --checkpoint_path models/sd-v2.1-base-4view.pt --dump_path ./weights_mvdream --original_config_file models/sd-v2-base.yaml --half --to_safetensors --test
```
```bash
# download original ckpt
wget https://huggingface.co/Peng-Wang/ImageDream/resolve/main/sd-v2.1-base-4view-ipmv-local.pt
wget https://raw.githubusercontent.com/bytedance/ImageDream/main/extern/ImageDream/imagedream/configs/sd_v2_base_ipmv_local.yaml
# convert
python convert_imagedream_to_diffusers.py --checkpoint_path models/sd-v2.1-base-4view-ipmv-local.pt --dump_path ./weights_imagedream --original_config_file models/sd-v2-base_ipmv_local.yaml --half --to_safetensors --test
```
### usage
example:
```bash
python main.py "a cute owl"
```
detailed usage:
```python
import torch
import kiui
from mvdream.pipeline_mvdream import MVDreamPipeline
pipe = MVDreamPipeline.from_pretrained('./weights', torch_dtype=torch.float16)
pipe = pipe.to("cuda")
prompt = "a photo of an astronaut riding a horse on mars"
image = pipe(prompt) # np.ndarray [4, 256, 256, 3]
kiui.vis.plot_image(image)
```