Text-to-Image
English
wala
text-to-multiview
Hooman commited on
Commit
27cc6a7
·
verified ·
1 Parent(s): 8997d54

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +11 -2
README.md CHANGED
@@ -91,5 +91,14 @@ The model is based on the MVDream architecture, fine-tuned to generate four-view
91
  The model was trained on NVIDIA H100 GPUs.
92
 
93
  ## Citation
94
-
95
- [Citation information to be added after paper publication]
 
 
 
 
 
 
 
 
 
 
91
  The model was trained on NVIDIA H100 GPUs.
92
 
93
  ## Citation
94
+ ```
95
+ @misc{sanghi2024waveletlatentdiffusionwala,
96
+ title={Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings},
97
+ author={Aditya Sanghi and Aliasghar Khani and Pradyumna Reddy and Arianna Rampini and Derek Cheung and Kamal Rahimi Malekshan and Kanika Madan and Hooman Shayani},
98
+ year={2024},
99
+ eprint={2411.08017},
100
+ archivePrefix={arXiv},
101
+ primaryClass={cs.CV},
102
+ url={https://arxiv.org/abs/2411.08017},
103
+ }
104
+ ```