FoundationVision
/

LlamaGen

Model card Files Files and versions Community

ShoufaChen commited on Jun 11, 2024

Commit

7e43842

·

verified ·

1 Parent(s): 710d7a4

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -7,4 +7,6 @@ license: mit
 We introduce LlamaGen, a new family of image generation models that apply original next-token prediction paradigm of large language models to visual generation domain. It is an affirmative answer to whether vanilla autoregressive models, e.g., Llama, without inductive biases on visual signals can achieve state-of-the-art image generation performance if scaling properly. We reexamine design spaces of image tokenizers, scalability properties of image generation models, and their training data quality.
-This repo is used for hosting LlamaGen's checkpoints. For more details or tutorials see https://github.com/FoundationVision/LlamaGen

 We introduce LlamaGen, a new family of image generation models that apply original next-token prediction paradigm of large language models to visual generation domain. It is an affirmative answer to whether vanilla autoregressive models, e.g., Llama, without inductive biases on visual signals can achieve state-of-the-art image generation performance if scaling properly. We reexamine design spaces of image tokenizers, scalability properties of image generation models, and their training data quality.
+This repo is used for hosting LlamaGen's checkpoints. For more details or tutorials see https://github.com/FoundationVision/LlamaGen
+Paper:arxiv.org/abs/2406.06525