Improve model card with paper, code, project links and pipeline tag (#3)

Browse files

- Improve model card with paper, code, project links and pipeline tag (64a200fd87066bf0e54e87bd0c91518ac44c6177)

Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>

Files changed (1) hide show

README.md +9 -5

README.md CHANGED Viewed

@@ -1,14 +1,22 @@
 ---
 license: apache-2.0
 tags:
 - dllm
 - diffusion
 - llm
 - text_generation
-library_name: transformers
 ---
 # LLaDA-MoE
 **LLaDA-MoE** is a new and upgraded series of the LLaDA diffusion language model. This pre-release includes two cutting-edge models:
 - `LLaDA-MoE-7B-A1B-Base`: A base pre-trained model designed for research and secondary development.
@@ -223,10 +231,6 @@ input_ids = torch.tensor(input_ids).to(device).unsqueeze(0)
 text = generate(model, input_ids, steps=128, gen_length=128, block_length=32, temperature=0., cfg_scale=0., remasking='low_confidence')
 print(tokenizer.batch_decode(text[:, input_ids.shape[1]:], skip_special_tokens=False)[0])
 ```

 ---
+library_name: transformers
 license: apache-2.0
 tags:
 - dllm
 - diffusion
 - llm
 - text_generation
+pipeline_tag: text-generation
 ---
 # LLaDA-MoE
+This model is based on the principles described in the paper [Large Language Diffusion Models](https://huggingface.co/papers/2502.09992).
+- 📚 [Paper](https://huggingface.co/papers/2502.09992)
+- 🏠 [Project Page](https://ml-gsai.github.io/LLaDA-demo/)
+- 💻 [Code](https://github.com/ML-GSAI/LLaDA)
 **LLaDA-MoE** is a new and upgraded series of the LLaDA diffusion language model. This pre-release includes two cutting-edge models:
 - `LLaDA-MoE-7B-A1B-Base`: A base pre-trained model designed for research and secondary development.
 text = generate(model, input_ids, steps=128, gen_length=128, block_length=32, temperature=0., cfg_scale=0., remasking='low_confidence')
 print(tokenizer.batch_decode(text[:, input_ids.shape[1]:], skip_special_tokens=False)[0])
 ```