Continuous-Rivals-Discrete
/

langflow-owt

Text Generation

Model card Files Files and versions

chumengl commited on 1 day ago

Commit

a08f933

·

verified ·

1 Parent(s): d48a15b

Update README.md

Files changed (1) hide show

README.md +15 -0

README.md CHANGED Viewed

@@ -16,6 +16,9 @@ pipeline_tag: text-generation
 LangFlow is a continuous diffusion language model that operates in embedding space. Unlike discrete diffusion models (MDLM, SEDD, DUO), LangFlow performs diffusion directly on continuous token embeddings, enabling smoother denoising dynamics.
 ## Using LangFlow
 To use the pre-trained model for text generation, use the following snippet:
@@ -41,6 +44,18 @@ for text in texts:
 - **Training**: 1M steps on OpenWebText corpus
 - **Tokenizer**: GPT-2 tokenizer (50,257 vocab size)
 ## Model Card Contact
 Chumeng Liang (chumengl@illinois.edu)

 LangFlow is a continuous diffusion language model that operates in embedding space. Unlike discrete diffusion models (MDLM, SEDD, DUO), LangFlow performs diffusion directly on continuous token embeddings, enabling smoother denoising dynamics.
+For more details, please see our paper: [LangFlow: Continuous Diffusion Rivals Discrete in Language Modeling](https://arxiv.org/abs/2604.11748).
 ## Using LangFlow
 To use the pre-trained model for text generation, use the following snippet:
 - **Training**: 1M steps on OpenWebText corpus
 - **Tokenizer**: GPT-2 tokenizer (50,257 vocab size)
+## Citation
+```
+@article{chen2026langflow,
+  title={LangFlow: Continuous Diffusion Rivals Discrete in Language Modeling},
+  author={Chen, Yuxin and Liang, Chumeng and Sui, Hangke and Guo, Ruihan and Cheng, Chaoran and You, Jiaxuan and Liu, Ge},
+  journal={arXiv preprint arXiv:2604.11748},
+  year={2026}
+}
+```
 ## Model Card Contact
 Chumeng Liang (chumengl@illinois.edu)