chumengl commited on
Commit
a08f933
·
verified ·
1 Parent(s): d48a15b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -0
README.md CHANGED
@@ -16,6 +16,9 @@ pipeline_tag: text-generation
16
 
17
  LangFlow is a continuous diffusion language model that operates in embedding space. Unlike discrete diffusion models (MDLM, SEDD, DUO), LangFlow performs diffusion directly on continuous token embeddings, enabling smoother denoising dynamics.
18
 
 
 
 
19
  ## Using LangFlow
20
 
21
  To use the pre-trained model for text generation, use the following snippet:
@@ -41,6 +44,18 @@ for text in texts:
41
  - **Training**: 1M steps on OpenWebText corpus
42
  - **Tokenizer**: GPT-2 tokenizer (50,257 vocab size)
43
 
 
 
 
 
 
 
 
 
 
 
 
44
  ## Model Card Contact
45
 
46
  Chumeng Liang (chumengl@illinois.edu)
 
 
16
 
17
  LangFlow is a continuous diffusion language model that operates in embedding space. Unlike discrete diffusion models (MDLM, SEDD, DUO), LangFlow performs diffusion directly on continuous token embeddings, enabling smoother denoising dynamics.
18
 
19
+ For more details, please see our paper: [LangFlow: Continuous Diffusion Rivals Discrete in Language Modeling](https://arxiv.org/abs/2604.11748).
20
+
21
+
22
  ## Using LangFlow
23
 
24
  To use the pre-trained model for text generation, use the following snippet:
 
44
  - **Training**: 1M steps on OpenWebText corpus
45
  - **Tokenizer**: GPT-2 tokenizer (50,257 vocab size)
46
 
47
+ ## Citation
48
+
49
+ ```
50
+ @article{chen2026langflow,
51
+ title={LangFlow: Continuous Diffusion Rivals Discrete in Language Modeling},
52
+ author={Chen, Yuxin and Liang, Chumeng and Sui, Hangke and Guo, Ruihan and Cheng, Chaoran and You, Jiaxuan and Liu, Ge},
53
+ journal={arXiv preprint arXiv:2604.11748},
54
+ year={2026}
55
+ }
56
+ ```
57
+
58
  ## Model Card Contact
59
 
60
  Chumeng Liang (chumengl@illinois.edu)
61
+