Commit
•
65fe965
1
Parent(s):
49c8c9e
Update README.md
Browse files
README.md
CHANGED
@@ -3,7 +3,7 @@ license: mit
|
|
3 |
---
|
4 |
# Beyond Language Models: Byte Models are Digital World Simulators
|
5 |
|
6 |
-
This repository contains the weights for the bGPT model as described in the paper [Beyond Language Models: Byte Models are Digital World Simulators](https://arxiv.org/
|
7 |
|
8 |
bGPT supports generative modelling via next byte prediction on any type of data and can perform any task executable on a computer, showcasing the capability to simulate all activities within the digital world, with its potential only limited by computational resources and our imagination.
|
9 |
|
@@ -144,4 +144,12 @@ The CPU States Dataset contains 2.1 million instances, each featuring a 1KB memo
|
|
144 |
|
145 |
## BibTeX entry and citation info
|
146 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
147 |
```
|
|
|
3 |
---
|
4 |
# Beyond Language Models: Byte Models are Digital World Simulators
|
5 |
|
6 |
+
This repository contains the weights for the bGPT model as described in the paper [Beyond Language Models: Byte Models are Digital World Simulators](https://arxiv.org/abs/2402.19155).
|
7 |
|
8 |
bGPT supports generative modelling via next byte prediction on any type of data and can perform any task executable on a computer, showcasing the capability to simulate all activities within the digital world, with its potential only limited by computational resources and our imagination.
|
9 |
|
|
|
144 |
|
145 |
## BibTeX entry and citation info
|
146 |
```
|
147 |
+
@misc{wu2024language,
|
148 |
+
title={Beyond Language Models: Byte Models are Digital World Simulators},
|
149 |
+
author={Shangda Wu and Xu Tan and Zili Wang and Rui Wang and Xiaobing Li and Maosong Sun},
|
150 |
+
year={2024},
|
151 |
+
eprint={2402.19155},
|
152 |
+
archivePrefix={arXiv},
|
153 |
+
primaryClass={cs.LG}
|
154 |
+
}
|
155 |
```
|