Ctrl+K

8 contributors

History: 48 commits

versae

Model at 210k steps, mlm acc 0.6537

7ec1ea9 almost 4 years ago

configs
Changed and added vocab and tokenizer almost 4 years ago
mc4
Fixes to mc4 fork almost 4 years ago
.gitattributes

736 Bytes

Update .gitattributes almost 4 years ago
.gitignore

1.84 kB

Initial test with BETO's corpus almost 4 years ago
README.md

1.84 kB

Fixed widget example almost 4 years ago
config.json

618 Bytes

Fix config for checkpoint almost 4 years ago
config.py

256 Bytes

Preparing code for final runs almost 4 years ago
convert.py

876 Bytes

Improved version of conversion script Flax → PyTorch almost 4 years ago
flax_model.msgpack

250 MB
LFS

Model at 210k steps, mlm acc 0.6537 almost 4 years ago
get_embeddings_and_perplexity.py

1.53 kB

Add script to generate dataset of embeddings and perplexities. Add script to generate t-SNE plot for embedding and perplexity visualization. almost 4 years ago
merges.txt

505 kB

Changed and added vocab and tokenizer almost 4 years ago
perplexity.py

751 Bytes

Adding checkpointing, wandb, and new mlm script almost 4 years ago
pytorch_model.bin
Detected Pickle imports (4)
- "torch.LongStorage",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict"
What is a pickle import?
499 MB
LFS

Model at 210k steps, mlm acc 0.6537 almost 4 years ago
run.sh

883 Bytes

Adding base config and organizing configs almost 4 years ago
run_mlm_flax.py

30 kB

Adding sampling to mc4 almost 4 years ago
run_mlm_flax_stream.py

30.8 kB

Adding pad_to_multiple_of=16 almost 4 years ago
run_stream.sh

932 Bytes

Preparing code for final runs almost 4 years ago
special_tokens_map.json

239 Bytes

Changed and added vocab and tokenizer almost 4 years ago
tokenizer.json

1.45 MB

Changed and added vocab and tokenizer almost 4 years ago
tokenizer_config.json

292 Bytes

Changed and added vocab and tokenizer almost 4 years ago
tokens.py

649 Bytes

Scripts for perplexity sampling and fixes almost 4 years ago
tokens.py.orig

899 Bytes

Adjust batch size for extrating tokens almost 4 years ago
tsne_plot.py

3.02 kB

Remove unused imports almost 4 years ago
vocab.json

846 kB

Changed and added vocab and tokenizer almost 4 years ago

Detected Pickle imports (4)