nano_tagalog_models / README.md
922CA's picture
Update README.md
3efa064 verified
|
raw
history blame
260 Bytes
metadata
license: apache-2.0
datasets:
  - 922-Narra/lt_08312023_test_5j1

Pretrained toy model. Made with Andrej Karpathy's NanoGPT, ~2023.

Parameters:

  • batch_size = 64
  • block_size = 256
  • n_layer = 8
  • n_head = 8
  • n_embd = 768

Everything else is left as is.