mingpt-openwebtext / README.md
eswardivi's picture
Update README.md
c0c52e6
metadata
license: mit
datasets:
  - Skylion007/openwebtext

MinGPT

Normal GPT 2 Architecture with below config trained on subset of openwebtext

    n_ctx=256,
    n_positions = 256,
    n_layer = 6,
    n_embd = 384,
    n_head = 6,