custom_gpt2 / README.md
mmoffatt's picture
Update README.md
9df72c0 verified
|
raw
history blame
311 Bytes
metadata
license: mit
datasets:
  - Salesforce/wikitext
language:
  - en

This is a custom implementation of gpt2, where we replace attention with our implementation. Currently, we don't replace softmax, but in future submits we would like to replace the softmax function in attention with other softmax variations.