Kristijan's picture
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
bcb66e3