OG = only_gpt
This is a model which should produce exactly the same behavior as GPT2. It shows that GenKaLM model falls back to GPT2 when the graph part is disabled.
OG = only_gpt
This is a model which should produce exactly the same behavior as GPT2. It shows that GenKaLM model falls back to GPT2 when the graph part is disabled.