Avelina/lovelace-medium-alpha1
Text Generation
•
Updated
•
14
•
1
This collection contains the pre-trained, fine-tuned and aligned models for the Direct Preference Heads paper.
Note Pretrained Transformer-XL model with 550M parameters, trained on 100B tokens from The Pile.