Its a UNA version with DPO over MathPILE Books out of the UNA-SOLAR-10.7B-Instruct-1.0

I used MathPILE OUTSTANDING Dataset of great Mathematic material in order to produce this beautiful model :)

UNA-DPO over Attention and MLP's

  • PEFT 0.7.1
  • Transformers 4.36.2-UNA
  • Pytorch 2.1.2+cu121
  • Datasets 2.16.0
  • Tokenizers 0.15.
10.7B params
