This collection includes base phi3-mini A) DPO aligned B) SFT on ARC train split (3 epochs) C) Quantised version of B) using GPTQ.

MLP
classroom
AI & ML interests
Building ChatGPT for EPFL Course Contents
models
28

cs552-mlp/phi3-lora-arc3-gptq-2bits
Text Generation
•
Updated
•
107

cs552-mlp/phi3-lora-arc3-gptq-3bits
Text Generation
•
Updated
•
109

cs552-mlp/phi3-lora-arc3-gptq-4bits
Text Generation
•
Updated
•
111

cs552-mlp/phi3-lora-sciq3-gptq
Text Generation
•
Updated
•
108

cs552-mlp/phi3-lora-gptq-8bits
Text Generation
•
Updated
•
124

cs552-mlp/phi3-lora-mcq3
Updated
•
2

cs552-mlp/phi3-gptq-4bits
Text Generation
•
Updated
•
111

cs552-mlp/phi3-lora-openbookqa3
Updated
•
1

cs552-mlp/phi3-lora-sciq3
Updated
•
3

cs552-mlp/phi3-gptq-8bits
Text Generation
•
Updated
•
89
datasets
None public yet