Edit model card

Model Card for Model ID

This modelcard aims to be a base template for new models. It has been generated using this raw template.

Model Details

Model Description

  • Developed by: [More Information Needed]
  • Funded by [optional]: [More Information Needed]
  • Shared by [optional]: [More Information Needed]
  • Model type: [More Information Needed]
  • Language(s) (NLP): [More Information Needed]
  • License: [More Information Needed]
  • Finetuned from model [optional]: [More Information Needed]

Model Sources [optional]

  • Repository: [More Information Needed]
  • Paper [optional]: [More Information Needed]
  • Demo [optional]: [More Information Needed]

Training Details

Training Data

[More Information Needed]

Training Procedure

Supervised Fine-Tuning (SFT) on chosen examples and Direct Preference Optimiazion (DPO) on preference data.

Preprocessing [optional]

[More Information Needed]

Training Hyperparameters

DPO hyperparameters

  • beta=0.1
  • learning_rate=5e-6
  • gradient_accumulation=8
  • num_train_epochs=2

Testing Data, Factors & Metrics

Testing Data

[More Information Needed]

Metrics

[More Information Needed]

Results

[More Information Needed]

Summary

[More Information Needed]

Technical Specifications

Compute Infrastructure

[More Information Needed]

Hardware

[More Information Needed]

Software

[More Information Needed]

Model Card Authors and Contacts

DebuggingFace Antonio Mari (antonio.mari@epfl.ch) Matteo Santelmo (matteo.santelmo@epfl.ch) Stefano Viel (stefano.viel@epfl.ch)

Downloads last month
3
Safetensors
Model size
2.51B params
Tensor type
F32
·
Model is too large to load in Inference API (serverless). To try the model, launch it on Inference Endpoints (dedicated) instead.