--- # For reference on model card metadata, see the spec: https://github.com/huggingface/hub-docs/blob/main/modelcard.md?plain=1 # Doc / guide: https://huggingface.co/docs/hub/model-cards {} --- # Model Card for Model ID This modelcard aims to be a base template for new models. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/modelcard_template.md?plain=1). ## Model Details ### Model Description - **Developed by:** [More Information Needed] - **Funded by [optional]:** [More Information Needed] - **Shared by [optional]:** [More Information Needed] - **Model type:** [More Information Needed] - **Language(s) (NLP):** [More Information Needed] - **License:** [More Information Needed] - **Finetuned from model [optional]:** [More Information Needed] ### Model Sources [optional] - **Repository:** [More Information Needed] - **Paper [optional]:** [More Information Needed] - **Demo [optional]:** [More Information Needed] ## Training Details ### Training Data [More Information Needed] ### Training Procedure Supervised Fine-Tuning (SFT) on chosen examples and Direct Preference Optimiazion (DPO) on preference data. #### Preprocessing [optional] [More Information Needed] #### Training Hyperparameters DPO hyperparameters * `beta=0.1` * `learning_rate=5e-6` * `gradient_accumulation=8` * `num_train_epochs=2` ### Testing Data, Factors & Metrics #### Testing Data [More Information Needed] #### Metrics [More Information Needed] ### Results [More Information Needed] #### Summary [More Information Needed] ## Technical Specifications ### Compute Infrastructure [More Information Needed] #### Hardware [More Information Needed] #### Software [More Information Needed] ## Model Card Authors and Contacts **DebuggingFace** Antonio Mari (antonio.mari@epfl.ch) Matteo Santelmo (matteo.santelmo@epfl.ch) Stefano Viel (stefano.viel@epfl.ch)