YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
Quantization made by Richard Erkhov.
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta - bnb 4bits
- Model creator: https://huggingface.co/EpistemeAI/
- Original model: https://huggingface.co/EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta/
Original model description:
language:
- en widget:
- text: "My name is Julien and I like to" example_title: "Julien"
- text: "My name is Merve and my favorite" example_title: "Merve"
license: apache-2.0 tags: - text-generation-inference - transformers - unsloth - llama - trl base_model: EpistemeAI2/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math model-index: - name: Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta results: - task: type: text-generation name: Text Generation dataset: name: IFEval (0-Shot) type: HuggingFaceH4/ifeval args: num_few_shot: 0 metrics: - type: inst_level_strict_acc and prompt_level_strict_acc value: 72.74 name: strict accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: BBH (3-Shot) type: BBH args: num_few_shot: 3 metrics: - type: acc_norm value: 26.9 name: normalized accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MATH Lvl 5 (4-Shot) type: hendrycks/competition_math args: num_few_shot: 4 metrics: - type: exact_match value: 13.22 name: exact match source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: GPQA (0-shot) type: Idavidrein/gpqa args: num_few_shot: 0 metrics: - type: acc_norm value: 4.03 name: acc_norm source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MuSR (0-shot) type: TAUR-Lab/MuSR args: num_few_shot: 0 metrics: - type: acc_norm value: 4.28 name: acc_norm source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MMLU-PRO (5-shot) type: TIGER-Lab/MMLU-Pro config: main split: test args: num_few_shot: 5 metrics: - type: acc value: 28.26 name: accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta name: Open LLM Leaderboard
KTO Fine tuning!
A KTO version EpistemeAI2/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math
Uploaded model
- Developed by: EpistemeAI2
- License: apache-2.0
- Finetuned from model : EpistemeAI2/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math
This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.
Open LLM Leaderboard Evaluation Results
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 24.90 |
IFEval (0-Shot) | 72.74 |
BBH (3-Shot) | 26.90 |
MATH Lvl 5 (4-Shot) | 13.22 |
GPQA (0-shot) | 4.03 |
MuSR (0-shot) | 4.28 |
MMLU-PRO (5-shot) | 28.26 |
- Downloads last month
- 3