BashirRP/llm_judge_fiddler
BashirRP/llm_judge_bashir
vwxyzjn/online_dpo_llmjudge
Text Generation
• 1B • Updated • 3
vwxyzjn/online_dpo_llmjudge_tldr
Updated
vwxyzjn/online_dpo_llmjudge_tldr_6.9b
Text Generation
• 7B • Updated • 2
Wonder-Griffin/XL-Judge-LLM
Text Generation
• Updated • 147
abhiraj1/llm_judge_lora_model
Updated
abhiraj1/llm-judge-gemma-2-9b-4bit
Text Generation
• 10B • Updated • 1
abhiraj1/llm-judge-gemma-9B-4bit-int
Text Generation
• Updated • 2
abhiraj1/llm_judge_lora_model_llama3.1
Updated
abhiraj1/llm-judge-llama-3.1-4bit
Text Generation
• Updated • 1
abhiraj1/llm_judge_lora_model_llama3.1_v2
Updated
abhiraj1/llm-judge-llama-3.1-4bit_v2
Text Generation
• Updated • 3
hongji-s/fall-as4-llm-as-judge-ft-model
Updated
hellomomiji/dpo_trained_model_llm_judge
Updated
xzhe121/Llama-3.2-3B-llm-judge-dpo-finetuned
Updated
wanyuhe499/llm_judge_dpo_peft
Updated
PaceAhh/llama-3.2-dpo-lora-adapter-llm-judge
wanyuhe499/llm_judge_dpo_peft_iter1
Updated
wanyuhe499/llm_judge_dpo_peft_iter2
Updated
wanyuhe499/llm_judge_dpo_peft_iter3
Updated
yagebin/fine-tuned-distilbert-base-uncased-LLM-Judge
Text Classification
• 67.3M • Updated • 3
mattzcarey/zeval-llm-as-judge
1B • Updated MrezaPRZ/Qwen2.5-Coder-3B-grpo-llm_judge-750-bad-checkpoint
Text Generation
• 3B • Updated • 1