-
-
-
-
-
-
Inference Providers
Active filters:
full
mlfoundations-dev/hp_ablations_mistral_lr8e-6_dcftv1.2
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_mistral_scheduler_constant_dcftv1.2
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_minlr1e-6_dcftv1.2
Text Generation
•
Updated
•
8
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_minlr5e-7_dcftv1.2
Text Generation
•
Updated
•
6
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_dcftv1.2
Text Generation
•
Updated
•
7
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_minlr1e-7_dcftv1.2
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_mistral_bsz1024_dcftv1.2
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_mistral_scheduler_linear_warmup0.05_dcftv1.2
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.15_dcftv1.2
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_mistral_scheduler_inverse_sqrt_dcftv1.2
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_mistral_scheduler_linear_warmup0.10_dcftv1.2
Text Generation
•
Updated
•
6
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_minlr1e-6_dcftv1.2
Text Generation
•
Updated
•
5
tensorblock/oh-dcft-v1.2_no-curation_gpt-4o-mini-GGUF
Updated
•
132
mlfoundations-dev/hp_ablations_mistral_bsz2048_dcftv1.2
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_dcftv1.2
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_minlr5e-7_dcftv1.2
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_minlr1e-7_dcftv1.2
Text Generation
•
Updated
•
5
lightblue/qwen2.5-7B-instruct-orpo2
Text Generation
•
Updated
•
7
QuantFactory/LLaMA-O1-Base-1127-GGUF
Updated
•
273
•
2
mlfoundations-dev/hp_ablations_qwen_adambeta1_0.9_dcftv1.2
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_qwen_adambeta1_0.95_dcftv1.2
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_qwen_adambeta1_0.92_dcftv1.2
Text Generation
•
Updated
•
9
mlfoundations-dev/hp_ablations_qwen_adambeta2_0.95_dcftv1.2
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_qwen_adambeta2_0.98_dcftv1.2
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_qwen_adambeta1_0.85_dcftv1.2
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_qwen_adambeta2_0.999_dcftv1.2
Text Generation
•
Updated
•
7
mlfoundations-dev/hp_ablations_qwen_bsz256_dcftv1.2
Text Generation
•
Updated
•
7
mlfoundations-dev/hp_ablations_qwen_adambeta2_0.995_dcftv1.2
Text Generation
•
Updated
•
10
mlfoundations-dev/hp_ablations_qwen_adambeta2_0.9995_dcftv1.2
Text Generation
•
Updated
•
6
mlfoundations-dev/hp_ablations_qwen_adambeta2_0.99_dcftv1.2
Text Generation
•
Updated
•
9