Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Reinforcement Learning

Inference Endpoints

text-generation-inference

Misc with no match

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

10

Full-text search

Active filters: Reinforcement Learning

mrlijun/SMR-R1

Updated Apr 2 • 8 • 2

HUANG1993/GreedRL-VRP-pretrained-v1

Reinforcement Learning • Updated Apr 26, 2023 • 4

Hawk91/PongNoFrameskip-v4_DQN

Updated Aug 21, 2023 • 1

ledmands/ALE-Pacman-v5

Reinforcement Learning • Updated Jun 2, 2024 • 50 • 1

Daemontatox/Cogito-R1

Text Generation • Updated Feb 19 • 5 • 5

mradermacher/Cogito-R1-GGUF

Updated Feb 12 • 388

mradermacher/Cogito-R1-i1-GGUF

Updated Feb 13 • 477

omreab/SoccerTwos

Updated Apr 3 • 12

mradermacher/SMR-R1-GGUF

Updated 24 days ago • 161

mradermacher/SMR-R1-i1-GGUF

Updated 24 days ago • 269