Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
igormolybog
's Collections
Domain spec fine-tuning
Inference speed
llama + WebWork
evals
Solver training
Datasets
Reasoning
Hetero training
Long context
Open
Agents
LM economy
Scaling laws
compression
robotics
Alignment
Imagen
Agents
updated
Feb 7
Upvote
-
Efficient Exploration for LLMs
Paper
•
2402.00396
•
Published
Feb 1
•
21
Upvote
-
Share collection
View history
Collection guide
Browse collections