Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Reset Other
rl
AutoTrain Compatible
text-generation-inference
Inference Endpoints
custom_code
Other with no match
Eval Results
Merge
4-bit precision
8-bit precision
Carbon Emissions
Mixture of Experts
Apply filters
Models
83
Full-text search
Edit filters
Sort: Trending
Active filters:
rl
Clear all
sryu1/jsbgym_models
Reinforcement Learning
•
Updated
Jun 23, 2023
d-byrne/snake-v1_training_state
Updated
Jun 8, 2023
InstaDeepAI/jumanji-benchmark-a2c-BinPack-v2
Updated
Jun 8, 2023
InstaDeepAI/jumanji-benchmark-a2c-CVRP-v1
Updated
Jun 8, 2023
•
1
ContextualAI/archangel_sft_pythia1-4b
Text Generation
•
Updated
Jan 11
•
62
ContextualAI/archangel_sft_pythia2-8b
Text Generation
•
Updated
Jan 11
•
30
ContextualAI/archangel_sft_pythia6-9b
Text Generation
•
Updated
Jan 11
•
15
ContextualAI/archangel_sft_pythia12-0b
Text Generation
•
Updated
Jan 11
•
17
ContextualAI/archangel_sft_llama7b
Text Generation
•
Updated
Jan 11
•
414
•
1
ContextualAI/archangel_sft_llama13b
Text Generation
•
Updated
Jan 11
•
199
ContextualAI/archangel_sft_llama30b
Text Generation
•
Updated
Jan 11
•
2
ContextualAI/archangel_slic_llama30b
Text Generation
•
Updated
Jan 11
•
2
ContextualAI/archangel_slic_pythia1-4b
Text Generation
•
Updated
Jan 11
•
1
ContextualAI/archangel_slic_pythia2-8b
Text Generation
•
Updated
Jan 11
•
1
ContextualAI/archangel_slic_pythia6-9b
Text Generation
•
Updated
Jan 11
•
1
ContextualAI/archangel_slic_pythia12-0b
Text Generation
•
Updated
Jan 11
•
1
ContextualAI/archangel_slic_llama7b
Text Generation
•
Updated
Jan 11
•
3
•
1
ContextualAI/archangel_slic_llama13b
Text Generation
•
Updated
Jan 11
•
2
ContextualAI/archangel_dpo_pythia1-4b
Text Generation
•
Updated
Jan 11
•
1
ContextualAI/archangel_dpo_pythia2-8b
Text Generation
•
Updated
Jan 11
•
2
ContextualAI/archangel_dpo_pythia6-9b
Text Generation
•
Updated
Jan 11
•
3
ContextualAI/archangel_dpo_pythia12-0b
Text Generation
•
Updated
Jan 11
•
1
ContextualAI/archangel_dpo_llama7b
Text Generation
•
Updated
Jan 11
•
8
ContextualAI/archangel_dpo_llama13b
Text Generation
•
Updated
Jan 11
•
3
ContextualAI/archangel_dpo_llama30b
Text Generation
•
Updated
Jan 11
•
2
ContextualAI/archangel_kto_pythia1-4b
Text Generation
•
Updated
Jan 11
•
1
ContextualAI/archangel_kto_pythia2-8b
Text Generation
•
Updated
Jan 11
•
61
ContextualAI/archangel_kto_pythia6-9b
Text Generation
•
Updated
Jan 11
•
2
ContextualAI/archangel_kto_pythia12-0b
Text Generation
•
Updated
Jan 11
•
1
ContextualAI/archangel_kto_llama7b
Text Generation
•
Updated
Jan 11
•
5
•
1
Previous
1
2
3
Next