Collection of papers that utilize reinforcement learning to enhance tool usage and function calling.
August Moharrami
August4293
·
AI & ML interests
None yet
Recent Activity
updated
a collection
2 days ago
RL Fine-tuning Reasoning
updated
a collection
2 days ago
RL Fine-tuning Tool Usage
updated
a collection
2 days ago
RL Fine-tuning Tool Usage
Organizations
models
4
datasets
8
August4293/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
•
16.6k
•
43
August4293/sentiment
Viewer
•
Updated
•
6.26k
•
49
August4293/hello_world
Viewer
•
Updated
•
3
•
35
August4293/tool_sample_dataset
Viewer
•
Updated
•
200
•
52
August4293/gsm8k_preference_dataset_it_2
Viewer
•
Updated
•
379
•
40
August4293/gsm8k_preference_dataset_it_1
Viewer
•
Updated
•
895
•
37
August4293/Self_Alignment_Preference-Dataset
Viewer
•
Updated
•
4.45k
•
42
August4293/CS_QA
Viewer
•
Updated
•
969
•
8