-
JudgeLM: Fine-tuned Large Language Models are Scalable Judges
Paper • 2310.17631 • Published • 33 -
AgentTuning: Enabling Generalized Agent Abilities for LLMs
Paper • 2310.12823 • Published • 35 -
G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment
Paper • 2303.16634 • Published • 3 -
GPT-4 Doesn't Know It's Wrong: An Analysis of Iterative Prompting for Reasoning Problems
Paper • 2310.12397 • Published • 1
Vinay Mimani
wiredmau5
AI & ML interests
None yet
Recent Activity
liked
a model
about 1 month ago
lion-ai/MedImageInsights
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet