1 5 6

jinhaoduan

jhao

AI & ML interests

None yet

Recent Activity

upvoted a paper 23 days ago

SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially?

upvoted a paper about 1 month ago

TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention

commented on a paper about 1 month ago

TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention

View all activity

Organizations

jhao's activity

upvoted a paper 23 days ago

SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially?

Paper • 2503.12349 • Published Mar 16 • 41

upvoted a paper about 1 month ago

TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention

Paper • 2503.10602 • Published Mar 13 • 4

commented a paper about 1 month ago

TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention

Paper • 2503.10602 • Published Mar 13 • 4 •

liked a dataset 5 months ago

openbmb/RLHF-V-Dataset

Viewer • Updated May 28, 2024 • 5.73k • 589 • 63

updated a Space 6 months ago

llm-autobiography

🚀

liked a Space 6 months ago

llm-autobiography

🚀

liked a Space 7 months ago

llm-autobiography

🚀

liked a dataset 8 months ago

Henrychur/MedS-Bench

Updated Nov 15, 2024 • 69 • 8

liked a Space 8 months ago

369

Open Medical-LLM Leaderboard

🥇

Browse and submit LLM evaluations

upvoted a paper about 1 year ago

GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations

Paper • 2402.12348 • Published Feb 19, 2024 • 1

authored 2 papers about 1 year ago

Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Paper • 2403.15447 • Published Mar 18, 2024 • 16

GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations

Paper • 2402.12348 • Published Feb 19, 2024 • 1

upvoted 2 papers about 1 year ago

Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Paper • 2403.15447 • Published Mar 18, 2024 • 16

Neural Network Diffusion

Paper • 2402.13144 • Published Feb 20, 2024 • 97

updated a Space about 1 year ago

GTBench

😻

Explore and filter model evaluation results

liked a Space about 1 year ago

GTBench

😻

Explore and filter model evaluation results

updated 4 models over 1 year ago