2 2 6

Kaijie Zhu

March07

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models

upvoted a paper 2 months ago

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

liked a Space 6 months ago

lmsys/mt-bench

View all activity

Organizations

March07's activity

upvoted a paper 5 days ago

THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models

Paper • 2504.13367 • Published 10 days ago • 24

upvoted a paper 2 months ago

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

Paper • 2502.14296 • Published Feb 20 • 46

liked 2 Spaces 6 months ago

187

MT Bench

📊

Compare model answers to questions

AgentReview

🎓

EMNLP 2024

liked a model 8 months ago

stabilityai/stable-diffusion-3-medium

Text-to-Image • Updated Aug 12, 2024 • 15.4k • • 4.76k

authored 5 papers 12 months ago

PromptBench: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts

Paper • 2306.04528 • Published Jun 7, 2023 • 3

A Survey on Evaluation of Large Language Models

Paper • 2307.03109 • Published Jul 6, 2023 • 42

Improving Generalization of Adversarial Training via Robust Critical Fine-Tuning

Paper • 2308.02533 • Published Aug 1, 2023

Large Language Models Understand and Can be Enhanced by Emotional Stimuli

Paper • 2307.11760 • Published Jul 14, 2023 • 1

DyVal: Dynamic Evaluation of Large Language Models for Reasoning Tasks

Paper • 2309.17167 • Published Sep 29, 2023 • 1

liked a Space about 1 year ago

11.1k

Stable Diffusion 2-1

🔥

Generate images from text descriptions

liked a dataset over 1 year ago

lukaemon/bbh

Viewer • Updated Feb 2, 2023 • 6.51k • 23.3k • 61

authored a paper over 1 year ago

PromptBench: A Unified Library for Evaluation of Large Language Models

Paper • 2312.07910 • Published Dec 13, 2023 • 19

New activity in microsoft/phi-1_5 over 1 year ago

Module Not Found

#19 opened over 1 year ago by

March07

liked a Space almost 2 years ago

PromptBench

🏃