Yida Lu's picture

2 4

Yida Lu

lrxl

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Agent-SafetyBench: Evaluating the Safety of LLM Agents

authored a paper 13 days ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

upvoted a paper 13 days ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

View all activity

Organizations

None yet

lrxl's activity

upvoted a paper 6 days ago

Agent-SafetyBench: Evaluating the Safety of LLM Agents

Paper • 2412.14470 • Published 11 days ago • 8

authored a paper 13 days ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Paper • 2412.11605 • Published 14 days ago • 15

upvoted a paper 13 days ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Paper • 2412.11605 • Published 14 days ago • 15

authored a paper 6 months ago

AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models

Paper • 2406.16714 • Published Jun 24 • 10

updated a dataset 6 months ago

lrxl/AutoDetect-results

Viewer • Updated Jun 25 • 15 • 43 • 1

liked 4 models 10 months ago

thu-coai/ShieldLM-6B-chatglm3

Feature Extraction • Updated Feb 27 • 43 • 3

thu-coai/ShieldLM-13B-baichuan2

Text Generation • Updated Feb 27 • 49 • 3

thu-coai/ShieldLM-7B-internlm2

Feature Extraction • Updated Feb 27 • 554 • 9

thu-coai/ShieldLM-14B-qwen

Text Generation • Updated Feb 27 • 281 • 13