4 3 39

Yuxiang Zhang

Joelzhang

AI & ML interests

None yet

Recent Activity

updated a dataset 29 days ago

Joelzhang/ToolBeHonest

New activity 29 days ago

Joelzhang/ToolBeHonest:[bot] Conversion to Parquet

New activity 29 days ago

Joelzhang/ToolBeHonest:Upload 2 files

View all activity

Organizations

Joelzhang's activity

updated a dataset 29 days ago

Joelzhang/ToolBeHonest

Viewer • Updated 29 days ago • 700 • 118 • 2

New activity in Joelzhang/ToolBeHonest 29 days ago

[bot] Conversion to Parquet

#1 opened 5 months ago by

parquet-converter

Upload 2 files

#3 opened 29 days ago by

wanng

Update README.md

#4 opened 29 days ago by

wanng

Update README.md

#2 opened 29 days ago by

wanng

authored a paper about 2 months ago

ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models

Paper • 2406.20015 • Published Jun 28 • 1

upvoted 2 papers about 2 months ago

ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models

Paper • 2406.20015 • Published Jun 28 • 1

A Survey on the Honesty of Large Language Models

Paper • 2409.18786 • Published Sep 27 • 31

liked a dataset 3 months ago

BAAI/Infinity-Instruct

Viewer • Updated 23 days ago • 20.4M • 8.72k • 557

authored 6 papers 4 months ago

Fengshenbang 1.0: Being the Foundation of Chinese Cognitive Intelligence

Paper • 2209.02970 • Published Sep 7, 2022

Solving Math Word Problems via Cooperative Reasoning induced Language Models

Paper • 2210.16257 • Published Oct 28, 2022

EALM: Introducing Multidimensional Ethical Alignment in Conversational Information Retrieval

Paper • 2310.00970 • Published Oct 2, 2023

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

Paper • 2405.19327 • Published May 29 • 46

ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation

Paper • 2406.09961 • Published Jun 14 • 54

PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents

Paper • 2406.13923 • Published Jun 20 • 21

upvoted a paper 5 months ago

ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation

Paper • 2406.09961 • Published Jun 14 • 54

liked a dataset 9 months ago

abacusai/SystemChat

Viewer • Updated Mar 4 • 7.02k • 64 • 124

liked a dataset 10 months ago

DrNicefellow/CHAT-ALL-IN-ONE-v1

Viewer • Updated Feb 6 • 1.24M • 110 • 5

liked 2 datasets 12 months ago

yuyijiong/LongPaper_multitask

Preview • Updated Dec 4, 2023 • 77 • 14

gaia-benchmark/GAIA

Viewer • Updated Mar 26 • 932 • 584 • 156