2 2 1

Yao Fu

yaofu

https://franxyao.github.io

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

authored a paper 12 months ago

Retrieval Head Mechanistically Explains Long-Context Factuality

authored a paper 12 months ago

Toward Inference-optimal Mixture-of-Expert Large Language Models

View all activity

Organizations

None yet

yaofu's activity

upvoted a paper 6 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 114

authored 10 papers 12 months ago

C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models

Paper • 2305.08322 • Published May 15, 2023

Data-to-text Generation with Variational Sequential Planning

Paper • 2202.13756 • Published Feb 28, 2022

Just-DREAM-about-it: Figurative Language Understanding with DREAM-FLUTE

Paper • 2210.16407 • Published Oct 28, 2022

commented 2 papers about 1 year ago

Data Engineering for Scaling Language Models to 128K Context

Paper • 2402.10171 • Published Feb 15, 2024 • 26 •

Data Engineering for Scaling Language Models to 128K Context

Paper • 2402.10171 • Published Feb 15, 2024 • 26 •

authored a paper about 1 year ago

Data Engineering for Scaling Language Models to 128K Context

Paper • 2402.10171 • Published Feb 15, 2024 • 26

updated a dataset about 1 year ago

yaofu/slimpajama-per-source-length-upsample

Viewer • Updated Feb 15, 2024 • 84.7k • 207 • 18

updated 2 models about 1 year ago

yaofu/llama-2-13b-64k

Text Generation • Updated Feb 15, 2024 • 4

yaofu/llama-2-7b-80k

Text Generation • Updated Feb 14, 2024 • 3.31k • 12

liked a Space over 1 year ago

347

Yi-34B-Chat

🔥

upvoted a paper over 1 year ago

Specializing Smaller Language Models towards Multi-Step Reasoning

Paper • 2301.12726 • Published Jan 30, 2023 • 1

authored a paper over 1 year ago

Specializing Smaller Language Models towards Multi-Step Reasoning

Paper • 2301.12726 • Published Jan 30, 2023 • 1