Zhenran Xu's picture

Zhenran Xu

imryanxu

·

AI & ML interests

fishing in lab while working on language agents

Recent Activity

upvoted a paper 4 days ago

New Trends for Modern Machine Translation with Large Reasoning Models

updated a dataset 5 days ago

HIT-TMG/YiZhao

upvoted a collection 20 days ago

View all activity

Organizations

imryanxu's activity

upvoted a paper 4 days ago

New Trends for Modern Machine Translation with Large Reasoning Models

Paper • 2503.10351 • Published 4 days ago • 19

upvoted a collection 20 days ago

YiZhao Dataset

Data and filtering models of our financial open-source YiZhao Dataset. • 5 items • Updated Jan 10 • 1

upvoted 3 papers 25 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published Feb 13 • 184

AIDE: AI-Driven Exploration in the Space of Code

Paper • 2502.13138 • Published 27 days ago • 7

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 25 days ago • 97

upvoted 2 collections about 1 month ago

Ovis2

Our latest advancement in multi-modal large language models (MLLMs) • 8 items • Updated 4 days ago • 55

DeepSeek-VL2

5 items • Updated Feb 9 • 71

upvoted 3 papers about 2 months ago

FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces

Paper • 2501.12909 • Published Jan 22 • 68

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks

Paper • 2501.11733 • Published Jan 20 • 28

KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model

Paper • 2501.01028 • Published Jan 2 • 14

upvoted a collection about 2 months ago

KaLM-embedding

11 items • Updated 7 days ago • 23

upvoted 3 papers about 2 months ago

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Paper • 2501.10893 • Published Jan 18 • 24

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published Jan 21 • 54

PaSa: An LLM Agent for Comprehensive Academic Paper Search

Paper • 2501.10120 • Published Jan 17 • 45

upvoted a paper 2 months ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 276

upvoted 5 papers 3 months ago

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published Dec 23, 2024 • 43

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

Paper • 2412.17498 • Published Dec 23, 2024 • 22

Agent-SafetyBench: Evaluating the Safety of LLM Agents

Paper • 2412.14470 • Published Dec 19, 2024 • 12

RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

Paper • 2412.08972 • Published Dec 12, 2024 • 10

VisionArena: 230K Real World User-VLM Conversations with Preference Labels

Paper • 2412.08687 • Published Dec 11, 2024 • 13