6 41 74

Jian Liao

imjliao

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

SparkAudio/Spark-TTS-0.5B

upvoted a paper 7 days ago

Chain of Draft: Thinking Faster by Writing Less

upvoted a paper 7 days ago

SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers

View all activity

Organizations

imjliao's activity

upvoted 3 papers 7 days ago

upvoted a collection 17 days ago

Awesome Computer Use Agents

Collection

https://github.com/ranpox/awesome-computer-use • 25 items • Updated Dec 18, 2024 • 11

upvoted a paper 17 days ago

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Paper • 2412.04454 • Published Dec 5, 2024 • 64

upvoted a collection about 1 month ago

Qwen2.5-VL (All Versions)

Collection

All versions of Qwen2.5-VL including 4-bit, 16-bit and more! • 9 items • Updated 11 days ago • 8

upvoted a paper 4 months ago

FlowMind: Automatic Workflow Generation with LLMs

Paper • 2404.13050 • Published Mar 17, 2024 • 34

upvoted an article 4 months ago

Article

Visually Multilingual: Introducing mcdse-2b

•

Oct 27, 2024

• 38

upvoted 2 articles 5 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14, 2024

• 243

Article

Docmatix - a huge dataset for Document Visual Question Answering

Jul 18, 2024

• 72

upvoted a paper 9 months ago

THOUGHTSCULPT: Reasoning with Intermediate Revision and Search

Paper • 2404.05966 • Published Apr 9, 2024 • 2

upvoted a collection 11 months ago

🤖 Agents

Collection

21 items • Updated Dec 31, 2024 • 139

upvoted 8 papers about 1 year ago

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Paper • 2402.10986 • Published Feb 16, 2024 • 78

WebLINX: Real-World Website Navigation with Multi-Turn Dialogue

Paper • 2402.05930 • Published Feb 8, 2024 • 39

Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding

Paper • 2401.12954 • Published Jan 23, 2024 • 30

Self-Discover: Large Language Models Self-Compose Reasoning Structures

Paper • 2402.03620 • Published Feb 6, 2024 • 115

Scaling Laws for Downstream Task Performance of Large Language Models

Paper • 2402.04177 • Published Feb 6, 2024 • 18

CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay

Paper • 2402.04858 • Published Feb 7, 2024 • 15

ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Paper • 2402.04615 • Published Feb 7, 2024 • 43

Direct Language Model Alignment from Online AI Feedback

Paper • 2402.04792 • Published Feb 7, 2024 • 31