1 28 26

Symbol-LLM

https://xufangzhi.github.io/symbol-llm-page/

https://github.com/xufangzhi/Symbol-LLM

AI & ML interests

Natural Language Processing, Large Language Models, Neuro-Symbolic

Recent Activity

upvoted a paper 11 days ago

FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning

upvoted a paper 16 days ago

UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning

upvoted a paper 20 days ago

MAPS: A Multi-Agent Framework Based on Big Seven Personality and Socratic Guidance for Multimodal Scientific Problem Solving

View all activity

Organizations

Posts 6

Post

1088

🥳 Thrilled to introduce our recent efforts on bootstrapping VLMs for multi-modal chain-of-thought reasoning !

📕 Title: Vision-Language Models Can Self-Improve Reasoning via Reflection

🔗 Link: Vision-Language Models Can Self-Improve Reasoning via Reflection (2411.00855)

😇Takeaways:

- We found that VLMs can self-improve reasoning performance through a reflection mechanism, and importantly, this approach can scale through test-time computing.

- Evaluation on comprehensive and diverse Vision-Language reasoning tasks are included !

Post

2282

🚀 Excited to introduce a new member of the OS-Copilot family: OS-Atlas - an open-sourced foundational action model for GUI agents

📘 Paper: OS-ATLAS: A Foundation Action Model for Generalist GUI Agents (2410.23218)
🔗 Website: https://osatlas.github.io

😇 TL;DR: OS-Atlas offers:
1. State-of-the-Art GUI Grounding: Helps GUI agents accurately locate GUI elements.
2. Strong OOD Performance and Cross-platform Compatibility: Excels in out-of-domain agentic tasks across MacOS, Windows, Linux, Android, and Web.
3. Complete Infrastructure for GUI Data Synthesis:
You can easily build your own OS agent upon it!

View all Posts

Collections 1

Papers 1

arxiv:2411.00855

models 10

Symbol-LLM

AI & ML interests

Recent Activity

Organizations

Posts 6

Collections 1

Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models

Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models

OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

Vision-Language Models Can Self-Improve Reasoning via Reflection

Papers 1

models 10

Symbol-LLM/Symbol-LLM-8B-Instruct-v1.1

Symbol-LLM/Symbol-LLM-8B-Instruct

Symbol-LLM/ENVISIONS_7B_miniwob_iter5

Symbol-LLM/ENVISIONS_13B_miniwob_iter5

Symbol-LLM/ENVISIONS_13B_logic_iter8

Symbol-LLM/ENVISIONS_7B_logic_iter8

Symbol-LLM/ENVISIONS_13B_math_iter10

Symbol-LLM/ENVISIONS_7B_math_iter10

Symbol-LLM/Symbol-LLM-7B-Instruct

Symbol-LLM/Symbol-LLM-13B-Instruct

datasets 1

Symbol-LLM/Symbolic_Collection

Symbol-LLM

AI & ML interests

Recent Activity

Organizations

Posts 6

Collections 1

Papers 1

models 10 Sort: Recently updated

datasets 1

models 10