NLP Group of The University of Hong Kong

university

https://nlp.cs.hku.hk/

HKUNLP

Activity Feed Request to join this org

AI & ML interests

Pretraining algorithms, Semantic parsing, Dialog systems, Machine Translation

Recent Activity

tianbaoxiexxx authored a paper about 1 month ago

Qwen2.5-VL Technical Report

ranpox authored a paper about 1 month ago

Qwen2.5-VL Technical Report

multi-train authored a paper 2 months ago

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

View all activity

hkunlp's activity

tianbaoxiexxx

authored a paper about 1 month ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 173

ranpox

authored a paper about 1 month ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 173

multi-train

authored a paper 2 months ago

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Paper • 2501.10893 • Published Jan 18 • 25

ranpox

authored a paper 3 months ago

AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials

Paper • 2412.09605 • Published Dec 12, 2024 • 29

ranpox

authored a paper 4 months ago

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Paper • 2412.04454 • Published Dec 5, 2024 • 65

tianbaoxiexxx

authored a paper 4 months ago

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Paper • 2412.04454 • Published Dec 5, 2024 • 65

ranpox

authored a paper 5 months ago

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Paper • 2404.07972 • Published Apr 11, 2024 • 48

halfrot

authored a paper 7 months ago

HAF-RM: A Hybrid Alignment Framework for Reward Model Training

Paper • 2407.04185 • Published Jul 4, 2024

multi-train

authored 4 papers 8 months ago

tianbaoxiexxx

authored a paper 8 months ago

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Paper • 2407.10956 • Published Jul 15, 2024 • 7

multi-train

authored a paper 8 months ago

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Paper • 2407.12883 • Published Jul 16, 2024 • 9

taoyds

authored a paper 11 months ago

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Paper • 2404.07972 • Published Apr 11, 2024 • 48

yushihu

authored a paper 11 months ago

BLINK: Multimodal Large Language Models Can See but Not Perceive

Paper • 2404.12390 • Published Apr 18, 2024 • 26

yushihu

authored 3 papers 12 months ago

Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models

Paper • 2312.03052 • Published Dec 5, 2023

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

Paper • 2303.11897 • Published Mar 21, 2023

Training Language Models to Generate Text with Citations via Fine-grained Rewards

Paper • 2402.04315 • Published Feb 6, 2024

tianbaoxiexxx

authored a paper 12 months ago

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Paper • 2404.07972 • Published Apr 11, 2024 • 48

AI & ML interests

Recent Activity

Team members 14

hkunlp's activity