2 10

Faria Huq

oaishi

https://oaishi.github.io

AI & ML interests

LLM Agents, Personalization, User Preference Modeling

Recent Activity

upvoted a paper 15 days ago

Magma: A Foundation Model for Multimodal AI Agents

commented on a paper about 1 month ago

CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation

upvoted a paper about 1 month ago

Fixing Imbalanced Attention to Mitigate In-Context Hallucination of Large Vision-Language Model

View all activity

Organizations

None yet

oaishi's activity

upvoted a paper 15 days ago

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published 24 days ago • 56

commented a paper about 1 month ago

CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation

Paper • 2501.16609 • Published Jan 28 • 7 •

upvoted a paper about 1 month ago

Fixing Imbalanced Attention to Mitigate In-Context Hallucination of Large Vision-Language Model

Paper • 2501.12206 • Published Jan 21 • 4

upvoted a paper 3 months ago

The BrowserGym Ecosystem for Web Agent Research

Paper • 2412.05467 • Published Dec 6, 2024 • 21

upvoted a paper 4 months ago

HumanEval-V: Benchmarking High-Level Visual Reasoning with Complex Diagrams in Coding Tasks

Paper • 2410.12381 • Published Oct 16, 2024 • 44

upvoted 2 papers 5 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 171

Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code

Paper • 2409.19715 • Published Sep 29, 2024 • 10

authored a paper 6 months ago

NoTeeline: Supporting Real-Time Notetaking from Keypoints with Large Language Models

Paper • 2409.16493 • Published Sep 24, 2024 • 11

upvoted a paper 6 months ago

NoTeeline: Supporting Real-Time Notetaking from Keypoints with Large Language Models

Paper • 2409.16493 • Published Sep 24, 2024 • 11

upvoted a paper 9 months ago

The Prompt Report: A Systematic Survey of Prompting Techniques

Paper • 2406.06608 • Published Jun 6, 2024 • 60

upvoted a collection 11 months ago

Agent

Collection

112 items • Updated Sep 9, 2024 • 21

New activity in adept/fuyu-8b over 1 year ago

How to get Image embedding using Fuyu

#37 opened over 1 year ago by

oaishi

upvoted a paper over 1 year ago

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Paper • 2309.10020 • Published Sep 18, 2023 • 41