MAIR Lab

university

mair-lab

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

rabiulawal updated a dataset about 9 hours ago

mair-lab/omniedit-got-tokenized-256

BAJUKA authored a paper 4 days ago

UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction

rabiulawal published a dataset 8 days ago

mair-lab/omniedit-got-tokenized-256

View all activity

mair-lab's activity

rabiulawal

updated a dataset about 9 hours ago

mair-lab/omniedit-got-tokenized-256

Updated about 9 hours ago • 31

BAJUKA

authored a paper 4 days ago

UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction

Paper • 2503.15661 • Published 30 days ago • 1

rabiulawal

published a dataset 8 days ago

mair-lab/omniedit-got-tokenized-256

Updated about 9 hours ago • 31

BAJUKA

authored a paper about 1 month ago

LIVS: A Pluralistic Alignment Dataset for Inclusive Public Spaces

Paper • 2503.01894 • Published Feb 27 • 2

BAJUKA

updated a dataset 2 months ago

mair-lab/CulturalVQA

Viewer • Updated Feb 17 • 2.37k • 346 • 6

oscmansan

authored a paper 3 months ago

Consistency-diversity-realism Pareto fronts of conditional image generative models

Paper • 2406.10429 • Published Jun 14, 2024

joanrodai

authored a paper 4 months ago

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Paper • 2412.04626 • Published Dec 5, 2024 • 14

rabiulawal

authored a paper 4 months ago

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Paper • 2412.04626 • Published Dec 5, 2024 • 14

BAJUKA

authored a paper 4 months ago

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Paper • 2412.04626 • Published Dec 5, 2024 • 14

aagrawal

authored a paper 11 months ago

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27, 2024 • 90

oscmansan

authored a paper 11 months ago

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27, 2024 • 90

oscmansan

authored a paper about 1 year ago

Improving Text-to-Image Consistency via Automatic Prompt Optimization

Paper • 2403.17804 • Published Mar 26, 2024 • 18

aagrawal

authored a paper about 1 year ago

Improving Text-to-Image Consistency via Automatic Prompt Optimization

Paper • 2403.17804 • Published Mar 26, 2024 • 18

oscmansan

authored 2 papers about 1 year ago

MAPL: Parameter-Efficient Adaptation of Unimodal Pre-Trained Models for Vision-Language Few-Shot Prompting

Paper • 2210.07179 • Published Oct 13, 2022 • 3

Improving Automatic VQA Evaluation Using Large Language Models

Paper • 2310.02567 • Published Oct 4, 2023 • 3

joanrodai

authored a paper over 1 year ago

StarVector: Generating Scalable Vector Graphics Code from Images

Paper • 2312.11556 • Published Dec 17, 2023 • 35

joanrodai

authored 2 papers almost 2 years ago

OCR-VQGAN: Taming Text-within-Image Generation

Paper • 2210.11248 • Published Oct 19, 2022

FigGen: Text to Scientific Figure Generation

Paper • 2306.00800 • Published Jun 1, 2023

aagrawal

authored a paper almost 2 years ago

Measuring Progress in Fine-grained Vision-and-Language Understanding

Paper • 2305.07558 • Published May 12, 2023 • 1

oscmansan

updated a Space almost 2 years ago

MAPL

🔥

AI & ML interests

Recent Activity

Team members 8

mair-lab's activity

MAPL