S M Jishanul Islam's picture

14 16

S M Jishanul Islam

smji

·

https://s-m-j-i.github.io/Personal-CV/

S-M-J-I

AI & ML interests

Computer Vision, NLP, LLMs, Multimodal Deep Learning

Recent Activity

liked a dataset 7 days ago

nyu-visionx/VSI-Bench

liked a model 17 days ago

nvidia/GR00T-N1-2B

liked a dataset 20 days ago

TIGER-Lab/VisualWebInstruct

View all activity

Organizations

smji's activity

upvoted 8 collections 3 months ago

Multimodal Benchmarks

102 items • Updated 1 day ago • 9

Multimodal Dataset

43 items • Updated 8 days ago • 3

LLM context length

1 item • Updated Jan 10, 2024 • 1

LLM

2 items • Updated Jan 9, 2024 • 1

PEFT

1 item • Updated Oct 17, 2023 • 1

Multimodal Analysis

7 items • Updated 12 days ago • 1

Multimodal Alignment

17 items • Updated 10 days ago • 2

Multimodal LLM

184 items • Updated 1 day ago • 16

upvoted an article 9 months ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11, 2024

• 118

upvoted a paper 12 months ago

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Paper • 2404.05726 • Published Apr 8, 2024 • 22

upvoted a collection about 1 year ago

PDF Document / OCR Datasets

Document datasets with .pdf files that are usable with pixparse libraries and tools. • 2 items • Updated Mar 30, 2024 • 48

upvoted 2 papers about 1 year ago

VideoPrism: A Foundational Visual Encoder for Video Understanding

Paper • 2402.13217 • Published Feb 20, 2024 • 24

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14, 2024 • 127

upvoted a collection about 1 year ago

Bengali Regional Text to IPA Models

A collection of models for transcribing Bengali Regional Text to the International Phonetic Alphabets (IPA). • 4 items • Updated Jan 20 • 1