IndustryAI's picture

IndustryAI

AI4Industry

·

AI & ML interests

None yet

Recent Activity

new activity 6 days ago

ds4sd/SubGrapher:Similar findings in MolParser

liked a dataset 7 days ago

allenai/olmOCR-mix-0225

updated a dataset about 1 month ago

AI4Industry/MolParser-7M

View all activity

Organizations

None yet

AI4Industry's activity

upvoted an article about 2 months ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

Feb 20

• 232

upvoted an article 2 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.22k

upvoted an article 3 months ago

Article

Timm ❤️ Transformers: Use any timm model with transformers

Jan 16

• 46

upvoted a paper 4 months ago

MolParser: End-to-end Visual Recognition of Molecule Structures in the Wild

Paper • 2411.11098 • Published Nov 17, 2024 • 1

upvoted 2 collections 7 months ago

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 211

MobileNetV4 pretrained weights

Weights for MobileNet-V4 pretrained in timm • 17 items • Updated Sep 22, 2024 • 18

upvoted a paper 8 months ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 163

upvoted a collection 9 months ago

🍃 MINT-1T

Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 13 items • Updated Jul 24, 2024 • 59

upvoted a collection 11 months ago

Searching for Better ViT Baselines

Exploring ViT hparams and model shapes for the GPU poor (between tiny and base). • 28 items • Updated Feb 14 • 17

upvoted an article 12 months ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15, 2024

• 176

upvoted a paper about 1 year ago

Uni-SMART: Universal Science Multimodal Analysis and Research Transformer

Paper • 2403.10301 • Published Mar 15, 2024 • 54