Alara Dirik's picture

Alara Dirik

adirik

·

alaradirik

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

pyannote/segmentation

liked a model 3 days ago

pyannote/speaker-diarization

liked a model 4 days ago

neuralwork/gemma-2-9b-it-tr

View all activity

Articles

A Dive into Text-to-Video Models

New ViT and ALIGN Models From Kakao Brain

Using Machine Learning to Aid Survivors and Race through Time

A Dive into Pretraining Strategies for Vision-Language Models

Universal Image Segmentation with Mask2Former and OneFormer

Organizations

adirik's activity

upvoted an article 29 days ago

Article

Halo: Open Source Health Tracking with Wearables

By

•

Nov 19

• 99

upvoted 2 collections about 2 months ago

Cosmos Tokenizer

A suite of image and video tokenizers • 12 items • Updated 14 days ago • 28

LLM2CLIP

LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 10 items • Updated 20 days ago • 49

upvoted a collection 2 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 9 days ago • 195

upvoted 2 articles 4 months ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22

• 86

Article

A Dive into Text-to-Video Models

May 8, 2023

• 23

upvoted a collection 6 months ago

Embedding Model Datasets

A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 67 items • Updated Jul 3 • 89

upvoted a paper 9 months ago

L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects

Paper • 2402.09052 • Published Feb 14 • 17

upvoted a collection 9 months ago

Candle Wasm Examples

11 items • Updated Apr 3 • 17

upvoted 2 papers 11 months ago

IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation

Paper • 2402.08682 • Published Feb 13 • 12

StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis

Paper • 2401.17093 • Published Jan 30 • 19

upvoted 2 collections 11 months ago

Text to 3D and Motion Papers

50 items • Updated Feb 13 • 3

TURNA

8 items • Updated May 1 • 8

upvoted 2 papers about 1 year ago

Kosmos-G: Generating Images in Context with Multimodal Large Language Models

Paper • 2310.02992 • Published Oct 4, 2023 • 4

RealFill: Reference-Driven Generation for Authentic Image Completion

Paper • 2309.16668 • Published Sep 28, 2023 • 14

upvoted a paper over 1 year ago

Simple and Controllable Music Generation

Paper • 2306.05284 • Published Jun 8, 2023 • 146