4 3 51

Attashe

attashe

attashe

AI & ML interests

Neural Network, Object detection, Generative Art

Recent Activity

updated a model about 1 hour ago

attashe/uno_converted

published a model about 5 hours ago

attashe/uno_converted

liked a model 7 days ago

bytedance-research/UNO

View all activity

Organizations

attashe's activity

updated a model about 1 hour ago

attashe/uno_converted

Updated about 1 hour ago

published a model about 5 hours ago

attashe/uno_converted

Updated about 1 hour ago

liked a model 7 days ago

bytedance-research/UNO

Text-to-Image • Updated 12 days ago • 100

reacted to jjokah's post with 🔥 8 days ago

Post

2319

# Video Tokenization — for efficient AI video processing

Meet 𝐕𝐢𝐝𝐓𝐨𝐤, a new open-source video tokenization technique developed by Microsoft Research to address the computational challenges of processing large volumes of video data. The core problem VidTok tackles is the inefficiency caused by redundant information in raw video pixels.

VidTok converts complex video footage into compact, structured units called tokens, making it easier and more efficient for AI systems to analyze, understand, and generate video content.

Research Paper: https://arxiv.org/abs/2412.13061
VidTok Code: https://github.com/microsoft/VidTok

liked a model 12 days ago

google/gemma-3-27b-it-qat-q4_0-gguf

Image-Text-to-Text • Updated 6 days ago • 63k • 172

liked a model 16 days ago

ZySec-AI/gemma-3-27b-tools

Image-Text-to-Text • Updated 27 days ago • 174 • 6

updated a model 18 days ago

attashe/gemma-3-27b-tools-Q5_K_M-GGUF

Image-Text-to-Text • Updated 18 days ago • 54 • 1

published a model 18 days ago

attashe/gemma-3-27b-tools-Q5_K_M-GGUF

Image-Text-to-Text • Updated 18 days ago • 54 • 1

liked a model 25 days ago

stepfun-ai/GOT-OCR-2.0-hf

Image-Text-to-Text • Updated Jan 31 • 27.8k • 189

liked a model about 2 months ago

Wan-AI/Wan2.1-T2V-14B

Text-to-Video • Updated Mar 12 • 66.6k • • 1.2k

reacted to prithivMLmods's post with 👍 about 2 months ago

Post

3942

Dino: The Minimalist Multipurpose Chat System 🌠
Agent-Dino : prithivMLmods/Agent-Dino
Github: https://github.com/PRITHIVSAKTHIUR/Agent-Dino

By default, it performs the following tasks:
{Text-to-Text Generation}, {Image-Text-Text Generation}
@image: Generates an image using Stable Diffusion xL.
@3d: Generates a 3D mesh.
@web: Web search agents.
@rAgent: Initiates a reasoning chain using Llama mode for coding explanations.
@tts1-♀, @tts2-♂: Voice generation (Female and Male voices).
@yolo : Object Detection

liked a model about 2 months ago

lemonilia/Mistral-Small-3-Reasoner-s1

Updated Feb 8 • 815 • 17

liked a Space about 2 months ago

144

MatAnyone

🤡

Gradio demo for MatAnyone

liked a dataset 2 months ago

simplescaling/s1K

Viewer • Updated Feb 11 • 1k • 2.01k • 211

liked a Space 2 months ago

Video To Canny Edge

🏢

Convert video to Canny edge effect

liked a dataset 2 months ago

GAIR/LIMO

Viewer • Updated Feb 10 • 817 • 4.62k • 150

liked 2 models 3 months ago

bartowski/Mistral-Small-24B-Instruct-2501-GGUF

Text Generation • Updated Jan 30 • 17.9k • 110

mistralai/Mistral-Small-24B-Instruct-2501

Text Generation • Updated Feb 2 • 829k • • 893

reacted to AdinaY's post with 👍 3 months ago

Post

3206

It’s not just a flood of model releases, papers are dropping just as fast 🚀

Here are the 10 most upvoted papers from the Chinese community:
👉 zh-ai-community/2025-january-papers-679933cbf0f3ced11f5a168a

liked a dataset 3 months ago

ServiceNow-AI/R1-Distill-SFT

Viewer • Updated Feb 8 • 1.85M • 2.6k • 294