Attashe

attashe

AI & ML interests

Neural Network, Object detection, Generative Art

Recent Activity

updated a model about 1 hour ago
attashe/uno_converted
published a model about 5 hours ago
attashe/uno_converted
liked a model 7 days ago
bytedance-research/UNO
View all activity

Organizations

Storia AI's profile picture

attashe's activity

published a model about 5 hours ago
reacted to jjokah's post with πŸ”₯ 8 days ago
view post
Post
2319
# Video Tokenization β€” for efficient AI video processing

Meet 𝐕𝐒𝐝𝐓𝐨𝐀, a new open-source video tokenization technique developed by Microsoft Research to address the computational challenges of processing large volumes of video data. The core problem VidTok tackles is the inefficiency caused by redundant information in raw video pixels.

VidTok converts complex video footage into compact, structured units called tokens, making it easier and more efficient for AI systems to analyze, understand, and generate video content.

Research Paper: https://arxiv.org/abs/2412.13061
VidTok Code: https://github.com/microsoft/VidTok
reacted to prithivMLmods's post with πŸ‘ about 2 months ago
view post
Post
3942
Dino: The Minimalist Multipurpose Chat System 🌠
Agent-Dino : prithivMLmods/Agent-Dino
Github: https://github.com/PRITHIVSAKTHIUR/Agent-Dino

By default, it performs the following tasks:
{Text-to-Text Generation}, {Image-Text-Text Generation}
@image: Generates an image using Stable Diffusion xL.
@3d: Generates a 3D mesh.
@web: Web search agents.
@rAgent: Initiates a reasoning chain using Llama mode for coding explanations.
@tts1-♀, @tts2-β™‚: Voice generation (Female and Male voices).
@yolo : Object Detection
reacted to AdinaY's post with πŸ‘ 3 months ago