C. Emre Karataş

emredeveloper

AI & ML interests

LLM

Recent Activity

liked a model about 6 hours ago
microsoft/OmniParser-v2.0
liked a model 1 day ago
tomg-group-umd/huginn-0125
liked a model 1 day ago
amazon/chronos-bolt-base
View all activity

Organizations

None yet

emredeveloper's activity

reacted to merve's post with 🔥 3 days ago
view post
Post
4285
Your weekly recap of open AI is here, and it's packed with models! merve/feb-14-releases-67af876b404cc27c6d837767

👀 Multimodal
> OpenGVLab released InternVideo 2.5 Chat models, new video LMs with long context
> AIDC released Ovis2 model family along with Ovis dataset, new vision LMs in different sizes (1B, 2B, 4B, 8B, 16B, 34B), with video and OCR support
> ColQwenStella-2b is a multilingual visual retrieval model that is sota in it's size
> Hoags-2B-Exp is a new multilingual vision LM with contextual reasoning, long context video understanding

💬 LLMs
A lot of math models!
> Open-R1 team released OpenR1-Math-220k large scale math reasoning dataset, along with Qwen2.5-220K-Math fine-tuned on the dataset, OpenR1-Qwen-7B
> Nomic AI released new Nomic Embed multilingual retrieval model, a MoE with 500 params with 305M active params, outperforming other models
> DeepScaleR-1.5B-Preview is a new DeepSeek-R1-Distill fine-tune using distributed RL on math
> LIMO is a new fine-tune of Qwen2.5-32B-Instruct on Math

🗣️ Audio
> Zonos-v0.1 is a new family of speech recognition models, which contains the model itself and embeddings

🖼️ Vision and Image Generation
> We have ported DepthPro of Apple to transformers for your convenience!
> illustrious-xl-v1.0 is a new illustration generation model
·
replied to merve's post 3 days ago
upvoted an article 5 days ago
view article
Article

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

43
upvoted an article 6 days ago
view article
Article

Build awesome datasets for video generation

23