Mert Erbak's picture

Mert Erbak PRO

merterbak

AI & ML interests

Currently NLP and Image Processing

Recent Activity

liked a Space 11 days ago
merterbak/grok
liked a dataset 11 days ago
ylecun/mnist
reacted to m-ric's post with šŸš€ 11 days ago
šŸ’„ š—šš—¼š—¼š—“š—¹š—² š—暝—²š—¹š—²š—®š˜€š—²š˜€ š—šš—²š—ŗš—¶š—»š—¶ šŸ®.šŸ¬, š˜€š˜š—®š—暝˜š—¶š—»š—“ š˜„š—¶š˜š—µ š—® š—™š—¹š—®š˜€š—µ š—ŗš—¼š—±š—²š—¹ š˜š—µš—®š˜ š˜€š˜š—²š—®š—ŗš—暝—¼š—¹š—¹š˜€ š—šš—£š—§-šŸ°š—¼ š—®š—»š—± š—–š—¹š—®š˜‚š—±š—²-šŸÆ.šŸ² š—¦š—¼š—»š—»š—²š˜! And they start a huge effort on agentic capabilities. šŸš€ The performance improvements are crazy for such a fast model: ā€£ Gemini 2.0 Flash outperforms the previous 1.5 Pro model at twice the speed ā€£ Now supports both input AND output of images, video, audio and text ā€£ Can natively use tools like Google Search and execute code āž”ļø If the price is on par with previous Flash iteration ($0.30 / M tokens, to compare with GPT-4o's $1.25) the competition will have a big problem with this 4x cheaper model that gets better benchmarks šŸ¤Æ šŸ¤– What about the agentic capabilities? ā€£ Project Astra: A universal AI assistant that can use Google Search, Lens and Maps ā€£ Project Mariner: A Chrome extension that can complete complex web tasks (83.5% success rate on WebVoyager benchmark, this is really impressive!) ā€£ Jules: An AI coding agent that integrates with GitHub workflows I'll be eagerly awaiting further news from Google! Read their blogpost here šŸ‘‰ https://blog.google/technology/google-deepmind/google-gemini-ai-update-december-2024/
View all activity

Organizations

MLX Community's profile picture Social Post Explorers's profile picture Hugging Face Discord Community's profile picture AI Starter Pack's profile picture

merterbak's activity

reacted to m-ric's post with šŸš€ 11 days ago
view post
Post
2361
šŸ’„ š—šš—¼š—¼š—“š—¹š—² š—暝—²š—¹š—²š—®š˜€š—²š˜€ š—šš—²š—ŗš—¶š—»š—¶ šŸ®.šŸ¬, š˜€š˜š—®š—暝˜š—¶š—»š—“ š˜„š—¶š˜š—µ š—® š—™š—¹š—®š˜€š—µ š—ŗš—¼š—±š—²š—¹ š˜š—µš—®š˜ š˜€š˜š—²š—®š—ŗš—暝—¼š—¹š—¹š˜€ š—šš—£š—§-šŸ°š—¼ š—®š—»š—± š—–š—¹š—®š˜‚š—±š—²-šŸÆ.šŸ² š—¦š—¼š—»š—»š—²š˜! And they start a huge effort on agentic capabilities.

šŸš€ The performance improvements are crazy for such a fast model:
ā€£ Gemini 2.0 Flash outperforms the previous 1.5 Pro model at twice the speed
ā€£ Now supports both input AND output of images, video, audio and text
ā€£ Can natively use tools like Google Search and execute code

āž”ļø If the price is on par with previous Flash iteration ($0.30 / M tokens, to compare with GPT-4o's $1.25) the competition will have a big problem with this 4x cheaper model that gets better benchmarks šŸ¤Æ

šŸ¤– What about the agentic capabilities?

ā€£ Project Astra: A universal AI assistant that can use Google Search, Lens and Maps
ā€£ Project Mariner: A Chrome extension that can complete complex web tasks (83.5% success rate on WebVoyager benchmark, this is really impressive!)
ā€£ Jules: An AI coding agent that integrates with GitHub workflows

I'll be eagerly awaiting further news from Google!

Read their blogpost here šŸ‘‰ https://blog.google/technology/google-deepmind/google-gemini-ai-update-december-2024/
reacted to samjulien's post with šŸ”„ 16 days ago
view post
Post
1469
šŸ”„ RAG in just a few lines of code?!

Try out our Hacker News Listener with new built-in RAG capabilities and Palmyra X 004 from the team at Writer!

This Writer Framework app:

- Scrapes up to 500 HN stories and comments
- Uploads them to a Knowledge Graph
- Enables interactive chat with the content using graph-based RAG
- Provides source attribution with every response

The best part? Setting up RAG is now incredibly simple - just a few lines of code to connect your Knowledge Graph as a tool with Palmyra X 004.

šŸ¤— Space: samjulien/hacker-news-listener
šŸ’» Code: https://github.com/writer/framework-tutorials/tree/main/hacker-news-social-listener
upvoted an article 23 days ago
view article
Article

Use Models from the Hugging Face Hub in LM Studio

By yagilb ā€¢
ā€¢ 127
reacted to merve's post with šŸ”„ 26 days ago
view post
Post
3842
Small yet mighty! šŸ’«

We are releasing SmolVLM: a new 2B small vision language made for on-device use, fine-tunable on consumer GPU, immensely memory efficient šŸ¤ 

We release three checkpoints under Apache 2.0: SmolVLM-Instruct, SmolVLM-Synthetic and SmolVLM-Base HuggingFaceTB/smolvlm-6740bd584b2dcbf51ecb1f39

Learn more from our blog here: huggingface.co/blog/smolvlm
This release comes with a demo, fine-tuning code, MLX integration and TRL integration for DPO šŸ’
Try the demo: HuggingFaceTB/SmolVLM
Fine-tuning Recipe: https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynb
Also TRL integration for DPO šŸ’—